Clc09G00280 (gene) Watermelon (cordophanus) v2

Overview
NameClc09G00280
Typegene
OrganismCitrullus lanatus subsp. cordophanus cv. cordophanus (Watermelon (cordophanus) v2)
Descriptionprotein CHUP1, chloroplastic
LocationClcChr09: 253777 .. 257344 (+)
RNA-Seq ExpressionClc09G00280
SyntenyClc09G00280
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTAAATCCTCAAAAACGAAAAACCATCCAAAATTAAACCATTTCCATAACAAAACCCATCTTCTTCTTCCTCGTGATCCTTATTTTAATGTACGTTTAATTATCCCTATCTCATTTAACTTAACTGTTCCAAATTGGAATCCATTAAGATCCCTTTCTTTCCCTGTTCCATCAACCAAAATCCTTCTGGGTAGACCCGCTCTTGCACCTCAACATCGAGACAAAACGTGGCTATTGTTTGCCAAAGAAGAAAGAAAAGAAAAAGGGATTTTGAATTGGAATTTTCTACACTTGGAATCGTGTTTAATCAGCCTATGACACGGACGAAAAAGCACCCAATAGAAACAACAATCGCAGTGATCCGCCCAACGTCAAAGCGTAGCTCTCCCGCTGATTTCATGCGTTCCCAAATGCTTTTCTGTTACAAAATCAAAGTAAGCTCTCCACCCTTCTCCTCCATAACTCATTTGTGTTTCCTCTAATCTTCCCTCCTCTTCCGAATCTGCAATGGGGTTTGGTTGATTTTTCCTCTTCTCTTACCAATCTGAAATGGGTTTCGGCTGATTTTTCGAAGTATTCCTTCTTTCTGCAGCAGTTGATCGTTTCATTGTCTTTTCTTATTGAATGGGTTTCCAGAGTTTTTATGGGGTTGATTTGAAAACAACTTGTTTATGTGCTTTGATCTCAGATGGGTCATCTGATTCTCTTTGGTTTATTCAGATGGAGGACGATGAAGACGAGGACCTATAATATTTTGTGATCCTTCAGTTTATTAATTTTGTTAGATTAAAGGTTCTTGATATCCAATAATGGAAAACAAGAGGGATTTGATGAAGCCTATATTATTCAAATTTGGGGTTGTTCTGGCTATCTCCTTAGCTGGTTTTCTCTGTTCCCGATCCAGACAGAGAAATAAAAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGGTTCTTATATTCAATCTCTTATGCTTCGTCAATTTCTTAATGTTAATTGTTAAGTCATGTATTTGGTGTTTGTGAGAGTTCGAATCTCTAATCTTTTGGTCTTTAACTGATTGAGATATATTGTACTATGTTTAGTTGGCAAGTTTTAAGTCGTGTACTTCGTTTGCTTTTAGCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGAGGAAGAGGCCCTAGACTTGATAAGCAAGGAATGAAGGCAGCAACAACAGCATCCTCCAATGTTGTTCTTTTTGCAGTTGATGCCTATGTAAGTTTCCTAAAGATGGTTTTTACATTTTGATAATATGAATAACTTGGAATGTTCAAAAAGGTTGGATACCAGGTAGGTTTTGTTGAAGTTGGTTGTTCAAAATTTGTGTAATGAACCCATTTGTTTAACTGTTGGGTGATTGAAAATGCTAATACCTTATGATGAAAGTATTTTGTTAATGAAGAAACCATCAGGACAGTTGATTGTTCTTTCTCAGTCTCTGTCCTTGGCAATGGTTTGTTTTCAGCATGTTGGTCTTTGATTAAGACATAAGCTTCATGTTATGTGTTATGGTCTTTTCTAAGCTCGTGACTTTGTTGTTTGATTTCTTTCTCAGGAAGAAATGTGTATTCCAAAAGTCAATGTTGGTGATTCAATTGTTGGTCTCTGTCCTAGCAATAAGAATAAGCATGGTGTAGATAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTTCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGAAGCACCAAGGTCGGGGCTCGAAACTCCAAAAGCTTATAAGACTGTTGAGGATGATCAATATGAACAAGAGATCAGGCACCTCAAATGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCTGTAATGGAGCTCCAAAATAGGTTGAAGATTAACAACATGGAAGCCAAGCTTTTCAACCTCAAGATCGAGTCCCTTCAGGCTGAAAACCGACGATTAGAGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCTGCAAAAGCAAAAATTAAGTTTCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGAAAGAAAGCAATAAAGATGCCCAAATCAAGCTGCAAAAGATTGAAGAATTGGAGAAGGAGACAGAGGACTTGAGAAAGTCGAATTTGAGATTACAAAGAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAGTACGTTTTTCTTCCTAGCATAGCTATTTTACCTTTTTTATAATTTTAAGGTGATGTAAAATTGAATGCCATGCCCGTCTAATTTGTGCAGAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCACAAGAGAACAAGGCGTTGACTAAGGAAATCGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTCCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCTGGGAAAACAGCAGCAAGAGATCTAAGCAAAACATTAAGTCCCAAATCTGAGGAGAAAGCGAAGAAGCTGATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAATGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCAGATGATTCAGCTGTTGATTTTCCATCAACAGCCAAAACAAGTTCAAACAAAGTCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGCGATTCTTCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCATAATTCATCAATACATTCAATGGATTTTCACAGATTGCAGGGCCAAAAGGAAGATGACGGAAAAACTGAAGACTCCATAAGAAGGAATAGTGACGTTGGCTATGTTAACAAGAGATTTGTTTCAGGGAGCGACCGATCGAGCAACTCGTCATATAGATCTCAAAGTCATGACACAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGTCACATAGGAAGGCTGCATCCATTGGTTCGGTTTGAACATAAAAAAAAGCTTGTCTGGCTTTCAATGGCTTCATCATCACCTGTTCAGATGTGTATAATTAAGGACCATACTGTTCACAAAAGATAGGTAGAAACTAATTCCACTGATACGGAAACACTTTGTAACATAAATAACTTACTGTTGTAATGTTACTTGTACGAGCTAATCCTAATACGAAACTAATCTTATCAAGTTTTCTCTTACTATTAATGTGTCAATC

mRNA sequence

GTAAATCCTCAAAAACGAAAAACCATCCAAAATTAAACCATTTCCATAACAAAACCCATCTTCTTCTTCCTCGTGATCCTTATTTTAATGTACGTTTAATTATCCCTATCTCATTTAACTTAACTGTTCCAAATTGGAATCCATTAAGATCCCTTTCTTTCCCTGTTCCATCAACCAAAATCCTTCTGGGTAGACCCGCTCTTGCACCTCAACATCGAGACAAAACGTGGCTATTGTTTGCCAAAGAAGAAAGAAAAGAAAAAGGGATTTTGAATTGGAATTTTCTACACTTGGAATCGTGTTTAATCAGCCTATGACACGGACGAAAAAGCACCCAATAGAAACAACAATCGCAGTGATCCGCCCAACGTCAAAGCGTAGCTCTCCCGCTGATTTCATGCGTTCCCAAATGCTTTTCTGTTACAAAATCAAAATGGAGGACGATGAAGACGAGGACCTATAATATTTTGTGATCCTTCAGTTTATTAATTTTGTTAGATTAAAGGTTCTTGATATCCAATAATGGAAAACAAGAGGGATTTGATGAAGCCTATATTATTCAAATTTGGGGTTGTTCTGGCTATCTCCTTAGCTGGTTTTCTCTGTTCCCGATCCAGACAGAGAAATAAAAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGAGGAAGAGGCCCTAGACTTGATAAGCAAGGAATGAAGGCAGCAACAACAGCATCCTCCAATGTTGTTCTTTTTGCAGTTGATGCCTATAAACCATCAGGACAGTTGATTGTTCTTTCTCAGTCTCTGTCCTTGGCAATGGAAGAAATGTGTATTCCAAAAGTCAATGTTGGTGATTCAATTGTTGGTCTCTGTCCTAGCAATAAGAATAAGCATGGTGTAGATAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTTCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGAAGCACCAAGGTCGGGGCTCGAAACTCCAAAAGCTTATAAGACTGTTGAGGATGATCAATATGAACAAGAGATCAGGCACCTCAAATGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCTGTAATGGAGCTCCAAAATAGGTTGAAGATTAACAACATGGAAGCCAAGCTTTTCAACCTCAAGATCGAGTCCCTTCAGGCTGAAAACCGACGATTAGAGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCTGCAAAAGCAAAAATTAAGTTTCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGAAAGAAAGCAATAAAGATGCCCAAATCAAGCTGCAAAAGATTGAAGAATTGGAGAAGGAGACAGAGGACTTGAGAAAGTCGAATTTGAGATTACAAAGAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCACAAGAGAACAAGGCGTTGACTAAGGAAATCGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTCCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCTGGGAAAACAGCAGCAAGAGATCTAAGCAAAACATTAAGTCCCAAATCTGAGGAGAAAGCGAAGAAGCTGATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAATGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCAGATGATTCAGCTGTTGATTTTCCATCAACAGCCAAAACAAGTTCAAACAAAGTCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGCGATTCTTCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCATAATTCATCAATACATTCAATGGATTTTCACAGATTGCAGGGCCAAAAGGAAGATGACGGAAAAACTGAAGACTCCATAAGAAGGAATAGTGACGTTGGCTATGTTAACAAGAGATTTGTTTCAGGGAGCGACCGATCGAGCAACTCGTCATATAGATCTCAAAGTCATGACACAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGTCACATAGGAAGGCTGCATCCATTGGTTCGGTTTGAACATAAAAAAAAGCTTGTCTGGCTTTCAATGGCTTCATCATCACCTGTTCAGATGTGTATAATTAAGGACCATACTGTTCACAAAAGATAGGTAGAAACTAATTCCACTGATACGGAAACACTTTGTAACATAAATAACTTACTGTTGTAATGTTACTTGTACGAGCTAATCCTAATACGAAACTAATCTTATCAAGTTTTCTCTTACTATTAATGTGTCAATC

Coding sequence (CDS)

ATGGAAAACAAGAGGGATTTGATGAAGCCTATATTATTCAAATTTGGGGTTGTTCTGGCTATCTCCTTAGCTGGTTTTCTCTGTTCCCGATCCAGACAGAGAAATAAAAGACCTCCTCTGCCTCCTCCCTCGTCGAGTTCTTCAGATGATCAGGGCAATAAAGTTGACTTGGGAAGAGGAAGAGGCCCTAGACTTGATAAGCAAGGAATGAAGGCAGCAACAACAGCATCCTCCAATGTTGTTCTTTTTGCAGTTGATGCCTATAAACCATCAGGACAGTTGATTGTTCTTTCTCAGTCTCTGTCCTTGGCAATGGAAGAAATGTGTATTCCAAAAGTCAATGTTGGTGATTCAATTGTTGGTCTCTGTCCTAGCAATAAGAATAAGCATGGTGTAGATAAAGATGGCTTGCTTCTCCCAGAGTTTCAGGAACTTGTCAAGGAATTTGATTTTTCTGCAGCAAATGCTGGGCTTTCTCCTAAGAAAAATGTTGAAGCACCAAGGTCGGGGCTCGAAACTCCAAAAGCTTATAAGACTGTTGAGGATGATCAATATGAACAAGAGATCAGGCACCTCAAATGCAAGGTGAAAATGCTGCGAGAGAGGGAGAGGAACCTTGAGGTTCAACTACTTGAGTATTATGGTCTGAAAGAGCAAGAAACTGCTGTAATGGAGCTCCAAAATAGGTTGAAGATTAACAACATGGAAGCCAAGCTTTTCAACCTCAAGATCGAGTCCCTTCAGGCTGAAAACCGACGATTAGAGTCACAAGTTTGCGATCATGCTAAGTCAGTGTCCGACCTCGAGGCTGCAAAAGCAAAAATTAAGTTTCTCAAGAAAAAACTTAGATATGAAGCAGAACAGAACAGGGGACAGATCTTAAATCTTCAGCAAAGAGTTGTTAAGCTGCAAGATCAAGAACATAAGAAAGAAAGCAATAAAGATGCCCAAATCAAGCTGCAAAAGATTGAAGAATTGGAGAAGGAGACAGAGGACTTGAGAAAGTCGAATTTGAGATTACAAAGAGAAAATTCTGATCTGGGTCGGAGATTAGATGCTACTCAATTTCTTGCAAATTCTATTTTGGAAGACCAAGAAAAAGAATCACTCAAAGAAGAAAGGGAGCGTTTGGCACAAGAGAACAAGGCGTTGACTAAGGAAATCGAGCAGCTTCAAGCACACCGGTGTGCAGATGTTGAAGAGCTAGTCTATCTCCGCTGGATTAATGCTTGCTTAAGATATGAACTGCGGAATTTTCAGCCTCCAGCTGGGAAAACAGCAGCAAGAGATCTAAGCAAAACATTAAGTCCCAAATCTGAGGAGAAAGCGAAGAAGCTGATCCTCGAATATGCAAATACAGAAGGAATTGAAGGGAAGGGCATTAATGTTGTGGATTTCGATTCAGATCAATGGTCGTCTTCACAAGCTTCCTCTCATACTGATCCTGGAGATCCAGATGATTCAGCTGTTGATTTTCCATCAACAGCCAAAACAAGTTCAAACAAAGTCAAATTCATTAGTAAACTCAGAAAACTCTTGAGGGGAAAAGGTAGTCAACAAAACCTGACTTTGTTAGCAGAAAAATCTGCAGCATCTGTAGAAGATAGCGATTCTTCTCGTTACAGTTCAAGTAATTCTACTGGGACCAATGCTACTCGAGCCGAGGGGCAGGGTATTGGATACACAACTCCATCTCATAATTCATCAATACATTCAATGGATTTTCACAGATTGCAGGGCCAAAAGGAAGATGACGGAAAAACTGAAGACTCCATAAGAAGGAATAGTGACGTTGGCTATGTTAACAAGAGATTTGTTTCAGGGAGCGACCGATCGAGCAACTCGTCATATAGATCTCAAAGTCATGACACAGAATCCACCGAAAAGTCTGAGTTGATGAAATATGCTGAAGTTTTGAAAGACACTCGAGGAGCTAAGAACCGGTCACATAGGAAGGCTGCATCCATTGGTTCGGTTTGA

Protein sequence

MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRGRGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIVGLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTVEDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFNLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKKESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLANSILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSSRYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVGYVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGSV
Homology
BLAST of Clc09G00280 vs. NCBI nr
Match: XP_038898688.1 (protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898689.1 protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898690.1 protein CHUP1, chloroplastic [Benincasa hispida])

HSP 1 Score: 1045.4 bits (2702), Expect = 2.0e-301
Identity = 580/660 (87.88%), Postives = 601/660 (91.06%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           M++KRDLMKPILFKFG  LAIS AGFLCS+ R RNKRPPL PPSSSSSDDQ +KVDLGRG
Sbjct: 1   MDDKRDLMKPILFKFGFALAISFAGFLCSQFRLRNKRPPLLPPSSSSSDDQSSKVDLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLD QG+KAAT ASSNVV FAVDAY                 E+ CIPKVN  DS +
Sbjct: 61  RGPRLDNQGLKAATAASSNVVHFAVDAY-----------------EKKCIPKVNFDDSNI 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GL PS  NKHGVDKDG LLPEFQELVKEFDFSAANAGL PKKNVEAPRSGLETPKAYKTV
Sbjct: 121 GLRPS--NKHGVDKDG-LLPEFQELVKEFDFSAANAGLPPKKNVEAPRSGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           EDD+YEQEIRHLK KVK LRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 EDDEYEQEIRHLKSKVKTLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
            LKIESLQA+NRRLESQVCDHAKSVSDLEAAKAKIKFLKKK+RYEAEQNRGQILNLQQRV
Sbjct: 241 TLKIESLQADNRRLESQVCDHAKSVSDLEAAKAKIKFLKKKIRYEAEQNRGQILNLQQRV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           VKLQDQEHK  ESNKDAQI+LQKIEELEKE EDLRKSNL+LQ ENSDL RRLDATQFLAN
Sbjct: 301 VKLQDQEHKTNESNKDAQIRLQKIEELEKEIEDLRKSNLKLQIENSDLSRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL++EN+ALTKEIEQLQAHRCAD+EELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEEMERLSRENEALTKEIEQLQAHRCADIEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEGIEGK IN+ DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGIEGKSINITDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDS S 
Sbjct: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSGSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSI-RRNSDV 600
           RYSSSNS GTNATRAEGQGIGYTTPS NSS HSMDFHRL  QKEDDGKTEDSI RRNSDV
Sbjct: 541 RYSSSNSPGTNATRAEGQGIGYTTPSRNSSRHSMDFHRLNSQKEDDGKTEDSIRRRNSDV 600

Query: 601 GYVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           GYVNK+FV GSD SSNSSYRSQS DTESTEKSELMKYAEVLKDTRGAKN+S RKAASIGS
Sbjct: 601 GYVNKKFVLGSDESSNSSYRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSQRKAASIGS 640

BLAST of Clc09G00280 vs. NCBI nr
Match: KAA0059471.1 (protein CHUP1 [Cucumis melo var. makuwa] >TYK03852.1 protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 1011.9 bits (2615), Expect = 2.5e-291
Identity = 561/659 (85.13%), Postives = 591/659 (89.68%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +LMKP+L KFGVVLAIS A FL SR R +NKRPPLPPP SSSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLD QGMKAAT ASSNVVLFAVDAY                 EEMCIPKVNV DS +
Sbjct: 61  RGPRLDNQGMKAATAASSNVVLFAVDAY-----------------EEMCIPKVNVDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLLLPEFQE VKEFD SAANA  SPKKNVEAPRSGLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           EDD+YEQEIRHLK KVKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 EDDEYEQEIRHLKSKVKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V
Sbjct: 241 TFKIESLEADNRRLESQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  ESNKDAQIKLQKIE+LEKE E+LRK N RLQ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNESNKDAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL QEN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDSA +FPSTAKTSSNK+KFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDS 
Sbjct: 481 TDPGDPDDSAAEFPSTAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YSSSNSTGTNATRAEGQ IGY T S NSS +S+DF RL  QKED+ KTEDS RRNSDVG
Sbjct: 541 CYSSSNSTGTNATRAEGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           YVNKRFV GSD+SSNSS RSQS DTESTEKSELMKYAEVLKDTRGAKN+SHRKAASIGS
Sbjct: 601 YVNKRFVLGSDQSSNSSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGS 640

BLAST of Clc09G00280 vs. NCBI nr
Match: XP_008462405.1 (PREDICTED: protein CHUP1, chloroplastic [Cucumis melo] >XP_008462406.1 PREDICTED: protein CHUP1, chloroplastic [Cucumis melo])

HSP 1 Score: 1008.4 bits (2606), Expect = 2.8e-290
Identity = 560/659 (84.98%), Postives = 590/659 (89.53%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +LMKP+L KFGVVLAIS A FL SR R +NKRPPLPPP SSSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLD QGMKAAT ASSNVVLFAVDAY                 EEMCI KVNV DS +
Sbjct: 61  RGPRLDNQGMKAATAASSNVVLFAVDAY-----------------EEMCIRKVNVDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLLLPEFQE VKEFD SAANA  SPKKNVEAPRSGLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           EDD+YEQEIRHLK KVKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 EDDEYEQEIRHLKSKVKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V
Sbjct: 241 TFKIESLEADNRRLESQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  ESNKDAQIKLQKIE+LEKE E+LRK N RLQ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNESNKDAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL QEN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDSA +FPSTAKTSSNK+KFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDS 
Sbjct: 481 TDPGDPDDSAAEFPSTAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YSSSNSTGTNATRAEGQ IGY T S NSS +S+DF RL  QKED+ KTEDS RRNSDVG
Sbjct: 541 CYSSSNSTGTNATRAEGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           YVNKRFV GSD+SSNSS RSQS DTESTEKSELMKYAEVLKDTRGAKN+SHRKAASIGS
Sbjct: 601 YVNKRFVLGSDQSSNSSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGS 640

BLAST of Clc09G00280 vs. NCBI nr
Match: XP_004141788.1 (protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KGN45575.1 hypothetical protein Csa_015974 [Cucumis sativus])

HSP 1 Score: 993.4 bits (2567), Expect = 9.2e-286
Identity = 553/659 (83.92%), Postives = 584/659 (88.62%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +L +PILFKFGVVLAIS AGFL SR R +NKRPPLPPPS SSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLDKQG        SNVVLFAVDAY                 EE CIPKVN  DS +
Sbjct: 61  RGPRLDKQG------TPSNVVLFAVDAY-----------------EETCIPKVNFDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLL PEFQEL+KEFD SAANA  S KKNVEAPR GLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           E+D+YEQEIR+LK KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 ENDEYEQEIRYLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV
Sbjct: 241 TFKIESLEADNRRLESQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  +SNKDAQIKLQKIE+LEKE E+LRKSNLRL+ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNQSNKDAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL +EN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDS  DFPSTAKT SNK+KFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDS 
Sbjct: 481 TDPGDPDDSTTDFPSTAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRLQ QKEDD K EDSIRRNSDVG
Sbjct: 541 CYSTSNSTGTNATRAEGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
            VNKRFV GSD+ S+SSYRSQ+ DTESTEKSELMKYAEVLKDTRGAKNRSHRK ASIGS
Sbjct: 601 CVNKRFVVGSDQLSDSSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGS 634

BLAST of Clc09G00280 vs. NCBI nr
Match: XP_031744947.1 (protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031744948.1 protein CHUP1, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 989.9 bits (2558), Expect = 1.0e-284
Identity = 553/660 (83.79%), Postives = 585/660 (88.64%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPS-SSSSDDQGNKVDLGR 60
           ME+K +L +PILFKFGVVLAIS AGFL SR R +NKRPPLPPPS SSS+DDQGNKV+LGR
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSSADDQGNKVNLGR 60

Query: 61  GRGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSI 120
           GRGPRLDKQG        SNVVLFAVDAY                 EE CIPKVN  DS 
Sbjct: 61  GRGPRLDKQG------TPSNVVLFAVDAY-----------------EETCIPKVNFDDSN 120

Query: 121 VGLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKT 180
           +GLCPS  NKHGVDKDGLL PEFQEL+KEFD SAANA  S KKNVEAPR GLETPKAYKT
Sbjct: 121 LGLCPS--NKHGVDKDGLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKT 180

Query: 181 VEDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 240
           VE+D+YEQEIR+LK KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL
Sbjct: 181 VENDEYEQEIRYLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 240

Query: 241 FNLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQR 300
           F  KIESL+A+NRRLESQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+R
Sbjct: 241 FTFKIESLEADNRRLESQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKR 300

Query: 301 VVKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLA 360
           V+KLQDQEHK  +SNKDAQIKLQKIE+LEKE E+LRKSNLRL+ ENSDLGRRLDATQFLA
Sbjct: 301 VLKLQDQEHKTNQSNKDAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLA 360

Query: 361 NSILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRN 420
           NS+LEDQEKESLKEE ERL +EN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRN
Sbjct: 361 NSLLEDQEKESLKEETERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRN 420

Query: 421 FQPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASS 480
           FQPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASS
Sbjct: 421 FQPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASS 480

Query: 481 HTDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDS 540
           HTDPGDPDDS  DFPSTAKT SNK+KFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDS
Sbjct: 481 HTDPGDPDDSTTDFPSTAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDS 540

Query: 541 SRYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDV 600
             YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRLQ QKEDD K EDSIRRNSDV
Sbjct: 541 PCYSTSNSTGTNATRAEGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDV 600

Query: 601 GYVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           G VNKRFV GSD+ S+SSYRSQ+ DTESTEKSELMKYAEVLKDTRGAKNRSHRK ASIGS
Sbjct: 601 GCVNKRFVVGSDQLSDSSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGS 635

BLAST of Clc09G00280 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 8.6e-53
Identity = 154/389 (39.59%), Postives = 239/389 (61.44%), Query Frame = 0

Query: 135 DGLLLPEFQELVK-EFDFSAANAGLSPKKNVEAPRSGLETPKAYKTVEDDQYEQEIRHLK 194
           D  +LPEF++L+  E ++        P  + +      E  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 195 CKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFNLKIESLQAENRR 254
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  + N+ I SLQAE ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 255 LESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKKESN 314
           L+ ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E ++  N
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE-EEAMN 264

Query: 315 KDAQI--KLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQ---FLANSILEDQEKE 374
           KD ++  KL+ +++LE +  +L++ N  LQ E  +L  +LD+ +      +++ E  +  
Sbjct: 265 KDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVA 324

Query: 375 SLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAA 434
            ++EE   L   N+ L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +A
Sbjct: 325 KVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISA 384

Query: 435 RDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDS 494
           RDLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++
Sbjct: 385 RDLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNA 444

Query: 495 AVDFPSTAKTS-SNKVKFISKLRKLLRGK 517
           ++D  ++  +S S K   I KL+K  + K
Sbjct: 445 SMDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of Clc09G00280 vs. ExPASy TrEMBL
Match: A0A5A7V182 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G00460 PE=4 SV=1)

HSP 1 Score: 1011.9 bits (2615), Expect = 1.2e-291
Identity = 561/659 (85.13%), Postives = 591/659 (89.68%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +LMKP+L KFGVVLAIS A FL SR R +NKRPPLPPP SSSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLD QGMKAAT ASSNVVLFAVDAY                 EEMCIPKVNV DS +
Sbjct: 61  RGPRLDNQGMKAATAASSNVVLFAVDAY-----------------EEMCIPKVNVDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLLLPEFQE VKEFD SAANA  SPKKNVEAPRSGLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           EDD+YEQEIRHLK KVKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 EDDEYEQEIRHLKSKVKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V
Sbjct: 241 TFKIESLEADNRRLESQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  ESNKDAQIKLQKIE+LEKE E+LRK N RLQ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNESNKDAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL QEN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDSA +FPSTAKTSSNK+KFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDS 
Sbjct: 481 TDPGDPDDSAAEFPSTAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YSSSNSTGTNATRAEGQ IGY T S NSS +S+DF RL  QKED+ KTEDS RRNSDVG
Sbjct: 541 CYSSSNSTGTNATRAEGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           YVNKRFV GSD+SSNSS RSQS DTESTEKSELMKYAEVLKDTRGAKN+SHRKAASIGS
Sbjct: 601 YVNKRFVLGSDQSSNSSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGS 640

BLAST of Clc09G00280 vs. ExPASy TrEMBL
Match: A0A1S3CGW9 (protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500772 PE=4 SV=1)

HSP 1 Score: 1008.4 bits (2606), Expect = 1.3e-290
Identity = 560/659 (84.98%), Postives = 590/659 (89.53%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +LMKP+L KFGVVLAIS A FL SR R +NKRPPLPPP SSSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLMKPLLLKFGVVLAISFASFLYSRFRLKNKRPPLPPPLSSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLD QGMKAAT ASSNVVLFAVDAY                 EEMCI KVNV DS +
Sbjct: 61  RGPRLDNQGMKAATAASSNVVLFAVDAY-----------------EEMCIRKVNVDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLLLPEFQE VKEFD SAANA  SPKKNVEAPRSGLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLLPEFQEHVKEFDLSAANAEFSPKKNVEAPRSGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           EDD+YEQEIRHLK KVKMLRERERNLE QLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 EDDEYEQEIRHLKSKVKMLRERERNLEFQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVC+HAK+VSDLEAA+AKIKFLKKKLR+EAEQNR QILNLQQ+V
Sbjct: 241 TFKIESLEADNRRLESQVCNHAKTVSDLEAARAKIKFLKKKLRHEAEQNRRQILNLQQKV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  ESNKDAQIKLQKIE+LEKE E+LRK N RLQ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNESNKDAQIKLQKIEDLEKEIEELRKLNSRLQIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL QEN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTQENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGKGI+V DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKGISVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDSA +FPSTAKTSSNK+KFI KL+KLLRGKGSQQNLTLLAEKSAAS+EDSDS 
Sbjct: 481 TDPGDPDDSAAEFPSTAKTSSNKIKFIGKLKKLLRGKGSQQNLTLLAEKSAASIEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YSSSNSTGTNATRAEGQ IGY T S NSS +S+DF RL  QKED+ KTEDS RRNSDVG
Sbjct: 541 CYSSSNSTGTNATRAEGQAIGYATSSRNSSRYSIDFQRLHSQKEDEVKTEDSARRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
           YVNKRFV GSD+SSNSS RSQS DTESTEKSELMKYAEVLKDTRGAKN+SHRKAASIGS
Sbjct: 601 YVNKRFVLGSDQSSNSSDRSQSQDTESTEKSELMKYAEVLKDTRGAKNQSHRKAASIGS 640

BLAST of Clc09G00280 vs. ExPASy TrEMBL
Match: A0A0A0K799 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G452300 PE=4 SV=1)

HSP 1 Score: 993.4 bits (2567), Expect = 4.4e-286
Identity = 553/659 (83.92%), Postives = 584/659 (88.62%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQGNKVDLGRG 60
           ME+K +L +PILFKFGVVLAIS AGFL SR R +NKRPPLPPPS SSSDDQGNKV+LGRG
Sbjct: 1   MEDKGNLRRPILFKFGVVLAISFAGFLYSRFRLKNKRPPLPPPSYSSSDDQGNKVNLGRG 60

Query: 61  RGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSIV 120
           RGPRLDKQG        SNVVLFAVDAY                 EE CIPKVN  DS +
Sbjct: 61  RGPRLDKQG------TPSNVVLFAVDAY-----------------EETCIPKVNFDDSNL 120

Query: 121 GLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTV 180
           GLCPS  NKHGVDKDGLL PEFQEL+KEFD SAANA  S KKNVEAPR GLETPKAYKTV
Sbjct: 121 GLCPS--NKHGVDKDGLLPPEFQELLKEFDLSAANAEFSSKKNVEAPRYGLETPKAYKTV 180

Query: 181 EDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240
           E+D+YEQEIR+LK KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF
Sbjct: 181 ENDEYEQEIRYLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLF 240

Query: 241 NLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRV 300
             KIESL+A+NRRLESQVCDHAKSVSDLEAA+AKIKFLKKKLRYEAEQNRGQILNLQ+RV
Sbjct: 241 TFKIESLEADNRRLESQVCDHAKSVSDLEAARAKIKFLKKKLRYEAEQNRGQILNLQKRV 300

Query: 301 VKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLAN 360
           +KLQDQEHK  +SNKDAQIKLQKIE+LEKE E+LRKSNLRL+ ENSDLGRRLDATQFLAN
Sbjct: 301 LKLQDQEHKTNQSNKDAQIKLQKIEDLEKEIEELRKSNLRLEIENSDLGRRLDATQFLAN 360

Query: 361 SILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNF 420
           S+LEDQEKESLKEE ERL +EN+ALTKEIEQLQAHR ADVEELVYLRWINACLRYELRNF
Sbjct: 361 SLLEDQEKESLKEETERLTRENEALTKEIEQLQAHRLADVEELVYLRWINACLRYELRNF 420

Query: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSH 480
           QPPAGKTAARDLSKTLSPKSEEKAKKLIL+YANTEG EGK +NV DFDSDQWSSSQASSH
Sbjct: 421 QPPAGKTAARDLSKTLSPKSEEKAKKLILDYANTEGNEGKSMNVTDFDSDQWSSSQASSH 480

Query: 481 TDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDSS 540
           TDPGDPDDS  DFPSTAKT SNK+KFISKLRKLL+GKGSQQN+TLLAEKSAASVEDSDS 
Sbjct: 481 TDPGDPDDSTTDFPSTAKTGSNKIKFISKLRKLLKGKGSQQNMTLLAEKSAASVEDSDSP 540

Query: 541 RYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDVG 600
            YS+SNSTGTNATRAEGQ IGY TP  NSS HSMDFHRLQ QKEDD K EDSIRRNSDVG
Sbjct: 541 CYSTSNSTGTNATRAEGQAIGYATPLLNSSGHSMDFHRLQSQKEDDVKIEDSIRRNSDVG 600

Query: 601 YVNKRFVSGSDRSSNSSYRSQSHDTESTEKSELMKYAEVLKDTRGAKNRSHRKAASIGS 659
            VNKRFV GSD+ S+SSYRSQ+ DTESTEKSELMKYAEVLKDTRGAKNRSHRK ASIGS
Sbjct: 601 CVNKRFVVGSDQLSDSSYRSQNQDTESTEKSELMKYAEVLKDTRGAKNRSHRKTASIGS 634

BLAST of Clc09G00280 vs. ExPASy TrEMBL
Match: A0A6J1HMC2 (protein CHUP1, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111464273 PE=4 SV=1)

HSP 1 Score: 883.6 bits (2282), Expect = 5.0e-253
Identity = 505/664 (76.05%), Postives = 548/664 (82.53%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDD-QGNKVDLGR 60
           ME K DL+KP+LFKFGVVLAIS A F+ SR R RNKRP L PPSSSSSD+ + NKV+LGR
Sbjct: 2   MEEKTDLVKPVLFKFGVVLAISFASFMYSRFRIRNKRPSLAPPSSSSSDEWRNNKVELGR 61

Query: 61  GRGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSI 120
           GRG +LD Q MK AT ASSN ++ A DAY                 EEMCI K N  DS 
Sbjct: 62  GRGHKLDDQTMKVATAASSNAIILAADAY-----------------EEMCIQKANGDDSS 121

Query: 121 VGLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKT 180
            G   S  N H VD++GLLLPEFQELVK+FD SAANAG SPKKN  A R G+ETPKAYK 
Sbjct: 122 AGF--STGNDHIVDEEGLLLPEFQELVKQFDLSAANAGFSPKKNAGALRLGIETPKAYKR 181

Query: 181 VEDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 240
           VE D YE EI+HLK KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL
Sbjct: 182 VESDGYEHEIKHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 241

Query: 241 FNLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQR 300
           F LKIESLQA+NRRLESQV D AKS SDLEAA+  IKFLKKKLR+EAEQNR QI+NLQQR
Sbjct: 242 FTLKIESLQADNRRLESQVSDQAKSASDLEAARTTIKFLKKKLRHEAEQNREQIVNLQQR 301

Query: 301 VVKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLA 360
           V KL DQE K  ES K+ QIKLQ IE+LEKE E+L+K+N RLQ+ENSDLGRRLDATQFLA
Sbjct: 302 VTKLLDQECKINESTKNDQIKLQNIEDLEKEIEELKKANSRLQKENSDLGRRLDATQFLA 361

Query: 361 NSILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRN 420
           NSILEDQEKESLKEER+R AQEN+ LTKEIEQLQAHRCADVEELVYLRWINACLRYELRN
Sbjct: 362 NSILEDQEKESLKEERDRFAQENETLTKEIEQLQAHRCADVEELVYLRWINACLRYELRN 421

Query: 421 FQPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASS 480
           FQP AGKTAARDLSKTLSPKSE KAKKLILEYANTEGIEGK IN+ DFDSDQWSSSQASS
Sbjct: 422 FQPAAGKTAARDLSKTLSPKSEHKAKKLILEYANTEGIEGKSINLTDFDSDQWSSSQASS 481

Query: 481 HTDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDS 540
           HTDPGD D SAVD   TAK SSNK+KF+SKLR LLRGK +QQ+  LL EKSAA+V D DS
Sbjct: 482 HTDPGDLDYSAVDSRLTAKPSSNKIKFMSKLRSLLRGKSNQQSSALLPEKSAAAVGDVDS 541

Query: 541 SRYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDV 600
            RYSSS+STGTNATRA+G G GYTTPS NSS  SMDFHRL  QKEDD KTEDS+RRNSDV
Sbjct: 542 PRYSSSHSTGTNATRADGHGTGYTTPSQNSSRRSMDFHRLNSQKEDDVKTEDSLRRNSDV 601

Query: 601 GYVNKRFVSGSDRSSNSSYRSQSHDTEST---EKSELMKYAEVLKDTRGAKNRSHRKAAS 660
           GY+NKRFVSGSDRSSNS YRS S +TEST   EKSEL+KYAEVLK++RG KN+S RK A 
Sbjct: 602 GYINKRFVSGSDRSSNSLYRSSSQETESTDKSEKSELLKYAEVLKNSRGDKNQSRRKVAP 646

BLAST of Clc09G00280 vs. ExPASy TrEMBL
Match: A0A6J1KFL7 (protein CHUP1, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111494600 PE=4 SV=1)

HSP 1 Score: 872.5 bits (2253), Expect = 1.1e-249
Identity = 502/667 (75.26%), Postives = 543/667 (81.41%), Query Frame = 0

Query: 1   MENKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKRPPLPPPSSSSSDDQG-NKVDLGR 60
           ME K DL+KP+LFKFGVVLAIS A F+ SR R RNKRP L PPSSSSSD+ G NKV+LG 
Sbjct: 2   MEEKTDLVKPVLFKFGVVLAISFASFMYSRFRIRNKRPSLAPPSSSSSDEWGNNKVELGT 61

Query: 61  GRGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVGDSI 120
           GRG +LD Q MK AT ASSN ++ A DAY                 EEMCI K N GD  
Sbjct: 62  GRGHKLDDQTMKVATAASSNAIILAADAY-----------------EEMCIQKAN-GDDS 121

Query: 121 VGLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKT 180
                S  N H VDK+G+LLPEFQELVK+FD SAANAG SPKKN    RSG+ETPKAYK 
Sbjct: 122 NAAGFSTGNDHIVDKEGMLLPEFQELVKQFDLSAANAGFSPKKNAGELRSGIETPKAYKR 181

Query: 181 VEDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 240
           VE D YE EI+HLK KVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL
Sbjct: 182 VETDGYEHEIKHLKSKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKL 241

Query: 241 FNLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQR 300
           F LKIESLQA+NRRLESQV D AKS SDLEAA+  IKFLKKKLR+EAEQNR QI+NLQQR
Sbjct: 242 FTLKIESLQADNRRLESQVSDQAKSASDLEAARTTIKFLKKKLRHEAEQNREQIVNLQQR 301

Query: 301 VVKLQDQEHK-KESNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLA 360
           V KL DQE+K  ES K+ QIKLQ IE+LEKE E+L+K N RLQ+ENSDLGRRLDATQFLA
Sbjct: 302 VTKLLDQEYKINESTKNDQIKLQNIEDLEKEIEELKKVNSRLQKENSDLGRRLDATQFLA 361

Query: 361 NSILEDQEKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRN 420
           NSILEDQEKESLKEER+  AQENK LTKEIEQLQAHRCADVEELVYLRWINACLRYELRN
Sbjct: 362 NSILEDQEKESLKEERDCFAQENKTLTKEIEQLQAHRCADVEELVYLRWINACLRYELRN 421

Query: 421 FQPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASS 480
           FQP AGKTAARDLSKTLSPKSE KAKKLILEYANTEGIEGK IN+ DFDSDQWSSSQASS
Sbjct: 422 FQPAAGKTAARDLSKTLSPKSEHKAKKLILEYANTEGIEGKSINLTDFDSDQWSSSQASS 481

Query: 481 HTDPGDPDDSAVDFPSTAKTSSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAASVEDSDS 540
           HTDPGD D SAVD   TAK SSNK+KF+SKLR LLRGK +QQ+  LLAEKSAA+V D DS
Sbjct: 482 HTDPGDLDYSAVDSRLTAKPSSNKIKFMSKLRSLLRGKSNQQSSALLAEKSAAAVGDVDS 541

Query: 541 SRYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDSIRRNSDV 600
            RYSSS+STGTN TRA+G   GYTTPS NSS  SMDFHRL  QKEDD KTEDS+RRNS+V
Sbjct: 542 PRYSSSHSTGTNTTRADGHCTGYTTPSQNSSRRSMDFHRLNSQKEDDVKTEDSLRRNSNV 601

Query: 601 GYVNKRFVSGSDRSSNSSYRSQSHDTEST------EKSELMKYAEVLKDTRGAKNRSHRK 660
           GY+NKRFVSGSDRSSNS YRS S +TEST      EKSEL+KYAEVLK+TRG KN+  RK
Sbjct: 602 GYINKRFVSGSDRSSNSLYRSSSQETESTDKSNKSEKSELLKYAEVLKNTRGDKNQPRRK 650

BLAST of Clc09G00280 vs. TAIR 10
Match: AT1G52080.1 (actin binding protein family )

HSP 1 Score: 322.8 bits (826), Expect = 6.5e-88
Identity = 255/669 (38.12%), Postives = 368/669 (55.01%), Query Frame = 0

Query: 3   NKRDLMKPILFKFGVVLAISLAGFLCSRSRQRNKR-----PPLPPPSSSSS-DDQGNKVD 62
           +KRD+   ++ + G  LA+S AGFL +R R+  KR     PPLPP SS +   D  NK  
Sbjct: 6   HKRDI-NLLVLQLGAALAVSFAGFLFARFRKNTKRIGPTLPPLPPHSSDNGYRDYSNKSI 65

Query: 63  LGRGRGPRLDKQGMKAATTASSNVVLFAVDAYKPSGQLIVLSQSLSLAMEEMCIPKVNVG 122
             R  G     +                                                
Sbjct: 66  DRRDEGTEKTDE------------------------------------------------ 125

Query: 123 DSIVGLCPSNKNKHGVDKDGLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKA 182
           ++++G+ P  +     +KD  LLPEF+E  K+ D    +       + E PRS +  P A
Sbjct: 126 ETLIGVSPRRECDLD-EKDVFLLPEFEEEAKKLDLLVCD-------DCETPRSDITAPLA 185

Query: 183 YKTVEDDQYEQEIRHLKCKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNME 242
           + + E+  +E EI  L+  V+ LRERER LE +LLEYY LKEQ+   MEL++RLK+N ME
Sbjct: 186 FPSEEEADHENEINRLRNTVRALRERERCLEDKLLEYYSLKEQQKIAMELRSRLKLNQME 245

Query: 243 AKLFNLKIESLQAENRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNL 302
            K+FN KI+ LQAEN +L+++  +H+K + +L+ AK++++ LKKKL    +Q+  QIL+L
Sbjct: 246 TKVFNFKIKKLQAENEKLKAECFEHSKVLLELDMAKSQVQVLKKKLNINTQQHVAQILSL 305

Query: 303 QQRVVKLQDQEHKKE-SNKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQ 362
           +QRV +LQ++E K    + +A   +Q++ +LE E  +L  +N RLQ EN +L  +L++ Q
Sbjct: 306 KQRVARLQEEEIKAVLPDLEADKMMQRLRDLESEINELTDTNTRLQFENFELSEKLESVQ 365

Query: 363 FLANSILEDQEK-ESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRY 422
            +ANS LE+ E+ E+L+E+  RL  EN+ L K++EQLQ  RC D+E+LVYLRWINACLRY
Sbjct: 366 IIANSKLEEPEEIETLREDCNRLRSENEELKKDVEQLQGDRCTDLEQLVYLRWINACLRY 425

Query: 423 ELRNFQPPAGKTAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSS 482
           ELR +QPPAGKT ARDLS TLSP SEEKAK+LILEYA++E          + D D+WSSS
Sbjct: 426 ELRTYQPPAGKTVARDLSTTLSPTSEEKAKQLILEYAHSED---------NTDYDRWSSS 485

Query: 483 QASSH--TDPGDPDDSAVDFPSTAKT-SSNKVKFISKLRKLLRGKGSQQNLTLLAEKSAA 542
           Q  S   TD    DDS+VD     KT  + K K + KL K+L GK ++      ++K A 
Sbjct: 486 QEESSMITDSMFLDDSSVDTLFATKTKKTGKKKLMHKLMKILHGKDTKD-----SKKRAG 545

Query: 543 SVEDSDSSRYSSSNSTGTNATRAEGQGIGYTTPSHNSSIHSMDFHRLQGQKEDDGKTEDS 602
           S E        SS++TG            ++TP    S HSMDF  L   K+++   ++ 
Sbjct: 546 SSE-------PSSSNTGV-----------HSTPRQLRSTHSMDFQMLMRGKDEEEDFKNH 570

Query: 603 I---RRNSDVGYVNKRFVSGSDRSSNSSYRSQSH--DTESTEKSELMKYAEVLKDTRGAK 656
           I   RR S+              ++ SS   + H  +T+   K EL+K A+ L  +R  K
Sbjct: 606 IVMLRRKSE--------------AAGSSTYGEEHCLETDQNGKKELIKLADALTKSRSTK 570

BLAST of Clc09G00280 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 209.9 bits (533), Expect = 6.1e-54
Identity = 154/389 (39.59%), Postives = 239/389 (61.44%), Query Frame = 0

Query: 135 DGLLLPEFQELVK-EFDFSAANAGLSPKKNVEAPRSGLETPKAYKTVEDDQYEQEIRHLK 194
           D  +LPEF++L+  E ++        P  + +      E  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 195 CKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFNLKIESLQAENRR 254
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  + N+ I SLQAE ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 255 LESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKKESN 314
           L+ ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E ++  N
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE-EEAMN 264

Query: 315 KDAQI--KLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQ---FLANSILEDQEKE 374
           KD ++  KL+ +++LE +  +L++ N  LQ E  +L  +LD+ +      +++ E  +  
Sbjct: 265 KDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVA 324

Query: 375 SLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAA 434
            ++EE   L   N+ L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +A
Sbjct: 325 KVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISA 384

Query: 435 RDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDS 494
           RDLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++
Sbjct: 385 RDLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNA 444

Query: 495 AVDFPSTAKTS-SNKVKFISKLRKLLRGK 517
           ++D  ++  +S S K   I KL+K  + K
Sbjct: 445 SMDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of Clc09G00280 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 209.9 bits (533), Expect = 6.1e-54
Identity = 154/389 (39.59%), Postives = 239/389 (61.44%), Query Frame = 0

Query: 135 DGLLLPEFQELVK-EFDFSAANAGLSPKKNVEAPRSGLETPKAYKTVEDDQYEQEIRHLK 194
           D  +LPEF++L+  E ++        P  + +      E  + Y+ VE    + E+  LK
Sbjct: 85  DDDILPEFEDLLSGEIEY--------PLPDDDNNLEKAEKERKYE-VEMAYNDGELERLK 144

Query: 195 CKVKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFNLKIESLQAENRR 254
             VK L ERE  LE +LLEYYGLKEQE+ ++ELQ +LKI  +E  + N+ I SLQAE ++
Sbjct: 145 QLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSLQAERKK 204

Query: 255 LESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKKESN 314
           L+ ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E ++  N
Sbjct: 205 LQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE-EEAMN 264

Query: 315 KDAQI--KLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQ---FLANSILEDQEKE 374
           KD ++  KL+ +++LE +  +L++ N  LQ E  +L  +LD+ +      +++ E  +  
Sbjct: 265 KDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESDKVA 324

Query: 375 SLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGKTAA 434
            ++EE   L   N+ L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK +A
Sbjct: 325 KVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGKISA 384

Query: 435 RDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDPDDS 494
           RDLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D D++
Sbjct: 385 RDLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDFDNA 444

Query: 495 AVDFPSTAKTS-SNKVKFISKLRKLLRGK 517
           ++D  ++  +S S K   I KL+K  + K
Sbjct: 445 SMDSSTSRFSSFSKKPGLIQKLKKWGKSK 455

BLAST of Clc09G00280 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 148.7 bits (374), Expect = 1.7e-35
Identity = 106/272 (38.97%), Postives = 171/272 (62.87%), Query Frame = 0

Query: 251 NRRLESQVCDHAKSVSDLEAAKAKIKFLKKKLRYEAEQNRGQILNLQQRVVKLQDQEHKK 310
           ++ L+ ++  +     +LE A+ KIK L+++++ +A Q +GQ+L L+Q V  LQ +E ++
Sbjct: 52  DKNLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE-EE 111

Query: 311 ESNKDAQI--KLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQ---FLANSILEDQ 370
             NKD ++  KL+ +++LE +  +L++ N  LQ E  +L  +LD+ +      +++ E  
Sbjct: 112 AMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTESD 171

Query: 371 EKESLKEERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYELRNFQPPAGK 430
           +   ++EE   L   N+ L K++E LQ +R ++VEELVYLRW+NACLRYELRN+Q PAGK
Sbjct: 172 KVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPAGK 231

Query: 431 TAARDLSKTLSPKSEEKAKKLILEYANTEGIEGKGINVVDFDSDQWSSSQASSHTDPGDP 490
            +ARDLSK LSPKS+ KAK+L+LEYA +E   G+G      D+D  S+    S     D 
Sbjct: 232 ISARDLSKNLSPKSQAKAKRLMLEYAGSE--RGQG------DTDLESNYSQPSSPGSDDF 291

Query: 491 DDSAVDFPSTAKTS-SNKVKFISKLRKLLRGK 517
           D++++D  ++  +S S K   I KL+K  + K
Sbjct: 292 DNASMDSSTSRFSSFSKKPGLIQKLKKWGKSK 314

BLAST of Clc09G00280 vs. TAIR 10
Match: AT2G36650.1 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 50.8 bits (120), Expect = 4.8e-06
Identity = 65/284 (22.89%), Postives = 143/284 (50.35%), Query Frame = 0

Query: 136 GLLLPEFQELVKEFDFSAANAGLSPKKNVEAPRSGLETPKAYKTVEDDQYEQEIRHLKCK 195
           GL+L  F  + +  D    ++  +P+ +    R   E  +A      +Q +QEI  LK +
Sbjct: 30  GLILARF--VSRNEDNEVTSSTSNPESSSSPSRENDEEEEA---ESPNQQKQEILSLKSR 89

Query: 196 VKMLRERERNLEVQLLEYYGLKEQETAVMELQNRLKINNMEAKLFNLKIESLQAENRRLE 255
            + L+ +E  +E+    +  LK+QE  ++E ++ L +   +   F  ++ +++ E++R +
Sbjct: 90  FEELQRKEYEMELHFERFCNLKDQEVMLIEHKSILSLEKAQLDFFRKEVLAMEEEHKRGQ 149

Query: 256 SQVCDHAKSVSDLEAAKAKIKFLK---KKLRYEAEQNRGQILNLQQRVVKLQDQEHKKES 315
           + V  + K V +++  +++   L+   KKLR +++Q   +++N  ++++ ++ +  K   
Sbjct: 150 ALVIVYLKLVGEIQELRSENGLLEGKAKKLRRKSKQ-LYRVVNESRKIIGVEKEFLK--C 209

Query: 316 NKDAQIKLQKIEELEKETEDLRKSNLRLQRENSDLGRRLDATQFLANSILEDQEKESLKE 375
             + + K   ++ELE + +D+      LQ E  +L        F+ +S   +   E +  
Sbjct: 210 VDELETKNNIVKELEGKVKDMEAYVDVLQEEKEEL--------FMKSS---NSTSEMVSV 269

Query: 376 ERERLAQENKALTKEIEQLQAHRCADVEELVYLRWINACLRYEL 417
           E      + + + +E E+L+      V+E++ LRW NACLR+E+
Sbjct: 270 E------DYRRIVEEYEELKKDYANGVKEVINLRWSNACLRHEV 288

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038898688.12.0e-30187.88protein CHUP1, chloroplastic [Benincasa hispida] >XP_038898689.1 protein CHUP1, ... [more]
KAA0059471.12.5e-29185.13protein CHUP1 [Cucumis melo var. makuwa] >TYK03852.1 protein CHUP1 [Cucumis melo... [more]
XP_008462405.12.8e-29084.98PREDICTED: protein CHUP1, chloroplastic [Cucumis melo] >XP_008462406.1 PREDICTED... [more]
XP_004141788.19.2e-28683.92protein CHUP1, chloroplastic isoform X2 [Cucumis sativus] >KGN45575.1 hypothetic... [more]
XP_031744947.11.0e-28483.79protein CHUP1, chloroplastic isoform X1 [Cucumis sativus] >XP_031744948.1 protei... [more]
Match NameE-valueIdentityDescription
Q9LI748.6e-5339.59Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7V1821.2e-29185.13Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold347G00460 ... [more]
A0A1S3CGW91.3e-29084.98protein CHUP1, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103500772 PE=4 SV=1[more]
A0A0A0K7994.4e-28683.92Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_7G452300 PE=4 SV=1[more]
A0A6J1HMC25.0e-25376.05protein CHUP1, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC1114... [more]
A0A6J1KFL71.1e-24975.26protein CHUP1, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111494... [more]
Match NameE-valueIdentityDescription
AT1G52080.16.5e-8838.12actin binding protein family [more]
AT3G25690.16.1e-5439.59Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.26.1e-5439.59Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.31.7e-3538.97Hydroxyproline-rich glycoprotein family protein [more]
AT2G36650.14.8e-0622.89unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (cordophanus) v2
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 268..346
NoneNo IPR availableCOILSCoilCoilcoord: 360..394
NoneNo IPR availableCOILSCoilCoilcoord: 182..209
NoneNo IPR availableCOILSCoilCoilcoord: 216..257
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 535..575
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 32..71
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 576..597
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 640..659
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 527..632
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 606..621
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 470..495
NoneNo IPR availablePANTHERPTHR31342:SF4ACTIN BINDING PROTEIN FAMILYcoord: 2..656
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 2..656

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Clc09G00280.2Clc09G00280.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane
cellular_component GO:0016021 integral component of membrane