CmUC01G025020 (gene) Watermelon (USVL531) v1

Overview
NameCmUC01G025020
Typegene
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionprotein CHUP1, chloroplastic-like
LocationCmU531Chr01: 36383968 .. 36389618 (-)
RNA-Seq ExpressionCmUC01G025020
SyntenyCmUC01G025020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGAAGATAACCCATCAGAAAACAGAGGGAAACCATCTAGGTTCGCTGATCAAAATCAGAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATGGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTTAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCAGAGCAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCTATTATTGGGGATTTAGCTTGTTCGGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCATCGCAGACAATCATCTCGTGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAATGAACTTGCAGAACTTAAGCGGAATACTAGAAATTATGAACTTGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACTCAAAAAGTTAGTGTATTGGAAGAAGATAGAAGAGCACTGTCCGAACAATTAGTGGCACTATCATCGATTCCTGAGAAGCAGGAAGAGCCGCAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGTAGGCTTTCTGCGGTGGAGTCTGAGTTGGCTTGTCTAGCAAAGAATTCCGAGGTAACATTTAGTCATTTACTAGGGTCTTTGTGTTGAATGGTCAAATTTTTTAAATGTATATGAAACATTCCACAACACTCTTTCACAACACCAAAAAGCTAACATTTTACCTTTGTTACTACCAGAGTGAAGCTGTAGCAAAGATCAAAGCAGAGGCATCCTTGCTAAGACACACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTTAATTCCTGTTTAAGGAGCGAGCTTCGAAACTCTTGTCCCTCGGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAATACTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATCTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAAGACTTGTCTAATTTAGATTGTTCGGATAATAGTCTTTTAGACAAAAATTGGGTTGACATAGAGGAAGGAAGAAGCCCCAGAAGAAGACATTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAAGAAGCAGATCCTCTATCCTCTCAGAAATATGATTTGGGTGTTATTCAGAGGCCTCATGTTTTGGGAAATTGCCATGAAAATAACAGGAGTTTTGCTTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCAAGTCCCGCCACCTCTGCCACCGCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGAAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCCAGAAAAGATTCTTCTAATGGAGCCATATGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTGTAAGCTCTCAGTCATATTGTCTTCCATTCTGATTGTAGTTCTCTCATCAGAAGACATAATAATATTTCATACTTTTGCAGATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTATTCCTTTGATTCTCTTAACCTTGGTGCATTTGATTTAAACGTTATAGGAAAAAAATGAACTTAAAAGAGAAGCAAGTAAGTGAATGCATGCTGTAGAATGTAGAGTTGATATTTCTATGATAATCATATGAAGGCTGCATAATGAATGGATCATTTTATTGAAGAGTTGAGTAATTTGCTGATCGATTAATTTTCTCAAAGGTGGATGAAAGGGCAGTTCTTAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGTAAGGATCTAAAGCTTCCCGCATCCAAATTTGTTTATATTCTTTGAAGTCAGTGATGATAAACAATGTCTTTATGAAATCAGGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGGGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACAATGGAATCATAAGCAAGGTAAAACTTCTTAATATATCATAAGCCTCTGGAAAAGCAGTCTGAAGAGTCGACATGACGATGTCTTGTTATTGCTGGATAATTTCTTAATAACAACCATAGTCTATGATGTATTTGATCAAAACCAAAATCAAACTCTATAATGTAGACAAACACAGATGTCCTGTTATTGCTGGATAATTTCTCGATAGCAACCAAAATCTAGATCTATATGAACAAATCAAAATACTCTATGTTGTAGATAAAGTTGGGTTCCGTGAAGTTGGCAAAAATGTACATGAAGAGAGTAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATCCATCAGGTAGCAATTAGTTAAGCCTTCGTTCACTGCATTTTTCTTCAGGAAAAGACAAAGATTGTTGCTGATTGTGTTTTGCCTTTCGCGGCTACAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGAGAAACTTGGCCAACCTTCTGAGCAAAAAGTGAAAGTTATACACTACAGAACAAAGCAAATAGAAATCAGTTGGTTTAGGTTTACTAACTCGGTAAAGCAGCTAGCTACCATGCTGGCAATACATTCATCAGGTCATCTGAACAATATTTTTCTCTTCATGGGAAGTTTTTGGATTTGAAACTTAGAGCTGGGCTGCGGGGGAAGCCAGATCTTTAGCGAACTGTATTTTTGTCCTCTGCCATCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGGTGGCAAACTGCCAGAGGGAGGAAGATCGTTTTAATATATATTAATGTAAATTGTGTGTTAACTGAAAATCAAACGTCTTTGTGGAATGAGTGTTCCTTTGTTAATGAAATTTTTTCTCTTGATTTCTGTGTTTCTAAAGTAAGATTGCTGAGAGTAGAGGGCATGTTTCGAAATGTGTTCAATAAGAAAAAAAAGAGGCTTCAATTTTCAATTTAAGAAATGTATCACAGGTTGTGTGTAACTGGTTAAGATTTAGATAGTTTGAAATCACAAATAAAATCAGTGTGAATTTAGTGTCATGGTTTTAAAGTTAAATTTAGTATCTATAATTTGATAAAGTCTCCCACATAATTTTTATGGTTTAATAAAATAAGAACTAAATTATAATTATAACTACAAGAATTAAATTCTAACTTTCAAGGAGACTCATTGTAATTATAAATCTTTTTTTAGTAAAAGAAACATTTCATTGAAAAACTAAATCAGGAAAAAACCCCAAGTACGTGAAGGTGGTTATAAAAGAGAATGCTAATTATTGAGTAGAAAAGATAAACAGTATAGCTAACCAAATTTGTAATTGAACTTCTGTTCTATATAAATGTTATAAAGTTGTGGTTGTATATCTCGTTAAGATTGAGATTACTTGAAATAACAAAACACAGATATATGGTTTTCAAACCACATCACCTTCAATAATTTCTAAGAACAAAGGTACTCCAGAAAACAACTTAAAACAAGCTCTAAAATATCTACCAGCACTTACTTCTCCTGATATTTTTCAATAAATCCAAAAACAAGTTGAGAAAACACCATTCAGACAACCTTTATGTGATTTTCTTAAAAAGATGAACACTCTCTAACCAAAAAGATCATAAAACAAAGCAACTTAAAGAAAAGAAACGAATTGAGCGATCCAATAAATGACGAGAGGGGAGTGGATTTGGATGATTGATGAAATCTAAAGCAGTGGAGAAAAGGGTAATATGAAAGGACTGGAAGATTTGACATACAGAAAAGACCTAACATTGCCTGTATTTACCCCATCTTGGGCACCAAACTACAAAAGAGACAGACAAACCAGCCACTCTGCCAACCCCATAAACAATTAAGAAAAAAAAAAATTAAAGACAGAGAAGAAATCTCCGCTCCCATCACCCTATTCTGTTTCTTCCAAGTTCTACTTCTACTATCCTTTCTTTTCTTTCTCCCTTGATTTTGGCTAACTTCTGGCTGTAGCCCACCTCCAACCTCTTTACAACCAACATTTTTCTGAAAATAATCTCATTTTGGATGCACATGATTTGGTAATTTCTGATGCTGTTGCAGACGAAAAGACCCATGTCTGGTGCAACAGAGTACCCTTTGTGTGTCAATATATTAAATATCCGGTATCGCGGTTAAATTATTAATTGACCCAAATTTATAAGATTTATTTAGATGATCTAGTGATCCAAAAACTTAAGTTAACAGATGAAACTAAATTTAATATCATATTAAATAATAATAATACGTAGGAGGTTAAACACAAAACCTCGCCTACTATCACTAAAAATTTAAACTAATGAGTGAAGCAAATTTAATTTTATATCATCAACAACCACGTTCATAGTATGTGTTCTAACTTCTAACTCTAACTACTCTCTTCAATTCAATTCAATTCAATTCATCTTCATCCTTTCCTTTCACAGATATCAACTATTTGATTACTACTTATCCTCACATCACACAATACCACAAGCATTATCATACAGTGAAAATGTAATGACTATGGTCGGAATTCATATATCTTAAAAGGAAAAACTAGGGTTGGTAAATTAGGGTGTGGGAAATACGGAAAGAACAGTAAAGTGAGTTAGAATTGAAGTGATGTGACCTGTTAATTGGGGATTTTAGGCGAAATTTTGGAGCCATTCCCATCCCATCGCATCGATCTGCCACGGAAAGCTTAGTAGTCACATCCACCAGGATCCCACGGCAGAGGAAATTTTACTTTCTTTCTTAGATCTCACCGGCGAAAAATGGTGCTTTTATGGCTTGTTCTGTCCCGATTCACGAACCTCCCCTTCCATTTTTCCATTCCCAACTGCTGTTTTCCGTCTGCTTCTGCTTATCTTTTTCGGGAATTCCTCTTAGAATATCTGATCTCCTCCATCAACCAGACCGCCATCATGATACCACTTCAAAATAA

mRNA sequence

ATGAAGGAAGATAACCCATCAGAAAACAGAGGGAAACCATCTAGGTTCGCTGATCAAAATCAGAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATGGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTTAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCAGAGCAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCTATTATTGGGGATTTAGCTTGTTCGGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCATCGCAGACAATCATCTCGTGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAATGAACTTGCAGAACTTAAGCGGAATACTAGAAATTATGAACTTGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACTCAAAAAGTTAGTGTATTGGAAGAAGATAGAAGAGCACTGTCCGAACAATTAGTGGCACTATCATCGATTCCTGAGAAGCAGGAAGAGCCGCAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGTAGGCTTTCTGCGGTGGAGTCTGAGTTGGCTTGTCTAGCAAAGAATTCCGAGAGTGAAGCTGTAGCAAAGATCAAAGCAGAGGCATCCTTGCTAAGACACACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTTAATTCCTGTTTAAGGAGCGAGCTTCGAAACTCTTGTCCCTCGGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAATACTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATCTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAAGACTTGTCTAATTTAGATTGTTCGGATAATAGTCTTTTAGACAAAAATTGGGTTGACATAGAGGAAGGAAGAAGCCCCAGAAGAAGACATTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAAGAAGCAGATCCTCTATCCTCTCAGAAATATGATTTGGGTGTTATTCAGAGGCCTCATGTTTTGGGAAATTGCCATGAAAATAACAGGAGTTTTGCTTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCAAGTCCCGCCACCTCTGCCACCGCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGAAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCCAGAAAAGATTCTTCTAATGGAGCCATATGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTGGATGAAAGGGCAGTTCTTAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGGGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACAATGGAATCATAAGCAAGATAAAGTTGGGTTCCGTGAAGTTGGCAAAAATGTACATGAAGAGAGTAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATCCATCAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGAGAAACTTGGCCAACCTTCTGAGCAAAAACTGGGCTGCGGGGGAAGCCAGATCTTTAGCGAACTGTATTTTTGTCCTCTGCCATCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGATCTCACCGGCGAAAAATGGTGCTTTTATGGCTTGTTCTGTCCCGATTCACGAACCTCCCCTTCCATTTTTCCATTCCCAACTGCTGTTTTCCGTCTGCTTCTGCTTATCTTTTTCGGGAATTCCTCTTAGAATATCTGATCTCCTCCATCAACCAGACCGCCATCATGATACCACTTCAAAATAA

Coding sequence (CDS)

ATGAAGGAAGATAACCCATCAGAAAACAGAGGGAAACCATCTAGGTTCGCTGATCAAAATCAGAATCCCAAGTGTCTAAATCAGAATAATGCCAAAGGAACTACTGGGAATGGTTCGAAATTGAGGGCTGCTTCTTCCTGGGGTTCTCACATTGTTAAAGGTTTCTCCACAGACAAGAGAACTAAAGCTCAGAGCAATCTTCAACCCAAGAAAGCACCACCACTTGGGAATTCGGATTTAGCTAATCAGAAGGAGAAGTTTGTTCCTTCCCATTCTCGCATCAAGCGTTCTATTATTGGGGATTTAGCTTGTTCGGCCAATCCTGCTCAAGTTCATCCACAGTCTTATCAGACCCATCGCAGACAATCATCTCGTGATTTGTTCGTCGAGCTCGATCAACTCAGAAGTTTGCTAAACGAATCTAAGCAGAGGGAATTCGAACTTCAGAATGAACTTGCAGAACTTAAGCGGAATACTAGAAATTATGAACTTGAAAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGGCCTTACTCAAAAAGTTAGTGTATTGGAAGAAGATAGAAGAGCACTGTCCGAACAATTAGTGGCACTATCATCGATTCCTGAGAAGCAGGAAGAGCCGCAGACTGCGCCTGTAAACGTAGAGGTGGAAGTTGTTGAGTTGAGACGCTTGAATAAGGAACTTCAGCTTCAGAAGAGGAACCTCGCTTGTAGGCTTTCTGCGGTGGAGTCTGAGTTGGCTTGTCTAGCAAAGAATTCCGAGAGTGAAGCTGTAGCAAAGATCAAAGCAGAGGCATCCTTGCTAAGACACACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTGAATGAGGTTGAGGAACTTGCATACCTTAGGTGGGTTAATTCCTGTTTAAGGAGCGAGCTTCGAAACTCTTGTCCCTCGGCGAATTCTGGTAGCCCATCCAGCCCTCAGCCAATTGAGAGGAATACTGAATCAGTTGGTTCATTATCCAGCCAAAAGGAGAACATGGAGTACAGTAGTGCAAAGAGAATAAATCTAATTAAGAAGTTGAAGAAATGGCCTATTACTGATGAAGACTTGTCTAATTTAGATTGTTCGGATAATAGTCTTTTAGACAAAAATTGGGTTGACATAGAGGAAGGAAGAAGCCCCAGAAGAAGACATTCCATTAGTGGAGCCAAATGCTGGCCTGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGCTTTATATGTGCAAAAGAGATGGAAAAAGAAGCAGATCCTCTATCCTCTCAGAAATATGATTTGGGTGTTATTCAGAGGCCTCATGTTTTGGGAAATTGCCATGAAAATAACAGGAGTTTTGCTTCTTTGGATGTGGAGAAACGAGCATTGCGTATACCAAATCCCCCTCCGAGGCCTTCTTGCTCGATTTCTAGTGAACCTAAAGAAGAAAACACAGCTCAAGTCCCGCCACCTCTGCCACCGCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGAAGCGCCACAGGAATGGTACAGCGAGCTCCACAAGTTGTTGAATTCTACCATTCACTAATGAAGAGAGATTCCAGAAAAGATTCTTCTAATGGAGCCATATGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAAATTGAGAATCGATCATCTCATTTGCTTGCTATAAAGGCAGATATTGAGACACAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCAGTTTATCTGAAGATCGAAGATATTGTGGAATTTGTGAAGTGGCTTGACGATGAACTTTGCTTTCTGGTGGATGAAAGGGCAGTTCTTAAGCACTTTGATTGGCCAGAGAGAAAGGCTGACACCTTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAGTGTGAAATCTCAGCCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTCAAAAAAATGGTTGCTTTATCAGAGAAGATGGAGCGTAGTAGTTATAACCTTCTCCGGATGAGGGAATCATTGATGCGAAATTGCAAAGAGTTCCAAATTCCCACAGATTGGATGCTTGACAATGGAATCATAAGCAAGATAAAGTTGGGTTCCGTGAAGTTGGCAAAAATGTACATGAAGAGAGTAGCAATGGAACTTCAATCAAAGGCTTCATCAGAGAAAGATCCCGCAATGGATTACATGCTTCTTCAAGGAGTGAGATTTGCCTTTAGAATCCATCAGTTTGCAGGAGGGTTCGATGCCGAAACAATGCATGCATTTGAGGATCTGAGAAACTTGGCCAACCTTCTGAGCAAAAACTGGGCTGCGGGGGAAGCCAGATCTTTAGCGAACTGTATTTTTGTCCTCTGCCATCCTTCCTCCTCTCTTCATGAAGCAAGCTCCTCCAAAGCACAGGCAGGTCCTACGGAAAGAGAGATCTCACCGGCGAAAAATGGTGCTTTTATGGCTTGTTCTGTCCCGATTCACGAACCTCCCCTTCCATTTTTCCATTCCCAACTGCTGTTTTCCGTCTGCTTCTGCTTATCTTTTTCGGGAATTCCTCTTAGAATATCTGATCTCCTCCATCAACCAGACCGCCATCATGATACCACTTCAAAATAA

Protein sequence

MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKRTKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHRRQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQKVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACRLSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKKWPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSCSISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLSKNWAAGEARSLANCIFVLCHPSSSLHEASSSKAQAGPTEREISPAKNGAFMACSVPIHEPPLPFFHSQLLFSVCFCLSFSGIPLRISDLLHQPDRHHDTTSK
Homology
BLAST of CmUC01G025020 vs. NCBI nr
Match: XP_038881875.1 (protein CHUP1, chloroplastic-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1474.5 bits (3816), Expect = 0.0e+00
Identity = 763/786 (97.07%), Postives = 773/786 (98.35%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKG +GNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGNSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQPKK PPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKVPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLF+ELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ
Sbjct: 121 RQSSRDLFIELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEE+RRALSEQLV LSSI EKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEERRALSEQLVTLSSISEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAEASLLRH NEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEASLLRHRNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNSC SANSGSPSSPQPIER+ ES+GSLSSQKE MEY+SAKRINL+KKLKK
Sbjct: 301 VNSCLRSELRNSCSSANSGSPSSPQPIERSGESLGSLSSQKEYMEYNSAKRINLVKKLKK 360

Query: 361 WPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLDCSDNSLLDKNWVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI
Sbjct: 361 WPITDEDLSNLDCSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEKEADPLSSQKYDLGVIQRPHV GNCHE NRSF SL+VEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVFGNCHETNRSFTSLEVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540
           ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS
Sbjct: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780
           LAKMYMKRVAMELQSKA SEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL
Sbjct: 721 LAKMYMKRVAMELQSKALSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780

Query: 781 ANLLSK 787
           ANLL+K
Sbjct: 781 ANLLNK 786

BLAST of CmUC01G025020 vs. NCBI nr
Match: XP_038881874.1 (protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1458.0 bits (3773), Expect = 0.0e+00
Identity = 763/818 (93.28%), Postives = 773/818 (94.50%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKG +GNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGNSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQPKK PPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKVPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLF+ELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ
Sbjct: 121 RQSSRDLFIELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEE+RRALSEQLV LSSI EKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEERRALSEQLVTLSSISEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAEASLLRH NEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEASLLRHRNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNSC SANSGSPSSPQPIER+ ES+GSLSSQKE MEY+SAKRINL+KKLKK
Sbjct: 301 VNSCLRSELRNSCSSANSGSPSSPQPIERSGESLGSLSSQKEYMEYNSAKRINLVKKLKK 360

Query: 361 WPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLDCSDNSLLDKNWVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI
Sbjct: 361 WPITDEDLSNLDCSDNSLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEKEADPLSSQKYDLGVIQRPHV GNCHE NRSF SL+VEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVFGNCHETNRSFTSLEVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540
           ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS
Sbjct: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIH-------------------- 780
           LAKMYMKRVAMELQSKA SEKDPAMDYMLLQGVRFAFRIH                    
Sbjct: 721 LAKMYMKRVAMELQSKALSEKDPAMDYMLLQGVRFAFRIHQVAINQACLRSLHFSSGKHK 780

Query: 781 ------------QFAGGFDAETMHAFEDLRNLANLLSK 787
                       QFAGGFDAETMHAFEDLRNLANLL+K
Sbjct: 781 DCLLIVFCLSRLQFAGGFDAETMHAFEDLRNLANLLNK 818

BLAST of CmUC01G025020 vs. NCBI nr
Match: XP_004134549.1 (protein CHUP1, chloroplastic [Cucumis sativus] >KGN49492.1 hypothetical protein Csa_003596 [Cucumis sativus])

HSP 1 Score: 1454.9 bits (3765), Expect = 0.0e+00
Identity = 752/786 (95.67%), Postives = 767/786 (97.58%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E RGKPSRFADQNQNPKCLNQNNAKG+TGNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSTGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRRALSEQLV L S+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLPSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNS PSANSGSPSSPQP+ER++E++GSLSSQKE MEYSSAKRINLIKKLKK
Sbjct: 301 VNSCLRSELRNSSPSANSGSPSSPQPVERSSEAIGSLSSQKEYMEYSSAKRINLIKKLKK 360

Query: 361 WPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLDCSDN+LLDKNWVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF+
Sbjct: 361 WPITDEDLSNLDCSDNNLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFM 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEK+ DPLSSQKYDLGVIQRPHVLGNCHE NR+FASLDVEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKDVDPLSSQKYDLGVIQRPHVLGNCHETNRNFASLDVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540
           ISSEPKEEN AQVPPPLPPPPPPPPLPKF+VRSATGMVQRAPQVVEFYHSLMKRDSRKDS
Sbjct: 481 ISSEPKEENRAQVPPPLPPPPPPPPLPKFSVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGTICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780
           LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL
Sbjct: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780

Query: 781 ANLLSK 787
           ANLL+K
Sbjct: 781 ANLLNK 786

BLAST of CmUC01G025020 vs. NCBI nr
Match: XP_008439508.1 (PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo] >KAA0052457.1 protein CHUP1 [Cucumis melo var. makuwa] >TYK13365.1 protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 748/787 (95.04%), Postives = 763/787 (96.95%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E RGKPSRFADQNQNPKCLNQNNAKG++GNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K QSNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  AKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRRALSEQLV LSS+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAK-NSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300
           LS+VESELACLAK NSESEAVAK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR
Sbjct: 241 LSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300

Query: 301 WVNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLK 360
           WVNSCLRSELRNSCPSANSGSPSSPQP+ER++E V SLSSQKE MEYSSAKRINLIKKLK
Sbjct: 301 WVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYMEYSSAKRINLIKKLK 360

Query: 361 KWPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420
           KWPITDEDLSNLDCSDN+LLDK WVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF
Sbjct: 361 KWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420

Query: 421 ICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSC 480
           +CAKEMEK+ DPLSSQKYDLGVIQRPHVLGN HE NR+FASLDVEKRALRIPNPPPRPSC
Sbjct: 421 MCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIPNPPPRPSC 480

Query: 481 SISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540
           SISSEPKEEN AQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD
Sbjct: 481 SISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540

Query: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600
           SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE
Sbjct: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600

Query: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660
           DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP
Sbjct: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660

Query: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSV 720
           RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD+GIISKIKLGSV
Sbjct: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSV 720

Query: 721 KLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780
           KLAKMYMKRVA ELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN
Sbjct: 721 KLAKMYMKRVATELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780

Query: 781 LANLLSK 787
           LANLL+K
Sbjct: 781 LANLLNK 787

BLAST of CmUC01G025020 vs. NCBI nr
Match: XP_023518667.1 (protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1367.1 bits (3537), Expect = 0.0e+00
Identity = 718/789 (91.00%), Postives = 747/789 (94.68%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP+ENRGKPSRFADQNQ          KG +GNGSKLRAASSWGSHIVKGFSTDK+
Sbjct: 1   MKEDNPAENRGKPSRFADQNQ--------YTKGGSGNGSKLRAASSWGSHIVKGFSTDKK 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQ KKA PL +S+L NQKEK VPSH+RIKRS+IGDL CS NPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQSKKA-PLTSSNLVNQKEKSVPSHTRIKRSLIGDLTCSPNPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAEL+RNTRN+ELERELEEKKAEL+GLTQ
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELQRNTRNFELERELEEKKAELEGLTQ 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           K  +LEEDRRALSEQLVA SSI EK EEPQTAP+NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KFGLLEEDRRALSEQLVAASSISEKPEEPQTAPLNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNSCPSANS SPSSPQ +ER +E VGSLSSQKE+M+Y++AKRIN IKKLKK
Sbjct: 301 VNSCLRSELRNSCPSANSDSPSSPQAMERTSEPVGSLSSQKEHMDYNNAKRINAIKKLKK 360

Query: 361 WPITDEDLSNLDCSDN--SLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDG 420
           WPITDEDLSNLDCSDN  SLL KNWVD EE RSPRRRHSISGAKCWPEELEPNKRRQSDG
Sbjct: 361 WPITDEDLSNLDCSDNNDSLLGKNWVDTEEERSPRRRHSISGAKCWPEELEPNKRRQSDG 420

Query: 421 FICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPS 480
           FICAKE+EKEADPLSSQKYDLGVIQRPH+L N HE NR+FASLDVEKRALRIPNPPPRPS
Sbjct: 421 FICAKELEKEADPLSSQKYDLGVIQRPHILENSHETNRNFASLDVEKRALRIPNPPPRPS 480

Query: 481 CSISSEPKEENTAQVPPPLPPPPPPPP-LPKFAVRSATGMVQRAPQVVEFYHSLMKRDSR 540
           CSISSEPKEENT +VPPPLPPPPPPPP LPKFA RS+TGMVQRAPQVVEFYHSLMKRDSR
Sbjct: 481 CSISSEPKEENTGRVPPPLPPPPPPPPLLPKFAARSSTGMVQRAPQVVEFYHSLMKRDSR 540

Query: 541 KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK 600
           KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK
Sbjct: 541 KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK 600

Query: 601 IEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD 660
           IED+V FVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD
Sbjct: 601 IEDVVAFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD 660

Query: 661 DPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLG 720
           DPRLPC+IALKKMV LSEKMERSSYNLLRMRESLMRNCKEFQIP DWMLDNGIISKIKLG
Sbjct: 661 DPRLPCEIALKKMVTLSEKMERSSYNLLRMRESLMRNCKEFQIPIDWMLDNGIISKIKLG 720

Query: 721 SVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDL 780
           SVKLAKMYMKRVAMELQSK+SSEKDPAMDYMLLQGVR+AFRIHQFAGGFDAETMHAFEDL
Sbjct: 721 SVKLAKMYMKRVAMELQSKSSSEKDPAMDYMLLQGVRYAFRIHQFAGGFDAETMHAFEDL 780

Query: 781 RNLANLLSK 787
           RNLANLL+K
Sbjct: 781 RNLANLLNK 780

BLAST of CmUC01G025020 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 421.8 bits (1083), Expect = 1.9e-116
Identity = 323/857 (37.69%), Postives = 444/857 (51.81%), Query Frame = 0

Query: 130 ELDQLRSLLNESKQREFELQNELAEL----KRNTRNYELERELEEKKAELDGLTQKVSVL 189
           EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  E+D L   ++ L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 190 EEDRRALSEQL-----------VALSSIPEKQEEPQ------------------------ 249
           + +R+ L E+L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 250 --------------TAPVNVEVEVVELRRLNKELQLQKRNLACRLSAVESELACLAKNSE 309
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L + E+ +A L+  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 310 SEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRSELRN-SCPS 369
           S+ VAK++ E + L+H NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 370 ----------------------------------------ANSGSPSSPQPIERNTESVG 429
                                                   +N   PSSP   + +  S+ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 430 SLSSQKENMEYSSAKRINLIKKLKKWPITDEDLS---------------NLDCSDN---- 489
           S +S+      S +K+  LI+KLKKW  + +D S                L  S N    
Sbjct: 430 SSTSRFS----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQRG 489

Query: 490 ---SLLDKN-----------WVDIEEGRSP----------RRRHSISG------------ 549
              SL+ +N            VD E   +P          +++ S  G            
Sbjct: 490 PLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFHV 549

Query: 550 -AKCWPEELEPNKRRQSDGFICAKEMEK----EADPLSSQKYDLGVIQRPHVL------- 609
            +K     L+       D    A E EK    +AD   ++++   V   P +        
Sbjct: 550 MSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRV 609

Query: 610 -------------GNCHENNRSFAS-----------LDVEKRALRIPNPPPRPSCS---- 669
                           +E+N   AS           +D+EKR  R+P PPPR +      
Sbjct: 610 VVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKST 669

Query: 670 --ISSEPKEENTAQVPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQVV 729
              S+ P        PPP PP           PPPPPP P    R A G   V RAP++V
Sbjct: 670 NLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELV 729

Query: 730 EFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 782
           EFY SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV 
Sbjct: 730 EFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQ 789

BLAST of CmUC01G025020 vs. ExPASy TrEMBL
Match: A0A0A0KMA9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G526260 PE=4 SV=1)

HSP 1 Score: 1454.9 bits (3765), Expect = 0.0e+00
Identity = 752/786 (95.67%), Postives = 767/786 (97.58%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E RGKPSRFADQNQNPKCLNQNNAKG+TGNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSTGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRRALSEQLV L S+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLPSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNS PSANSGSPSSPQP+ER++E++GSLSSQKE MEYSSAKRINLIKKLKK
Sbjct: 301 VNSCLRSELRNSSPSANSGSPSSPQPVERSSEAIGSLSSQKEYMEYSSAKRINLIKKLKK 360

Query: 361 WPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFI 420
           WPITDEDLSNLDCSDN+LLDKNWVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF+
Sbjct: 361 WPITDEDLSNLDCSDNNLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGFM 420

Query: 421 CAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSCS 480
           CAKEMEK+ DPLSSQKYDLGVIQRPHVLGNCHE NR+FASLDVEKRALRIPNPPPRPSCS
Sbjct: 421 CAKEMEKDVDPLSSQKYDLGVIQRPHVLGNCHETNRNFASLDVEKRALRIPNPPPRPSCS 480

Query: 481 ISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540
           ISSEPKEEN AQVPPPLPPPPPPPPLPKF+VRSATGMVQRAPQVVEFYHSLMKRDSRKDS
Sbjct: 481 ISSEPKEENRAQVPPPLPPPPPPPPLPKFSVRSATGMVQRAPQVVEFYHSLMKRDSRKDS 540

Query: 541 SNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600
           SNG ICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED
Sbjct: 541 SNGTICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 600

Query: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660
           IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR
Sbjct: 601 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 660

Query: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720
           LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK
Sbjct: 661 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 720

Query: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780
           LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL
Sbjct: 721 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 780

Query: 781 ANLLSK 787
           ANLL+K
Sbjct: 781 ANLLNK 786

BLAST of CmUC01G025020 vs. ExPASy TrEMBL
Match: A0A5A7UD87 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009920 PE=4 SV=1)

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 748/787 (95.04%), Postives = 763/787 (96.95%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E RGKPSRFADQNQNPKCLNQNNAKG++GNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K QSNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  AKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRRALSEQLV LSS+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAK-NSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300
           LS+VESELACLAK NSESEAVAK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR
Sbjct: 241 LSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300

Query: 301 WVNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLK 360
           WVNSCLRSELRNSCPSANSGSPSSPQP+ER++E V SLSSQKE MEYSSAKRINLIKKLK
Sbjct: 301 WVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYMEYSSAKRINLIKKLK 360

Query: 361 KWPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420
           KWPITDEDLSNLDCSDN+LLDK WVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF
Sbjct: 361 KWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420

Query: 421 ICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSC 480
           +CAKEMEK+ DPLSSQKYDLGVIQRPHVLGN HE NR+FASLDVEKRALRIPNPPPRPSC
Sbjct: 421 MCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIPNPPPRPSC 480

Query: 481 SISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540
           SISSEPKEEN AQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD
Sbjct: 481 SISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540

Query: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600
           SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE
Sbjct: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600

Query: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660
           DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP
Sbjct: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660

Query: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSV 720
           RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD+GIISKIKLGSV
Sbjct: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSV 720

Query: 721 KLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780
           KLAKMYMKRVA ELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN
Sbjct: 721 KLAKMYMKRVATELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780

Query: 781 LANLLSK 787
           LANLL+K
Sbjct: 781 LANLLNK 787

BLAST of CmUC01G025020 vs. ExPASy TrEMBL
Match: A0A1S3AZK1 (protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103484287 PE=4 SV=1)

HSP 1 Score: 1438.7 bits (3723), Expect = 0.0e+00
Identity = 748/787 (95.04%), Postives = 763/787 (96.95%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNP E RGKPSRFADQNQNPKCLNQNNAKG++GNGSKLRAASSWGSHIVKGFSTDKR
Sbjct: 1   MKEDNPLEIRGKPSRFADQNQNPKCLNQNNAKGSSGNGSKLRAASSWGSHIVKGFSTDKR 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
            K QSNLQPKKAPPLGNSDL NQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR
Sbjct: 61  AKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKK ELD L +
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKVELDSLAK 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           KVSVLEEDRRALSEQLV LSS+ EKQEE QTAP NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAK-NSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300
           LS+VESELACLAK NSESEAVAK+KAE SLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR
Sbjct: 241 LSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEVEELAYLR 300

Query: 301 WVNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLK 360
           WVNSCLRSELRNSCPSANSGSPSSPQP+ER++E V SLSSQKE MEYSSAKRINLIKKLK
Sbjct: 301 WVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYMEYSSAKRINLIKKLK 360

Query: 361 KWPITDEDLSNLDCSDNSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420
           KWPITDEDLSNLDCSDN+LLDK WVD EEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF
Sbjct: 361 KWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420

Query: 421 ICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSC 480
           +CAKEMEK+ DPLSSQKYDLGVIQRPHVLGN HE NR+FASLDVEKRALRIPNPPPRPSC
Sbjct: 421 MCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIPNPPPRPSC 480

Query: 481 SISSEPKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540
           SISSEPKEEN AQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD
Sbjct: 481 SISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRKD 540

Query: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600
           SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE
Sbjct: 541 SSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIE 600

Query: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660
           DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP
Sbjct: 601 DIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDP 660

Query: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSV 720
           RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLD+GIISKIKLGSV
Sbjct: 661 RLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSV 720

Query: 721 KLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780
           KLAKMYMKRVA ELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN
Sbjct: 721 KLAKMYMKRVATELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRN 780

Query: 781 LANLLSK 787
           LANLL+K
Sbjct: 781 LANLLNK 787

BLAST of CmUC01G025020 vs. ExPASy TrEMBL
Match: A0A6J1KYE4 (protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111497484 PE=4 SV=1)

HSP 1 Score: 1365.1 bits (3532), Expect = 0.0e+00
Identity = 717/788 (90.99%), Postives = 747/788 (94.80%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDN SENRGKPSRFADQNQ          KG +GNGSKLRAASSWGSHIVKGFSTDK+
Sbjct: 1   MKEDNASENRGKPSRFADQNQ--------YTKGASGNGSKLRAASSWGSHIVKGFSTDKK 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQ+NLQ KKA PL NS+L NQKEK VPSH+RIKRS+IGDL CS NPAQVHPQSYQTHR
Sbjct: 61  TKAQTNLQSKKA-PLTNSNLVNQKEKSVPSHTRIKRSLIGDLTCSPNPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAEL+RNTRN+ELERELEEKKAEL+GLTQ
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELQRNTRNFELERELEEKKAELEGLTQ 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           K S+LEEDRRALSEQLVA SSI EK EEPQTAP+NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KFSLLEEDRRALSEQLVAASSITEKPEEPQTAPLNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNSCPSANS SPSSPQ +ER++E VGSLSSQKE+M+Y++AKRIN IKKLKK
Sbjct: 301 VNSCLRSELRNSCPSANSDSPSSPQAMERSSEPVGSLSSQKEHMDYNNAKRINAIKKLKK 360

Query: 361 WPITDEDLSNLDCSD-NSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDGF 420
           WPITDEDLSNLDCSD NSLL KNWVD EE  SPRRRHSISGAKCWPEELEPNKRRQSDGF
Sbjct: 361 WPITDEDLSNLDCSDNNSLLGKNWVDTEEETSPRRRHSISGAKCWPEELEPNKRRQSDGF 420

Query: 421 ICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPSC 480
           +CAKE+EKEADPLSSQKYDLGVIQRPH+L N HE NR+FASLDVEKRALRIPNPPPRPSC
Sbjct: 421 LCAKELEKEADPLSSQKYDLGVIQRPHILENNHETNRNFASLDVEKRALRIPNPPPRPSC 480

Query: 481 SISSEPKEENTAQVPPPLPPPPPPPP-LPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRK 540
           SISSEPKEENT +VPPPLPPPPPPPP LPKFA RS+TGMVQRAPQVVEFYHSLMKRDSRK
Sbjct: 481 SISSEPKEENTGRVPPPLPPPPPPPPLLPKFAARSSTGMVQRAPQVVEFYHSLMKRDSRK 540

Query: 541 DSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKI 600
           DSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKI
Sbjct: 541 DSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKI 600

Query: 601 EDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDD 660
           ED+V FVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDD
Sbjct: 601 EDVVAFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDD 660

Query: 661 PRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGS 720
           PRLPC+IALKKMV LSEKMERSSYNLLRMRESLMRNCKEFQIP DWMLDNGIISKIKLGS
Sbjct: 661 PRLPCEIALKKMVTLSEKMERSSYNLLRMRESLMRNCKEFQIPIDWMLDNGIISKIKLGS 720

Query: 721 VKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLR 780
           VKLAKMYMKRVAMELQSK+SSEKDPAMDYMLLQGVR+AFRIHQFAGGFDAETMHAFEDLR
Sbjct: 721 VKLAKMYMKRVAMELQSKSSSEKDPAMDYMLLQGVRYAFRIHQFAGGFDAETMHAFEDLR 779

Query: 781 NLANLLSK 787
           NLANLL+K
Sbjct: 781 NLANLLNK 779

BLAST of CmUC01G025020 vs. ExPASy TrEMBL
Match: A0A6J1EE76 (protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433508 PE=4 SV=1)

HSP 1 Score: 1364.4 bits (3530), Expect = 0.0e+00
Identity = 718/789 (91.00%), Postives = 747/789 (94.68%), Query Frame = 0

Query: 1   MKEDNPSENRGKPSRFADQNQNPKCLNQNNAKGTTGNGSKLRAASSWGSHIVKGFSTDKR 60
           MKEDNPSENRGKPSRFADQNQ          KG +GNGSKLRAASSWGSHIVKGFSTDK+
Sbjct: 1   MKEDNPSENRGKPSRFADQNQ--------YTKGGSGNGSKLRAASSWGSHIVKGFSTDKK 60

Query: 61  TKAQSNLQPKKAPPLGNSDLANQKEKFVPSHSRIKRSIIGDLACSANPAQVHPQSYQTHR 120
           TKAQSNLQ KKA PL NS+L NQKEK VPSH+RIKRS+IGDL CS NPAQVHPQSYQTHR
Sbjct: 61  TKAQSNLQSKKA-PLTNSNLVNQKEKSVPSHTRIKRSLIGDLTCSPNPAQVHPQSYQTHR 120

Query: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKAELDGLTQ 180
           RQSSRDLFVELDQLRSLLNESKQREFELQNELAEL+RNTRN+ELERELEEKKAEL+GLTQ
Sbjct: 121 RQSSRDLFVELDQLRSLLNESKQREFELQNELAELQRNTRNFELERELEEKKAELEGLTQ 180

Query: 181 KVSVLEEDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQLQKRNLACR 240
           K S+LEEDRRALSEQLVA SSI EK EEPQTAP+NVEVEVVELRRLNKELQLQKRNLACR
Sbjct: 181 KFSLLEEDRRALSEQLVAASSISEKPEEPQTAPLNVEVEVVELRRLNKELQLQKRNLACR 240

Query: 241 LSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300
           LS+VESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW
Sbjct: 241 LSSVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRW 300

Query: 301 VNSCLRSELRNSCPSANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKK 360
           VNSCLRSELRNSCPSANS SPSSP+ +ER++E V SLSSQKE+M+Y++AKRIN IKKLKK
Sbjct: 301 VNSCLRSELRNSCPSANSDSPSSPRAMERSSEPVESLSSQKEHMDYNNAKRINAIKKLKK 360

Query: 361 WPITDEDLSNLDCSD--NSLLDKNWVDIEEGRSPRRRHSISGAKCWPEELEPNKRRQSDG 420
           WPITDEDLSNLDCSD  NSLL KNWVD EE RSPRRRHSISGAKCWPEELEPNKRRQSDG
Sbjct: 361 WPITDEDLSNLDCSDNNNSLLGKNWVDTEEERSPRRRHSISGAKCWPEELEPNKRRQSDG 420

Query: 421 FICAKEMEKEADPLSSQKYDLGVIQRPHVLGNCHENNRSFASLDVEKRALRIPNPPPRPS 480
           FICAKE+EKEAD LSSQKYDLGVIQRPH+L N HE NR+FASLDVEKRALRIPNPPPRPS
Sbjct: 421 FICAKELEKEADTLSSQKYDLGVIQRPHILENSHETNRNFASLDVEKRALRIPNPPPRPS 480

Query: 481 CSISSEPKEENTAQVPPPLPPPPPPPP-LPKFAVRSATGMVQRAPQVVEFYHSLMKRDSR 540
           CSISSEPKEENT +VPPPLPPPPPPPP LPKFA RS+TGMVQRAPQVVEFYHSLMKRDSR
Sbjct: 481 CSISSEPKEENTGRVPPPLPPPPPPPPLLPKFAARSSTGMVQRAPQVVEFYHSLMKRDSR 540

Query: 541 KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK 600
           KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK
Sbjct: 541 KDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLK 600

Query: 601 IEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD 660
           IED+V FVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD
Sbjct: 601 IEDVVAFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKD 660

Query: 661 DPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLG 720
           DPRLPC+IALKKMV LSEKMERSSYNLLRMRESLMRNCKEFQIP DWMLDNGIISKIKLG
Sbjct: 661 DPRLPCEIALKKMVTLSEKMERSSYNLLRMRESLMRNCKEFQIPIDWMLDNGIISKIKLG 720

Query: 721 SVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDL 780
           SVKLAKMYMKRVAMELQSK+SSEKDPAMDYMLLQGVR+AFRIHQFAGGFDAETMHAFEDL
Sbjct: 721 SVKLAKMYMKRVAMELQSKSSSEKDPAMDYMLLQGVRYAFRIHQFAGGFDAETMHAFEDL 780

Query: 781 RNLANLLSK 787
           RNLANLL+K
Sbjct: 781 RNLANLLNK 780

BLAST of CmUC01G025020 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 421.8 bits (1083), Expect = 1.4e-117
Identity = 323/857 (37.69%), Postives = 444/857 (51.81%), Query Frame = 0

Query: 130 ELDQLRSLLNESKQREFELQNELAEL----KRNTRNYELERELEEKKAELDGLTQKVSVL 189
           EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  E+D L   ++ L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 190 EEDRRALSEQL-----------VALSSIPEKQEEPQ------------------------ 249
           + +R+ L E+L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 250 --------------TAPVNVEVEVVELRRLNKELQLQKRNLACRLSAVESELACLAKNSE 309
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L + E+ +A L+  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 310 SEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRSELRN-SCPS 369
           S+ VAK++ E + L+H NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 370 ----------------------------------------ANSGSPSSPQPIERNTESVG 429
                                                   +N   PSSP   + +  S+ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 430 SLSSQKENMEYSSAKRINLIKKLKKWPITDEDLS---------------NLDCSDN---- 489
           S +S+      S +K+  LI+KLKKW  + +D S                L  S N    
Sbjct: 430 SSTSRFS----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQRG 489

Query: 490 ---SLLDKN-----------WVDIEEGRSP----------RRRHSISG------------ 549
              SL+ +N            VD E   +P          +++ S  G            
Sbjct: 490 PLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFHV 549

Query: 550 -AKCWPEELEPNKRRQSDGFICAKEMEK----EADPLSSQKYDLGVIQRPHVL------- 609
            +K     L+       D    A E EK    +AD   ++++   V   P +        
Sbjct: 550 MSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRV 609

Query: 610 -------------GNCHENNRSFAS-----------LDVEKRALRIPNPPPRPSCS---- 669
                           +E+N   AS           +D+EKR  R+P PPPR +      
Sbjct: 610 VVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKST 669

Query: 670 --ISSEPKEENTAQVPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQVV 729
              S+ P        PPP PP           PPPPPP P    R A G   V RAP++V
Sbjct: 670 NLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELV 729

Query: 730 EFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 782
           EFY SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV 
Sbjct: 730 EFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQ 789

BLAST of CmUC01G025020 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 421.8 bits (1083), Expect = 1.4e-117
Identity = 323/857 (37.69%), Postives = 444/857 (51.81%), Query Frame = 0

Query: 130 ELDQLRSLLNESKQREFELQNELAEL----KRNTRNYELERELEEKKAELDGLTQKVSVL 189
           EL++L+ L+ E ++RE +L+ EL E     ++ +   EL+R+L+ K  E+D L   ++ L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 190 EEDRRALSEQL-----------VALSSIPEKQEEPQ------------------------ 249
           + +R+ L E+L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 250 --------------TAPVNVEVEVVELRRLNKELQLQKRNLACRLSAVESELACLAKNSE 309
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L + E+ +A L+  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 310 SEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRSELRN-SCPS 369
           S+ VAK++ E + L+H NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 370 ----------------------------------------ANSGSPSSPQPIERNTESVG 429
                                                   +N   PSSP   + +  S+ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 430 SLSSQKENMEYSSAKRINLIKKLKKWPITDEDLS---------------NLDCSDN---- 489
           S +S+      S +K+  LI+KLKKW  + +D S                L  S N    
Sbjct: 430 SSTSRFS----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQRG 489

Query: 490 ---SLLDKN-----------WVDIEEGRSP----------RRRHSISG------------ 549
              SL+ +N            VD E   +P          +++ S  G            
Sbjct: 490 PLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFHV 549

Query: 550 -AKCWPEELEPNKRRQSDGFICAKEMEK----EADPLSSQKYDLGVIQRPHVL------- 609
            +K     L+       D    A E EK    +AD   ++++   V   P +        
Sbjct: 550 MSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKRV 609

Query: 610 -------------GNCHENNRSFAS-----------LDVEKRALRIPNPPPRPSCS---- 669
                           +E+N   AS           +D+EKR  R+P PPPR +      
Sbjct: 610 VVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKST 669

Query: 670 --ISSEPKEENTAQVPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQVV 729
              S+ P        PPP PP           PPPPPP P    R A G   V RAP++V
Sbjct: 670 NLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELV 729

Query: 730 EFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 782
           EFY SLMKR+S+K+ +   I +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV 
Sbjct: 730 EFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQ 789

BLAST of CmUC01G025020 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 409.1 bits (1050), Expect = 9.3e-114
Identity = 315/822 (38.32%), Postives = 436/822 (53.04%), Query Frame = 0

Query: 114 QSYQTHRRQSSRDLFVELDQLRSLLNESKQREFEL-QNELAELKRNTRNYELERELEEKK 173
           QS       + ++L  EL Q     N   ++E E+ +N++ EL+R     +++ +  + K
Sbjct: 42  QSVDPDYNLNDKNLQEELSQ-----NGIVRKELEVARNKIKELQR-----QIQLDANQTK 101

Query: 174 AELDGLTQKVSVLE-EDRRALSEQLVALSSIPEKQEEPQTAPVNVEVEVVELRRLNKELQ 233
            +L  L Q VS L+ ++  A+++          + E    A  ++EV+V+EL+R N+ELQ
Sbjct: 102 GQLLLLKQHVSSLQMKEEEAMNKD--------TEVERKLKAVQDLEVQVMELKRKNRELQ 161

Query: 234 LQKRNLACRLSAVESELACLAKNSESEAVAKIKAEASLLRHTNEDLCKQVEGLQMSRLNE 293
            +KR L+ +L + E+ +A L+  +ES+ VAK++ E + L+H NEDL KQVEGLQM+R +E
Sbjct: 162 HEKRELSIKLDSAEARIATLSNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSE 221

Query: 294 VEELAYLRWVNSCLRSELRN-SCPS----------------------------------- 353
           VEEL YLRWVN+CLR ELRN   P+                                   
Sbjct: 222 VEELVYLRWVNACLRYELRNYQTPAGKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQG 281

Query: 354 -----ANSGSPSSPQPIERNTESVGSLSSQKENMEYSSAKRINLIKKLKKWPITDEDLS- 413
                +N   PSSP   + +  S+ S +S+      S +K+  LI+KLKKW  + +D S 
Sbjct: 282 DTDLESNYSQPSSPGSDDFDNASMDSSTSRFS----SFSKKPGLIQKLKKWGKSKDDSSV 341

Query: 414 --------------NLDCSDN-------SLLDKN-----------WVDIEEGRSP----- 473
                          L  S N       SL+ +N            VD E   +P     
Sbjct: 342 QSSPSRSFYGGSPGRLSSSMNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPETPNL 401

Query: 474 -----RRRHSISG-------------AKCWPEELEPNKRRQSDGFICAKEMEK----EAD 533
                +++ S  G             +K     L+       D    A E EK    +AD
Sbjct: 402 PRIRTQQQASSPGEGLNSVAASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKAD 461

Query: 534 PLSSQKYDLGVIQRPHVL--------------------GNCHENNRSFAS---------- 593
              ++++   V   P +                        +E+N   AS          
Sbjct: 462 QARAERFGGNVALPPKLAQLKEKRVVVPSVITATGDQSNESNESNEGKASENAATVTKMK 521

Query: 594 -LDVEKRALRIPNPPPRPSCS------ISSEPKEENTAQVPPPLPP-----------PPP 653
            +D+EKR  R+P PPPR +         S+ P        PPP PP           PPP
Sbjct: 522 LVDIEKRPPRVPRPPPRSAGGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPP 581

Query: 654 PPPLPKFAVRSATG--MVQRAPQVVEFYHSLMKRDSRKDSSNGAICN-VPDVSNVRSSMI 713
           PPP P    R A G   V RAP++VEFY SLMKR+S+K+ +   I +   + S  R++MI
Sbjct: 582 PPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMI 641

Query: 714 GEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDER 773
           GEIENRS+ LLA+KAD+ETQG+FV SL  EV  + +  IED++ FV WLD+EL FLVDER
Sbjct: 642 GEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLVDER 701

Query: 774 AVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMER 782
           AVLKHFDWPE KAD LREAAF Y+DL KLE +++++ DDP L C+ ALKKM  L EK+E+
Sbjct: 702 AVLKHFDWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLEKVEQ 761

BLAST of CmUC01G025020 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 333.2 bits (853), Expect = 6.5e-91
Identity = 193/361 (53.46%), Postives = 236/361 (65.37%), Query Frame = 0

Query: 450 NCHENNRSFASLDVEKRALRIPNPPPRPSCSIS------SEPKEENTAQVPPPLP----- 509
           N  E   S +   V  R  R+P PPP+ S S+       ++P  + +   PPP P     
Sbjct: 262 NSEELTESSSLSTVRSRVPRVPKPPPKRSISLGDSTENRADPPPQKSIPPPPPPPPPPLL 321

Query: 510 ------------PPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRD---SRKDSSNG 569
                       PPPPPPP P  ++  A+  V+R P+VVEFYHSLM+RD   SR+DS+ G
Sbjct: 322 QQPPPPPSVSKAPPPPPPPPPPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGG 381

Query: 570 AICNVPDV---SNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLKIED 629
                  +   SN R  MIGEIENRS +LLAIK D+ETQG+F+  LI+EV NA +  IED
Sbjct: 382 GNAAAEAILANSNAR-DMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIED 441

Query: 630 IVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPR 689
           +V FVKWLDDEL +LVDERAVLKHF+WPE+KAD LREAAF Y DLKKL  E S +++DPR
Sbjct: 442 VVPFVKWLDDELSYLVDERAVLKHFEWPEQKADALREAAFCYFDLKKLISEASRFREDPR 501

Query: 690 LPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVK 749
                ALKKM AL EK+E   Y+L RMRES     K FQIP DWML+ GI S+IKL SVK
Sbjct: 502 QSSSSALKKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVK 561

Query: 750 LAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNL 782
           LA  YMKRV+ EL+  A     P  + +++QGVRFAFR+HQFAGGFDAETM AFE+LR+ 
Sbjct: 562 LAMKYMKRVSAELE--AIEGGGPEEEELIVQGVRFAFRVHQFAGGFDAETMKAFEELRDK 619

BLAST of CmUC01G025020 vs. TAIR 10
Match: AT1G48280.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 298.1 bits (762), Expect = 2.3e-80
Identity = 156/314 (49.68%), Postives = 220/314 (70.06%), Query Frame = 0

Query: 469 RIPNPPPRPSCSISSE----PKEENTAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQV 528
           R+P  PP P   +S       ++EN++   PP PPPPPPPP P+   ++A    Q++P V
Sbjct: 228 RLPPTPPLPKFLVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRPLAKAA--RAQKSPPV 287

Query: 529 VEFYHSLMKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 588
            + +  L K+D+ ++ S     N   V++  +S++GEI+NRS+HL+AIKADIET+GEF+N
Sbjct: 288 SQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFIN 347

Query: 589 SLIREVNNAVYLKIEDIVEFVKWLDDELCFLVDERAVLKHFDWPERKADTLREAAFGYRD 648
            LI++V    +  +ED+++FV WLD EL  L DERAVLKHF WPE+KADTL+EAA  YR+
Sbjct: 348 DLIQKVLTTCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEAAVEYRE 407

Query: 649 LKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYNLLRMRESLMRNCKEFQIPTDW 708
           LKKLE E+S+Y DDP +   +ALKKM  L +K E+    L+R+R S MR+ ++F+IP +W
Sbjct: 408 LKKLEKELSSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEW 467

Query: 709 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 768
           MLD+G+I KIK  S+KLAK YM RVA ELQS  + +++   + +LLQGVRFA+R HQFAG
Sbjct: 468 MLDSGMICKIKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYRTHQFAG 527

Query: 769 GFDAETMHAFEDLR 779
           G D ET+ A E+++
Sbjct: 528 GLDPETLCALEEIK 539

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038881875.10.0e+0097.07protein CHUP1, chloroplastic-like isoform X2 [Benincasa hispida][more]
XP_038881874.10.0e+0093.28protein CHUP1, chloroplastic-like isoform X1 [Benincasa hispida][more]
XP_004134549.10.0e+0095.67protein CHUP1, chloroplastic [Cucumis sativus] >KGN49492.1 hypothetical protein ... [more]
XP_008439508.10.0e+0095.04PREDICTED: protein CHUP1, chloroplastic-like [Cucumis melo] >KAA0052457.1 protei... [more]
XP_023518667.10.0e+0091.00protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Q9LI741.9e-11637.69Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KMA90.0e+0095.67Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G526260 PE=4 SV=1[more]
A0A5A7UD870.0e+0095.04Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009920... [more]
A0A1S3AZK10.0e+0095.04protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103484287 PE=4 S... [more]
A0A6J1KYE40.0e+0090.99protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111497484 PE... [more]
A0A6J1EE760.0e+0091.00protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433508 ... [more]
Match NameE-valueIdentityDescription
AT3G25690.11.4e-11737.69Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.21.4e-11737.69Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.39.3e-11438.32Hydroxyproline-rich glycoprotein family protein [more]
AT4G18570.16.5e-9153.46Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G48280.12.3e-8049.68hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (USVL531) v1
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 131..195
NoneNo IPR availableCOILSCoilCoilcoord: 216..247
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 491..509
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 467..510
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 311..342
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 58..72
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 12..44
NoneNo IPR availablePANTHERPTHR31342:SF41PROTEIN CHUP1, CHLOROPLASTIC-LIKEcoord: 1..786
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 1..786

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmUC01G025020.1CmUC01G025020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane