CmoCh03G014690 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh03G014690
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
Descriptionprotein CHUP1, chloroplastic-like
LocationCmo_Chr03: 10551195 .. 10554333 (-)
RNA-Seq ExpressionCmoCh03G014690
SyntenyCmoCh03G014690
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCCCATTACTCACACACTTTTCTTCTTCTCCACTTCACTCTCCGAACCTTCTTTTCAAACAAAACAAAACAAAACAAAACAAGCCCCGCTCCTTCTTCCACATTTCCTGTTCGTTTTTTCGACAATGCAGCTGACGAACTTGTCGTGAACTCTCTTGCTTCTTGGCTGTTTCTCCACAGGGAGGGATCACTCTTTGTGTCTGCTTTTTTTGTTCATATAGAATGAAGGAGGATAACCCATCTGAAAGCAGAGGGGGGAAACCATCTAGGTTTGCTGATCAGAATCACAATCCCAATCCCAATCCCAATCCCAAGTCTCTAAATCACAATGCCAAAGGAACTACTGGGAATGCTTCCAAATTCAGGGCTGCTTCTTCCTGGGGTTCTAACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAACCACTCTCCAACCCAAGAAACCACCTCCACTTGCTACTTCGGATTTCGCATATCACAAGGACAAGCTCCCTCCTTCCCAGTCTCGCATCAAGCGTTCTCTCATTGGGGATTCACCCTGCTCTCCAAATCCTGCTCAAGTTCATCCACACTCTTATCACACCCACCGCAGACAGTCTTCTCGGGACTTGTTCGTCGAGCTCGATCAACTCAGAACTTTGCTCAACGAATCTAAGAACAGGGAATTCGAACTTCAGAACGAACTTGCAGAACTTAAGCGGAATACTACAAATTATGAACTGGAGAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGCTCTTACTCAGAAACTGAATCTATTGGAAGAAGATAGAAGATCCCTGTCCCAGCAATTGGTAGCTTCATCCTCAATTTCTGAGAAGCAAGAAGAGCCGCAGACCGCACCTCTAAACATAGAAGTGGAAGTTGTTGAGTTGAGACGCTTGAACAAGGAACTCCAGCTCCAGAAGAGGAATCTCGCTTGCAGGCTTTCTTCTGTGGAGTCCGAGCTGGCTTGTGTAGCAAAAAGTTCCGAGGTAACACTTGGTCATTTGTTAGGATCTTTGTGTTTCACTGTATACCACAACCCAACATGCTTATGCTTGTGTGTTTGTTTTGTTACTACTAGAGTGAAGGTGTGGCAAAAATCAAAGCAGAGGCATCTTTGCTGAGACAAACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTAAACGAGGTTGAGGAGCTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAACGAGCTTCGCAATTCCTGCCCCTCCGCCAATTCTAACAGCCCATCCAGCCCTCAGACAATCGACAGGACAAGTGAAGCTGTTGGTTCATTATCCAGCCAAAAGGAGCACATGGTGGATTGCAACAGTGCGAAGAGAATAAATCTAATTAAAAAGTTGAAGAAATGGCCTATTACTGATGAGGAATTGTCTAATTTAGATTGCTCAGATAACAGTGCTGTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAGTGCTGGGGCGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGGTTTATATGTGCAAAAGAGATGGATAAAGAAGCAGATCCTCTTTCCTCTCAGACTAACAGGAGTTCTGCCTCCTTAGATGTGGAGAAACGAGCCTTGCGTGTACCCAATCCCCCTCCGAGGCCTTCTTGCTCAATTTCTAATGAACCCAAAGATGAAAACACAGCTATAATCCCACCACCTCTGCCACCTCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGGAGCGCCACGGGGATGGTGCAACGAGCTCCACAAGTTGTTGAATTCTACCATTCACTTATGAAAAGAGATTCAAGAAGAGATTCATCCCATGGAGCCATGTGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAGATTGAGAATCGATCATCTCATTTGCTTGCCGTAAGTTCTCACTCATATAATTCATCTTCCATTCTTCTCATTTGGTGCTTGCCGTAAATAAGGTTTCATACTTGTGCAGATAAAGGCAGATATTGAGACCCAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCGGTGTACTTGAACATTGAAGATGTTGTGGCATTCGTGAAGTGGCTTGATGATGAACTTTGCTTTCTGGTACTCTCTCTTAACTTTGGTACACTTGATTTAAACAAAACATTACAGGAAGAATTTGAAAGAATGCATGCTGTAATATTGTGGAAGGTGGACGAAAGGGCAGTTCTGAAGCACTTCGATTGGCCAGAGAGAAAGGCTGACACATTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAATGTGAAATCTCAGGCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTGAAAAAAATGGTTTCTTTATCAGAGAAGTAAGGACCTAAAGCTTCCCATATCTGAATTTGGTTGTTTTCCTTGACCTCAATTATGCTAAACAATGTGTTAATGCCAATCAGAATGGAACGTAGTATTTACAATCTTCTCCGGATGAGAGAATCATTGATGAGGCATTGCAAAGAATTCCAAATTCCCACAGATTGGATGCTCGACAACGGGATCATAAGCAAGGTAAAACTTCATGAAATATCATCATAAGTCTGTAAGAAAAGAAGAAGATGTCATGTTATTGGTAGATCATTTATTAATAACAACCATAGTCTATTTGAACGATGGGTTCAAGAACACAGAAGTTGGATAAATCTTAAAAACAACCAAAATTGAACTCTTATGTTGTAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAGATGTACATGAAGAGAGTAGCTATGGAACTTCAGTCAAAGGCGTCATCAGAGAAAGATCCCGCAATGGACTACATGCTTCTTCAAGGAGTAAGATTTGCGTTCAGAATTCATCAGGTACCGATTAATTAATCATTTTTGTTCAGGAAAACAAAAAGAGATTGTTGCTGATTGTGTTTTGCCTTTTCCCGGCCACAGTTTGCAGGAGGGTTGGATGCTGATACAATGCATGCATTTGAGGATCTTCGGAACTTGGCCAACCTTCTGAACAAAAAGTGA

mRNA sequence

CTTCCCATTACTCACACACTTTTCTTCTTCTCCACTTCACTCTCCGAACCTTCTTTTCAAACAAAACAAAACAAAACAAAACAAGCCCCGCTCCTTCTTCCACATTTCCTGTTCGTTTTTTCGACAATGCAGCTGACGAACTTGTCGTGAACTCTCTTGCTTCTTGGCTGTTTCTCCACAGGGAGGGATCACTCTTTGTGTCTGCTTTTTTTGTTCATATAGAATGAAGGAGGATAACCCATCTGAAAGCAGAGGGGGGAAACCATCTAGGTTTGCTGATCAGAATCACAATCCCAATCCCAATCCCAATCCCAAGTCTCTAAATCACAATGCCAAAGGAACTACTGGGAATGCTTCCAAATTCAGGGCTGCTTCTTCCTGGGGTTCTAACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAACCACTCTCCAACCCAAGAAACCACCTCCACTTGCTACTTCGGATTTCGCATATCACAAGGACAAGCTCCCTCCTTCCCAGTCTCGCATCAAGCGTTCTCTCATTGGGGATTCACCCTGCTCTCCAAATCCTGCTCAAGTTCATCCACACTCTTATCACACCCACCGCAGACAGTCTTCTCGGGACTTGTTCGTCGAGCTCGATCAACTCAGAACTTTGCTCAACGAATCTAAGAACAGGGAATTCGAACTTCAGAACGAACTTGCAGAACTTAAGCGGAATACTACAAATTATGAACTGGAGAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGCTCTTACTCAGAAACTGAATCTATTGGAAGAAGATAGAAGATCCCTGTCCCAGCAATTGGTAGCTTCATCCTCAATTTCTGAGAAGCAAGAAGAGCCGCAGACCGCACCTCTAAACATAGAAGTGGAAGTTGTTGAGTTGAGACGCTTGAACAAGGAACTCCAGCTCCAGAAGAGGAATCTCGCTTGCAGGCTTTCTTCTGTGGAGTCCGAGCTGGCTTGTGTAGCAAAAAGTTCCGAGAGTGAAGGTGTGGCAAAAATCAAAGCAGAGGCATCTTTGCTGAGACAAACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTAAACGAGGTTGAGGAGCTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAACGAGCTTCGCAATTCCTGCCCCTCCGCCAATTCTAACAGCCCATCCAGCCCTCAGACAATCGACAGGACAAGTGAAGCTGTTGGTTCATTATCCAGCCAAAAGGAGCACATGGTGGATTGCAACAGTGCGAAGAGAATAAATCTAATTAAAAAGTTGAAGAAATGGCCTATTACTGATGAGGAATTGTCTAATTTAGATTGCTCAGATAACAGTGCTGTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAGTGCTGGGGCGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGGTTTATATGTGCAAAAGAGATGGATAAAGAAGCAGATCCTCTTTCCTCTCAGACTAACAGGAGTTCTGCCTCCTTAGATGTGGAGAAACGAGCCTTGCGTGTACCCAATCCCCCTCCGAGGCCTTCTTGCTCAATTTCTAATGAACCCAAAGATGAAAACACAGCTATAATCCCACCACCTCTGCCACCTCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGGAGCGCCACGGGGATGGTGCAACGAGCTCCACAAGTTGTTGAATTCTACCATTCACTTATGAAAAGAGATTCAAGAAGAGATTCATCCCATGGAGCCATGTGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAGATTGAGAATCGATCATCTCATTTGCTTGCCATAAAGGCAGATATTGAGACCCAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCGGTGTACTTGAACATTGAAGATGTTGTGGCATTCGTGAAGTGGCTTGATGATGAACTTTGCTTTCTGGTACTCTCTCTTAACTTTGGTACACTTGATTTAAACAAAACATTACAGGAAGAATTTGAAAGAATGCATGCTGTAATATTGTGGAAGGTGGACGAAAGGGCAGTTCTGAAGCACTTCGATTGGCCAGAGAGAAAGGCTGACACATTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAATGTGAAATCTCAGGCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTGAAAAAAATGGTTTCTTTATCAGAGAAAATGGAACGTAGTATTTACAATCTTCTCCGGATGAGAGAATCATTGATGAGGCATTGCAAAGAATTCCAAATTCCCACAGATTGGATGCTCGACAACGGGATCATAAGCAAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAGATGTACATGAAGAGAGTAGCTATGGAACTTCAGTCAAAGGCGTCATCAGAGAAAGATCCCGCAATGGACTACATGCTTCTTCAAGGAGTAAGATTTGCGTTCAGAATTCATCAGTTTGCAGGAGGGTTGGATGCTGATACAATGCATGCATTTGAGGATCTTCGGAACTTGGCCAACCTTCTGAACAAAAAGTGA

Coding sequence (CDS)

ATGAAGGAGGATAACCCATCTGAAAGCAGAGGGGGGAAACCATCTAGGTTTGCTGATCAGAATCACAATCCCAATCCCAATCCCAATCCCAAGTCTCTAAATCACAATGCCAAAGGAACTACTGGGAATGCTTCCAAATTCAGGGCTGCTTCTTCCTGGGGTTCTAACATTGTCAAAGGTTTCTCCACAGACAAGAGAACTAAAACCACTCTCCAACCCAAGAAACCACCTCCACTTGCTACTTCGGATTTCGCATATCACAAGGACAAGCTCCCTCCTTCCCAGTCTCGCATCAAGCGTTCTCTCATTGGGGATTCACCCTGCTCTCCAAATCCTGCTCAAGTTCATCCACACTCTTATCACACCCACCGCAGACAGTCTTCTCGGGACTTGTTCGTCGAGCTCGATCAACTCAGAACTTTGCTCAACGAATCTAAGAACAGGGAATTCGAACTTCAGAACGAACTTGCAGAACTTAAGCGGAATACTACAAATTATGAACTGGAGAGGGAACTTGAGGAAAAGAAAGCCGAATTAGACGCTCTTACTCAGAAACTGAATCTATTGGAAGAAGATAGAAGATCCCTGTCCCAGCAATTGGTAGCTTCATCCTCAATTTCTGAGAAGCAAGAAGAGCCGCAGACCGCACCTCTAAACATAGAAGTGGAAGTTGTTGAGTTGAGACGCTTGAACAAGGAACTCCAGCTCCAGAAGAGGAATCTCGCTTGCAGGCTTTCTTCTGTGGAGTCCGAGCTGGCTTGTGTAGCAAAAAGTTCCGAGAGTGAAGGTGTGGCAAAAATCAAAGCAGAGGCATCTTTGCTGAGACAAACAAATGAAGATTTGTGCAAGCAAGTGGAAGGTCTGCAGATGAGCAGATTAAACGAGGTTGAGGAGCTTGCATACCTTAGGTGGGTCAATTCCTGTTTAAGGAACGAGCTTCGCAATTCCTGCCCCTCCGCCAATTCTAACAGCCCATCCAGCCCTCAGACAATCGACAGGACAAGTGAAGCTGTTGGTTCATTATCCAGCCAAAAGGAGCACATGGTGGATTGCAACAGTGCGAAGAGAATAAATCTAATTAAAAAGTTGAAGAAATGGCCTATTACTGATGAGGAATTGTCTAATTTAGATTGCTCAGATAACAGTGCTGTAGACAAAAATTGGGTTGATACAGAGGAAGGAAGAAGCCCCAGAAGAAGACACTCCATTAGTGGAGCCAAGTGCTGGGGCGAAGAATTGGAGCCAAACAAGAGGAGGCAATCTGATGGGTTTATATGTGCAAAAGAGATGGATAAAGAAGCAGATCCTCTTTCCTCTCAGACTAACAGGAGTTCTGCCTCCTTAGATGTGGAGAAACGAGCCTTGCGTGTACCCAATCCCCCTCCGAGGCCTTCTTGCTCAATTTCTAATGAACCCAAAGATGAAAACACAGCTATAATCCCACCACCTCTGCCACCTCCTCCTCCGCCCCCTCCTCTTCCAAAGTTCGCTGTGAGGAGCGCCACGGGGATGGTGCAACGAGCTCCACAAGTTGTTGAATTCTACCATTCACTTATGAAAAGAGATTCAAGAAGAGATTCATCCCATGGAGCCATGTGCAATGTTCCAGATGTTTCAAATGTCCGGAGCAGCATGATTGGAGAGATTGAGAATCGATCATCTCATTTGCTTGCCATAAAGGCAGATATTGAGACCCAGGGAGAGTTTGTAAATTCACTGATAAGAGAGGTCAACAATGCGGTGTACTTGAACATTGAAGATGTTGTGGCATTCGTGAAGTGGCTTGATGATGAACTTTGCTTTCTGGTACTCTCTCTTAACTTTGGTACACTTGATTTAAACAAAACATTACAGGAAGAATTTGAAAGAATGCATGCTGTAATATTGTGGAAGGTGGACGAAAGGGCAGTTCTGAAGCACTTCGATTGGCCAGAGAGAAAGGCTGACACATTGCGAGAAGCAGCCTTTGGGTACAGAGATCTAAAGAAATTGGAATGTGAAATCTCAGGCTACAAAGATGATCCCAGATTGCCTTGTGACATTGCTCTGAAAAAAATGGTTTCTTTATCAGAGAAAATGGAACGTAGTATTTACAATCTTCTCCGGATGAGAGAATCATTGATGAGGCATTGCAAAGAATTCCAAATTCCCACAGATTGGATGCTCGACAACGGGATCATAAGCAAGATAAAGTTGGGTTCGGTGAAGTTGGCAAAGATGTACATGAAGAGAGTAGCTATGGAACTTCAGTCAAAGGCGTCATCAGAGAAAGATCCCGCAATGGACTACATGCTTCTTCAAGGAGTAAGATTTGCGTTCAGAATTCATCAGTTTGCAGGAGGGTTGGATGCTGATACAATGCATGCATTTGAGGATCTTCGGAACTTGGCCAACCTTCTGAACAAAAAGTGA

Protein sequence

MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKGFSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSYHTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELDALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRNLACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQSDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAIIPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGLDADTMHAFEDLRNLANLLNKK
Homology
BLAST of CmoCh03G014690 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 1.7e-106
Identity = 314/887 (35.40%), Postives = 431/887 (48.59%), Query Frame = 0

Query: 134 ELDQLRTLLNESKNREFELQNELAE---LKRNTTN-YELERELEEKKAELDALTQKLNLL 193
           EL++L+ L+ E + RE +L+ EL E   LK   ++  EL+R+L+ K  E+D L   +N L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 194 EEDRRSLSQQL-----------VASSSISEKQEEPQ------------------------ 253
           + +R+ L ++L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 254 --------------TAPLNIEVEVVELRRLNKELQLQKRNLACRLSSVESELACVAKSSE 313
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A ++  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 314 SEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRN-SCPS 373
           S+ VAK++ E + L+  NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 374 ----------------------------------------ANSNSPSSPQTIDRTSEAVG 433
                                                   +N + PSSP + D  + ++ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 434 SLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELS---------------NLDCSDN--- 493
           S +S+       + +K+  LI+KLKKW  + ++ S                L  S N   
Sbjct: 430 SSTSRFS-----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQR 489

Query: 494 --------------------SAVDKNWVDTEEGRSPRRRHSISGAKCWGEELE------- 553
                                 VD+    T E  +  R  +   A   GE L        
Sbjct: 490 GPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFH 549

Query: 554 ---------------PNKRRQSDGFICAKEMDKEADPL---------------------- 613
                            K R        K +  +AD                        
Sbjct: 550 VMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKR 609

Query: 614 ----------------SSQTNRSSAS-----------LDVEKRALRVPNPPPRPSCS--- 673
                           S+++N   AS           +D+EKR  RVP PPPR +     
Sbjct: 610 VVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKS 669

Query: 674 ---ISNEPKDENTAIIPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQV 733
               S  P        PPP PP           PPPPPP P    R A G   V RAP++
Sbjct: 670 TNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPEL 729

Query: 734 VEFYHSLMKRDSRRDSSHGAMCN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFV 793
           VEFY SLMKR+S+++ +   + +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV
Sbjct: 730 VEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFV 789

Query: 794 NSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILW 798
            SL  EV  + + +IED++AFV WLD+EL FL                            
Sbjct: 790 QSLATEVRASSFTDIEDLLAFVSWLDEELSFL---------------------------- 849

BLAST of CmoCh03G014690 vs. ExPASy TrEMBL
Match: A0A6J1ECG5 (protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433055 PE=4 SV=1)

HSP 1 Score: 1477.6 bits (3824), Expect = 0.0e+00
Identity = 774/803 (96.39%), Postives = 774/803 (96.39%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG
Sbjct: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
           HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD
Sbjct: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 774

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 774

BLAST of CmoCh03G014690 vs. ExPASy TrEMBL
Match: A0A6J1ITW4 (protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111478380 PE=4 SV=1)

HSP 1 Score: 1426.8 bits (3692), Expect = 0.0e+00
Identity = 750/803 (93.40%), Postives = 760/803 (94.65%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPS++RGGKPSRFADQNH  NPNPNPKSLNHNAKGTTGNASKFRAASSWGS+IVKG
Sbjct: 1   MKEDNPSDNRGGKPSRFADQNH--NPNPNPKSLNHNAKGTTGNASKFRAASSWGSHIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFA HKDKLPPSQSRIKRSLIGDSPCSPNPAQ+HPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFANHKDKLPPSQSRIKRSLIGDSPCSPNPAQLHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
            THRRQSSRDLF+ELDQLRTLLNESK+REFELQNEL ELKRNT NYELERELEEKKAELD
Sbjct: 121 QTHRRQSSRDLFLELDQLRTLLNESKHREFELQNELTELKRNTRNYELERELEEKKAELD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALT+KLNLLEEDRRSLSQQLVASSSISEKQEE QTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTRKLNLLEEDRRSLSQQLVASSSISEKQEESQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEH VDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHTVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDNS V+KNWVD EEGRSPRRRHSISGAKCW EELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNSLVEKNWVDAEEGRSPRRRHSISGAKCWAEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRS  SLDVEKRALRVPNPPPRPSCSISNEPKDENTA +
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSFVSLDVEKRALRVPNPPPRPSCSISNEPKDENTAQV 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           +KKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 VKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 772

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 772

BLAST of CmoCh03G014690 vs. ExPASy TrEMBL
Match: A0A0A0KMA9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G526260 PE=4 SV=1)

HSP 1 Score: 1246.1 bits (3223), Expect = 0.0e+00
Identity = 671/824 (81.43%), Postives = 716/824 (86.89%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLN-HNAKGTTGNASKFRAASSWGSNIVK 60
           MKEDNP E R GKPSRFADQN       NPK LN +NAKG+TGN SK RAASSWGS+IVK
Sbjct: 1   MKEDNPLEIR-GKPSRFADQNQ------NPKCLNQNNAKGSTGNGSKLRAASSWGSHIVK 60

Query: 61  GFSTDKRTK--TTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHP 120
           GFSTDKRTK  + LQPKK PPL  SD    K+K  PS SRIKRS+IGD  CS NPAQVHP
Sbjct: 61  GFSTDKRTKAQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHP 120

Query: 121 HSYHTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKA 180
            SY THRRQSSRDLFVELDQLR+LLNESK REFELQNELAELKRNT NYELERELEEKK 
Sbjct: 121 QSYQTHRRQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKV 180

Query: 181 ELDALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQ 240
           ELD+L +K+++LEEDRR+LS+QLV   S+SEKQEE QTAP N+EVEVVELRRLNKELQLQ
Sbjct: 181 ELDSLAKKVSVLEEDRRALSEQLVTLPSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQ 240

Query: 241 KRNLACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVE 300
           KRNLACRLSSVESELAC+AK+SESE VAKIKAE SLLR TNEDLCKQVEGLQMSRLNEVE
Sbjct: 241 KRNLACRLSSVESELACLAKNSESEAVAKIKAEVSLLRHTNEDLCKQVEGLQMSRLNEVE 300

Query: 301 ELAYLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRI 360
           ELAYLRWVNSCLR+ELRNS PSANS SPSSPQ ++R+SEA+GSLSSQKE+M + +SAKRI
Sbjct: 301 ELAYLRWVNSCLRSELRNSSPSANSGSPSSPQPVERSSEAIGSLSSQKEYM-EYSSAKRI 360

Query: 361 NLIKKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNK 420
           NLIKKLKKWPITDE+LSNLDCSDN+ +DKNWVDTEEGRSPRRRHSISGAKCW EELEPNK
Sbjct: 361 NLIKKLKKWPITDEDLSNLDCSDNNLLDKNWVDTEEGRSPRRRHSISGAKCWPEELEPNK 420

Query: 421 RRQSDGFICAKEMDKEADPLSSQ------------------TNRSSASLDVEKRALRVPN 480
           RRQSDGF+CAKEM+K+ DPLSSQ                  TNR+ ASLDVEKRALR+PN
Sbjct: 421 RRQSDGFMCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNCHETNRNFASLDVEKRALRIPN 480

Query: 481 PPPRPSCSISNEPKDENTAIIPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLM 540
           PPPRPSCSIS+EPK+EN A +PPPLPPPPPPPPLPKF+VRSATGMVQRAPQVVEFYHSLM
Sbjct: 481 PPPRPSCSISSEPKEENRAQVPPPLPPPPPPPPLPKFSVRSATGMVQRAPQVVEFYHSLM 540

Query: 541 KRDSRRDSSHGAMCNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNN 600
           KRDSR+DSS+G +CNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNN
Sbjct: 541 KRDSRKDSSNGTICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNN 600

Query: 601 AVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLK 660
           AVYL IED+V FVKWLDDELCFL                             VDERAVLK
Sbjct: 601 AVYLKIEDIVEFVKWLDDELCFL-----------------------------VDERAVLK 660

Query: 661 HFDWPERKADTLREAAFGYRDLKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYN 720
           HFDWPERKADTLREAAFGYRDLKKLECEIS YKDDPRLPCDIALKKMV+LSEKMERS YN
Sbjct: 661 HFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSYN 720

Query: 721 LLRMRESLMRHCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDP 780
           LLRMRESLMR+CKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDP
Sbjct: 721 LLRMRESLMRNCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDP 780

Query: 781 AMDYMLLQGVRFAFRIHQFAGGLDADTMHAFEDLRNLANLLNKK 804
           AMDYMLLQGVRFAFRIHQFAGG DA+TMHAFEDLRNLANLLNKK
Sbjct: 781 AMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK 787

BLAST of CmoCh03G014690 vs. ExPASy TrEMBL
Match: A0A5A7UD87 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009920 PE=4 SV=1)

HSP 1 Score: 1236.1 bits (3197), Expect = 0.0e+00
Identity = 669/825 (81.09%), Postives = 714/825 (86.55%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLN-HNAKGTTGNASKFRAASSWGSNIVK 60
           MKEDNP E R GKPSRFADQN       NPK LN +NAKG++GN SK RAASSWGS+IVK
Sbjct: 1   MKEDNPLEIR-GKPSRFADQNQ------NPKCLNQNNAKGSSGNGSKLRAASSWGSHIVK 60

Query: 61  GFSTDKRTKT--TLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHP 120
           GFSTDKR KT   LQPKK PPL  SD    K+K  PS SRIKRS+IGD  CS NPAQVHP
Sbjct: 61  GFSTDKRAKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHP 120

Query: 121 HSYHTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKA 180
            SY THRRQSSRDLFVELDQLR+LLNESK REFELQNELAELKRNT NYELERELEEKK 
Sbjct: 121 QSYQTHRRQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKV 180

Query: 181 ELDALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQ 240
           ELD+L +K+++LEEDRR+LS+QLV  SS+SEKQEE QTAP N+EVEVVELRRLNKELQLQ
Sbjct: 181 ELDSLAKKVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQ 240

Query: 241 KRNLACRLSSVESELACVAK-SSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEV 300
           KRNLACRLSSVESELAC+AK +SESE VAK+KAE SLLR TNEDLCKQVEGLQMSRLNEV
Sbjct: 241 KRNLACRLSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEV 300

Query: 301 EELAYLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKR 360
           EELAYLRWVNSCLR+ELRNSCPSANS SPSSPQ ++R+SE V SLSSQKE+M + +SAKR
Sbjct: 301 EELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYM-EYSSAKR 360

Query: 361 INLIKKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPN 420
           INLIKKLKKWPITDE+LSNLDCSDN+ +DK WVDTEEGRSPRRRHSISGAKCW EELEPN
Sbjct: 361 INLIKKLKKWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPN 420

Query: 421 KRRQSDGFICAKEMDKEADPLSSQ------------------TNRSSASLDVEKRALRVP 480
           KRRQSDGF+CAKEM+K+ DPLSSQ                  TNR+ ASLDVEKRALR+P
Sbjct: 421 KRRQSDGFMCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIP 480

Query: 481 NPPPRPSCSISNEPKDENTAIIPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL 540
           NPPPRPSCSIS+EPK+EN A +PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL
Sbjct: 481 NPPPRPSCSISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL 540

Query: 541 MKRDSRRDSSHGAMCNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN 600
           MKRDSR+DSS+GA+CNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN
Sbjct: 541 MKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN 600

Query: 601 NAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVL 660
           NAVYL IED+V FVKWLDDELCFL                             VDERAVL
Sbjct: 601 NAVYLKIEDIVEFVKWLDDELCFL-----------------------------VDERAVL 660

Query: 661 KHFDWPERKADTLREAAFGYRDLKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIY 720
           KHFDWPERKADTLREAAFGYRDLKKLECEIS YKDDPRLPCDIALKKMV+LSEKMERS Y
Sbjct: 661 KHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSY 720

Query: 721 NLLRMRESLMRHCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKD 780
           NLLRMRESLMR+CKEFQIPTDWMLD+GIISKIKLGSVKLAKMYMKRVA ELQSKASSEKD
Sbjct: 721 NLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSVKLAKMYMKRVATELQSKASSEKD 780

Query: 781 PAMDYMLLQGVRFAFRIHQFAGGLDADTMHAFEDLRNLANLLNKK 804
           PAMDYMLLQGVRFAFRIHQFAGG DA+TMHAFEDLRNLANLLNKK
Sbjct: 781 PAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK 788

BLAST of CmoCh03G014690 vs. ExPASy TrEMBL
Match: A0A1S3AZK1 (protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103484287 PE=4 SV=1)

HSP 1 Score: 1236.1 bits (3197), Expect = 0.0e+00
Identity = 669/825 (81.09%), Postives = 714/825 (86.55%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLN-HNAKGTTGNASKFRAASSWGSNIVK 60
           MKEDNP E R GKPSRFADQN       NPK LN +NAKG++GN SK RAASSWGS+IVK
Sbjct: 1   MKEDNPLEIR-GKPSRFADQNQ------NPKCLNQNNAKGSSGNGSKLRAASSWGSHIVK 60

Query: 61  GFSTDKRTKT--TLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHP 120
           GFSTDKR KT   LQPKK PPL  SD    K+K  PS SRIKRS+IGD  CS NPAQVHP
Sbjct: 61  GFSTDKRAKTQSNLQPKKAPPLGNSDLVNQKEKFVPSHSRIKRSIIGDLACSANPAQVHP 120

Query: 121 HSYHTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKA 180
            SY THRRQSSRDLFVELDQLR+LLNESK REFELQNELAELKRNT NYELERELEEKK 
Sbjct: 121 QSYQTHRRQSSRDLFVELDQLRSLLNESKQREFELQNELAELKRNTRNYELERELEEKKV 180

Query: 181 ELDALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQ 240
           ELD+L +K+++LEEDRR+LS+QLV  SS+SEKQEE QTAP N+EVEVVELRRLNKELQLQ
Sbjct: 181 ELDSLAKKVSVLEEDRRALSEQLVTLSSVSEKQEEQQTAPGNVEVEVVELRRLNKELQLQ 240

Query: 241 KRNLACRLSSVESELACVAK-SSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEV 300
           KRNLACRLSSVESELAC+AK +SESE VAK+KAE SLLR TNEDLCKQVEGLQMSRLNEV
Sbjct: 241 KRNLACRLSSVESELACLAKNNSESEAVAKVKAEVSLLRHTNEDLCKQVEGLQMSRLNEV 300

Query: 301 EELAYLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKR 360
           EELAYLRWVNSCLR+ELRNSCPSANS SPSSPQ ++R+SE V SLSSQKE+M + +SAKR
Sbjct: 301 EELAYLRWVNSCLRSELRNSCPSANSGSPSSPQPVERSSEPVCSLSSQKEYM-EYSSAKR 360

Query: 361 INLIKKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPN 420
           INLIKKLKKWPITDE+LSNLDCSDN+ +DK WVDTEEGRSPRRRHSISGAKCW EELEPN
Sbjct: 361 INLIKKLKKWPITDEDLSNLDCSDNTLLDKKWVDTEEGRSPRRRHSISGAKCWPEELEPN 420

Query: 421 KRRQSDGFICAKEMDKEADPLSSQ------------------TNRSSASLDVEKRALRVP 480
           KRRQSDGF+CAKEM+K+ DPLSSQ                  TNR+ ASLDVEKRALR+P
Sbjct: 421 KRRQSDGFMCAKEMEKDVDPLSSQKYDLGVIQRPHVLGNFHETNRNFASLDVEKRALRIP 480

Query: 481 NPPPRPSCSISNEPKDENTAIIPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL 540
           NPPPRPSCSIS+EPK+EN A +PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL
Sbjct: 481 NPPPRPSCSISSEPKEENRAQVPPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSL 540

Query: 541 MKRDSRRDSSHGAMCNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN 600
           MKRDSR+DSS+GA+CNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN
Sbjct: 541 MKRDSRKDSSNGAICNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN 600

Query: 601 NAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVL 660
           NAVYL IED+V FVKWLDDELCFL                             VDERAVL
Sbjct: 601 NAVYLKIEDIVEFVKWLDDELCFL-----------------------------VDERAVL 660

Query: 661 KHFDWPERKADTLREAAFGYRDLKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIY 720
           KHFDWPERKADTLREAAFGYRDLKKLECEIS YKDDPRLPCDIALKKMV+LSEKMERS Y
Sbjct: 661 KHFDWPERKADTLREAAFGYRDLKKLECEISAYKDDPRLPCDIALKKMVALSEKMERSSY 720

Query: 721 NLLRMRESLMRHCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKD 780
           NLLRMRESLMR+CKEFQIPTDWMLD+GIISKIKLGSVKLAKMYMKRVA ELQSKASSEKD
Sbjct: 721 NLLRMRESLMRNCKEFQIPTDWMLDSGIISKIKLGSVKLAKMYMKRVATELQSKASSEKD 780

Query: 781 PAMDYMLLQGVRFAFRIHQFAGGLDADTMHAFEDLRNLANLLNKK 804
           PAMDYMLLQGVRFAFRIHQFAGG DA+TMHAFEDLRNLANLLNKK
Sbjct: 781 PAMDYMLLQGVRFAFRIHQFAGGFDAETMHAFEDLRNLANLLNKK 788

BLAST of CmoCh03G014690 vs. NCBI nr
Match: XP_022925727.1 (protein CHUP1, chloroplastic-like [Cucurbita moschata] >XP_022925728.1 protein CHUP1, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1477.6 bits (3824), Expect = 0.0e+00
Identity = 774/803 (96.39%), Postives = 774/803 (96.39%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG
Sbjct: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
           HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD
Sbjct: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 774

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 774

BLAST of CmoCh03G014690 vs. NCBI nr
Match: KAG6581496.1 (Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 1450.3 bits (3753), Expect = 0.0e+00
Identity = 761/803 (94.77%), Postives = 766/803 (95.39%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG
Sbjct: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFA HKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFANHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
            THRRQSSRDLFVELDQLRTLLNESK+REFELQNELAELKRNT NYELERELEEKKA+ D
Sbjct: 121 QTHRRQSSRDLFVELDQLRTLLNESKHREFELQNELAELKRNTRNYELERELEEKKAQFD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALTQKLNLLEEDRRSLSQQLVASSSISEKQEE QTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEESQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDN AV+KNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNCAVEKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRS ASLDVEKRALRVPNPPPRPSCSISNEPKDENTA +
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSFASLDVEKRALRVPNPPPRPSCSISNEPKDENTAQV 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN+AVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNDAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 774

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 774

BLAST of CmoCh03G014690 vs. NCBI nr
Match: XP_023544570.1 (protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023544572.1 protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1438.3 bits (3722), Expect = 0.0e+00
Identity = 758/803 (94.40%), Postives = 763/803 (95.02%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPSESRGGKPSRFADQNH  NPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG
Sbjct: 1   MKEDNPSESRGGKPSRFADQNH--NPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFA HKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFANHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
            THRRQSSRDLFVELDQLRTLLNESK+REFELQNEL ELKRNT NYELERELEEKKAELD
Sbjct: 121 QTHRRQSSRDLFVELDQLRTLLNESKHREFELQNELTELKRNTRNYELERELEEKKAELD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALTQKLNLLEEDRRSLSQQLVASSSISEKQEE QTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEESQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSA+SNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSASSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDN  V+KNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNIPVEKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRS ASLDVEKRALRVPNPPPRPSCSISNEPKDENTA +
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSFASLDVEKRALRVPNPPPRPSCSISNEPKDENTAQV 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGA+CNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAICNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 772

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 772

BLAST of CmoCh03G014690 vs. NCBI nr
Match: KAG7034785.1 (Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1427.9 bits (3695), Expect = 0.0e+00
Identity = 754/803 (93.90%), Postives = 759/803 (94.52%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPSESRGGKPSRFADQNH    NPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG
Sbjct: 14  MKEDNPSESRGGKPSRFADQNH----NPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 73

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFA HKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY
Sbjct: 74  FSTDKRTKTTLQPKKPPPLATSDFANHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 133

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
            THR QSSRDLFVELDQLRTLLNESK+REFELQNELAELKRN  NYELERELEEKKA+ D
Sbjct: 134 QTHRTQSSRDLFVELDQLRTLLNESKHREFELQNELAELKRNARNYELERELEEKKAQFD 193

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALTQKLNLLEEDRRSLSQQLVASSSISEKQEE QTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 194 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEESQTAPLNIEVEVVELRRLNKELQLQKRN 253

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 254 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 313

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI
Sbjct: 314 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 373

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLD SDN AV+KNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ
Sbjct: 374 KKLKKWPITDEELSNLDYSDNCAVEKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 433

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRS ASLDVEKRALRVPNPPPRPSCSISNEPKDENTA +
Sbjct: 434 SDGFICAKEMDKEADPLSSQTNRSFASLDVEKRALRVPNPPPRPSCSISNEPKDENTAQV 493

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 494 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 553

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVN+AVYLNIEDVVAFVKWLDDELC
Sbjct: 554 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNDAVYLNIEDVVAFVKWLDDELC 613

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 614 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 673

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 674 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 733

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 734 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 783

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 794 GLDADTMHAFEDLRNLANLLNKK 783

BLAST of CmoCh03G014690 vs. NCBI nr
Match: XP_022978368.1 (protein CHUP1, chloroplastic-like [Cucurbita maxima] >XP_022978369.1 protein CHUP1, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1426.8 bits (3692), Expect = 0.0e+00
Identity = 750/803 (93.40%), Postives = 760/803 (94.65%), Query Frame = 0

Query: 1   MKEDNPSESRGGKPSRFADQNHNPNPNPNPKSLNHNAKGTTGNASKFRAASSWGSNIVKG 60
           MKEDNPS++RGGKPSRFADQNH  NPNPNPKSLNHNAKGTTGNASKFRAASSWGS+IVKG
Sbjct: 1   MKEDNPSDNRGGKPSRFADQNH--NPNPNPKSLNHNAKGTTGNASKFRAASSWGSHIVKG 60

Query: 61  FSTDKRTKTTLQPKKPPPLATSDFAYHKDKLPPSQSRIKRSLIGDSPCSPNPAQVHPHSY 120
           FSTDKRTKTTLQPKKPPPLATSDFA HKDKLPPSQSRIKRSLIGDSPCSPNPAQ+HPHSY
Sbjct: 61  FSTDKRTKTTLQPKKPPPLATSDFANHKDKLPPSQSRIKRSLIGDSPCSPNPAQLHPHSY 120

Query: 121 HTHRRQSSRDLFVELDQLRTLLNESKNREFELQNELAELKRNTTNYELERELEEKKAELD 180
            THRRQSSRDLF+ELDQLRTLLNESK+REFELQNEL ELKRNT NYELERELEEKKAELD
Sbjct: 121 QTHRRQSSRDLFLELDQLRTLLNESKHREFELQNELTELKRNTRNYELERELEEKKAELD 180

Query: 181 ALTQKLNLLEEDRRSLSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRN 240
           ALT+KLNLLEEDRRSLSQQLVASSSISEKQEE QTAPLNIEVEVVELRRLNKELQLQKRN
Sbjct: 181 ALTRKLNLLEEDRRSLSQQLVASSSISEKQEESQTAPLNIEVEVVELRRLNKELQLQKRN 240

Query: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300
           LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA
Sbjct: 241 LACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELA 300

Query: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHMVDCNSAKRINLI 360
           YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEH VDCNSAKRINLI
Sbjct: 301 YLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSEAVGSLSSQKEHTVDCNSAKRINLI 360

Query: 361 KKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRSPRRRHSISGAKCWGEELEPNKRRQ 420
           KKLKKWPITDEELSNLDCSDNS V+KNWVD EEGRSPRRRHSISGAKCW EELEPNKRRQ
Sbjct: 361 KKLKKWPITDEELSNLDCSDNSLVEKNWVDAEEGRSPRRRHSISGAKCWAEELEPNKRRQ 420

Query: 421 SDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALRVPNPPPRPSCSISNEPKDENTAII 480
           SDGFICAKEMDKEADPLSSQTNRS  SLDVEKRALRVPNPPPRPSCSISNEPKDENTA +
Sbjct: 421 SDGFICAKEMDKEADPLSSQTNRSFVSLDVEKRALRVPNPPPRPSCSISNEPKDENTAQV 480

Query: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540
           PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN
Sbjct: 481 PPPLPPPPPPPPLPKFAVRSATGMVQRAPQVVEFYHSLMKRDSRRDSSHGAMCNVPDVSN 540

Query: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600
           VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC
Sbjct: 541 VRSSMIGEIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELC 600

Query: 601 FLVLSLNFGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRD 660
           FL                             VDERAVLKHFDWPERKADTLREAAFGYRD
Sbjct: 601 FL-----------------------------VDERAVLKHFDWPERKADTLREAAFGYRD 660

Query: 661 LKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720
           +KKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW
Sbjct: 661 VKKLECEISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDW 720

Query: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 780
           MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG
Sbjct: 721 MLDNGIISKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAG 772

Query: 781 GLDADTMHAFEDLRNLANLLNKK 804
           GLDADTMHAFEDLRNLANLLNKK
Sbjct: 781 GLDADTMHAFEDLRNLANLLNKK 772

BLAST of CmoCh03G014690 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 388.7 bits (997), Expect = 1.2e-107
Identity = 314/887 (35.40%), Postives = 431/887 (48.59%), Query Frame = 0

Query: 134 ELDQLRTLLNESKNREFELQNELAE---LKRNTTN-YELERELEEKKAELDALTQKLNLL 193
           EL++L+ L+ E + RE +L+ EL E   LK   ++  EL+R+L+ K  E+D L   +N L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 194 EEDRRSLSQQL-----------VASSSISEKQEEPQ------------------------ 253
           + +R+ L ++L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 254 --------------TAPLNIEVEVVELRRLNKELQLQKRNLACRLSSVESELACVAKSSE 313
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A ++  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 314 SEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRN-SCPS 373
           S+ VAK++ E + L+  NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 374 ----------------------------------------ANSNSPSSPQTIDRTSEAVG 433
                                                   +N + PSSP + D  + ++ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 434 SLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELS---------------NLDCSDN--- 493
           S +S+       + +K+  LI+KLKKW  + ++ S                L  S N   
Sbjct: 430 SSTSRFS-----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQR 489

Query: 494 --------------------SAVDKNWVDTEEGRSPRRRHSISGAKCWGEELE------- 553
                                 VD+    T E  +  R  +   A   GE L        
Sbjct: 490 GPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFH 549

Query: 554 ---------------PNKRRQSDGFICAKEMDKEADPL---------------------- 613
                            K R        K +  +AD                        
Sbjct: 550 VMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKR 609

Query: 614 ----------------SSQTNRSSAS-----------LDVEKRALRVPNPPPRPSCS--- 673
                           S+++N   AS           +D+EKR  RVP PPPR +     
Sbjct: 610 VVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKS 669

Query: 674 ---ISNEPKDENTAIIPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQV 733
               S  P        PPP PP           PPPPPP P    R A G   V RAP++
Sbjct: 670 TNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPEL 729

Query: 734 VEFYHSLMKRDSRRDSSHGAMCN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFV 793
           VEFY SLMKR+S+++ +   + +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV
Sbjct: 730 VEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFV 789

Query: 794 NSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILW 798
            SL  EV  + + +IED++AFV WLD+EL FL                            
Sbjct: 790 QSLATEVRASSFTDIEDLLAFVSWLDEELSFL---------------------------- 849

BLAST of CmoCh03G014690 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 388.7 bits (997), Expect = 1.2e-107
Identity = 314/887 (35.40%), Postives = 431/887 (48.59%), Query Frame = 0

Query: 134 ELDQLRTLLNESKNREFELQNELAE---LKRNTTN-YELERELEEKKAELDALTQKLNLL 193
           EL++L+ L+ E + RE +L+ EL E   LK   ++  EL+R+L+ K  E+D L   +N L
Sbjct: 130 ELERLKQLVKELEEREVKLEGELLEYYGLKEQESDIVELQRQLKIKTVEIDMLNITINSL 189

Query: 194 EEDRRSLSQQL-----------VASSSISEKQEEPQ------------------------ 253
           + +R+ L ++L           VA + I E Q + Q                        
Sbjct: 190 QAERKKLQEELSQNGIVRKELEVARNKIKELQRQIQLDANQTKGQLLLLKQHVSSLQMKE 249

Query: 254 --------------TAPLNIEVEVVELRRLNKELQLQKRNLACRLSSVESELACVAKSSE 313
                          A  ++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A ++  +E
Sbjct: 250 EEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATLSNMTE 309

Query: 314 SEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRN-SCPS 373
           S+ VAK++ E + L+  NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN   P+
Sbjct: 310 SDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRNYQTPA 369

Query: 374 ----------------------------------------ANSNSPSSPQTIDRTSEAVG 433
                                                   +N + PSSP + D  + ++ 
Sbjct: 370 GKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFDNASMD 429

Query: 434 SLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELS---------------NLDCSDN--- 493
           S +S+       + +K+  LI+KLKKW  + ++ S                L  S N   
Sbjct: 430 SSTSRFS-----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSSMNKQR 489

Query: 494 --------------------SAVDKNWVDTEEGRSPRRRHSISGAKCWGEELE------- 553
                                 VD+    T E  +  R  +   A   GE L        
Sbjct: 490 GPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSVAASFH 549

Query: 554 ---------------PNKRRQSDGFICAKEMDKEADPL---------------------- 613
                            K R        K +  +AD                        
Sbjct: 550 VMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQLKEKR 609

Query: 614 ----------------SSQTNRSSAS-----------LDVEKRALRVPNPPPRPSCS--- 673
                           S+++N   AS           +D+EKR  RVP PPPR +     
Sbjct: 610 VVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSAGGGKS 669

Query: 674 ---ISNEPKDENTAIIPPPLPP-----------PPPPPPLPKFAVRSATG--MVQRAPQV 733
               S  P        PPP PP           PPPPPP P    R A G   V RAP++
Sbjct: 670 TNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPEL 729

Query: 734 VEFYHSLMKRDSRRDSSHGAMCN-VPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFV 793
           VEFY SLMKR+S+++ +   + +   + S  R++MIGEIENRS+ LLA+KAD+ETQG+FV
Sbjct: 730 VEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFV 789

Query: 794 NSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILW 798
            SL  EV  + + +IED++AFV WLD+EL FL                            
Sbjct: 790 QSLATEVRASSFTDIEDLLAFVSWLDEELSFL---------------------------- 849

BLAST of CmoCh03G014690 vs. TAIR 10
Match: AT3G25690.3 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 375.2 bits (962), Expect = 1.3e-103
Identity = 303/832 (36.42%), Postives = 413/832 (49.64%), Query Frame = 0

Query: 142 LNESKNREFELQNELA----ELKRNTTNYELERELEEKKAELDALTQK--LNLLEEDRRS 201
           LN+   +E   QN +     E+ RN    EL+R++     +LDA   K  L LL++   S
Sbjct: 50  LNDKNLQEELSQNGIVRKELEVARNKIK-ELQRQI-----QLDANQTKGQLLLLKQHVSS 109

Query: 202 LSQQLVASSSISEKQEEPQTAPLNIEVEVVELRRLNKELQLQKRNLACRLSSVESELACV 261
           L  +   + +   + E    A  ++EV+V+EL+R N+ELQ +KR L+ +L S E+ +A +
Sbjct: 110 LQMKEEEAMNKDTEVERKLKAVQDLEVQVMELKRKNRELQHEKRELSIKLDSAEARIATL 169

Query: 262 AKSSESEGVAKIKAEASLLRQTNEDLCKQVEGLQMSRLNEVEELAYLRWVNSCLRNELRN 321
           +  +ES+ VAK++ E + L+  NEDL KQVEGLQM+R +EVEEL YLRWVN+CLR ELRN
Sbjct: 170 SNMTESDKVAKVREEVNNLKHNNEDLLKQVEGLQMNRFSEVEELVYLRWVNACLRYELRN 229

Query: 322 -SCPS----------------------------------------ANSNSPSSPQTIDRT 381
              P+                                        +N + PSSP + D  
Sbjct: 230 YQTPAGKISARDLSKNLSPKSQAKAKRLMLEYAGSERGQGDTDLESNYSQPSSPGSDDFD 289

Query: 382 SEAVGSLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELS---------------NLDCS 441
           + ++ S +S+       + +K+  LI+KLKKW  + ++ S                L  S
Sbjct: 290 NASMDSSTSRFS-----SFSKKPGLIQKLKKWGKSKDDSSVQSSPSRSFYGGSPGRLSSS 349

Query: 442 DN-----------------------SAVDKNWVDTEEGRSPRRRHSISGAKCWGEELE-- 501
            N                         VD+    T E  +  R  +   A   GE L   
Sbjct: 350 MNKQRGPLESLMIRNAGESVAITTFGQVDQESPGTPETPNLPRIRTQQQASSPGEGLNSV 409

Query: 502 --------------------PNKRRQSDGFICAKEMDKEADPL----------------- 561
                                 K R        K +  +AD                   
Sbjct: 410 AASFHVMSKSVDNVLDEKYPAYKDRHKLAVEREKHIKHKADQARAERFGGNVALPPKLAQ 469

Query: 562 ---------------------SSQTNRSSAS-----------LDVEKRALRVPNPPPRPS 621
                                S+++N   AS           +D+EKR  RVP PPPR +
Sbjct: 470 LKEKRVVVPSVITATGDQSNESNESNEGKASENAATVTKMKLVDIEKRPPRVPRPPPRSA 529

Query: 622 CS------ISNEPKDENTAIIPPPLPP-----------PPPPPPLPKFAVRSATG--MVQ 681
                    S  P        PPP PP           PPPPPP P    R A G   V 
Sbjct: 530 GGGKSTNLPSARPPLPGGGPPPPPPPPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVH 589

Query: 682 RAPQVVEFYHSLMKRDSRRDSSHGAMCN-VPDVSNVRSSMIGEIENRSSHLLAIKADIET 741
           RAP++VEFY SLMKR+S+++ +   + +   + S  R++MIGEIENRS+ LLA+KAD+ET
Sbjct: 590 RAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARNNMIGEIENRSTFLLAVKADVET 649

Query: 742 QGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMH 798
           QG+FV SL  EV  + + +IED++AFV WLD+EL FL                       
Sbjct: 650 QGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFL----------------------- 709

BLAST of CmoCh03G014690 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 303.9 bits (777), Expect = 3.8e-82
Identity = 239/610 (39.18%), Postives = 319/610 (52.30%), Query Frame = 0

Query: 224 VVELRRLNKELQ-----LQKRNLACRLSSVESELACVAKSSESEGVAKIKAEASLLRQTN 283
           V ELRR  +EL+     L+  NL  +L     E   V    ES+ +A    E   LR+  
Sbjct: 87  VSELRRQVEELREREALLKTENLEVKLL---RESVSVIPLLESQ-IADKNGEIDELRKET 146

Query: 284 EDLCKQVEGL--QMSRLNEVEELAYLRWVNSCLRNELRNSCPSANSNSPSSPQTIDRTSE 343
             L +  E L  +  R  E+      R     +  E+       +S S     ++ +  +
Sbjct: 147 ARLAEDNERLRREFDRSEEMRRECETR--EKEMEAEIVELRKLVSSESDDHALSVSQRFQ 206

Query: 344 AVGSLSSQKEHMVDCNSAKRINLIKKLKKWPITDEELSNLDCSDNSAVDKNWVDTEEGRS 403
            +  +S+ K +++   S KR+  ++ L + PIT++E +N   S +   D        G  
Sbjct: 207 GLMDVSA-KSNLI--RSLKRVGSLRNLPE-PITNQENTNKSISSSGDAD--------GDI 266

Query: 404 PRRRHSISGAKCWGEELEPNKRRQSDGFICAKEMDKEADPLSSQTNRSSASLDVEKRALR 463
            R+           +E+E   R  +                S +   SS+   V  R  R
Sbjct: 267 YRK-----------DEIESYSRSSN----------------SEELTESSSLSTVRSRVPR 326

Query: 464 VPNPPPRPSCSISN------EPKDENTAIIPPPLP-----------------PPPPPPPL 523
           VP PPP+ S S+ +      +P  + +   PPP P                 PPPPPPP 
Sbjct: 327 VPKPPPKRSISLGDSTENRADPPPQKSIPPPPPPPPPPLLQQPPPPPSVSKAPPPPPPPP 386

Query: 524 PKFAVRSATGMVQRAPQVVEFYHSLMKRD---SRRDSSHGAMCNVPDV---SNVRSSMIG 583
           P  ++  A+  V+R P+VVEFYHSLM+RD   SRRDS+ G       +   SN R  MIG
Sbjct: 387 PPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNAR-DMIG 446

Query: 584 EIENRSSHLLAIKADIETQGEFVNSLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLN 643
           EIENRS +LLAIK D+ETQG+F+  LI+EV NA + +IEDVV FVKWLDDEL +L     
Sbjct: 447 EIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYL----- 506

Query: 644 FGTLDLNKTLQEEFERMHAVILWKVDERAVLKHFDWPERKADTLREAAFGYRDLKKLECE 703
                                   VDERAVLKHF+WPE+KAD LREAAF Y DLKKL  E
Sbjct: 507 ------------------------VDERAVLKHFEWPEQKADALREAAFCYFDLKKLISE 566

Query: 704 ISGYKDDPRLPCDIALKKMVSLSEKMERSIYNLLRMRESLMRHCKEFQIPTDWMLDNGII 763
            S +++DPR     ALKKM +L EK+E  +Y+L RMRES     K FQIP DWML+ GI 
Sbjct: 567 ASRFREDPRQSSSSALKKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGIT 619

Query: 764 SKIKLGSVKLAKMYMKRVAMELQSKASSEKDPAMDYMLLQGVRFAFRIHQFAGGLDADTM 798
           S+IKL SVKLA  YMKRV+ EL+  A     P  + +++QGVRFAFR+HQFAGG DA+TM
Sbjct: 627 SQIKLASVKLAMKYMKRVSAELE--AIEGGGPEEEELIVQGVRFAFRVHQFAGGFDAETM 619

BLAST of CmoCh03G014690 vs. TAIR 10
Match: AT1G48280.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 290.0 bits (741), Expect = 5.7e-78
Identity = 160/343 (46.65%), Postives = 221/343 (64.43%), Query Frame = 0

Query: 456 RVPNPPPRPSCSISNE----PKDENTAIIPPPLPPPPPPPPLPKFAVRSATGMVQRAPQV 515
           R+P  PP P   +S       +DEN++   PP PPPPPPPP P+   ++A    Q++P V
Sbjct: 228 RLPPTPPLPKFLVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRPLAKAA--RAQKSPPV 287

Query: 516 VEFYHSLMKRDSRRDSSHGAMCNVPDVSNVRSSMIGEIENRSSHLLAIKADIETQGEFVN 575
            + +  L K+D+ R+ S     N   V++  +S++GEI+NRS+HL+AIKADIET+GEF+N
Sbjct: 288 SQLFQLLNKQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFIN 347

Query: 576 SLIREVNNAVYLNIEDVVAFVKWLDDELCFLVLSLNFGTLDLNKTLQEEFERMHAVILWK 635
            LI++V    + ++EDV+ FV WLD EL  L                             
Sbjct: 348 DLIQKVLTTCFSDMEDVMKFVDWLDKELATL----------------------------- 407

Query: 636 VDERAVLKHFDWPERKADTLREAAFGYRDLKKLECEISGYKDDPRLPCDIALKKMVSLSE 695
            DERAVLKHF WPE+KADTL+EAA  YR+LKKLE E+S Y DDP +   +ALKKM +L +
Sbjct: 408 ADERAVLKHFKWPEKKADTLQEAAVEYRELKKLEKELSSYSDDPNIHYGVALKKMANLLD 467

Query: 696 KMERSIYNLLRMRESLMRHCKEFQIPTDWMLDNGIISKIKLGSVKLAKMYMKRVAMELQS 755
           K E+ I  L+R+R S MR  ++F+IP +WMLD+G+I KIK  S+KLAK YM RVA ELQS
Sbjct: 468 KSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMICKIKRASIKLAKTYMNRVANELQS 527

Query: 756 KASSEKDPAMDYMLLQGVRFAFRIHQFAGGLDADTMHAFEDLR 795
             + +++   + +LLQGVRFA+R HQFAGGLD +T+ A E+++
Sbjct: 528 ARNLDRESTKEALLLQGVRFAYRTHQFAGGLDPETLCALEEIK 539

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9LI741.7e-10635.40Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1ECG50.0e+0096.39protein CHUP1, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111433055 ... [more]
A0A6J1ITW40.0e+0093.40protein CHUP1, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111478380 PE... [more]
A0A0A0KMA90.0e+0081.43Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G526260 PE=4 SV=1[more]
A0A5A7UD870.0e+0081.09Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G009920... [more]
A0A1S3AZK10.0e+0081.09protein CHUP1, chloroplastic-like OS=Cucumis melo OX=3656 GN=LOC103484287 PE=4 S... [more]
Match NameE-valueIdentityDescription
XP_022925727.10.0e+0096.39protein CHUP1, chloroplastic-like [Cucurbita moschata] >XP_022925728.1 protein C... [more]
KAG6581496.10.0e+0094.77Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia][more]
XP_023544570.10.0e+0094.40protein CHUP1, chloroplastic-like [Cucurbita pepo subsp. pepo] >XP_023544572.1 p... [more]
KAG7034785.10.0e+0093.90Protein CHUP1, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperm... [more]
XP_022978368.10.0e+0093.40protein CHUP1, chloroplastic-like [Cucurbita maxima] >XP_022978369.1 protein CHU... [more]
Match NameE-valueIdentityDescription
AT3G25690.11.2e-10735.40Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.21.2e-10735.40Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.31.3e-10336.42Hydroxyproline-rich glycoprotein family protein [more]
AT4G18570.13.8e-8239.18Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G48280.15.7e-7846.65hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 135..199
NoneNo IPR availableCOILSCoilCoilcoord: 220..251
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..92
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 316..345
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 431..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 478..493
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 14..67
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 105..127
NoneNo IPR availablePANTHERPTHR31342:SF41PROTEIN CHUP1, CHLOROPLASTIC-LIKEcoord: 1..803
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 1..803

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh03G014690.1CmoCh03G014690.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane