Sgr015918 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr015918
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
Descriptionprotein CHUP1, chloroplastic
Locationtig00006297: 680800 .. 683309 (-)
RNA-Seq ExpressionSgr015918
SyntenySgr015918
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCGCAGGAAGAAGATGATGATATGGCCATGGAGATCAACAGCTTGAGGAAACAACTGGAAATTGCTGTGGGGAAATCCAACTTTCTCGAGAAAGAGAATCAAGAACTGAGACAAGAAGTTGGTCGTCTGAAATCTCAGATTCAGTCTCTGAAAGCCCACAGCAATGAGAGAAAATCCATTCTCTGGAAGAAATTCCACAACTCCATGGATGTCTCCGTCACGTTCGCCGACTCGTCGCCGCAGAAGCCACCGGAGCAGAGTCCCGCGGCGAATGATAAACCGATAAGAAGAACCGGAGACTTCCCGGAAACAACCGATAAACGGGAGCCAACCAGATCGCCGAAACAGCTTCCTCCGATAACTGCTTGGGCCGTCGTGAAAGAGAACCAGAGAAAGCCGGCTCCGGCTCCGCCCCCACCTCCGCTTCCGACGAAGCTCCTCGGCGGATCAAAGGCAGTGCGTCGAGTCCCGGAAGTGCTGGAGCTGTACCGTTCACTGACGAAACGAGACGCCCAGAAGGAAAACAAGGGCAACGCGGGAGGATATCCGGCGGTGGCATTCAGCAAAAACATGATCGGAGAGATCGAGAACCGGTCAGCGTATCTGTCAGCGGTAAGTAAACAGTAAAAGCAAATATTTTTCCATGACAAAAACGAAAACCCAGTTGAATGGAACGATGGGCAGATAAAATCGGAGGTGGAGACGCATGGGGAGTTCGTGAACTGGCTGATAAAGGAAGTGGAAGCGGCGGCGCCGAGGGAGATAACGGAGGTGGAGAGGTTCGTGAAGTGGCTGGACGGGGAGCTGGCGGCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCGCGGTGGCCGGAGGGGAAGGCGGATGCGCTGCGGGAGGCGGCGTTCAGTTACAGGGATCTGAAGGGGCTGGAGAGTGAAGTGCGTTCGTTCAAAGACAATCCGAAGGAGGAGATGGGTGTGGTTGTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAAAGTACAATCTGTGGTTGGATTGAAAATTGTTGTGTTTGATTGATTGAGGAGATTTGGGAGTTTGATTTTGAATTTTGAAATGTGCAGGTTGGAGCAGAGCGTTGGGAATGTGGAGAGGACGAGGGAGTTCAGTTGTAAGAAGTACGAGAGTTTTCAAATCCCCTGCGAATGGATGTTGGACTCTGGCCTCCTCGGTCAGGTACGTGCCATTCCAATTTCACCCTTTTTCTTTATTATTATTATTATTATTATAAATTTAGCTATTCAACTCTCCACACTAAAATAAACACATTCTTAAAAGTTAAAAAAATCAAACTTGTAATTTAATCTCTTCCACTTTTAATCATTTAACCATGTGTTAATAAGAATATGTTTATTAAAAACTTAATGGTTTAAGTCTCATTCTTGTAACTTTCAAAAATATCTTTTTTTTCTTACTAATCTTAACCATTAGAGCTCTGGAAAAGTTGGCAAAATAATAATAATAATTTTGATTGATAATGCAGGCATGTGTTGGAACTTAATAAATTGGCTAAGACAATTATTAGGTTAGGAGATGTGAAGAACCGAACTCAAAAAAAAAAAAAAAAAGCTAGAGAAAATAGACAATTTTAAAATTCAAAAACAAAAAATAAAAATAAAAAGTCTGAAAATTCAAAAACTCAAATAAATTTAAACCAATTATAGTTGAATTGTGGTAACAACCGCATTAACAATGATAACCACAGAAACAAAATAAAATATAATAAAAATTTTAAAGACATAATTGAAAATTTTTATAAGATATTAACAATCAAATTTCATAATTAAATTTATGATGAAAGCTAATATAAGATTAATTCTACCAACTGGGCAAGTAAAATCATAAACGCAGCCAAAATTACTTTCATTTAGAAAAATCATAACAAATCATCAGTTGCTTGCATCTTCCAGAATCAGGCACATGCAAATAAAAAACCTAACTTGGTCATTGTTTGCAGATGAAGTTGAGCTCATTGAGGCTAGCCAAGGAATACATGCGGAGGATAACAAGAGAACTACATTCAACCGAAACCCCGCACCCAGAAAACCTCTTTCTTCAAGGTGTTCGATTTGCGTACAGGGTTCACCAGGTAAACTAGAATGATCAGTAGAATGATGAGCTAATCTGATCATTCACATAATAACACTCAGAACCTTTTCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCATTCGAAGGGCTGAGGAAAGTCGGGCTGAGCTGGGGAGGAGGTCAGAGAAAATAGGCTTCTTGGTGATAAATTGTAGTGTTTGAATCTTGTCGGTTGTTGTGGCAAAACTTTACTAGTAAACAAACAGCTATTGTAAGAATTAGCATTGCAGAGCACATTCAACAAAAAGATGTGATACAAATGCTTGATCAAAAAAATTGGGAATTGTATACCTTAATCAAATCTATCGCAACTTGTTCAAACCATCGCAAGGCAAAGGCAAATGTCATAACAGCATAA

mRNA sequence

ATGCCGCAGGAAGAAGATGATGATATGGCCATGGAGATCAACAGCTTGAGGAAACAACTGGAAATTGCTGTGGGGAAATCCAACTTTCTCGAGAAAGAGAATCAAGAACTGAGACAAGAAGTTGGTCGTCTGAAATCTCAGATTCAGTCTCTGAAAGCCCACAGCAATGAGAGAAAATCCATTCTCTGGAAGAAATTCCACAACTCCATGGATGTCTCCGTCACGTTCGCCGACTCGTCGCCGCAGAAGCCACCGGAGCAGAGTCCCGCGGCGAATGATAAACCGATAAGAAGAACCGGAGACTTCCCGGAAACAACCGATAAACGGGAGCCAACCAGATCGCCGAAACAGCTTCCTCCGATAACTGCTTGGGCCGTCGTGAAAGAGAACCAGAGAAAGCCGGCTCCGGCTCCGCCCCCACCTCCGCTTCCGACGAAGCTCCTCGGCGGATCAAAGGCAGTGCGTCGAGTCCCGGAAGTGCTGGAGCTGTACCGTTCACTGACGAAACGAGACGCCCAGAAGGAAAACAAGGGCAACGCGGGAGGATATCCGGCGGTGGCATTCAGCAAAAACATGATCGGAGAGATCGAGAACCGGTCAGCGTATCTGTCAGCGATAAAATCGGAGGTGGAGACGCATGGGGAGTTCGTGAACTGGCTGATAAAGGAAGTGGAAGCGGCGGCGCCGAGGGAGATAACGGAGGTGGAGAGGTTCGTGAAGTGGCTGGACGGGGAGCTGGCGGCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCGCGGTGGCCGGAGGGGAAGGCGGATGCGCTGCGGGAGGCGGCGTTCAGTTACAGGGATCTGAAGGGGCTGGAGAGTGAAGTGCGTTCGTTCAAAGACAATCCGAAGGAGGAGATGGGTGTGGTTGTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAAAGTTGGAGCAGAGCGTTGGGAATGTGGAGAGGACGAGGGAGTTCAGTTGTAAGAAGTACGAGAGTTTTCAAATCCCCTGCGAATGGATGTTGGACTCTGGCCTCCTCGGTCAGGCACATGCAAATAAAAAACCTAACTTGGTCATTGTTTGCAGATGAAGTTGAGCTCATTGAGGCTAGCCAAGGAATACATGCGGAGGATAACAAGAGAACTACATTCAACCGAAACCCCGCACCCAGAAAACCTCTTTCTTCAAGGTGTTCGATTTGCGTACAGGGTTCACCAGAACCTTTTCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCATTCGAAGGGCTGAGGAAAGTCGGGCTGAGCTGGGGAGGAGCTATTGTAAGAATTAGCATTGCAGAGCACATTCAACAAAAAGATGTGATACAAATGCTTGATCAAAAAAATTGGGAATTGTATACCTTAATCAAATCTATCGCAACTTGTTCAAACCATCGCAAGGCAAAGGCAAATGTCATAACAGCATAA

Coding sequence (CDS)

ATGCCGCAGGAAGAAGATGATGATATGGCCATGGAGATCAACAGCTTGAGGAAACAACTGGAAATTGCTGTGGGGAAATCCAACTTTCTCGAGAAAGAGAATCAAGAACTGAGACAAGAAGTTGGTCGTCTGAAATCTCAGATTCAGTCTCTGAAAGCCCACAGCAATGAGAGAAAATCCATTCTCTGGAAGAAATTCCACAACTCCATGGATGTCTCCGTCACGTTCGCCGACTCGTCGCCGCAGAAGCCACCGGAGCAGAGTCCCGCGGCGAATGATAAACCGATAAGAAGAACCGGAGACTTCCCGGAAACAACCGATAAACGGGAGCCAACCAGATCGCCGAAACAGCTTCCTCCGATAACTGCTTGGGCCGTCGTGAAAGAGAACCAGAGAAAGCCGGCTCCGGCTCCGCCCCCACCTCCGCTTCCGACGAAGCTCCTCGGCGGATCAAAGGCAGTGCGTCGAGTCCCGGAAGTGCTGGAGCTGTACCGTTCACTGACGAAACGAGACGCCCAGAAGGAAAACAAGGGCAACGCGGGAGGATATCCGGCGGTGGCATTCAGCAAAAACATGATCGGAGAGATCGAGAACCGGTCAGCGTATCTGTCAGCGATAAAATCGGAGGTGGAGACGCATGGGGAGTTCGTGAACTGGCTGATAAAGGAAGTGGAAGCGGCGGCGCCGAGGGAGATAACGGAGGTGGAGAGGTTCGTGAAGTGGCTGGACGGGGAGCTGGCGGCGCTGGTGGACGAGAGGGCGGTGCTGAAGCACTTCCCGCGGTGGCCGGAGGGGAAGGCGGATGCGCTGCGGGAGGCGGCGTTCAGTTACAGGGATCTGAAGGGGCTGGAGAGTGAAGTGCGTTCGTTCAAAGACAATCCGAAGGAGGAGATGGGTGTGGTTGTGAAGAGGGCTCAGGCGCTGCAAGACAGGCGAGAAAGTTGGAGCAGAGCGTTGGGAATGTGGAGAGGACGAGGGAGTTCAGTTGTAAGAAGTACGAGAGTTTTCAAATCCCCTGCGAATGGATGTTGGACTCTGGCCTCCTCGGTCAGGCACATGCAAATAAAAAACCTAACTTGGTCATTGTTTGCAGATGAAGTTGAGCTCATTGAGGCTAGCCAAGGAATACATGCGGAGGATAACAAGAGAACTACATTCAACCGAAACCCCGCACCCAGAAAACCTCTTTCTTCAAGGTGTTCGATTTGCGTACAGGGTTCACCAGAACCTTTTCAGTACGCAGGTGGTTTTGATTCGGAGGCTATAGTGGCATTCGAAGGGCTGAGGAAAGTCGGGCTGAGCTGGGGAGGAGCTATTGTAAGAATTAGCATTGCAGAGCACATTCAACAAAAAGATGTGATACAAATGCTTGATCAAAAAAATTGGGAATTGTATACCTTAATCAAATCTATCGCAACTTGTTCAAACCATCGCAAGGCAAAGGCAAATGTCATAACAGCATAA

Protein sequence

MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPFQYAGGFDSEAIVAFEGLRKVGLSWGGAIVRISIAEHIQQKDVIQMLDQKNWELYTLIKSIATCSNHRKAKANVITA
Homology
BLAST of Sgr015918 vs. NCBI nr
Match: KAA0060029.1 (protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 474.2 bits (1219), Expect = 1.4e-129
Identity = 272/443 (61.40%), Postives = 320/443 (72.23%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP+E+D+++AMEIN L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKS
Sbjct: 1   MPKEKDEELAMEINCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFH+SMD++V  ADS P  P   +                  DKRE T+ PKQ   
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATVA-----------------GDKREVTKFPKQ--- 120

Query: 121 ITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKE
Sbjct: 121 -SSWDDVKESQRMTAVPASAPPPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG P VAF+KNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I++V
Sbjct: 181 NKVAHGGAPVVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISDV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Sbjct: 241 EKFVKWLDVKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPK 300

Query: 301 EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVR 360
           EEM VV+KRAQALQDRRE    S  +++           +  + F+ P    +  A   +
Sbjct: 301 EEMNVVLKRAQALQDRRECTINSVEQSVSNMERTREFNCKKYQAFQIPCQWMFDSALPTQ 360

Query: 361 HMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF 420
           +   K+        +VE IEA +GIH +DNKRTT NRN   RKP   R S+C+QGS    
Sbjct: 361 NSHTKH--------KVEHIEAGKGIHDKDNKRTTINRNLTSRKPFPPRGSLCIQGS---- 408

Query: 421 QYAGGFDSEAIVAFEGLRKVGLS 435
             +GGFDSEAI AFEGL+K GLS
Sbjct: 421 --SGGFDSEAIEAFEGLKKAGLS 408

BLAST of Sgr015918 vs. NCBI nr
Match: TYJ97286.1 (protein CHUP1 [Cucumis melo var. makuwa])

HSP 1 Score: 469.5 bits (1207), Expect = 3.4e-128
Identity = 275/439 (62.64%), Postives = 320/439 (72.89%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP+E+D+++AMEI+ L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFH+SMD++V  ADS P  P   + AA               DKRE T+ PKQ   
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNP---ATAAG--------------DKREVTKFPKQ--- 120

Query: 121 ITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKE
Sbjct: 121 -SSWDDVKESQRMTAVPASAPPPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG P VAF+KNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I+E 
Sbjct: 181 NKVAHGGAPVVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEA 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Sbjct: 241 EKFVKWLDVKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQI 360
           EEM VV+KRAQALQDR E       M R R  +  +  + F+ P   C  +  S    Q 
Sbjct: 301 EEMNVVLKRAQALQDRVE--QSVSNMERTREFN-CKKYQAFQIP---CQWMFDSALPTQT 360

Query: 361 KNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPFQYAG 420
                +   ++VE IEA +GIH +DNKRTT NRN   RKP   R S+C+QGS      +G
Sbjct: 361 VPQDNNQVDNKVEHIEAGKGIHDKDNKRTTINRNLTSRKPFPPRGSLCIQGS------SG 406

Query: 421 GFDSEAIVAFEGLRKVGLS 435
           GFDSEAI AFEGL+K GLS
Sbjct: 421 GFDSEAIEAFEGLKKAGLS 406

BLAST of Sgr015918 vs. NCBI nr
Match: XP_022150972.1 (protein CHUP1, chloroplastic [Momordica charantia])

HSP 1 Score: 457.6 bits (1176), Expect = 1.3e-124
Identity = 269/444 (60.59%), Postives = 311/444 (70.05%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MPQEED+++AMEI SLRK+L+IAV KS+FLEKENQELRQE+GRLKSQIQSLKAH+N+RKS
Sbjct: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           +LWKKF+NSMD                             + P  TDKRE T+S  + P 
Sbjct: 61  LLWKKFYNSMD----------------------------AESPPATDKREATKSSPKQP- 120

Query: 121 ITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQK 180
              W  VKE+QR       PAPAPPPPPLPTKLL GSKAVRRVPEVLELYRSLTKRDAQK
Sbjct: 121 --VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQK 180

Query: 181 ENKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITE 240
           ENK   GG+PAVAF+KNMIGEIENRSAYL+AIKSEVETHGEFVNWLIKEVE AAPR+ITE
Sbjct: 181 ENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITE 240

Query: 241 VERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP 300
           VERFV WLD EL +LVDERAVLKHFPRWPEGKADALREAAFSYRDLK LESEV SF+DNP
Sbjct: 241 VERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNP 300

Query: 301 KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHM 360
           KEEMGVV+KRAQALQDR E     +   R    +  R+   F+ P    W   S  V  M
Sbjct: 301 KEEMGVVLKRAQALQDRLEQSVSNVEKTREFSCNKYRN---FRIPCE--WMFESGLVGQM 360

Query: 361 QIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQG---SPEP 420
           ++ +L  +    +  +   ++ + + DN +   N              + +QG   +   
Sbjct: 361 KLSSLRLA----KEYMRRITRELQSIDNTQQADN--------------LLLQGVRFAYRV 390

Query: 421 FQYAGGFDSEAIVAFEGLRKVGLS 435
            QYAGGFDS+AI AFEGL+KVGLS
Sbjct: 421 HQYAGGFDSDAIAAFEGLKKVGLS 390

BLAST of Sgr015918 vs. NCBI nr
Match: XP_023523072.1 (protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_023523080.1 protein CHUP1, chloroplastic isoform X2 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 450.3 bits (1157), Expect = 2.1e-122
Identity = 267/446 (59.87%), Postives = 309/446 (69.28%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS IQSLKAH+N+RKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHIQSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFHNSMDV+V   DSSPQ PP                    TDK E TR+ KQ   
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPP-------------------ATDKWETTRTQKQ--- 120

Query: 121 ITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKE
Sbjct: 121 -SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG+PAVAF+KNMIGEIENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EV
Sbjct: 181 NKAANGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           ERFVKWLDGEL +LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Sbjct: 241 ERFVKWLDGELGSLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSV 360
           EE   ++KRAQALQDR E           +  S V  TR F           C  +  S 
Sbjct: 301 EETNAMLKRAQALQDRLE-----------QSVSNVERTREFNCKKYNKFQIPCQWMLDSG 360

Query: 361 RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQG---S 420
              Q+K  +  L  + +  I           K    N  P          ++ +QG   +
Sbjct: 361 LPAQMKLSSLRLVKECMRRI----------TKEIQLNETPQTE-------NLFLQGVRFA 395

Query: 421 PEPFQYAGGFDSEAIVAFEGLRKVGL 434
               QYAGGFDSEAIVAFEG+++VGL
Sbjct: 421 YRVHQYAGGFDSEAIVAFEGMKQVGL 395

BLAST of Sgr015918 vs. NCBI nr
Match: XP_022998607.1 (protein CHUP1, chloroplastic [Cucurbita maxima])

HSP 1 Score: 449.5 bits (1155), Expect = 3.6e-122
Identity = 266/446 (59.64%), Postives = 309/446 (69.28%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS +QSLK H+N+RKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFHNSMDV+V   DSSPQ PP                    TDK E TR+ KQ   
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPP-------------------ATDKWETTRTQKQ--- 120

Query: 121 ITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKE
Sbjct: 121 -SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG+PAVAF+KNMIGEIENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EV
Sbjct: 181 NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           ERFVKWLDGELA+LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Sbjct: 241 ERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSV 360
           EE   ++KRAQALQDR E           +  S V  TR F           C  +  S 
Sbjct: 301 EETNAMLKRAQALQDRLE-----------QSVSNVERTREFNCNKYNKFQIPCQWMLDSG 360

Query: 361 RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQG---S 420
              Q+K  +  L  + +  I           K    N  P          ++ +QG   +
Sbjct: 361 LPAQMKLSSLRLVKECMRRI----------TKELQLNETPQTE-------NLFLQGVRFA 395

Query: 421 PEPFQYAGGFDSEAIVAFEGLRKVGL 434
               QYAGGFDSEAIVAFEG+++VGL
Sbjct: 421 YRVHQYAGGFDSEAIVAFEGMKQVGL 395

BLAST of Sgr015918 vs. ExPASy Swiss-Prot
Match: Q9LI74 (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 166.0 bits (419), Expect = 1.1e-39
Identity = 121/308 (39.29%), Postives = 169/308 (54.87%), Query Frame = 0

Query: 134 PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSK 193
           P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   
Sbjct: 698 PPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARN 757

Query: 194 NMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALV 253
           NMIGEIENRS +L A+K++VET G+FV  L  EV A++  +I ++  FV WLD EL+ LV
Sbjct: 758 NMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLV 817

Query: 254 DERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQD 313
           DERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L +
Sbjct: 818 DERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLE 877

Query: 314 RRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFA 373
           + E    AL   R R  ++ R  + F  P +  W   + V        +Q+        A
Sbjct: 878 KVEQSVYAL--LRTRDMAISR-YKEFGIPVD--WLSDTGVVGKIKLSSVQLAKKYMKRVA 937

Query: 374 DEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA 430
            E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Sbjct: 938 YELDSVSGS-------------DKDP-------NREFLLLQGVRFAFRVHQFAGGFDAES 979

BLAST of Sgr015918 vs. ExPASy TrEMBL
Match: A0A5A7V2M1 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold340G00750 PE=4 SV=1)

HSP 1 Score: 474.2 bits (1219), Expect = 6.7e-130
Identity = 272/443 (61.40%), Postives = 320/443 (72.23%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP+E+D+++AMEIN L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKS
Sbjct: 1   MPKEKDEELAMEINCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFH+SMD++V  ADS P  P   +                  DKRE T+ PKQ   
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNPATVA-----------------GDKREVTKFPKQ--- 120

Query: 121 ITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKE
Sbjct: 121 -SSWDDVKESQRMTAVPASAPPPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG P VAF+KNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I++V
Sbjct: 181 NKVAHGGAPVVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISDV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Sbjct: 241 EKFVKWLDVKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPK 300

Query: 301 EEMGVVVKRAQALQDRRE----SWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVR 360
           EEM VV+KRAQALQDRRE    S  +++           +  + F+ P    +  A   +
Sbjct: 301 EEMNVVLKRAQALQDRRECTINSVEQSVSNMERTREFNCKKYQAFQIPCQWMFDSALPTQ 360

Query: 361 HMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF 420
           +   K+        +VE IEA +GIH +DNKRTT NRN   RKP   R S+C+QGS    
Sbjct: 361 NSHTKH--------KVEHIEAGKGIHDKDNKRTTINRNLTSRKPFPPRGSLCIQGS---- 408

Query: 421 QYAGGFDSEAIVAFEGLRKVGLS 435
             +GGFDSEAI AFEGL+K GLS
Sbjct: 421 --SGGFDSEAIEAFEGLKKAGLS 408

BLAST of Sgr015918 vs. ExPASy TrEMBL
Match: A0A5D3BE56 (Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00800 PE=4 SV=1)

HSP 1 Score: 469.5 bits (1207), Expect = 1.6e-128
Identity = 275/439 (62.64%), Postives = 320/439 (72.89%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP+E+D+++AMEI+ L+K LEI++ KS FLE+ENQELR E+ RLKSQIQSLKA +NERKS
Sbjct: 1   MPKEKDEELAMEIDCLKKDLEISLQKSIFLERENQELRLELNRLKSQIQSLKALNNERKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFH+SMD++V  ADS P  P   + AA               DKRE T+ PKQ   
Sbjct: 61  ILWKKFHSSMDMAVAGADSPPLNP---ATAAG--------------DKREVTKFPKQ--- 120

Query: 121 ITAWAVVKENQR-----KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            ++W  VKE+QR       AP PPPPPLP KLLGGSKAVRRVPEVL+LYR+LTKRDAQKE
Sbjct: 121 -SSWDDVKESQRMTAVPASAPPPPPPPLPKKLLGGSKAVRRVPEVLDLYRTLTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG P VAF+KNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVE  APR+I+E 
Sbjct: 181 NKVAHGGAPVVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEMIAPRDISEA 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           E+FVKWLD +LA+LVDERAVLKHFPRWPE KADALREAAFSYRDLK LES+V  F+DNPK
Sbjct: 241 EKFVKWLDVKLASLVDERAVLKHFPRWPEAKADALREAAFSYRDLKSLESKVCMFRDNPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQI 360
           EEM VV+KRAQALQDR E       M R R  +  +  + F+ P   C  +  S    Q 
Sbjct: 301 EEMNVVLKRAQALQDRVE--QSVSNMERTREFN-CKKYQAFQIP---CQWMFDSALPTQT 360

Query: 361 KNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPFQYAG 420
                +   ++VE IEA +GIH +DNKRTT NRN   RKP   R S+C+QGS      +G
Sbjct: 361 VPQDNNQVDNKVEHIEAGKGIHDKDNKRTTINRNLTSRKPFPPRGSLCIQGS------SG 406

Query: 421 GFDSEAIVAFEGLRKVGLS 435
           GFDSEAI AFEGL+K GLS
Sbjct: 421 GFDSEAIEAFEGLKKAGLS 406

BLAST of Sgr015918 vs. ExPASy TrEMBL
Match: A0A6J1DC83 (protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 6.5e-125
Identity = 269/444 (60.59%), Postives = 311/444 (70.05%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MPQEED+++AMEI SLRK+L+IAV KS+FLEKENQELRQE+GRLKSQIQSLKAH+N+RKS
Sbjct: 1   MPQEEDEELAMEITSLRKELQIAVDKSDFLEKENQELRQELGRLKSQIQSLKAHNNDRKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           +LWKKF+NSMD                             + P  TDKRE T+S  + P 
Sbjct: 61  LLWKKFYNSMD----------------------------AESPPATDKREATKSSPKQP- 120

Query: 121 ITAWAVVKENQR------KPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQK 180
              W  VKE+QR       PAPAPPPPPLPTKLL GSKAVRRVPEVLELYRSLTKRDAQK
Sbjct: 121 --VWVAVKESQRMPEGAPAPAPAPPPPPLPTKLLAGSKAVRRVPEVLELYRSLTKRDAQK 180

Query: 181 ENKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITE 240
           ENK   GG+PAVAF+KNMIGEIENRSAYL+AIKSEVETHGEFVNWLIKEVE AAPR+ITE
Sbjct: 181 ENKAAHGGFPAVAFTKNMIGEIENRSAYLTAIKSEVETHGEFVNWLIKEVEGAAPRDITE 240

Query: 241 VERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNP 300
           VERFV WLD EL +LVDERAVLKHFPRWPEGKADALREAAFSYRDLK LESEV SF+DNP
Sbjct: 241 VERFVNWLDRELGSLVDERAVLKHFPRWPEGKADALREAAFSYRDLKSLESEVCSFRDNP 300

Query: 301 KEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASS-VRHM 360
           KEEMGVV+KRAQALQDR E     +   R    +  R+   F+ P    W   S  V  M
Sbjct: 301 KEEMGVVLKRAQALQDRLEQSVSNVEKTREFSCNKYRN---FRIPCE--WMFESGLVGQM 360

Query: 361 QIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQG---SPEP 420
           ++ +L  +    +  +   ++ + + DN +   N              + +QG   +   
Sbjct: 361 KLSSLRLA----KEYMRRITRELQSIDNTQQADN--------------LLLQGVRFAYRV 390

Query: 421 FQYAGGFDSEAIVAFEGLRKVGLS 435
            QYAGGFDS+AI AFEGL+KVGLS
Sbjct: 421 HQYAGGFDSDAIAAFEGLKKVGLS 390

BLAST of Sgr015918 vs. ExPASy TrEMBL
Match: A0A6J1K8G4 (protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV=1)

HSP 1 Score: 449.5 bits (1155), Expect = 1.8e-122
Identity = 266/446 (59.64%), Postives = 309/446 (69.28%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP EED+++AMEI++L+++LEI++ KSNFLEKENQEL+QE+ R KS +QSLK H+N+RKS
Sbjct: 1   MPMEEDEELAMEIDALKRELEISLQKSNFLEKENQELKQELARFKSHLQSLKPHNNDRKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFHNSMDV+V   DSSPQ PP                    TDK E TR+ KQ   
Sbjct: 61  ILWKKFHNSMDVAVAGTDSSPQSPP-------------------ATDKWETTRTQKQ--- 120

Query: 121 ITAWAVVKENQR----KPAPA-PPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            + WAVVKENQR     P PA PPPPPLPTKLLGGSKAVRRVPEVLELYR +TKRDAQKE
Sbjct: 121 -SNWAVVKENQRMAAAAPTPAPPPPPPLPTKLLGGSKAVRRVPEVLELYRLVTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG+PAVAF+KNMIGEIENRSAYLSAIKSEVETHGEFVN LI+EVEAAAPR+I EV
Sbjct: 181 NKATNGGFPAVAFTKNMIGEIENRSAYLSAIKSEVETHGEFVNRLIREVEAAAPRDIAEV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           ERFVKWLDGELA+LVDERAVLKHFPRWPEGKADALREAAFSY+DLK LE+EV SF++NPK
Sbjct: 241 ERFVKWLDGELASLVDERAVLKHFPRWPEGKADALREAAFSYKDLKSLEAEVCSFRENPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANG-----CWTLASSV 360
           EE   ++KRAQALQDR E           +  S V  TR F           C  +  S 
Sbjct: 301 EETNAMLKRAQALQDRLE-----------QSVSNVERTREFNCNKYNKFQIPCQWMLDSG 360

Query: 361 RHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQG---S 420
              Q+K  +  L  + +  I           K    N  P          ++ +QG   +
Sbjct: 361 LPAQMKLSSLRLVKECMRRI----------TKELQLNETPQTE-------NLFLQGVRFA 395

Query: 421 PEPFQYAGGFDSEAIVAFEGLRKVGL 434
               QYAGGFDSEAIVAFEG+++VGL
Sbjct: 421 YRVHQYAGGFDSEAIVAFEGMKQVGL 395

BLAST of Sgr015918 vs. ExPASy TrEMBL
Match: A0A0A0LVK7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1)

HSP 1 Score: 441.4 bits (1134), Expect = 4.8e-120
Identity = 266/442 (60.18%), Postives = 313/442 (70.81%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           MP+EED+ +AMEIN L+K+LEI++ KS FLEKENQELRQE+ RL+SQIQS KA +NERKS
Sbjct: 1   MPKEEDEVLAMEINCLKKELEISLQKSIFLEKENQELRQELNRLRSQIQSFKAQNNERKS 60

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           ILWKKFH+S+D+SV  ADS P  P   +                  DKRE T+SPKQ   
Sbjct: 61  ILWKKFHSSIDISVAGADSPPLSPATVA-----------------GDKRESTKSPKQ--- 120

Query: 121 ITAWAVVKENQRK---PA--PAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKE 180
            ++W  VKE+ R    PA  P PPPPPLPTKLLGGSKAVRRVPEVLELYR+LTKRDAQKE
Sbjct: 121 -SSWDDVKESHRMTGVPASPPPPPPPPLPTKLLGGSKAVRRVPEVLELYRTLTKRDAQKE 180

Query: 181 NKGNAGGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEV 240
           NK   GG PAVAF+KNMIGEIENRSAYLSAIKSEVETHG+FVNWLIKEVE  APR+I+EV
Sbjct: 181 NKVAHGGAPAVAFTKNMIGEIENRSAYLSAIKSEVETHGDFVNWLIKEVETIAPRDISEV 240

Query: 241 ERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPK 300
           ERFVKWLDG+LA+LVDERAVLK+FPRWPE KADALREAAFSYRDLKGLES+V  F+DNPK
Sbjct: 241 ERFVKWLDGKLASLVDERAVLKYFPRWPEAKADALREAAFSYRDLKGLESKVCMFRDNPK 300

Query: 301 EEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSVRHMQI 360
           EEM VV+KRAQALQDR E       M R R  +  R  + F+ P   C  +  S    QI
Sbjct: 301 EEMNVVLKRAQALQDRVE--QSVSNMERTREFN-CRKYQAFQIP---CQWMFDSALPTQI 360

Query: 361 KNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---Q 420
           K  T  L  +   +I  ++ + + +  +               R ++ +QG+   +   Q
Sbjct: 361 KMSTLRLAKE--YMIRITRELQSTETPQ---------------RENLFLQGARFAYRVHQ 398

Query: 421 YAGGFDSEAIVAFEGLRKVGLS 435
           YAGGFDSE I AFEGL+K GLS
Sbjct: 421 YAGGFDSETIEAFEGLKKAGLS 398

BLAST of Sgr015918 vs. TAIR 10
Match: AT1G07120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast envelope; EXPRESSED IN: inflorescence meristem, petal, leaf whorl, flower; EXPRESSED DURING: 4 anthesis, petal differentiation and expansion stage; BEST Arabidopsis thaliana protein match is: Tetratricopeptide repeat (TPR)-like superfamily protein (TAIR:AT4G18570.1); Has 288 Blast hits to 260 proteins in 50 species: Archae - 0; Bacteria - 8; Metazoa - 27; Fungi - 15; Plants - 163; Viruses - 0; Other Eukaryotes - 75 (source: NCBI BLink). )

HSP 1 Score: 255.4 bits (651), Expect = 9.4e-68
Identity = 150/313 (47.92%), Postives = 203/313 (64.86%), Query Frame = 0

Query: 1   MPQEEDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKS 60
           +P  EDD    ++  L K+L+  + +++ LEKEN ELRQEV RL++Q+ +LK+H NERKS
Sbjct: 2   LPNGEDDS---DLLRLVKELQAYLVRNDKLEKENHELRQEVARLRAQVSNLKSHENERKS 61

Query: 61  ILWKKFHNSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPP 120
           +LWKK  +S D S T  D S  K PE                 ++  K +  R+P   P 
Sbjct: 62  MLWKKLQSSYDGSNT--DGSNLKAPES---------------VKSNTKGQEVRNPNPKPT 121

Query: 121 ITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKGNA 180
           I       + Q      PPPPPLP+K   G ++VRR PEV+E YR+LTKR++   NK N 
Sbjct: 122 I-------QGQSTATKPPPPPPLPSKRTLGKRSVRRAPEVVEFYRALTKRESHMGNKINQ 181

Query: 181 GGYPAVAFSKNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVK 240
            G  + AF++NMIGEIENRS YLS IKS+ + H + ++ LI +VEAA   +I+EVE FVK
Sbjct: 182 NGVLSPAFNRNMIGEIENRSKYLSDIKSDTDRHRDHIHILISKVEAATFTDISEVETFVK 241

Query: 241 WLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGV 300
           W+D EL++LVDERAVLKHFP+WPE K D+LREAA +Y+  K L +E+ SFKDNPK+ +  
Sbjct: 242 WIDEELSSLVDERAVLKHFPKWPERKVDSLREAACNYKRPKNLGNEILSFKDNPKDSLTQ 287

Query: 301 VVKRAQALQDRRE 314
            ++R Q+LQDR E
Sbjct: 302 ALQRIQSLQDRLE 287

BLAST of Sgr015918 vs. TAIR 10
Match: AT4G18570.1 (Tetratricopeptide repeat (TPR)-like superfamily protein )

HSP 1 Score: 196.4 bits (498), Expect = 5.2e-50
Identity = 162/464 (34.91%), Postives = 231/464 (49.78%), Query Frame = 0

Query: 5   EDDDMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSI--- 64
           E DD A+ ++   + L     KSN +        + VG L++  + +    N  KSI   
Sbjct: 187 ESDDHALSVSQRFQGLMDVSAKSNLIRS-----LKRVGSLRNLPEPITNQENTNKSISSS 246

Query: 65  --------------LWKKFHNSMDVSVTFADSS--------PQKPPEQSPAANDKPIRRT 124
                          + +  NS +++ + + S+        P+ PP++S +  D    R 
Sbjct: 247 GDADGDIYRKDEIESYSRSSNSEELTESSSLSTVRSRVPRVPKPPPKRSISLGDSTENRA 306

Query: 125 GDFPETTDKREPTRSPKQLPPITAWAVVKENQRKPAPAPPPPPLPTKLLGGSKAVRRVPE 184
              P+   K  P   P   PP+        +  K  P PPPPP P  L   S  VRRVPE
Sbjct: 307 DPPPQ---KSIPPPPPPPPPPLLQQPPPPPSVSKAPPPPPPPPPPKSLSIASAKVRRVPE 366

Query: 185 VLELYRSLTKRDAQKENKGNAGGYPAVA-------FSKNMIGEIENRSAYLSAIKSEVET 244
           V+E Y SL +RD+    + + GG  A A        +++MIGEIENRS YL AIK++VET
Sbjct: 367 VVEFYHSLMRRDSTNSRRDSTGGGNAAAEAILANSNARDMIGEIENRSVYLLAIKTDVET 426

Query: 245 HGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALRE 304
            G+F+ +LIKEV  AA  +I +V  FVKWLD EL+ LVDERAVLKHF  WPE KADALRE
Sbjct: 427 QGDFIRFLIKEVGNAAFSDIEDVVPFVKWLDDELSYLVDERAVLKHF-EWPEQKADALRE 486

Query: 305 AAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRS 364
           AAF Y DLK L SE   F+++P++     +K+ QAL ++ E    +L   R   ++  +S
Sbjct: 487 AAFCYFDLKKLISEASRFREDPRQSSSSALKKMQALFEKLEHGVYSLSRMRESAATKFKS 546

Query: 365 TRVFKSPANGCWTLASSVRHMQIKNLTWSLFADEVELI----EASQGIHAEDNKRTTFNR 424
              F+ P +  W L + +   QIK  +  L    ++ +    EA +G   E+ +      
Sbjct: 547 ---FQIPVD--WMLETGIT-SQIKLASVKLAMKYMKRVSAELEAIEGGGPEEEE------ 606

Query: 425 NPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEAIVAFEGLR 430
                        + VQG    F   Q+AGGFD+E + AFE LR
Sbjct: 607 -------------LIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616

BLAST of Sgr015918 vs. TAIR 10
Match: AT1G48280.1 (hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 171.8 bits (434), Expect = 1.4e-42
Identity = 137/443 (30.93%), Postives = 220/443 (49.66%), Query Frame = 0

Query: 8   DMAMEINSLRKQLEIAVGKSNFLEKENQELRQEVGRLKSQIQSLKAHSNERKSILWKKFH 67
           D+ +++ +L+ +LE A   +  LE  N++L Q++   +++I SL ++    K     +F 
Sbjct: 134 DLQLQVLNLKTELEEARNSNVELELNNRKLSQDLVSAEAKISSLSSNDKPAKEHQNSRF- 193

Query: 68  NSMDVSVTFADSSPQKPPEQSPAANDKPIRRTGDFPETTDKREPTRSPKQLPPI------ 127
              D+    A    Q   ++  A             E++    P+ SP +LPP       
Sbjct: 194 --KDIQRLIASKLEQPKVKKEVAV------------ESSRLSPPSPSPSRLPPTPPLPKF 253

Query: 128 ---TAWAVVKENQRK-----PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQ 187
               A ++ K ++       P P PPPPP P + L  +   ++ P V +L++ L K+D  
Sbjct: 254 LVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRPLAKAARAQKSPPVSQLFQLLNKQDNS 313

Query: 188 KENKGNAGGYPAVAFS--KNMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPRE 247
           +    +  G  +   S   +++GEI+NRSA+L AIK+++ET GEF+N LI++V      +
Sbjct: 314 RNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFINDLIQKVLTTCFSD 373

Query: 248 ITEVERFVKWLDGELAALVDERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFK 307
           + +V +FV WLD ELA L DERAVLKHF +WPE KAD L+EAA  YR+LK LE E+ S+ 
Sbjct: 374 MEDVMKFVDWLDKELATLADERAVLKHF-KWPEKKADTLQEAAVEYRELKKLEKELSSYS 433

Query: 308 DNPKEEMGVVVKRAQALQDRRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLAS--- 367
           D+P    GV +K+   L D+ E   R L   RG   S +RS + FK P    W L S   
Sbjct: 434 DDPNIHYGVALKKMANLLDKSEQRIRRLVRLRG---SSMRSYQDFKIPVE--WMLDSGMI 493

Query: 368 -SVRHMQIKNLTWSLFADEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGS 427
             ++   IK L  +        +++++ +  E  K     +               V+ +
Sbjct: 494 CKIKRASIK-LAKTYMNRVANELQSARNLDRESTKEALLLQG--------------VRFA 540

Query: 428 PEPFQYAGGFDSEAIVAFEGLRK 431
               Q+AGG D E + A E +++
Sbjct: 554 YRTHQFAGGLDPETLCALEEIKQ 540

BLAST of Sgr015918 vs. TAIR 10
Match: AT3G25690.1 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 166.0 bits (419), Expect = 7.5e-41
Identity = 121/308 (39.29%), Postives = 169/308 (54.87%), Query Frame = 0

Query: 134 PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSK 193
           P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   
Sbjct: 698 PPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARN 757

Query: 194 NMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALV 253
           NMIGEIENRS +L A+K++VET G+FV  L  EV A++  +I ++  FV WLD EL+ LV
Sbjct: 758 NMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLV 817

Query: 254 DERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQD 313
           DERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L +
Sbjct: 818 DERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLE 877

Query: 314 RRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFA 373
           + E    AL   R R  ++ R  + F  P +  W   + V        +Q+        A
Sbjct: 878 KVEQSVYAL--LRTRDMAISR-YKEFGIPVD--WLSDTGVVGKIKLSSVQLAKKYMKRVA 937

Query: 374 DEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA 430
            E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Sbjct: 938 YELDSVSGS-------------DKDP-------NREFLLLQGVRFAFRVHQFAGGFDAES 979

BLAST of Sgr015918 vs. TAIR 10
Match: AT3G25690.2 (Hydroxyproline-rich glycoprotein family protein )

HSP 1 Score: 166.0 bits (419), Expect = 7.5e-41
Identity = 121/308 (39.29%), Postives = 169/308 (54.87%), Query Frame = 0

Query: 134 PAPAPPPPPLPTKLLGGSKAVRRVPEVLELYRSLTKRDAQKENKG---NAGGYPAVAFSK 193
           P P PPPP    +  GG   V R PE++E Y+SL KR+++KE      ++G   + A   
Sbjct: 698 PPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSLISSGTGNSSAARN 757

Query: 194 NMIGEIENRSAYLSAIKSEVETHGEFVNWLIKEVEAAAPREITEVERFVKWLDGELAALV 253
           NMIGEIENRS +L A+K++VET G+FV  L  EV A++  +I ++  FV WLD EL+ LV
Sbjct: 758 NMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLAFVSWLDEELSFLV 817

Query: 254 DERAVLKHFPRWPEGKADALREAAFSYRDLKGLESEVRSFKDNPKEEMGVVVKRAQALQD 313
           DERAVLKHF  WPEGKADALREAAF Y+DL  LE +V SF D+P       +K+   L +
Sbjct: 818 DERAVLKHFD-WPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSCEPALKKMYKLLE 877

Query: 314 RRESWSRALGMWRGRGSSVVRSTRVFKSPANGCWTLASSV------RHMQIKNLTWSLFA 373
           + E    AL   R R  ++ R  + F  P +  W   + V        +Q+        A
Sbjct: 878 KVEQSVYAL--LRTRDMAISR-YKEFGIPVD--WLSDTGVVGKIKLSSVQLAKKYMKRVA 937

Query: 374 DEVELIEASQGIHAEDNKRTTFNRNPAPRKPLSSRCSICVQGSPEPF---QYAGGFDSEA 430
            E++ +  S             +++P       +R  + +QG    F   Q+AGGFD+E+
Sbjct: 938 YELDSVSGS-------------DKDP-------NREFLLLQGVRFAFRVHQFAGGFDAES 979

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0060029.11.4e-12961.40protein CHUP1 [Cucumis melo var. makuwa][more]
TYJ97286.13.4e-12862.64protein CHUP1 [Cucumis melo var. makuwa][more]
XP_022150972.11.3e-12460.59protein CHUP1, chloroplastic [Momordica charantia][more]
XP_023523072.12.1e-12259.87protein CHUP1, chloroplastic isoform X1 [Cucurbita pepo subsp. pepo] >XP_0235230... [more]
XP_022998607.13.6e-12259.64protein CHUP1, chloroplastic [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
Q9LI741.1e-3939.29Protein CHUP1, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7V2M16.7e-13061.40Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold340G00750 ... [more]
A0A5D3BE561.6e-12862.64Protein CHUP1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold194G00800 ... [more]
A0A6J1DC836.5e-12560.59protein CHUP1, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111018994 PE=4... [more]
A0A6J1K8G41.8e-12259.64protein CHUP1, chloroplastic OS=Cucurbita maxima OX=3661 GN=LOC111493194 PE=4 SV... [more]
A0A0A0LVK74.8e-12060.18Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G532360 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G07120.19.4e-6847.92FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT4G18570.15.2e-5034.91Tetratricopeptide repeat (TPR)-like superfamily protein [more]
AT1G48280.11.4e-4230.93hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.17.5e-4139.29Hydroxyproline-rich glycoprotein family protein [more]
AT3G25690.27.5e-4139.29Hydroxyproline-rich glycoprotein family protein [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 13..54
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..149
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 97..114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 74..120
NoneNo IPR availablePANTHERPTHR31342:SF48CHUP1-LIKE PROTEINcoord: 5..314
IPR040265Protein CHUP1-likePANTHERPTHR31342PROTEIN CHUP1, CHLOROPLASTICcoord: 5..314

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr015918.1Sgr015918.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009658 chloroplast organization
cellular_component GO:0009707 chloroplast outer membrane