CmoCh20G003970 (gene) Cucurbita moschata (Rifu) v1

Overview
NameCmoCh20G003970
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu) v1)
DescriptionUPF0503 protein At3g09070, chloroplastic-like
LocationCmo_Chr20: 1896622 .. 1898325 (-)
RNA-Seq ExpressionCmoCh20G003970
SyntenyCmoCh20G003970
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATCTGCAGGCCAAAACTGTATCGCATCGGCTTTCCTCATGTCACCGTCATCCTACCAAGCCAGTAACCGGATTCTGCGCCTCCTGCCTCCGTGAGCGCCTTGCTGGGATTGATCTCGATACGCAACAGGAATCGCCTCTTCAGAAACACCTTTCATCATTGGAGCTCCGTCGGAGCAAATCCTTCTCTGCAGCGAAGCGTGATGCCAGCATCGGGCGAATGCAGGTGCAGCATCGGAGATCGTGCGATGTTCGCTCTGGAAACTCCTTGTCGGACCTTTTCTGTCGAGAAGATAAGCCGAGATGTAGGAATCAGCAGGTGGAGATCGAATCCGAGAATTTAGGTTTTGAATTGCATGAGGTTGTGGCAAATGAGAGACAATTTAGGGCATCCGGAGGGACAATTGGACCGGCTCTTGGTACGATCGATGATTTTGCTGGAGAGGAGGCTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTCGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAAAAAAGAAAGAATCTCGGTAACAATTGCAATGTAGATGCGGTGACAACAGAGGCTATCAAGTCGAGAGTGCTTGAAATTGGAGAGACTCGTTCGGAGGTTGGAGACTATGGATTGGGTAGAAGGTCTTGTGATACAGATCCAAGATTCTCCGCCGATACAGGTAGAATGTCGTTGGATGATTCACGGTATTCATTCGATGAGGCAAGGGCTTCTTGGGATGGATATCTGATTGGAAAAACTTATCCGAAGATTACGCCGATGGTTTCAGTTTTGGAGGAGGCCAAATTCCCCGGTACTGAATTTGAGAAAGACAATCCTTCTGATGAAGCAGAAGGCTCTCCGAAGAATGTAGGAGATAAAATCCCTGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGCTCAAGTTCACACAGAAAAGGGGCATCAGCGGACTTTGATGACTTGAAATTGATATCAAACGCAAAGGTATCTCCTGCAACAACTGAGCTGTTCTATGGGGCAAAGGTGCTACTTACAGAGAAAGATTTGAACAAGTCTCGCTCAAAAGCCACCAGAGATGGCGAATTGAGTGGCACTGATGTTACTTCCAAAGATTCTGTTCTTGATGCAGCTGGGATTGACCGAAAGACGTTCAAGAAAGTGCATAGATGGCGTAAAGTATTAAGTGTTCTTGGTATGATGCATAAGCGAAGTGGTGAAAGCAAGTCTGATGATGAAGAAAGTTGTGATCGGCCTATTGCTGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGCGAAGCGAACTCTTGCGTTAGCCAGAAGCTCATTCGTAGTTACAGCGTAAGCTGTCGAGATCGGAGCAAAGTAGCTGGATTTAATGGCGGGAATGATTTAAAACTGAACGGTTCGAGGTGGAGAGACGATTTTACGTTGCGGAGGAATCGGAGTGTAAGGTATTCGCCAAATAACTTTGATAATGGCTTGTTAAGGTTCTATTTGACACCATTGAGGAGCCACAGCAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTTCTTTCAATGTCAAGCATGTCGTGTAA

mRNA sequence

ATGAATCTGCAGGCCAAAACTGTATCGCATCGGCTTTCCTCATGTCACCGTCATCCTACCAAGCCAGTAACCGGATTCTGCGCCTCCTGCCTCCGTGAGCGCCTTGCTGGGATTGATCTCGATACGCAACAGGAATCGCCTCTTCAGAAACACCTTTCATCATTGGAGCTCCGTCGGAGCAAATCCTTCTCTGCAGCGAAGCGTGATGCCAGCATCGGGCGAATGCAGGTGCAGCATCGGAGATCGTGCGATGTTCGCTCTGGAAACTCCTTGTCGGACCTTTTCTGTCGAGAAGATAAGCCGAGATGTAGGAATCAGCAGGTGGAGATCGAATCCGAGAATTTAGGTTTTGAATTGCATGAGGTTGTGGCAAATGAGAGACAATTTAGGGCATCCGGAGGGACAATTGGACCGGCTCTTGGTACGATCGATGATTTTGCTGGAGAGGAGGCTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTCGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAAAAAAGAAAGAATCTCGGTAACAATTGCAATGTAGATGCGGTGACAACAGAGGCTATCAAGTCGAGAGTGCTTGAAATTGGAGAGACTCGTTCGGAGGTTGGAGACTATGGATTGGGTAGAAGGTCTTGTGATACAGATCCAAGATTCTCCGCCGATACAGGTAGAATGTCGTTGGATGATTCACGGTATTCATTCGATGAGGCAAGGGCTTCTTGGGATGGATATCTGATTGGAAAAACTTATCCGAAGATTACGCCGATGGTTTCAGTTTTGGAGGAGGCCAAATTCCCCGGTACTGAATTTGAGAAAGACAATCCTTCTGATGAAGCAGAAGGCTCTCCGAAGAATGTAGGAGATAAAATCCCTGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGCTCAAGTTCACACAGAAAAGGGGCATCAGCGGACTTTGATGACTTGAAATTGATATCAAACGCAAAGGTATCTCCTGCAACAACTGAGCTGTTCTATGGGGCAAAGGTGCTACTTACAGAGAAAGATTTGAACAAGTCTCGCTCAAAAGCCACCAGAGATGGCGAATTGAGTGGCACTGATGTTACTTCCAAAGATTCTGTTCTTGATGCAGCTGGGATTGACCGAAAGACGTTCAAGAAAGTGCATAGATGGCGTAAAGTATTAAGTGTTCTTGGTATGATGCATAAGCGAAGTGGTGAAAGCAAGTCTGATGATGAAGAAAGTTGTGATCGGCCTATTGCTGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGCGAAGCGAACTCTTGCGTTAGCCAGAAGCTCATTCGTAGTTACAGCGTAAGCTGTCGAGATCGGAGCAAAGTAGCTGGATTTAATGGCGGGAATGATTTAAAACTGAACGGTTCGAGGTGGAGAGACGATTTTACGTTGCGGAGGAATCGGAGTGTAAGGTATTCGCCAAATAACTTTGATAATGGCTTGTTAAGGTTCTATTTGACACCATTGAGGAGCCACAGCAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTTCTTTCAATGTCAAGCATGTCGTGTAA

Coding sequence (CDS)

ATGAATCTGCAGGCCAAAACTGTATCGCATCGGCTTTCCTCATGTCACCGTCATCCTACCAAGCCAGTAACCGGATTCTGCGCCTCCTGCCTCCGTGAGCGCCTTGCTGGGATTGATCTCGATACGCAACAGGAATCGCCTCTTCAGAAACACCTTTCATCATTGGAGCTCCGTCGGAGCAAATCCTTCTCTGCAGCGAAGCGTGATGCCAGCATCGGGCGAATGCAGGTGCAGCATCGGAGATCGTGCGATGTTCGCTCTGGAAACTCCTTGTCGGACCTTTTCTGTCGAGAAGATAAGCCGAGATGTAGGAATCAGCAGGTGGAGATCGAATCCGAGAATTTAGGTTTTGAATTGCATGAGGTTGTGGCAAATGAGAGACAATTTAGGGCATCCGGAGGGACAATTGGACCGGCTCTTGGTACGATCGATGATTTTGCTGGAGAGGAGGCTGAGTTCAAGACGGTGAAAGAGTTTATAGATCTCGAATTTCGGAGGAAGAAGAATGCAGGTCGAGATTTAAGAGAAATTGCAGGGAGTGTCTGGGAAGCGGCTTCAGTCTTCAGCAAGAAACTCGGAAAATGGAGGAAAAAGCAAAAAAGAAAGAATCTCGGTAACAATTGCAATGTAGATGCGGTGACAACAGAGGCTATCAAGTCGAGAGTGCTTGAAATTGGAGAGACTCGTTCGGAGGTTGGAGACTATGGATTGGGTAGAAGGTCTTGTGATACAGATCCAAGATTCTCCGCCGATACAGGTAGAATGTCGTTGGATGATTCACGGTATTCATTCGATGAGGCAAGGGCTTCTTGGGATGGATATCTGATTGGAAAAACTTATCCGAAGATTACGCCGATGGTTTCAGTTTTGGAGGAGGCCAAATTCCCCGGTACTGAATTTGAGAAAGACAATCCTTCTGATGAAGCAGAAGGCTCTCCGAAGAATGTAGGAGATAAAATCCCTGGTGGATCGGCTCAGACTAAAGATTACTATATGGATTCATTGTCTTCTCTGAGGCGGAGGAAGAGTTTTGATCGCTCAAGTTCACACAGAAAAGGGGCATCAGCGGACTTTGATGACTTGAAATTGATATCAAACGCAAAGGTATCTCCTGCAACAACTGAGCTGTTCTATGGGGCAAAGGTGCTACTTACAGAGAAAGATTTGAACAAGTCTCGCTCAAAAGCCACCAGAGATGGCGAATTGAGTGGCACTGATGTTACTTCCAAAGATTCTGTTCTTGATGCAGCTGGGATTGACCGAAAGACGTTCAAGAAAGTGCATAGATGGCGTAAAGTATTAAGTGTTCTTGGTATGATGCATAAGCGAAGTGGTGAAAGCAAGTCTGATGATGAAGAAAGTTGTGATCGGCCTATTGCTGAGTCTTGGGAAAAGCTGAGGCGTGTTGCTAATGGCGAAGCGAACTCTTGCGTTAGCCAGAAGCTCATTCGTAGTTACAGCGTAAGCTGTCGAGATCGGAGCAAAGTAGCTGGATTTAATGGCGGGAATGATTTAAAACTGAACGGTTCGAGGTGGAGAGACGATTTTACGTTGCGGAGGAATCGGAGTGTAAGGTATTCGCCAAATAACTTTGATAATGGCTTGTTAAGGTTCTATTTGACACCATTGAGGAGCCACAGCAGAGGCAAACCAGGAAAGAGCAGGCCAAGAAGTTCTTCTTTCAATGTCAAGCATGTCGTGTAA

Protein sequence

MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRSKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELHEVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEFEKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQKLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYLTPLRSHSRGKPGKSRPRSSSFNVKHVV
Homology
BLAST of CmoCh20G003970 vs. ExPASy Swiss-Prot
Match: Q9SS80 (Protein OCTOPUS OS=Arabidopsis thaliana OX=3702 GN=OPS PE=1 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 1.4e-56
Identity = 222/668 (33.23%), Postives = 305/668 (45.66%), Query Frame = 0

Query: 10  HRLS-SCHRHPTKPVTGFCASCLRERLAGID------LDTQQESPLQKHLSSL------- 69
           HRLS SC+RHP +  TGFC SCL ERL+ +D        +  + P     ++L       
Sbjct: 25  HRLSTSCNRHPEERFTGFCPSCLCERLSVLDQTNNGGSSSSSKKPPTISAAALKALFKPS 84

Query: 70  ----------------------ELRRSKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSD 129
                                 ELRR+KSFSA+K +     +    RRSCDVR  +SL +
Sbjct: 85  GNNGVGGVNTNGNGRVKPGFFPELRRTKSFSASKNNEGFSGVFEPQRRSCDVRLRSSLWN 144

Query: 130 LFCRED---------------KPRCR---------NQQVEIESENLGFELHE----VVAN 189
           LF +++               +PR           N + E ES++   E  E    V A 
Sbjct: 145 LFSQDEQRNLPSNVTGGEIDVEPRKSSVAEPVLEVNDEGEAESDDEELEEEEEEDYVEAG 204

Query: 190 ERQ-FRASGGTIGPALGTIDDFAGE------------EAEFKTVKEFIDLEFRRKKNAGR 249
           + +    SG  +      I +   E            E E K +K++IDL+ + KK +  
Sbjct: 205 DFEILNDSGELMREKSDEIVEVREEIEEAVKPTKGLSEEELKPIKDYIDLDSQTKKPS-- 264

Query: 250 DLREIAGSVWEAASVFSKKLGKWRKKQKRKNL--GNNCNVDAVTTEAIKSRVLEIGETRS 309
               +  S W AASVFSKKL KWR+ QK K    G +    +      K    ++ +T+S
Sbjct: 265 ----VRRSFWSAASVFSKKLQKWRQNQKMKKRRNGGDHRPGSARLPVEKPIGRQLRDTQS 324

Query: 310 EVGDYGLGRRSCDTDP--------------RFSADTGRMSLDDSRYSFDEARASWDGYLI 369
           E+ DYG GRRSCDTDP              RFS D GR+SLDD RYSFDE RASWDG LI
Sbjct: 325 EIADYGYGRRSCDTDPRFSLDAGRFSLDAGRFSVDIGRISLDDPRYSFDEPRASWDGSLI 384

Query: 370 GKTY-------PKITPMVSVLEEAKFP----GTEFEKDNPSDEAEGSPKNVGDK------ 429
           G+T        P    M+SV+E+A  P     T  +   P +E    P  V         
Sbjct: 385 GRTMFPPAARAPPPPSMLSVVEDAPPPVHRHVTRADMQFPVEEPAPPPPVVNQTNGVSDP 444

Query: 430 --IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPAT 489
             IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL     VS A 
Sbjct: 445 VIIPGGSIQTRDYYTD--SSSRRRKSLDRSSSSMRKTAAAVVADMDEPKL----SVSSAI 504

Query: 490 TELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVHRWRKV 549
           +   Y   +    +D N    +   +G      +   D  +++        KK  RW K 
Sbjct: 505 SIDAYSGSL----RDNNNYAVETADNGSFREPAMMIGDRKVNS----NDNNKKSRRWGK- 564

BLAST of CmoCh20G003970 vs. ExPASy Swiss-Prot
Match: Q9LFB9 (Protein OCTOPUS-like OS=Arabidopsis thaliana OX=3702 GN=OPSL1 PE=2 SV=1)

HSP 1 Score: 194.5 bits (493), Expect = 3.2e-48
Identity = 193/593 (32.55%), Postives = 272/593 (45.87%), Query Frame = 0

Query: 10  HRLS-SCHRHPTKPVTGFCASCLRERLAGIDLD------TQQESPLQKHLSSL------- 69
           HRLS SC  HP +  +GFC SCL +RL+ +D +      +    P      SL       
Sbjct: 23  HRLSTSCDLHPEERFSGFCPSCLCDRLSVLDHNAAPPPSSSSRKPPSISAVSLKALFKPS 82

Query: 70  -------------------ELRRSKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFC 129
                              ELRR+KSFSA   +   G  + Q RRSCDVR  +   +L  
Sbjct: 83  SSGTNNSNGNGRVRPGFFPELRRTKSFSAKNNEGFSGGFEPQ-RRSCDVRLRDDERNLPI 142

Query: 130 REDKPRCRNQQVEIESENLGFELHEVVANERQFRASGGTIGPALGTI-----DDFAGEEA 189
            E     + ++   ES      L      E +     G   P  G I      +   EE 
Sbjct: 143 NEAASVDKIEEEARESSVSEIVLEVTEEAEIEEDEENGEKDP--GEIVEEKSSEIGEEEE 202

Query: 190 EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLGNNCNVD 249
           E K +K+++DL  + KK +   +++ AGS + AASVFSKKL KW++KQK K   N     
Sbjct: 203 ELKPMKDYMDLYSQTKKPS---VKDFAGSFFSAASVFSKKLQKWKQKQKVKKPRNGVGG- 262

Query: 250 AVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDP-------RFSADTGRMSLDDSRYSF 309
                         G  +SE+   G+GRRS DTDP       RFS D GR+S+DDSRYS 
Sbjct: 263 --------------GRPQSEI---GVGRRSSDTDPRFSLDAGRFSVDIGRISMDDSRYSL 322

Query: 310 DEARASWDGYLIGKTYPKITP----MVSVLEEAKFPGTEFE-KDNPSDEAEGSPKNVGDK 369
           DE RASWDG+LIG+T     P    M+SV+E A    ++ +   +PS +      +    
Sbjct: 323 DEPRASWDGHLIGRTTAARVPLPPSMLSVVENAPLNRSDMQIPSSPSIKPISGDSDPIII 382

Query: 370 IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYG 429
           IPGGS QT+DYY    SS RRRKS DRS+S RK    + +D+K +SN+  +         
Sbjct: 383 IPGGSNQTRDYYTGPPSS-RRRKSLDRSNSIRK-IVTELEDVKSVSNSTTT--------- 442

Query: 430 AKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVHRWRKVLSVLGM 489
               +    +  + +K  ++G+                       KK  RW K  S+LG 
Sbjct: 443 ----IDSNSMETAENKGNQNGD-----------------------KKSRRWGK-WSILGF 502

Query: 490 MHKRSGESKSDDEES-------CDRPIAESWEKLRRVANGEANSCVSQKLIRSYS-VSCR 545
           ++++  + + +D  S        +R ++ESW ++R   NGE       K+ RS S VS R
Sbjct: 503 IYRKGKDDEEEDRYSRSNSAGMVERSLSESWPEMR---NGEGG---GPKMRRSNSNVSWR 525

BLAST of CmoCh20G003970 vs. ExPASy TrEMBL
Match: A0A6J1FY88 (UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=LOC111448423 PE=4 SV=1)

HSP 1 Score: 1121.3 bits (2899), Expect = 0.0e+00
Identity = 567/567 (100.00%), Postives = 567/567 (100.00%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS
Sbjct: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH
Sbjct: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
           EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRSHSRGKPGKSRPRSSSFNVKHVV
Sbjct: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 567

BLAST of CmoCh20G003970 vs. ExPASy TrEMBL
Match: A0A6J1JH66 (UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC111484453 PE=4 SV=1)

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 554/567 (97.71%), Postives = 561/567 (98.94%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKH S LELRRS
Sbjct: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHHSPLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGRMQVQHR+SCDVRSGNSLSDLFCREDKPRC NQQVEIES+NLGFELH
Sbjct: 61  KSFSAAKRDASIGRMQVQHRKSCDVRSGNSLSDLFCREDKPRCTNQQVEIESKNLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRM 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSAD GR+SLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADAGRISLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
            KD+PSDE+EGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 MKDDPSDESEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDG+LSGTDVTSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGDLSGTDVTSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCD+PIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDQPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRSHSRGKPGKSRPRSSSFNVKHVV
Sbjct: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 567

BLAST of CmoCh20G003970 vs. ExPASy TrEMBL
Match: A0A5A7V3J1 (UPF0503 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G00450 PE=4 SV=1)

HSP 1 Score: 954.5 bits (2466), Expect = 2.0e-274
Identity = 489/574 (85.19%), Postives = 516/574 (89.90%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPL-QKHLSSLELRR 60
           MNLQ K+VSHRLS+CHRHP+KPVTGFCASCLRERLAGID D Q ESPL   H SS ELRR
Sbjct: 1   MNLQLKSVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDLQHESPLPNNHSSSAELRR 60

Query: 61  SKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFEL 120
           SKS+SAAK +A IG+ ++QHR+SCDVRSGNSLSDLFCREDKPRC N +VEIESENLGFEL
Sbjct: 61  SKSYSAAKCEAGIGQSELQHRKSCDVRSGNSLSDLFCREDKPRCTNPEVEIESENLGFEL 120

Query: 121 HEVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180
            EVV N RQFRAS G IGP LGTIDDFAGE+AEFKTVKEFIDLEFRRKKNAGRDLREIAG
Sbjct: 121 REVVGNGRQFRASEGIIGPGLGTIDDFAGEDAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180

Query: 181 SVWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 240
           SVWEAASVFSKKL KWRKKQKRKNLGNN NV AV  E IK R LEI ETRSEVG+YGLGR
Sbjct: 181 SVWEAASVFSKKLSKWRKKQKRKNLGNNSNVGAVKVEDIKPRALEIRETRSEVGEYGLGR 240

Query: 241 RSCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTE 300
           RSCDTDPRFS D GRMSLDDSRYSFDE RASWDGYLIGKTYP+ITPMVSVLEEAKF G  
Sbjct: 241 RSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGAG 300

Query: 301 FEKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFD 360
           FEKD+PSDEAEGSP NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGAS DFD
Sbjct: 301 FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFD 360

Query: 361 DLKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGI 420
           +LKLISNAKVSPATTELFYGAKVL+TEKDLN SR KAT DG+LSGTDVTSKDSV DA  I
Sbjct: 361 ELKLISNAKVSPATTELFYGAKVLITEKDLNSSRPKATGDGDLSGTDVTSKDSVPDAPVI 420

Query: 421 DRKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEES------CDRPIAESWEKLRRVANGE 480
           DRK+FKKVHRWRKVLSVLGM+ KR+GESKSDDEES       DRP+ ESWEKLRRVANGE
Sbjct: 421 DRKSFKKVHRWRKVLSVLGMIQKRNGESKSDDEESSVAGNVVDRPVVESWEKLRRVANGE 480

Query: 481 ANSCVSQKLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDN 540
           ANSCVSQKLIRSYSVSCRD SK+AGFNGGND KLN +RWRDDFTL+RNRSVRYSPNNFDN
Sbjct: 481 ANSCVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNVTRWRDDFTLQRNRSVRYSPNNFDN 540

Query: 541 GLLRFYLTPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           GLLRFYLTPLRS+SRGK GKSRPR+S FNVKHV+
Sbjct: 541 GLLRFYLTPLRSYSRGKLGKSRPRNSPFNVKHVI 574

BLAST of CmoCh20G003970 vs. ExPASy TrEMBL
Match: A0A1S3CKL0 (UPF0503 protein At3g09070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502039 PE=4 SV=1)

HSP 1 Score: 954.5 bits (2466), Expect = 2.0e-274
Identity = 489/574 (85.19%), Postives = 516/574 (89.90%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPL-QKHLSSLELRR 60
           MNLQ K+VSHRLS+CHRHP+KPVTGFCASCLRERLAGID D Q ESPL   H SS ELRR
Sbjct: 1   MNLQLKSVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDLQHESPLPNNHSSSAELRR 60

Query: 61  SKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFEL 120
           SKS+SAAK +A IG+ ++QHR+SCDVRSGNSLSDLFCREDKPRC N +VEIESENLGFEL
Sbjct: 61  SKSYSAAKCEAGIGQSELQHRKSCDVRSGNSLSDLFCREDKPRCTNPEVEIESENLGFEL 120

Query: 121 HEVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180
            EVV N RQFRAS G IGP LGTIDDFAGE+AEFKTVKEFIDLEFRRKKNAGRDLREIAG
Sbjct: 121 REVVGNGRQFRASEGIIGPGLGTIDDFAGEDAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180

Query: 181 SVWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 240
           SVWEAASVFSKKL KWRKKQKRKNLGNN NV AV  E IK R LEI ETRSEVG+YGLGR
Sbjct: 181 SVWEAASVFSKKLSKWRKKQKRKNLGNNSNVGAVKVEDIKPRALEIRETRSEVGEYGLGR 240

Query: 241 RSCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTE 300
           RSCDTDPRFS D GRMSLDDSRYSFDE RASWDGYLIGKTYP+ITPMVSVLEEAKF G  
Sbjct: 241 RSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGAG 300

Query: 301 FEKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFD 360
           FEKD+PSDEAEGSP NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGAS DFD
Sbjct: 301 FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFD 360

Query: 361 DLKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGI 420
           +LKLISNAKVSPATTELFYGAKVL+TEKDLN SR KAT DG+LSGTDVTSKDSV DA  I
Sbjct: 361 ELKLISNAKVSPATTELFYGAKVLITEKDLNSSRPKATGDGDLSGTDVTSKDSVPDAPVI 420

Query: 421 DRKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEES------CDRPIAESWEKLRRVANGE 480
           DRK+FKKVHRWRKVLSVLGM+ KR+GESKSDDEES       DRP+ ESWEKLRRVANGE
Sbjct: 421 DRKSFKKVHRWRKVLSVLGMIQKRNGESKSDDEESSVAGNVVDRPVVESWEKLRRVANGE 480

Query: 481 ANSCVSQKLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDN 540
           ANSCVSQKLIRSYSVSCRD SK+AGFNGGND KLN +RWRDDFTL+RNRSVRYSPNNFDN
Sbjct: 481 ANSCVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNVTRWRDDFTLQRNRSVRYSPNNFDN 540

Query: 541 GLLRFYLTPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           GLLRFYLTPLRS+SRGK GKSRPR+S FNVKHV+
Sbjct: 541 GLLRFYLTPLRSYSRGKLGKSRPRNSPFNVKHVI 574

BLAST of CmoCh20G003970 vs. ExPASy TrEMBL
Match: A0A0A0KBP2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G187980 PE=4 SV=1)

HSP 1 Score: 937.9 bits (2423), Expect = 1.9e-269
Identity = 484/575 (84.17%), Postives = 514/575 (89.39%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQ K+VSHRLS+CHRHP+KPVTGFCASCLRERLAGID DTQ ESP+Q + SS ELRRS
Sbjct: 1   MNLQLKSVSHRLSTCHRHPSKPVTGFCASCLRERLAGIDPDTQHESPVQNNHSSAELRRS 60

Query: 61  KSFSAAKRDA-SIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFEL 120
           KS SAAK +A  IG+ +VQHR+SCDVRSGNSLSDLFCREDKPRC N++VEIESENLGFEL
Sbjct: 61  KSHSAAKCEAGGIGQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTNREVEIESENLGFEL 120

Query: 121 HEVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180
            EVV N RQFRAS G IGP LGTID F+GEEAEFKTVKEFIDLEFRRKKNAGRDLREIAG
Sbjct: 121 REVVGNGRQFRASEGIIGPGLGTIDGFSGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAG 180

Query: 181 SVWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 240
           SVWEAAS FSKKLGKWRKKQKRKNL NN  V AV  E IK R  EI ETRSEVG+YGLGR
Sbjct: 181 SVWEAASGFSKKLGKWRKKQKRKNLSNNSYVGAVKAEDIKPRAHEIRETRSEVGEYGLGR 240

Query: 241 RSCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTE 300
           RSCDTDPRFS D GRMSLDDSRYSFDE RASWDGYLIGKTYP+ITPMVSVLEEAKF GT 
Sbjct: 241 RSCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTG 300

Query: 301 FEKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFD 360
           FEKD+PSDEAEGSP NVGDKIPGGSAQTKDYYMDSLSS+RRRKSFDRS SHRKGAS DFD
Sbjct: 301 FEKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSMRRRKSFDRSCSHRKGASGDFD 360

Query: 361 DLKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGI 420
           +LKLISNAKVSPATTELFYGAKVL+TEKDLN SR K T DG+LSGTDVTSKDSV DA  I
Sbjct: 361 ELKLISNAKVSPATTELFYGAKVLITEKDLNSSRPKTTGDGDLSGTDVTSKDSVPDAPVI 420

Query: 421 DRKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEES------CDRPIAESWEKLRRVANGE 480
           DRKTFKKVHRWRKVLSVLGM+ KR+GESKSDDEES       DRP+ ESWEKLRRVANGE
Sbjct: 421 DRKTFKKVHRWRKVLSVLGMVQKRNGESKSDDEESSVGGNVVDRPVVESWEKLRRVANGE 480

Query: 481 ANSCVSQKLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDN 540
           ANSCVSQKLIRSYSVSCRD SK+AGFNGGND KLN +RWRDDFTL+RNRSVRYSPNNFDN
Sbjct: 481 ANSCVSQKLIRSYSVSCRDPSKLAGFNGGNDSKLNVTRWRDDFTLQRNRSVRYSPNNFDN 540

Query: 541 -GLLRFYLTPLRSHSRGKPGKSRPRSSSFNVKHVV 568
            GLLRFYLTPLRS++RGK GK+RPR+S FNVKHV+
Sbjct: 541 GGLLRFYLTPLRSYNRGKLGKNRPRNSPFNVKHVI 575

BLAST of CmoCh20G003970 vs. NCBI nr
Match: XP_022943770.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata])

HSP 1 Score: 1121.3 bits (2899), Expect = 0.0e+00
Identity = 567/567 (100.00%), Postives = 567/567 (100.00%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS
Sbjct: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH
Sbjct: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
           EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRSHSRGKPGKSRPRSSSFNVKHVV
Sbjct: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 567

BLAST of CmoCh20G003970 vs. NCBI nr
Match: KAG6570672.1 (Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia] >KAG7010518.1 UPF0503 protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1112.1 bits (2875), Expect = 0.0e+00
Identity = 562/567 (99.12%), Postives = 565/567 (99.65%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS
Sbjct: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGR+QVQHR+SCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH
Sbjct: 61  KSFSAAKRDASIGRLQVQHRKSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSAD GRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADAGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
           EKD+PSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 EKDDPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRSHSRGK GKSRPRSSSFNVKHVV
Sbjct: 541 TPLRSHSRGKSGKSRPRSSSFNVKHVV 567

BLAST of CmoCh20G003970 vs. NCBI nr
Match: XP_022986829.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima])

HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 554/567 (97.71%), Postives = 561/567 (98.94%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKH S LELRRS
Sbjct: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHHSPLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGRMQVQHR+SCDVRSGNSLSDLFCREDKPRC NQQVEIES+NLGFELH
Sbjct: 61  KSFSAAKRDASIGRMQVQHRKSCDVRSGNSLSDLFCREDKPRCTNQQVEIESKNLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRM 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSAD GR+SLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADAGRISLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
            KD+PSDE+EGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 MKDDPSDESEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDG+LSGTDVTSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGDLSGTDVTSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCD+PIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDQPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRSHSRGKPGKSRPRSSSFNVKHVV
Sbjct: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 567

BLAST of CmoCh20G003970 vs. NCBI nr
Match: XP_023512723.1 (UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1094.7 bits (2830), Expect = 0.0e+00
Identity = 550/567 (97.00%), Postives = 561/567 (98.94%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQ KTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKH SSLELRRS
Sbjct: 1   MNLQGKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHHSSLELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KSFSAAKRDASIGR+QVQHR+SCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH
Sbjct: 61  KSFSAAKRDASIGRLQVQHRKSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           E VANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EAVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGR 
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRM 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFSAD GR+SLDDSRYSFDEARASWDGYL+GKTYPKITPMVSVLEEAKFPGTEF
Sbjct: 241 SCDTDPRFSADAGRISLDDSRYSFDEARASWDGYLVGKTYPKITPMVSVLEEAKFPGTEF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
           EKD+PSDEAEGSPKNVGDK+PGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD
Sbjct: 301 EKDDPSDEAEGSPKNVGDKVPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTD TSKDSVLDAAGID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDATSKDSVLDAAGID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480
           RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ
Sbjct: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEESCDRPIAESWEKLRRVANGEANSCVSQ 480

Query: 481 KLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540
           KLIRSYSVSC+D+SKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL
Sbjct: 481 KLIRSYSVSCQDQSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNGLLRFYL 540

Query: 541 TPLRSHSRGKPGKSRPRSSSFNVKHVV 568
           TPLRS+SRGKPGKSRPRSSSF+VKHV+
Sbjct: 541 TPLRSYSRGKPGKSRPRSSSFSVKHVM 567

BLAST of CmoCh20G003970 vs. NCBI nr
Match: XP_038901013.1 (protein OCTOPUS [Benincasa hispida])

HSP 1 Score: 968.4 bits (2502), Expect = 2.7e-278
Identity = 500/574 (87.11%), Postives = 523/574 (91.11%), Query Frame = 0

Query: 1   MNLQAKTVSHRLSSCHRHPTKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRS 60
           MNLQ KT+SHRLS+CHRHP+KPVTGFCASCLRERLAGID DTQQESP+  + SS ELRRS
Sbjct: 1   MNLQLKTLSHRLSTCHRHPSKPVTGFCASCLRERLAGIDTDTQQESPVPNNHSSSELRRS 60

Query: 61  KSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFELH 120
           KS+SAAKR+A I + +VQHR+SCDVRSGNSLSDLFCREDKPRC  ++VEIESENLG EL 
Sbjct: 61  KSYSAAKREAGIEQSEVQHRKSCDVRSGNSLSDLFCREDKPRCTIREVEIESENLGSELR 120

Query: 121 EVVANERQFRASGGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180
           EVVANER FRAS G IGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS
Sbjct: 121 EVVANERLFRASEGIIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRKKNAGRDLREIAGS 180

Query: 181 VWEAASVFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRR 240
           VWEAASVFSKKLGKWRKKQKRKNL NN NV  V  E IK RVLEI ETRSEVGDYGLGRR
Sbjct: 181 VWEAASVFSKKLGKWRKKQKRKNLSNNGNVGTVKAEDIKPRVLEIRETRSEVGDYGLGRR 240

Query: 241 SCDTDPRFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEF 300
           SCDTDPRFS D GRMSLDDSRYSFDE RASWDGYLIGKTYP+ITPMVSVLEEAKF GT F
Sbjct: 241 SCDTDPRFSVDAGRMSLDDSRYSFDEPRASWDGYLIGKTYPRITPMVSVLEEAKFSGTGF 300

Query: 301 EKDNPSDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDD 360
           EKD+PSDEAEGSP NVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRS SHRKGAS DFDD
Sbjct: 301 EKDDPSDEAEGSPMNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSCSHRKGASGDFDD 360

Query: 361 LKLISNAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGID 420
           LKLISNAKVSPATTELFYGAKVL+TEKDLN S SKATR+G+LSGTDVTSKDSV DAA ID
Sbjct: 361 LKLISNAKVSPATTELFYGAKVLITEKDLNNSHSKATREGDLSGTDVTSKDSVPDAAVID 420

Query: 421 RKTFKKVHRWRKVLSVLGMMHKRSGESKSDDEES------CDRPIAESWEKLRRVANGEA 480
           RKTFKKVHRWRKVLSVLGM+ KRSGESKSDDEES       DRPIAESWEKLRRVANGEA
Sbjct: 421 RKTFKKVHRWRKVLSVLGMIQKRSGESKSDDEESSVGGNVVDRPIAESWEKLRRVANGEA 480

Query: 481 NSCVSQKLIRSYSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNRSVRYSPNNFDNG 540
           NSCVSQKLIRSYSVSCRD SK+AGFNG ND KLN SRWRDDFTL+RNRSVRYSPNNFDNG
Sbjct: 481 NSCVSQKLIRSYSVSCRDPSKLAGFNGSNDSKLNVSRWRDDFTLQRNRSVRYSPNNFDNG 540

Query: 541 LLRFYLTPLRSH-SRGKPGKSRPRSSSFNVKHVV 568
           LLRFYLTPLRS+ SRGKPGKSRPR+S FNVKHV+
Sbjct: 541 LLRFYLTPLRSYSSRGKPGKSRPRNSPFNVKHVM 574

BLAST of CmoCh20G003970 vs. TAIR 10
Match: AT5G58930.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 258.5 bits (659), Expect = 1.3e-68
Identity = 214/593 (36.09%), Postives = 296/593 (49.92%), Query Frame = 0

Query: 15  CHRHP-TKPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRSKSFSAAKRDASIG 74
           CHRHP +KP TGFCA+CLRERL+ I+  +   S      +S ELRR +S+S   RDAS  
Sbjct: 17  CHRHPSSKPTTGFCATCLRERLSTIEALSSSVS------ASTELRRVRSYSV--RDASAS 76

Query: 75  RMQVQHRRSCDVRSGNSLSDLFCREDKPRCRNQQVEIESENLGFEL-HEVVANERQFRAS 134
            +    RRSCDVRS +   D               E+   ++ F +  +++ +E +    
Sbjct: 77  VLDQPRRRSCDVRSNHDDDD-------------DDELLKSSIRFPIVPDLIEDEEEEDDE 136

Query: 135 GGTIGPALGTIDDFAGEEAEFKTVKEFIDLEFRRK--KNAGRDLREIAGSVWEAASVFSK 194
           G  +      +++   E+ E KT+KE IDLE R +  KN G+D            SVFS+
Sbjct: 137 GKKL------VEEEI-EDGEQKTMKELIDLESRNQQLKNNGKD------------SVFSR 196

Query: 195 KLGKWRKKQKRK--NLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDPRF 254
            L K+  K  RK  + GN+                             LGRRSCD DPR 
Sbjct: 197 TLRKFSLKHHRKIPDSGNS-----------------------------LGRRSCDVDPRL 256

Query: 255 SADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFPGTEFEKDNPSDE 314
           S D GR+       SFDE RASWDG LIGKTYPK+ P+ SV E+ K    +   +   ++
Sbjct: 257 SLDAGRV-------SFDEPRASWDGCLIGKTYPKLIPLSSVTEDVKASPEKITGEKVEED 316

Query: 315 AEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDDLKLISNAK 374
            + +        PGG+AQT+DYY+DS    RRR+SFDRSS H      + D+LK ISNAK
Sbjct: 317 EKNN--------PGGTAQTRDYYLDS----RRRRSFDRSSRH---GLLEVDELKAISNAK 376

Query: 375 VSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRK-----T 434
           VSP T  LF+GAK+L+TE++L  S   + ++ +    ++ SK     AAG  +K      
Sbjct: 377 VSPETVGLFHGAKLLVTERELRDSNWYSIKNYKPESLELGSKGVGCVAAGEVKKQDGFGL 436

Query: 435 FKKVHRWRKVLSVLGMMHKRSGESKSDDE---------ESCDRPIAESWEKLRRVANGEA 494
            K    W K  +  G++ +++  +K++ +          + +  +AES  KLRRVA GE 
Sbjct: 437 KKSGKNWSKGWNFWGLIQRKTDVAKNEMKTEQSLKLGGNTMEGSLAESLLKLRRVAKGET 496

Query: 495 NSCVSQKLIRSYSVSC--------RDRSKVAGFNGG-----------------------N 554
           N  VS+KLIRSYSVS         R  S V GF GG                        
Sbjct: 497 NGDVSEKLIRSYSVSARKSCDGMLRGASIVNGFEGGRSSCDGLFHGSITGVETGRRSLCE 518

Query: 555 DLKLNGSRWRDDFTLRRNRSV-RYSPNNFDNGLLRFYLTPLRSHSRGKPGKSR 556
           D   +G   + +  L+ +  +  YSP+N  NG++RFYLTPL SH   K GKSR
Sbjct: 557 DGMFHGVEGKRNHLLQSDDKLGTYSPDNLRNGMVRFYLTPLNSHMTSKSGKSR 518

BLAST of CmoCh20G003970 vs. TAIR 10
Match: AT3G46990.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 246.9 bits (629), Expect = 3.9e-65
Identity = 215/596 (36.07%), Postives = 303/596 (50.84%), Query Frame = 0

Query: 13  SSCHRHPT-KPVTGFCASCLRERLAGIDLDTQQESPLQKHLSSLELRRSKSFSAAKRDAS 72
           SSCHRHP+ KP +GFCASCLRERL  I+  +   + +Q    + ELRR +S+S   R+AS
Sbjct: 14  SSCHRHPSAKPTSGFCASCLRERLVTIEAQSSSLAAVQ----TPELRRIRSYSV--RNAS 73

Query: 73  IGRMQVQHRRSCDVR-SGNSLSDLFCREDKPRCRNQQVEIESENLGFELHEVVANERQFR 132
           +       RRSCDVR S +SL DLF  +D+ R  +   +    +L  E  E    E  + 
Sbjct: 74  VSVSDQPRRRSCDVRSSASSLLDLFVDDDEERVDSSIRKPLVPDLKEEEEEEEEEEDYYD 133

Query: 133 ASGGTIGPALGTIDDFAGEE--AEFKTVKEFIDLEFRR--KKNAGRDLREIAGSVWEAAS 192
                 G  +   D+    +   E KT+KEFIDL++R   KKN G+DL+EI       AS
Sbjct: 134 ------GEDIKGFDEGKPRKIVEENKTMKEFIDLDWRNQIKKNNGKDLKEI-------AS 193

Query: 193 VFSKKLGKWRKKQKRKNLGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDP 252
           V S++L              N  ++    E   SR   I            GR S D DP
Sbjct: 194 VLSRRL-------------KNFTLNKRNDEKSDSRFAGIVN----------GRHSSDVDP 253

Query: 253 RFSADTGRMSLDDSRYSFDEARASWDGYLIGKTYPKITPMVSVLEEAKFP-GTEFEKDNP 312
           R S D GR+       SF++ R+SWDG LI K+Y K+T + +V E+AK   G E      
Sbjct: 254 RLSFDGGRI-------SFEKPRSSWDGCLIEKSYHKLTTLSTVTEDAKAKCGVE------ 313

Query: 313 SDEAEGSPKNVGDKIPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDDLKLIS 372
             E E   K   +K PGG+ QTK+YY DS    RRR+SFDRS S ++    + D+L+ IS
Sbjct: 314 --EEEVEEK---EKSPGGTVQTKNYYSDS----RRRRSFDRSVSIKRQGLLEVDELRGIS 373

Query: 373 NAKVSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFK 432
           NAKVSP T  LF+GAK+L+TEK+L  S   + ++ +    ++ SK  +  AAG + K   
Sbjct: 374 NAKVSPETVGLFHGAKLLVTEKELRDSNWYSIKNVKPESKELVSKGKICIAAGGEGKKQD 433

Query: 433 KVH------RWRKVLSVLGMMHKRSGESKSD---------DEESCDRPIAESWEKLRRVA 492
            V       +W K  ++ G++ +R  E+K++         +  + +  +AES  KLRRV 
Sbjct: 434 SVELKKPRKKWPKGWNIWGLI-QRKNEAKNEIKTEQILKLEGNAVEGSLAESLLKLRRVG 493

Query: 493 NGEANSCVSQKLIRSYSVSCR--------DRSKVAGFNGGN------------------- 552
            GE N  VS+KL++SYSVS R          + V+GF GG                    
Sbjct: 494 KGETNVGVSEKLLKSYSVSARKSCDGVRSGANIVSGFEGGRSSCDGLFHGSINSVEAGRN 544

Query: 553 --DLKLNGSRWRDD-FTLRRNRSV-RYSPNNFDNGLLRFYLTPLRSHSRGKPGKSR 556
             D  +NG   + +   L+RN +V   S  N +  + RFYL+P++SH   K GKSR
Sbjct: 554 SCDGLVNGIEGKQNHHLLQRNANVGTCSQENLEKSMFRFYLSPVKSHKTSKSGKSR 544

BLAST of CmoCh20G003970 vs. TAIR 10
Match: AT2G38070.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 241.5 bits (615), Expect = 1.6e-63
Identity = 217/618 (35.11%), Postives = 304/618 (49.19%), Query Frame = 0

Query: 10  HRLS-SCHRHPTKPVTGFCASCLRERLAGIDLDTQQE----SPLQKHLSSL--------- 69
           HR S SC RHP +  TGFC SCL +RL+ +D+  +      S  +K  SS          
Sbjct: 32  HRPSTSCDRHPDERFTGFCPSCLFDRLSVLDITGKNNNAVASSSKKPPSSSAALKAIFKP 91

Query: 70  ---------ELRRSKSFSAAKRDA-SIGRMQVQHRRSCDVRSGNSLSDLFCREDKPRCRN 129
                    ELRR+KSFSA+K +A S+G  + Q RRSCDVR  N+L  LF  + +   + 
Sbjct: 92  SSSSGSFFPELRRTKSFSASKAEAFSLGAFEPQ-RRSCDVRVRNTLWSLFHEDAEHNSQT 151

Query: 130 QQ------VEIESENLG-----------FELHEVVANERQFRASGGTIGPALGTIDDFAG 189
           ++       EI+ E +             E+     NE+  +    T       ID+   
Sbjct: 152 KEGLSVNCSEIDLERINSIVKSPVFEEETEIESEQDNEKDIKFE--TFKEPRSVIDEIVE 211

Query: 190 EEAEFKTVK-EFIDLEFRRK---KNAGRDLREIAGSVWEAASVFSKKLGKWRKKQK-RKN 249
           EE E +T K E   +EF  +   K   RD +EIAGS W AASVFSKKL KWR+KQK +K+
Sbjct: 212 EEEEEETKKVEDFTMEFNPQTTAKKTNRDFKEIAGSFWSAASVFSKKLQKWRQKQKLKKH 271

Query: 250 LGNNCNVDAVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDPRFSADTGRMSL------ 309
              N    +      K+   ++ +T+SE+ +YG GRRSCDTDPRFS D GR SL      
Sbjct: 272 RTGNLGAGSSALPVEKAIGRQLRDTQSEIAEYGYGRRSCDTDPRFSIDAGRFSLDAGRVS 331

Query: 310 -DDSRYSFDEARASWDGYLIGKTYP--KITPMVSVLEEAKFPGTEFEKDN--PSDEAEGS 369
            DD RYSF+E RASWDGYLIG+     ++  M+SV+E++         D   P +++   
Sbjct: 332 VDDPRYSFEEPRASWDGYLIGRAAAPMRMPSMLSVVEDSPVRNHVHRSDTHIPVEKSPQV 391

Query: 370 PKNVGDKI-PGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRK---GASADFDDLKLISNAK 429
            + V D+I PGGSAQT++YY+DS SS RRRKS DRSSS RK      A+ D+LKL  + +
Sbjct: 392 SEAVIDEIVPGGSAQTREYYLDS-SSSRRRKSLDRSSSTRKLSASVMAEIDELKLTQDRE 451

Query: 430 VSPATTELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVH 489
                             KDL  S S + RD +    +   +  V +  G      K+  
Sbjct: 452 A-----------------KDL-VSHSNSLRD-DCCSVENNYEMGVRENVGTIECNKKRTK 511

Query: 490 RWRKVLSVLGMMHKRSGESKSDDE--ESCDRPIAESWEKLRRVANGEANSCVSQKLIRS- 549
           + R   ++ G++H+++G    ++E     DR  + SW       N E  +    K+IRS 
Sbjct: 512 KSRWSWNIFGLLHRKNGNKYEEEERRSGVDRTFSGSW-------NVEPRNGFDPKMIRSN 571

Query: 550 YSVSCRDRSKVAGFNGGNDLKLNGSRWRDDFTLRRNR-----SVRYSPNNFDNGLLRFYL 559
            SVS R     +G  GG               L+RN      S +   +  +NG+L+FYL
Sbjct: 572 SSVSWRS----SGTTGGG--------------LQRNSVDGYISGKKKVSKAENGMLKFYL 601

BLAST of CmoCh20G003970 vs. TAIR 10
Match: AT3G09070.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 222.2 bits (565), Expect = 1.0e-57
Identity = 222/668 (33.23%), Postives = 305/668 (45.66%), Query Frame = 0

Query: 10  HRLS-SCHRHPTKPVTGFCASCLRERLAGID------LDTQQESPLQKHLSSL------- 69
           HRLS SC+RHP +  TGFC SCL ERL+ +D        +  + P     ++L       
Sbjct: 25  HRLSTSCNRHPEERFTGFCPSCLCERLSVLDQTNNGGSSSSSKKPPTISAAALKALFKPS 84

Query: 70  ----------------------ELRRSKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSD 129
                                 ELRR+KSFSA+K +     +    RRSCDVR  +SL +
Sbjct: 85  GNNGVGGVNTNGNGRVKPGFFPELRRTKSFSASKNNEGFSGVFEPQRRSCDVRLRSSLWN 144

Query: 130 LFCRED---------------KPRCR---------NQQVEIESENLGFELHE----VVAN 189
           LF +++               +PR           N + E ES++   E  E    V A 
Sbjct: 145 LFSQDEQRNLPSNVTGGEIDVEPRKSSVAEPVLEVNDEGEAESDDEELEEEEEEDYVEAG 204

Query: 190 ERQ-FRASGGTIGPALGTIDDFAGE------------EAEFKTVKEFIDLEFRRKKNAGR 249
           + +    SG  +      I +   E            E E K +K++IDL+ + KK +  
Sbjct: 205 DFEILNDSGELMREKSDEIVEVREEIEEAVKPTKGLSEEELKPIKDYIDLDSQTKKPS-- 264

Query: 250 DLREIAGSVWEAASVFSKKLGKWRKKQKRKNL--GNNCNVDAVTTEAIKSRVLEIGETRS 309
               +  S W AASVFSKKL KWR+ QK K    G +    +      K    ++ +T+S
Sbjct: 265 ----VRRSFWSAASVFSKKLQKWRQNQKMKKRRNGGDHRPGSARLPVEKPIGRQLRDTQS 324

Query: 310 EVGDYGLGRRSCDTDP--------------RFSADTGRMSLDDSRYSFDEARASWDGYLI 369
           E+ DYG GRRSCDTDP              RFS D GR+SLDD RYSFDE RASWDG LI
Sbjct: 325 EIADYGYGRRSCDTDPRFSLDAGRFSLDAGRFSVDIGRISLDDPRYSFDEPRASWDGSLI 384

Query: 370 GKTY-------PKITPMVSVLEEAKFP----GTEFEKDNPSDEAEGSPKNVGDK------ 429
           G+T        P    M+SV+E+A  P     T  +   P +E    P  V         
Sbjct: 385 GRTMFPPAARAPPPPSMLSVVEDAPPPVHRHVTRADMQFPVEEPAPPPPVVNQTNGVSDP 444

Query: 430 --IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSH-RKGAS---ADFDDLKLISNAKVSPAT 489
             IPGGS QT+DYY D  SS RRRKS DRSSS  RK A+   AD D+ KL     VS A 
Sbjct: 445 VIIPGGSIQTRDYYTD--SSSRRRKSLDRSSSSMRKTAAAVVADMDEPKL----SVSSAI 504

Query: 490 TELFYGAKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVHRWRKV 549
           +   Y   +    +D N    +   +G      +   D  +++        KK  RW K 
Sbjct: 505 SIDAYSGSL----RDNNNYAVETADNGSFREPAMMIGDRKVNS----NDNNKKSRRWGK- 564

BLAST of CmoCh20G003970 vs. TAIR 10
Match: AT5G01170.1 (Protein of unknown function (DUF740) )

HSP 1 Score: 194.5 bits (493), Expect = 2.3e-49
Identity = 193/593 (32.55%), Postives = 272/593 (45.87%), Query Frame = 0

Query: 10  HRLS-SCHRHPTKPVTGFCASCLRERLAGIDLD------TQQESPLQKHLSSL------- 69
           HRLS SC  HP +  +GFC SCL +RL+ +D +      +    P      SL       
Sbjct: 23  HRLSTSCDLHPEERFSGFCPSCLCDRLSVLDHNAAPPPSSSSRKPPSISAVSLKALFKPS 82

Query: 70  -------------------ELRRSKSFSAAKRDASIGRMQVQHRRSCDVRSGNSLSDLFC 129
                              ELRR+KSFSA   +   G  + Q RRSCDVR  +   +L  
Sbjct: 83  SSGTNNSNGNGRVRPGFFPELRRTKSFSAKNNEGFSGGFEPQ-RRSCDVRLRDDERNLPI 142

Query: 130 REDKPRCRNQQVEIESENLGFELHEVVANERQFRASGGTIGPALGTI-----DDFAGEEA 189
            E     + ++   ES      L      E +     G   P  G I      +   EE 
Sbjct: 143 NEAASVDKIEEEARESSVSEIVLEVTEEAEIEEDEENGEKDP--GEIVEEKSSEIGEEEE 202

Query: 190 EFKTVKEFIDLEFRRKKNAGRDLREIAGSVWEAASVFSKKLGKWRKKQKRKNLGNNCNVD 249
           E K +K+++DL  + KK +   +++ AGS + AASVFSKKL KW++KQK K   N     
Sbjct: 203 ELKPMKDYMDLYSQTKKPS---VKDFAGSFFSAASVFSKKLQKWKQKQKVKKPRNGVGG- 262

Query: 250 AVTTEAIKSRVLEIGETRSEVGDYGLGRRSCDTDP-------RFSADTGRMSLDDSRYSF 309
                         G  +SE+   G+GRRS DTDP       RFS D GR+S+DDSRYS 
Sbjct: 263 --------------GRPQSEI---GVGRRSSDTDPRFSLDAGRFSVDIGRISMDDSRYSL 322

Query: 310 DEARASWDGYLIGKTYPKITP----MVSVLEEAKFPGTEFE-KDNPSDEAEGSPKNVGDK 369
           DE RASWDG+LIG+T     P    M+SV+E A    ++ +   +PS +      +    
Sbjct: 323 DEPRASWDGHLIGRTTAARVPLPPSMLSVVENAPLNRSDMQIPSSPSIKPISGDSDPIII 382

Query: 370 IPGGSAQTKDYYMDSLSSLRRRKSFDRSSSHRKGASADFDDLKLISNAKVSPATTELFYG 429
           IPGGS QT+DYY    SS RRRKS DRS+S RK    + +D+K +SN+  +         
Sbjct: 383 IPGGSNQTRDYYTGPPSS-RRRKSLDRSNSIRK-IVTELEDVKSVSNSTTT--------- 442

Query: 430 AKVLLTEKDLNKSRSKATRDGELSGTDVTSKDSVLDAAGIDRKTFKKVHRWRKVLSVLGM 489
               +    +  + +K  ++G+                       KK  RW K  S+LG 
Sbjct: 443 ----IDSNSMETAENKGNQNGD-----------------------KKSRRWGK-WSILGF 502

Query: 490 MHKRSGESKSDDEES-------CDRPIAESWEKLRRVANGEANSCVSQKLIRSYS-VSCR 545
           ++++  + + +D  S        +R ++ESW ++R   NGE       K+ RS S VS R
Sbjct: 503 IYRKGKDDEEEDRYSRSNSAGMVERSLSESWPEMR---NGEGG---GPKMRRSNSNVSWR 525

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9SS801.4e-5633.23Protein OCTOPUS OS=Arabidopsis thaliana OX=3702 GN=OPS PE=1 SV=1[more]
Q9LFB93.2e-4832.55Protein OCTOPUS-like OS=Arabidopsis thaliana OX=3702 GN=OPSL1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1FY880.0e+00100.00UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita moschata OX=3662 GN=L... [more]
A0A6J1JH660.0e+0097.71UPF0503 protein At3g09070, chloroplastic-like OS=Cucurbita maxima OX=3661 GN=LOC... [more]
A0A5A7V3J12.0e-27485.19UPF0503 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold571G0045... [more]
A0A1S3CKL02.0e-27485.19UPF0503 protein At3g09070, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103502039... [more]
A0A0A0KBP21.9e-26984.17Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G187980 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
XP_022943770.10.0e+00100.00UPF0503 protein At3g09070, chloroplastic-like [Cucurbita moschata][more]
KAG6570672.10.0e+0099.12Protein OCTOPUS, partial [Cucurbita argyrosperma subsp. sororia] >KAG7010518.1 U... [more]
XP_022986829.10.0e+0097.71UPF0503 protein At3g09070, chloroplastic-like [Cucurbita maxima][more]
XP_023512723.10.0e+0097.00UPF0503 protein At3g09070, chloroplastic-like [Cucurbita pepo subsp. pepo][more]
XP_038901013.12.7e-27887.11protein OCTOPUS [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT5G58930.11.3e-6836.09Protein of unknown function (DUF740) [more]
AT3G46990.13.9e-6536.07Protein of unknown function (DUF740) [more]
AT2G38070.11.6e-6335.11Protein of unknown function (DUF740) [more]
AT3G09070.11.0e-5733.23Protein of unknown function (DUF740) [more]
AT5G01170.12.3e-4932.55Protein of unknown function (DUF740) [more]
InterPro
Analysis Name: InterPro Annotations of Cucurbita moschata (Rifu) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR008004Protein OCTOPUS-likePFAMPF05340DUF740coord: 51..111
e-value: 1.5E-8
score: 33.8
coord: 7..45
e-value: 3.5E-10
score: 39.2
coord: 148..547
e-value: 7.2E-120
score: 401.5
IPR008004Protein OCTOPUS-likePANTHERPTHR31659PROTEIN: UPF0503-LIKE PROTEIN, PUTATIVE (DUF740)-RELATEDcoord: 3..566
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 297..314
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 545..567
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 297..328
NoneNo IPR availablePANTHERPTHR31659:SF0EMB|CAB61945.1coord: 3..566

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh20G003970.1CmoCh20G003970.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005886 plasma membrane