CmaCh16G002970 (gene) Cucurbita maxima (Rimu)

NameCmaCh16G002970
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCoffea canephora DH200=94 genomic scaffold, scaffold_6
LocationCma_Chr16 : 1393626 .. 1395471 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGGACCGACCTCACAGTAGCTCTGCTGTATCCGAAAACAGCCAGCTATGGCTTCCAGCCTGTTTCATCTTCACATCCTCAATACTCCCTTATTATCTTCAATCCCCTTCTTCTCTCTTCCCATTCACTTCTCGCCGGAGAACTCTCTCTCTTCACTCTAGCTTCTCCTCTTCTCCCGGCGTTCTCCGCCGGCACTGCCTAAATCGACTTCCTCATGGATCCCTGCCCTTTTCTTCGGGTTCTAGTCGGAAACTTAGCCCTCAAGTTTCCGGTCGCCGCCAAACCGTCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGCGAATAGTTCTTCTTCTGTACTTGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCACAACGGATGGACCGGAATCCGTGGAGGCACAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGGTTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTTAGAAACGAACGTGATTGGGATCGTTCAAGGTGCGTTCACGAATCTTTCGTGATCTTCTGTTTGTTTCCCGAGAAAATGCAAGAAACGTGCGAAGAAAATTGACGAACAGAGAGAGAAAATTAATTCGTTCGTTTTTCTCTTACTGAATCTCGATTCGCTAAATCTTAATTGCAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTTTCCGGATCGCCGGTCGCCGCCGCGTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCGGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGAAAGCTTCGACCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTACAGCACGTGACTTGCACGGAGGACGCAGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGAAAGGGCTTATTGTCGTTTCGCGTGAATGAGCTTGTCGTTTTGGAATTTCAGACATTTTTGAATTTTTATATATTCATTTTATTTTTCAATAGTTGAGTTGTATTTCTTGTACAGAGGGAGAGATCAATAGATGCACGATTCTTTTTCCACTTTTTG

mRNA sequence

GGGACCGACCTCACAGTAGCTCTGCTGTATCCGAAAACAGCCAGCTATGGCTTCCAGCCTGTTTCATCTTCACATCCTCAATACTCCCTTATTATCTTCAATCCCCTTCTTCTCTCTTCCCATTCACTTCTCGCCGGAGAACTCTCTCTCTTCACTCTAGCTTCTCCTCTTCTCCCGGCGTTCTCCGCCGGCACTGCCTAAATCGACTTCCTCATGGATCCCTGCCCTTTTCTTCGGGTTCTAGTCGGAAACTTAGCCCTCAAGTTTCCGGTCGCCGCCAAACCGTCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGCGAATAGTTCTTCTTCTGTACTTGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCACAACGGATGGACCGGAATCCGTGGAGGCACAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGGTTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTTAGAAACGAACGTGATTGGGATCGTTCAAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTTTCCGGATCGCCGGTCGCCGCCGCGTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCGGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGAAAGCTTCGACCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTACAGCACGTGACTTGCACGGAGGACGCAGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGAAAGGGCTTATTGTCGTTTCGCGTGAATGAGCTTGTCGTTTTGGAATTTCAGACATTTTTGAATTTTTATATATTCATTTTATTTTTCAATAGTTGAGTTGTATTTCTTGTACAGAGGGAGAGATCAATAGATGCACGATTCTTTTTCCACTTTTTG

Coding sequence (CDS)

ATGGATCCCTGCCCTTTTCTTCGGGTTCTAGTCGGAAACTTAGCCCTCAAGTTTCCGGTCGCCGCCAAACCGTCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGCGAATAGTTCTTCTTCTGTACTTGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCACAACGGATGGACCGGAATCCGTGGAGGCACAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGGTTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTTAGAAACGAACGTGATTGGGATCGTTCAAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTTTCCGGATCGCCGGTCGCCGCCGCGTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCGGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGAAAGCTTCGACCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTACAGCACGTGACTTGCACGGAGGACGCAGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGA

Protein sequence

MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGETSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
BLAST of CmaCh16G002970 vs. TrEMBL
Match: A0A0A0KY09_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000030 PE=4 SV=1)

HSP 1 Score: 788.5 bits (2035), Expect = 4.3e-225
Identity = 403/459 (87.80%), Postives = 419/459 (91.29%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFPVAA+PSFS VHPS+SPC+CKIKL+DFPTQF T+PL+VDGET
Sbjct: 1   MDPCPFLRILVGNLALKFPVAARPSFSAVHPSTSPCYCKIKLNDFPTQFVTIPLLVDGET 60

Query: 61  SGA---------NSSSSV-------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGP 120
           SGA         +SSSSV       ++A FSLNKSQIEKLV KRKD  VKIEVYTGRLGP
Sbjct: 61  SGAATTSSTSSSSSSSSVSTQSHSSISASFSLNKSQIEKLV-KRKDPSVKIEVYTGRLGP 120

Query: 121 VTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRA 180
            +C GD+FGSSAKLLGRI VPVTGS LSETKPCVF NGWTGI  G KGYSSAQLHLTVR+
Sbjct: 121 ASCSGDVFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRS 180

Query: 181 EQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRS 240
           E DPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+S
Sbjct: 181 EPDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKS 240

Query: 241 WLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300
           WLPKI SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Sbjct: 241 WLPKIRSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300

Query: 301 RPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTG 360
           RPVDGSW PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGGKFTID TG
Sbjct: 301 RPVDGSWRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGKFTIDMTG 360

Query: 361 SASPVISPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAV 420
           SASP ISPN SFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEVAV
Sbjct: 361 SASPAISPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVAV 420

Query: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 458

BLAST of CmaCh16G002970 vs. TrEMBL
Match: W9QZR1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003734 PE=4 SV=1)

HSP 1 Score: 513.5 bits (1321), Expect = 2.6e-142
Identity = 293/459 (63.83%), Postives = 347/459 (75.60%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSG-VHPSSSPCFCKIKLSDFPTQFATVPLVVDGE 60
           MDPCPF+R+L+G+LALK PVA+KPSFSG VHPS+SPCFCKIKL +FP QFA +PL  D  
Sbjct: 1   MDPCPFVRILIGDLALKLPVASKPSFSGTVHPSASPCFCKIKLKNFPHQFAAIPLNRD-- 60

Query: 61  TSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKLL 120
              ANS S  LAACFSL+K+Q E L +K + L  KI+VYTGR G  TCG +    ++KLL
Sbjct: 61  ---ANSRS--LAACFSLDKAQFESLAAKPQCL--KIKVYTGRRGS-TCGLN----ASKLL 120

Query: 121 GRIVVPVTGSSLSETKPCVFHNGWTGI----RGGTKGYSSAQLHLTVRAEQDPRFVFRFD 180
           G++ VP+    ++E++P VF NGW  I       +   SS+QL L VRAE DPRFVF+FD
Sbjct: 121 GKVSVPLD-LRVAESRPYVFQNGWVSIGKKDNKESLNLSSSQLRLCVRAEPDPRFVFQFD 180

Query: 181 GEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSW-LPKIGSERD 240
           GEPECSPQVFQVQGSV+QPVFTCKF FR+  D     SS+TE + TSRSW +P +  +++
Sbjct: 181 GEPECSPQVFQVQGSVKQPVFTCKFDFRSSSDL--KNSSVTEPN-TSRSWFVPSLKIQKE 240

Query: 241 QS-AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHP 300
           Q   KERKGWS+TIHDLSGSPVA ASMVTPFV SPGS RVSRSNPGAWLILRP +G+W P
Sbjct: 241 QKYTKERKGWSVTIHDLSGSPVAVASMVTPFVASPGSDRVSRSNPGAWLILRPGEGTWKP 300

Query: 301 WGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGS--ASPVIS 360
           WGRLEAWRE GG+DS+GYRFELL   +  ATLA S +S+++GGKF+ID T S  +SP IS
Sbjct: 301 WGRLEAWRERGGTDSVGYRFELLGDDATPATLACSAVSAAAGGKFSIDVTSSIVSSPAIS 360

Query: 361 PNESFDLGSGSGSRPGS-------GDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAV 420
           P  S DLGSGSGSRPGS        DFG       +GFVMS+ VEG+ KK+ + EVEV V
Sbjct: 361 PQSSIDLGSGSGSRPGSRAGSGSGSDFG--VGLSNRGFVMSSTVEGVGKKS-KPEVEVGV 420

Query: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           QHVTC+EDAA FVALAAA+DLSMDACRLFSQKL KELRQ
Sbjct: 421 QHVTCSEDAAAFVALAAAMDLSMDACRLFSQKLPKELRQ 438

BLAST of CmaCh16G002970 vs. TrEMBL
Match: A0A067EE54_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012226mg PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 4.5e-142
Identity = 296/472 (62.71%), Postives = 348/472 (73.73%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP-VAAKPSF-SGVHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+LVGNLALKFP V +KPSF S +HPSSS C+CKIKL  FP + ATVPLV D 
Sbjct: 1   MDPCPFVRILVGNLALKFPTVTSKPSFLSRIHPSSSSCYCKIKLKSFPDEIATVPLVQDE 60

Query: 61  ETSGANSSSSVLAACFSLNKSQIEKLVSKRKD-------LLVKIEVYTGRLGPVTCGGDL 120
            T    + S  LAACF+LNK+QI+K++ K K        + ++++VYTG  G ++C    
Sbjct: 61  TTPANGNLSHSLAACFNLNKAQIDKILEKSKSPKSNSGVISLRVDVYTGSNG-MSCV--- 120

Query: 121 FGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFV 180
             ++ KLLGR+ VP+     +E++P V HNGW GI    KG S AQL+LTV++E DPRFV
Sbjct: 121 --TTDKLLGRVSVPLDLRG-AESRPSVIHNGWAGIGENKKG-SQAQLYLTVKSEPDPRFV 180

Query: 181 FRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDR---SRSSITEQSSTSRSWLPK 240
           F+FDGEPECSPQVFQVQGSV+Q VFTCKFGFRN  + DR   SR+S+TE +ST RSWL  
Sbjct: 181 FQFDGEPECSPQVFQVQGSVKQAVFTCKFGFRNSNN-DRNLVSRTSMTE-NSTPRSWLSA 240

Query: 241 IGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVD 300
            GSE+DQS+KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP +
Sbjct: 241 FGSEKDQSSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGN 300

Query: 301 GSWHPWGRLEAWRESGGSDSIGYRFELLPAT----SAAATLANSTISSSSGGKFTIDKTG 360
            +W PWGRLEAWRE G SD +GYRF+LL  T    S++ T+AN+ ISS+ GGKFTID   
Sbjct: 301 CTWKPWGRLEAWREPGNSDLLGYRFDLLHDTISSNSSSTTVANANISSTKGGKFTIDMAS 360

Query: 361 SAS--PVISPNESFDLGSGS----GSRPGSG---DFGYLTA----YQYKGFVMSTKVEGM 420
           S S  PV SP  S D GSGS    GSRPGSG   DF +        Q +GFVMS  VEG 
Sbjct: 361 SVSTTPVHSPQSSCDFGSGSWSGPGSRPGSGSGSDFAFCCTGPPILQSRGFVMSATVEGG 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            K + + EVEV VQHVTCTEDAA FVALAAA+DLS+DAC LFS KLRKELRQ
Sbjct: 421 GKCS-KPEVEVGVQHVTCTEDAAAFVALAAAMDLSVDACTLFSHKLRKELRQ 461

BLAST of CmaCh16G002970 vs. TrEMBL
Match: V4S6F0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028366mg PE=4 SV=1)

HSP 1 Score: 511.5 bits (1316), Expect = 1.0e-141
Identity = 295/472 (62.50%), Postives = 348/472 (73.73%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP-VAAKPSF-SGVHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+LVGNLALKFP V +KPSF S +HPSSS C+CKIKL  FP + ATVPLV D 
Sbjct: 1   MDPCPFVRILVGNLALKFPTVTSKPSFLSRIHPSSSSCYCKIKLKSFPDEIATVPLVQDE 60

Query: 61  ETSGANSSSSVLAACFSLNKSQIEKLVSKRKD-------LLVKIEVYTGRLGPVTCGGDL 120
            T    + S  LAACF+LNK+QI+K++ K K        + ++++VYTG  G ++C    
Sbjct: 61  TTPANGNLSHSLAACFNLNKAQIDKILEKSKSSKSNNGVISLRVDVYTGSNG-MSCV--- 120

Query: 121 FGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFV 180
             ++ KLLGR+ VP+     +E++P V HNGW GI    KG S AQL+LTV++E DPRFV
Sbjct: 121 --TTDKLLGRVSVPLDMRG-AESRPSVIHNGWAGIGENKKG-SQAQLYLTVKSEPDPRFV 180

Query: 181 FRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDR---SRSSITEQSSTSRSWLPK 240
           F+FDGEPECSPQVFQVQGSV+Q VFTCKFGFRN  + DR   SR+S+TE +ST RSWL  
Sbjct: 181 FQFDGEPECSPQVFQVQGSVKQAVFTCKFGFRNSNN-DRNLVSRTSMTE-NSTPRSWLSA 240

Query: 241 IGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVD 300
            GSE+DQS+KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP +
Sbjct: 241 FGSEKDQSSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGN 300

Query: 301 GSWHPWGRLEAWRESGGSDSIGYRFELLPAT----SAAATLANSTISSSSGGKFTIDKTG 360
            +W PWGRLEAWRE G SD +GYRF+LL  T    S++ T+AN+ ISS+ GGKFTID   
Sbjct: 301 CTWKPWGRLEAWREPGNSDLLGYRFDLLHDTISSNSSSTTVANANISSTKGGKFTIDMAS 360

Query: 361 SAS--PVISPNESFDLGSGS----GSRPGSG---DFGYLTA----YQYKGFVMSTKVEGM 420
           S S  PV SP  S D GSGS    GSRPGSG   DF +        Q +GFVMS  VEG 
Sbjct: 361 SVSTTPVHSPQSSCDFGSGSWSGPGSRPGSGSGSDFAFCCTGPPILQSRGFVMSATVEGG 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            K + + EVEV VQHVTCTEDAA FVALAAA+DLS+DAC LFS KLRKELR+
Sbjct: 421 GKCS-KPEVEVGVQHVTCTEDAAAFVALAAAMDLSVDACTLFSHKLRKELRR 461

BLAST of CmaCh16G002970 vs. TrEMBL
Match: A0A061DYV5_THECC (Gb:AAC34331.1 OS=Theobroma cacao GN=TCM_006326 PE=4 SV=1)

HSP 1 Score: 499.6 bits (1285), Expect = 3.9e-138
Identity = 282/466 (60.52%), Postives = 340/466 (72.96%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+LVGNLALKFPV+ KPS S +HPS+S C+CKIKL +FP Q AT+P +   E 
Sbjct: 1   MDPCPFVRILVGNLALKFPVSTKPSLSRIHPSTSSCYCKIKLKNFPHQVATIPFIQSQED 60

Query: 61  SGANSSSSV-----LAACFSLNKSQIEKLVSK-RKDLLVKIEVYTGRLGPVTCGGDLFGS 120
           S  +SSSS      LAACFSL+KSQI+++VS+      + IEVY    G  +CG     +
Sbjct: 61  SSTSSSSSSSFQKSLAACFSLSKSQIDRIVSRGSSSYKLSIEVYADPDGS-SCGL----T 120

Query: 121 SAKLLGRIVVPVTGSSLSETKPCVFHNGWTGI--RGGTKGYSSAQLHLTVRAEQDPRFVF 180
             KLLG++ VP+     +E++P V HNGW  I      K  SSAQL LTVR E DPRFVF
Sbjct: 121 YGKLLGKVSVPLDLRG-AESRPSVVHNGWIAIGRNRSNKNGSSAQLCLTVRTEPDPRFVF 180

Query: 181 RFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSE 240
           +F GEPECSPQVFQVQG ++Q VFTCKFGFRN  D +    S   +S+T+R+WLP + +E
Sbjct: 181 QFGGEPECSPQVFQVQGGLKQAVFTCKFGFRNTSDRNLGSRSSLPESNTTRNWLPSLKTE 240

Query: 241 RDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWH 300
           ++QS+KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP  G+W 
Sbjct: 241 KEQSSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGCGTWK 300

Query: 301 PWGRLEAWRESGGSDSIGYRFEL-----LPATSAAATLANSTISSSSGGKFTIDKTG--S 360
           PWGRLEAWRE G +D++GYRF+L     + ATS  ATLA+S +S+  GGKFT+D T   +
Sbjct: 301 PWGRLEAWREPGFTDALGYRFDLFHDDYIAATSTTATLASSILSTKLGGKFTMDMTTNVA 360

Query: 361 ASPVISPNESFDLGSGSGSRPGSG---DFGYLTAYQ----YK-GFVMSTKVEGMKKKNRR 420
           A+P  SP  S D   GSGSRPGSG   DFG+  +      Y+ GFVMS+ VEG  K + +
Sbjct: 361 ATPSTSPQSSCDF--GSGSRPGSGSGSDFGFAASISPQSLYRGGFVMSSTVEGAGKCS-K 420

Query: 421 AEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            EVEV VQHVTCTEDAAVFVALAAA+DLS+DACR FSQKLRKELRQ
Sbjct: 421 PEVEVGVQHVTCTEDAAVFVALAAAMDLSVDACRSFSQKLRKELRQ 457

BLAST of CmaCh16G002970 vs. TAIR10
Match: AT1G10020.1 (AT1G10020.1 Protein of unknown function (DUF1005))

HSP 1 Score: 460.3 bits (1183), Expect = 1.3e-129
Identity = 260/464 (56.03%), Postives = 323/464 (69.61%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+ +GNLALK P+AAK + S VHPSSSPCFCKIKL +FP Q A +P +    T
Sbjct: 1   MDPCPFIRLTIGNLALKVPLAAKTTSSVVHPSSSPCFCKIKLKNFPPQTAAIPYIPLETT 60

Query: 61  SGANSSSSVLAACFSLNKSQIEKLVSKR---KDLLVKIEVYTGRLGPVTCGGDLFGSSAK 120
                 +  LAA F L+ S I++L S+        +KI +YTGR G   CG      S +
Sbjct: 61  QFPEIQT--LAATFHLSSSDIQRLASRSIFTSKPCLKILIYTGRAG-AACGVH----SGR 120

Query: 121 LLGRIVVPVTGSSLSETKPCVFHNGWTGI-RGGTKGYSSAQLHLTVRAEQDPRFVFRFDG 180
           LL ++ VP+  S  +++KPCVFHNGW  + +G  K  SSAQ HL V+AE DPRFVF+FDG
Sbjct: 121 LLAKVSVPLDLSG-TQSKPCVFHNGWISVGKGAGKSSSSAQFHLNVKAEPDPRFVFQFDG 180

Query: 181 EPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQS 240
           EPECSPQV Q+QG+++QPVFTCKF  R+  D  +   S+  ++S SRSWL   GSER++ 
Sbjct: 181 EPECSPQVVQIQGNIRQPVFTCKFSCRHTGDRTQRSRSLPTETSVSRSWLNSFGSERERP 240

Query: 241 AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWGR 300
            KERKGWSIT+HDLSGSPVA AS+VTPFV SPG+ RVSRSNPG+WLILRP D +W PWGR
Sbjct: 241 GKERKGWSITVHDLSGSPVAMASIVTPFVASPGTDRVSRSNPGSWLILRPGDCTWRPWGR 300

Query: 301 LEAWRESGG-SDSIGYRFELLP--ATSAAATLANSTISSSSGGKFTID---KTGSASPVI 360
           LEAWRE GG +D +GYRFEL+P  ++ A   LA STISS  GGKF+I+      S+SP  
Sbjct: 301 LEAWRERGGATDGLGYRFELIPDGSSGAGIVLAESTISSHRGGKFSIELGSSPSSSSPTS 360

Query: 361 SPNES-----FDLGSGSGSRP------GSGDFGY-LTAYQ-YKGFVMSTKVEGMKKKNRR 420
             N S        GSG G+ P      GSGD+GY L  +  YKGFVMS  VEG  K ++ 
Sbjct: 361 VVNRSRSRRGGSSGSGGGASPANSPRGGSGDYGYGLWPWNVYKGFVMSASVEGEGKCSKP 420

Query: 421 AEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKEL 442
             VEV+VQHV+C EDAA +VAL+AA+DLSMDACRLF+Q++RKEL
Sbjct: 421 C-VEVSVQHVSCMEDAAAYVALSAAIDLSMDACRLFNQRMRKEL 455

BLAST of CmaCh16G002970 vs. TAIR10
Match: AT3G19680.1 (AT3G19680.1 Protein of unknown function (DUF1005))

HSP 1 Score: 449.1 bits (1154), Expect = 3.1e-126
Identity = 259/488 (53.07%), Postives = 332/488 (68.03%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAK-------PSFSGVHPSSSPCFCKIKLSDFPTQFATVP 60
           MDPC F+R++VGNLA++FP ++        PS SG++P++  C+CKI+  +FP +  +VP
Sbjct: 1   MDPCSFVRIIVGNLAVRFPSSSSSSSSSSGPSVSGINPTAPNCYCKIRFKNFPREIVSVP 60

Query: 61  LVV----DGETSGANSSS-SVLAACFSLNKSQIEKLVSKRKDLLVKIEVYT-------GR 120
           ++     + ET  ++S + S +AACFSL+K+QIE  + K K  ++ +E Y+         
Sbjct: 61  VMFRTESESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDG 120

Query: 121 LGPVTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGI---RGGTKGYSSAQL 180
           +   +CG  L  +  KLLGR  V +   S +ETK  + HNGW  +   +  +K  S  +L
Sbjct: 121 VSGASCG--LATAGEKLLGRFEVSLDLKS-AETKSFLAHNGWVALPSKKTKSKTGSDPEL 180

Query: 181 HLTVRAEQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRS---RSSI 240
           H++VR E DPRFVF+FDGEPECSPQVFQVQG+ +Q VFTCKFG RN    DR+    SS+
Sbjct: 181 HVSVRVEPDPRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSM 240

Query: 241 TEQSSTSRSWLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR 300
             + S++RS +  + SE++Q +KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS+RV+R
Sbjct: 241 MSEISSTRSCISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTR 300

Query: 301 SNPGAWLILRPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSG 360
           S+PGAWLILRP   +W PWGRLEAWRE+G SD++GYRFEL     A A  A+S+IS  +G
Sbjct: 301 SSPGAWLILRPDGCTWKPWGRLEAWREAGYSDTLGYRFELFQDGIATAVSASSSISLKNG 360

Query: 361 GKFTIDKTGSAS-----PVISPNESFDLGSGS------GSRPGSG---DFGYL------T 420
           G F ID TG  S     P  SP  S+DLGSGS       SRPGSG   DFGYL       
Sbjct: 361 GSFVIDVTGGTSTTASTPTTSPQGSWDLGSGSSAGSRPASRPGSGSGSDFGYLLPQHPSA 420

Query: 421 AYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQ 444
           A Q +GFVMS  VEG+ K++ + EVEV V HVTCTEDAA  VALAAAVDLS+DACRLFS 
Sbjct: 421 AAQNRGFVMSATVEGVGKRS-KPEVEVGVTHVTCTEDAAAHVALAAAVDLSLDACRLFSH 480

BLAST of CmaCh16G002970 vs. TAIR10
Match: AT1G50040.1 (AT1G50040.1 Protein of unknown function (DUF1005))

HSP 1 Score: 412.1 bits (1058), Expect = 4.2e-115
Identity = 247/472 (52.33%), Postives = 308/472 (65.25%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP----------VAAKPSFSGVHPSSSPCFCKIKLSDFPTQFA 60
           MDPC F+R++VGNLA++FP           ++ PS S V  SS  C+CKIK   FP Q  
Sbjct: 1   MDPCSFVRIIVGNLAVRFPRSPSSSSSSSSSSGPSVSDV--SSGNCYCKIKFKSFPRQIV 60

Query: 61  TVPLVV----DGETSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVT 120
           +VP+++    + E+   + + S +AACFSL+KSQIE  + K K  ++ +EVY+ R    +
Sbjct: 61  SVPVLLRTESESESRCCSGNVSTVAACFSLSKSQIETSLKKAKWSVLSVEVYSRR--SAS 120

Query: 121 CGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGW----TGIRGGTKGYSSAQLHLTV 180
           C G +  S  KL+GR  V +     +E+K C+ HNGW    T  +   K  S  +LH++V
Sbjct: 121 C-GFVAASGEKLIGRFQVTL-DLKAAESKTCLAHNGWVDLGTKSKNNKKSGSDPELHVSV 180

Query: 181 RAEQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS 240
           R E D RFVF+FDGEPECSPQVFQVQG+ +Q VFTCKFGFRN  D + S S         
Sbjct: 181 RVEPDTRFVFQFDGEPECSPQVFQVQGNAKQAVFTCKFGFRNSGDRNLSLS--------- 240

Query: 241 RSWLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWL 300
              L  + S ++Q +KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS+RVSRS+PGAWL
Sbjct: 241 ---LSSVTSGKEQFSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSNRVSRSSPGAWL 300

Query: 301 ILRPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDK 360
           ILRP   +W PW RL+AWRE G SD +GYRFEL     A A  A+S+IS+  GG F ID 
Sbjct: 301 ILRPDGYTWKPWVRLQAWREPGVSDVLGYRFELYKDGIAVAVSASSSISTKLGGSFIIDG 360

Query: 361 TGSASPVI---SPNESFDLGSGSGSR-----PGSGD---FGYLTAYQYKGFVMSTKVEGM 420
           + S +      S   SFDL S S  R      GSG    F    A Q  GFVMST+V+G+
Sbjct: 361 STSTTTTASWSSSEGSFDLSSWSSIRSSRTDSGSGSDFRFSLSQAQQNLGFVMSTRVQGV 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           +K++ + +VEV V+HVTCTEDAA  VALAAAVDLSMDACRLFSQKLR ELRQ
Sbjct: 421 EKQS-KPKVEVGVKHVTCTEDAAAHVALAAAVDLSMDACRLFSQKLRNELRQ 453

BLAST of CmaCh16G002970 vs. TAIR10
Match: AT4G29310.1 (AT4G29310.1 Protein of unknown function (DUF1005))

HSP 1 Score: 360.5 bits (924), Expect = 1.4e-99
Identity = 223/452 (49.34%), Postives = 286/452 (63.27%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSG--VHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+ + +LAL+ P  A     G  VHPSS+PC+CK+++  FP+Q A +PL    
Sbjct: 1   MDPCPFVRLTIDSLALRLPETATNKQIGGEVHPSSTPCYCKLRIKHFPSQKALLPLSSFS 60

Query: 61  ETSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKL 120
           + S    SS+  A  F L+   I ++  K+  L  ++ VY GR G  TCG     +S KL
Sbjct: 61  DASSPPESSTS-APGFHLDADAIRRISGKKISL--RVSVYAGRTGH-TCGV----ASGKL 120

Query: 121 LGRIVVPVT-GSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LG++ V V   ++LS T    FHNGW  + GG     SA+LHL V AE DPRFVF+F GE
Sbjct: 121 LGKVEVAVDLAAALSRT--VAFHNGWKKL-GGDGDKPSARLHLLVCAEPDPRFVFQFGGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSE---RD 240
           PECSP V+Q+Q +++QPVF+CKF   ++R+  RSRS  +  + +SR W+ +  S      
Sbjct: 181 PECSPVVYQIQDNLKQPVFSCKFS--SDRN-GRSRSLPSGFTYSSRGWITRTLSGDQWEK 240

Query: 241 QSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSW 300
           + A+ERKGW ITIHDLSGSPVAAASM+TPFV SPGS RVSRSNPGAWLILRP      SW
Sbjct: 241 KQARERKGWMITIHDLSGSPVAAASMITPFVASPGSDRVSRSNPGAWLILRPHGTCVSSW 300

Query: 301 HPWGRLEAWRESGGSDSIGYRFELL--PATSAAATLANSTISSSSGGKFTIDKTGSASPV 360
            PWGRLEAWRE G  D +GY+FEL+   +TS    +A  T+S+  GGKF+ID+       
Sbjct: 301 KPWGRLEAWRERGAIDGLGYKFELVRDNSTSTGIPIAEGTMSTKQGGKFSIDRR------ 360

Query: 361 ISPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTC 420
                     SG G  P         +   KGFVM + VEG  K ++   V V  QHVTC
Sbjct: 361 ---------VSGQGESPA-------ISSPVKGFVMGSSVEGEGKVSKPV-VHVGAQHVTC 415

Query: 421 TEDAAVFVALAAAVDLSMDACRLFSQKLRKEL 442
             DAA+FVAL+AAVDLS+DAC+LFS+KLRKEL
Sbjct: 421 MADAALFVALSAAVDLSVDACQLFSRKLRKEL 415

BLAST of CmaCh16G002970 vs. TAIR10
Match: AT5G17640.1 (AT5G17640.1 Protein of unknown function (DUF1005))

HSP 1 Score: 271.2 bits (692), Expect = 1.2e-72
Identity = 181/460 (39.35%), Postives = 259/460 (56.30%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPS---SSPCFCKIKLSDFPTQFATVPLVVD 60
           MDP  F+R+ VG+LAL+ P     S S  +     SS C C+IKL  FP Q  ++PL+  
Sbjct: 1   MDPQAFIRLSVGSLALRIPKVLINSTSKSNEKKNFSSQCSCEIKLRGFPVQTTSIPLMPS 60

Query: 61  GETSGANSSSSVLAACFSLNKSQIEKLVSK----RKDLLVKIEVYTGRLGPVTCGGDLFG 120
            + +  + S   ++  F L +S +  L++          ++I V+TG+   + CG    G
Sbjct: 61  LDAAPDHHS---ISTSFYLEESDLRALLTPGCFYSPHAHLEISVFTGKKS-LNCG---VG 120

Query: 121 SSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFVFR 180
              + +G   + V G    E KP +  NGW  I G TK   +A+LHL V+ + DPR+VF+
Sbjct: 121 GKRQQIGMFKLEV-GPEWGEGKPMILFNGWISI-GKTKRDGAAELHLKVKLDPDPRYVFQ 180

Query: 181 FDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPK-IGSE 240
           F+     SPQ+ Q++GSV+QP+F+CKF          SR  +++    +  W     G+E
Sbjct: 181 FEDVTTLSPQIVQLRGSVKQPIFSCKF----------SRDRVSQVDPLNGYWSSSGDGTE 240

Query: 241 RDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDG 300
            +   +ERKGW + IHDLSGS VAAA + TPFVPS G   V++SNPGAWL++RP      
Sbjct: 241 LESERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGAWLVVRPDPSRPN 300

Query: 301 SWHPWGRLEAWRESGGSDSIGYRFELLP--ATSAAATLANSTISSSSGGKFTIDK----- 360
           SW PWG+LEAWRE G  DS+  RF LL          ++   IS+  GG+F ID      
Sbjct: 301 SWQPWGKLEAWRERGIRDSVCCRFHLLSNGLEVGDVLMSEILISAEKGGEFLIDTDKQML 360

Query: 361 TGSASPVISPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEV 420
           T +A+P+ SP  S D  SG G     G           GFVMS++V+G + K+ +  V++
Sbjct: 361 TVAATPIPSPQSSGDF-SGLGQCVSGG-----------GFVMSSRVQG-EGKSSKPVVQL 420

Query: 421 AVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELR 443
           A++HVTC EDAA+F+ALAAAVDLS+ AC+ F +  R+  R
Sbjct: 421 AMRHVTCVEDAAIFMALAAAVDLSILACKPFRRTSRRRFR 428

BLAST of CmaCh16G002970 vs. NCBI nr
Match: gi|659108870|ref|XP_008454429.1| (PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo])

HSP 1 Score: 799.3 bits (2063), Expect = 3.5e-228
Identity = 406/453 (89.62%), Postives = 417/453 (92.05%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFPVAAKPSFSGVHPS+SPCFCKIKL+DFPTQF T+PL+VDGE 
Sbjct: 1   MDPCPFLRILVGNLALKFPVAAKPSFSGVHPSTSPCFCKIKLNDFPTQFVTIPLLVDGEI 60

Query: 61  SGANSSSSV----------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGD 120
           SGA SSSS           LAACFSLNKSQIEKLV KRKD  VKIEVYTGRLGP TC GD
Sbjct: 61  SGAASSSSSSSVSSQSHSSLAACFSLNKSQIEKLV-KRKDASVKIEVYTGRLGPATCSGD 120

Query: 121 LFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRF 180
           +FGSSAKLLGRI VPVTGS LSETKPCVF NGWTGI  G KGYSSAQLHLTVR+E DPRF
Sbjct: 121 VFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRSEPDPRF 180

Query: 181 VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIG 240
           VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+SWLPKI 
Sbjct: 181 VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKSWLPKIR 240

Query: 241 SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS 300
           SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Sbjct: 241 SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS 300

Query: 301 WHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVI 360
           W PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGG+FTID TGSASP I
Sbjct: 301 WRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGRFTIDMTGSASPAI 360

Query: 361 SPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCT 420
           SPN SFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEV VQHVTCT
Sbjct: 361 SPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVGVQHVTCT 420

Query: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 452

BLAST of CmaCh16G002970 vs. NCBI nr
Match: gi|778689105|ref|XP_004150270.2| (PREDICTED: uncharacterized protein LOC101221491 [Cucumis sativus])

HSP 1 Score: 788.5 bits (2035), Expect = 6.1e-225
Identity = 403/459 (87.80%), Postives = 419/459 (91.29%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFPVAA+PSFS VHPS+SPC+CKIKL+DFPTQF T+PL+VDGET
Sbjct: 1   MDPCPFLRILVGNLALKFPVAARPSFSAVHPSTSPCYCKIKLNDFPTQFVTIPLLVDGET 60

Query: 61  SGA---------NSSSSV-------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGP 120
           SGA         +SSSSV       ++A FSLNKSQIEKLV KRKD  VKIEVYTGRLGP
Sbjct: 61  SGAATTSSTSSSSSSSSVSTQSHSSISASFSLNKSQIEKLV-KRKDPSVKIEVYTGRLGP 120

Query: 121 VTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRA 180
            +C GD+FGSSAKLLGRI VPVTGS LSETKPCVF NGWTGI  G KGYSSAQLHLTVR+
Sbjct: 121 ASCSGDVFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRS 180

Query: 181 EQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRS 240
           E DPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+S
Sbjct: 181 EPDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKS 240

Query: 241 WLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300
           WLPKI SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Sbjct: 241 WLPKIRSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300

Query: 301 RPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTG 360
           RPVDGSW PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGGKFTID TG
Sbjct: 301 RPVDGSWRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGKFTIDMTG 360

Query: 361 SASPVISPNESFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAV 420
           SASP ISPN SFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEVAV
Sbjct: 361 SASPAISPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVAV 420

Query: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 458

BLAST of CmaCh16G002970 vs. NCBI nr
Match: gi|1009178292|ref|XP_015870445.1| (PREDICTED: uncharacterized protein LOC107407651 [Ziziphus jujuba])

HSP 1 Score: 556.6 bits (1433), Expect = 3.9e-155
Identity = 301/455 (66.15%), Postives = 353/455 (77.58%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+L+G+LALKFP+A+KPSFSG+HPSSSPCFCKIKL +FP Q AT+PL+     
Sbjct: 1   MDPCPFVRILIGDLALKFPIASKPSFSGIHPSSSPCFCKIKLKNFPAQLATIPLIPIDSR 60

Query: 61  SGANS--SSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKL 120
           SG ++  SS  LAACF+LNK+ IEKL  K+  L  KI V+TGR G  TCG +    +A+L
Sbjct: 61  SGTSTDTSSHTLAACFNLNKTHIEKLAGKQTCL--KISVFTGRRG-TTCGFN----AARL 120

Query: 121 LGRIVVPVTGSSLSETKPC-VFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LGR++VP+  SS +ET+P  VF NGW GI    KG SSAQLHL VRAE DPRFVF+FDGE
Sbjct: 121 LGRVMVPLDLSS-AETRPAFVFQNGWVGIGENKKG-SSAQLHLNVRAEPDPRFVFQFDGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQSA 240
           PECSPQVFQVQG+V+QPVFTCKFGFRN  D  +SRS    + ST R+W+P + +++D   
Sbjct: 181 PECSPQVFQVQGNVKQPVFTCKFGFRNNSDL-KSRSM--SEPSTPRNWIPSLRTQKDHCT 240

Query: 241 KERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWGRL 300
           KERKGWSITIHDLSGSPVA ASMVTPFV SPGSH VSRSNPGAWLILRP +G+W PWGRL
Sbjct: 241 KERKGWSITIHDLSGSPVAVASMVTPFVASPGSHLVSRSNPGAWLILRPGEGTWKPWGRL 300

Query: 301 EAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNESFD 360
           EAWRE GGSDS+GY+FELL  T+A+ TLANS +S++SGGKF ID T + SPV SP+ S D
Sbjct: 301 EAWRERGGSDSVGYKFELLSDTAASTTLANSVVSATSGGKFVIDVTSNVSPVNSPHSSCD 360

Query: 361 LGSG----SGSRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVT 420
            G G    SGSR GSG      FG    Y Y+GFVMS+ VEG+ K + + EVEV VQHVT
Sbjct: 361 FGGGMGSVSGSRSGSGSGSDFGFGIPAHYSYRGFVMSSTVEGVGKCS-KPEVEVGVQHVT 420

Query: 421 CTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           C EDAA FVALAAA+DLSMDACRLFSQKL KELRQ
Sbjct: 421 CAEDAAAFVALAAAMDLSMDACRLFSQKLPKELRQ 442

BLAST of CmaCh16G002970 vs. NCBI nr
Match: gi|657962698|ref|XP_008372951.1| (PREDICTED: uncharacterized protein LOC103436305 [Malus domestica])

HSP 1 Score: 549.3 bits (1414), Expect = 6.2e-153
Identity = 293/453 (64.68%), Postives = 353/453 (77.92%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVV-DGE 60
           MDPCPF+R+LVG+L LKFP+A++PS + VHPSSSPCFCKIKLS+FP Q +TVPL+  DG+
Sbjct: 49  MDPCPFVRILVGDLTLKFPMASRPSSATVHPSSSPCFCKIKLSNFPHQVSTVPLIANDGQ 108

Query: 61  TSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKLL 120
            +   + +  LAACF+LNK+QIE L SKR   ++KI VYTGR+G  TCG +    SAKLL
Sbjct: 109 AAQTATHNHSLAACFNLNKTQIETLSSKRS--ILKIAVYTGRVG-ATCGLN----SAKLL 168

Query: 121 GRIVVPVTGSSLSETKPCVFHNGWTGIRGGTK----GYSSAQLHLTVRAEQDPRFVFRFD 180
           GR+ VP++   ++E++P V+ NGW  I G  K    G SSA+L+L+VRAE DPRF+F+FD
Sbjct: 169 GRVNVPLSELGVAESRPVVYQNGWIAIGGKKKSNGNGSSSAELYLSVRAEPDPRFIFQFD 228

Query: 181 GEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQ 240
           GEPECSPQVFQVQG+V+QPVFTCKFGFRN    D    S++ Q  T R+WLP  G+ ++Q
Sbjct: 229 GEPECSPQVFQVQGNVKQPVFTCKFGFRN----DLQSRSMSSQPGTPRNWLPFGGTHKEQ 288

Query: 241 SAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWG 300
            AKERKGWSITIHDLSGSPVAAASMVTPFV SPGSHRVSRSNPGAWLILRP +G+W PWG
Sbjct: 289 XAKERKGWSITIHDLSGSPVAAASMVTPFVASPGSHRVSRSNPGAWLILRPNEGTWQPWG 348

Query: 301 RLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNES 360
           RLEAW E GGSD++GYRFEL       +TLANST+ + +GGKF+ID T S +P  SP+ S
Sbjct: 349 RLEAWLERGGSDNVGYRFEL-----QNSTLANSTLGAKNGGKFSIDLTSSLTPANSPHSS 408

Query: 361 FDLGSGSGSRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCT 420
           FDLGSGS SRPGSG      FG L +   +GFVMS+ VEG+ K + + EVEV VQHVTCT
Sbjct: 409 FDLGSGSSSRPGSGSGSDFGFGLLPSLVQRGFVMSSTVEGVGKCS-KPEVEVGVQHVTCT 468

Query: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           EDAA +VALAAA+DLSMDACR FSQKLRKELRQ
Sbjct: 469 EDAAAYVALAAAMDLSMDACRPFSQKLRKELRQ 484

BLAST of CmaCh16G002970 vs. NCBI nr
Match: gi|470111644|ref|XP_004292055.1| (PREDICTED: uncharacterized protein LOC101308741 [Fragaria vesca subsp. vesca])

HSP 1 Score: 538.1 bits (1385), Expect = 1.4e-149
Identity = 294/454 (64.76%), Postives = 346/454 (76.21%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPVAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVD--- 60
           MDPCPF+R+L+G+L LKFP  +KPSFS VHPSSSPCFCKIKL++FP QF+ VPLV+    
Sbjct: 1   MDPCPFVRILIGDLTLKFPSVSKPSFSTVHPSSSPCFCKIKLTNFPFQFSAVPLVLPSSA 60

Query: 61  GETSGANSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAK 120
           G     NS +  L ACF+L+K QIE L SK+  L   I +YTGR G  TCG +    SAK
Sbjct: 61  GAQPDPNSRAHSLNACFNLSKPQIEALASKKPSL--SISIYTGRRG-ATCGLN----SAK 120

Query: 121 LLGRIVVPVTGSSLSETKPCVFHNGWTGIRGGTKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LLGR+ VP+   + +ET+P V+ NGW GI G   G   +Q  L+VRAE DPRFVF+FDGE
Sbjct: 121 LLGRVTVPLAELAAAETRPVVYQNGWIGIGGKKNGSGQSQFFLSVRAEPDPRFVFKFDGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWL-PKIGS-ERDQ 240
           PECSPQVFQVQG+V+QPVFTCKFGFRN  D  +SRS    +  T R+WL P +GS +++Q
Sbjct: 181 PECSPQVFQVQGNVKQPVFTCKFGFRNASDM-QSRSM--SEQGTPRNWLVPFMGSNQKEQ 240

Query: 241 SAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWG 300
           SAKERKGWS+TIHDLSGSPVAAASMVTPFV SPGS RVSRSNPGAWLILRP DG+W PWG
Sbjct: 241 SAKERKGWSLTIHDLSGSPVAAASMVTPFVASPGSQRVSRSNPGAWLILRPEDGTWKPWG 300

Query: 301 RLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNES 360
           RLEAW E GGSD++GY+FELL     +  LANST+S+S+GGKFTID T S +PV SP+ S
Sbjct: 301 RLEAWLERGGSDTVGYKFELL-----STILANSTVSASNGGKFTIDLTSSLTPVNSPHSS 360

Query: 361 FDLGSGSG-SRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTC 420
           FD GSGSG SRPGSG      FG +     +GFVMS+ VEG+ K + + EVEV VQHVTC
Sbjct: 361 FDFGSGSGSSRPGSGSGSDFGFGLIPQLLQRGFVMSSTVEGIGKCS-KPEVEVGVQHVTC 420

Query: 421 TEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           TEDAA +VALAAA+DLSMDACR FSQKLRKELRQ
Sbjct: 421 TEDAAAYVALAAAMDLSMDACRPFSQKLRKELRQ 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KY09_CUCSA4.3e-22587.80Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000030 PE=4 SV=1[more]
W9QZR1_9ROSA2.6e-14263.83Uncharacterized protein OS=Morus notabilis GN=L484_003734 PE=4 SV=1[more]
A0A067EE54_CITSI4.5e-14262.71Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012226mg PE=4 SV=1[more]
V4S6F0_9ROSI1.0e-14162.50Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028366mg PE=4 SV=1[more]
A0A061DYV5_THECC3.9e-13860.52Gb:AAC34331.1 OS=Theobroma cacao GN=TCM_006326 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G10020.11.3e-12956.03 Protein of unknown function (DUF1005)[more]
AT3G19680.13.1e-12653.07 Protein of unknown function (DUF1005)[more]
AT1G50040.14.2e-11552.33 Protein of unknown function (DUF1005)[more]
AT4G29310.11.4e-9949.34 Protein of unknown function (DUF1005)[more]
AT5G17640.11.2e-7239.35 Protein of unknown function (DUF1005)[more]
Match NameE-valueIdentityDescription
gi|659108870|ref|XP_008454429.1|3.5e-22889.62PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo][more]
gi|778689105|ref|XP_004150270.2|6.1e-22587.80PREDICTED: uncharacterized protein LOC101221491 [Cucumis sativus][more]
gi|1009178292|ref|XP_015870445.1|3.9e-15566.15PREDICTED: uncharacterized protein LOC107407651 [Ziziphus jujuba][more]
gi|657962698|ref|XP_008372951.1|6.2e-15364.68PREDICTED: uncharacterized protein LOC103436305 [Malus domestica][more]
gi|470111644|ref|XP_004292055.1|1.4e-14964.76PREDICTED: uncharacterized protein LOC101308741 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR010410DUF1005
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh16G002970.1CmaCh16G002970.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010410Protein of unknown function DUF1005PFAMPF06219DUF1005coord: 1..442
score: 3.8E
NoneNo IPR availablePANTHERPTHR31317FAMILY NOT NAMEDcoord: 2..443
score: 1.2E
NoneNo IPR availablePANTHERPTHR31317:SF3F2J10.8 PROTEIN-RELATEDcoord: 2..443
score: 1.2E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmaCh16G002970CmaCh04G004230Cucurbita maxima (Rimu)cmacmaB351
The following block(s) are covering this gene:
GeneOrganismBlock
CmaCh16G002970Cucurbita maxima (Rimu)cmacmaB135
CmaCh16G002970Cucumber (Gy14) v1cgycmaB0623
CmaCh16G002970Cucurbita moschata (Rifu)cmacmoB318
CmaCh16G002970Cucumber (Chinese Long) v2cmacuB348
CmaCh16G002970Melon (DHL92) v3.6.1cmamedB331
CmaCh16G002970Cucumber (Chinese Long) v3cmacucB0414