CmoCh16G003180 (gene) Cucurbita moschata (Rifu)

NameCmoCh16G003180
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionNuclear factor 1 A-type isoform 2
LocationCmo_Chr16 : 1453793 .. 1455681 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAAAACAGCCAGCTATGGCTTCCAGCCTGTTTCATCTTCACATCCTCAATACTCCCTTATTATCTTCAATCCCCTTCTTCTCTCTTCCCATTCACTTCTCGCCGGAGAACTCTCTCTCTTCCCTCTAGCTTCTCCTCTTCTCCCGGCGTTCTCCGCCGGCACTGCCTAAATCGACTTCCTCATGGATCCCTGTCCTTTTCTTCGGGTTCTCGTCGGGAACTTAGCCCTGAAGTTTCCGATCGCCGCCAAACCATCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGTGAATAGTTCTTCTTCTGTACTCGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCAGAACGGATGGACCGGAATCCGTGGAGGCAGAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGATTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTCAGAAACGAACGTGATTGGGATCGTTCAAGGTGCGTTCACGAATCTTTCGTGATCTTCTGTTTGTTTCCCGAGAAAATGCAAGAAACGTGCGAAGAAAATCGACGAACAGAGAGAGAAAATTAATTCGTTTGTTTTTCTGTTACTGAATCTCGATTCGCTAAATCTTAATTGCAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTGTCCGGATCGCCGGTCGCCGCCGCTTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCCGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGGAAGCTTCGATCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTTCAGCACGTGACATGCACGGAGGACGCCGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGAAAAGGCTTATTGTCGTTTCGCTTGAATGAGCTTGTCGTTTTGGAATTTCAGACATTTTTGAATTTTTATATATTCATTTTATTTTTATTTTATTTTTCAATAGTTTGAGTTGTATTTTTTGTACGGAGGAAGAGATCAATAGATGCACGATTCTTATATTATATTATTTATTTATTTTTGTAATATTATAAAGCAAACCCATGAGAAGAATTGGAGGATTAAGAATCCTA

mRNA sequence

CGAAAACAGCCAGCTATGGCTTCCAGCCTGTTTCATCTTCACATCCTCAATACTCCCTTATTATCTTCAATCCCCTTCTTCTCTCTTCCCATTCACTTCTCGCCGGAGAACTCTCTCTCTTCCCTCTAGCTTCTCCTCTTCTCCCGGCGTTCTCCGCCGGCACTGCCTAAATCGACTTCCTCATGGATCCCTGTCCTTTTCTTCGGGTTCTCGTCGGGAACTTAGCCCTGAAGTTTCCGATCGCCGCCAAACCATCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGTGAATAGTTCTTCTTCTGTACTCGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCAGAACGGATGGACCGGAATCCGTGGAGGCAGAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGATTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTCAGAAACGAACGTGATTGGGATCGTTCAAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTGTCCGGATCGCCGGTCGCCGCCGCTTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCCGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGGAAGCTTCGATCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTTCAGCACGTGACATGCACGGAGGACGCCGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGAAAAGGCTTATTGTCGTTTCGCTTGAATGAGCTTGTCGTTTTGGAATTTCAGACATTTTTGAATTTTTATATATTCATTTTATTTTTATTTTATTTTTCAATAGTTTGAGTTGTATTTTTTGTACGGAGGAAGAGATCAATAGATGCACGATTCTTATATTATATTATTTATTTATTTTTGTAATATTATAAAGCAAACCCATGAGAAGAATTGGAGGATTAAGAATCCTA

Coding sequence (CDS)

ATGGATCCCTGTCCTTTTCTTCGGGTTCTCGTCGGGAACTTAGCCCTGAAGTTTCCGATCGCCGCCAAACCATCTTTCTCCGGCGTGCATCCATCGAGTTCTCCGTGCTTTTGTAAAATTAAACTGAGCGATTTTCCGACGCAGTTCGCTACTGTTCCTCTCGTCGTCGACGGCGAAACTTCCGGCGTGAATAGTTCTTCTTCTGTACTCGCGGCCTGCTTCAGCCTCAATAAATCTCAGATTGAGAAGCTTGTTTCTAAGCGGAAAGATTTGTTAGTGAAGATTGAAGTCTACACCGGCCGCCTTGGTCCGGTTACTTGCGGCGGCGACCTCTTCGGAAGCTCCGCTAAGTTACTCGGCCGAATCGTTGTGCCGGTGACTGGTTCGAGTTTGTCAGAAACTAAACCGTGCGTGTTCCAGAACGGATGGACCGGAATCCGTGGAGGCAGAAAAGGTTACTCATCGGCTCAATTGCACTTGACGGTTCGCGCCGAGCAGGACCCGAGATTCGTGTTTCGGTTCGACGGTGAACCGGAGTGCAGTCCTCAGGTTTTTCAGGTGCAAGGAAGTGTACAGCAACCGGTTTTTACTTGCAAATTCGGTTTCAGAAACGAACGTGATTGGGATCGTTCAAGGTCGTCAATTACTGAGCAAAGTAGCACCTCGAGGAGTTGGTTACCGAAGATCGGATCCGAGAGGGACCAATCGGCGAAAGAACGAAAAGGATGGTCCATAACGATCCACGATCTGTCCGGATCGCCGGTCGCCGCCGCTTCCATGGTGACACCATTCGTACCGTCGCCAGGTTCGCACCGTGTAAGCCGTTCAAATCCCGGCGCCTGGCTAATTCTCCGCCCGGTCGACGGTAGCTGGCACCCGTGGGGCCGCCTCGAGGCATGGCGGGAGAGCGGCGGCTCAGATTCAATCGGCTACCGATTCGAGCTCCTCCCCGCGACCTCCGCCGCCGCTACGTTAGCGAACTCCACCATAAGCTCGAGCAGCGGCGGGAAGTTCACGATCGATAAGACCGGCAGCGCGTCGCCAGTGATCAGCCCTAACGGAAGCTTCGATCTCGGGTCTGGATCAGGATCTCGACCCGGATCCGGGGATTTCGGGTACTTGACGGCGTATCAGTACAAAGGATTTGTGATGTCGACGAAGGTGGAAGGGATGAAGAAGAAGAATAGGAGGGCAGAGGTGGAAGTGGCGGTTCAGCACGTGACATGCACGGAGGACGCCGCGGTGTTTGTGGCGTTGGCGGCGGCGGTGGACCTGAGTATGGACGCCTGCAGGCTGTTCTCTCAGAAGCTAAGGAAGGAGCTGAGGCAATGA
BLAST of CmoCh16G003180 vs. TrEMBL
Match: A0A0A0KY09_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000030 PE=4 SV=1)

HSP 1 Score: 792.0 bits (2044), Expect = 3.8e-226
Identity = 403/459 (87.80%), Postives = 421/459 (91.72%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFP+AA+PSFS VHPS+SPC+CKIKL+DFPTQF T+PL+VDGET
Sbjct: 1   MDPCPFLRILVGNLALKFPVAARPSFSAVHPSTSPCYCKIKLNDFPTQFVTIPLLVDGET 60

Query: 61  SGV---------NSSSSV-------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGP 120
           SG          +SSSSV       ++A FSLNKSQIEKLV KRKD  VKIEVYTGRLGP
Sbjct: 61  SGAATTSSTSSSSSSSSVSTQSHSSISASFSLNKSQIEKLV-KRKDPSVKIEVYTGRLGP 120

Query: 121 VTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRA 180
            +C GD+FGSSAKLLGRI VPVTGS LSETKPCVFQNGWTGI  G+KGYSSAQLHLTVR+
Sbjct: 121 ASCSGDVFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRS 180

Query: 181 EQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRS 240
           E DPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+S
Sbjct: 181 EPDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKS 240

Query: 241 WLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300
           WLPKI SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Sbjct: 241 WLPKIRSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300

Query: 301 RPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTG 360
           RPVDGSW PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGGKFTID TG
Sbjct: 301 RPVDGSWRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGKFTIDMTG 360

Query: 361 SASPVISPNGSFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAV 420
           SASP ISPNGSFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEVAV
Sbjct: 361 SASPAISPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVAV 420

Query: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 458

BLAST of CmoCh16G003180 vs. TrEMBL
Match: W9QZR1_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_003734 PE=4 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 1.5e-142
Identity = 292/461 (63.34%), Postives = 349/461 (75.70%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSG-VHPSSSPCFCKIKLSDFPTQFATVPLVVDGE 60
           MDPCPF+R+L+G+LALK P+A+KPSFSG VHPS+SPCFCKIKL +FP QFA +PL  D  
Sbjct: 1   MDPCPFVRILIGDLALKLPVASKPSFSGTVHPSASPCFCKIKLKNFPHQFAAIPLNRD-- 60

Query: 61  TSGVNSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKLL 120
                ++S  LAACFSL+K+Q E L +K + L  KI+VYTGR G  TCG +    ++KLL
Sbjct: 61  -----ANSRSLAACFSLDKAQFESLAAKPQCL--KIKVYTGRRGS-TCGLN----ASKLL 120

Query: 121 GRIVVPVTGSSLSETKPCVFQNGWTGIRGGRK------GYSSAQLHLTVRAEQDPRFVFR 180
           G++ VP+    ++E++P VFQNGW  I  G+K        SS+QL L VRAE DPRFVF+
Sbjct: 121 GKVSVPLD-LRVAESRPYVFQNGWVSI--GKKDNKESLNLSSSQLRLCVRAEPDPRFVFQ 180

Query: 181 FDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSW-LPKIGSE 240
           FDGEPECSPQVFQVQGSV+QPVFTCKF FR+  D     SS+TE + TSRSW +P +  +
Sbjct: 181 FDGEPECSPQVFQVQGSVKQPVFTCKFDFRSSSDL--KNSSVTEPN-TSRSWFVPSLKIQ 240

Query: 241 RDQS-AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSW 300
           ++Q   KERKGWS+TIHDLSGSPVA ASMVTPFV SPGS RVSRSNPGAWLILRP +G+W
Sbjct: 241 KEQKYTKERKGWSVTIHDLSGSPVAVASMVTPFVASPGSDRVSRSNPGAWLILRPGEGTW 300

Query: 301 HPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGS--ASPV 360
            PWGRLEAWRE GG+DS+GYRFELL   +  ATLA S +S+++GGKF+ID T S  +SP 
Sbjct: 301 KPWGRLEAWRERGGTDSVGYRFELLGDDATPATLACSAVSAAAGGKFSIDVTSSIVSSPA 360

Query: 361 ISPNGSFDLGSGSGSRPGS-------GDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEV 420
           ISP  S DLGSGSGSRPGS        DFG       +GFVMS+ VEG+ KK+ + EVEV
Sbjct: 361 ISPQSSIDLGSGSGSRPGSRAGSGSGSDFG--VGLSNRGFVMSSTVEGVGKKS-KPEVEV 420

Query: 421 AVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            VQHVTC+EDAA FVALAAA+DLSMDACRLFSQKL KELRQ
Sbjct: 421 GVQHVTCSEDAAAFVALAAAMDLSMDACRLFSQKLPKELRQ 438

BLAST of CmoCh16G003180 vs. TrEMBL
Match: A0A067EE54_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012226mg PE=4 SV=1)

HSP 1 Score: 509.2 bits (1310), Expect = 5.0e-141
Identity = 294/472 (62.29%), Postives = 348/472 (73.73%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP-IAAKPSF-SGVHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+LVGNLALKFP + +KPSF S +HPSSS C+CKIKL  FP + ATVPLV D 
Sbjct: 1   MDPCPFVRILVGNLALKFPTVTSKPSFLSRIHPSSSSCYCKIKLKSFPDEIATVPLVQDE 60

Query: 61  ETSGVNSSSSVLAACFSLNKSQIEKLVSKRKD-------LLVKIEVYTGRLGPVTCGGDL 120
            T    + S  LAACF+LNK+QI+K++ K K        + ++++VYTG  G ++C    
Sbjct: 61  TTPANGNLSHSLAACFNLNKAQIDKILEKSKSPKSNSGVISLRVDVYTGSNG-MSCV--- 120

Query: 121 FGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFV 180
             ++ KLLGR+ VP+     +E++P V  NGW GI   +KG S AQL+LTV++E DPRFV
Sbjct: 121 --TTDKLLGRVSVPLDLRG-AESRPSVIHNGWAGIGENKKG-SQAQLYLTVKSEPDPRFV 180

Query: 181 FRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDR---SRSSITEQSSTSRSWLPK 240
           F+FDGEPECSPQVFQVQGSV+Q VFTCKFGFRN  + DR   SR+S+TE +ST RSWL  
Sbjct: 181 FQFDGEPECSPQVFQVQGSVKQAVFTCKFGFRNSNN-DRNLVSRTSMTE-NSTPRSWLSA 240

Query: 241 IGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVD 300
            GSE+DQS+KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP +
Sbjct: 241 FGSEKDQSSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGN 300

Query: 301 GSWHPWGRLEAWRESGGSDSIGYRFELLPAT----SAAATLANSTISSSSGGKFTIDKTG 360
            +W PWGRLEAWRE G SD +GYRF+LL  T    S++ T+AN+ ISS+ GGKFTID   
Sbjct: 301 CTWKPWGRLEAWREPGNSDLLGYRFDLLHDTISSNSSSTTVANANISSTKGGKFTIDMAS 360

Query: 361 SAS--PVISPNGSFDLGSGS----GSRPGSG---DFGYLTA----YQYKGFVMSTKVEGM 420
           S S  PV SP  S D GSGS    GSRPGSG   DF +        Q +GFVMS  VEG 
Sbjct: 361 SVSTTPVHSPQSSCDFGSGSWSGPGSRPGSGSGSDFAFCCTGPPILQSRGFVMSATVEGG 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            K + + EVEV VQHVTCTEDAA FVALAAA+DLS+DAC LFS KLRKELRQ
Sbjct: 421 GKCS-KPEVEVGVQHVTCTEDAAAFVALAAAMDLSVDACTLFSHKLRKELRQ 461

BLAST of CmoCh16G003180 vs. TrEMBL
Match: V4S6F0_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028366mg PE=4 SV=1)

HSP 1 Score: 508.1 bits (1307), Expect = 1.1e-140
Identity = 293/472 (62.08%), Postives = 348/472 (73.73%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP-IAAKPSF-SGVHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+LVGNLALKFP + +KPSF S +HPSSS C+CKIKL  FP + ATVPLV D 
Sbjct: 1   MDPCPFVRILVGNLALKFPTVTSKPSFLSRIHPSSSSCYCKIKLKSFPDEIATVPLVQDE 60

Query: 61  ETSGVNSSSSVLAACFSLNKSQIEKLVSKRKD-------LLVKIEVYTGRLGPVTCGGDL 120
            T    + S  LAACF+LNK+QI+K++ K K        + ++++VYTG  G ++C    
Sbjct: 61  TTPANGNLSHSLAACFNLNKAQIDKILEKSKSSKSNNGVISLRVDVYTGSNG-MSCV--- 120

Query: 121 FGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFV 180
             ++ KLLGR+ VP+     +E++P V  NGW GI   +KG S AQL+LTV++E DPRFV
Sbjct: 121 --TTDKLLGRVSVPLDMRG-AESRPSVIHNGWAGIGENKKG-SQAQLYLTVKSEPDPRFV 180

Query: 181 FRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDR---SRSSITEQSSTSRSWLPK 240
           F+FDGEPECSPQVFQVQGSV+Q VFTCKFGFRN  + DR   SR+S+TE +ST RSWL  
Sbjct: 181 FQFDGEPECSPQVFQVQGSVKQAVFTCKFGFRNSNN-DRNLVSRTSMTE-NSTPRSWLSA 240

Query: 241 IGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVD 300
            GSE+DQS+KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP +
Sbjct: 241 FGSEKDQSSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGN 300

Query: 301 GSWHPWGRLEAWRESGGSDSIGYRFELLPAT----SAAATLANSTISSSSGGKFTIDKTG 360
            +W PWGRLEAWRE G SD +GYRF+LL  T    S++ T+AN+ ISS+ GGKFTID   
Sbjct: 301 CTWKPWGRLEAWREPGNSDLLGYRFDLLHDTISSNSSSTTVANANISSTKGGKFTIDMAS 360

Query: 361 SAS--PVISPNGSFDLGSGS----GSRPGSG---DFGYLTA----YQYKGFVMSTKVEGM 420
           S S  PV SP  S D GSGS    GSRPGSG   DF +        Q +GFVMS  VEG 
Sbjct: 361 SVSTTPVHSPQSSCDFGSGSWSGPGSRPGSGSGSDFAFCCTGPPILQSRGFVMSATVEGG 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            K + + EVEV VQHVTCTEDAA FVALAAA+DLS+DAC LFS KLRKELR+
Sbjct: 421 GKCS-KPEVEVGVQHVTCTEDAAAFVALAAAMDLSVDACTLFSHKLRKELRR 461

BLAST of CmoCh16G003180 vs. TrEMBL
Match: A0A061DYV5_THECC (Gb:AAC34331.1 OS=Theobroma cacao GN=TCM_006326 PE=4 SV=1)

HSP 1 Score: 496.5 bits (1277), Expect = 3.3e-137
Identity = 281/466 (60.30%), Postives = 340/466 (72.96%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+LVGNLALKFP++ KPS S +HPS+S C+CKIKL +FP Q AT+P +   E 
Sbjct: 1   MDPCPFVRILVGNLALKFPVSTKPSLSRIHPSTSSCYCKIKLKNFPHQVATIPFIQSQED 60

Query: 61  SGVNSSSSV-----LAACFSLNKSQIEKLVSK-RKDLLVKIEVYTGRLGPVTCGGDLFGS 120
           S  +SSSS      LAACFSL+KSQI+++VS+      + IEVY    G  +CG     +
Sbjct: 61  SSTSSSSSSSFQKSLAACFSLSKSQIDRIVSRGSSSYKLSIEVYADPDGS-SCGL----T 120

Query: 121 SAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGR--KGYSSAQLHLTVRAEQDPRFVF 180
             KLLG++ VP+     +E++P V  NGW  I   R  K  SSAQL LTVR E DPRFVF
Sbjct: 121 YGKLLGKVSVPLDLRG-AESRPSVVHNGWIAIGRNRSNKNGSSAQLCLTVRTEPDPRFVF 180

Query: 181 RFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSE 240
           +F GEPECSPQVFQVQG ++Q VFTCKFGFRN  D +    S   +S+T+R+WLP + +E
Sbjct: 181 QFGGEPECSPQVFQVQGGLKQAVFTCKFGFRNTSDRNLGSRSSLPESNTTRNWLPSLKTE 240

Query: 241 RDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWH 300
           ++QS+KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS RVSRSNPGAWLILRP  G+W 
Sbjct: 241 KEQSSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSDRVSRSNPGAWLILRPGCGTWK 300

Query: 301 PWGRLEAWRESGGSDSIGYRFEL-----LPATSAAATLANSTISSSSGGKFTIDKTG--S 360
           PWGRLEAWRE G +D++GYRF+L     + ATS  ATLA+S +S+  GGKFT+D T   +
Sbjct: 301 PWGRLEAWREPGFTDALGYRFDLFHDDYIAATSTTATLASSILSTKLGGKFTMDMTTNVA 360

Query: 361 ASPVISPNGSFDLGSGSGSRPGSG---DFGYLTAYQ----YK-GFVMSTKVEGMKKKNRR 420
           A+P  SP  S D   GSGSRPGSG   DFG+  +      Y+ GFVMS+ VEG  K + +
Sbjct: 361 ATPSTSPQSSCDF--GSGSRPGSGSGSDFGFAASISPQSLYRGGFVMSSTVEGAGKCS-K 420

Query: 421 AEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
            EVEV VQHVTCTEDAAVFVALAAA+DLS+DACR FSQKLRKELRQ
Sbjct: 421 PEVEVGVQHVTCTEDAAVFVALAAAMDLSVDACRSFSQKLRKELRQ 457

BLAST of CmoCh16G003180 vs. TAIR10
Match: AT1G10020.1 (AT1G10020.1 Protein of unknown function (DUF1005))

HSP 1 Score: 455.7 bits (1171), Expect = 3.3e-128
Identity = 259/464 (55.82%), Postives = 322/464 (69.40%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+ +GNLALK P+AAK + S VHPSSSPCFCKIKL +FP Q A +P +    T
Sbjct: 1   MDPCPFIRLTIGNLALKVPLAAKTTSSVVHPSSSPCFCKIKLKNFPPQTAAIPYIPLETT 60

Query: 61  SGVNSSSSVLAACFSLNKSQIEKLVSKR---KDLLVKIEVYTGRLGPVTCGGDLFGSSAK 120
                 +  LAA F L+ S I++L S+        +KI +YTGR G   CG      S +
Sbjct: 61  QFPEIQT--LAATFHLSSSDIQRLASRSIFTSKPCLKILIYTGRAG-AACGVH----SGR 120

Query: 121 LLGRIVVPVTGSSLSETKPCVFQNGWTGI-RGGRKGYSSAQLHLTVRAEQDPRFVFRFDG 180
           LL ++ VP+  S  +++KPCVF NGW  + +G  K  SSAQ HL V+AE DPRFVF+FDG
Sbjct: 121 LLAKVSVPLDLSG-TQSKPCVFHNGWISVGKGAGKSSSSAQFHLNVKAEPDPRFVFQFDG 180

Query: 181 EPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQS 240
           EPECSPQV Q+QG+++QPVFTCKF  R+  D  +   S+  ++S SRSWL   GSER++ 
Sbjct: 181 EPECSPQVVQIQGNIRQPVFTCKFSCRHTGDRTQRSRSLPTETSVSRSWLNSFGSERERP 240

Query: 241 AKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWGR 300
            KERKGWSIT+HDLSGSPVA AS+VTPFV SPG+ RVSRSNPG+WLILRP D +W PWGR
Sbjct: 241 GKERKGWSITVHDLSGSPVAMASIVTPFVASPGTDRVSRSNPGSWLILRPGDCTWRPWGR 300

Query: 301 LEAWRESGG-SDSIGYRFELLP--ATSAAATLANSTISSSSGGKFTID---KTGSASPVI 360
           LEAWRE GG +D +GYRFEL+P  ++ A   LA STISS  GGKF+I+      S+SP  
Sbjct: 301 LEAWRERGGATDGLGYRFELIPDGSSGAGIVLAESTISSHRGGKFSIELGSSPSSSSPTS 360

Query: 361 SPNGS-----FDLGSGSGSRP------GSGDFGY-LTAYQ-YKGFVMSTKVEGMKKKNRR 420
             N S        GSG G+ P      GSGD+GY L  +  YKGFVMS  VEG  K ++ 
Sbjct: 361 VVNRSRSRRGGSSGSGGGASPANSPRGGSGDYGYGLWPWNVYKGFVMSASVEGEGKCSKP 420

Query: 421 AEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKEL 442
             VEV+VQHV+C EDAA +VAL+AA+DLSMDACRLF+Q++RKEL
Sbjct: 421 C-VEVSVQHVSCMEDAAAYVALSAAIDLSMDACRLFNQRMRKEL 455

BLAST of CmoCh16G003180 vs. TAIR10
Match: AT3G19680.1 (AT3G19680.1 Protein of unknown function (DUF1005))

HSP 1 Score: 447.2 bits (1149), Expect = 1.2e-125
Identity = 258/488 (52.87%), Postives = 327/488 (67.01%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAK-------PSFSGVHPSSSPCFCKIKLSDFPTQFATVP 60
           MDPC F+R++VGNLA++FP ++        PS SG++P++  C+CKI+  +FP +  +VP
Sbjct: 1   MDPCSFVRIIVGNLAVRFPSSSSSSSSSSGPSVSGINPTAPNCYCKIRFKNFPREIVSVP 60

Query: 61  LVVDGETSGVNSSSS-----VLAACFSLNKSQIEKLVSKRKDLLVKIEVYT-------GR 120
           ++   E+      SS      +AACFSL+K+QIE  + K K  ++ +E Y+         
Sbjct: 61  VMFRTESESETRCSSSGNVSTVAACFSLSKAQIEASLKKPKFSVLSVEAYSRGNSDGDDG 120

Query: 121 LGPVTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGR---KGYSSAQL 180
           +   +CG  L  +  KLLGR  V +   S +ETK  +  NGW  +   +   K  S  +L
Sbjct: 121 VSGASCG--LATAGEKLLGRFEVSLDLKS-AETKSFLAHNGWVALPSKKTKSKTGSDPEL 180

Query: 181 HLTVRAEQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRS---RSSI 240
           H++VR E DPRFVF+FDGEPECSPQVFQVQG+ +Q VFTCKFG RN    DR+    SS+
Sbjct: 181 HVSVRVEPDPRFVFQFDGEPECSPQVFQVQGNTKQAVFTCKFGSRNSNSGDRNLLHSSSM 240

Query: 241 TEQSSTSRSWLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSR 300
             + S++RS +  + SE++Q +KERKGWSIT+HDLSGSPVA ASMVTPFVPSPGS+RV+R
Sbjct: 241 MSEISSTRSCISSMNSEKEQPSKERKGWSITVHDLSGSPVAMASMVTPFVPSPGSNRVTR 300

Query: 301 SNPGAWLILRPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSG 360
           S+PGAWLILRP   +W PWGRLEAWRE+G SD++GYRFEL     A A  A+S+IS  +G
Sbjct: 301 SSPGAWLILRPDGCTWKPWGRLEAWREAGYSDTLGYRFELFQDGIATAVSASSSISLKNG 360

Query: 361 GKFTIDKTGSAS-----PVISPNGSFDLGSGS------GSRPGSG---DFGYL------T 420
           G F ID TG  S     P  SP GS+DLGSGS       SRPGSG   DFGYL       
Sbjct: 361 GSFVIDVTGGTSTTASTPTTSPQGSWDLGSGSSAGSRPASRPGSGSGSDFGYLLPQHPSA 420

Query: 421 AYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQ 444
           A Q +GFVMS  VEG+ K++ + EVEV V HVTCTEDAA  VALAAAVDLS+DACRLFS 
Sbjct: 421 AAQNRGFVMSATVEGVGKRS-KPEVEVGVTHVTCTEDAAAHVALAAAVDLSLDACRLFSH 480

BLAST of CmoCh16G003180 vs. TAIR10
Match: AT1G50040.1 (AT1G50040.1 Protein of unknown function (DUF1005))

HSP 1 Score: 411.8 bits (1057), Expect = 5.5e-115
Identity = 247/472 (52.33%), Postives = 309/472 (65.47%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFP----------IAAKPSFSGVHPSSSPCFCKIKLSDFPTQFA 60
           MDPC F+R++VGNLA++FP           ++ PS S V  SS  C+CKIK   FP Q  
Sbjct: 1   MDPCSFVRIIVGNLAVRFPRSPSSSSSSSSSSGPSVSDV--SSGNCYCKIKFKSFPRQIV 60

Query: 61  TVPLVV----DGETSGVNSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVT 120
           +VP+++    + E+   + + S +AACFSL+KSQIE  + K K  ++ +EVY+ R    +
Sbjct: 61  SVPVLLRTESESESRCCSGNVSTVAACFSLSKSQIETSLKKAKWSVLSVEVYSRR--SAS 120

Query: 121 CGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFQNGW----TGIRGGRKGYSSAQLHLTV 180
           C G +  S  KL+GR  V +     +E+K C+  NGW    T  +  +K  S  +LH++V
Sbjct: 121 C-GFVAASGEKLIGRFQVTL-DLKAAESKTCLAHNGWVDLGTKSKNNKKSGSDPELHVSV 180

Query: 181 RAEQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS 240
           R E D RFVF+FDGEPECSPQVFQVQG+ +Q VFTCKFGFRN  D + S S         
Sbjct: 181 RVEPDTRFVFQFDGEPECSPQVFQVQGNAKQAVFTCKFGFRNSGDRNLSLS--------- 240

Query: 241 RSWLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWL 300
              L  + S ++Q +KERKGWSITIHDLSGSPVA ASMVTPFVPSPGS+RVSRS+PGAWL
Sbjct: 241 ---LSSVTSGKEQFSKERKGWSITIHDLSGSPVAMASMVTPFVPSPGSNRVSRSSPGAWL 300

Query: 301 ILRPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDK 360
           ILRP   +W PW RL+AWRE G SD +GYRFEL     A A  A+S+IS+  GG F ID 
Sbjct: 301 ILRPDGYTWKPWVRLQAWREPGVSDVLGYRFELYKDGIAVAVSASSSISTKLGGSFIIDG 360

Query: 361 TGSASPVI---SPNGSFDLGSGSGSR-----PGSGD---FGYLTAYQYKGFVMSTKVEGM 420
           + S +      S  GSFDL S S  R      GSG    F    A Q  GFVMST+V+G+
Sbjct: 361 STSTTTTASWSSSEGSFDLSSWSSIRSSRTDSGSGSDFRFSLSQAQQNLGFVMSTRVQGV 420

Query: 421 KKKNRRAEVEVAVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           +K++ + +VEV V+HVTCTEDAA  VALAAAVDLSMDACRLFSQKLR ELRQ
Sbjct: 421 EKQS-KPKVEVGVKHVTCTEDAAAHVALAAAVDLSMDACRLFSQKLRNELRQ 453

BLAST of CmoCh16G003180 vs. TAIR10
Match: AT4G29310.1 (AT4G29310.1 Protein of unknown function (DUF1005))

HSP 1 Score: 356.3 bits (913), Expect = 2.7e-98
Identity = 222/452 (49.12%), Postives = 285/452 (63.05%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSG--VHPSSSPCFCKIKLSDFPTQFATVPLVVDG 60
           MDPCPF+R+ + +LAL+ P  A     G  VHPSS+PC+CK+++  FP+Q A +PL    
Sbjct: 1   MDPCPFVRLTIDSLALRLPETATNKQIGGEVHPSSTPCYCKLRIKHFPSQKALLPLSSFS 60

Query: 61  ETSGVNSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKL 120
           + S    SS+  A  F L+   I ++  K+  L  ++ VY GR G  TCG     +S KL
Sbjct: 61  DASSPPESSTS-APGFHLDADAIRRISGKKISL--RVSVYAGRTGH-TCGV----ASGKL 120

Query: 121 LGRIVVPVT-GSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LG++ V V   ++LS T    F NGW  + GG     SA+LHL V AE DPRFVF+F GE
Sbjct: 121 LGKVEVAVDLAAALSRT--VAFHNGWKKL-GGDGDKPSARLHLLVCAEPDPRFVFQFGGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSE---RD 240
           PECSP V+Q+Q +++QPVF+CKF   ++R+  RSRS  +  + +SR W+ +  S      
Sbjct: 181 PECSPVVYQIQDNLKQPVFSCKFS--SDRN-GRSRSLPSGFTYSSRGWITRTLSGDQWEK 240

Query: 241 QSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDGSW 300
           + A+ERKGW ITIHDLSGSPVAAASM+TPFV SPGS RVSRSNPGAWLILRP      SW
Sbjct: 241 KQARERKGWMITIHDLSGSPVAAASMITPFVASPGSDRVSRSNPGAWLILRPHGTCVSSW 300

Query: 301 HPWGRLEAWRESGGSDSIGYRFELL--PATSAAATLANSTISSSSGGKFTIDKTGSASPV 360
            PWGRLEAWRE G  D +GY+FEL+   +TS    +A  T+S+  GGKF+ID+       
Sbjct: 301 KPWGRLEAWRERGAIDGLGYKFELVRDNSTSTGIPIAEGTMSTKQGGKFSIDRR------ 360

Query: 361 ISPNGSFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTC 420
                     SG G  P         +   KGFVM + VEG  K ++   V V  QHVTC
Sbjct: 361 ---------VSGQGESPA-------ISSPVKGFVMGSSVEGEGKVSKPV-VHVGAQHVTC 415

Query: 421 TEDAAVFVALAAAVDLSMDACRLFSQKLRKEL 442
             DAA+FVAL+AAVDLS+DAC+LFS+KLRKEL
Sbjct: 421 MADAALFVALSAAVDLSVDACQLFSRKLRKEL 415

BLAST of CmoCh16G003180 vs. TAIR10
Match: AT5G17640.1 (AT5G17640.1 Protein of unknown function (DUF1005))

HSP 1 Score: 266.5 bits (680), Expect = 2.8e-71
Identity = 180/460 (39.13%), Postives = 258/460 (56.09%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPS---SSPCFCKIKLSDFPTQFATVPLVVD 60
           MDP  F+R+ VG+LAL+ P     S S  +     SS C C+IKL  FP Q  ++PL+  
Sbjct: 1   MDPQAFIRLSVGSLALRIPKVLINSTSKSNEKKNFSSQCSCEIKLRGFPVQTTSIPLMPS 60

Query: 61  GETSGVNSSSSVLAACFSLNKSQIEKLVSK----RKDLLVKIEVYTGRLGPVTCGGDLFG 120
            + +  + S   ++  F L +S +  L++          ++I V+TG+   + CG    G
Sbjct: 61  LDAAPDHHS---ISTSFYLEESDLRALLTPGCFYSPHAHLEISVFTGKKS-LNCG---VG 120

Query: 121 SSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFVFR 180
              + +G   + V G    E KP +  NGW  I G  K   +A+LHL V+ + DPR+VF+
Sbjct: 121 GKRQQIGMFKLEV-GPEWGEGKPMILFNGWISI-GKTKRDGAAELHLKVKLDPDPRYVFQ 180

Query: 181 FDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPK-IGSE 240
           F+     SPQ+ Q++GSV+QP+F+CKF          SR  +++    +  W     G+E
Sbjct: 181 FEDVTTLSPQIVQLRGSVKQPIFSCKF----------SRDRVSQVDPLNGYWSSSGDGTE 240

Query: 241 RDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRP---VDG 300
            +   +ERKGW + IHDLSGS VAAA + TPFVPS G   V++SNPGAWL++RP      
Sbjct: 241 LESERRERKGWKVKIHDLSGSAVAAAFITTPFVPSTGCDWVAKSNPGAWLVVRPDPSRPN 300

Query: 301 SWHPWGRLEAWRESGGSDSIGYRFELLP--ATSAAATLANSTISSSSGGKFTIDK----- 360
           SW PWG+LEAWRE G  DS+  RF LL          ++   IS+  GG+F ID      
Sbjct: 301 SWQPWGKLEAWRERGIRDSVCCRFHLLSNGLEVGDVLMSEILISAEKGGEFLIDTDKQML 360

Query: 361 TGSASPVISPNGSFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEV 420
           T +A+P+ SP  S D  SG G     G           GFVMS++V+G + K+ +  V++
Sbjct: 361 TVAATPIPSPQSSGDF-SGLGQCVSGG-----------GFVMSSRVQG-EGKSSKPVVQL 420

Query: 421 AVQHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELR 443
           A++HVTC EDAA+F+ALAAAVDLS+ AC+ F +  R+  R
Sbjct: 421 AMRHVTCVEDAAIFMALAAAVDLSILACKPFRRTSRRRFR 428

BLAST of CmoCh16G003180 vs. NCBI nr
Match: gi|659108870|ref|XP_008454429.1| (PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo])

HSP 1 Score: 802.7 bits (2072), Expect = 3.1e-229
Identity = 406/453 (89.62%), Postives = 419/453 (92.49%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFP+AAKPSFSGVHPS+SPCFCKIKL+DFPTQF T+PL+VDGE 
Sbjct: 1   MDPCPFLRILVGNLALKFPVAAKPSFSGVHPSTSPCFCKIKLNDFPTQFVTIPLLVDGEI 60

Query: 61  SGVNSSSSV----------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGD 120
           SG  SSSS           LAACFSLNKSQIEKLV KRKD  VKIEVYTGRLGP TC GD
Sbjct: 61  SGAASSSSSSSVSSQSHSSLAACFSLNKSQIEKLV-KRKDASVKIEVYTGRLGPATCSGD 120

Query: 121 LFGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRF 180
           +FGSSAKLLGRI VPVTGS LSETKPCVFQNGWTGI  G+KGYSSAQLHLTVR+E DPRF
Sbjct: 121 VFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRSEPDPRF 180

Query: 181 VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIG 240
           VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+SWLPKI 
Sbjct: 181 VFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKSWLPKIR 240

Query: 241 SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS 300
           SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS
Sbjct: 241 SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGS 300

Query: 301 WHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVI 360
           W PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGG+FTID TGSASP I
Sbjct: 301 WRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGRFTIDMTGSASPAI 360

Query: 361 SPNGSFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCT 420
           SPNGSFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEV VQHVTCT
Sbjct: 361 SPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVGVQHVTCT 420

Query: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 452

BLAST of CmoCh16G003180 vs. NCBI nr
Match: gi|778689105|ref|XP_004150270.2| (PREDICTED: uncharacterized protein LOC101221491 [Cucumis sativus])

HSP 1 Score: 792.0 bits (2044), Expect = 5.5e-226
Identity = 403/459 (87.80%), Postives = 421/459 (91.72%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPFLR+LVGNLALKFP+AA+PSFS VHPS+SPC+CKIKL+DFPTQF T+PL+VDGET
Sbjct: 1   MDPCPFLRILVGNLALKFPVAARPSFSAVHPSTSPCYCKIKLNDFPTQFVTIPLLVDGET 60

Query: 61  SGV---------NSSSSV-------LAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGP 120
           SG          +SSSSV       ++A FSLNKSQIEKLV KRKD  VKIEVYTGRLGP
Sbjct: 61  SGAATTSSTSSSSSSSSVSTQSHSSISASFSLNKSQIEKLV-KRKDPSVKIEVYTGRLGP 120

Query: 121 VTCGGDLFGSSAKLLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRA 180
            +C GD+FGSSAKLLGRI VPVTGS LSETKPCVFQNGWTGI  G+KGYSSAQLHLTVR+
Sbjct: 121 ASCSGDVFGSSAKLLGRITVPVTGSGLSETKPCVFQNGWTGIGEGKKGYSSAQLHLTVRS 180

Query: 181 EQDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRS 240
           E DPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTS+S
Sbjct: 181 EPDPRFVFRFDGEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSKS 240

Query: 241 WLPKIGSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300
           WLPKI SERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL
Sbjct: 241 WLPKIRSERDQSAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLIL 300

Query: 301 RPVDGSWHPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTG 360
           RPVDGSW PWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISS SGGKFTID TG
Sbjct: 301 RPVDGSWRPWGRLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSGSGGKFTIDMTG 360

Query: 361 SASPVISPNGSFDLGSGSGSRPGSGDFGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAV 420
           SASP ISPNGSFDLGSG+GSRPGSGDFGYLT YQYKGFVMST VEGMKKK+RR EVEVAV
Sbjct: 361 SASPAISPNGSFDLGSGTGSRPGSGDFGYLTGYQYKGFVMSTMVEGMKKKSRRPEVEVAV 420

Query: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ
Sbjct: 421 QHVTCTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 458

BLAST of CmoCh16G003180 vs. NCBI nr
Match: gi|1009178292|ref|XP_015870445.1| (PREDICTED: uncharacterized protein LOC107407651 [Ziziphus jujuba])

HSP 1 Score: 559.3 bits (1440), Expect = 6.0e-156
Identity = 303/455 (66.59%), Postives = 355/455 (78.02%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVDGET 60
           MDPCPF+R+L+G+LALKFPIA+KPSFSG+HPSSSPCFCKIKL +FP Q AT+PL+     
Sbjct: 1   MDPCPFVRILIGDLALKFPIASKPSFSGIHPSSSPCFCKIKLKNFPAQLATIPLIPIDSR 60

Query: 61  SGVNS--SSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKL 120
           SG ++  SS  LAACF+LNK+ IEKL  K+  L  KI V+TGR G  TCG +    +A+L
Sbjct: 61  SGTSTDTSSHTLAACFNLNKTHIEKLAGKQTCL--KISVFTGRRG-TTCGFN----AARL 120

Query: 121 LGRIVVPVTGSSLSETKPC-VFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LGR++VP+  SS +ET+P  VFQNGW GI   +KG SSAQLHL VRAE DPRFVF+FDGE
Sbjct: 121 LGRVMVPLDLSS-AETRPAFVFQNGWVGIGENKKG-SSAQLHLNVRAEPDPRFVFQFDGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQSA 240
           PECSPQVFQVQG+V+QPVFTCKFGFRN  D  +SRS    + ST R+W+P + +++D   
Sbjct: 181 PECSPQVFQVQGNVKQPVFTCKFGFRNNSDL-KSRS--MSEPSTPRNWIPSLRTQKDHCT 240

Query: 241 KERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWGRL 300
           KERKGWSITIHDLSGSPVA ASMVTPFV SPGSH VSRSNPGAWLILRP +G+W PWGRL
Sbjct: 241 KERKGWSITIHDLSGSPVAVASMVTPFVASPGSHLVSRSNPGAWLILRPGEGTWKPWGRL 300

Query: 301 EAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNGSFD 360
           EAWRE GGSDS+GY+FELL  T+A+ TLANS +S++SGGKF ID T + SPV SP+ S D
Sbjct: 301 EAWRERGGSDSVGYKFELLSDTAASTTLANSVVSATSGGKFVIDVTSNVSPVNSPHSSCD 360

Query: 361 LGSG----SGSRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVT 420
            G G    SGSR GSG      FG    Y Y+GFVMS+ VEG+ K + + EVEV VQHVT
Sbjct: 361 FGGGMGSVSGSRSGSGSGSDFGFGIPAHYSYRGFVMSSTVEGVGKCS-KPEVEVGVQHVT 420

Query: 421 CTEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           C EDAA FVALAAA+DLSMDACRLFSQKL KELRQ
Sbjct: 421 CAEDAAAFVALAAAMDLSMDACRLFSQKLPKELRQ 442

BLAST of CmoCh16G003180 vs. NCBI nr
Match: gi|657962698|ref|XP_008372951.1| (PREDICTED: uncharacterized protein LOC103436305 [Malus domestica])

HSP 1 Score: 551.6 bits (1420), Expect = 1.3e-153
Identity = 294/453 (64.90%), Postives = 355/453 (78.37%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVV-DGE 60
           MDPCPF+R+LVG+L LKFP+A++PS + VHPSSSPCFCKIKLS+FP Q +TVPL+  DG+
Sbjct: 49  MDPCPFVRILVGDLTLKFPMASRPSSATVHPSSSPCFCKIKLSNFPHQVSTVPLIANDGQ 108

Query: 61  TSGVNSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAKLL 120
            +   + +  LAACF+LNK+QIE L SKR   ++KI VYTGR+G  TCG +    SAKLL
Sbjct: 109 AAQTATHNHSLAACFNLNKTQIETLSSKRS--ILKIAVYTGRVG-ATCGLN----SAKLL 168

Query: 121 GRIVVPVTGSSLSETKPCVFQNGWTGIRGGRK----GYSSAQLHLTVRAEQDPRFVFRFD 180
           GR+ VP++   ++E++P V+QNGW  I G +K    G SSA+L+L+VRAE DPRF+F+FD
Sbjct: 169 GRVNVPLSELGVAESRPVVYQNGWIAIGGKKKSNGNGSSSAELYLSVRAEPDPRFIFQFD 228

Query: 181 GEPECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWLPKIGSERDQ 240
           GEPECSPQVFQVQG+V+QPVFTCKFGFRN    D    S++ Q  T R+WLP  G+ ++Q
Sbjct: 229 GEPECSPQVFQVQGNVKQPVFTCKFGFRN----DLQSRSMSSQPGTPRNWLPFGGTHKEQ 288

Query: 241 SAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWG 300
            AKERKGWSITIHDLSGSPVAAASMVTPFV SPGSHRVSRSNPGAWLILRP +G+W PWG
Sbjct: 289 XAKERKGWSITIHDLSGSPVAAASMVTPFVASPGSHRVSRSNPGAWLILRPNEGTWQPWG 348

Query: 301 RLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNGS 360
           RLEAW E GGSD++GYRFEL       +TLANST+ + +GGKF+ID T S +P  SP+ S
Sbjct: 349 RLEAWLERGGSDNVGYRFEL-----QNSTLANSTLGAKNGGKFSIDLTSSLTPANSPHSS 408

Query: 361 FDLGSGSGSRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTCT 420
           FDLGSGS SRPGSG      FG L +   +GFVMS+ VEG+ K + + EVEV VQHVTCT
Sbjct: 409 FDLGSGSSSRPGSGSGSDFGFGLLPSLVQRGFVMSSTVEGVGKCS-KPEVEVGVQHVTCT 468

Query: 421 EDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           EDAA +VALAAA+DLSMDACR FSQKLRKELRQ
Sbjct: 469 EDAAAYVALAAAMDLSMDACRPFSQKLRKELRQ 484

BLAST of CmoCh16G003180 vs. NCBI nr
Match: gi|470111644|ref|XP_004292055.1| (PREDICTED: uncharacterized protein LOC101308741 [Fragaria vesca subsp. vesca])

HSP 1 Score: 540.0 bits (1390), Expect = 3.8e-150
Identity = 295/454 (64.98%), Postives = 348/454 (76.65%), Query Frame = 1

Query: 1   MDPCPFLRVLVGNLALKFPIAAKPSFSGVHPSSSPCFCKIKLSDFPTQFATVPLVVD--- 60
           MDPCPF+R+L+G+L LKFP  +KPSFS VHPSSSPCFCKIKL++FP QF+ VPLV+    
Sbjct: 1   MDPCPFVRILIGDLTLKFPSVSKPSFSTVHPSSSPCFCKIKLTNFPFQFSAVPLVLPSSA 60

Query: 61  GETSGVNSSSSVLAACFSLNKSQIEKLVSKRKDLLVKIEVYTGRLGPVTCGGDLFGSSAK 120
           G     NS +  L ACF+L+K QIE L SK+  L   I +YTGR G  TCG +    SAK
Sbjct: 61  GAQPDPNSRAHSLNACFNLSKPQIEALASKKPSL--SISIYTGRRG-ATCGLN----SAK 120

Query: 121 LLGRIVVPVTGSSLSETKPCVFQNGWTGIRGGRKGYSSAQLHLTVRAEQDPRFVFRFDGE 180
           LLGR+ VP+   + +ET+P V+QNGW GI G + G   +Q  L+VRAE DPRFVF+FDGE
Sbjct: 121 LLGRVTVPLAELAAAETRPVVYQNGWIGIGGKKNGSGQSQFFLSVRAEPDPRFVFKFDGE 180

Query: 181 PECSPQVFQVQGSVQQPVFTCKFGFRNERDWDRSRSSITEQSSTSRSWL-PKIGS-ERDQ 240
           PECSPQVFQVQG+V+QPVFTCKFGFRN  D  +SRS    +  T R+WL P +GS +++Q
Sbjct: 181 PECSPQVFQVQGNVKQPVFTCKFGFRNASDM-QSRSM--SEQGTPRNWLVPFMGSNQKEQ 240

Query: 241 SAKERKGWSITIHDLSGSPVAAASMVTPFVPSPGSHRVSRSNPGAWLILRPVDGSWHPWG 300
           SAKERKGWS+TIHDLSGSPVAAASMVTPFV SPGS RVSRSNPGAWLILRP DG+W PWG
Sbjct: 241 SAKERKGWSLTIHDLSGSPVAAASMVTPFVASPGSQRVSRSNPGAWLILRPEDGTWKPWG 300

Query: 301 RLEAWRESGGSDSIGYRFELLPATSAAATLANSTISSSSGGKFTIDKTGSASPVISPNGS 360
           RLEAW E GGSD++GY+FELL     +  LANST+S+S+GGKFTID T S +PV SP+ S
Sbjct: 301 RLEAWLERGGSDTVGYKFELL-----STILANSTVSASNGGKFTIDLTSSLTPVNSPHSS 360

Query: 361 FDLGSGSG-SRPGSGD-----FGYLTAYQYKGFVMSTKVEGMKKKNRRAEVEVAVQHVTC 420
           FD GSGSG SRPGSG      FG +     +GFVMS+ VEG+ K + + EVEV VQHVTC
Sbjct: 361 FDFGSGSGSSRPGSGSGSDFGFGLIPQLLQRGFVMSSTVEGIGKCS-KPEVEVGVQHVTC 420

Query: 421 TEDAAVFVALAAAVDLSMDACRLFSQKLRKELRQ 444
           TEDAA +VALAAA+DLSMDACR FSQKLRKELRQ
Sbjct: 421 TEDAAAYVALAAAMDLSMDACRPFSQKLRKELRQ 438

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KY09_CUCSA3.8e-22687.80Uncharacterized protein OS=Cucumis sativus GN=Csa_4G000030 PE=4 SV=1[more]
W9QZR1_9ROSA1.5e-14263.34Uncharacterized protein OS=Morus notabilis GN=L484_003734 PE=4 SV=1[more]
A0A067EE54_CITSI5.0e-14162.29Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g012226mg PE=4 SV=1[more]
V4S6F0_9ROSI1.1e-14062.08Uncharacterized protein OS=Citrus clementina GN=CICLE_v10028366mg PE=4 SV=1[more]
A0A061DYV5_THECC3.3e-13760.30Gb:AAC34331.1 OS=Theobroma cacao GN=TCM_006326 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G10020.13.3e-12855.82 Protein of unknown function (DUF1005)[more]
AT3G19680.11.2e-12552.87 Protein of unknown function (DUF1005)[more]
AT1G50040.15.5e-11552.33 Protein of unknown function (DUF1005)[more]
AT4G29310.12.7e-9849.12 Protein of unknown function (DUF1005)[more]
AT5G17640.12.8e-7139.13 Protein of unknown function (DUF1005)[more]
Match NameE-valueIdentityDescription
gi|659108870|ref|XP_008454429.1|3.1e-22989.62PREDICTED: uncharacterized protein LOC103494838 [Cucumis melo][more]
gi|778689105|ref|XP_004150270.2|5.5e-22687.80PREDICTED: uncharacterized protein LOC101221491 [Cucumis sativus][more]
gi|1009178292|ref|XP_015870445.1|6.0e-15666.59PREDICTED: uncharacterized protein LOC107407651 [Ziziphus jujuba][more]
gi|657962698|ref|XP_008372951.1|1.3e-15364.90PREDICTED: uncharacterized protein LOC103436305 [Malus domestica][more]
gi|470111644|ref|XP_004292055.1|3.8e-15064.98PREDICTED: uncharacterized protein LOC101308741 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR010410DUF1005
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh16G003180.1CmoCh16G003180.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR010410Protein of unknown function DUF1005PFAMPF06219DUF1005coord: 1..442
score: 1.4E
NoneNo IPR availablePANTHERPTHR31317FAMILY NOT NAMEDcoord: 2..443
score: 7.0E
NoneNo IPR availablePANTHERPTHR31317:SF3F2J10.8 PROTEIN-RELATEDcoord: 2..443
score: 7.0E

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
CmoCh16G003180CmoCh04G004560Cucurbita moschata (Rifu)cmocmoB286
The following block(s) are covering this gene:
GeneOrganismBlock
CmoCh16G003180Cucumber (Chinese Long) v2cmocuB338
CmoCh16G003180Cucurbita pepo (Zucchini)cmocpeB321
CmoCh16G003180Melon (DHL92) v3.6.1cmomedB317
CmoCh16G003180Cucumber (Chinese Long) v3cmocucB0409
CmoCh16G003180Cucurbita moschata (Rifu)cmocmoB096
CmoCh16G003180Cucumber (Gy14) v1cgycmoB0625
CmoCh16G003180Cucurbita maxima (Rimu)cmacmoB133