CmaCh03G008740 (gene) Cucurbita maxima (Rimu)

NameCmaCh03G008740
Typegene
OrganismCucurbita maxima (Cucurbita maxima (Rimu))
DescriptionCoffea canephora DH200=94 genomic scaffold, scaffold_8
LocationCma_Chr03 : 6462479 .. 6463606 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGCGGCCACCGCCACCGCCACCGACAAGCCCTCCCGGCCAAGACTCGCATGTTTCTCCTTCGCTGCCTATGCCAAAGCCGTCATCGACCATCTCAAATCCCTTCAAATTCCGGTCCTCCCCGGCCTCTCGGACACTGAATTTACATCCTTCGAGTCCACCTTCCAATTCTCCTTCCCACCGGACCTCCGTTCAATCCTCCAAGAGGGTCTCCCAATTGGCTCTGGTTTCCCCAATTGGAGATCCTCCTCCATTCAACAGCTTCACATTCTAATCAATCTCCCCCGGCTTTGCCTCCTCAAGGAAATTTCTCAGAGAAAATTCTGGTGTCAATCTTGGGGTACTCAACCCAATGACCCAAATGACGCCGTTGCTCTAGCCCAGCAACTCTTGGACAGAGCCCCTGTTCTCGTCCCCATTTACAAAAACTGTTACATCCCTTCCGAGCCGAACATGGCCGGGAACCCCGTATTTCACCTCGACGGTGGGGAGATTCGTGTTTCCAGCTTCGATTTAGCCGGATTCTTTCAAACCCATGAATATTCTCAACTTAGTGAGGCTGAACCAGACCGGTCAATGATCGACTCGCCGGCTTGGGCGGCGACGGAGGCCAGAGCGGTGGATTTCTGGACGGAAGTGGCTTCGGGAAAAAATATGGCGGGGCGTGGGGTAACACTAGGATGGTGGAATGAAGGGGAATTTGAAATGGGGTTGGACGGATGCCTTGAGGACGTATTGTGGAAGTTGCGACAGGGCGGGTGGAGGGAAGAAGATATCAGAGATATGATGATGATGGACGGCCATGATCGGAGCTTGGAACAGAGTCGGGCGACCATCGAGAAACTCAAAGTATCAGTATGTGAGATTTTACTGAGTGGGGGATGGAGCAGGGACGATGTGATGTACTCTCTGGGTCTCGAAGACAATTCCAATTCCAATTCCGCCATTGTTATTCCTGAAGAAGAAGAAGAAGAAGAATCAACGTTTGAAATCAATCTTCATCATCATCGCCCAACCAAAGTCCCCCAAGTGGAATGCAAGAAAAAGGCCAAATCCCGCAGCTCCACCAACCACCATAACATGTCCCCATTTTACTTTGCTCCACATCGAAATTTAATCTTGTAA

mRNA sequence

ATGGCCGCGGCCACCGCCACCGCCACCGACAAGCCCTCCCGGCCAAGACTCGCATGTTTCTCCTTCGCTGCCTATGCCAAAGCCGTCATCGACCATCTCAAATCCCTTCAAATTCCGGTCCTCCCCGGCCTCTCGGACACTGAATTTACATCCTTCGAGTCCACCTTCCAATTCTCCTTCCCACCGGACCTCCGTTCAATCCTCCAAGAGGGTCTCCCAATTGGCTCTGGTTTCCCCAATTGGAGATCCTCCTCCATTCAACAGCTTCACATTCTAATCAATCTCCCCCGGCTTTGCCTCCTCAAGGAAATTTCTCAGAGAAAATTCTGGTGTCAATCTTGGGGTACTCAACCCAATGACCCAAATGACGCCGTTGCTCTAGCCCAGCAACTCTTGGACAGAGCCCCTGTTCTCGTCCCCATTTACAAAAACTGTTACATCCCTTCCGAGCCGAACATGGCCGGGAACCCCGTATTTCACCTCGACGGTGGGGAGATTCGTGTTTCCAGCTTCGATTTAGCCGGATTCTTTCAAACCCATGAATATTCTCAACTTAGTGAGGCTGAACCAGACCGGTCAATGATCGACTCGCCGGCTTGGGCGGCGACGGAGGCCAGAGCGGTGGATTTCTGGACGGAAGTGGCTTCGGGAAAAAATATGGCGGGGCGTGGGGTAACACTAGGATGGTGGAATGAAGGGGAATTTGAAATGGGGTTGGACGGATGCCTTGAGGACGTATTGTGGAAGTTGCGACAGGGCGGGTGGAGGGAAGAAGATATCAGAGATATGATGATGATGGACGGCCATGATCGGAGCTTGGAACAGAGTCGGGCGACCATCGAGAAACTCAAAGTATCAGTATGTGAGATTTTACTGAGTGGGGGATGGAGCAGGGACGATGTGATGTACTCTCTGGGTCTCGAAGACAATTCCAATTCCAATTCCGCCATTGTTATTCCTGAAGAAGAAGAAGAAGAAGAATCAACGTTTGAAATCAATCTTCATCATCATCGCCCAACCAAAGTCCCCCAAGTGGAATGCAAGAAAAAGGCCAAATCCCGCAGCTCCACCAACCACCATAACATGTCCCCATTTTACTTTGCTCCACATCGAAATTTAATCTTGTAA

Coding sequence (CDS)

ATGGCCGCGGCCACCGCCACCGCCACCGACAAGCCCTCCCGGCCAAGACTCGCATGTTTCTCCTTCGCTGCCTATGCCAAAGCCGTCATCGACCATCTCAAATCCCTTCAAATTCCGGTCCTCCCCGGCCTCTCGGACACTGAATTTACATCCTTCGAGTCCACCTTCCAATTCTCCTTCCCACCGGACCTCCGTTCAATCCTCCAAGAGGGTCTCCCAATTGGCTCTGGTTTCCCCAATTGGAGATCCTCCTCCATTCAACAGCTTCACATTCTAATCAATCTCCCCCGGCTTTGCCTCCTCAAGGAAATTTCTCAGAGAAAATTCTGGTGTCAATCTTGGGGTACTCAACCCAATGACCCAAATGACGCCGTTGCTCTAGCCCAGCAACTCTTGGACAGAGCCCCTGTTCTCGTCCCCATTTACAAAAACTGTTACATCCCTTCCGAGCCGAACATGGCCGGGAACCCCGTATTTCACCTCGACGGTGGGGAGATTCGTGTTTCCAGCTTCGATTTAGCCGGATTCTTTCAAACCCATGAATATTCTCAACTTAGTGAGGCTGAACCAGACCGGTCAATGATCGACTCGCCGGCTTGGGCGGCGACGGAGGCCAGAGCGGTGGATTTCTGGACGGAAGTGGCTTCGGGAAAAAATATGGCGGGGCGTGGGGTAACACTAGGATGGTGGAATGAAGGGGAATTTGAAATGGGGTTGGACGGATGCCTTGAGGACGTATTGTGGAAGTTGCGACAGGGCGGGTGGAGGGAAGAAGATATCAGAGATATGATGATGATGGACGGCCATGATCGGAGCTTGGAACAGAGTCGGGCGACCATCGAGAAACTCAAAGTATCAGTATGTGAGATTTTACTGAGTGGGGGATGGAGCAGGGACGATGTGATGTACTCTCTGGGTCTCGAAGACAATTCCAATTCCAATTCCGCCATTGTTATTCCTGAAGAAGAAGAAGAAGAAGAATCAACGTTTGAAATCAATCTTCATCATCATCGCCCAACCAAAGTCCCCCAAGTGGAATGCAAGAAAAAGGCCAAATCCCGCAGCTCCACCAACCACCATAACATGTCCCCATTTTACTTTGCTCCACATCGAAATTTAATCTTGTAA

Protein sequence

MAAATATATDKPSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYSQLSEAEPDRSMIDSPAWAATEARAVDFWTEVASGKNMAGRGVTLGWWNEGEFEMGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATIEKLKVSVCEILLSGGWSRDDVMYSLGLEDNSNSNSAIVIPEEEEEEESTFEINLHHHRPTKVPQVECKKKAKSRSSTNHHNMSPFYFAPHRNLIL
BLAST of CmaCh03G008740 vs. TrEMBL
Match: A0A0A0KJT1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G497030 PE=4 SV=1)

HSP 1 Score: 604.0 bits (1556), Expect = 1.3e-169
Identity = 301/373 (80.70%), Postives = 324/373 (86.86%), Query Frame = 1

Query: 4   ATATATDKPSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPD 63
           ATAT T  P RP+LACFSFAAYAK VIDHLKSLQIPV PGLSD EFTS ESTF+FSFPPD
Sbjct: 2   ATATVTVNPPRPKLACFSFAAYAKTVIDHLKSLQIPVHPGLSDPEFTSVESTFRFSFPPD 61

Query: 64  LRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPND 123
           LRSILQEGLPIGSGFPNWRSSS QQLHILINLP+ CLLKEISQRKFWCQSWG QP+D ND
Sbjct: 62  LRSILQEGLPIGSGFPNWRSSSTQQLHILINLPKFCLLKEISQRKFWCQSWGAQPDDTND 121

Query: 124 AVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYS 183
           AVALA+Q LDRAPVLVPIYKN YIPS PNMAGNPVFHLD GEIRVSSFDLAGFFQTHEYS
Sbjct: 122 AVALAKQFLDRAPVLVPIYKNWYIPSAPNMAGNPVFHLDDGEIRVSSFDLAGFFQTHEYS 181

Query: 184 QLSEAEPDRSMIDSPAWAATEARAVDFWTEVASGKNMAGRGVTLGWWNEGEFEMGLDGCL 243
           QL +AE DR +IDSPAWAATEARAV+FWTEVAS K   GR VT GWWNEGEFEMGLDGCL
Sbjct: 182 QLGKAETDRLVIDSPAWAATEARAVEFWTEVASRKKATGREVTEGWWNEGEFEMGLDGCL 241

Query: 244 EDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATIEKLKVSVCEILLSGGWSRDDVMY 303
           EDV WKLR+GGWREED+RDMMMMD HDRSLEQ+ AT+EKL+VSVCEILLSGGWSRDDV+Y
Sbjct: 242 EDVFWKLREGGWREEDVRDMMMMDRHDRSLEQNEATMEKLRVSVCEILLSGGWSRDDVVY 301

Query: 304 SLGLEDNSNSNSAIVIPEEEEEEESTFEINLHH-HRPTKVPQVECKKKAKSRSSTNHHNM 363
           SL LE     +SA VIPEEE    STFEINLHH H P ++PQVE K K ++ ++T+H  M
Sbjct: 302 SLDLE----GHSASVIPEEE----STFEINLHHQHLPIRIPQVERKIKPRN-TTTSHLKM 361

Query: 364 SPFYFAPHRNLIL 376
            PF+FAPHRNLIL
Sbjct: 362 PPFFFAPHRNLIL 365

BLAST of CmaCh03G008740 vs. TrEMBL
Match: A0A061DHC3_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_000880 PE=4 SV=1)

HSP 1 Score: 354.0 bits (907), Expect = 2.3e-94
Identity = 176/338 (52.07%), Postives = 231/338 (68.34%), Query Frame = 1

Query: 3   AATATATDK------------PSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFT 62
           AATAT T              P R +L CFSFAAY+K++IDHLKSL IP+LPGL+D EF+
Sbjct: 2   AATATTTQTQPPNPLMKAPKGPPRSKLVCFSFAAYSKSLIDHLKSLDIPILPGLTDQEFS 61

Query: 63  SFESTFQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFW 122
           S EST  F+FPPDLRSILQEGLP+   FPNWRSSS QQL+IL+NLP L L K I+   FW
Sbjct: 62  SVESTLHFTFPPDLRSILQEGLPVDPSFPNWRSSSPQQLNILLNLPLLSLSKNITLHNFW 121

Query: 123 CQSWGTQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSS 182
             SWG +P++ N+A+AL + LL +AP+LVPIY+NCYIPS PNMAGNPVF++DG E+R+ S
Sbjct: 122 SDSWGPKPSNSNEALALVKSLLQKAPLLVPIYRNCYIPSTPNMAGNPVFYVDGDEVRILS 181

Query: 183 FDLAGFFQTHE-------YSQLSEAEPDRSMIDSPAWAATEARAVDFWTEVA-SGKNMAG 242
           FD+  FFQ  E       +   +  + +    + PAWAAT AR +DFWT+VA  G+ +  
Sbjct: 182 FDITRFFQEVEFLRRGGVFKPFTRKKSNSVNNNVPAWAATTARRIDFWTDVAEKGRRVVA 241

Query: 243 RGVTLGWWNEGEFE--MGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATI 302
           RGVT GWW+ GE E  +GL GCLE+V WKLR+GGWREE++R+MMM+DG D++  + ++  
Sbjct: 242 RGVTRGWWSRGEVEEDLGLRGCLEEVFWKLREGGWREEEVREMMMIDGCDQNENKEKSGT 301

Query: 303 EKLKVS---------VCEILLSGGWSRDDVMYSLGLED 310
             +            +  +LL  GW+ +DV+Y+L L D
Sbjct: 302 RLVMDGGDAAWHVRVLSVVLLRAGWASEDVVYALDLHD 339

BLAST of CmaCh03G008740 vs. TrEMBL
Match: A0A0D2QF47_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_003G034100 PE=4 SV=1)

HSP 1 Score: 350.9 bits (899), Expect = 1.9e-93
Identity = 182/345 (52.75%), Postives = 234/345 (67.83%), Query Frame = 1

Query: 2   AAATATATDK------------PSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEF 61
           A ATATAT              P R +LACFSFAAY+K +I HL++L+IP+LPGL+D EF
Sbjct: 3   ATATATATTAQTHRPNPLLKAPPPRSKLACFSFAAYSKTLIHHLQTLEIPILPGLTDHEF 62

Query: 62  TSFESTFQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKF 121
           TS ES F F+FPPDLRSILQEGLP+   FPNWRSSS QQL++L+NLP L L K I+   F
Sbjct: 63  TSIESAFHFTFPPDLRSILQEGLPVDPSFPNWRSSSPQQLNVLLNLPLLSLSKNITLHNF 122

Query: 122 WCQSWGTQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVS 181
           W  SWGT+P++PN+A+ L ++L   APVL+PIY+NCYIPS PNMAGNPVF++DG E+R+ 
Sbjct: 123 WSPSWGTKPSNPNEALGLVKRLFITAPVLIPIYRNCYIPSTPNMAGNPVFYVDGEEVRIL 182

Query: 182 SFDLAGFFQTHEYSQLSEA-----EPDRSMIDS--PAWAATEARAVDFWTEVA-SGKNMA 241
           SFD+  FFQ  E+ +            R+ +D+  PAWAA  AR ++FWT+VA  G+ + 
Sbjct: 183 SFDVNRFFQEVEFLRRGGVFKPFKRKKRNGVDNKVPAWAAKAARRIEFWTDVAEKGRRVV 242

Query: 242 GRGVTLG-WWNEGEFEMGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATI 301
            RGVT G WW + E E  L GCLE+V W+LR GGWREE+++DMMMMDG D+S  + +   
Sbjct: 243 ARGVTTGWWWRKEEEEFRLGGCLEEVFWRLRDGGWREEEVKDMMMMDGCDQSQIKPKNGT 302

Query: 302 EKL----------KVSVCEILLSGGWSRDDVMYSLGLEDNSNSNS 316
             L          +VS   +LL GGWSR+DV+YSL L+D     S
Sbjct: 303 RPLIDGDDAAWHTRVS-SVVLLRGGWSREDVVYSLDLDDIDGDES 346

BLAST of CmaCh03G008740 vs. TrEMBL
Match: V4SKM9_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025875mg PE=4 SV=1)

HSP 1 Score: 345.5 bits (885), Expect = 8.1e-92
Identity = 175/332 (52.71%), Postives = 232/332 (69.88%), Query Frame = 1

Query: 1   MAAATATAT----DKPSRP-RLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFEST 60
           MAA T T T     KP R  +L CFS+AAYAK +IDHLKSL IP+LPGL+D EF+  EST
Sbjct: 1   MAATTTTETMISRAKPPRTTKLVCFSYAAYAKNLIDHLKSLNIPILPGLNDAEFSDIEST 60

Query: 61  FQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWG 120
           F F+FPPDLRSIL+EGLP G  FPNW SSS QQL IL+NLP L L K +S   FW  SWG
Sbjct: 61  FNFTFPPDLRSILREGLPAGPAFPNWLSSSHQQLRILVNLPVLSLSKNVSLNNFWSVSWG 120

Query: 121 TQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAG 180
            +P + NDA++L ++LLD+AP+LVPIY+NCY+PS PNMAGNPVF++D  E+RV SFDLAG
Sbjct: 121 QRPQNNNDALSLIKKLLDKAPLLVPIYRNCYVPSTPNMAGNPVFYIDTEEVRVLSFDLAG 180

Query: 181 FFQTHEYSQLSEAEPDRSMIDSPAWAATEARAVDFWTEVAS-GKNMAGRGVTLG-WWNEG 240
           FF+  +  +  +A     ++D PAWAA E R ++FWT+VA  G+ +  RG + G WW  G
Sbjct: 181 FFK--QVDEFVKAGGGGGVLDMPAWAAKEPRTIEFWTDVAERGRRVLARGGSRGRWWRAG 240

Query: 241 --EFEMGLDGCLEDVLWKLRQGGWREEDIRDMMM-MDGHDRSLEQ--------SRATIEK 300
                +GL+ C+++V W+LR GGWREE++R+MMM +DGHD    +         R ++E+
Sbjct: 241 CENTRVGLECCMDEVFWRLRDGGWREEEVREMMMVVDGHDDPTSEVQLVGDSTGRGSVER 300

Query: 301 LKVSVCEILLSGGWSRDDVMYSLGLEDNSNSN 315
               +  +LL  GWS++DV+YSL L+D+ + N
Sbjct: 301 HVRLLSLVLLRAGWSKEDVVYSLNLQDHGSFN 330

BLAST of CmaCh03G008740 vs. TrEMBL
Match: A0A067KAU5_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12599 PE=4 SV=1)

HSP 1 Score: 345.1 bits (884), Expect = 1.1e-91
Identity = 175/318 (55.03%), Postives = 223/318 (70.13%), Query Frame = 1

Query: 16  RLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPDLRSILQEGLPIG 75
           +L CFSFAAYAK +IDHLKSL IP+LPGL+D+EF S EST++FSFPPDLRSILQEGLPIG
Sbjct: 4   KLVCFSFAAYAKTLIDHLKSLNIPILPGLTDSEFKSIESTYRFSFPPDLRSILQEGLPIG 63

Query: 76  SGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPNDAVALAQQL-LDR 135
             FPNWRSSS QQ+ IL+NLP L L K ISQ  FW  SWG +P D N A+ +A++L  D+
Sbjct: 64  PHFPNWRSSSPQQIQILLNLPILNLSKNISQNNFWVDSWGDKPEDSNKALDMAKRLFFDK 123

Query: 136 APVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYSQLS--EAEP-- 195
           APVLVPIY NCYIPS PN AGNPVF +D G +RV SFDLA FFQ  E+ Q+     +P  
Sbjct: 124 APVLVPIYGNCYIPSTPNDAGNPVFCVDNGGVRVLSFDLARFFQEVEFLQIGVHSIKPVN 183

Query: 196 ----DRSMIDSPAWAATEARAVDFWTEVA-SGKNMAGRGVTLGWWNEGEFEMG--LDGCL 255
               ++  I+ P WAAT AR +DFWT+VA +G+ +  R  T GWW  G  +    +  CL
Sbjct: 184 FPRIEKVAINVPVWAATTARKIDFWTDVAENGRKVVAREDTHGWWTGGNLDCSWEMGDCL 243

Query: 256 EDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQ------SRATIEKLKVSVCEILLSGGWS 315
           E+V W+LR GGW+EE++R+MMMMDG D  +++       +  +      +  +LL  GWS
Sbjct: 244 EEVFWRLRDGGWKEEEVREMMMMDGCDEGMKKGCGPKLGKEDVVWHVRLMSLVLLRAGWS 303

BLAST of CmaCh03G008740 vs. TAIR10
Match: AT2G22790.1 (AT2G22790.1 unknown protein)

HSP 1 Score: 201.8 bits (512), Expect = 7.3e-52
Identity = 120/324 (37.04%), Postives = 178/324 (54.94%), Query Frame = 1

Query: 9   TDKPSRPRLACFSFAAYAKAVIDHLKSLQ-IPVLPGLSDTEFTSFESTFQFSFPPDLRSI 68
           T  P R      S   Y K +++H KS     V PGL++ E ++ ES+  FSFP DLRSI
Sbjct: 16  TTDPIRSSSVNPSSPVYYKTIVNHFKSQTGNHVSPGLTNQEISAVESSHGFSFPLDLRSI 75

Query: 69  LQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPNDAVAL 128
           LQ GLP+G+ FPNWR+ S +   +L   P L L + + +  FW  SWG +P +  +A++L
Sbjct: 76  LQTGLPVGTNFPNWRTGSNRNNLLL---PLLNLSQHVVRNGFWVDSWGIRPGNDAEALSL 135

Query: 129 AQQLLDRAPVLVPIYKNCYIPSE-PNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYSQLS 188
            ++L++ APVLVP+Y + Y+PS  PN+AGNPVF +DG  +R  S D+ GF +    S+  
Sbjct: 136 VKKLIEIAPVLVPVYGDFYVPSTTPNLAGNPVFQIDGDGVRELSCDVVGFLKGIGRSETP 195

Query: 189 EAEPDRSMIDSPAWAATEARAVDFWTEVASG-KNMAGRGVTLGWWNEGEFEMGLDGCLED 248
             +  R             R V+FW++VA G + +  R  T  WW+   FE GL  CL+D
Sbjct: 196 TEDRRRRR---------RPRRVEFWSDVAEGWRFVVARDYTRDWWSALGFE-GLTACLDD 255

Query: 249 VLWKLRQGGWREEDIRDMMMMDGHDRS--LEQSRATIEKLKVSVCEILLSGGWSRDDVMY 308
             WKLR+ GW E+D+RDMMMMD  D++  ++Q   T  +                 DV+Y
Sbjct: 256 AFWKLREAGWTEDDVRDMMMMDSVDQNTCIQQQTQTQSR-----------------DVVY 308

Query: 309 SLGLEDNSNSNSAIVIPEEEEEEE 328
           + G ++  N        E+E+ ++
Sbjct: 316 AFG-DEGMNERDRDTCTEDEDHQK 308

BLAST of CmaCh03G008740 vs. TAIR10
Match: AT5G67020.1 (AT5G67020.1 unknown protein)

HSP 1 Score: 180.3 bits (456), Expect = 2.3e-45
Identity = 111/335 (33.13%), Postives = 175/335 (52.24%), Query Frame = 1

Query: 12  PSRP--RLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPDLRSILQ 71
           PS P  R +  SF+ +A  VI+HLK+  I + PGLSDTEF   E+ F F+FPPDLR IL 
Sbjct: 29  PSTPTIRNSLQSFSPFADKVINHLKNSGIKIQPGLSDTEFARVEAEFGFTFPPDLRVILS 88

Query: 72  EGLPIGSGFPNWRSSSIQ-QLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPNDAVALA 131
            GL +G+GFP+WRS   +  L  +I+LP   +  +I++   WC+SWG +P DP  A+ +A
Sbjct: 89  AGLSVGAGFPDWRSPGARLHLRAMIDLPVAAVSFQIAKNSLWCKSWGLKPPDPEKALRVA 148

Query: 132 QQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYSQLSEA 191
           +  L RAP+L+PI+ +CYIP  P++AGNPVF +D   I     DL+ FF+     + SE 
Sbjct: 149 RNALKRAPLLIPIFDHCYIPCNPSLAGNPVFFIDETRIFCCGSDLSEFFERESAFRSSEF 208

Query: 192 EP-----DRSMIDSPAWA----------------ATEARAVDFWTEVASGKNMAGRGVTL 251
            P      RS+ +  A +                A ++R V+FW++ A  +       T 
Sbjct: 209 FPRILTKQRSVSEKSAGSSSNFSRRSLDLGRANGAGKSRWVEFWSDAAVDRCRRNSASTS 268

Query: 252 GWWNEG------EFEMGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGH-----DRSLEQS 309
              +        E    ++  +  +   LR+GGW E DI +++ +        +  +  +
Sbjct: 269 SSSSSSPDLPKTETPKWVNQYVNRIGSVLRRGGWSESDIDEIIHVSASGFFEGEMVIIDN 328

BLAST of CmaCh03G008740 vs. TAIR10
Match: AT3G50340.1 (AT3G50340.1 unknown protein)

HSP 1 Score: 176.0 bits (445), Expect = 4.3e-44
Identity = 109/351 (31.05%), Postives = 183/351 (52.14%), Query Frame = 1

Query: 3   AATATATDKPSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPP 62
           +A A A   P+  R +  SF++ A  VI HL + +I V PGL+D+EF   E+ F F+FPP
Sbjct: 23  SARAAAPTTPT-VRNSLVSFSSLADQVISHLHTSRIQVQPGLTDSEFARAEAEFAFAFPP 82

Query: 63  DLRSILQEGLPIGSGFPNWRSSSIQ-QLHILINLPRLCLLKEISQRKFWCQSWGTQPNDP 122
           DLR++L  GLP+G+GFP+WRS   +  L  +I+LP   +  +I++   W +SWG +P+DP
Sbjct: 83  DLRAVLTAGLPVGAGFPDWRSPGARLHLRAMIDLPIAAVSFQIARNTLWSKSWGLRPSDP 142

Query: 123 NDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHE 182
             A+ +A+  L RAP+++PI+ +CYIP  P++AGNPVF++D   I     DL+ FF+   
Sbjct: 143 EKALRVARNALKRAPLMIPIFDHCYIPCNPSLAGNPVFYIDETRIFCCGSDLSDFFERES 202

Query: 183 YSQLSEAEP-----DRSMIDSPAWAATEA--------------------RAVDFWTEVAS 242
             + S+  P      RS+ +  A +++ +                    R V+FW++ A 
Sbjct: 203 VFRGSDTCPVVLTKQRSVSEKSAGSSSSSSSNFSRMSLDSGRVHGSSTPRWVEFWSDAAV 262

Query: 243 GKNMAGRGVTLGWWNEGEFEMGLD-------GCLEDVLWK----LRQGGWREEDIRDMMM 302
            +       ++   +    E  LD         ++D + +    LR GGW E D+ D++ 
Sbjct: 263 DRRRRNSASSMSSSHSSSPERYLDLPRSETPKWVDDYVNRIGSVLRGGGWSESDVDDIVH 322

Query: 303 MDGH-----DRSLEQSRATIEKLKVSV---CEILLSGGWSRDDVMYSLGLE 309
           +        +  +  ++A ++ L +      E L   GWS ++V  +LG +
Sbjct: 323 VSASGFFEGEMVILDNQAVLDALLLKAGRFSESLRKAGWSSEEVSDALGFD 372

BLAST of CmaCh03G008740 vs. NCBI nr
Match: gi|659081030|ref|XP_008441112.1| (PREDICTED: uncharacterized protein LOC103485339 [Cucumis melo])

HSP 1 Score: 611.7 bits (1576), Expect = 8.7e-172
Identity = 302/373 (80.97%), Postives = 326/373 (87.40%), Query Frame = 1

Query: 6   ATATDKPSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPDLR 65
           ATAT KP RP+LACFSFAAYAK VIDHLKSLQIPVLPGLSD EFTS ESTF+FSFPPDLR
Sbjct: 2   ATATVKPPRPKLACFSFAAYAKTVIDHLKSLQIPVLPGLSDPEFTSVESTFRFSFPPDLR 61

Query: 66  SILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPNDAV 125
           SILQEGLPIGSGFPNWRSSSIQQLHILINLP+ CLLKEISQRKFWCQSWG QP+D NDAV
Sbjct: 62  SILQEGLPIGSGFPNWRSSSIQQLHILINLPKFCLLKEISQRKFWCQSWGAQPDDSNDAV 121

Query: 126 ALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYSQL 185
           ALA+Q LDRAPVLVPIYKN YIPS PNMAGNPVFHLDGGEIRVSSFDLAGFFQ HEYSQL
Sbjct: 122 ALAKQFLDRAPVLVPIYKNWYIPSAPNMAGNPVFHLDGGEIRVSSFDLAGFFQAHEYSQL 181

Query: 186 SEAEPDRSMIDSPAWAATEARAVDFWTEVASGKNMAGRGVTLGWWNEGEFEMGLDGCLED 245
            +AEPD  +IDSPAWAATEARAV+FWTEVAS K    R VT GWWNEGEFEMGLDGCLED
Sbjct: 182 GKAEPDCLVIDSPAWAATEARAVEFWTEVASRKKAKAREVTEGWWNEGEFEMGLDGCLED 241

Query: 246 VLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATIEKLKVSVCEILLSGGWSRDDVMYSL 305
           V WKLR+GGWRE+D+RDMMMMD HDRSLEQ+  T+EKL+VSV EILLSGGWSRDDV+YSL
Sbjct: 242 VFWKLREGGWREDDVRDMMMMDRHDRSLEQNETTMEKLRVSVGEILLSGGWSRDDVVYSL 301

Query: 306 GLEDNSNSNSAIVIPEEEEEEESTFEINL---HHHRPTKVPQVECKKKAKSRSSTNHHNM 365
            LE     NSAIVIP    +EESTFEINL   HHH+P ++PQVE KKK ++ ++TNH  M
Sbjct: 302 DLE----CNSAIVIP----DEESTFEINLHHYHHHQPIRIPQVERKKKPRNTTTTNHLKM 361

Query: 366 SPFYFAPHRNLIL 376
            PF+FAPHRNLIL
Sbjct: 362 PPFFFAPHRNLIL 366

BLAST of CmaCh03G008740 vs. NCBI nr
Match: gi|449451563|ref|XP_004143531.1| (PREDICTED: uncharacterized protein LOC101204059 [Cucumis sativus])

HSP 1 Score: 604.0 bits (1556), Expect = 1.8e-169
Identity = 301/373 (80.70%), Postives = 324/373 (86.86%), Query Frame = 1

Query: 4   ATATATDKPSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFESTFQFSFPPD 63
           ATAT T  P RP+LACFSFAAYAK VIDHLKSLQIPV PGLSD EFTS ESTF+FSFPPD
Sbjct: 2   ATATVTVNPPRPKLACFSFAAYAKTVIDHLKSLQIPVHPGLSDPEFTSVESTFRFSFPPD 61

Query: 64  LRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWGTQPNDPND 123
           LRSILQEGLPIGSGFPNWRSSS QQLHILINLP+ CLLKEISQRKFWCQSWG QP+D ND
Sbjct: 62  LRSILQEGLPIGSGFPNWRSSSTQQLHILINLPKFCLLKEISQRKFWCQSWGAQPDDTND 121

Query: 124 AVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAGFFQTHEYS 183
           AVALA+Q LDRAPVLVPIYKN YIPS PNMAGNPVFHLD GEIRVSSFDLAGFFQTHEYS
Sbjct: 122 AVALAKQFLDRAPVLVPIYKNWYIPSAPNMAGNPVFHLDDGEIRVSSFDLAGFFQTHEYS 181

Query: 184 QLSEAEPDRSMIDSPAWAATEARAVDFWTEVASGKNMAGRGVTLGWWNEGEFEMGLDGCL 243
           QL +AE DR +IDSPAWAATEARAV+FWTEVAS K   GR VT GWWNEGEFEMGLDGCL
Sbjct: 182 QLGKAETDRLVIDSPAWAATEARAVEFWTEVASRKKATGREVTEGWWNEGEFEMGLDGCL 241

Query: 244 EDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATIEKLKVSVCEILLSGGWSRDDVMY 303
           EDV WKLR+GGWREED+RDMMMMD HDRSLEQ+ AT+EKL+VSVCEILLSGGWSRDDV+Y
Sbjct: 242 EDVFWKLREGGWREEDVRDMMMMDRHDRSLEQNEATMEKLRVSVCEILLSGGWSRDDVVY 301

Query: 304 SLGLEDNSNSNSAIVIPEEEEEEESTFEINLHH-HRPTKVPQVECKKKAKSRSSTNHHNM 363
           SL LE     +SA VIPEEE    STFEINLHH H P ++PQVE K K ++ ++T+H  M
Sbjct: 302 SLDLE----GHSASVIPEEE----STFEINLHHQHLPIRIPQVERKIKPRN-TTTSHLKM 361

Query: 364 SPFYFAPHRNLIL 376
            PF+FAPHRNLIL
Sbjct: 362 PPFFFAPHRNLIL 365

BLAST of CmaCh03G008740 vs. NCBI nr
Match: gi|590706170|ref|XP_007047648.1| (Uncharacterized protein TCM_000880 [Theobroma cacao])

HSP 1 Score: 354.0 bits (907), Expect = 3.3e-94
Identity = 176/338 (52.07%), Postives = 231/338 (68.34%), Query Frame = 1

Query: 3   AATATATDK------------PSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFT 62
           AATAT T              P R +L CFSFAAY+K++IDHLKSL IP+LPGL+D EF+
Sbjct: 2   AATATTTQTQPPNPLMKAPKGPPRSKLVCFSFAAYSKSLIDHLKSLDIPILPGLTDQEFS 61

Query: 63  SFESTFQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFW 122
           S EST  F+FPPDLRSILQEGLP+   FPNWRSSS QQL+IL+NLP L L K I+   FW
Sbjct: 62  SVESTLHFTFPPDLRSILQEGLPVDPSFPNWRSSSPQQLNILLNLPLLSLSKNITLHNFW 121

Query: 123 CQSWGTQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSS 182
             SWG +P++ N+A+AL + LL +AP+LVPIY+NCYIPS PNMAGNPVF++DG E+R+ S
Sbjct: 122 SDSWGPKPSNSNEALALVKSLLQKAPLLVPIYRNCYIPSTPNMAGNPVFYVDGDEVRILS 181

Query: 183 FDLAGFFQTHE-------YSQLSEAEPDRSMIDSPAWAATEARAVDFWTEVA-SGKNMAG 242
           FD+  FFQ  E       +   +  + +    + PAWAAT AR +DFWT+VA  G+ +  
Sbjct: 182 FDITRFFQEVEFLRRGGVFKPFTRKKSNSVNNNVPAWAATTARRIDFWTDVAEKGRRVVA 241

Query: 243 RGVTLGWWNEGEFE--MGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATI 302
           RGVT GWW+ GE E  +GL GCLE+V WKLR+GGWREE++R+MMM+DG D++  + ++  
Sbjct: 242 RGVTRGWWSRGEVEEDLGLRGCLEEVFWKLREGGWREEEVREMMMIDGCDQNENKEKSGT 301

Query: 303 EKLKVS---------VCEILLSGGWSRDDVMYSLGLED 310
             +            +  +LL  GW+ +DV+Y+L L D
Sbjct: 302 RLVMDGGDAAWHVRVLSVVLLRAGWASEDVVYALDLHD 339

BLAST of CmaCh03G008740 vs. NCBI nr
Match: gi|823139670|ref|XP_012469700.1| (PREDICTED: uncharacterized protein LOC105787717 [Gossypium raimondii])

HSP 1 Score: 350.9 bits (899), Expect = 2.8e-93
Identity = 182/345 (52.75%), Postives = 234/345 (67.83%), Query Frame = 1

Query: 2   AAATATATDK------------PSRPRLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEF 61
           A ATATAT              P R +LACFSFAAY+K +I HL++L+IP+LPGL+D EF
Sbjct: 3   ATATATATTAQTHRPNPLLKAPPPRSKLACFSFAAYSKTLIHHLQTLEIPILPGLTDHEF 62

Query: 62  TSFESTFQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKF 121
           TS ES F F+FPPDLRSILQEGLP+   FPNWRSSS QQL++L+NLP L L K I+   F
Sbjct: 63  TSIESAFHFTFPPDLRSILQEGLPVDPSFPNWRSSSPQQLNVLLNLPLLSLSKNITLHNF 122

Query: 122 WCQSWGTQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVS 181
           W  SWGT+P++PN+A+ L ++L   APVL+PIY+NCYIPS PNMAGNPVF++DG E+R+ 
Sbjct: 123 WSPSWGTKPSNPNEALGLVKRLFITAPVLIPIYRNCYIPSTPNMAGNPVFYVDGEEVRIL 182

Query: 182 SFDLAGFFQTHEYSQLSEA-----EPDRSMIDS--PAWAATEARAVDFWTEVA-SGKNMA 241
           SFD+  FFQ  E+ +            R+ +D+  PAWAA  AR ++FWT+VA  G+ + 
Sbjct: 183 SFDVNRFFQEVEFLRRGGVFKPFKRKKRNGVDNKVPAWAAKAARRIEFWTDVAEKGRRVV 242

Query: 242 GRGVTLG-WWNEGEFEMGLDGCLEDVLWKLRQGGWREEDIRDMMMMDGHDRSLEQSRATI 301
            RGVT G WW + E E  L GCLE+V W+LR GGWREE+++DMMMMDG D+S  + +   
Sbjct: 243 ARGVTTGWWWRKEEEEFRLGGCLEEVFWRLRDGGWREEEVKDMMMMDGCDQSQIKPKNGT 302

Query: 302 EKL----------KVSVCEILLSGGWSRDDVMYSLGLEDNSNSNS 316
             L          +VS   +LL GGWSR+DV+YSL L+D     S
Sbjct: 303 RPLIDGDDAAWHTRVS-SVVLLRGGWSREDVVYSLDLDDIDGDES 346

BLAST of CmaCh03G008740 vs. NCBI nr
Match: gi|567867095|ref|XP_006426170.1| (hypothetical protein CICLE_v10025875mg [Citrus clementina])

HSP 1 Score: 345.5 bits (885), Expect = 1.2e-91
Identity = 175/332 (52.71%), Postives = 232/332 (69.88%), Query Frame = 1

Query: 1   MAAATATAT----DKPSRP-RLACFSFAAYAKAVIDHLKSLQIPVLPGLSDTEFTSFEST 60
           MAA T T T     KP R  +L CFS+AAYAK +IDHLKSL IP+LPGL+D EF+  EST
Sbjct: 1   MAATTTTETMISRAKPPRTTKLVCFSYAAYAKNLIDHLKSLNIPILPGLNDAEFSDIEST 60

Query: 61  FQFSFPPDLRSILQEGLPIGSGFPNWRSSSIQQLHILINLPRLCLLKEISQRKFWCQSWG 120
           F F+FPPDLRSIL+EGLP G  FPNW SSS QQL IL+NLP L L K +S   FW  SWG
Sbjct: 61  FNFTFPPDLRSILREGLPAGPAFPNWLSSSHQQLRILVNLPVLSLSKNVSLNNFWSVSWG 120

Query: 121 TQPNDPNDAVALAQQLLDRAPVLVPIYKNCYIPSEPNMAGNPVFHLDGGEIRVSSFDLAG 180
            +P + NDA++L ++LLD+AP+LVPIY+NCY+PS PNMAGNPVF++D  E+RV SFDLAG
Sbjct: 121 QRPQNNNDALSLIKKLLDKAPLLVPIYRNCYVPSTPNMAGNPVFYIDTEEVRVLSFDLAG 180

Query: 181 FFQTHEYSQLSEAEPDRSMIDSPAWAATEARAVDFWTEVAS-GKNMAGRGVTLG-WWNEG 240
           FF+  +  +  +A     ++D PAWAA E R ++FWT+VA  G+ +  RG + G WW  G
Sbjct: 181 FFK--QVDEFVKAGGGGGVLDMPAWAAKEPRTIEFWTDVAERGRRVLARGGSRGRWWRAG 240

Query: 241 --EFEMGLDGCLEDVLWKLRQGGWREEDIRDMMM-MDGHDRSLEQ--------SRATIEK 300
                +GL+ C+++V W+LR GGWREE++R+MMM +DGHD    +         R ++E+
Sbjct: 241 CENTRVGLECCMDEVFWRLRDGGWREEEVREMMMVVDGHDDPTSEVQLVGDSTGRGSVER 300

Query: 301 LKVSVCEILLSGGWSRDDVMYSLGLEDNSNSN 315
               +  +LL  GWS++DV+YSL L+D+ + N
Sbjct: 301 HVRLLSLVLLRAGWSKEDVVYSLNLQDHGSFN 330

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KJT1_CUCSA1.3e-16980.70Uncharacterized protein OS=Cucumis sativus GN=Csa_6G497030 PE=4 SV=1[more]
A0A061DHC3_THECC2.3e-9452.07Uncharacterized protein OS=Theobroma cacao GN=TCM_000880 PE=4 SV=1[more]
A0A0D2QF47_GOSRA1.9e-9352.75Uncharacterized protein OS=Gossypium raimondii GN=B456_003G034100 PE=4 SV=1[more]
V4SKM9_9ROSI8.1e-9252.71Uncharacterized protein OS=Citrus clementina GN=CICLE_v10025875mg PE=4 SV=1[more]
A0A067KAU5_JATCU1.1e-9155.03Uncharacterized protein OS=Jatropha curcas GN=JCGZ_12599 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G22790.17.3e-5237.04 unknown protein[more]
AT5G67020.12.3e-4533.13 unknown protein[more]
AT3G50340.14.3e-4431.05 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659081030|ref|XP_008441112.1|8.7e-17280.97PREDICTED: uncharacterized protein LOC103485339 [Cucumis melo][more]
gi|449451563|ref|XP_004143531.1|1.8e-16980.70PREDICTED: uncharacterized protein LOC101204059 [Cucumis sativus][more]
gi|590706170|ref|XP_007047648.1|3.3e-9452.07Uncharacterized protein TCM_000880 [Theobroma cacao][more]
gi|823139670|ref|XP_012469700.1|2.8e-9352.75PREDICTED: uncharacterized protein LOC105787717 [Gossypium raimondii][more]
gi|567867095|ref|XP_006426170.1|1.2e-9152.71hypothetical protein CICLE_v10025875mg [Citrus clementina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmaCh03G008740.1CmaCh03G008740.1mRNA


Analysis Name: InterPro Annotations of Cucurbita maxima
Date Performed: 2017-05-20
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR32011FAMILY NOT NAMEDcoord: 11..341
score: 1.2E
NoneNo IPR availablePANTHERPTHR32011:SF3SUBFAMILY NOT NAMEDcoord: 11..341
score: 1.2E