CmoCh04G004140 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G004140
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionLOCATED IN: chloroplast; EXPRESSED IN: 14 plant structures; EXPRESSED DURING: 8 growth stages;
LocationCmo_Chr04 : 2046440 .. 2047699 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTTCCACCGGAATCCTTCCGTCAACAACCGCCATTCCCCCTCCCCCACCTCCTCTGTCTCCTCCACCACCGTCCCACACAACCCCTCCGCCACCACCGCCACTGCCGATAACGACCCTATGCACTCATGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCTCTTTCCTCCATCCTTCCCCCTCATTGCGACTCGTTTTTTCTCTCCTCTCTCGCCGATTCCGACCGGCCGGCCCTCTCTCTTTTGTCCTCTCACGATGCTTACTCCGTTATCTCCTCTGCTCTCTCCTCCTCCGTCTCTGGATCTGGTTCTGACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCCTTTCTTCCACTTCTTTCCTCTTTGTATCTCTCTCGCGTTCATTCTACTTCCTCCGATTCCCCTTCTCCTCCTTCTCTCGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCTCGGGCTGGGAAGCCTGTTCTTGTCGCGATTCCTGATCTTTCGCAGCCTTCTCTTTACCATTCTCCTCTGAATAAGCCCAATTCTGTTGCCCAAGCTCAATTCAGGCCATCCGTTGGAGTTCTTTGCCCTTCGCTTGAACCACAGAACGCGGTGAAGTCAACCAAAAGAGCTTGTATCATTGGCGTCGCTCTCGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAACTCTGTCGCTCTGCGGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAATTTGATAAAGAAGATGGTTTGGATATTGATGGGTTTTCGGAGAAAAGGGCTTTGGAGTATGGGGATGAAATAGAGGATGTTTCAGTAAAAATGGGTAACCTACAAGTTGAGACGTGTGGGAACAATTCCGATGATTCAGAACCTAAGGGGTTCAGAATTCCGCTTCCATGGGAGCTTTTGCAGCCATTACTTAGAATTTTAGGACATTGTTTATTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCGTTGCTGTAAGGTGTTTATATGCAAGGGCATCTCATGATTTAGTACCGCAGGTGATATTGGCAACTCGGAGTCTTATTCAGCTTGACAACAGAACTCGAGCGGCTGCAAAGGCTGCAACAGCAACAAATGCTTACACACCCAGCAAAGATAAGAAATCAGAAATCTTATTGGTCTCAAAATAA

mRNA sequence

ATGGACTTCCACCGGAATCCTTCCGTCAACAACCGCCATTCCCCCTCCCCCACCTCCTCTGTCTCCTCCACCACCGTCCCACACAACCCCTCCGCCACCACCGCCACTGCCGATAACGACCCTATGCACTCATGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCTCTTTCCTCCATCCTTCCCCCTCATTGCGACTCGTTTTTTCTCTCCTCTCTCGCCGATTCCGACCGGCCGGCCCTCTCTCTTTTGTCCTCTCACGATGCTTACTCCGTTATCTCCTCTGCTCTCTCCTCCTCCGTCTCTGGATCTGGTTCTGACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCCTTTCTTCCACTTCTTTCCTCTTTGTATCTCTCTCGCGTTCATTCTACTTCCTCCGATTCCCCTTCTCCTCCTTCTCTCGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCTCGGGCTGGGAAGCCTGTTCTTGTCGCGATTCCTGATCTTTCGCAGCCTTCTCTTTACCATTCTCCTCTGAATAAGCCCAATTCTGTTGCCCAAGCTCAATTCAGGCCATCCGTTGGAGTTCTTTGCCCTTCGCTTGAACCACAGAACGCGGTGAAGTCAACCAAAAGAGCTTGTATCATTGGCGTCGCTCTCGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAACTCTGTCGCTCTGCGGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAATTTGATAAAGAAGATGGTTTGGATATTGATGGGTTTTCGGAGAAAAGGGCTTTGGAGTATGGGGATGAAATAGAGGATGTTTCAGTAAAAATGGGTAACCTACAAGTTGAGACGTGTGGGAACAATTCCGATGATTCAGAACCTAAGGGGTTCAGAATTCCGCTTCCATGGGAGCTTTTGCAGCCATTACTTAGAATTTTAGGACATTGTTTATTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCGTTGCTGTAAGGTGTTTATATGCAAGGGCATCTCATGATTTAGTACCGCAGGTGATATTGGCAACTCGGAGTCTTATTCAGCTTGACAACAGAACTCGAGCGGCTGCAAAGGCTGCAACAGCAACAAATGCTTACACACCCAGCAAAGATAAGAAATCAGAAATCTTATTGGTCTCAAAATAA

Coding sequence (CDS)

ATGGACTTCCACCGGAATCCTTCCGTCAACAACCGCCATTCCCCCTCCCCCACCTCCTCTGTCTCCTCCACCACCGTCCCACACAACCCCTCCGCCACCACCGCCACTGCCGATAACGACCCTATGCACTCATGGTGGGAGTCCGTTTCCAAAGCCCGCTCTCGCATCCACGCTCTTTCCTCCATCCTTCCCCCTCATTGCGACTCGTTTTTTCTCTCCTCTCTCGCCGATTCCGACCGGCCGGCCCTCTCTCTTTTGTCCTCTCACGATGCTTACTCCGTTATCTCCTCTGCTCTCTCCTCCTCCGTCTCTGGATCTGGTTCTGACCCTCTCTGCCACTGGCTTTACGATACTTTTCTCTCTTCCGATCCCCATCTCCGCCTTGTTGTTCTTTCCTTTCTTCCACTTCTTTCCTCTTTGTATCTCTCTCGCGTTCATTCTACTTCCTCCGATTCCCCTTCTCCTCCTTCTCTCGCCGGCTTTGAGGCTGTGCTTCTCGCGCTTTATTCCTCTGAGGTTAAGTCTCGGGCTGGGAAGCCTGTTCTTGTCGCGATTCCTGATCTTTCGCAGCCTTCTCTTTACCATTCTCCTCTGAATAAGCCCAATTCTGTTGCCCAAGCTCAATTCAGGCCATCCGTTGGAGTTCTTTGCCCTTCGCTTGAACCACAGAACGCGGTGAAGTCAACCAAAAGAGCTTGTATCATTGGCGTCGCTCTCGATTGCTATTACAAGCAGATCTCGCAGATGCCGAGCTGGTCGAAGCTTGAACTCTGTCGCTCTGCGGCGTCGTGGGCTGGGCAAGATTGTTGCTGCAAGAGAGAATTTGATAAAGAAGATGGTTTGGATATTGATGGGTTTTCGGAGAAAAGGGCTTTGGAGTATGGGGATGAAATAGAGGATGTTTCAGTAAAAATGGGTAACCTACAAGTTGAGACGTGTGGGAACAATTCCGATGATTCAGAACCTAAGGGGTTCAGAATTCCGCTTCCATGGGAGCTTTTGCAGCCATTACTTAGAATTTTAGGACATTGTTTATTGGCTCCTTTGAATTCACAAGATGTTAAGGATGCAGCTTCCGTTGCTGTAAGGTGTTTATATGCAAGGGCATCTCATGATTTAGTACCGCAGGTGATATTGGCAACTCGGAGTCTTATTCAGCTTGACAACAGAACTCGAGCGGCTGCAAAGGCTGCAACAGCAACAAATGCTTACACACCCAGCAAAGATAAGAAATCAGAAATCTTATTGGTCTCAAAATAA
BLAST of CmoCh04G004140 vs. Swiss-Prot
Match: F126B_HUMAN (Protein FAM126B OS=Homo sapiens GN=FAM126B PE=1 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 4.7e-09
Identity = 49/158 (31.01%), Postives = 75/158 (47.47%), Query Frame = 1

Query: 109 DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLAL 168
           +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +
Sbjct: 54  EPVCHQLFELYRSSEVRLKRFTLQFLPELMWVYLRLTVSRDRQSNGC-----IEALLLGI 113

Query: 169 YSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQAQ----FRPSVGVLCPSLEP 228
           Y+ E+  + G  K +   IP LS+PS+YH P +   S+A  +        + V+   L P
Sbjct: 114 YNLEIADKDGNNKVLSFTIPSLSKPSIYHEP-STIGSMALTEGALCQHDLIRVVYSDLHP 173

Query: 229 Q-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR 260
           Q     +  R  ++   + CY   I  MP+ S   LCR
Sbjct: 174 QRETFTAQNRFEVLSFLMLCYNSAIVYMPASSYQSLCR 205

BLAST of CmoCh04G004140 vs. Swiss-Prot
Match: F126B_PONAB (Protein FAM126B OS=Pongo abelii GN=FAM126B PE=2 SV=1)

HSP 1 Score: 63.9 bits (154), Expect = 4.7e-09
Identity = 49/158 (31.01%), Postives = 75/158 (47.47%), Query Frame = 1

Query: 109 DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLAL 168
           +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +
Sbjct: 54  EPVCHQLFELYRSSEVRLKRFTLQFLPELMWVYLRLTVSRDRQSNGC-----IEALLLGI 113

Query: 169 YSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQAQ----FRPSVGVLCPSLEP 228
           Y+ E+  + G  K +   IP LS+PS+YH P +   S+A  +        + V+   L P
Sbjct: 114 YNLEIADKDGNNKVLSFTIPSLSKPSIYHEP-STIGSMALTEGALCQHDLIRVVYSDLHP 173

Query: 229 Q-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR 260
           Q     +  R  ++   + CY   I  MP+ S   LCR
Sbjct: 174 QRETFTAQNRFEVLSFLMLCYNSAIVYMPASSYQSLCR 205

BLAST of CmoCh04G004140 vs. Swiss-Prot
Match: F126B_MOUSE (Protein FAM126B OS=Mus musculus GN=Fam126b PE=1 SV=1)

HSP 1 Score: 63.5 bits (153), Expect = 6.1e-09
Identity = 49/158 (31.01%), Postives = 75/158 (47.47%), Query Frame = 1

Query: 109 DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLAL 168
           +P+CH L++ + SS+  L+   L FLP L  +YL    S    S         EA+LL +
Sbjct: 54  EPVCHQLFELYRSSEVRLKRFTLQFLPELIWVYLRLTVSRDRQSNGC-----IEALLLGI 113

Query: 169 YSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSVAQAQ----FRPSVGVLCPSLEP 228
           Y+ E+  + G  K +   IP LS+PS+YH P +   S+A  +        + V+   L P
Sbjct: 114 YNLEIADKDGNNKVLSFTIPSLSKPSIYHEP-STIGSMALTEGALCQHDLIRVVYSDLHP 173

Query: 229 Q-NAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR 260
           Q     +  R  ++   + CY   I  MP+ S   LCR
Sbjct: 174 QRETFTAQNRFEVLSFLMLCYNSAIVYMPASSYQSLCR 205

BLAST of CmoCh04G004140 vs. Swiss-Prot
Match: HYCCI_MOUSE (Hyccin OS=Mus musculus GN=Fam126a PE=1 SV=3)

HSP 1 Score: 57.4 bits (137), Expect = 4.4e-07
Identity = 48/157 (30.57%), Postives = 75/157 (47.77%), Query Frame = 1

Query: 109 DPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLAL 168
           +P+CH L++ + S +  L    L FLP L   YL+   S S D  S   +   EA+LL +
Sbjct: 54  EPVCHQLFEFYRSGEEQLLRFTLQFLPELMWCYLAV--SASRDVHSSGCI---EALLLGV 113

Query: 169 YSSEVKSRAG--KPVLVAIPDLSQPSLYHSPLNKPNSV----AQAQFRPSVGVLCPSLEP 228
           Y+ E+  + G  K +   IP LS+PS+YH P +  +      A +Q   S  V       
Sbjct: 114 YNLEIVDKHGHSKVLSFTIPSLSKPSVYHEPSSIGSMALTESALSQHGLSKVVYSGPHPQ 173

Query: 229 QNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCR 260
           +  + +  R  ++   L CY   ++ MPS S   LC+
Sbjct: 174 REMLTAQNRFEVLTFLLLCYNAALTYMPSVSLQSLCQ 205

BLAST of CmoCh04G004140 vs. Swiss-Prot
Match: HYCCI_CHICK (Hyccin OS=Gallus gallus GN=FAM126A PE=2 SV=2)

HSP 1 Score: 57.4 bits (137), Expect = 4.4e-07
Identity = 58/194 (29.90%), Postives = 91/194 (46.91%), Query Frame = 1

Query: 72  LSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVL 131
           +SS A + +   +L+SS   Y VI    S  +     +P+CH L++ + S +  L    L
Sbjct: 24  ISSYATNLKDKTALISS--LYKVIQEPQSELL-----EPVCHQLFEFYRSGEEQLLRFTL 83

Query: 132 SFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAG--KPVLVAIPDLS 191
            FLP L   YL+   S S D  S   +   EA+LL +Y+ E+  + G  K +   IP LS
Sbjct: 84  QFLPELMWCYLAV--SASRDLQSSGCI---EALLLGVYNLEIVDKEGHSKVLSFTIPSLS 143

Query: 192 QPSLYHSPLNKPNSV----AQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQ 251
           +PS+YH P +  +      A +Q   S  V       +  + +  R  ++   L CY   
Sbjct: 144 KPSVYHEPSSIGSMALTEGALSQHGLSRVVYSGPHPQREMLTAQNRFEVLTFLLLCYNAA 203

Query: 252 ISQMPSWSKLELCR 260
           +S MP+ S   LC+
Sbjct: 204 LSYMPAISLQSLCQ 205

BLAST of CmoCh04G004140 vs. TrEMBL
Match: A0A0A0KZ90_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107390 PE=4 SV=1)

HSP 1 Score: 725.7 bits (1872), Expect = 3.2e-206
Identity = 378/425 (88.94%), Postives = 394/425 (92.71%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSP-TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHAL 60
           MDFHRNPS+NNRHS SP +SS SSTT  HNP+A TA+AD DPMHSWWESVSKARSRIHAL
Sbjct: 1   MDFHRNPSINNRHSSSPSSSSASSTTALHNPTA-TASADTDPMHSWWESVSKARSRIHAL 60

Query: 61  SSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTF 120
           SSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSALSSS+SGSGSDPLCHWLYDTF
Sbjct: 61  SSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSALSSSLSGSGSDPLCHWLYDTF 120

Query: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGK 180
           LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGK
Sbjct: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK 180

Query: 181 PVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVAL 240
           PV+V+IPDLSQPSLYHSP+NKPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVAL
Sbjct: 181 PVVVSIPDLSQPSLYHSPMNKPNSGAQAQVRPSVGVLSPSLEPQNAVKSTKRACIVGVAL 240

Query: 241 DCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIE 300
           DCYYKQISQMPSWSKLE CRSAASWAGQDCCC REFDKEDG D+ GFSEKRALEY DEIE
Sbjct: 241 DCYYKQISQMPSWSKLEFCRSAASWAGQDCCCTREFDKEDGFDVGGFSEKRALEYTDEIE 300

Query: 301 DVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAAS 360
           D S +MG LQ+E CGNNS+DSEPKG RIPLPWELLQP+LRILGHCLLAPLNSQDVKD AS
Sbjct: 301 DASEEMGRLQIEKCGNNSNDSEPKGSRIPLPWELLQPVLRILGHCLLAPLNSQDVKDEAS 360

Query: 361 VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA-----TNAYTPSKDKKSEI 420
           VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAA A     +NA TPSKDKK EI
Sbjct: 361 VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAAAAAANSSSNANTPSKDKKPEI 420

BLAST of CmoCh04G004140 vs. TrEMBL
Match: W9QVH9_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_017230 PE=4 SV=1)

HSP 1 Score: 515.8 bits (1327), Expect = 5.0e-143
Identity = 296/447 (66.22%), Postives = 337/447 (75.39%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSP-TSSVSS--TTVPH-NP------SATTATADNDPMHSWWESVS 60
           MDFH +      +SPS  TSS+S+  TT P+ NP      +A  A   +DPMHSWWES+S
Sbjct: 1   MDFHHHHQTLASNSPSSSTSSISAAATTYPNGNPISSAAATAAAAAPSSDPMHSWWESIS 60

Query: 61  KARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDP 120
           KARSRIH+LSSILP    S  LSSLADSDRPALSLLSS  AYS +SSALS   SGSGSDP
Sbjct: 61  KARSRIHSLSSILPDPSLSLSLSSLADSDRPALSLLSSSLAYSALSSALSDPHSGSGSDP 120

Query: 121 LCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYS 180
           LCHWLYDTFLSSDPHLRLVV SFLPL+S LYLSR+HS S+DSPS PSLAGFEAVLLA+Y+
Sbjct: 121 LCHWLYDTFLSSDPHLRLVVFSFLPLISGLYLSRIHSLSTDSPSLPSLAGFEAVLLAIYA 180

Query: 181 SEVKSRAGKPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTK 240
           +E KSRAGKPVLV++PDLSQPSLYH+P  K      A  + SVGVL P LEPQ AVKSTK
Sbjct: 181 AETKSRAGKPVLVSVPDLSQPSLYHTPRQK-----IAGSKNSVGVLSPPLEPQIAVKSTK 240

Query: 241 RACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKED---GLDIDGFS 300
           RACI+GVALDCYYKQISQMP+WSKL+ CR AASWAGQ+C C+++FD +D    +     S
Sbjct: 241 RACIVGVALDCYYKQISQMPAWSKLDFCRFAASWAGQECTCEKQFDDDDDDESVVSSRVS 300

Query: 301 EKRALEYGDEIEDVSVKMGNLQVETCGNNSDDS------EPKGFRIPLPWELLQPLLRIL 360
           E R LE GD+ +DV   +  L++E  G  S  S      + KG RIPLPWELLQP+LRIL
Sbjct: 301 EIRYLENGDQTDDVVDDIAQLRIENGGGRSGSSSGGENLDSKGSRIPLPWELLQPVLRIL 360

Query: 361 GHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA- 420
           GHCLLAPLNSQDVKD A+VAVR LYARASHDLVPQ ILATRSLIQL+ R+RA  +AA A 
Sbjct: 361 GHCLLAPLNSQDVKDMAAVAVRRLYARASHDLVPQAILATRSLIQLEKRSRADVRAAAAA 420

BLAST of CmoCh04G004140 vs. TrEMBL
Match: F6H1N7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g01090 PE=4 SV=1)

HSP 1 Score: 505.0 bits (1299), Expect = 8.9e-140
Identity = 281/410 (68.54%), Postives = 316/410 (77.07%), Query Frame = 1

Query: 14  SPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLS 73
           SPS +SSV       +P+A +   D DPMHSWWES+SKARSRIH LS+ILP    S  LS
Sbjct: 11  SPSSSSSV-------DPNANSNPNDQDPMHSWWESISKARSRIHVLSTILPSPSLSLSLS 70

Query: 74  SLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSF 133
           SLADSDRPA SLL S DAY  +SS+LS   SGSGSDPLC WLY+TF SSDP LRLVVLSF
Sbjct: 71  SLADSDRPARSLLQSPDAYDALSSSLSCPRSGSGSDPLCQWLYETFQSSDPDLRLVVLSF 130

Query: 134 LPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSL 193
           +PLLS +YLSRV +         SLAGFEAVLLA Y+SE+K+RAGKPV ++IPDLSQPSL
Sbjct: 131 VPLLSGIYLSRVATAD-------SLAGFEAVLLAFYASELKARAGKPVSISIPDLSQPSL 190

Query: 194 YHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWS 253
           YH+P +KP SVA A  R SVG++ P LEPQ  VKSTKRACI+GVALDCYYKQISQMPSWS
Sbjct: 191 YHTPRSKPTSVAPA--RQSVGLVSPPLEPQIEVKSTKRACIVGVALDCYYKQISQMPSWS 250

Query: 254 KLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIEDVSVKMGNLQVETC 313
           KL+LC+ AA+WAGQ+C CK EFD  +  +IDGFSE R L+ GDEIE    ++G L +   
Sbjct: 251 KLDLCQFAAAWAGQNCPCKAEFDVNENAEIDGFSEVRVLDEGDEIERCVEEVGKLGIV-- 310

Query: 314 GNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDL 373
             N  +S  KG RIPLPWELLQP LRI+GHCLLAPLNSQDVKDAASVAVRCLYARASHDL
Sbjct: 311 -ENRSNSGFKGVRIPLPWELLQPTLRIIGHCLLAPLNSQDVKDAASVAVRCLYARASHDL 370

Query: 374 VPQVILATRSLIQLDNRTRAAAKAATATNA----YTPSKDKKSEILLVSK 420
           VPQ ILATRSLIQLD R R AAKAA A NA     TPSK KK E+LLVSK
Sbjct: 371 VPQAILATRSLIQLDKRAREAAKAAAAANAASNSNTPSKAKKPEVLLVSK 401

BLAST of CmoCh04G004140 vs. TrEMBL
Match: A0A061E3B9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_007521 PE=4 SV=1)

HSP 1 Score: 493.8 bits (1270), Expect = 2.0e-136
Identity = 287/449 (63.92%), Postives = 328/449 (73.05%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALS 60
           MDF  NPS N   SPS +SS SS+T  H P +TT   DNDPMHSWWESVSK RSRI +LS
Sbjct: 1   MDFPHNPSTN---SPSSSSSTSSSTAHHPPPSTTT--DNDPMHSWWESVSKQRSRILSLS 60

Query: 61  SILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFL 120
           S+LP   D   LSSLADSDRPALSLLSS  AYS+ISSALSS  SGSGSDPLC WLY+TF 
Sbjct: 61  SLLPS--DGLTLSSLADSDRPALSLLSSPAAYSLISSALSSPSSGSGSDPLCQWLYETFQ 120

Query: 121 SSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKP 180
           SSDPHLRL+VLSFLPLLS +YLSR+H  SSDS S PSLAGFEAVLLA+YSSE KSR+GKP
Sbjct: 121 SSDPHLRLLVLSFLPLLSGIYLSRIH--SSDSSSLPSLAGFEAVLLAVYSSEAKSRSGKP 180

Query: 181 VLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALD 240
           +LV IPDLSQPSLYH+P NKP      + R SVGVL P LEP  AVKSTKRA I+G ALD
Sbjct: 181 LLVQIPDLSQPSLYHTPRNKP---VNDRSRQSVGVLSPPLEPHLAVKSTKRAIIVGTALD 240

Query: 241 CYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFD---------------------KED 300
           CYYKQ+SQMP+WSKLE C+ AA+WAGQDC C+ +FD                     +ED
Sbjct: 241 CYYKQVSQMPAWSKLEFCKFAAAWAGQDCPCRTKFDADDHDHNENGNGNSNGHDRFFRED 300

Query: 301 GLDIDGFSEKRALEYGDE---IEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQP 360
               +G   +   +  DE   I+DV V+M NL +     ++++ E KG RIPLPWELL+P
Sbjct: 301 SRFSNGTRNRDDDDVDDEDDVIKDVVVEMDNLGINK--EDAENLEKKGVRIPLPWELLRP 360

Query: 361 LLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK 420
           ++ ILGHCL  P NSQDVKDAASVA+RCLYARASHDL PQ ILA +SLI+LD   RAAAK
Sbjct: 361 VVTILGHCLFGPSNSQDVKDAASVAIRCLYARASHDLAPQAILALQSLIRLDKSARAAAK 420

BLAST of CmoCh04G004140 vs. TrEMBL
Match: A0A067LQE6_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06562 PE=4 SV=1)

HSP 1 Score: 489.2 bits (1258), Expect = 5.0e-135
Identity = 279/431 (64.73%), Postives = 323/431 (74.94%), Query Frame = 1

Query: 10  NNRHSPSPTSSVSSTTVPHNPSAT------TATADNDPMHSWWESVSKARSRIHALSSIL 69
           N++++ SP+SS +ST     P AT      T T   DPM SWWESVSKAR+RI +LSS+L
Sbjct: 5   NDQNNLSPSSSTTSTPSAFRPIATVAAATTTTTTTTDPMQSWWESVSKARARILSLSSLL 64

Query: 70  PPH-CDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTFLSS 129
           P     SF LSSLADSDRPALS LSS +AY++ SSALSS  SGSGSDPLC WLY+T+LSS
Sbjct: 65  PSDPSSSFSLSSLADSDRPALSFLSSFEAYTLFSSALSSPSSGSGSDPLCQWLYETYLSS 124

Query: 130 DPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVL 189
           DPHLRL+VL+FLPLL  LYLSR+HS  S++ S PSLAGFEAVLLA+YSSE KSRAGKPVL
Sbjct: 125 DPHLRLIVLAFLPLLLGLYLSRIHS--SETTSTPSLAGFEAVLLAIYSSEAKSRAGKPVL 184

Query: 190 VAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCY 249
           V +PDLSQPSLYHSP NK NS      + SVGVL P LEPQ AVKSTKR  I+GV LDCY
Sbjct: 185 VQVPDLSQPSLYHSPRNKQNSHGLNSSKQSVGVLSPPLEPQIAVKSTKRPVIVGVTLDCY 244

Query: 250 YKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDG--------FSEKRAL-- 309
           +KQISQMPSWSK+ELC+ A+ WAGQDC CK +FD +  + I+         F E R L  
Sbjct: 245 FKQISQMPSWSKVELCKYASDWAGQDCACKDKFDVDKEIAIENGNGRSGGYFLEDRNLSN 304

Query: 310 ---EYGDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPL 369
                G EI+DV  +M  L +E   + + DSE +G RIPLPWE+LQPLLRILGHCLL P+
Sbjct: 305 GYGNNGHEIDDVVEEMEKLGIER--DVTVDSESRGVRIPLPWEILQPLLRILGHCLLGPM 364

Query: 370 NSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAK-AATATNAYTPSK 420
           N +DVKDA+SVAVR LYAR SHDL PQV+LATRSLIQLD R+R AAK AA A NA TPSK
Sbjct: 365 NPEDVKDASSVAVRRLYARGSHDLAPQVLLATRSLIQLDKRSREAAKAAAAAANANTPSK 424

BLAST of CmoCh04G004140 vs. TAIR10
Match: AT5G64090.1 (AT5G64090.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 445.3 bits (1144), Expect = 4.2e-125
Identity = 265/458 (57.86%), Postives = 318/458 (69.43%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSPTSSVSSTTVPHN------PSAT---------TATADNDPMHSW 60
           MDF   PS     SPSP+SS SS+T PH       P+AT         +A AD DPMHSW
Sbjct: 1   MDFSVKPSGG---SPSPSSSTSSST-PHRFKSVTTPTATAAAVSGFSPSAAADRDPMHSW 60

Query: 61  WESVSKARSRIHALSSILPPHCDSFF-------LSSLADSDRPALSLLSSHDAYSVISSA 120
           WESVSK RSRI +LSS+L    DS F       +SSLADSDRPALSLLSS  AYS+IS++
Sbjct: 61  WESVSKQRSRILSLSSLLSG--DSHFEDGDVTPISSLADSDRPALSLLSSRAAYSLISNS 120

Query: 121 LSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSL 180
           L +  SGSGSDPLC WLY+T+LSSDP LRLVVLSF PLL  +YLSR+HS  SDS S PSL
Sbjct: 121 LCNPASGSGSDPLCQWLYETYLSSDPPLRLVVLSFFPLLVGMYLSRIHS--SDSTSLPSL 180

Query: 181 AGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCP 240
           +GFEAVLLA+Y++EVK+RAGKP+LV IPDLSQPSLYH+P N  +    +    SVGVL P
Sbjct: 181 SGFEAVLLAIYAAEVKARAGKPILVHIPDLSQPSLYHTPRNGVDKSRDSNPTASVGVLSP 240

Query: 241 SLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKE 300
            LEPQ AVKSTKRA I+GV L CY+K+ISQMP+WSKLE C+ +ASWAGQDC CK + D++
Sbjct: 241 QLEPQIAVKSTKRASIVGVGLQCYFKEISQMPAWSKLEFCKFSASWAGQDCDCKEKIDED 300

Query: 301 DGLDI---DGF--------SEKRALEYGDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRI 360
           +   +   +GF        S  R+LE  ++ + ++++    Q+ +  N       +G RI
Sbjct: 301 EDKVLALTNGFGDSSSFNGSSGRSLEIEEDFDRLAIRENEEQLSS--NGGGGGVGRGVRI 360

Query: 361 PLPWELLQPLLRILGHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQL 420
           PLPWEL QP LRILGHCLL+PLN++DVKDAAS AVR LYARASHDL PQ ILATRSL+ L
Sbjct: 361 PLPWELFQPTLRILGHCLLSPLNTEDVKDAASNAVRSLYARASHDLNPQAILATRSLVNL 420

BLAST of CmoCh04G004140 vs. TAIR10
Match: AT5G21050.1 (AT5G21050.1 LOCATED IN: chloroplast)

HSP 1 Score: 208.4 bits (529), Expect = 8.7e-54
Identity = 149/375 (39.73%), Postives = 216/375 (57.60%), Query Frame = 1

Query: 18  TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHALSSILPPHCDSFFLSSLAD 77
           + S SS   P +P A T  ++    ++  ES +K ++ I +LS+I+             +
Sbjct: 2   SDSSSSHDSPPSP-AITGDSETTVTNNESESNTKCQTAIQSLSTIV------------TN 61

Query: 78  SDRPA-LSLLSSHDAYSV-ISSALSSSVSGSGSDPLCHWLYDTFLSSDPHLRLVVLSFLP 137
           ++ P+ +++L   +A S  ISS L    SG+G + LC WLYDTF S++P L+L+VL F+P
Sbjct: 62  TNIPSTITILLDDEAVSTAISSLLLRPDSGAGDNNLCRWLYDTFQSAEPSLQLLVLRFVP 121

Query: 138 LLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGKPVLVAIPDLSQPSLYH 197
           L++ LYLSRV       P     AGFEAVLLALY+ E  SRAG+ + V IPDLS PS+YH
Sbjct: 122 LIAGLYLSRV-------PLRQPQAGFEAVLLALYAHETTSRAGQAITVNIPDLSYPSIYH 181

Query: 198 SP--LNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVALDCYYKQISQMPSWS 257
               L + N+        ++ V+  +L+P   V+ST+RA I+GVAL+ YY +IS+MP  S
Sbjct: 182 ESKGLTRNNNSTCL----NIAVISSTLDPHGTVRSTRRARIVGVALELYYSKISKMPRES 241

Query: 258 KLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIEDVSVKMGNLQVETC 317
           KL  C S   WAGQ+     E ++     I   S+    +   E E+V++          
Sbjct: 242 KLNFCESCEKWAGQN----GETEQSSRAVIPTLSD----DSWREEENVAI---------- 301

Query: 318 GNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLA-PLNSQDVKDAASVAVRCLYARASHD 377
                 SE    RIPLPWELLQP+LRILGHCLL   +  +++ +AA+ A + LY R+ HD
Sbjct: 302 ---GGRSERDSGRIPLPWELLQPILRILGHCLLGLKMEDRELSEAANKACQSLYLRSLHD 331

Query: 378 LVPQVILATRSLIQL 388
           + P+ ILAT SL++L
Sbjct: 362 INPKAILATGSLLRL 331

BLAST of CmoCh04G004140 vs. NCBI nr
Match: gi|449438789|ref|XP_004137170.1| (PREDICTED: uncharacterized protein LOC101215901 [Cucumis sativus])

HSP 1 Score: 725.7 bits (1872), Expect = 4.6e-206
Identity = 378/425 (88.94%), Postives = 394/425 (92.71%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSP-TSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHAL 60
           MDFHRNPS+NNRHS SP +SS SSTT  HNP+A TA+AD DPMHSWWESVSKARSRIHAL
Sbjct: 1   MDFHRNPSINNRHSSSPSSSSASSTTALHNPTA-TASADTDPMHSWWESVSKARSRIHAL 60

Query: 61  SSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTF 120
           SSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSALSSS+SGSGSDPLCHWLYDTF
Sbjct: 61  SSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSALSSSLSGSGSDPLCHWLYDTF 120

Query: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGK 180
           LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVKSRAGK
Sbjct: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVKSRAGK 180

Query: 181 PVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACIIGVAL 240
           PV+V+IPDLSQPSLYHSP+NKPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI+GVAL
Sbjct: 181 PVVVSIPDLSQPSLYHSPMNKPNSGAQAQVRPSVGVLSPSLEPQNAVKSTKRACIVGVAL 240

Query: 241 DCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEYGDEIE 300
           DCYYKQISQMPSWSKLE CRSAASWAGQDCCC REFDKEDG D+ GFSEKRALEY DEIE
Sbjct: 241 DCYYKQISQMPSWSKLEFCRSAASWAGQDCCCTREFDKEDGFDVGGFSEKRALEYTDEIE 300

Query: 301 DVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDVKDAAS 360
           D S +MG LQ+E CGNNS+DSEPKG RIPLPWELLQP+LRILGHCLLAPLNSQDVKD AS
Sbjct: 301 DASEEMGRLQIEKCGNNSNDSEPKGSRIPLPWELLQPVLRILGHCLLAPLNSQDVKDEAS 360

Query: 361 VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA-----TNAYTPSKDKKSEI 420
           VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAA A     +NA TPSKDKK EI
Sbjct: 361 VAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAAAAAANSSSNANTPSKDKKPEI 420

BLAST of CmoCh04G004140 vs. NCBI nr
Match: gi|659111243|ref|XP_008455651.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103495769 [Cucumis melo])

HSP 1 Score: 716.8 bits (1849), Expect = 2.1e-203
Identity = 378/428 (88.32%), Postives = 393/428 (91.82%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSPTSS-VSSTTVPH--NPSAT---TATADNDPMHSWWESVSKARS 60
           MDFHRNPS+NNRHS SP+SS  SSTT P   NP+AT   +A+AD DPMHSWWESVSKARS
Sbjct: 1   MDFHRNPSINNRHSSSPSSSSASSTTRPFXTNPTATASASASADTDPMHSWWESVSKARS 60

Query: 61  RIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHW 120
           RIHALSSILPPH DSFFLSS+ADSDRPALSLLSSHDAYSVISSALSSS SGSGSDPLCHW
Sbjct: 61  RIHALSSILPPHSDSFFLSSVADSDRPALSLLSSHDAYSVISSALSSSHSGSGSDPLCHW 120

Query: 121 LYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVK 180
           LYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPS PSLAGFEAVLLALYSSEVK
Sbjct: 121 LYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSLPSLAGFEAVLLALYSSEVK 180

Query: 181 SRAGKPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTKRACI 240
           SRAGKPV+V+IPDLSQPSLYHSPLNKPNS AQAQ RPSVGVL PSLEPQNAVKSTKRACI
Sbjct: 181 SRAGKPVVVSIPDLSQPSLYHSPLNKPNSGAQAQARPSVGVLSPSLEPQNAVKSTKRACI 240

Query: 241 IGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGLDIDGFSEKRALEY 300
           +GVALDCYYKQISQMPSWSKL  CRSAASWAGQDCCC REFDKEDGLD+ GFSEKRALEY
Sbjct: 241 VGVALDCYYKQISQMPSWSKLAFCRSAASWAGQDCCCTREFDKEDGLDVGGFSEKRALEY 300

Query: 301 GDEIEDVSVKMGNLQVETCGNNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDV 360
            DEIED S +MG LQ+E CGNNS+DSEPKG RIPLPWELLQP+LRILGHCLL PLNSQDV
Sbjct: 301 TDEIEDASEEMGRLQIEKCGNNSNDSEPKGSRIPLPWELLQPILRILGHCLLTPLNSQDV 360

Query: 361 KDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA---TNAYTPSKDKK 420
           KD ASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAA A   +NA TPSKDKK
Sbjct: 361 KDEASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAAAANSSSNANTPSKDKK 420

BLAST of CmoCh04G004140 vs. NCBI nr
Match: gi|1009150292|ref|XP_015892942.1| (PREDICTED: uncharacterized protein LOC107427110 [Ziziphus jujuba])

HSP 1 Score: 534.6 bits (1376), Expect = 1.5e-148
Identity = 297/434 (68.43%), Postives = 342/434 (78.80%), Query Frame = 1

Query: 1   MDFHRNPSVNN-RHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHAL 60
           MDFH+N   N+   S S TSS+++T  P+   A  A+  +DPMHSWWES+SKARSRIH+L
Sbjct: 1   MDFHQNHLANSPSSSSSSTSSMTTTPNPNGTGAVAASTTDDPMHSWWESISKARSRIHSL 60

Query: 61  SSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTF 120
           SSIL P   S  LSSLADSDRPALSLLSS DAY+ + S+LSS +SGSGSDPLCHWLYDTF
Sbjct: 61  SSILDPSV-SPSLSSLADSDRPALSLLSSPDAYAAVCSSLSSPLSGSGSDPLCHWLYDTF 120

Query: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGK 180
           LSSDPHLRLVV SF+PLLS  YLSR+HS SSDSPS PSLAGFEAVLLALYS+E K+RAGK
Sbjct: 121 LSSDPHLRLVVHSFIPLLSGTYLSRIHSLSSDSPSLPSLAGFEAVLLALYSAETKARAGK 180

Query: 181 PVLVAIPDLSQPSLYHSPLNKPNSVA---QAQFRPSVGVLCPSLEPQNAVKSTKRACIIG 240
           P++V++PDLSQPSLYH+PLNKPNS +     Q RPSVGVL P LEPQ AVKSTKRACI+G
Sbjct: 181 PLVVSVPDLSQPSLYHAPLNKPNSQSPTHATQSRPSVGVLSPPLEPQIAVKSTKRACIVG 240

Query: 241 VALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGL-DIDGFSEKRALEYG 300
           VALD YYKQISQMP+WSK + C  AASWAGQDC C+ + D +DG  +I GFSE R+LE G
Sbjct: 241 VALDSYYKQISQMPAWSKHDFCLFAASWAGQDCSCQHQLDGDDGQPEIAGFSEIRSLENG 300

Query: 301 DEIEDVSVKMGNLQVETCG-NNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDV 360
            +I+D   +M +L+++  G  +S +   KG RIPLPWELLQP LRILGHCLLAPLNSQDV
Sbjct: 301 KQIDDAVEEMAHLRIQQNGCGSSANGVSKGSRIPLPWELLQPALRILGHCLLAPLNSQDV 360

Query: 361 KDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA---------TNAYT 420
           K+AA VAVR LYARASHDLVPQ ILATRSLIQL  R R+AA AA A         +N+ T
Sbjct: 361 KEAAGVAVRRLYARASHDLVPQAILATRSLIQLHKRARSAAMAAAAAAAATANSSSNSNT 420

BLAST of CmoCh04G004140 vs. NCBI nr
Match: gi|1009150294|ref|XP_015892943.1| (PREDICTED: uncharacterized protein LOC107427111 [Ziziphus jujuba])

HSP 1 Score: 534.3 bits (1375), Expect = 2.0e-148
Identity = 297/434 (68.43%), Postives = 342/434 (78.80%), Query Frame = 1

Query: 1   MDFHRNPSVNN-RHSPSPTSSVSSTTVPHNPSATTATADNDPMHSWWESVSKARSRIHAL 60
           MDF +NP  N+   S S TSS+++T  P+   A  A+  +DPMHSWWES+SKARSRIH+L
Sbjct: 1   MDFPQNPLANSPSSSSSSTSSMTTTPNPNGTGAVAASTTDDPMHSWWESISKARSRIHSL 60

Query: 61  SSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDPLCHWLYDTF 120
           SSIL P   S  LSSLADSDRPALSLLSS DAY+ + S+LSS +SGSGSDPLCHWLYDTF
Sbjct: 61  SSILDPSV-SPSLSSLADSDRPALSLLSSPDAYAAVCSSLSSPLSGSGSDPLCHWLYDTF 120

Query: 121 LSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYSSEVKSRAGK 180
           LSSDPHLRLVV SF+PLLS  YLSR+HS SSDSPS PSLAGFEAVLLALYS+E K+RAGK
Sbjct: 121 LSSDPHLRLVVHSFIPLLSGTYLSRIHSLSSDSPSLPSLAGFEAVLLALYSAETKARAGK 180

Query: 181 PVLVAIPDLSQPSLYHSPLNKPNSVA---QAQFRPSVGVLCPSLEPQNAVKSTKRACIIG 240
           P++V++PDLSQPSLYH+PLNKPNS +     Q RPSVGVL P LEPQ AVKSTKRACI+G
Sbjct: 181 PLVVSVPDLSQPSLYHAPLNKPNSQSPTHATQSRPSVGVLSPPLEPQIAVKSTKRACIVG 240

Query: 241 VALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKEDGL-DIDGFSEKRALEYG 300
           VALD YYKQISQMP+WSK + C  AASWAGQDC C+ + D +DG  +I GFSE R+LE G
Sbjct: 241 VALDSYYKQISQMPAWSKHDFCLFAASWAGQDCSCQHQLDGDDGQPEIAGFSEIRSLENG 300

Query: 301 DEIEDVSVKMGNLQVETCG-NNSDDSEPKGFRIPLPWELLQPLLRILGHCLLAPLNSQDV 360
            +I+D   +M +L+++  G  +S +   KG RIPLPWELLQP LRILGHCLLAPLNSQDV
Sbjct: 301 KQIDDAVEEMAHLRIQQNGCGSSANGVSKGSRIPLPWELLQPALRILGHCLLAPLNSQDV 360

Query: 361 KDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA---------TNAYT 420
           K+AA VAVR LYARASHDLVPQ ILATRSLIQL  R R+AA AA A         +N+ T
Sbjct: 361 KEAAGVAVRRLYARASHDLVPQAILATRSLIQLHKRARSAAMAAAAAAAATANSSSNSNT 420

BLAST of CmoCh04G004140 vs. NCBI nr
Match: gi|703090815|ref|XP_010094185.1| (hypothetical protein L484_017230 [Morus notabilis])

HSP 1 Score: 515.8 bits (1327), Expect = 7.2e-143
Identity = 296/447 (66.22%), Postives = 337/447 (75.39%), Query Frame = 1

Query: 1   MDFHRNPSVNNRHSPSP-TSSVSS--TTVPH-NP------SATTATADNDPMHSWWESVS 60
           MDFH +      +SPS  TSS+S+  TT P+ NP      +A  A   +DPMHSWWES+S
Sbjct: 1   MDFHHHHQTLASNSPSSSTSSISAAATTYPNGNPISSAAATAAAAAPSSDPMHSWWESIS 60

Query: 61  KARSRIHALSSILPPHCDSFFLSSLADSDRPALSLLSSHDAYSVISSALSSSVSGSGSDP 120
           KARSRIH+LSSILP    S  LSSLADSDRPALSLLSS  AYS +SSALS   SGSGSDP
Sbjct: 61  KARSRIHSLSSILPDPSLSLSLSSLADSDRPALSLLSSSLAYSALSSALSDPHSGSGSDP 120

Query: 121 LCHWLYDTFLSSDPHLRLVVLSFLPLLSSLYLSRVHSTSSDSPSPPSLAGFEAVLLALYS 180
           LCHWLYDTFLSSDPHLRLVV SFLPL+S LYLSR+HS S+DSPS PSLAGFEAVLLA+Y+
Sbjct: 121 LCHWLYDTFLSSDPHLRLVVFSFLPLISGLYLSRIHSLSTDSPSLPSLAGFEAVLLAIYA 180

Query: 181 SEVKSRAGKPVLVAIPDLSQPSLYHSPLNKPNSVAQAQFRPSVGVLCPSLEPQNAVKSTK 240
           +E KSRAGKPVLV++PDLSQPSLYH+P  K      A  + SVGVL P LEPQ AVKSTK
Sbjct: 181 AETKSRAGKPVLVSVPDLSQPSLYHTPRQK-----IAGSKNSVGVLSPPLEPQIAVKSTK 240

Query: 241 RACIIGVALDCYYKQISQMPSWSKLELCRSAASWAGQDCCCKREFDKED---GLDIDGFS 300
           RACI+GVALDCYYKQISQMP+WSKL+ CR AASWAGQ+C C+++FD +D    +     S
Sbjct: 241 RACIVGVALDCYYKQISQMPAWSKLDFCRFAASWAGQECTCEKQFDDDDDDESVVSSRVS 300

Query: 301 EKRALEYGDEIEDVSVKMGNLQVETCGNNSDDS------EPKGFRIPLPWELLQPLLRIL 360
           E R LE GD+ +DV   +  L++E  G  S  S      + KG RIPLPWELLQP+LRIL
Sbjct: 301 EIRYLENGDQTDDVVDDIAQLRIENGGGRSGSSSGGENLDSKGSRIPLPWELLQPVLRIL 360

Query: 361 GHCLLAPLNSQDVKDAASVAVRCLYARASHDLVPQVILATRSLIQLDNRTRAAAKAATA- 420
           GHCLLAPLNSQDVKD A+VAVR LYARASHDLVPQ ILATRSLIQL+ R+RA  +AA A 
Sbjct: 361 GHCLLAPLNSQDVKDMAAVAVRRLYARASHDLVPQAILATRSLIQLEKRSRADVRAAAAA 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
F126B_HUMAN4.7e-0931.01Protein FAM126B OS=Homo sapiens GN=FAM126B PE=1 SV=1[more]
F126B_PONAB4.7e-0931.01Protein FAM126B OS=Pongo abelii GN=FAM126B PE=2 SV=1[more]
F126B_MOUSE6.1e-0931.01Protein FAM126B OS=Mus musculus GN=Fam126b PE=1 SV=1[more]
HYCCI_MOUSE4.4e-0730.57Hyccin OS=Mus musculus GN=Fam126a PE=1 SV=3[more]
HYCCI_CHICK4.4e-0729.90Hyccin OS=Gallus gallus GN=FAM126A PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0KZ90_CUCSA3.2e-20688.94Uncharacterized protein OS=Cucumis sativus GN=Csa_4G107390 PE=4 SV=1[more]
W9QVH9_9ROSA5.0e-14366.22Uncharacterized protein OS=Morus notabilis GN=L484_017230 PE=4 SV=1[more]
F6H1N7_VITVI8.9e-14068.54Putative uncharacterized protein OS=Vitis vinifera GN=VIT_12s0055g01090 PE=4 SV=... [more]
A0A061E3B9_THECC2.0e-13663.92Uncharacterized protein OS=Theobroma cacao GN=TCM_007521 PE=4 SV=1[more]
A0A067LQE6_JATCU5.0e-13564.73Uncharacterized protein OS=Jatropha curcas GN=JCGZ_06562 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G64090.14.2e-12557.86 FUNCTIONS IN: molecular_function unknown[more]
AT5G21050.18.7e-5439.73 LOCATED IN: chloroplast[more]
Match NameE-valueIdentityDescription
gi|449438789|ref|XP_004137170.1|4.6e-20688.94PREDICTED: uncharacterized protein LOC101215901 [Cucumis sativus][more]
gi|659111243|ref|XP_008455651.1|2.1e-20388.32PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103495769 [Cucumis me... [more]
gi|1009150292|ref|XP_015892942.1|1.5e-14868.43PREDICTED: uncharacterized protein LOC107427110 [Ziziphus jujuba][more]
gi|1009150294|ref|XP_015892943.1|2.0e-14868.43PREDICTED: uncharacterized protein LOC107427111 [Ziziphus jujuba][more]
gi|703090815|ref|XP_010094185.1|7.2e-14366.22hypothetical protein L484_017230 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR018619Hyccin
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006468 protein phosphorylation
biological_process GO:0048544 recognition of pollen
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
molecular_function GO:0005524 ATP binding
molecular_function GO:0005515 protein binding
molecular_function GO:0004672 protein kinase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G004140.1CmoCh04G004140.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR018619HyccinPFAMPF09790Hyccincoord: 77..387
score: 1.9
NoneNo IPR availablePANTHERPTHR31220FAMILY NOT NAMEDcoord: 309..419
score: 1.3E-220coord: 42..290
score: 1.3E
NoneNo IPR availablePANTHERPTHR31220:SF2SUBFAMILY NOT NAMEDcoord: 309..419
score: 1.3E-220coord: 42..290
score: 1.3E

The following gene(s) are paralogous to this gene:

None