CmoCh04G031220 (gene) Cucurbita moschata (Rifu)

NameCmoCh04G031220
Typegene
OrganismCucurbita moschata (Cucurbita moschata (Rifu))
DescriptionPlant/F7F23-4 protein
LocationCmo_Chr04 : 21668766 .. 21671951 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAATCCCATGCGGGGCCAGAGAGGAAGCGAAGGTGTGTCGTGTGTGTTTGGTTCAGTTCTAGCTTCTGCTTCCACATTCGATGATGGAGCTCCATTGCGCATTTCTCCACACATCATTCTCTTTCCCTATCAGAGATAGAATTCCTGCTCATGGAGACGCCTCCGCTGCCGTCGCCTGCTCTCCTTCTTCGTCCTCGCTTTCAAGAATTCCAGCTCGAAATTTTTCTATGGGTTCTAACACTATAGGTAACGAACTCTGAACTTTGTTAAAGTTTCCGACGAGTTGCGAAGATAAGACCATGAGATTGCTATCTTTGTTCATTATATTGTTGAATTATTCACGACTTAGAAAATTGTCATCAGATTTTCTCGAGCACAAAATTGGATTTGGAAATTTGGCCTTAAATTTTGTTTTGTCTGTCTTGTGCGTGTATAGCTGTGCATTGGTCTTGATGTCGAATATCTATTGTTTCAAATTATTGATTCCTGATAGTTCGTGTCCCAACATCTTATCAAGCTTCATATTGACTTGGCAGTCACAAAGTGAATGCGAAGGCAGTATGAATTGTTTTGAATTGTAGAAATTTGTTACTTTGCTAGTTTATCTTGTATTAGAAAATATGATCGGGTGAAGTATTAATAATTTATATCTTGCTAGTTTAAATTGTAACAACGTTTTTTTTTATCAACATAGAGTGAAATATTAATAATTTTGTTGAATACCAGAGTTCCGCTCATTGGTTTCTCGAGTTAGACTGATGAAATCATCCTGTTCTGCCGTTGTCAGAGGAGAGAGTGCAGTACCTAGTGAATGCAGTTCAGAAACCATTGATTCTTCGAATTCAACTCCAACTCGCAATGGGCCAGTGGGAAATGTTCCAAATGCAAAAGATGGCGTGGAATGCTTGGATCAGCATAAAATGACCAAAGTGTGTGACAAGCTTATTGACGTCTTTCTGGTCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAGTATCCGACCCCATTTCTTTAGAAGGTGCCAAGATCGAGCTGCCTCCGAGGATGATCCTGGGATGCAGCATAAGCTACTCCGGCTAGGAAGAAAGCTGAAAGAGGTCAGTCTTGGGATCTTAGAAAAAGGGTAAAAAAACTCTTGTTTAAATGACCTTGTTTTGATATTTAAAAAAAAAAAAAAAATCCGCAAATAAGACGAGTGTAAGCTAACATTTCGTATGCCAACAAGGTCAATCATCAGTTCTATTGGTGTAGAGCCACAAGGTGTGTTCCTCGGGGAACAATCAAAATGAAGTGTTTCGTACTTGATTATTTTATGATCACATAGTTGCAATATGTGAGTTGTTTATCATGTTTTTTTTCCCCTTTAAAAATTTTCACAGATTGATGAAGATGTGCAGAGACACAACGAATTTCTTGAGGTGCTCAGAGGAGCAGCACCATCAGAACTTGGTGAAATTATTTCGAGGCGTCGTAAAGATTTTACAAAAGAATTCTTTGTTCATCTTCACACAGTGACTGAATCTTATTATGATGATCCAACTGAGCAAGATGGTATTACTGCATACTTTTTAGTCTGCAAAATACTTGTGAACTTTTCTTGTTGATAGTAGTGAATAAATAGCTGCCTAAGTGGTTGGGGGAAGCCTAATTCAGGGAATGATCCGCAACCTAAGCGTAGCTTGATTGTTTAAGGTTTTTATACATTTATAATTAGTTCGTCTCCCTACAAAAATTGGAATCTTCTCTTTATTTTTAAGTTTTTGAATGTCGTCTTCTTTCGTTTGCAAAATGTGTTCTTTTTTTCTATAATATATGGTATTGATATGCTTGAAGTGATATTTTTTTCGCTTATATTTCAATTGATAAGCTCTGGCGAAGCTTGGGAACTCCTGCCTGGCTGCTGTACAAGCATATGATGCCGCAACTGAAAACATTGAAGCACTAGATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAAATTTAGATGCCGCTTGCAGAAAGATAGACAGTCTGGCGGAGAAAAACCAGCTTGATTCTGCATTAGTATTGATGATTACAAAAGCTTGGTCTGCTGCAAAGGAATCGAACATGATGAAAGACGAGGTAATGCATTTGCTACCCTTGCTGTTTGCCTTTGATACAAAAATGTGGTTTCATGGTCTTATTTTCCGACCATTTTAGAAATTCCAATGACTGTGCGAAGTTTGTAATTACTGAAACTAACGCTTTCAAATAGTGGTTTCTGTTTGAACGAAACTACGATAGCTTAAGATACCGAAGAGAGAAAAATACCAATTTGGTCCCATGATTTCATTAGTTTCTATATTGAGTCGTATGAGCATTTAGCAAATGAGGGCACGTTTAATCGATGTCCTGTTGTTTCATATTTACTTTTCGATGGTGAAAGTGTTGGTGTTACTTTTATTGGCGCAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAGTGCCAACAGAAATCAGGATTCTGAAGCATCTTCTTACAATCGAGGATCCTGAGGAGAAGCTAAGTGCCCTGAAGGATGCATTTACCCCTGGCGAAGAACTTCAAGGACAAGATGTAGACTGCTTATACACGTATGCTTTCTCACCTCATTGTTAAGAATATATTGAAATATTTAGCATGCCATAATCTATGAGCGTAAACCCAATATGATAGCAGATTAGGGGGTTCAAAGCTTCCATGTTCCAACCCTTACAGTGCTATTTTCTTATCTATTATACTCAAAGTTTCTTATCTATTATACTCAAAGTTTCTTATCTATTAAAACAGTGATTTTAGTGAAAACGATTTGTTTGTAAGTAATTCATTGAGTTCGTTGGCTACGGAGGCGTTCCAACCATTTGATTTTGAACGTGACAAAAGTGCTAAATATAGTTTAAATTTATTATCAGTGATTTAACACTCCCTGCCATCGTTTTTTTTTTTCTTAAAACACTTTTAGGACTCCAGAGAAGCTTCACGCGTGGATGAAGACAGTGTTGGATGCTTATCATTTCAGCCGGGAAGGCACCCTCATAAAGGAAGCCAGAGACCTTATGAACTCACAGGTCATCGTAAAACTTGAGGAATTGAAACATCTCGTTGAGAAAAATTTTATGTGA

mRNA sequence

AAAATCCCATGCGGGGCCAGAGAGGAAGCGAAGGTGTGTCGTGTGTGTTTGGTTCAGTTCTAGCTTCTGCTTCCACATTCGATGATGGAGCTCCATTGCGCATTTCTCCACACATCATTCTCTTTCCCTATCAGAGATAGAATTCCTGCTCATGGAGACGCCTCCGCTGCCGTCGCCTGCTCTCCTTCTTCGTCCTCGCTTTCAAGAATTCCAGCTCGAAATTTTTCTATGGGTTCTAACACTATAGAGTTCCGCTCATTGGTTTCTCGAGTTAGACTGATGAAATCATCCTGTTCTGCCGTTGTCAGAGGAGAGAGTGCAGTACCTAGTGAATGCAGTTCAGAAACCATTGATTCTTCGAATTCAACTCCAACTCGCAATGGGCCAGTGGGAAATGTTCCAAATGCAAAAGATGGCGTGGAATGCTTGGATCAGCATAAAATGACCAAAGTGTGTGACAAGCTTATTGACGTCTTTCTGGTCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAGTATCCGACCCCATTTCTTTAGAAGGTGCCAAGATCGAGCTGCCTCCGAGGATGATCCTGGGATGCAGCATAAGCTACTCCGGCTAGGAAGAAAGCTGAAAGAGATTGATGAAGATGTGCAGAGACACAACGAATTTCTTGAGGTGCTCAGAGGAGCAGCACCATCAGAACTTGGTGAAATTATTTCGAGGCGTCGTAAAGATTTTACAAAAGAATTCTTTGTTCATCTTCACACAGTGACTGAATCTTATTATGATGATCCAACTGAGCAAGATGCTCTGGCGAAGCTTGGGAACTCCTGCCTGGCTGCTGTACAAGCATATGATGCCGCAACTGAAAACATTGAAGCACTAGATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAAATTTAGATGCCGCTTGCAGAAAGATAGACAGTCTGGCGGAGAAAAACCAGCTTGATTCTGCATTAGTATTGATGATTACAAAAGCTTGGTCTGCTGCAAAGGAATCGAACATGATGAAAGACGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAGTGCCAACAGAAATCAGGATTCTGAAGCATCTTCTTACAATCGAGGATCCTGAGGAGAAGCTAAGTGCCCTGAAGGATGCATTTACCCCTGGCGAAGAACTTCAAGGACAAGATGTAGACTGCTTATACACGACTCCAGAGAAGCTTCACGCGTGGATGAAGACAGTGTTGGATGCTTATCATTTCAGCCGGGAAGGCACCCTCATAAAGGAAGCCAGAGACCTTATGAACTCACAGGTCATCGTAAAACTTGAGGAATTGAAACATCTCGTTGAGAAAAATTTTATGTGA

Coding sequence (CDS)

ATGATGGAGCTCCATTGCGCATTTCTCCACACATCATTCTCTTTCCCTATCAGAGATAGAATTCCTGCTCATGGAGACGCCTCCGCTGCCGTCGCCTGCTCTCCTTCTTCGTCCTCGCTTTCAAGAATTCCAGCTCGAAATTTTTCTATGGGTTCTAACACTATAGAGTTCCGCTCATTGGTTTCTCGAGTTAGACTGATGAAATCATCCTGTTCTGCCGTTGTCAGAGGAGAGAGTGCAGTACCTAGTGAATGCAGTTCAGAAACCATTGATTCTTCGAATTCAACTCCAACTCGCAATGGGCCAGTGGGAAATGTTCCAAATGCAAAAGATGGCGTGGAATGCTTGGATCAGCATAAAATGACCAAAGTGTGTGACAAGCTTATTGACGTCTTTCTGGTCGACAAGCCAACTCCAACAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAGTATCCGACCCCATTTCTTTAGAAGGTGCCAAGATCGAGCTGCCTCCGAGGATGATCCTGGGATGCAGCATAAGCTACTCCGGCTAGGAAGAAAGCTGAAAGAGATTGATGAAGATGTGCAGAGACACAACGAATTTCTTGAGGTGCTCAGAGGAGCAGCACCATCAGAACTTGGTGAAATTATTTCGAGGCGTCGTAAAGATTTTACAAAAGAATTCTTTGTTCATCTTCACACAGTGACTGAATCTTATTATGATGATCCAACTGAGCAAGATGCTCTGGCGAAGCTTGGGAACTCCTGCCTGGCTGCTGTACAAGCATATGATGCCGCAACTGAAAACATTGAAGCACTAGATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAAATTTAGATGCCGCTTGCAGAAAGATAGACAGTCTGGCGGAGAAAAACCAGCTTGATTCTGCATTAGTATTGATGATTACAAAAGCTTGGTCTGCTGCAAAGGAATCGAACATGATGAAAGACGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGCGGTAACCTTCAAAGATTAGTGCCAACAGAAATCAGGATTCTGAAGCATCTTCTTACAATCGAGGATCCTGAGGAGAAGCTAAGTGCCCTGAAGGATGCATTTACCCCTGGCGAAGAACTTCAAGGACAAGATGTAGACTGCTTATACACGACTCCAGAGAAGCTTCACGCGTGGATGAAGACAGTGTTGGATGCTTATCATTTCAGCCGGGAAGGCACCCTCATAAAGGAAGCCAGAGACCTTATGAACTCACAGGTCATCGTAAAACTTGAGGAATTGAAACATCTCGTTGAGAAAAATTTTATGTGA
BLAST of CmoCh04G031220 vs. Swiss-Prot
Match: Y4920_ARATH (Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g37920 PE=1 SV=2)

HSP 1 Score: 298.5 bits (763), Expect = 1.2e-79
Identity = 151/360 (41.94%), Postives = 241/360 (66.94%), Query Frame = 1

Query: 87  SETIDSSNSTPTRNGPVG-NVPNAKDG---VECLDQHKMTKVCDKLIDVFLVDKPTPTDW 146
           S TI  +  T T NG     V ++ +    VE  + + M + CDK+ID+FL +KP    W
Sbjct: 45  SSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQW 104

Query: 147 RRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLRLGRKLKEIDEDVQRHNEFLEV 206
           +  +    EW+    +F++RC+ RA +E DP ++ KL+ L  K+K+ID+++++HN+ L+ 
Sbjct: 105 KTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKE 164

Query: 207 LRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDPTEQDALAKLGNSCLAAVQAYD 266
           ++   P+++  I ++RR+DFT EFF ++  ++E+  D   ++DA+A+L   CL+AV AYD
Sbjct: 165 IQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSETL-DGLEDRDAVARLATRCLSAVSAYD 224

Query: 267 AATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESN 326
              E++E LD A+ KF+DI+NSP++D+AC KI SLA+  +LDS+L+L+I  A++AAKES 
Sbjct: 225 NTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQ 284

Query: 327 MMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIEDPEEKLSALKDAFTPGEELQG 386
            + +E KDI+YHLY     +L+ + P EI++LK+LL I DPEE+ SAL  AF+PG++ + 
Sbjct: 285 TVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEA 344

Query: 387 QDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           +D   LYTTP++LH W+K +LDAYH ++E T IKEA+ +    VI +L  LK  +E  ++
Sbjct: 345 KDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYL 402

BLAST of CmoCh04G031220 vs. TrEMBL
Match: A0A0A0LMH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G403700 PE=4 SV=1)

HSP 1 Score: 691.8 bits (1784), Expect = 5.4e-196
Identity = 355/441 (80.50%), Postives = 387/441 (87.76%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNTIEFRSLV 61
           MELH A LHTSFSF IR    AHGDASAA  CSPS  SLSRI  RNFS+GS +  F SLV
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAA--CSPSLPSLSRITIRNFSLGSKSRGFPSLV 60

Query: 62  SRVRLMKSSCSAVVRGESAVPSECSSETIDSSNSTPTRNGPVGNVPNAKDGVECLDQHKM 121
              R  KSS SA VRG  AVPS+C+SET+D  N  P+ + PV +V NAKD VE LDQHKM
Sbjct: 61  CHDRPKKSSFSAFVRGVKAVPSDCNSETLDLLN--PSSDEPVRDVQNAKDSVENLDQHKM 120

Query: 122 TKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLR 181
           TKVCDKLI+VF++DKPTP DWRRLIAFSKEWD+IRPHFF RCQDRAASEDDPGM+HKLLR
Sbjct: 121 TKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR 180

Query: 182 LGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDP 241
            GRKLKEIDEDVQRHNE LEV+R  +PSELGEIISRRRKDFTKEFFVHLHTV +SYYDDP
Sbjct: 181 FGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDP 240

Query: 242 TEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKN 301
            +Q+ALAKLGNSCLAAVQ YDAATENIEAL+AAELKFQDIINSP +DAACRKID+LAEKN
Sbjct: 241 AKQNALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKN 300

Query: 302 QLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIE 361
           QLDSALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRL+P EIRILK+LLTI 
Sbjct: 301 QLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTIN 360

Query: 362 DPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDL 421
           DPEEKLSALKDAFTPGEEL+GQDVDCLYTTPE+LH W+KTV+DAYHFSREGTL++EARDL
Sbjct: 361 DPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDL 420

Query: 422 MNSQVIVKLEELKHLVEKNFM 443
           MN Q+IVKLEELK L+EK FM
Sbjct: 421 MNPQLIVKLEELKGLIEKKFM 437

BLAST of CmoCh04G031220 vs. TrEMBL
Match: E0CRE9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g04580 PE=4 SV=1)

HSP 1 Score: 573.2 bits (1476), Expect = 2.8e-160
Identity = 288/410 (70.24%), Postives = 340/410 (82.93%), Query Frame = 1

Query: 37  SSSLSRIPARNFSMGSNTIEFRSLVSRVRLMKSSCSAVVRGESAVPSECSSETID-SSNS 96
           S S  R   RN          R LV+ +RL  SS +++V   +AVPS  + E +  SS+S
Sbjct: 20  SLSFIRNSRRNLCFTRKVEGRRLLVNPIRLQHSSVASIVGDTTAVPSRSTGEPLSFSSSS 79

Query: 97  TPTRNGPV---GNVPNAKDGVECLDQHKMTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEW 156
            P  +  V   G   + KD  ECLD HKM +VCDKLI+VF+VDKPTPTDWRRL+AFSKEW
Sbjct: 80  DPAVDKDVIGYGEKRDGKDDPECLDNHKMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEW 139

Query: 157 DSIRPHFFRRCQDRAASEDDPGMQHKLLRLGRKLKEIDEDVQRHNEFLEVLRGAAPSELG 216
            +IRPHF+RRCQDRA SE DPG +H LLRLGRKLKEIDEDV+RHNE LEV++G  P+++ 
Sbjct: 140 SNIRPHFYRRCQDRADSEGDPGKKHSLLRLGRKLKEIDEDVKRHNELLEVIKGTPPADIS 199

Query: 217 EIISRRRKDFTKEFFVHLHTVTESYYDDPTEQDALAKLGNSCLAAVQAYDAATENIEALD 276
            ++++RRKDFTKEFFVHLHTV ESY+D+PTEQ+ALAKLGN CLAAVQ YD A+E+IEAL+
Sbjct: 200 AVVAKRRKDFTKEFFVHLHTVAESYHDNPTEQNALAKLGNMCLAAVQTYDTASESIEALN 259

Query: 277 AAELKFQDIINSPNLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDIL 336
           AAELKFQDI+NSP+LD ACRKIDSLAEKNQLDSALVLMITKAWSAAKESNM KDEVKD+L
Sbjct: 260 AAELKFQDILNSPSLDVACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMTKDEVKDVL 319

Query: 337 YHLYVTASGNLQRLVPTEIRILKHLLTIEDPEEKLSALKDAFTPGEELQGQDVDCLYTTP 396
           +HLY TA GNLQRL+P EIRILK+LLTIEDPEEK+SALKDAFTPG+E++G+DVDCLYTTP
Sbjct: 320 FHLYTTARGNLQRLMPKEIRILKYLLTIEDPEEKMSALKDAFTPGDEIEGKDVDCLYTTP 379

Query: 397 EKLHAWMKTVLDAYHFSREGTLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           EKLH WM+TV+DA+HFSREGTLI+EARDLMN ++I KLEELK LV+ NFM
Sbjct: 380 EKLHTWMQTVVDAFHFSREGTLIREARDLMNPKIIQKLEELKKLVQDNFM 429

BLAST of CmoCh04G031220 vs. TrEMBL
Match: W9RSH2_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_021538 PE=4 SV=1)

HSP 1 Score: 557.4 bits (1435), Expect = 1.6e-155
Identity = 277/373 (74.26%), Postives = 323/373 (86.60%), Query Frame = 1

Query: 72  SAVVRGESAVPSECSSETIDSSN--STPTRNGPVGNVPNAKDGVECLDQHKMTKVCDKLI 131
           +AVV   +A PS  S E +  S+  S   ++G  G   + +D V  LD  KM KVCDKLI
Sbjct: 72  AAVVGDTAAAPSNYSKERVSDSDDPSLSEKSGTNGENRDERDNVGSLDDQKMNKVCDKLI 131

Query: 132 DVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLRLGRKLKEI 191
            VF+VDKPTPTDWRRL+AFSKEWD+IRPHF++RCQ+RA SEDDPGM+HKLLRLGRKLKEI
Sbjct: 132 GVFMVDKPTPTDWRRLLAFSKEWDNIRPHFYKRCQERADSEDDPGMKHKLLRLGRKLKEI 191

Query: 192 DEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDPTEQDALAK 251
           DEDVQRHNE LEV++GA PSE+ EI++RRRKDFTKEFFVHLHTV ESYYD+PTEQ+ALAK
Sbjct: 192 DEDVQRHNELLEVIKGA-PSEISEIVARRRKDFTKEFFVHLHTVAESYYDNPTEQNALAK 251

Query: 252 LGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKNQLDSALVL 311
           LGN+CLAAVQAYDAA+E+ EAL+ AELK QDII+SP+LDAACRKID+LAEKNQLDSALVL
Sbjct: 252 LGNTCLAAVQAYDAASESAEALNTAELKLQDIISSPSLDAACRKIDNLAEKNQLDSALVL 311

Query: 312 MITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIEDPEEKLSA 371
           M+TKAWSAAKESNM+KDEVKD+LYHLY+TA GNLQRL+P EIRILK++LTI DP+E+LS 
Sbjct: 312 MLTKAWSAAKESNMVKDEVKDVLYHLYLTARGNLQRLMPKEIRILKYILTIVDPDERLSV 371

Query: 372 LKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDLMNSQVIVK 431
           L DAFTPGEEL+G+DVDCLYTTPEKLH W+K ++DAYH S EGTLI+EARDLMN ++I K
Sbjct: 372 LNDAFTPGEELEGKDVDCLYTTPEKLHTWIKIIVDAYHSSSEGTLIREARDLMNPKIIQK 431

Query: 432 LEELKHLVEKNFM 443
           LEELK LVE  FM
Sbjct: 432 LEELKTLVENKFM 443

BLAST of CmoCh04G031220 vs. TrEMBL
Match: A0A0D2TUM3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G241300 PE=4 SV=1)

HSP 1 Score: 554.3 bits (1427), Expect = 1.3e-154
Identity = 297/450 (66.00%), Postives = 355/450 (78.89%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPS--SSSLSRIPARNFSMGSNTIEFRS 61
           MEL CA       FP+R +        +++A  PS  SS+ SRI      + S  + + +
Sbjct: 1   MELACASFQACSIFPVRLKSSKATKFESSLALLPSCKSSNSSRIRC----LSSKFLSYPA 60

Query: 62  L-VSRVR-LMKSSCSAVVRGESAVPSECSSETIDSSNSTPTR--NGPV---GNVPNAKDG 121
           L ++RV    + S +AVV  ++AVP+ C  E I  S+S  +   N  V   G     K  
Sbjct: 61  LCINRVSGQQRFSLAAVVGDKTAVPNNCDEEKISDSDSAGSSLINDEVTRDGENDGDKGS 120

Query: 122 VECLDQHKMTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDD 181
           VE +D  KM +VCDKLI+VFLVDKPTPTDWRRL+AFSKEW++IRPHFF+RCQ+RA  E D
Sbjct: 121 VEGMDSVKMIRVCDKLIEVFLVDKPTPTDWRRLLAFSKEWNNIRPHFFQRCQERADVEGD 180

Query: 182 PGMQHKLLRLGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHT 241
           PGM+HKLLRLGRKLKEIDED+QRHNE LEV++G+ PSE+ E+++RRRKDFTKEFFVH+HT
Sbjct: 181 PGMKHKLLRLGRKLKEIDEDIQRHNELLEVIKGS-PSEISEMVARRRKDFTKEFFVHIHT 240

Query: 242 VTESYYDDPTEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACR 301
           V ESYYD+PTEQ+ALAKLGN+CLAAVQAYD A EN+EAL+AAELKFQDIINSP+LD ACR
Sbjct: 241 VAESYYDNPTEQNALAKLGNTCLAAVQAYDTAAENVEALNAAELKFQDIINSPSLDVACR 300

Query: 302 KIDSLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIR 361
           KIDSLAEKNQLDSALVLMITKAWSAAKESNM KDEVKDILYHLY+TA GNLQRL+P EIR
Sbjct: 301 KIDSLAEKNQLDSALVLMITKAWSAAKESNMTKDEVKDILYHLYMTARGNLQRLLPKEIR 360

Query: 362 ILKHLLTIEDPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREG 421
           I+K+LLTIEDPEE+L AL DAFTPGEEL+G D+D LYTTPEKLH  M+ V+DAY+FS EG
Sbjct: 361 IVKYLLTIEDPEERLCALNDAFTPGEELEGSDMDNLYTTPEKLHTMMRAVVDAYNFSHEG 420

Query: 422 TLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           TL++EARDLMN ++I KL EL  +VEKNFM
Sbjct: 421 TLLREARDLMNPKIIEKLGELIKIVEKNFM 445

BLAST of CmoCh04G031220 vs. TrEMBL
Match: A0A061FGT6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_034989 PE=4 SV=1)

HSP 1 Score: 549.7 bits (1415), Expect = 3.3e-153
Identity = 297/447 (66.44%), Postives = 359/447 (80.31%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIR---DRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNTIEFR 61
           MEL  A        P+R    +  ++G +SA+++ + +S+S SRI   +    S      
Sbjct: 1   MELASASRQACCLLPVRLKSSKATSYGSSSASLSSTKNSNS-SRIRCLSSKFSSCPA--- 60

Query: 62  SLVSRVRLMKSSCSAVVRGESAVPSECSSETI---DSSNSTPTRNGPVGNVPNAKDGVEC 121
            +V++VRL +   +AVV  ++AVP++C  E +   DS +S+P  N  V    + K  VE 
Sbjct: 61  LIVNQVRLRRP-LAAVVGDKTAVPNDCDGEQVAGLDSFDSSPI-NDDVNE--DEKGSVEG 120

Query: 122 LDQHKMTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGM 181
           LD  KM +VCDKLI+VF+VDKPTPTDWRRL+AFSKEW SIRPHFF+RCQDRA  E DPGM
Sbjct: 121 LDNSKMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEWSSIRPHFFKRCQDRADGEADPGM 180

Query: 182 QHKLLRLGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTE 241
           +HKLLRLGRKLKEIDEDVQRHNE LEV++GA PSE+ EI++RRRKDFTKEFFVHLHTV E
Sbjct: 181 KHKLLRLGRKLKEIDEDVQRHNELLEVVKGA-PSEVSEIVARRRKDFTKEFFVHLHTVAE 240

Query: 242 SYYDDPTEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKID 301
           S YD+PTEQ+ALAKLGN+CLAAVQAYD ATE+IEA++AAELKFQDIINSP+LD AC+KID
Sbjct: 241 SCYDNPTEQNALAKLGNTCLAAVQAYDTATESIEAINAAELKFQDIINSPSLDVACQKID 300

Query: 302 SLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILK 361
           SLA KNQLDSAL+LMITKAWSAAKESNM KDEVKDILYHLY+TA GNLQRL+P EIRI+K
Sbjct: 301 SLAAKNQLDSALMLMITKAWSAAKESNMTKDEVKDILYHLYMTARGNLQRLLPKEIRIVK 360

Query: 362 HLLTIEDPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLI 421
           +LL IEDPEE+L AL DAFTPGEEL+G+DVD LYTTPEKLH  M+ V+DAY+FS+EGTL+
Sbjct: 361 YLLAIEDPEERLCALNDAFTPGEELEGKDVDNLYTTPEKLHTLMRAVVDAYNFSQEGTLL 420

Query: 422 KEARDLMNSQVIVKLEELKHLVEKNFM 443
           +EARDLMN ++I KLEEL  +VE+NFM
Sbjct: 421 REARDLMNPKIIEKLEELIKVVERNFM 438

BLAST of CmoCh04G031220 vs. TAIR10
Match: AT1G36320.1 (AT1G36320.1 unknown protein)

HSP 1 Score: 512.3 bits (1318), Expect = 3.0e-145
Identity = 246/335 (73.43%), Postives = 298/335 (88.96%), Query Frame = 1

Query: 110 KDGVE--CLDQHKMTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRA 169
           KDG E   +D  +M KVCDKLI+VF+VDKPTP+DWRRL+AFSKEWDSIRPHF++RCQ+RA
Sbjct: 80  KDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLAFSKEWDSIRPHFYKRCQERA 139

Query: 170 ASEDDPGMQHKLLRLGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFF 229
            SED+P M+HK+ RL RKLKE+DED+QRHNE L V++   P+E+GE+++RRRKDFT EFF
Sbjct: 140 DSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTPPAEIGELVARRRKDFTNEFF 199

Query: 230 VHLHTVTESYYDDPTEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNL 289
            HLHTV ESYYD+P EQ+ALA LG   +AAVQAYD +TE+I+AL+AAE+K QDIINSP+L
Sbjct: 200 EHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTESIDALNAAEMKLQDIINSPSL 259

Query: 290 DAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLV 349
           DAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMK+EVKDILYHLYVTA GNLQRL+
Sbjct: 260 DAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEEVKDILYHLYVTARGNLQRLM 319

Query: 350 PTEIRILKHLLTIEDPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYH 409
           P E+RILK+LL+IEDP+E++SAL+DAFTPG+EL+G DVD LYTTPE L + MKTVL+AYH
Sbjct: 320 PKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDYLYTTPEHLQSLMKTVLEAYH 379

Query: 410 FSREGTLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           FSREG+L+KEA+DLM+ ++I K+E+LK LVEK +M
Sbjct: 380 FSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

BLAST of CmoCh04G031220 vs. TAIR10
Match: AT4G37920.1 (AT4G37920.1 unknown protein)

HSP 1 Score: 298.5 bits (763), Expect = 6.7e-81
Identity = 151/360 (41.94%), Postives = 241/360 (66.94%), Query Frame = 1

Query: 87  SETIDSSNSTPTRNGPVG-NVPNAKDG---VECLDQHKMTKVCDKLIDVFLVDKPTPTDW 146
           S TI  +  T T NG     V ++ +    VE  + + M + CDK+ID+FL +KP    W
Sbjct: 45  SSTITFATDTVTYNGTTSAEVKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQW 104

Query: 147 RRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLRLGRKLKEIDEDVQRHNEFLEV 206
           +  +    EW+    +F++RC+ RA +E DP ++ KL+ L  K+K+ID+++++HN+ L+ 
Sbjct: 105 KTYLVLRDEWNKYSVNFYKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKE 164

Query: 207 LRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDPTEQDALAKLGNSCLAAVQAYD 266
           ++   P+++  I ++RR+DFT EFF ++  ++E+  D   ++DA+A+L   CL+AV AYD
Sbjct: 165 IQ-ENPTDINAIAAKRRRDFTGEFFRYVTLLSETL-DGLEDRDAVARLATRCLSAVSAYD 224

Query: 267 AATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESN 326
              E++E LD A+ KF+DI+NSP++D+AC KI SLA+  +LDS+L+L+I  A++AAKES 
Sbjct: 225 NTLESVETLDTAQAKFEDILNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQ 284

Query: 327 MMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIEDPEEKLSALKDAFTPGEELQG 386
            + +E KDI+YHLY     +L+ + P EI++LK+LL I DPEE+ SAL  AF+PG++ + 
Sbjct: 285 TVTNEAKDIMYHLYKATKSSLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEA 344

Query: 387 QDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           +D   LYTTP++LH W+K +LDAYH ++E T IKEA+ +    VI +L  LK  +E  ++
Sbjct: 345 KDPKALYTTPKELHKWIKIMLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYL 402

BLAST of CmoCh04G031220 vs. NCBI nr
Match: gi|659081270|ref|XP_008441243.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 712.6 bits (1838), Expect = 4.2e-202
Identity = 368/441 (83.45%), Postives = 395/441 (89.57%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNTIEFRSLV 61
           MELH A L TSFSF IR +  A GDASAA  CSPSS SLSRI  RNFS+GS +  F SL+
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAA--CSPSSPSLSRITVRNFSLGSKSRGFPSLL 60

Query: 62  SRVRLMKSSCSAVVRGESAVPSECSSETIDSSNSTPTRNGPVGNVPNAKDGVECLDQHKM 121
            R R  KSS S  VRG SAVPS+C+SET+DS N +P  +  V +V NAKD VE LDQHKM
Sbjct: 61  CRDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVES--VRDVQNAKDSVESLDQHKM 120

Query: 122 TKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLR 181
           TKVCDKLI+VF++DKPTPTDWRRLIAFSKEWD+IRPHFF RCQDRAASEDDPGM+HKLLR
Sbjct: 121 TKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR 180

Query: 182 LGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDP 241
           LGRKLKEIDEDVQRHNE LEV+R  APSELGEIISRRRKDFTKEFFVHLHTV ESYYDDP
Sbjct: 181 LGRKLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDP 240

Query: 242 TEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKN 301
            EQ+ALAKLGNSCLAAVQ YDAATENIEALDAAELKFQDIINSP LDAACRKID+LAEKN
Sbjct: 241 AEQNALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKN 300

Query: 302 QLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIE 361
           QLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRL+P EIRILK+LLTI+
Sbjct: 301 QLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIK 360

Query: 362 DPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDL 421
           DPEEKLSALKDAFTPGEE++GQDVDCLYTTP+KLHAW+KTV+DAYHFSREGTLIKEARDL
Sbjct: 361 DPEEKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDL 420

Query: 422 MNSQVIVKLEELKHLVEKNFM 443
           MN QVIVKLEELKHL+EK FM
Sbjct: 421 MNPQVIVKLEELKHLLEKKFM 437

BLAST of CmoCh04G031220 vs. NCBI nr
Match: gi|659081268|ref|XP_008441242.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 708.8 bits (1828), Expect = 6.1e-201
Identity = 368/442 (83.26%), Postives = 395/442 (89.37%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNT-IEFRSL 61
           MELH A L TSFSF IR +  A GDASAA  CSPSS SLSRI  RNFS+GS +   F SL
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAA--CSPSSPSLSRITVRNFSLGSKSRAGFPSL 60

Query: 62  VSRVRLMKSSCSAVVRGESAVPSECSSETIDSSNSTPTRNGPVGNVPNAKDGVECLDQHK 121
           + R R  KSS S  VRG SAVPS+C+SET+DS N +P  +  V +V NAKD VE LDQHK
Sbjct: 61  LCRDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVES--VRDVQNAKDSVESLDQHK 120

Query: 122 MTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLL 181
           MTKVCDKLI+VF++DKPTPTDWRRLIAFSKEWD+IRPHFF RCQDRAASEDDPGM+HKLL
Sbjct: 121 MTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLL 180

Query: 182 RLGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDD 241
           RLGRKLKEIDEDVQRHNE LEV+R  APSELGEIISRRRKDFTKEFFVHLHTV ESYYDD
Sbjct: 181 RLGRKLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDD 240

Query: 242 PTEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEK 301
           P EQ+ALAKLGNSCLAAVQ YDAATENIEALDAAELKFQDIINSP LDAACRKID+LAEK
Sbjct: 241 PAEQNALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEK 300

Query: 302 NQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTI 361
           NQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRL+P EIRILK+LLTI
Sbjct: 301 NQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTI 360

Query: 362 EDPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARD 421
           +DPEEKLSALKDAFTPGEE++GQDVDCLYTTP+KLHAW+KTV+DAYHFSREGTLIKEARD
Sbjct: 361 KDPEEKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARD 420

Query: 422 LMNSQVIVKLEELKHLVEKNFM 443
           LMN QVIVKLEELKHL+EK FM
Sbjct: 421 LMNPQVIVKLEELKHLLEKKFM 438

BLAST of CmoCh04G031220 vs. NCBI nr
Match: gi|449441744|ref|XP_004138642.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 691.8 bits (1784), Expect = 7.8e-196
Identity = 355/441 (80.50%), Postives = 387/441 (87.76%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNTIEFRSLV 61
           MELH A LHTSFSF IR    AHGDASAA  CSPS  SLSRI  RNFS+GS +  F SLV
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAA--CSPSLPSLSRITIRNFSLGSKSRGFPSLV 60

Query: 62  SRVRLMKSSCSAVVRGESAVPSECSSETIDSSNSTPTRNGPVGNVPNAKDGVECLDQHKM 121
              R  KSS SA VRG  AVPS+C+SET+D  N  P+ + PV +V NAKD VE LDQHKM
Sbjct: 61  CHDRPKKSSFSAFVRGVKAVPSDCNSETLDLLN--PSSDEPVRDVQNAKDSVENLDQHKM 120

Query: 122 TKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLLR 181
           TKVCDKLI+VF++DKPTP DWRRLIAFSKEWD+IRPHFF RCQDRAASEDDPGM+HKLLR
Sbjct: 121 TKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR 180

Query: 182 LGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDDP 241
            GRKLKEIDEDVQRHNE LEV+R  +PSELGEIISRRRKDFTKEFFVHLHTV +SYYDDP
Sbjct: 181 FGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDP 240

Query: 242 TEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEKN 301
            +Q+ALAKLGNSCLAAVQ YDAATENIEAL+AAELKFQDIINSP +DAACRKID+LAEKN
Sbjct: 241 AKQNALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKN 300

Query: 302 QLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTIE 361
           QLDSALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRL+P EIRILK+LLTI 
Sbjct: 301 QLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTIN 360

Query: 362 DPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARDL 421
           DPEEKLSALKDAFTPGEEL+GQDVDCLYTTPE+LH W+KTV+DAYHFSREGTL++EARDL
Sbjct: 361 DPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDL 420

Query: 422 MNSQVIVKLEELKHLVEKNFM 443
           MN Q+IVKLEELK L+EK FM
Sbjct: 421 MNPQLIVKLEELKGLIEKKFM 437

BLAST of CmoCh04G031220 vs. NCBI nr
Match: gi|778673035|ref|XP_011649916.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 688.0 bits (1774), Expect = 1.1e-194
Identity = 355/442 (80.32%), Postives = 387/442 (87.56%), Query Frame = 1

Query: 2   MELHCAFLHTSFSFPIRDRIPAHGDASAAVACSPSSSSLSRIPARNFSMGSNT-IEFRSL 61
           MELH A LHTSFSF IR    AHGDASAA  CSPS  SLSRI  RNFS+GS +   F SL
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAA--CSPSLPSLSRITIRNFSLGSKSRAGFPSL 60

Query: 62  VSRVRLMKSSCSAVVRGESAVPSECSSETIDSSNSTPTRNGPVGNVPNAKDGVECLDQHK 121
           V   R  KSS SA VRG  AVPS+C+SET+D  N  P+ + PV +V NAKD VE LDQHK
Sbjct: 61  VCHDRPKKSSFSAFVRGVKAVPSDCNSETLDLLN--PSSDEPVRDVQNAKDSVENLDQHK 120

Query: 122 MTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEWDSIRPHFFRRCQDRAASEDDPGMQHKLL 181
           MTKVCDKLI+VF++DKPTP DWRRLIAFSKEWD+IRPHFF RCQDRAASEDDPGM+HKLL
Sbjct: 121 MTKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLL 180

Query: 182 RLGRKLKEIDEDVQRHNEFLEVLRGAAPSELGEIISRRRKDFTKEFFVHLHTVTESYYDD 241
           R GRKLKEIDEDVQRHNE LEV+R  +PSELGEIISRRRKDFTKEFFVHLHTV +SYYDD
Sbjct: 181 RFGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDD 240

Query: 242 PTEQDALAKLGNSCLAAVQAYDAATENIEALDAAELKFQDIINSPNLDAACRKIDSLAEK 301
           P +Q+ALAKLGNSCLAAVQ YDAATENIEAL+AAELKFQDIINSP +DAACRKID+LAEK
Sbjct: 241 PAKQNALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEK 300

Query: 302 NQLDSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLVPTEIRILKHLLTI 361
           NQLDSALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRL+P EIRILK+LLTI
Sbjct: 301 NQLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTI 360

Query: 362 EDPEEKLSALKDAFTPGEELQGQDVDCLYTTPEKLHAWMKTVLDAYHFSREGTLIKEARD 421
            DPEEKLSALKDAFTPGEEL+GQDVDCLYTTPE+LH W+KTV+DAYHFSREGTL++EARD
Sbjct: 361 NDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARD 420

Query: 422 LMNSQVIVKLEELKHLVEKNFM 443
           LMN Q+IVKLEELK L+EK FM
Sbjct: 421 LMNPQLIVKLEELKGLIEKKFM 438

BLAST of CmoCh04G031220 vs. NCBI nr
Match: gi|225459407|ref|XP_002285817.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic [Vitis vinifera])

HSP 1 Score: 573.2 bits (1476), Expect = 4.0e-160
Identity = 288/410 (70.24%), Postives = 340/410 (82.93%), Query Frame = 1

Query: 37  SSSLSRIPARNFSMGSNTIEFRSLVSRVRLMKSSCSAVVRGESAVPSECSSETID-SSNS 96
           S S  R   RN          R LV+ +RL  SS +++V   +AVPS  + E +  SS+S
Sbjct: 20  SLSFIRNSRRNLCFTRKVEGRRLLVNPIRLQHSSVASIVGDTTAVPSRSTGEPLSFSSSS 79

Query: 97  TPTRNGPV---GNVPNAKDGVECLDQHKMTKVCDKLIDVFLVDKPTPTDWRRLIAFSKEW 156
            P  +  V   G   + KD  ECLD HKM +VCDKLI+VF+VDKPTPTDWRRL+AFSKEW
Sbjct: 80  DPAVDKDVIGYGEKRDGKDDPECLDNHKMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEW 139

Query: 157 DSIRPHFFRRCQDRAASEDDPGMQHKLLRLGRKLKEIDEDVQRHNEFLEVLRGAAPSELG 216
            +IRPHF+RRCQDRA SE DPG +H LLRLGRKLKEIDEDV+RHNE LEV++G  P+++ 
Sbjct: 140 SNIRPHFYRRCQDRADSEGDPGKKHSLLRLGRKLKEIDEDVKRHNELLEVIKGTPPADIS 199

Query: 217 EIISRRRKDFTKEFFVHLHTVTESYYDDPTEQDALAKLGNSCLAAVQAYDAATENIEALD 276
            ++++RRKDFTKEFFVHLHTV ESY+D+PTEQ+ALAKLGN CLAAVQ YD A+E+IEAL+
Sbjct: 200 AVVAKRRKDFTKEFFVHLHTVAESYHDNPTEQNALAKLGNMCLAAVQTYDTASESIEALN 259

Query: 277 AAELKFQDIINSPNLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKDEVKDIL 336
           AAELKFQDI+NSP+LD ACRKIDSLAEKNQLDSALVLMITKAWSAAKESNM KDEVKD+L
Sbjct: 260 AAELKFQDILNSPSLDVACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMTKDEVKDVL 319

Query: 337 YHLYVTASGNLQRLVPTEIRILKHLLTIEDPEEKLSALKDAFTPGEELQGQDVDCLYTTP 396
           +HLY TA GNLQRL+P EIRILK+LLTIEDPEEK+SALKDAFTPG+E++G+DVDCLYTTP
Sbjct: 320 FHLYTTARGNLQRLMPKEIRILKYLLTIEDPEEKMSALKDAFTPGDEIEGKDVDCLYTTP 379

Query: 397 EKLHAWMKTVLDAYHFSREGTLIKEARDLMNSQVIVKLEELKHLVEKNFM 443
           EKLH WM+TV+DA+HFSREGTLI+EARDLMN ++I KLEELK LV+ NFM
Sbjct: 380 EKLHTWMQTVVDAFHFSREGTLIREARDLMNPKIIQKLEELKKLVQDNFM 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4920_ARATH1.2e-7941.94Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A0A0LMH6_CUCSA5.4e-19680.50Uncharacterized protein OS=Cucumis sativus GN=Csa_2G403700 PE=4 SV=1[more]
E0CRE9_VITVI2.8e-16070.24Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g04580 PE=4 SV=... [more]
W9RSH2_9ROSA1.6e-15574.26Uncharacterized protein OS=Morus notabilis GN=L484_021538 PE=4 SV=1[more]
A0A0D2TUM3_GOSRA1.3e-15466.00Uncharacterized protein OS=Gossypium raimondii GN=B456_009G241300 PE=4 SV=1[more]
A0A061FGT6_THECC3.3e-15366.44Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_034989 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G36320.13.0e-14573.43 unknown protein[more]
AT4G37920.16.7e-8141.94 unknown protein[more]
Match NameE-valueIdentityDescription
gi|659081270|ref|XP_008441243.1|4.2e-20283.45PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
gi|659081268|ref|XP_008441242.1|6.1e-20183.26PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
gi|449441744|ref|XP_004138642.1|7.8e-19680.50PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
gi|778673035|ref|XP_011649916.1|1.1e-19480.32PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
gi|225459407|ref|XP_002285817.1|4.0e-16070.24PREDICTED: uncharacterized protein At4g37920, chloroplastic [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0016226 iron-sulfur cluster assembly
biological_process GO:0009058 biosynthetic process
biological_process GO:0009987 cellular process
biological_process GO:0010508 positive regulation of autophagy
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005730 nucleolus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003899 DNA-directed RNA polymerase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CmoCh04G031220.1CmoCh04G031220.1mRNA


Analysis Name: InterPro Annotations of Cucurbita moschata
Date Performed: 2017-05-19
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31755FAMILY NOT NAMEDcoord: 30..442
score: 1.0E
NoneNo IPR availablePANTHERPTHR31755:SF1SUBFAMILY NOT NAMEDcoord: 30..442
score: 1.0E