Cla011898 (gene) Watermelon (97103) v1

NameCla011898
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionEndoribonuclease E-like protein (AHRD V1 **-- Q656E2_ORYSJ)
LocationChr11 : 3033324 .. 3037100 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAACTCCATTGCTCAACTCTCCAGACCTCATTATCCTTCTCTGTCAGAGGAAAAGTTCTTGCTCATGGAGACGCCTCTGCGGCCTGCTCTCCCTCCTCGTCTTCGTTTTCAAGAATTACAGCTCGAAACTTCTCTTTGGGTCCAAAAAGTAGAGGTAAGGTAGCTAACTCTGAACTTTGTTAAGTTTCCGATGAGTCGCTAAGGATGAACCATGCGATTGATATCTCTGTAGTATTGCTCAATGGTTTCAATTTCAGACGGTTGTTAAAAGGGGAACGTGGAGAAAAAGGATAAAGATTTTGATTCTGTGCTCTTTCCGCGTTCTGTTCATAATACGGCTGAACTATTCACGACTTGGAAAAATGGTCATTAAGTATTGTTTAAACCGAGACGTGGAAATTCGGCCACAGAAATTTTCTCTCTATTTACATATGTGTGTGTGTCTGTGTCTGTGTGTGTATAGCTATGCATTGGTCTAGATGCTGAATATCTCTTGTTTGAATTATTGATTCCTTATAGTTCGTGTTTCAAGCTTCATATTGACTTTAGCTTACCAGTCACAAAGTGAAGGTAGTATGAATAGTCTAAAATTGTAAAAACTTAAGAAAGTTTATCACAATGTTCTTTATTAAAGTAGAGTGAAGTATTAGTAATTTTGTTGAATAGCAGGGTTCCCTTCACTGGTATGTCGAGTTAGACTGAAGAACTCATCCTTTTCTGCCGTTGTCAGAGGGGAGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTCGTCGAATCCCAATCCCAATGAGCTGGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACCCCAACAGATTGGAGACGGTTAATCGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGTCTCGGAAGATGATCCTGGGATGAGGCATAAGCTACTCAAGCTTGGAAGAAAGTTGAAAGAGGTATGTCTTGAGAAAGGCAGGCCTTGTTTCTATAAAATAATTTATTGAGCAAATAAGATGAGTATAAACTAACACTTCGTATGCAAATAGGTTATTAGGTCTATGATTGTAAAAGCCCCAGAGGATGTGTTACTGAGGGGGTTAAGATGAATTGTTGACAATGACATGATTATTTTATTGATCAGATAATAGTTGCAGTATGTGAGTTTTCAATCGACCTTTTCTTTTCCCTTCTACAAAATTTTCACAGATTGATGAAGATGTGCAGAGACACAATGAACTTCTTGAAGTGGTCAGAGCAGTGGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTACATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGGTAATACTGCTTACTTTCCAGTTGGATAAATAGTAATGAATAAATAGCTTCCCATGGGGCTGAAGAAGTGTAATTTAAGAAGTAAATATGCGGGTAGAAAAATACCTTTTTGGTCTCCAAATTTTTAGTATAGTTTTGATTTGTTCATTAAGTTTTAAAATGTTACATTGAGTTTTGAGTTTAATTTCCATTTGATCCTTCGAAGTTAAAATGTCATCTTGAGATGACCAAATGGAAAATATACTCAAAACTCAGGGCAAAGGGATAATTTCTCCTATATTGTTTTAGGCTATTCAAAGGATGAGTGGGCCCTTGAAATTGTTCCTTATCTTAAAATCTACCTTAAGTAGCTAGATTGGTTAAGACATTATGTGTGTTTGTCATATGTTCAAATCTTTACCTCCACATTCTTTAATTTTTTTTTCCTTTTGTTAATGATGTCTTCTTTCTTTAGCAAACGTCTTTTTTCTTATATGCATATATAAGATTTGGTATTGATGGACGAACTGATATTTTCCCCTTATATTTCAACTGATAAGCTTTGGCAAAGCTTGGGAATTCCTGCCTTGCTGCTGTACAGACATTCGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAGCATGATGAAAGATGAGGTAATGCGTTTGCTTGCTACCCTTGCGACCATTGATGCAAATGTGTGGTTGTATGGTCCTATTTCTATGGTCTAGTAGGTTGGCAAGGCATCATCCGACCATTCAAGAAAATTCAAATGATTGTGTTAAGTCAGTAATTACGGAAACTTCCAATAACACTTCAAATACCATTATCTGTTTGAATGAAATGAGTACAGCTTAGGGCATCAGAGTAAAAACAATACCAATTATTTCCCTCGAGTTTATGTGTTTCTATGTTGGGTCATGCGAGCAAATGGAGAATGTACCTCTAATTGATGTGCTATTGTTGCGTAATGACTTTCTGATGGTGTAAGCGTCGGTGTTGTTATCTCTTTGTTAAGCGTTGGTGTTGTTTTCTCTTTGTTAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGTGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTAAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCATTCACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGTATGCTTTCGAACCTTATTGATATATTGAAATACTAATATGCCATAGTCTATTAGCTTGAACCAAAATTAGTGATTTAGTATGATCTCAGATCTTCTGTGTTCAAACTCTTCTGGTTCTATTTTCTATAATGTTGATAGTCTACTTGTTCAATACTCAATATTGAGCCAATTTTTGAGCCTACAAATGGGATGTTAAGTATATATTCAAAGAATTAGATATACGTTTAGATATAACCGAACTTATTAGCGGTGAGTTAGCACTCATTGTCGTCATGCCTTCTCATCAAACAAGAAAGAAAGCCTTGGTCCCGTCAGATCTCTCGAACCTTCAATGTCCATCTAGTTAAGAGTATGGATATCACCCTGACTTTACATTTGCGCAAAGCATACCACAATTACTTGAATGTTGTATGGAAAATGACCTTTAATTAATAATATTGCTTTGTCTGTTGTGCTCTATTCTTGGTATATTTTTTTTAACATCCGTGAATGTTCGGGTTAGCTCACGTGCACTTCGACTTATCTCATGAGACAACCTGCTTGACCCTAAAACATTTGGTTGCCAAGGGAACTCGTAGGATATTAATTCTTGGATAGGTGGCCATCATGGATTAAACTAGTGACATTTTAACTATTTATTGAGATTATGTCTCCTATTGTACCACTAGGTTAATACAGGATGGTTTATACTTGGTATCACTTTAGGCTTCTCCTTCTTTTTTAAAAAAAGATAATCTTATTAGGTTCTTTTTCTACTTCCCACTTTTAGGACACCAGAGAAGCTTCATACATGGATAAAGACAGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAAGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAGGAATTGAAGCATCTCCTTGAGAAAAAATTCATGTGA

mRNA sequence

ATGGAACTCCATTGCTCAACTCTCCAGACCTCATTATCCTTCTCTGTCAGAGGAAAAGTTCTTGCTCATGGAGACGCCTCTGCGGCCTGCTCTCCCTCCTCGTCTTCGTTTTCAAGAATTACAGCTCGAAACTTCTCTTTGGGTCCAAAAAGTAGAGGGTTCCCTTCACTGGTATGTCGAGTTAGACTGAAGAACTCATCCTTTTCTGCCGTTGTCAGAGGGGAGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTCGTCGAATCCCAATCCCAATGAGCTGGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACCCCAACAGATTGGAGACGGTTAATCGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGTCTCGGAAGATGATCCTGGGATGAGGCATAAGCTACTCAAGCTTGGAAGAAAGTTGAAAGAGATTGATGAAGATGTGCAGAGACACAATGAACTTCTTGAAGTGGTCAGAGCAGTGGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTACATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAGCTTGGGAATTCCTGCCTTGCTGCTGTACAGACATTCGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAGCATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGTGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTAAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCATTCACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACACCAGAGAAGCTTCATACATGGATAAAGACAGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAAGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAGGAATTGAAGCATCTCCTTGAGAAAAAATTCATGTGA

Coding sequence (CDS)

ATGGAACTCCATTGCTCAACTCTCCAGACCTCATTATCCTTCTCTGTCAGAGGAAAAGTTCTTGCTCATGGAGACGCCTCTGCGGCCTGCTCTCCCTCCTCGTCTTCGTTTTCAAGAATTACAGCTCGAAACTTCTCTTTGGGTCCAAAAAGTAGAGGGTTCCCTTCACTGGTATGTCGAGTTAGACTGAAGAACTCATCCTTTTCTGCCGTTGTCAGAGGGGAGAGTGCAGTACCTAGTGATTGTAATTCAGAAACTCTTGATTCGTCGAATCCCAATCCCAATGAGCTGGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACCCCAACAGATTGGAGACGGTTAATCGCTTTCAGCAAGGAATGGGACAATATCCGTCCCCATTTCTTTAGGAGGTGCCAAGATCGAGCTGTCTCGGAAGATGATCCTGGGATGAGGCATAAGCTACTCAAGCTTGGAAGAAAGTTGAAAGAGATTGATGAAGATGTGCAGAGACACAATGAACTTCTTGAAGTGGTCAGAGCAGTGGCACCATCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACAAAAGAATTCTTTGTACATCTTCACACGGTGGCTGAATCTTATTATGATGATCCGGCTGAGCAAAATGCTTTGGCAAAGCTTGGGAATTCCTGCCTTGCTGCTGTACAGACATTCGATGCTGCAACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAGGATATCATCAATTCTCCAACTTTAGATGCTGCTTGCAGAAAGATAGACAATTTGGCGGAGAAAAACCAACTTGATTCTGCATTAGTATTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCGAGCATGATGAAAGATGAGGTGAAAGACATACTATACCACTTGTACGTGACTGCAAGTGGTAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATTCTAAAGTATCTTCTCACAATTAAGGATCCTGAGGAAAAGCTAAGTGCTCTGAAGGATGCATTCACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACACCAGAGAAGCTTCATACATGGATAAAGACAGTGGTAGATGCTTATCATTTCAGCCGGGAAGGCACTCTCATTAAAGAAGCCAGAGACCTTATGAATCCACAGGTCATTGTTAAACTTGAGGAATTGAAGCATCTCCTTGAGAAAAAATTCATGTGA

Protein sequence

MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCRVRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKKFM
BLAST of Cla011898 vs. Swiss-Prot
Match: Y4920_ARATH (Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g37920 PE=1 SV=2)

HSP 1 Score: 301.2 bits (770), Expect = 1.8e-80
Identity = 156/402 (38.81%), Postives = 255/402 (63.43%), Query Frame = 1

Query: 45  FSLGPKSRGFPSLVCRVRLKNSS-----FSAVVRGESAVPSDCN----SETLDSSNPNPN 104
           FS   K   FP        KNS      FSA + G   +         ++T+  +     
Sbjct: 11  FSSADKLLSFPP-------KNSQTHHLPFSAFINGGRKIRKSSTITFATDTVTYNGTTSA 70

Query: 105 ELVRDVQNAKDSVESLDQHKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF 164
           E+   V++  + VE  + + M + CDK+I++F+ +KP    W+  +    EW+    +F+
Sbjct: 71  EVKSSVEDPME-VEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFY 130

Query: 165 RRCQDRAVSEDDPGMRHKLLKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRK 224
           +RC+ RA +E DP ++ KL+ L  K+K+ID+++++HN+LL+ ++   P+++  I ++RR+
Sbjct: 131 KRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQE-NPTDINAIAAKRRR 190

Query: 225 DFTKEFFVHLHTVAESYYDDPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQD 284
           DFT EFF ++  ++E+  D   +++A+A+L   CL+AV  +D   E++E L+ A+ KF+D
Sbjct: 191 DFTGEFFRYVTLLSETL-DGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFED 250

Query: 285 IINSPTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTAS 344
           I+NSP++D+AC KI +LA+  +LDS+L+L+I  A++AAKES  + +E KDI+YHLY    
Sbjct: 251 ILNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATK 310

Query: 345 GNLQRLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIK 404
            +L+ + PKEI++LKYLL I DPEE+ SAL  AF+PG++ E +D   LYTTP++LH WIK
Sbjct: 311 SSLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIK 370

Query: 405 TVVDAYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKKFM 438
            ++DAYH ++E T IKEA+ +  P VI +L  LK  +E +++
Sbjct: 371 IMLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYL 402

BLAST of Cla011898 vs. TrEMBL
Match: A0A0A0LMH6_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G403700 PE=4 SV=1)

HSP 1 Score: 775.0 bits (2000), Expect = 4.8e-221
Identity = 386/437 (88.33%), Postives = 406/437 (92.91%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCR 60
           MELH +TL TS SFS+R   LAHGDASAACSPS  S SRIT RNFSLG KSRGFPSLVC 
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60

Query: 61  VRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKVC 120
            R K SSFSA VRG  AVPSDCNSETLD  NP+ +E VRDVQNAKDSVE+LDQHKMTKVC
Sbjct: 61  DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRA SEDDPGM+HKLL+ GRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDVQRHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240

Query: 241 ALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCLAAVQT+DAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMITKAWSAAKES+MMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKKFM 438
           +IVKLEELK L+EKKFM
Sbjct: 421 LIVKLEELKGLIEKKFM 437

BLAST of Cla011898 vs. TrEMBL
Match: E0CRE9_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g04580 PE=4 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 1.2e-166
Identity = 300/443 (67.72%), Postives = 355/443 (80.14%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCR 60
           MEL  +TLQ   SF V    L              SF R + RN     K  G   LV  
Sbjct: 1   MELASTTLQARSSFQVIAPSL--------------SFIRNSRRNLCFTRKVEGRRLLVNP 60

Query: 61  VRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDV------QNAKDSVESLDQH 120
           +RL++SS +++V   +AVPS    E L  S+ +   + +DV      ++ KD  E LD H
Sbjct: 61  IRLQHSSVASIVGDTTAVPSRSTGEPLSFSSSSDPAVDKDVIGYGEKRDGKDDPECLDNH 120

Query: 121 KMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKL 180
           KM +VCDKLIEVFM+DKPTPTDWRRL+AFSKEW NIRPHF+RRCQDRA SE DPG +H L
Sbjct: 121 KMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEWSNIRPHFYRRCQDRADSEGDPGKKHSL 180

Query: 181 LKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYD 240
           L+LGRKLKEIDEDV+RHNELLEV++   P+++  ++++RRKDFTKEFFVHLHTVAESY+D
Sbjct: 181 LRLGRKLKEIDEDVKRHNELLEVIKGTPPADISAVVAKRRKDFTKEFFVHLHTVAESYHD 240

Query: 241 DPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAE 300
           +P EQNALAKLGN CLAAVQT+D A+E+IEALNAAELKFQDI+NSP+LD ACRKID+LAE
Sbjct: 241 NPTEQNALAKLGNMCLAAVQTYDTASESIEALNAAELKFQDILNSPSLDVACRKIDSLAE 300

Query: 301 KNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLT 360
           KNQLDSALVLMITKAWSAAKES+M KDEVKD+L+HLY TA GNLQRLMPKEIRILKYLLT
Sbjct: 301 KNQLDSALVLMITKAWSAAKESNMTKDEVKDVLFHLYTTARGNLQRLMPKEIRILKYLLT 360

Query: 361 IKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEAR 420
           I+DPEEK+SALKDAFTPG+E+EG+DVDCLYTTPEKLHTW++TVVDA+HFSREGTLI+EAR
Sbjct: 361 IEDPEEKMSALKDAFTPGDEIEGKDVDCLYTTPEKLHTWMQTVVDAFHFSREGTLIREAR 420

Query: 421 DLMNPQVIVKLEELKHLLEKKFM 438
           DLMNP++I KLEELK L++  FM
Sbjct: 421 DLMNPKIIQKLEELKKLVQDNFM 429

BLAST of Cla011898 vs. TrEMBL
Match: A0A059C054_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F04381 PE=4 SV=1)

HSP 1 Score: 572.0 bits (1473), Expect = 6.2e-160
Identity = 287/408 (70.34%), Postives = 335/408 (82.11%), Query Frame = 1

Query: 32  PSSSSFSRITARNFSLGPKSRGFPSLVCRVRLKNSSFSAVVRGESAVPSDCNSETLDSSN 91
           PS +S    T   FS      G   L   VR +N   +A+    + VP       +  S 
Sbjct: 70  PSLASSRGATESFFS------GSQMLADPVRCQNFFLAAMADDTTVVPERSTLHDMPESE 129

Query: 92  PNPNELVRDVQNAKDSVESL--DQHKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDN 151
            + N       N KD+   L  D +KM +VCDKLIEVF++DKPTPTDWRRL+AFSKEW +
Sbjct: 130 DSDNNPAATDGNEKDNSSDLGLDDNKMLRVCDKLIEVFLVDKPTPTDWRRLLAFSKEWSD 189

Query: 152 IRPHFFRRCQDRAVSEDDPGMRHKLLKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEI 211
           IRPHFFRRCQ+RA +ED+PGM+HKLL+LGRKLKE+DEDVQRH+ELLEV+R  APSE+ E+
Sbjct: 190 IRPHFFRRCQERADNEDNPGMKHKLLRLGRKLKEVDEDVQRHDELLEVIRT-APSEINEV 249

Query: 212 ISRRRKDFTKEFFVHLHTVAESYYDDPAEQNALAKLGNSCLAAVQTFDAATENIEALNAA 271
           ++RRRKDFTKEFFVH+HTVAESYYD+P+EQNA+AKLGN CLA VQ +D ATE+IEALN A
Sbjct: 250 VARRRKDFTKEFFVHVHTVAESYYDNPSEQNAVAKLGNMCLAVVQAYDTATESIEALNTA 309

Query: 272 ELKFQDIINSPTLDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESSMMKDEVKDILYH 331
           ELKFQDIINSP+LDAAC+KIDNLA KN+LDSALVLMITKAWSAAKES+MMKDEVKDILYH
Sbjct: 310 ELKFQDIINSPSLDAACKKIDNLAAKNELDSALVLMITKAWSAAKESNMMKDEVKDILYH 369

Query: 332 LYVTASGNLQRLMPKEIRILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEK 391
           LYV+A GNLQRLMPKEIRILKYLLTI+DPE+K SALKDAFTPGEELEG D+DCLYTTPEK
Sbjct: 370 LYVSARGNLQRLMPKEIRILKYLLTIEDPEQKASALKDAFTPGEELEGNDIDCLYTTPEK 429

Query: 392 LHTWIKTVVDAYHFSREGTLIKEARDLMNPQVIVKLEELKHLLEKKFM 438
           LHTWI+TV+DAYHFSREGTLIKEARDLMNP++I KL+ELK L+E ++M
Sbjct: 430 LHTWIRTVIDAYHFSREGTLIKEARDLMNPKIIEKLKELKKLVEDQYM 470

BLAST of Cla011898 vs. TrEMBL
Match: A0A0D2TUM3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G241300 PE=4 SV=1)

HSP 1 Score: 570.9 bits (1470), Expect = 1.4e-159
Identity = 300/451 (66.52%), Postives = 360/451 (79.82%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKV--LAHGDASAACSPS--SSSFSRITARNFSLGPKSRGFPS 60
           MEL C++ Q    F VR K       ++S A  PS  SS+ SRI      L  K   +P+
Sbjct: 1   MELACASFQACSIFPVRLKSSKATKFESSLALLPSCKSSNSSRIRC----LSSKFLSYPA 60

Query: 61  LVCRVRLKNS---SFSAVVRGESAVPSDCNSETLDSSNPNPNELV-----RDVQNAKD-- 120
           L C  R+      S +AVV  ++AVP++C+ E +  S+   + L+     RD +N  D  
Sbjct: 61  L-CINRVSGQQRFSLAAVVGDKTAVPNNCDEEKISDSDSAGSSLINDEVTRDGENDGDKG 120

Query: 121 SVESLDQHKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSED 180
           SVE +D  KM +VCDKLIEVF++DKPTPTDWRRL+AFSKEW+NIRPHFF+RCQ+RA  E 
Sbjct: 121 SVEGMDSVKMIRVCDKLIEVFLVDKPTPTDWRRLLAFSKEWNNIRPHFFQRCQERADVEG 180

Query: 181 DPGMRHKLLKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLH 240
           DPGM+HKLL+LGRKLKEIDED+QRHNELLEV++  +PSE+ E+++RRRKDFTKEFFVH+H
Sbjct: 181 DPGMKHKLLRLGRKLKEIDEDIQRHNELLEVIKG-SPSEISEMVARRRKDFTKEFFVHIH 240

Query: 241 TVAESYYDDPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAAC 300
           TVAESYYD+P EQNALAKLGN+CLAAVQ +D A EN+EALNAAELKFQDIINSP+LD AC
Sbjct: 241 TVAESYYDNPTEQNALAKLGNTCLAAVQAYDTAAENVEALNAAELKFQDIINSPSLDVAC 300

Query: 301 RKIDNLAEKNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEI 360
           RKID+LAEKNQLDSALVLMITKAWSAAKES+M KDEVKDILYHLY+TA GNLQRL+PKEI
Sbjct: 301 RKIDSLAEKNQLDSALVLMITKAWSAAKESNMTKDEVKDILYHLYMTARGNLQRLLPKEI 360

Query: 361 RILKYLLTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSRE 420
           RI+KYLLTI+DPEE+L AL DAFTPGEELEG D+D LYTTPEKLHT ++ VVDAY+FS E
Sbjct: 361 RIVKYLLTIEDPEERLCALNDAFTPGEELEGSDMDNLYTTPEKLHTMMRAVVDAYNFSHE 420

Query: 421 GTLIKEARDLMNPQVIVKLEELKHLLEKKFM 438
           GTL++EARDLMNP++I KL EL  ++EK FM
Sbjct: 421 GTLLREARDLMNPKIIEKLGELIKIVEKNFM 445

BLAST of Cla011898 vs. TrEMBL
Match: A0A061FGT6_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_034989 PE=4 SV=1)

HSP 1 Score: 568.5 bits (1464), Expect = 6.8e-159
Identity = 301/445 (67.64%), Postives = 362/445 (81.35%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVR---GKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSL 60
           MEL  ++ Q      VR    K  ++G +SA+ S + +S S   +R   L  K    P+L
Sbjct: 1   MELASASRQACCLLPVRLKSSKATSYGSSSASLSSTKNSNS---SRIRCLSSKFSSCPAL 60

Query: 61  VC-RVRLKNSSFSAVVRGESAVPSDCNSET---LDSSNPNP-NELVRDVQNAKDSVESLD 120
           +  +VRL+    +AVV  ++AVP+DC+ E    LDS + +P N+ V +  + K SVE LD
Sbjct: 61  IVNQVRLRRP-LAAVVGDKTAVPNDCDGEQVAGLDSFDSSPINDDVNE--DEKGSVEGLD 120

Query: 121 QHKMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRH 180
             KM +VCDKLIEVFM+DKPTPTDWRRL+AFSKEW +IRPHFF+RCQDRA  E DPGM+H
Sbjct: 121 NSKMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEWSSIRPHFFKRCQDRADGEADPGMKH 180

Query: 181 KLLKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESY 240
           KLL+LGRKLKEIDEDVQRHNELLEVV+  APSE+ EI++RRRKDFTKEFFVHLHTVAES 
Sbjct: 181 KLLRLGRKLKEIDEDVQRHNELLEVVKG-APSEVSEIVARRRKDFTKEFFVHLHTVAESC 240

Query: 241 YDDPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNL 300
           YD+P EQNALAKLGN+CLAAVQ +D ATE+IEA+NAAELKFQDIINSP+LD AC+KID+L
Sbjct: 241 YDNPTEQNALAKLGNTCLAAVQAYDTATESIEAINAAELKFQDIINSPSLDVACQKIDSL 300

Query: 301 AEKNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYL 360
           A KNQLDSAL+LMITKAWSAAKES+M KDEVKDILYHLY+TA GNLQRL+PKEIRI+KYL
Sbjct: 301 AAKNQLDSALMLMITKAWSAAKESNMTKDEVKDILYHLYMTARGNLQRLLPKEIRIVKYL 360

Query: 361 LTIKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKE 420
           L I+DPEE+L AL DAFTPGEELEG+DVD LYTTPEKLHT ++ VVDAY+FS+EGTL++E
Sbjct: 361 LAIEDPEERLCALNDAFTPGEELEGKDVDNLYTTPEKLHTLMRAVVDAYNFSQEGTLLRE 420

Query: 421 ARDLMNPQVIVKLEELKHLLEKKFM 438
           ARDLMNP++I KLEEL  ++E+ FM
Sbjct: 421 ARDLMNPKIIEKLEELIKVVERNFM 438

BLAST of Cla011898 vs. NCBI nr
Match: gi|659081270|ref|XP_008441243.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 804.7 bits (2077), Expect = 8.1e-230
Identity = 405/437 (92.68%), Postives = 416/437 (95.19%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCR 60
           MELH +TLQTS SFS+RGK LA GDASAACSPSS S SRIT RNFSLG KSRGFPSL+CR
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRGFPSLLCR 60

Query: 61  VRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKVC 120
            R K SSFS  VRG SAVPSDCNSETLDS NP+P E VRDVQNAKDSVESLDQHKMTKVC
Sbjct: 61  DRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGRK 180
           DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRA SEDDPGM+HKLL+LGRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDVQRHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240

Query: 241 ALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCLAAVQT+DAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMITKAWSAAKES+MMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKKFM 438
           VIVKLEELKHLLEKKFM
Sbjct: 421 VIVKLEELKHLLEKKFM 437

BLAST of Cla011898 vs. NCBI nr
Match: gi|659081268|ref|XP_008441242.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 800.0 bits (2065), Expect = 2.0e-228
Identity = 405/438 (92.47%), Postives = 416/438 (94.98%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSR-GFPSLVC 60
           MELH +TLQTS SFS+RGK LA GDASAACSPSS S SRIT RNFSLG KSR GFPSL+C
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRAGFPSLLC 60

Query: 61  RVRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKV 120
           R R K SSFS  VRG SAVPSDCNSETLDS NP+P E VRDVQNAKDSVESLDQHKMTKV
Sbjct: 61  RDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGR 180
           CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFF RCQDRA SEDDPGM+HKLL+LGR
Sbjct: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGR 180

Query: 181 KLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
           KLKEIDEDVQRHNELLEVVRA APSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240

Query: 241 NALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
           NALAKLGNSCLAAVQT+DAATENIEAL+AAELKFQDIINSPTLDAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
           SALVLMITKAWSAAKES+MMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
           EKLSALKDAFTPGEE+EGQDVDCLYTTP+KLH WIKTVVDAYHFSREGTLIKEARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNP 420

Query: 421 QVIVKLEELKHLLEKKFM 438
           QVIVKLEELKHLLEKKFM
Sbjct: 421 QVIVKLEELKHLLEKKFM 438

BLAST of Cla011898 vs. NCBI nr
Match: gi|449441744|ref|XP_004138642.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis sativus])

HSP 1 Score: 775.0 bits (2000), Expect = 6.9e-221
Identity = 386/437 (88.33%), Postives = 406/437 (92.91%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCR 60
           MELH +TL TS SFS+R   LAHGDASAACSPS  S SRIT RNFSLG KSRGFPSLVC 
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60

Query: 61  VRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKVC 120
            R K SSFSA VRG  AVPSDCNSETLD  NP+ +E VRDVQNAKDSVE+LDQHKMTKVC
Sbjct: 61  DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRA SEDDPGM+HKLL+ GRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240
           LKEIDEDVQRHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240

Query: 241 ALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300
           ALAKLGNSCLAAVQT+DAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360
           ALVLMITKAWSAAKES+MMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420

Query: 421 VIVKLEELKHLLEKKFM 438
           +IVKLEELK L+EKKFM
Sbjct: 421 LIVKLEELKGLIEKKFM 437

BLAST of Cla011898 vs. NCBI nr
Match: gi|778673035|ref|XP_011649916.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis sativus])

HSP 1 Score: 770.4 bits (1988), Expect = 1.7e-219
Identity = 386/438 (88.13%), Postives = 406/438 (92.69%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSR-GFPSLVC 60
           MELH +TL TS SFS+R   LAHGDASAACSPS  S SRIT RNFSLG KSR GFPSLVC
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRAGFPSLVC 60

Query: 61  RVRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDVQNAKDSVESLDQHKMTKV 120
             R K SSFSA VRG  AVPSDCNSETLD  NP+ +E VRDVQNAKDSVE+LDQHKMTKV
Sbjct: 61  HDRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKLLKLGR 180
           CDKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRA SEDDPGM+HKLL+ GR
Sbjct: 121 CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR 180

Query: 181 KLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240
           KLKEIDEDVQRHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+Q
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ 240

Query: 241 NALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300
           NALAKLGNSCLAAVQT+DAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360
           SALVLMITKAWSAAKES+MMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNP 420
           EKLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNP
Sbjct: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP 420

Query: 421 QVIVKLEELKHLLEKKFM 438
           Q+IVKLEELK L+EKKFM
Sbjct: 421 QLIVKLEELKGLIEKKFM 438

BLAST of Cla011898 vs. NCBI nr
Match: gi|225459407|ref|XP_002285817.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic [Vitis vinifera])

HSP 1 Score: 594.3 bits (1531), Expect = 1.7e-166
Identity = 300/443 (67.72%), Postives = 355/443 (80.14%), Query Frame = 1

Query: 1   MELHCSTLQTSLSFSVRGKVLAHGDASAACSPSSSSFSRITARNFSLGPKSRGFPSLVCR 60
           MEL  +TLQ   SF V    L              SF R + RN     K  G   LV  
Sbjct: 1   MELASTTLQARSSFQVIAPSL--------------SFIRNSRRNLCFTRKVEGRRLLVNP 60

Query: 61  VRLKNSSFSAVVRGESAVPSDCNSETLDSSNPNPNELVRDV------QNAKDSVESLDQH 120
           +RL++SS +++V   +AVPS    E L  S+ +   + +DV      ++ KD  E LD H
Sbjct: 61  IRLQHSSVASIVGDTTAVPSRSTGEPLSFSSSSDPAVDKDVIGYGEKRDGKDDPECLDNH 120

Query: 121 KMTKVCDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAVSEDDPGMRHKL 180
           KM +VCDKLIEVFM+DKPTPTDWRRL+AFSKEW NIRPHF+RRCQDRA SE DPG +H L
Sbjct: 121 KMIRVCDKLIEVFMVDKPTPTDWRRLLAFSKEWSNIRPHFYRRCQDRADSEGDPGKKHSL 180

Query: 181 LKLGRKLKEIDEDVQRHNELLEVVRAVAPSELGEIISRRRKDFTKEFFVHLHTVAESYYD 240
           L+LGRKLKEIDEDV+RHNELLEV++   P+++  ++++RRKDFTKEFFVHLHTVAESY+D
Sbjct: 181 LRLGRKLKEIDEDVKRHNELLEVIKGTPPADISAVVAKRRKDFTKEFFVHLHTVAESYHD 240

Query: 241 DPAEQNALAKLGNSCLAAVQTFDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAE 300
           +P EQNALAKLGN CLAAVQT+D A+E+IEALNAAELKFQDI+NSP+LD ACRKID+LAE
Sbjct: 241 NPTEQNALAKLGNMCLAAVQTYDTASESIEALNAAELKFQDILNSPSLDVACRKIDSLAE 300

Query: 301 KNQLDSALVLMITKAWSAAKESSMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLT 360
           KNQLDSALVLMITKAWSAAKES+M KDEVKD+L+HLY TA GNLQRLMPKEIRILKYLLT
Sbjct: 301 KNQLDSALVLMITKAWSAAKESNMTKDEVKDVLFHLYTTARGNLQRLMPKEIRILKYLLT 360

Query: 361 IKDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEAR 420
           I+DPEEK+SALKDAFTPG+E+EG+DVDCLYTTPEKLHTW++TVVDA+HFSREGTLI+EAR
Sbjct: 361 IEDPEEKMSALKDAFTPGDEIEGKDVDCLYTTPEKLHTWMQTVVDAFHFSREGTLIREAR 420

Query: 421 DLMNPQVIVKLEELKHLLEKKFM 438
           DLMNP++I KLEELK L++  FM
Sbjct: 421 DLMNPKIIQKLEELKKLVQDNFM 429

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4920_ARATH1.8e-8038.81Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A0A0LMH6_CUCSA4.8e-22188.33Uncharacterized protein OS=Cucumis sativus GN=Csa_2G403700 PE=4 SV=1[more]
E0CRE9_VITVI1.2e-16667.72Putative uncharacterized protein OS=Vitis vinifera GN=VIT_18s0001g04580 PE=4 SV=... [more]
A0A059C054_EUCGR6.2e-16070.34Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_F04381 PE=4 SV=1[more]
A0A0D2TUM3_GOSRA1.4e-15966.52Uncharacterized protein OS=Gossypium raimondii GN=B456_009G241300 PE=4 SV=1[more]
A0A061FGT6_THECC6.8e-15967.64Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_034989 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659081270|ref|XP_008441243.1|8.1e-23092.68PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
gi|659081268|ref|XP_008441242.1|2.0e-22892.47PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
gi|449441744|ref|XP_004138642.1|6.9e-22188.33PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
gi|778673035|ref|XP_011649916.1|1.7e-21988.13PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
gi|225459407|ref|XP_002285817.1|1.7e-16667.72PREDICTED: uncharacterized protein At4g37920, chloroplastic [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0009073 aromatic amino acid family biosynthetic process
biological_process GO:0016226 iron-sulfur cluster assembly
biological_process GO:0009058 biosynthetic process
biological_process GO:0009987 cellular process
biological_process GO:0010508 positive regulation of autophagy
biological_process GO:0006144 purine nucleobase metabolic process
biological_process GO:0006206 pyrimidine nucleobase metabolic process
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005730 nucleolus
cellular_component GO:0009507 chloroplast
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0009535 chloroplast thylakoid membrane
molecular_function GO:0003674 molecular_function
molecular_function GO:0003899 DNA-directed RNA polymerase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU09828watermelon unigene v2 vs TrEMBLtranscribed_cluster
WMU28485watermelon EST collection version 2.0transcribed_cluster
WMU64957watermelon EST collection version 2.0transcribed_cluster
WMU75881watermelon EST collection version 2.0transcribed_cluster
WMU75938watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla011898Cla011898.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU28485WMU28485transcribed_cluster
WMU75938WMU75938transcribed_cluster
WMU75881WMU75881transcribed_cluster
WMU09828WMU09828transcribed_cluster
WMU64957WMU64957transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31755FAMILY NOT NAMEDcoord: 33..437
score: 1.5E
NoneNo IPR availablePANTHERPTHR31755:SF1SUBFAMILY NOT NAMEDcoord: 33..437
score: 1.5E