ClCG03G016270 (gene) Watermelon (Charleston Gray)

NameClCG03G016270
Typegene
OrganismCitrullus lanatus (Watermelon (Charleston Gray))
DescriptionLate embryogenesis abundant protein D-34, putative
LocationCG_Chr03 : 31339484 .. 31341751 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCCAACAACAGCCCCGGAGGGCCGGCGGTGACCAGCCGGAGGAGCCGATCACGTATGGTGACGTTTTTCCTCATATTGAGGGCAGCCTTGCTCACAAGCCGGTAACGCCGGAGGATGCTGCAGCTGTGCAGACCGCTGAGACCGCCTTGCTCGGGAAGACACTGCATGGTGGTGCCGCTGCAGCGATACAATCTGCTGCTGCGAAAAATGAGATGGCTGGCGTTATTGGTAGTGGCAACGACGACATTGGTAATATCGTTGCCGACGACGTAAGCATAACTCAGACGGAGATGCCAGGGAGGAGGGTTATTACAGAGTCAGTCGGTGGACAGGTAATATTATTACAACCAATATTTAAAATGTTGATGTCGACGGAAATATTGAAATCCTAATCTTACAAAAACTCCAATAGTAATATTCATAAAATGTTGAGTTCAATAGATATTTCTTTTAAAGTTATAAAAGCAAAAAGATAAAAAAGATATAAAATAGTAAATATTATATTATATTTATATTAATGGTATTTGGTTGATTATTTTATATGTATGAAAATTTCGATTCACTCAAGAGTCGATATCGACCTATAAAACATGAAAATATTATAGAAATATTGACATTTTAGACGAAAATTTAATAGCATAGGTGTAGCTATCTCATGCCTCTTATTGTGAGATAGGAAATAATTTTCATTGGTAATTAAGGTTTTTTTCTTATCATTATTATTATTCGAAATTACATAAATGGCATAAAATATATGCGTTTGTTAGGTTGACGTGATTGAAATATTATATTCAACTACATCATTTTAAACTTAAGATTCAGTCTCAAATATAATGATGATGAGATTCAAAGTATTTGCATGCAATTTTGCTTTCAAAATTGATAGGCCATATTAATAGCAGGTCTATTATGGATAGATTTTGTTATATTTGCAAATTTTTAAAAATATTGATACACTCAATTATTATCCTTAAAAGTGCTATAGTAAATGTATAATAGGTAAAAAAAGATATGTGGACAAAATAAGAAGATCAAACAAGAATATAAAACTAAATAAAGATAACTAGAAATACATAAAATTAGGAAAATATCAAACTATATACAAATATAGAAAAACTTCCATGTGAGTCTTTTAATTTGAAAGAGTACATTTTTAATTACCGTTGAGTTATATTCTCTAGTATAGCAATATTGTAACTTTATATACTGATAAATTGGACCTTTTTCAAATTAATAAAAGGCTGTGACCTCTAGTATAGCAATATTGTAACTTTATATACTGATAAATTGGACCTTTTTCAAATTAATAAAAGGCTGTGACCCAACATAGCGAGAGGGTTCCAATTGCACCATTGTCGACATTGAATCCTCACGAAGAGGGAGGAGGAGGAATAACAATCGGCGAAGCATTGGAAGCAACCGCTTTAGCAGCGGGGGAGAAGCCAGTGGAATGGAGTGATGCCGCCGCAATCCAAGCGGCAGAAGTGAGGGCAACTGGGCAGATGAATATAGCACCGGGAGGTGTCGCCGCCACCGCACAATCGGCCGCCACTATAAACGCTCGAATCACACAAGATGAGGACAAGACCAAGCTCGCGGATGTTCTAAAGGTACATAATTCCTATATAATTTGGCCATTGACTTAATAAAAATTTGACGTAAACATTACACAACAGACATAATGTTGATAATATTGATGAATTTGAAATCTATACCTTTCAATTTTGTGTCTAATAATTTTTTATATTTTAGAAAATCTAATTTAAATATTTATTTGTCTTGCAAGCTCTAATTGCATATGAGATTAATTAGTTTTATAGGATGTAATTTGTAATTATTGATTTGTAGTTTAAATTAAAATTTTAATTTTTAAACTTTGACATGTTTATAAAATTACAATTTTTTTTTTTTTTTTGGTTGTCTTAAACCCATTCCCTTTTAAAATTACAAGAATTTAATGGCACTACAAAGAATTTTTATTGGATAGTTTTACGACGATCATATAGTTTATGATATATTATATTACATATGGTTTTATTTTTTAAAAAAGAATTCCATTCATAACAAAAGATACTATGTTTTGTTTTGTTTGGCGTTAGGATGCTCGATCAAAGCTGTCGGCAGATAGACCGGCGACGCGACGAGATGCAGAGGGGGTGGCCGGAGCTGAGATGCGAAATGACCCCTTTCTCACCACGCATCCTACCGGTGTTGCCGCGTCCGTTGCCGCCGCCGCGAGGCTCAACCAGAATAACAACAAGTAG

mRNA sequence

ATGAGCCAACAACAGCCCCGGAGGGCCGGCGGTGACCAGCCGGAGGAGCCGATCACGTATGGTGACGTTTTTCCTCATATTGAGGGCAGCCTTGCTCACAAGCCGGTAACGCCGGAGGATGCTGCAGCTGTGCAGACCGCTGAGACCGCCTTGCTCGGGAAGACACTGCATGGTGGTGCCGCTGCAGCGATACAATCTGCTGCTGCGAAAAATGAGATGGCTGGCGTTATTGGTAGTGGCAACGACGACATTGGTAATATCGTTGCCGACGACGTAAGCATAACTCAGACGGAGATGCCAGGGAGGAGGGTTATTACAGAGTCAGTCGGTGGACAGGCTGTGACCCAACATAGCGAGAGGGTTCCAATTGCACCATTGTCGACATTGAATCCTCACGAAGAGGGAGGAGGAGGAATAACAATCGGCGAAGCATTGGAAGCAACCGCTTTAGCAGCGGGGGAGAAGCCAGTGGAATGGAGTGATGCCGCCGCAATCCAAGCGGCAGAAGTGAGGGCAACTGGGCAGATGAATATAGCACCGGGAGGTGTCGCCGCCACCGCACAATCGGCCGCCACTATAAACGCTCGAATCACACAAGATGAGGACAAGACCAAGCTCGCGGATGTTCTAAAGGATGCTCGATCAAAGCTGTCGGCAGATAGACCGGCGACGCGACGAGATGCAGAGGGGGTGGCCGGAGCTGAGATGCGAAATGACCCCTTTCTCACCACGCATCCTACCGGTGTTGCCGCGTCCGTTGCCGCCGCCGCGAGGCTCAACCAGAATAACAACAAGTAG

Coding sequence (CDS)

ATGAGCCAACAACAGCCCCGGAGGGCCGGCGGTGACCAGCCGGAGGAGCCGATCACGTATGGTGACGTTTTTCCTCATATTGAGGGCAGCCTTGCTCACAAGCCGGTAACGCCGGAGGATGCTGCAGCTGTGCAGACCGCTGAGACCGCCTTGCTCGGGAAGACACTGCATGGTGGTGCCGCTGCAGCGATACAATCTGCTGCTGCGAAAAATGAGATGGCTGGCGTTATTGGTAGTGGCAACGACGACATTGGTAATATCGTTGCCGACGACGTAAGCATAACTCAGACGGAGATGCCAGGGAGGAGGGTTATTACAGAGTCAGTCGGTGGACAGGCTGTGACCCAACATAGCGAGAGGGTTCCAATTGCACCATTGTCGACATTGAATCCTCACGAAGAGGGAGGAGGAGGAATAACAATCGGCGAAGCATTGGAAGCAACCGCTTTAGCAGCGGGGGAGAAGCCAGTGGAATGGAGTGATGCCGCCGCAATCCAAGCGGCAGAAGTGAGGGCAACTGGGCAGATGAATATAGCACCGGGAGGTGTCGCCGCCACCGCACAATCGGCCGCCACTATAAACGCTCGAATCACACAAGATGAGGACAAGACCAAGCTCGCGGATGTTCTAAAGGATGCTCGATCAAAGCTGTCGGCAGATAGACCGGCGACGCGACGAGATGCAGAGGGGGTGGCCGGAGCTGAGATGCGAAATGACCCCTTTCTCACCACGCATCCTACCGGTGTTGCCGCGTCCGTTGCCGCCGCCGCGAGGCTCAACCAGAATAACAACAAGTAG

Protein sequence

MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGAAAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAARLNQNNNK
BLAST of ClCG03G016270 vs. Swiss-Prot
Match: LEA34_GOSHI (Late embryogenesis abundant protein D-34 OS=Gossypium hirsutum PE=4 SV=1)

HSP 1 Score: 264.2 bits (674), Expect = 1.5e-69
Identity = 150/269 (55.76%), Postives = 190/269 (70.63%), Query Frame = 1

Query: 1   MSQQQPRR----AGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTL 60
           MSQ QPRR    AG  + +EPI YGDVF ++ G LA+KP+ P+DAA +QTAET +LG+T 
Sbjct: 1   MSQGQPRRPQQPAGQGENQEPIKYGDVF-NVSGELANKPIAPQDAAMMQTAETQVLGQTQ 60

Query: 61  HGGAAAAIQSAAAKNEMAGVIGSGNDDIGNIVADD-VSITQTEMPGRRVITESVGGQAVT 120
            GG AA +Q+AA +NE  GV+G  ++DI +I  +  V++ +T++ GRR+ITE+V GQ V 
Sbjct: 61  KGGTAAVMQAAATRNEQVGVVG--HNDITDIAGEQGVTLAETDVAGRRIITEAVAGQVVG 120

Query: 121 QHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQ 180
           Q+   V   P+ T          ITIGEALEATA  AG+KPV+ SDAAA+QAAEVRATG 
Sbjct: 121 QY---VQATPVMTSQVGVVLQNAITIGEALEATAKTAGDKPVDQSDAAAVQAAEVRATGS 180

Query: 181 MNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAE 240
             I PGG+AATAQSAA  NA + +DE+K KL  VL  A +KL AD+  TR+DAEGV  AE
Sbjct: 181 NVIIPGGLAATAQSAAAHNATLDRDEEKIKLNQVLTGATAKLPADKAVTRQDAEGVVSAE 240

Query: 241 MRNDPFLTTHPTGVAASVAAAARLNQNNN 265
           +RN+P + THP GVAAS+AAAARLN+N N
Sbjct: 241 LRNNPNVATHPGGVAASMAAAARLNENVN 263

BLAST of ClCG03G016270 vs. Swiss-Prot
Match: LEA31_ARATH (Late embryogenesis abundant protein 31 OS=Arabidopsis thaliana GN=RAB28 PE=1 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 5.3e-67
Identity = 145/263 (55.13%), Postives = 178/263 (67.68%), Query Frame = 1

Query: 3   QQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGAAA 62
           ++QP+R     P+EP+TYGDVF  + G LA KP+ PEDA  +Q AET + G T  GGAAA
Sbjct: 4   EEQPKR-----PQEPVTYGDVF-EVSGELADKPIAPEDANMMQAAETRVFGHTQKGGAAA 63

Query: 63  AIQSAAAKNEMAGVIGSGNDDIGNIVAD-DVSITQTEMPGRRVITESVGGQAVTQHSERV 122
            +QSAA  N+  G +  G  D  ++ A+  V++ QT++PG RV TE VGGQ V Q+ E  
Sbjct: 64  VMQSAATANKRGGFVHPG--DTTDLAAERGVTVAQTDVPGARVTTEFVGGQVVGQYVEPR 123

Query: 123 PIAPLSTLNPHEEG---GGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNI 182
           P+A  + +     G      ITIGEALEAT   AG KPV+ SDAAAIQAAEVRA G   I
Sbjct: 124 PVATAAAMEAEVVGLSLQSAITIGEALEATVQTAGNKPVDQSDAAAIQAAEVRACGTNVI 183

Query: 183 APGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRN 242
           APGG+AA+AQSAA  NA I +DEDK KL DVL  A  KL+AD+  TR+DAEGV  AE+RN
Sbjct: 184 APGGIAASAQSAANHNATIDRDEDKIKLIDVLAGATGKLAADKAVTRQDAEGVVSAELRN 243

Query: 243 DPFLTTHPTGVAASVAAAARLNQ 262
           +P L+THP GVAAS+ AAARLN+
Sbjct: 244 NPNLSTHPGGVAASITAAARLNE 258

BLAST of ClCG03G016270 vs. Swiss-Prot
Match: LEA32_ARATH (Late embryogenesis abundant protein 32 OS=Arabidopsis thaliana GN=ECP31 PE=2 SV=1)

HSP 1 Score: 233.4 bits (594), Expect = 2.8e-60
Identity = 142/262 (54.20%), Postives = 165/262 (62.98%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+QPRR     P EP+ YGDVF  + G LA KP+ PEDA  +Q+AET + G T  GG 
Sbjct: 1   MSQEQPRR-----PREPVKYGDVF-EVSGELADKPIAPEDAKMMQSAETHVFGHTQKGGP 60

Query: 61  AAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSER 120
           AA +QSAA  N   G +    DD   +VA+  +  +  +P   V TE VGGQ V QH E 
Sbjct: 61  AAVMQSAATTNIRGGFVHP--DDKTELVAERGATVEQTVPAATVTTEFVGGQVVGQHVE- 120

Query: 121 VPIAPLSTLNPHEEG-GGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
            P   ++     EE     ITIGEALEAT   AG KPV+ SDAAAIQAAE+RA+G   IA
Sbjct: 121 -PRRVVAAARTDEEALQSTITIGEALEATVKTAGNKPVDQSDAAAIQAAEMRASGTNVIA 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
             GVAA+AQSAA  NA + +DE K KL DVL  A  KLSADR  TR DAEGV  AEMRN+
Sbjct: 181 LAGVAASAQSAADHNATVDRDERKIKLRDVLTGAAGKLSADRAVTREDAEGVVSAEMRNN 240

Query: 241 PFLTTHPTGVAASVAAAARLNQ 262
           P L THP GVAAS+  AARLN+
Sbjct: 241 PKLCTHPGGVAASLTVAARLNE 252

BLAST of ClCG03G016270 vs. Swiss-Prot
Match: LEA47_ARATH (Late embryogenesis abundant protein 47 OS=Arabidopsis thaliana GN=At5g27980 PE=2 SV=1)

HSP 1 Score: 191.8 bits (486), Expect = 9.5e-48
Identity = 108/169 (63.91%), Postives = 125/169 (73.96%), Query Frame = 1

Query: 94  ITQTEMPGRRVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAG 153
           I   E   + V+ E+ G QA  + +++  +A     NP +  G  ITIGEALEA  L AG
Sbjct: 29  IKAAEDKEKGVVAEASGEQAEGEVNQKKVVA-----NPLKSEGT-ITIGEALEAAVLTAG 88

Query: 154 EKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDA 213
            KPVEWSDAAAIQAAEVRATG+ NI PGGVAA+AQSAAT+NARI  D+ KT LADVL  A
Sbjct: 89  NKPVEWSDAAAIQAAEVRATGRTNIMPGGVAASAQSAATLNARIGSDDTKTTLADVLTGA 148

Query: 214 RSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAARLNQN 263
            SKL +D+ ATR+DAEGV GAEMRNDP LTT+PTGVAASVAAAAR+NQ+
Sbjct: 149 SSKLPSDKAATRKDAEGVTGAEMRNDPHLTTYPTGVAASVAAAARINQS 191

BLAST of ClCG03G016270 vs. Swiss-Prot
Match: LEA3_ARATH (Late embryogenesis abundant protein 3 OS=Arabidopsis thaliana GN=At1g03120 PE=3 SV=1)

HSP 1 Score: 133.7 bits (335), Expect = 3.1e-30
Identity = 69/123 (56.10%), Postives = 85/123 (69.11%), Query Frame = 1

Query: 139 ITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARIT 198
           +TIGEALEATAL+ G+KPV+  DAAAIQAAE RATG+    PGG+A  AQ+AAT N +  
Sbjct: 58  VTIGEALEATALSLGDKPVDRRDAAAIQAAETRATGESKGRPGGLAVAAQAAATTNEQTV 117

Query: 199 QDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAAR 258
            +EDK  +AD+L DA  +L  D+  T  DAE V GAE+R+   + T P GVA S++A AR
Sbjct: 118 SEEDKVNIADILTDAAERLPGDKVVTSEDAEAVVGAELRSSSEMKTTPGGVADSMSAGAR 177

Query: 259 LNQ 262
           LNQ
Sbjct: 178 LNQ 180


HSP 2 Score: 49.3 bits (116), Expect = 7.6e-05
Identity = 39/110 (35.45%), Postives = 52/110 (47.27%), Query Frame = 1

Query: 16  EPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLG--KTLHGGAAAAIQSAAAKNEM 75
           + +T G+       SL  KPV   DAAA+Q AET   G  K   GG A A Q+AA  NE 
Sbjct: 56  DTVTIGEALEATALSLGDKPVDRRDAAAIQAAETRATGESKGRPGGLAVAAQAAATTNEQ 115

Query: 76  AGVIGSGNDDIGNIVADDVSITQTEMPGRRVIT----ESVGGQAVTQHSE 120
                S  D +   +AD ++     +PG +V+T    E+V G  +   SE
Sbjct: 116 T---VSEEDKVN--IADILTDAAERLPGDKVVTSEDAEAVVGAELRSSSE 160


HSP 3 Score: 39.7 bits (91), Expect = 6.0e-02
Identity = 56/215 (26.05%), Postives = 82/215 (38.14%), Query Frame = 1

Query: 2   SQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKP----VTPEDAAAVQ-TAETALLGKTL 61
           S Q+PR     +P +   YG VF      +A K       P+   A   + +T  +G+ L
Sbjct: 7   SPQRPRDQDNTRPHDQ--YGIVFSVSGDDVARKQGDSFSQPDPTVATMGSVDTVTIGEAL 66

Query: 62  HGGA------------AAAIQSAA--AKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGR 121
              A            AAAIQ+A   A  E  G  G      G  VA   + T  E    
Sbjct: 67  EATALSLGDKPVDRRDAAAIQAAETRATGESKGRPG------GLAVAAQAAATTNE---- 126

Query: 122 RVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDA 181
                    Q V++  ++V IA + T                 +A     G+K V   DA
Sbjct: 127 ---------QTVSEE-DKVNIADILT-----------------DAAERLPGDKVVTSEDA 182

Query: 182 AAIQAAEVRATGQMNIAPGGVAATAQSAATINARI 198
            A+  AE+R++ +M   PGGVA +  + A +N ++
Sbjct: 187 EAVVGAELRSSSEMKTTPGGVADSMSAGARLNQQL 182

BLAST of ClCG03G016270 vs. TrEMBL
Match: A0A0A0M1P9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G574780 PE=4 SV=1)

HSP 1 Score: 396.4 bits (1017), Expect = 2.8e-107
Identity = 208/264 (78.79%), Postives = 229/264 (86.74%), Query Frame = 1

Query: 1   MSQQQPRR-AGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQQQPR+ A  DQ EEPI YGDVFPH+EG LA+KPVTPEDAAA+Q AET LLGKTLHGG
Sbjct: 1   MSQQQPRKPACCDQLEEPIKYGDVFPHVEGDLANKPVTPEDAAALQAAETVLLGKTLHGG 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSE 120
           AAA IQSAAAKNE AG++G G D    IVA+DV IT T++ G +        + VT+H E
Sbjct: 61  AAATIQSAAAKNERAGLVGRGKDVGDQIVAEDV-ITNTDLVGAQ--------EVVTEHRE 120

Query: 121 RVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
           RVPI PLSTLNPHEEGGGGITIGEALEATAL  GEK VEWSDAAAIQAAEVRATG+MNIA
Sbjct: 121 RVPIGPLSTLNPHEEGGGGITIGEALEATALTVGEKIVEWSDAAAIQAAEVRATGRMNIA 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
           PGG+AATAQSAAT+NAR+TQDEDKTKLADVLKDAR+KLSAD+PATRRDAEGV GAEMRND
Sbjct: 181 PGGIAATAQSAATMNARVTQDEDKTKLADVLKDARTKLSADKPATRRDAEGVTGAEMRND 240

Query: 241 PFLTTHPTGVAASVAAAARLNQNN 264
           P+LTTHPTGVAAS+AAAARLNQ+N
Sbjct: 241 PYLTTHPTGVAASIAAAARLNQSN 255

BLAST of ClCG03G016270 vs. TrEMBL
Match: M5W349_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020371mg PE=4 SV=1)

HSP 1 Score: 333.6 bits (854), Expect = 2.2e-88
Identity = 179/267 (67.04%), Postives = 214/267 (80.15%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEG-SLAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQ+QPR+   +  +E +TYGDVFP ++G  LA K V P+DAA +Q  E A+LGKT+ GG
Sbjct: 1   MSQEQPRKP--EDQKEAVTYGDVFPGVQGVELADKLVAPKDAAIMQAEENAVLGKTIKGG 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGNDDIGNIVADD--VSITQTEMPGRRVITESVGGQAVTQH 120
           AAA +Q+AA +NE AGV+G  +D   NIV  D  VS+ + E+PGRR+ITES+ GQAV Q+
Sbjct: 61  AAATLQTAARQNEKAGVVGP-DDMNANIVTGDEGVSVKEAELPGRRIITESIAGQAVGQY 120

Query: 121 SERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMN 180
           S+R P+A  +T+      GG ITIGEALEATA+ AG+KPVEWSDAAAIQAAEVRATG+ N
Sbjct: 121 SQRAPLAAPNTIQAGG-AGGQITIGEALEATAMTAGQKPVEWSDAAAIQAAEVRATGRTN 180

Query: 181 IAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMR 240
           I PGGVAA AQSAAT+NAR T+DE+KTKLAD+L DA SKL AD+PATRRDAEGV GAEMR
Sbjct: 181 IVPGGVAAAAQSAATLNARATKDEEKTKLADILADATSKLPADKPATRRDAEGVTGAEMR 240

Query: 241 NDPFLTTHPTGVAASVAAAARLNQNNN 265
           NDPFLTTHPTGVAASVAAAARLNQ N+
Sbjct: 241 NDPFLTTHPTGVAASVAAAARLNQTNS 263

BLAST of ClCG03G016270 vs. TrEMBL
Match: U5GGI0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s04950g PE=4 SV=1)

HSP 1 Score: 321.2 bits (822), Expect = 1.2e-84
Identity = 174/265 (65.66%), Postives = 206/265 (77.74%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+Q +R     P+EPI YGDVF  +EG LA KPV P DAA +QTAE AL+G+   GGA
Sbjct: 1   MSQRQQQR-----PQEPIKYGDVFS-VEGELAEKPVAPRDAAMMQTAENALMGQIQRGGA 60

Query: 61  AAAIQSAAAKNEMAGVIGSGN-DDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSE 120
           A+ +QSAA +NE AG +G  + +D+ N     VS+T+TEMPGRR+ITE++GGQ V    +
Sbjct: 61  ASMMQSAAMRNERAGFVGHSDVNDVANY--QGVSVTETEMPGRRIITEAIGGQIVRDFDQ 120

Query: 121 RVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
           R P+   S     ++   GITIGEALEATAL+ G+KPVEWSDAAAIQAAEVRATG+  I 
Sbjct: 121 RAPLVQ-SPPPLFQQVDAGITIGEALEATALSCGQKPVEWSDAAAIQAAEVRATGRTTIT 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
           PGGVAA AQSAATINAR+T+DEDKTKL+DVL DA SKL AD+PATR+DAEGV GAEMRND
Sbjct: 181 PGGVAAAAQSAATINARMTKDEDKTKLSDVLADATSKLPADKPATRKDAEGVTGAEMRND 240

Query: 241 PFLTTHPTGVAASVAAAARLNQNNN 265
           PFLTT+P GVAASVAAAARLNQ NN
Sbjct: 241 PFLTTNPAGVAASVAAAARLNQQNN 256

BLAST of ClCG03G016270 vs. TrEMBL
Match: V4T9S2_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10003643mg PE=4 SV=1)

HSP 1 Score: 316.6 bits (810), Expect = 2.8e-83
Identity = 172/266 (64.66%), Postives = 205/266 (77.07%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+QPRR        PI YGDVF  +EG +A   V P DAA +QTAE A+LG+   G A
Sbjct: 1   MSQEQPRR--------PIKYGDVFS-VEGEIAEMAVAPRDAALMQTAENAMLGQIQKGAA 60

Query: 61  AAAIQSAAAKNEMAGVIGSGNDDIGNIVADD-VSITQTEMPGRRVITESVGGQAVTQHSE 120
           A+ +QSAA +NE  G +G  ++D+ ++ A   VSIT+T +PGRR+ITE +GGQ V Q+S+
Sbjct: 61  ASMMQSAAERNEKGGFVG--HEDMTDVAAGQGVSITETNLPGRRIITEEIGGQVVGQYSQ 120

Query: 121 RVPIAPLSTLNPHEE--GGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMN 180
             P+  L+  + H E  GGGGITIGEALEATAL AG+KPVEWSDAAAIQAAEVRATG++N
Sbjct: 121 PSPLQSLAPPSSHGEVKGGGGITIGEALEATALTAGKKPVEWSDAAAIQAAEVRATGRIN 180

Query: 181 IAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMR 240
           I PGGVAA AQSAATINAR T+DEDKTKLAD+L DA +KL AD+  TR+DAEGVA AEMR
Sbjct: 181 ITPGGVAAAAQSAATINARTTRDEDKTKLADILTDATAKLPADKQVTRKDAEGVAAAEMR 240

Query: 241 NDPFLTTHPTGVAASVAAAARLNQNN 264
           NDP LTTHP GVAASVAAAARLNQ+N
Sbjct: 241 NDPMLTTHPAGVAASVAAAARLNQSN 255

BLAST of ClCG03G016270 vs. TrEMBL
Match: A0A059D3Y3_EUCGR (Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01734 PE=4 SV=1)

HSP 1 Score: 316.2 bits (809), Expect = 3.7e-83
Identity = 171/268 (63.81%), Postives = 208/268 (77.61%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+QPRR     PE+ I YGD+F  ++G LA KPVTP DAA +QTAE  + G+T  G A
Sbjct: 1   MSQEQPRRT----PEDAIRYGDIFA-VDGELAEKPVTPRDAAMMQTAENEMFGQTQRGHA 60

Query: 61  AAAIQSAAAKNEMAGVIGSGNDDIGNIVADD-VSITQTEMPGRRVITESVGGQAVTQHSE 120
           AAA+QSAAA+NE AG++G  + D+  + + D V+IT+T++PGRRVITESVGGQ V+Q S 
Sbjct: 61  AAAMQSAAARNERAGLVG--HSDVTEVASKDGVTITETDLPGRRVITESVGGQVVSQFSR 120

Query: 121 RVPIAPLST-----LNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATG 180
           R PI  ++      ++  ++G G ITIGEALEA AL AG+KPVEWSDAAAIQAAEVRATG
Sbjct: 121 RSPIPTMTPSSIYQIDAGKDGIGSITIGEALEAAALTAGQKPVEWSDAAAIQAAEVRATG 180

Query: 181 QMNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGA 240
           +  I PGGVAA AQ+AAT+NAR  + EDKT LAD+L DA SKL +D+PA+RRDAEGVAGA
Sbjct: 181 RTTIVPGGVAAAAQAAATLNARTPRAEDKTTLADILGDATSKLPSDKPASRRDAEGVAGA 240

Query: 241 EMRNDPFLTTHPTGVAASVAAAARLNQN 263
           EMRNDP L THP G+AASVAAAARLNQN
Sbjct: 241 EMRNDPRLNTHPAGIAASVAAAARLNQN 261

BLAST of ClCG03G016270 vs. TAIR10
Match: AT3G22490.1 (AT3G22490.1 Seed maturation protein)

HSP 1 Score: 255.8 bits (652), Expect = 3.0e-68
Identity = 145/263 (55.13%), Postives = 178/263 (67.68%), Query Frame = 1

Query: 3   QQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGAAA 62
           ++QP+R     P+EP+TYGDVF  + G LA KP+ PEDA  +Q AET + G T  GGAAA
Sbjct: 4   EEQPKR-----PQEPVTYGDVF-EVSGELADKPIAPEDANMMQAAETRVFGHTQKGGAAA 63

Query: 63  AIQSAAAKNEMAGVIGSGNDDIGNIVAD-DVSITQTEMPGRRVITESVGGQAVTQHSERV 122
            +QSAA  N+  G +  G  D  ++ A+  V++ QT++PG RV TE VGGQ V Q+ E  
Sbjct: 64  VMQSAATANKRGGFVHPG--DTTDLAAERGVTVAQTDVPGARVTTEFVGGQVVGQYVEPR 123

Query: 123 PIAPLSTLNPHEEG---GGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNI 182
           P+A  + +     G      ITIGEALEAT   AG KPV+ SDAAAIQAAEVRA G   I
Sbjct: 124 PVATAAAMEAEVVGLSLQSAITIGEALEATVQTAGNKPVDQSDAAAIQAAEVRACGTNVI 183

Query: 183 APGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRN 242
           APGG+AA+AQSAA  NA I +DEDK KL DVL  A  KL+AD+  TR+DAEGV  AE+RN
Sbjct: 184 APGGIAASAQSAANHNATIDRDEDKIKLIDVLAGATGKLAADKAVTRQDAEGVVSAELRN 243

Query: 243 DPFLTTHPTGVAASVAAAARLNQ 262
           +P L+THP GVAAS+ AAARLN+
Sbjct: 244 NPNLSTHPGGVAASITAAARLNE 258

BLAST of ClCG03G016270 vs. TAIR10
Match: AT3G22500.1 (AT3G22500.1 Seed maturation protein)

HSP 1 Score: 233.4 bits (594), Expect = 1.6e-61
Identity = 142/262 (54.20%), Postives = 165/262 (62.98%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+QPRR     P EP+ YGDVF  + G LA KP+ PEDA  +Q+AET + G T  GG 
Sbjct: 1   MSQEQPRR-----PREPVKYGDVF-EVSGELADKPIAPEDAKMMQSAETHVFGHTQKGGP 60

Query: 61  AAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSER 120
           AA +QSAA  N   G +    DD   +VA+  +  +  +P   V TE VGGQ V QH E 
Sbjct: 61  AAVMQSAATTNIRGGFVHP--DDKTELVAERGATVEQTVPAATVTTEFVGGQVVGQHVE- 120

Query: 121 VPIAPLSTLNPHEEG-GGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
            P   ++     EE     ITIGEALEAT   AG KPV+ SDAAAIQAAE+RA+G   IA
Sbjct: 121 -PRRVVAAARTDEEALQSTITIGEALEATVKTAGNKPVDQSDAAAIQAAEMRASGTNVIA 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
             GVAA+AQSAA  NA + +DE K KL DVL  A  KLSADR  TR DAEGV  AEMRN+
Sbjct: 181 LAGVAASAQSAADHNATVDRDERKIKLRDVLTGAAGKLSADRAVTREDAEGVVSAEMRNN 240

Query: 241 PFLTTHPTGVAASVAAAARLNQ 262
           P L THP GVAAS+  AARLN+
Sbjct: 241 PKLCTHPGGVAASLTVAARLNE 252

BLAST of ClCG03G016270 vs. TAIR10
Match: AT5G27980.1 (AT5G27980.1 Seed maturation protein)

HSP 1 Score: 191.8 bits (486), Expect = 5.3e-49
Identity = 108/169 (63.91%), Postives = 125/169 (73.96%), Query Frame = 1

Query: 94  ITQTEMPGRRVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAG 153
           I   E   + V+ E+ G QA  + +++  +A     NP +  G  ITIGEALEA  L AG
Sbjct: 29  IKAAEDKEKGVVAEASGEQAEGEVNQKKVVA-----NPLKSEGT-ITIGEALEAAVLTAG 88

Query: 154 EKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDA 213
            KPVEWSDAAAIQAAEVRATG+ NI PGGVAA+AQSAAT+NARI  D+ KT LADVL  A
Sbjct: 89  NKPVEWSDAAAIQAAEVRATGRTNIMPGGVAASAQSAATLNARIGSDDTKTTLADVLTGA 148

Query: 214 RSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAARLNQN 263
            SKL +D+ ATR+DAEGV GAEMRNDP LTT+PTGVAASVAAAAR+NQ+
Sbjct: 149 SSKLPSDKAATRKDAEGVTGAEMRNDPHLTTYPTGVAASVAAAARINQS 191

BLAST of ClCG03G016270 vs. TAIR10
Match: AT1G03120.1 (AT1G03120.1 responsive to abscisic acid 28)

HSP 1 Score: 133.7 bits (335), Expect = 1.7e-31
Identity = 69/123 (56.10%), Postives = 85/123 (69.11%), Query Frame = 1

Query: 139 ITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARIT 198
           +TIGEALEATAL+ G+KPV+  DAAAIQAAE RATG+    PGG+A  AQ+AAT N +  
Sbjct: 58  VTIGEALEATALSLGDKPVDRRDAAAIQAAETRATGESKGRPGGLAVAAQAAATTNEQTV 117

Query: 199 QDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAAR 258
            +EDK  +AD+L DA  +L  D+  T  DAE V GAE+R+   + T P GVA S++A AR
Sbjct: 118 SEEDKVNIADILTDAAERLPGDKVVTSEDAEAVVGAELRSSSEMKTTPGGVADSMSAGAR 177

Query: 259 LNQ 262
           LNQ
Sbjct: 178 LNQ 180


HSP 2 Score: 49.3 bits (116), Expect = 4.3e-06
Identity = 39/110 (35.45%), Postives = 52/110 (47.27%), Query Frame = 1

Query: 16  EPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLG--KTLHGGAAAAIQSAAAKNEM 75
           + +T G+       SL  KPV   DAAA+Q AET   G  K   GG A A Q+AA  NE 
Sbjct: 56  DTVTIGEALEATALSLGDKPVDRRDAAAIQAAETRATGESKGRPGGLAVAAQAAATTNEQ 115

Query: 76  AGVIGSGNDDIGNIVADDVSITQTEMPGRRVIT----ESVGGQAVTQHSE 120
                S  D +   +AD ++     +PG +V+T    E+V G  +   SE
Sbjct: 116 T---VSEEDKVN--IADILTDAAERLPGDKVVTSEDAEAVVGAELRSSSE 160


HSP 3 Score: 39.7 bits (91), Expect = 3.4e-03
Identity = 56/215 (26.05%), Postives = 82/215 (38.14%), Query Frame = 1

Query: 2   SQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKP----VTPEDAAAVQ-TAETALLGKTL 61
           S Q+PR     +P +   YG VF      +A K       P+   A   + +T  +G+ L
Sbjct: 7   SPQRPRDQDNTRPHDQ--YGIVFSVSGDDVARKQGDSFSQPDPTVATMGSVDTVTIGEAL 66

Query: 62  HGGA------------AAAIQSAA--AKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGR 121
              A            AAAIQ+A   A  E  G  G      G  VA   + T  E    
Sbjct: 67  EATALSLGDKPVDRRDAAAIQAAETRATGESKGRPG------GLAVAAQAAATTNE---- 126

Query: 122 RVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDA 181
                    Q V++  ++V IA + T                 +A     G+K V   DA
Sbjct: 127 ---------QTVSEE-DKVNIADILT-----------------DAAERLPGDKVVTSEDA 182

Query: 182 AAIQAAEVRATGQMNIAPGGVAATAQSAATINARI 198
            A+  AE+R++ +M   PGGVA +  + A +N ++
Sbjct: 187 EAVVGAELRSSSEMKTTPGGVADSMSAGARLNQQL 182

BLAST of ClCG03G016270 vs. TAIR10
Match: AT5G53260.1 (AT5G53260.1 Seed maturation protein)

HSP 1 Score: 102.1 bits (253), Expect = 5.6e-22
Identity = 63/173 (36.42%), Postives = 88/173 (50.87%), Query Frame = 1

Query: 90  DDVSITQTEMPGRRVITESVGGQAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATA 149
           D  S+T   +     +++S  G      +E +  A  + +      G   T+ EAL+A +
Sbjct: 6   DSASVTNISVEEHFSVSQSSPGGQFVGPTEEISTAAEALI------GRSTTLTEALKAAS 65

Query: 150 LAAGEKPVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARITQDEDKTKLADV 209
           +  G KPVE +D AAI+  E RA G    + GGV A A  A   N +I +D +KT L DV
Sbjct: 66  MNVGHKPVETTDVAAIKEVETRAIGGDIESEGGVTAVASKAVARNQKIGKDNEKTNLGDV 125

Query: 210 LKDARSKLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAARLNQN 263
           + +   K++ DR  T  DAE V  AE+ + PF    P GVA SVAAA +LN +
Sbjct: 126 IAEIDVKVTRDREVTSEDAEAVIRAELNHSPFNNIIPGGVAESVAAAYKLNHD 172

BLAST of ClCG03G016270 vs. NCBI nr
Match: gi|449436038|ref|XP_004135801.1| (PREDICTED: late embryogenesis abundant protein D-34 [Cucumis sativus])

HSP 1 Score: 396.4 bits (1017), Expect = 4.0e-107
Identity = 208/264 (78.79%), Postives = 229/264 (86.74%), Query Frame = 1

Query: 1   MSQQQPRR-AGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQQQPR+ A  DQ EEPI YGDVFPH+EG LA+KPVTPEDAAA+Q AET LLGKTLHGG
Sbjct: 1   MSQQQPRKPACCDQLEEPIKYGDVFPHVEGDLANKPVTPEDAAALQAAETVLLGKTLHGG 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSE 120
           AAA IQSAAAKNE AG++G G D    IVA+DV IT T++ G +        + VT+H E
Sbjct: 61  AAATIQSAAAKNERAGLVGRGKDVGDQIVAEDV-ITNTDLVGAQ--------EVVTEHRE 120

Query: 121 RVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
           RVPI PLSTLNPHEEGGGGITIGEALEATAL  GEK VEWSDAAAIQAAEVRATG+MNIA
Sbjct: 121 RVPIGPLSTLNPHEEGGGGITIGEALEATALTVGEKIVEWSDAAAIQAAEVRATGRMNIA 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
           PGG+AATAQSAAT+NAR+TQDEDKTKLADVLKDAR+KLSAD+PATRRDAEGV GAEMRND
Sbjct: 181 PGGIAATAQSAATMNARVTQDEDKTKLADVLKDARTKLSADKPATRRDAEGVTGAEMRND 240

Query: 241 PFLTTHPTGVAASVAAAARLNQNN 264
           P+LTTHPTGVAAS+AAAARLNQ+N
Sbjct: 241 PYLTTHPTGVAASIAAAARLNQSN 255

BLAST of ClCG03G016270 vs. NCBI nr
Match: gi|595833014|ref|XP_007206587.1| (hypothetical protein PRUPE_ppa020371mg [Prunus persica])

HSP 1 Score: 333.6 bits (854), Expect = 3.2e-88
Identity = 179/267 (67.04%), Postives = 214/267 (80.15%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEG-SLAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQ+QPR+   +  +E +TYGDVFP ++G  LA K V P+DAA +Q  E A+LGKT+ GG
Sbjct: 1   MSQEQPRKP--EDQKEAVTYGDVFPGVQGVELADKLVAPKDAAIMQAEENAVLGKTIKGG 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGNDDIGNIVADD--VSITQTEMPGRRVITESVGGQAVTQH 120
           AAA +Q+AA +NE AGV+G  +D   NIV  D  VS+ + E+PGRR+ITES+ GQAV Q+
Sbjct: 61  AAATLQTAARQNEKAGVVGP-DDMNANIVTGDEGVSVKEAELPGRRIITESIAGQAVGQY 120

Query: 121 SERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMN 180
           S+R P+A  +T+      GG ITIGEALEATA+ AG+KPVEWSDAAAIQAAEVRATG+ N
Sbjct: 121 SQRAPLAAPNTIQAGG-AGGQITIGEALEATAMTAGQKPVEWSDAAAIQAAEVRATGRTN 180

Query: 181 IAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMR 240
           I PGGVAA AQSAAT+NAR T+DE+KTKLAD+L DA SKL AD+PATRRDAEGV GAEMR
Sbjct: 181 IVPGGVAAAAQSAATLNARATKDEEKTKLADILADATSKLPADKPATRRDAEGVTGAEMR 240

Query: 241 NDPFLTTHPTGVAASVAAAARLNQNNN 265
           NDPFLTTHPTGVAASVAAAARLNQ N+
Sbjct: 241 NDPFLTTHPTGVAASVAAAARLNQTNS 263

BLAST of ClCG03G016270 vs. NCBI nr
Match: gi|1009117694|ref|XP_015875454.1| (PREDICTED: late embryogenesis abundant protein D-34 [Ziziphus jujuba])

HSP 1 Score: 328.9 bits (842), Expect = 7.9e-87
Identity = 178/265 (67.17%), Postives = 205/265 (77.36%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITY-GDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQ+QP R    Q  +PI   GDVF   +  LA KPV P+DA  +Q AETA LG T  G 
Sbjct: 1   MSQEQPLRP---QDNKPIKIDGDVFSDAKEELAQKPVVPKDAEIMQAAETAFLGHTAKGD 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGNDDIGNIVADDVSITQTEMPGRRVITESVGGQAVTQHSE 120
           AAAA+QSAA KNE AG++G  +DD  NI   D+SIT+T++PGRR+ITE+VGGQ V   S+
Sbjct: 61  AAAAMQSAATKNENAGLVG--HDDKSNIADKDISITETDLPGRRIITEAVGGQVVGLFSQ 120

Query: 121 RVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMNIA 180
           RVP+ P ST        GGITIGEALEATA+ AG+KPVEWSDAAAIQAAEVRATG+ NI 
Sbjct: 121 RVPLPPPSTAIQQGANSGGITIGEALEATAMTAGQKPVEWSDAAAIQAAEVRATGRTNIV 180

Query: 181 PGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMRND 240
           PGGVAA AQSAAT+NAR T+DEDKTKLA+VL  A S+L +D+PATRRDAEGV  AEMRND
Sbjct: 181 PGGVAAAAQSAATLNARATRDEDKTKLAEVLAGAASRLPSDKPATRRDAEGVTSAEMRND 240

Query: 241 PFLTTHPTGVAASVAAAARLNQNNN 265
           P+LTTHPTGVAASVAAAARLNQ NN
Sbjct: 241 PYLTTHPTGVAASVAAAARLNQINN 260

BLAST of ClCG03G016270 vs. NCBI nr
Match: gi|645222467|ref|XP_008218178.1| (PREDICTED: late embryogenesis abundant protein D-34 isoform X2 [Prunus mume])

HSP 1 Score: 327.8 bits (839), Expect = 1.8e-86
Identity = 177/267 (66.29%), Postives = 212/267 (79.40%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGSLAHKPVTPEDAAAVQTAETALLGKTLHGGA 60
           MSQ+QPR+   +  +E + YGDVFP   G LA K V P+DAA +Q AE A+LG+T+ GGA
Sbjct: 1   MSQEQPRKP--EDQKEAVKYGDVFPG--GELADKVVAPKDAAIMQAAENAVLGETIKGGA 60

Query: 61  AAAIQSAAAKNEMAGVIGSGNDDIGNIVADD---VSITQTEMPGRRVITESVGGQAVTQH 120
           AA +Q+AA +NE AGV+G  +D   NIV  D   VS+ + ++PGRR+ITES+ GQAV Q+
Sbjct: 61  AATLQAAARQNEKAGVVGP-DDMNANIVTGDEGGVSVKEADLPGRRIITESIAGQAVGQY 120

Query: 121 SERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEKPVEWSDAAAIQAAEVRATGQMN 180
           S+R P+A  +T+     GG  ITIGEALEATA+ AG+KPVEWSDAAAIQAAEVRATG+ N
Sbjct: 121 SQRAPLAAPNTIQAGGPGGQ-ITIGEALEATAMTAGQKPVEWSDAAAIQAAEVRATGRTN 180

Query: 181 IAPGGVAATAQSAATINARITQDEDKTKLADVLKDARSKLSADRPATRRDAEGVAGAEMR 240
           I PGGVAA AQSAAT+NAR T+DE+KTKLAD+L DA SKL AD+PATRRDAEGV GAEMR
Sbjct: 181 IVPGGVAAAAQSAATLNARATKDEEKTKLADILADATSKLPADKPATRRDAEGVTGAEMR 240

Query: 241 NDPFLTTHPTGVAASVAAAARLNQNNN 265
           NDPFLTTHPTGVAASVAAAARLNQ N+
Sbjct: 241 NDPFLTTHPTGVAASVAAAARLNQTNS 261

BLAST of ClCG03G016270 vs. NCBI nr
Match: gi|694376007|ref|XP_009364553.1| (PREDICTED: late embryogenesis abundant protein D-34-like [Pyrus x bretschneideri])

HSP 1 Score: 323.9 bits (829), Expect = 2.5e-85
Identity = 175/289 (60.55%), Postives = 217/289 (75.09%), Query Frame = 1

Query: 1   MSQQQPRRAGGDQPEEPITYGDVFPHIEGS-LAHKPVTPEDAAAVQTAETALLGKTLHGG 60
           MSQ+QPR+   +  +EP+TYGDVFP ++G+ LA K VTP+DAA +Q AE A+LG+T+ GG
Sbjct: 1   MSQEQPRKP--EDQKEPVTYGDVFPGVQGAQLADKVVTPKDAAMIQAAENAVLGQTVKGG 60

Query: 61  AAAAIQSAAAKNEMAGVIGSGN--DDIGNIVADDVSITQTEMPGRRVITESVGG------ 120
           AAA IQSAA +NE A V+GS +   D+G      VS+ + ++PGRR+ITES+GG      
Sbjct: 61  AAAIIQSAATQNEKAAVVGSSDVKADVGG-DGGGVSVKEADLPGRRIITESIGGKGSDSN 120

Query: 121 ----------------QAVTQHSERVPIAPLSTLNPHEEGGGGITIGEALEATALAAGEK 180
                           + V Q+S+R P+AP +T++P       ITIGEALEATA+ AG+K
Sbjct: 121 QMHVDQEKQRKIHELVEVVGQYSQRAPLAPPNTMHPVGGSAVQITIGEALEATAMTAGQK 180

Query: 181 PVEWSDAAAIQAAEVRATGQMNIAPGGVAATAQSAATINARITQDEDKTKLADVLKDARS 240
           PVEWSDAAAIQAAEVRATG+ NI PGGVA  AQSAAT+NAR T+DE+KTKLAD+L +A S
Sbjct: 181 PVEWSDAAAIQAAEVRATGRTNIVPGGVAGAAQSAATLNARATKDEEKTKLADILANATS 240

Query: 241 KLSADRPATRRDAEGVAGAEMRNDPFLTTHPTGVAASVAAAARLNQNNN 265
           KL AD+PATRRDAEGV GAEMRNDP+LTTHPTGVAASVAAAARLN+NN+
Sbjct: 241 KLPADKPATRRDAEGVTGAEMRNDPYLTTHPTGVAASVAAAARLNENNS 286

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LEA34_GOSHI1.5e-6955.76Late embryogenesis abundant protein D-34 OS=Gossypium hirsutum PE=4 SV=1[more]
LEA31_ARATH5.3e-6755.13Late embryogenesis abundant protein 31 OS=Arabidopsis thaliana GN=RAB28 PE=1 SV=... [more]
LEA32_ARATH2.8e-6054.20Late embryogenesis abundant protein 32 OS=Arabidopsis thaliana GN=ECP31 PE=2 SV=... [more]
LEA47_ARATH9.5e-4863.91Late embryogenesis abundant protein 47 OS=Arabidopsis thaliana GN=At5g27980 PE=2... [more]
LEA3_ARATH3.1e-3056.10Late embryogenesis abundant protein 3 OS=Arabidopsis thaliana GN=At1g03120 PE=3 ... [more]
Match NameE-valueIdentityDescription
A0A0A0M1P9_CUCSA2.8e-10778.79Uncharacterized protein OS=Cucumis sativus GN=Csa_1G574780 PE=4 SV=1[more]
M5W349_PRUPE2.2e-8867.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa020371mg PE=4 SV=1[more]
U5GGI0_POPTR1.2e-8465.66Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0005s04950g PE=4 SV=1[more]
V4T9S2_9ROSI2.8e-8364.66Uncharacterized protein OS=Citrus clementina GN=CICLE_v10003643mg PE=4 SV=1[more]
A0A059D3Y3_EUCGR3.7e-8363.81Uncharacterized protein OS=Eucalyptus grandis GN=EUGRSUZ_B01734 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G22490.13.0e-6855.13 Seed maturation protein[more]
AT3G22500.11.6e-6154.20 Seed maturation protein[more]
AT5G27980.15.3e-4963.91 Seed maturation protein[more]
AT1G03120.11.7e-3156.10 responsive to abscisic acid 28[more]
AT5G53260.15.6e-2236.42 Seed maturation protein[more]
Match NameE-valueIdentityDescription
gi|449436038|ref|XP_004135801.1|4.0e-10778.79PREDICTED: late embryogenesis abundant protein D-34 [Cucumis sativus][more]
gi|595833014|ref|XP_007206587.1|3.2e-8867.04hypothetical protein PRUPE_ppa020371mg [Prunus persica][more]
gi|1009117694|ref|XP_015875454.1|7.9e-8767.17PREDICTED: late embryogenesis abundant protein D-34 [Ziziphus jujuba][more]
gi|645222467|ref|XP_008218178.1|1.8e-8666.29PREDICTED: late embryogenesis abundant protein D-34 isoform X2 [Prunus mume][more]
gi|694376007|ref|XP_009364553.1|2.5e-8560.55PREDICTED: late embryogenesis abundant protein D-34-like [Pyrus x bretschneideri... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR007011SMP
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0007155 cell adhesion
cellular_component GO:0005829 cytosol
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
cellular_component GO:0005618 cell wall
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG03G016270.1ClCG03G016270.1mRNA


Analysis Name: InterPro Annotations of watermelon (Charleston Gray)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007011Seed maturation proteinPFAMPF04927SMPcoord: 18..72
score: 8.7E-18coord: 139..196
score: 1.5E-22coord: 204..262
score: 6.9
NoneNo IPR availablePANTHERPTHR31174SEED MATURATION FAMILY PROTEINcoord: 2..265
score: 3.1E
NoneNo IPR availablePANTHERPTHR31174:SF2EMBRYONIC ABUNDANT PROTEIN-RELATEDcoord: 2..265
score: 3.1E