Cla003103.1 (mRNA) Watermelon (97103) v1

NameCla003103
TypemRNA
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionHomeobox-leucine zipper protein (AHRD V1 ***- Q9FXN8_ZINEL); contains Interpro domain(s) IPR001356 Homeobox
LocationChr2 : 14793346 .. 14794695 (-)
Sequence length993
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGACCTGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGGTTTAAATCTTCTTTTCAAACAGCAAAGCAAATCAAAAAACCAATTTCCCTTTCCCCCTTATTTTTTTTGTTTTTGTTTTTGTACTTGTAATTTTTAGACATACCCATGTGGGATGTGGGGTTTCTTTTTCTCTGTCTCCTCCTACTTTCAAACTTCAGTTTACAATGGTCAATCAATCAATCAGCCTTTTTCCTTTCTGGGTTTTGCAAGATTCCACGTATTTTTCAGTAATTCTTTTCCAATTTTGTGTACAGATCATGAACAGAGTCCGAGAAACAAGAACAGTAACCATGTTTACGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTTTCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTCGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCAGTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGGTAAAATTACAAACCCCAAACAAAATCTCCCCCTTTTTATTTTTTGTCTTTTTAAGTATCAGAAATTTGGTAATAATGGAGACTGTAAAATTGTGTTTGCAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGTGGCGACCGATTCTGAAAATGCTCTGATCGAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAATCCCAAGACTTCAATCACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTCACCCGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAACTACTTGCAGTTTCAGAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGGGCTGA

mRNA sequence

ATGAAGAGACCTGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGATCATGAACAGAGTCCGAGAAACAAGAACAGTAACCATGTTTACGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTTTCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTCGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCAGTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGTGGCGACCGATTCTGAAAATGCTCTGATCGAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAATCCCAAGACTTCAATCACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTCACCCGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAACTACTTGCAGTTTCAGAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGGGCTGA

Coding sequence (CDS)

ATGAAGAGACCTGCAGTCAGCTCAGATTCCTTGGGTGCACTCATCTCCATTTGCCCAACTTCAGATCATGAACAGAGTCCGAGAAACAAGAACAGTAACCATGTTTACGGCACAGAATTCCAGTCTATGCTGGATGGATTTGAGGAAGAAGGGTGCGTTGAAGAATCGGGGCATGTTTCAGAGAAGAAAAGGCGACTTAGTGTGGAGCAAGTGAAGGCTCTAGAGAAGAATTTCGAAGTCGAAAACAAGCTCGAGCCAGAAAGGAAAGTGAAGCTTGCTCGAGAACTTGGATTACAGCCTCGACAAGTGGCTGTTTGGTTTCAAAATCGTCGAGCCAGATGGAAAACCAAGCAATTAGAAAGAGACTATGGCGTTCTCAAAACAAATTATGAGAATCTCAAACTCAGTTATGAAACTCTCCAAAATGACAATCAAGCTCTCCTCAAACAGATTCGGGAACTGAAATCAAAGCTTCAAGAAGATAACTCAGAGAGCAATCTTTCGGTGGAGGAAGAAATGGTGGTGGCGACCGATTCTGAAAATGCTCTGATCGAACAAACTAAGCCGGAAATTGGTGATCAGTTCTCTGTTCCGCCGGCGAGTGAATCCCAAGACTTCAATCACGAGAGCTTCAACAACAATGGCGGAGAAGGGGAAGAGGCAGCAATAGAAGAAGTGTCATTGTTCGCCGATTTCAAAGATGGGTCATCTGATAGCGATTCGAGCGCAATTTTAAACGAAGATTACAGCCCGACGGCGGGCATTTCTTCACCCGGGGTGCTGCAGAATCACCAGCAGTACCATTTCATGACGGGAGCGGTATCTCCGGCGCCCTCCGCCGCCGTGAAACTCAACTACTTGCAGTTTCAGAAGGGGTATCAACAACAAACCCAGATGTTTCCAAAAATGGAGGAGCATAATTTCTTCAGTGGAGAGGAGGCTTGTAACTTCTTCTCCGATGAGCAAGCTCCGACTCTGCACTGGTGGGGCTGA

Protein sequence

MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWWG
BLAST of Cla003103 vs. Swiss-Prot
Match: ATHB6_ARATH (Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV=1)

HSP 1 Score: 262.7 bits (670), Expect = 5.4e-69
Identity = 165/341 (48.39%), Postives = 211/341 (61.88%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPT-SDHEQSPRNKNSNHVYGTEFQSMLDGFEEE--GCVEESG 60
           MKR + SSDS+G LIS+CPT S  EQSPR        G EFQSML+G+EEE    VEE G
Sbjct: 2   MKRLS-SSDSVGGLISLCPTTSTDEQSPRRYG-----GREFQSMLEGYEEEEEAIVEERG 61

Query: 61  HV--SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWK 120
           HV  SEKKRRLS+ QVKALEKNFE+ENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWK
Sbjct: 62  HVGLSEKKRRLSINQVKALEKNFELENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWK 121

Query: 121 TKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL-----QEDNSESNLSVE 180
           TKQLE+DYGVLKT Y++L+ ++++L+ DN++LL++I +LK+KL     +E+  E+N +V 
Sbjct: 122 TKQLEKDYGVLKTQYDSLRHNFDSLRRDNESLLQEISKLKTKLNGGGGEEEEEENNAAVT 181

Query: 181 EEMVVATDSENALIEQTKPEIGDQFSVPP--ASESQDFNHESFNNNGGEGEEAAIEEVSL 240
            E  ++   E   + +   +I +  S PP     S   N+ SF +        A    S 
Sbjct: 182 TESDISVKEEEVSLPE---KITEAPSSPPQFLEHSDGLNYRSFTDLRDLLPLKA--AASS 241

Query: 241 FADFKDGSSDSDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQ 300
           FA     S  SDSSA+LNE+ S    +++P  +                       N+ Q
Sbjct: 242 FAAAAGSSDSSDSSALLNEESSSNVTVAAPVTVPGG--------------------NFFQ 301

Query: 301 FQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
           F K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Sbjct: 302 FVK--MEQTE-----DHEDFLSGEEACEFFSDEQPPSLHWY 304

BLAST of Cla003103 vs. Swiss-Prot
Match: ATB16_ARATH (Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 SV=2)

HSP 1 Score: 229.6 bits (584), Expect = 5.1e-59
Identity = 160/353 (45.33%), Postives = 207/353 (58.64%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCV--EESGH 60
           MKR + SSDS+  LIS   TS  EQSPR       YG+ +QSML+G++E+  +  E SG+
Sbjct: 1   MKRLS-SSDSMCGLIS---TSTDEQSPRG------YGSNYQSMLEGYDEDATLIEEYSGN 60

Query: 61  -----VSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRAR 120
                +SEKKRRL V+QVKALEKNFE+ENKLEPERK KLA+ELGLQPRQVAVWFQNRRAR
Sbjct: 61  HHHMGLSEKKRRLKVDQVKALEKNFELENKLEPERKTKLAQELGLQPRQVAVWFQNRRAR 120

Query: 121 WKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKL--QEDNSESNL---S 180
           WKTKQLE+DYGVLK  Y++L+ ++++L+ DN +LL++I ++K+K+  +EDN+ +      
Sbjct: 121 WKTKQLEKDYGVLKGQYDSLRHNFDSLRRDNDSLLQEISKIKAKVNGEEDNNNNKAITEG 180

Query: 181 VEEEMVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHES-FNNNGGEGEEAAIEEVS 240
           V+EE V  TDS                   P+S  Q   H S FN               
Sbjct: 181 VKEEEVHKTDS------------------IPSSPLQFLEHSSGFNYRRS----------- 240

Query: 241 LFADFKD----------GSSDS-DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSP 300
            F D +D          GSSDS DSSA+LN++ S   G  +P V         +TG    
Sbjct: 241 -FTDLRDLLPNSTVVEAGSSDSCDSSAVLNDETSSDNGRLTPPVT--------VTGG--- 287

Query: 301 APSAAVKLNYLQFQKGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
                   ++LQF K   +QT+     +  +F SGEEAC FFSDEQ P+LHW+
Sbjct: 301 --------SFLQFVK--TEQTE-----DHEDFLSGEEACGFFSDEQPPSLHWY 287

BLAST of Cla003103 vs. Swiss-Prot
Match: ATHB5_ARATH (Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV=1)

HSP 1 Score: 220.7 bits (561), Expect = 2.4e-56
Identity = 146/339 (43.07%), Postives = 189/339 (55.75%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISIC-PTSDHEQSPRNKNSNHVYGT--EFQSMLDGFEEEGCVEESG 60
           MKR   SSDSL   + I   T+D + SPR   +  +Y    ++  M D  E++G +E+ G
Sbjct: 1   MKRSRGSSDSLSGFLPIRHSTTDKQISPRPTTTGFLYSGAGDYSQMFDALEDDGSLEDLG 60

Query: 61  HV-------SEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNR 120
            V       +EKKRRL VEQVKALEKNFE++NKLEPERKVKLA+ELGLQPRQVA+WFQNR
Sbjct: 61  GVGHASSTAAEKKRRLGVEQVKALEKNFEIDNKLEPERKVKLAQELGLQPRQVAIWFQNR 120

Query: 121 RARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVE 180
           RARWKTKQLERDYGVLK+N++ LK + ++LQ DN +LL QI+ELK+KL   N E    +E
Sbjct: 121 RARWKTKQLERDYGVLKSNFDALKRNRDSLQRDNDSLLGQIKELKAKL---NVEGVKGIE 180

Query: 181 EE--MVVATDSENALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSL 240
           E   +     +++ +      E+  +   PP     D              E A E  S+
Sbjct: 181 ENGALKAVEANQSVMANNEVLELSHRSPSPPPHIPTD----------APTSELAFEMFSI 240

Query: 241 F---ADFKDGSSD-SDSSAILNEDYSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKL 300
           F    +F+D  +D SDSSA+LNE+YSP                      V  A + A   
Sbjct: 241 FPRTENFRDDPADSSDSSAVLNEEYSP--------------------NTVEAAGAVAATT 300

Query: 301 NYLQFQKGYQQQTQMFPKMEEH-NFFSGEEACNFFSDEQ 323
             +     + Q    F KMEEH + FSGEEAC  F+D +
Sbjct: 301 VEMSTMGCFSQ----FVKMEEHEDLFSGEEACKLFADNE 302

BLAST of Cla003103 vs. Swiss-Prot
Match: HOX4_ORYSJ (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 1.2e-39
Identity = 84/141 (59.57%), Postives = 106/141 (75.18%), Query Frame = 1

Query: 46  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPR 105
           G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPR
Sbjct: 31  GMEAEGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPR 90

Query: 106 QVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQED 165
           QVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+ D  ALL +I+ELK+KL ++
Sbjct: 91  QVAVWFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDE 150

Query: 166 NSESNLSVEEEMVVATDSENA 183
            + ++ +  +E   A+D   A
Sbjct: 151 EAAASFTSVKEEPAASDGPPA 171

BLAST of Cla003103 vs. Swiss-Prot
Match: HOX4_ORYSI (Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 1.2e-39
Identity = 84/141 (59.57%), Postives = 106/141 (75.18%), Query Frame = 1

Query: 46  GFEEEGCVEES----GHVSEKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPR 105
           G E EG VEE     G   EKKRRLSVEQV+ALE++FEVENKLEPERK +LAR+LGLQPR
Sbjct: 31  GMEAEGDVEEEMMACGGGGEKKRRLSVEQVRALERSFEVENKLEPERKARLARDLGLQPR 90

Query: 106 QVAVWFQNRRARWKTKQLERDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQED 165
           QVAVWFQNRRARWKTKQLERDY  L+ +Y++L+L ++ L+ D  ALL +I+ELK+KL ++
Sbjct: 91  QVAVWFQNRRARWKTKQLERDYAALRHSYDSLRLDHDALRRDKDALLAEIKELKAKLGDE 150

Query: 166 NSESNLSVEEEMVVATDSENA 183
            + ++ +  +E   A+D   A
Sbjct: 151 EAAASFTSVKEEPAASDGPPA 171

BLAST of Cla003103 vs. TrEMBL
Match: M5W009_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 401.7 bits (1031), Expect = 8.3e-109
Identity = 222/338 (65.68%), Postives = 258/338 (76.33%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGA+ISICP++  EQSPRN   NHVY  +FQSMLDG +EEGCVEE GHVSEKKRRL
Sbjct: 6   SSDSLGAMISICPSTAEEQSPRN---NHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SVEQVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERD+GVL
Sbjct: 66  SVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGVL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSES-NLSVEEEMVVATDSENALI- 186
           K NY++LKL+Y+ LQ++N+AL+K+I++LKSKLQE+N+ES NLSV+EE +VA D  N  + 
Sbjct: 126 KANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYKVV 185

Query: 187 --EQTK----PEIGDQFSVPPASESQDFNHESFN--NNGGEGEEAAIEEVSLFADFKDGS 246
             E +K    P +G   S  PA+ES++ N ESFN  NNG  G    +E VSLF DFKDGS
Sbjct: 186 DHELSKSPPPPPLG---SSVPATESKELNFESFNNTNNGAVG----LEAVSLFPDFKDGS 245

Query: 247 SDSDSSAILNEDYSPTAGISSPGVLQNHQ------QYHFMTGAVSPAPSAAVKLNYLQFQ 306
           SDSDSSAILNED SP   ISS G+LQNHQ               S +  ++  +N  QFQ
Sbjct: 246 SDSDSSAILNEDNSPNLTISSSGMLQNHQLMKSPASTSLKFNCCSSSSPSSSSMNCFQFQ 305

Query: 307 KGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHW 329
           K Y  Q   F K+EEHNFFS EEAC+FFSDEQAPTL W
Sbjct: 306 KTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cla003103 vs. TrEMBL
Match: A0A0A0KGQ3_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1)

HSP 1 Score: 401.4 bits (1030), Expect = 1.1e-108
Identity = 219/333 (65.77%), Postives = 255/333 (76.58%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVS 60
           MKR   SSDSLGAL+S+CPTS+ EQSPRN   +HVYG EFQSMLDG +EEG +EE  HV 
Sbjct: 1   MKRHG-SSDSLGALMSVCPTSE-EQSPRN---SHVYGREFQSMLDGLDEEGSIEEHCHVG 60

Query: 61  EKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE 120
           EKKRRLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 61  EKKRRLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLE 120

Query: 121 RDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSE 180
           RDYG+LK NYE+LK S++TLQ DN ALLK+I+ELKSKL+E+ +ESNLSV+EE+ V ++S+
Sbjct: 121 RDYGLLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESD 180

Query: 181 NALIEQTKPEIG-DQFSVPPASE-SQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSD 240
           N LIEQT   +  D  S+P AS+ S DFN+ESF   G +  +    EVSLF DFKDGSSD
Sbjct: 181 NLLIEQTTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSD 240

Query: 241 SDSSAILNEDYSPTAGISS--PGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQ 300
           SDSSAILNED SP A +SS   G+LQ+H Q            S A  LN   FQK     
Sbjct: 241 SDSSAILNEDNSPNAVVSSATAGMLQSHHQI---------LSSPATSLNCYPFQKAAYNN 300

Query: 301 TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
            Q F K+EEHNFFSGEE CN FSDEQAP++HW+
Sbjct: 301 AQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cla003103 vs. TrEMBL
Match: M5WIS1_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1)

HSP 1 Score: 398.7 bits (1023), Expect = 7.1e-108
Identity = 222/338 (65.68%), Postives = 259/338 (76.63%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGA+ISICP+++ EQSPRN   NHVY  +FQSMLDG +EEGCVEE GHVSEKKRRL
Sbjct: 6   SSDSLGAMISICPSTE-EQSPRN---NHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SVEQVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERD+GVL
Sbjct: 66  SVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGVL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSES-NLSVEEEMVVATDSENALI- 186
           K NY++LKL+Y+ LQ++N+AL+K+I++LKSKLQE+N+ES NLSV+EE +VA D  N  + 
Sbjct: 126 KANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYKVV 185

Query: 187 --EQTK----PEIGDQFSVPPASESQDFNHESFN--NNGGEGEEAAIEEVSLFADFKDGS 246
             E +K    P +G   S  PA+ES++ N ESFN  NNG  G    +E VSLF DFKDGS
Sbjct: 186 DHELSKSPPPPPLG---SSVPATESKELNFESFNNTNNGAVG----LEAVSLFPDFKDGS 245

Query: 247 SDSDSSAILNEDYSPTAGISSPGVLQNHQ------QYHFMTGAVSPAPSAAVKLNYLQFQ 306
           SDSDSSAILNED SP   ISS G+LQNHQ               S +  ++  +N  QFQ
Sbjct: 246 SDSDSSAILNEDNSPNLTISSSGMLQNHQLMKSPASTSLKFNCCSSSSPSSSSMNCFQFQ 305

Query: 307 KGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHW 329
           K Y  Q   F K+EEHNFFS EEAC+FFSDEQAPTL W
Sbjct: 306 KTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 329

BLAST of Cla003103 vs. TrEMBL
Match: A0A067KD47_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1)

HSP 1 Score: 396.7 bits (1018), Expect = 2.7e-107
Identity = 224/334 (67.07%), Postives = 253/334 (75.75%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVS 60
           MKR + SSDSLGALISICPTSD E SPRN  SNHVYG EFQSMLDG +EE CVEE+GHVS
Sbjct: 1   MKRLS-SSDSLGALISICPTSD-EHSPRN--SNHVYGREFQSMLDGLDEEACVEEAGHVS 60

Query: 61  EKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE 120
           EKKRRLSV+QVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 61  EKKRRLSVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLE 120

Query: 121 RDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSE 180
           RDYGVLK NYE LK++Y+ LQ+DN+ALLK+IRELK+KL EDN+ESN+SV+EE+++A   E
Sbjct: 121 RDYGVLKANYETLKVNYDALQHDNEALLKEIRELKAKLDEDNAESNVSVKEEIIIAETDE 180

Query: 181 NALIEQTKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSD 240
                  +P I    +    SE++D N+ESFN N        I  VSLF DFKDGSSDSD
Sbjct: 181 KG---SEEPPILTSIA---GSETKDMNYESFNINSSNSNN-GILAVSLFPDFKDGSSDSD 240

Query: 241 SSAILNED-----YSPTAGISSPGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQ 300
           SSAILNED      SP   ISS GV Q+H Q   M     P+ S++      QF K    
Sbjct: 241 SSAILNEDNNNSNNSPNPAISSSGVPQSHNQ--LMMSPSRPSSSSSP----FQFIKTGSY 300

Query: 301 QTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
           QTQ F KMEEHNFFS EEACNFFSDEQAP+L W+
Sbjct: 301 QTQ-FVKMEEHNFFSSEEACNFFSDEQAPSLQWY 316

BLAST of Cla003103 vs. TrEMBL
Match: A0A061DJ94_THECC (Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_001039 PE=4 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 1.6e-104
Identity = 212/333 (63.66%), Postives = 258/333 (77.48%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGAL+SICPT+D E SPRN   NH+Y  EFQSMLDG +EEGCVEESGHV+EKKRRL
Sbjct: 6   SSDSLGALMSICPTTD-EHSPRN---NHIYSREFQSMLDGLDEEGCVEESGHVAEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SV+QVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERDYG+L
Sbjct: 66  SVDQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDYGLL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSENALIEQ 186
           KT+YE LK++Y+TLQ+DN+ALLK+IRELK+KL  +++ESNLSV+EE++V  +++N  +EQ
Sbjct: 126 KTSYETLKVNYDTLQHDNEALLKEIRELKAKLNGESTESNLSVKEEVIV-HETDNKTLEQ 185

Query: 187 TKPEIGDQFSVPPASESQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSDSDSSAILN 246
           ++P      S+  +SE  + N+ESFNN+ G          +LF D KDGSSDSDSSAILN
Sbjct: 186 SEPP--PVSSLVTSSEPAELNYESFNNSIGS------VGATLFPDLKDGSSDSDSSAILN 245

Query: 247 ED---YSP-TAGISSPGVLQNHQQYHFMTGAV------SPAPSAAVKLNYLQFQKGYQQQ 306
           ED    SP  A ISS GVLQ+ QQ+  M+         + + S+   +N  QF K   Q 
Sbjct: 246 EDNNNCSPNNAAISSSGVLQS-QQHLLMSPTTTSSLNFNSSSSSPSSMNCFQFSKSTYQP 305

Query: 307 TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
           +  + KMEEHNFFS +EACNFFSDEQAP+LHW+
Sbjct: 306 SHQYVKMEEHNFFSADEACNFFSDEQAPSLHWY 324

BLAST of Cla003103 vs. NCBI nr
Match: gi|595826046|ref|XP_007205507.1| (hypothetical protein PRUPE_ppa008318mg [Prunus persica])

HSP 1 Score: 401.7 bits (1031), Expect = 1.2e-108
Identity = 222/338 (65.68%), Postives = 258/338 (76.33%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGA+ISICP++  EQSPRN   NHVY  +FQSMLDG +EEGCVEE GHVSEKKRRL
Sbjct: 6   SSDSLGAMISICPSTAEEQSPRN---NHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SVEQVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERD+GVL
Sbjct: 66  SVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGVL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSES-NLSVEEEMVVATDSENALI- 186
           K NY++LKL+Y+ LQ++N+AL+K+I++LKSKLQE+N+ES NLSV+EE +VA D  N  + 
Sbjct: 126 KANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYKVV 185

Query: 187 --EQTK----PEIGDQFSVPPASESQDFNHESFN--NNGGEGEEAAIEEVSLFADFKDGS 246
             E +K    P +G   S  PA+ES++ N ESFN  NNG  G    +E VSLF DFKDGS
Sbjct: 186 DHELSKSPPPPPLG---SSVPATESKELNFESFNNTNNGAVG----LEAVSLFPDFKDGS 245

Query: 247 SDSDSSAILNEDYSPTAGISSPGVLQNHQ------QYHFMTGAVSPAPSAAVKLNYLQFQ 306
           SDSDSSAILNED SP   ISS G+LQNHQ               S +  ++  +N  QFQ
Sbjct: 246 SDSDSSAILNEDNSPNLTISSSGMLQNHQLMKSPASTSLKFNCCSSSSPSSSSMNCFQFQ 305

Query: 307 KGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHW 329
           K Y  Q   F K+EEHNFFS EEAC+FFSDEQAPTL W
Sbjct: 306 KTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cla003103 vs. NCBI nr
Match: gi|449451407|ref|XP_004143453.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus])

HSP 1 Score: 401.4 bits (1030), Expect = 1.6e-108
Identity = 219/333 (65.77%), Postives = 255/333 (76.58%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVS 60
           MKR   SSDSLGAL+S+CPTS+ EQSPRN   +HVYG EFQSMLDG +EEG +EE  HV 
Sbjct: 1   MKRHG-SSDSLGALMSVCPTSE-EQSPRN---SHVYGREFQSMLDGLDEEGSIEEHCHVG 60

Query: 61  EKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE 120
           EKKRRLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 61  EKKRRLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLE 120

Query: 121 RDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSE 180
           RDYG+LK NYE+LK S++TLQ DN ALLK+I+ELKSKL+E+ +ESNLSV+EE+ V ++S+
Sbjct: 121 RDYGLLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESD 180

Query: 181 NALIEQTKPEIG-DQFSVPPASE-SQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSD 240
           N LIEQT   +  D  S+P AS+ S DFN+ESF   G +  +    EVSLF DFKDGSSD
Sbjct: 181 NLLIEQTTNHLPVDHISLPVASDHSDDFNYESFRTVGADDGDDQRVEVSLFTDFKDGSSD 240

Query: 241 SDSSAILNEDYSPTAGISS--PGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQ 300
           SDSSAILNED SP A +SS   G+LQ+H Q            S A  LN   FQK     
Sbjct: 241 SDSSAILNEDNSPNAVVSSATAGMLQSHHQI---------LSSPATSLNCYPFQKAAYNN 300

Query: 301 TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
            Q F K+EEHNFFSGEE CN FSDEQAP++HW+
Sbjct: 301 AQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cla003103 vs. NCBI nr
Match: gi|659080027|ref|XP_008440572.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo])

HSP 1 Score: 399.4 bits (1025), Expect = 5.9e-108
Identity = 218/333 (65.47%), Postives = 255/333 (76.58%), Query Frame = 1

Query: 1   MKRPAVSSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVS 60
           MKR   SSDSLGAL+S+CPTS+ EQSPRN   +HVYG EFQSMLDG +EEG +EE  HV 
Sbjct: 1   MKRHG-SSDSLGALMSVCPTSE-EQSPRN---SHVYGREFQSMLDGLDEEGSIEEHCHVG 60

Query: 61  EKKRRLSVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLE 120
           EKKRRLSV+QVKALEK FE+ENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLE
Sbjct: 61  EKKRRLSVDQVKALEKTFEIENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLE 120

Query: 121 RDYGVLKTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSESNLSVEEEMVVATDSE 180
           RDYG+LK NYE+LK S++TLQ DN ALLK+I+ELKSKL+E+ +ESNLSV+EE+ V ++S+
Sbjct: 121 RDYGLLKANYESLKRSFDTLQQDNDALLKEIKELKSKLEEEKTESNLSVKEEIFV-SESD 180

Query: 181 NALIEQTKPEIG-DQFSVPPASE-SQDFNHESFNNNGGEGEEAAIEEVSLFADFKDGSSD 240
           N LIEQT   +  D  S+P AS+ S DF++ESF   G +  +    EVSLF DFKDGSSD
Sbjct: 181 NLLIEQTTNHLPVDHISLPVASDHSDDFDYESFRTVGADDGDDQRVEVSLFPDFKDGSSD 240

Query: 241 SDSSAILNEDYSPTAGISS--PGVLQNHQQYHFMTGAVSPAPSAAVKLNYLQFQKGYQQQ 300
           SDSSAILNED SP A +SS   G+LQ+H Q            S A  LN   FQK     
Sbjct: 241 SDSSAILNEDNSPNAVVSSATAGMLQSHHQI---------LSSPATSLNCFPFQKATYNN 300

Query: 301 TQMFPKMEEHNFFSGEEACNFFSDEQAPTLHWW 330
            Q F K+EEHNFFSGEE CN FSDEQAP++HW+
Sbjct: 301 AQQFVKIEEHNFFSGEETCNLFSDEQAPSMHWY 318

BLAST of Cla003103 vs. NCBI nr
Match: gi|645219318|ref|XP_008235150.1| (PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Prunus mume])

HSP 1 Score: 399.1 bits (1024), Expect = 7.8e-108
Identity = 221/336 (65.77%), Postives = 259/336 (77.08%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGA+ISICP++  E SPRN   NHVY  +F SMLDG +EEGCVEE GHVSEKKRRL
Sbjct: 6   SSDSLGAMISICPSTAEEHSPRN---NHVYRRDFHSMLDGLDEEGCVEEGGHVSEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SVEQVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERD+GVL
Sbjct: 66  SVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGVL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSES-NLSVEEEMVVATDSENALI- 186
           K NY++LKL+Y++LQ++N+AL+K+I++LKSKLQE+N+ES NLSV+EE +VA D  N  + 
Sbjct: 126 KANYDSLKLNYDSLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYKVV 185

Query: 187 --EQTK-PEIGDQFSVPPASESQDFNHESFN--NNGGEGEEAAIEEVSLFADFKDGSSDS 246
             E++K P      S  PA+ES++ N ESFN  NNG  G    +E VSLF DFKDGSSDS
Sbjct: 186 DHEKSKSPPPPPPGSSVPATESKELNFESFNNTNNGAVG----LEAVSLFPDFKDGSSDS 245

Query: 247 DSSAILNEDYSPTAGISSPGVLQNHQQYHFMTG-------AVSPAPSAAVKLNYLQFQKG 306
           DSSAILNED SP   ISS G+LQNHQ              + S +PS++  +N  QFQK 
Sbjct: 246 DSSAILNEDNSPNLTISSSGMLQNHQLMKSPASTSLKFNCSSSSSPSSS-SMNCFQFQKT 305

Query: 307 YQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHW 329
           Y  Q   F K+EEHNFFS EEAC+FFSDEQAPTL W
Sbjct: 306 YHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 330

BLAST of Cla003103 vs. NCBI nr
Match: gi|595826040|ref|XP_007205506.1| (hypothetical protein PRUPE_ppa008318mg [Prunus persica])

HSP 1 Score: 398.7 bits (1023), Expect = 1.0e-107
Identity = 222/338 (65.68%), Postives = 259/338 (76.63%), Query Frame = 1

Query: 7   SSDSLGALISICPTSDHEQSPRNKNSNHVYGTEFQSMLDGFEEEGCVEESGHVSEKKRRL 66
           SSDSLGA+ISICP+++ EQSPRN   NHVY  +FQSMLDG +EEGCVEE GHVSEKKRRL
Sbjct: 6   SSDSLGAMISICPSTE-EQSPRN---NHVYRRDFQSMLDGLDEEGCVEEGGHVSEKKRRL 65

Query: 67  SVEQVKALEKNFEVENKLEPERKVKLARELGLQPRQVAVWFQNRRARWKTKQLERDYGVL 126
           SVEQVKALEKNFEVENKLEPERKVKLA+ELGLQPRQVAVWFQNRRARWKTKQLERD+GVL
Sbjct: 66  SVEQVKALEKNFEVENKLEPERKVKLAQELGLQPRQVAVWFQNRRARWKTKQLERDFGVL 125

Query: 127 KTNYENLKLSYETLQNDNQALLKQIRELKSKLQEDNSES-NLSVEEEMVVATDSENALI- 186
           K NY++LKL+Y+ LQ++N+AL+K+I++LKSKLQE+N+ES NLSV+EE +VA D  N  + 
Sbjct: 126 KANYDSLKLNYDNLQHENEALVKEIKQLKSKLQEENTESNNLSVKEEQMVAKDQSNYKVV 185

Query: 187 --EQTK----PEIGDQFSVPPASESQDFNHESFN--NNGGEGEEAAIEEVSLFADFKDGS 246
             E +K    P +G   S  PA+ES++ N ESFN  NNG  G    +E VSLF DFKDGS
Sbjct: 186 DHELSKSPPPPPLG---SSVPATESKELNFESFNNTNNGAVG----LEAVSLFPDFKDGS 245

Query: 247 SDSDSSAILNEDYSPTAGISSPGVLQNHQ------QYHFMTGAVSPAPSAAVKLNYLQFQ 306
           SDSDSSAILNED SP   ISS G+LQNHQ               S +  ++  +N  QFQ
Sbjct: 246 SDSDSSAILNEDNSPNLTISSSGMLQNHQLMKSPASTSLKFNCCSSSSPSSSSMNCFQFQ 305

Query: 307 KGYQQQTQMFPKMEEHNFFSGEEACNFFSDEQAPTLHW 329
           K Y  Q   F K+EEHNFFS EEAC+FFSDEQAPTL W
Sbjct: 306 KTYHPQ---FVKIEEHNFFSSEEACSFFSDEQAPTLQW 329

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ATHB6_ARATH5.4e-6948.39Homeobox-leucine zipper protein ATHB-6 OS=Arabidopsis thaliana GN=ATHB-6 PE=1 SV... [more]
ATB16_ARATH5.1e-5945.33Homeobox-leucine zipper protein ATHB-16 OS=Arabidopsis thaliana GN=ATHB-16 PE=2 ... [more]
ATHB5_ARATH2.4e-5643.07Homeobox-leucine zipper protein ATHB-5 OS=Arabidopsis thaliana GN=ATHB-5 PE=1 SV... [more]
HOX4_ORYSJ1.2e-3959.57Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. japonica GN=HOX4 PE=... [more]
HOX4_ORYSI1.2e-3959.57Homeobox-leucine zipper protein HOX4 OS=Oryza sativa subsp. indica GN=HOX4 PE=1 ... [more]
Match NameE-valueIdentityDescription
M5W009_PRUPE8.3e-10965.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
A0A0A0KGQ3_CUCSA1.1e-10865.77Uncharacterized protein OS=Cucumis sativus GN=Csa_6G499720 PE=4 SV=1[more]
M5WIS1_PRUPE7.1e-10865.68Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa008318mg PE=4 SV=1[more]
A0A067KD47_JATCU2.7e-10767.07Uncharacterized protein OS=Jatropha curcas GN=JCGZ_18664 PE=4 SV=1[more]
A0A061DJ94_THECC1.6e-10463.66Alanine--glyoxylate aminotransferase 2 isoform 1 OS=Theobroma cacao GN=TCM_00103... [more]
Match NameE-valueIdentityDescription
gi|595826046|ref|XP_007205507.1|1.2e-10865.68hypothetical protein PRUPE_ppa008318mg [Prunus persica][more]
gi|449451407|ref|XP_004143453.1|1.6e-10865.77PREDICTED: homeobox-leucine zipper protein ATHB-6 [Cucumis sativus][more]
gi|659080027|ref|XP_008440572.1|5.9e-10865.47PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Cucumis melo][more]
gi|645219318|ref|XP_008235150.1|7.8e-10865.77PREDICTED: homeobox-leucine zipper protein ATHB-6-like [Prunus mume][more]
gi|595826040|ref|XP_007205506.1|1.0e-10765.68hypothetical protein PRUPE_ppa008318mg [Prunus persica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR000047HTH_motif
IPR001356Homeobox_dom
IPR003106Leu_zip_homeo
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005667 transcription factor complex
molecular_function GO:0003677 DNA binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0016740 transferase activity

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cla003103Cla003103gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cla003103Cla003103.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cla003103.1.cds3Cla003103.1.cds3CDS
Cla003103.1.cds2Cla003103.1.cds2CDS
Cla003103.1.cds1Cla003103.1.cds1CDS


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000047Helix-turn-helix motifPRINTSPR00031HTHREPRESSRcoord: 88..97
score: 1.1E-5coord: 97..113
score: 1.
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 62..115
score: 5.6
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 60..121
score: 1.0
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 57..117
score: 17
IPR003106Leucine zipper, homeobox-associatedPFAMPF02183HALZcoord: 117..159
score: 5.1
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 64..124
score: 8.4
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 47..119
score: 1.21
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 92..115
scor
NoneNo IPR availableunknownCoilCoilcoord: 130..164
scor
NoneNo IPR availablePANTHERPTHR24326FAMILY NOT NAMEDcoord: 26..185
score: 2.6
NoneNo IPR availablePANTHERPTHR24326:SF196HOMEOBOX-LEUCINE ZIPPER PROTEIN ATHB-16-RELATEDcoord: 26..185
score: 2.6