Cla97C05G109140 (gene) Watermelon (97103) v2

NameCla97C05G109140
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
Descriptiontrihelix transcription factor ASIL2-like
LocationCla97Chr05 : 35686536 .. 35688481 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGAAAGGCAGTCCCTCATCCCCACCTACCTCACACACCTCCCCTTCTCTTCTATTTAACCACCACCACCACCACCACCACCAGCTTCCCCCTGCCGCCGCCGAAGACAACCCATCTCCCAAAAAAACCCCTGTTTCCACGGGGGGAGGCGGAGACCGGCTAAAACGAGATGAATGGAGCGAAGGCGCGGTGTCCAGCCTCCTAGAAGCCTACGAATCAAAATGGGTACTACGAAACAGAGCAAAATTGAAAGGGCATGATTGGGAAGATGTGGCCCGCCATGTCTCTTCAAGAGCTAATTTTACCAAATCTCCCAAAACCCAAACTCAGTGTAAGAATAAAATCGAGTCCATGAAAAAAAGGTACCGTTCTGAATCTGCCTCCGCCGCTTCTTCCTGGCCTTTGTACCACCGTCTTCATCTCTTGCTCCGGGGAAACGCTACGCTCACTCCACCACCGCCGCCATCCTCTCACTCTCCACCACCAGTTATTCTCCTCGATCCTCCGCCTCCGCCTCCGCCTCCACCTCCACCTCCACCTCCACCGCCGTTTCTTCCGCCCCAAAACTCCCATGGATCCAATGGTGTTGATAGGATTAATCCTAAGGTATCAACTTCTTCTTTAATTCCCCCCCCCCCCCCCCCCACCTTTTTTTTTTCTAATTTATTATGGATCCAACACTTTTATTGGATTTCGCTTTGCTCCCCTTTACTCTTTCTTTTTAGAAAAAAATTATCTTCCCCTTTATGGATTTTAATTATAAATTGCATAGTGCCATATGTTTGTATGGTTTTTGTAGATATAGTTTAAATATAGTCTTTGTTGTGGGAAAAGAAGCTTTTTCTTCACACCTTATTAGAATTCAATTCTTCCAAATTTCTTTATACTTACCTTAAGTATATTTATTCAAACTTTTTTAGTTTTAGTTGTTAATTATTTGAATCTTTTTTCTTTTACTTTTTAAATATAAAGTTTATAAATAAACAATACTTCTATCCATCCGGACTAAAATTCTGAACTTAAAAAGAAAAAAAAAATCTCGTTTGGTGATCAATTTATTTATTTTCCAAAATTAAAATAATAAACACTGTTTCCATCTATAACCTTTTTTGTTTTGTTCTTAACTTTAATTTTGAAAACTAAAAAAAATATATAGATTTTAAAAGTTTTTATTTTTAAGATTTTGACTAAAAATTAAGTGTGTATTAAAAAAAAAAAAAAAAAAAAAAAAAAAGATAAGAAAGAAAGAGTGAGAAGTACAGTATGGAATTTAGAAGAAAAGTTGTGTAATGTGATTGCCTCTTTAATAACAACAGTTAGATGATTCGAAAATGAGTTGCACACCTTCTCATAACCAATGGATAAAAATAAAAGATATTTTTGGAGTTTGTGTGTCTCATACATCCAAAGTATAAAAGTAGGAACTAGATTTTGTTGAATAGTTTTTTTGTGACTTTGTTCTAGATAAAAATGAAAGTAGGGAAATTAAATGATTGTAGGAAGATGGAGTTGATAATGGAAGAGGAGATGAATCAGATGAATTATCAGAGAAGAATAAGAAGATGGTAACAGAGACTGACAGTAGTACACCGGCAATAGTATATAGTGAAAAAGAAAAGGTAGCAATGAGGCCAAAACAACAAACAAAAATGAAAAACAACAAGAAGAAAAAGGGGACGAGGCTGTTGACAACAGAGGATTCGTTGGAACAGATCGCCGGGAGTATACGGTGGTTGGCCGAGGTCGTGGTGCGATCGGAACAAGCCAGAATGGAGATGATAAAGGATATAGAAAAGATGAGAGCTGAAGCAGAGGCTAAAAGAGGGGAAATGGATCTCAAAAGAACACAAATCATTGCAAATACCCAATTGGAGATTGCTAAGCTCTTTGCATCTTCTACCAAACCTCTTGATTCTTCACTAAGGATTGGTAGAACTTAA

mRNA sequence

ATGGAGATGAAAGGCAGTCCCTCATCCCCACCTACCTCACACACCTCCCCTTCTCTTCTATTTAACCACCACCACCACCACCACCACCAGCTTCCCCCTGCCGCCGCCGAAGACAACCCATCTCCCAAAAAAACCCCTGTTTCCACGGGGGGAGGCGGAGACCGGCTAAAACGAGATGAATGGAGCGAAGGCGCGGTGTCCAGCCTCCTAGAAGCCTACGAATCAAAATGGGTACTACGAAACAGAGCAAAATTGAAAGGGCATGATTGGGAAGATGTGGCCCGCCATGTCTCTTCAAGAGCTAATTTTACCAAATCTCCCAAAACCCAAACTCAGTGTAAGAATAAAATCGAGTCCATGAAAAAAAGGTACCGTTCTGAATCTGCCTCCGCCGCTTCTTCCTGGCCTTTGTACCACCGTCTTCATCTCTTGCTCCGGGGAAACGCTACGCTCACTCCACCACCGCCGCCATCCTCTCACTCTCCACCACCAGTTATTCTCCTCGATCCTCCGCCTCCGCCTCCGCCTCCACCTCCACCTCCACCTCCACCGCCGTTTCTTCCGCCCCAAAACTCCCATGGATCCAATGGTGTTGATAGGATTAATCCTAAGGAAGATGGAGTTGATAATGGAAGAGGAGATGAATCAGATGAATTATCAGAGAAGAATAAGAAGATGGTAACAGAGACTGACAGTAGTACACCGGCAATAGTATATAGTGAAAAAGAAAAGGTAGCAATGAGGCCAAAACAACAAACAAAAATGAAAAACAACAAGAAGAAAAAGGGGACGAGGCTGTTGACAACAGAGGATTCGTTGGAACAGATCGCCGGGAGTATACGGTGGTTGGCCGAGGTCGTGGTGCGATCGGAACAAGCCAGAATGGAGATGATAAAGGATATAGAAAAGATGAGAGCTGAAGCAGAGGCTAAAAGAGGGGAAATGGATCTCAAAAGAACACAAATCATTGCAAATACCCAATTGGAGATTGCTAAGCTCTTTGCATCTTCTACCAAACCTCTTGATTCTTCACTAAGGATTGGTAGAACTTAA

Coding sequence (CDS)

ATGGAGATGAAAGGCAGTCCCTCATCCCCACCTACCTCACACACCTCCCCTTCTCTTCTATTTAACCACCACCACCACCACCACCACCAGCTTCCCCCTGCCGCCGCCGAAGACAACCCATCTCCCAAAAAAACCCCTGTTTCCACGGGGGGAGGCGGAGACCGGCTAAAACGAGATGAATGGAGCGAAGGCGCGGTGTCCAGCCTCCTAGAAGCCTACGAATCAAAATGGGTACTACGAAACAGAGCAAAATTGAAAGGGCATGATTGGGAAGATGTGGCCCGCCATGTCTCTTCAAGAGCTAATTTTACCAAATCTCCCAAAACCCAAACTCAGTGTAAGAATAAAATCGAGTCCATGAAAAAAAGGTACCGTTCTGAATCTGCCTCCGCCGCTTCTTCCTGGCCTTTGTACCACCGTCTTCATCTCTTGCTCCGGGGAAACGCTACGCTCACTCCACCACCGCCGCCATCCTCTCACTCTCCACCACCAGTTATTCTCCTCGATCCTCCGCCTCCGCCTCCGCCTCCACCTCCACCTCCACCTCCACCGCCGTTTCTTCCGCCCCAAAACTCCCATGGATCCAATGGTGTTGATAGGATTAATCCTAAGGAAGATGGAGTTGATAATGGAAGAGGAGATGAATCAGATGAATTATCAGAGAAGAATAAGAAGATGGTAACAGAGACTGACAGTAGTACACCGGCAATAGTATATAGTGAAAAAGAAAAGGTAGCAATGAGGCCAAAACAACAAACAAAAATGAAAAACAACAAGAAGAAAAAGGGGACGAGGCTGTTGACAACAGAGGATTCGTTGGAACAGATCGCCGGGAGTATACGGTGGTTGGCCGAGGTCGTGGTGCGATCGGAACAAGCCAGAATGGAGATGATAAAGGATATAGAAAAGATGAGAGCTGAAGCAGAGGCTAAAAGAGGGGAAATGGATCTCAAAAGAACACAAATCATTGCAAATACCCAATTGGAGATTGCTAAGCTCTTTGCATCTTCTACCAAACCTCTTGATTCTTCACTAAGGATTGGTAGAACTTAA

Protein sequence

MEMKGSPSSPPTSHTSPSLLFNHHHHHHHQLPPAAAEDNPSPKKTPVSTGGGGDRLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIESMKKRYRSESASAASSWPLYHRLHLLLRGNATLTPPPPPSSHSPPPVILLDPPPPPPPPPPPPPPPPFLPPQNSHGSNGVDRINPKEDGVDNGRGDESDELSEKNKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLDSSLRIGRT
BLAST of Cla97C05G109140 vs. NCBI nr
Match: XP_022988920.1 (trihelix transcription factor ASIL1-like isoform X1 [Cucurbita maxima])

HSP 1 Score: 419.9 bits (1078), Expect = 8.8e-114
Identity = 260/296 (87.84%), Postives = 275/296 (92.91%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYESKWVLRNRAKLKGHDWEDVARHVSSRA+FTKSPKTQTQCK
Sbjct: 221 RLKRDEWSEGAVSTLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRADFTKSPKTQTQCK 280

Query: 115 NKIESMKKRYRSESASAASSWPLYHRLHLLLRGNATXXXXXXXXXXXXXXXXXXXXXXXX 174
           NKIESMKKRYRSESASAASSWPLYHRLHLLLRG           XXXXXXXXXXXXXXXX
Sbjct: 281 NKIESMKKRYRSESASAASSWPLYHRLHLLLRG-----------XXXXXXXXXXXXXXXX 340

Query: 175 XXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEKNKKMVTETDSST 234
           XXXXXXXXXXXX    NSHGSNGVDRINPKEDGVDNGR +ESDELSE++KKMV ETDSST
Sbjct: 341 XXXXXXXXXXXXLPAQNSHGSNGVDRINPKEDGVDNGRRNESDELSERSKKMVIETDSST 400

Query: 235 PAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVVVRSEQAR 294
           PAIVYS+K+KV+MRPKQQTKMKN+KKK  +  L++EDSLEQIAGSIRWLA+VVVRSEQAR
Sbjct: 401 PAIVYSDKDKVSMRPKQQTKMKNSKKKNKSTRLSSEDSLEQIAGSIRWLAKVVVRSEQAR 460

Query: 295 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLDSSLRIGRT 351
           MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFAS+TKP+DSSLRIGRT
Sbjct: 461 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASATKPIDSSLRIGRT 505

BLAST of Cla97C05G109140 vs. NCBI nr
Match: XP_023526851.1 (trihelix transcription factor ASIL1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 419.5 bits (1077), Expect = 1.1e-113
Identity = 261/296 (88.18%), Postives = 272/296 (91.89%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYESKWVLRNRAKLKGHDWEDVARHVSSRA+FTKSPKTQTQCK
Sbjct: 54  RLKRDEWSEGAVSTLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRADFTKSPKTQTQCK 113

Query: 115 NKIESMKKRYRSESASAASSWPLYHRLHLLLRGNATXXXXXXXXXXXXXXXXXXXXXXXX 174
           NKIESMKKRYRSESASAASSWPLYHRLHLLLRG           XXXXXXXXXXXXXXXX
Sbjct: 114 NKIESMKKRYRSESASAASSWPLYHRLHLLLRG-----------XXXXXXXXXXXXXXXX 173

Query: 175 XXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEKNKKMVTETDSST 234
           XXXXXXXXXXX     NS GSNGVDRINPKEDGVDNGR DESDELSEK+KKMV ETDSST
Sbjct: 174 XXXXXXXXXXXFLPAQNSLGSNGVDRINPKEDGVDNGRRDESDELSEKSKKMVIETDSST 233

Query: 235 PAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVVVRSEQAR 294
           PAIVYS+K+KV+MRPKQ TKMKN+KKK  +  L+TEDSLEQIAGSIRWLAEVVVRSEQAR
Sbjct: 234 PAIVYSDKDKVSMRPKQPTKMKNSKKKNKSTRLSTEDSLEQIAGSIRWLAEVVVRSEQAR 293

Query: 295 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLDSSLRIGRT 351
           MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFAS+TKP+DSSLRIGRT
Sbjct: 294 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASATKPIDSSLRIGRT 338

BLAST of Cla97C05G109140 vs. NCBI nr
Match: XP_022957512.1 (trihelix transcription factor ASIL2 [Cucurbita moschata])

HSP 1 Score: 416.4 bits (1069), Expect = 9.7e-113
Identity = 260/296 (87.84%), Postives = 270/296 (91.22%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYESKWVLRNRAKLKGHDWEDVARHVSSRA+FTKSPKTQTQCK
Sbjct: 55  RLKRDEWSEGAVSTLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRADFTKSPKTQTQCK 114

Query: 115 NKIESMKKRYRSESASAASSWPLYHRLHLLLRGNATXXXXXXXXXXXXXXXXXXXXXXXX 174
           NKIESMKKRYRSESASAASSWPLYHRLHLLLRG           XXXXXXXXXXXXXXXX
Sbjct: 115 NKIESMKKRYRSESASAASSWPLYHRLHLLLRG-----------XXXXXXXXXXXXXXXX 174

Query: 175 XXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEKNKKMVTETDSST 234
           XXXXXXXXXXXX    NSHGSNG DRINPKEDGVDNGR DESDELSEK+KKMV ETDSST
Sbjct: 175 XXXXXXXXXXXXLPAQNSHGSNGGDRINPKEDGVDNGRRDESDELSEKSKKMVIETDSST 234

Query: 235 PAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVVVRSEQAR 294
           PAIVYS+K+KV+MRPKQQTKMKN KKK  +  L+ EDSLEQIAGSIRWLAEVVVRSEQAR
Sbjct: 235 PAIVYSDKDKVSMRPKQQTKMKNTKKKNKSTRLSAEDSLEQIAGSIRWLAEVVVRSEQAR 294

Query: 295 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLDSSLRIGRT 351
           MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFAS+T P+DSS RIGRT
Sbjct: 295 MEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASATNPIDSSPRIGRT 339

BLAST of Cla97C05G109140 vs. NCBI nr
Match: XP_010067722.1 (PREDICTED: trihelix transcription factor ASIL2 [Eucalyptus grandis] >KCW65902.1 hypothetical protein EUGRSUZ_G03225 [Eucalyptus grandis])

HSP 1 Score: 284.3 bits (726), Expect = 5.7e-73
Identity = 219/307 (71.34%), Postives = 241/307 (78.50%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVSSLLEAYE+KWVLRNRAKLKGHDWEDVARHVSSRAN TKSPKTQTQCK
Sbjct: 46  RLKRDEWSEGAVSSLLEAYETKWVLRNRAKLKGHDWEDVARHVSSRANSTKSPKTQTQCK 105

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNA-----------TXXXXXXXXXXXX 174
           NKIESMKKRYR+ESA+A  SSWPLY RL +LLRG A            XXXXXXXXXXXX
Sbjct: 106 NKIESMKKRYRTESATADTSSWPLYPRLDMLLRGTAPPQLPLAPPXXXXXXXXXXXXXXX 165

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEK 234
           XXXXXXXXXXXXXXXXXXXXXXXXXX  NSHGSNGVD++ PKEDG        SD++S+K
Sbjct: 166 XXXXXXXXXXXXXXXXXXXXXXXXXXAQNSHGSNGVDKL-PKEDGAGT---KLSDQVSDK 225

Query: 235 NKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRW 294
           N     ETDSSTPA+ YS+K+K +   K +T M     K              IA SIRW
Sbjct: 226 NP---VETDSSTPAL-YSDKQKSSRSKKHKTVM-----KMXXXXXXXXREEYGIAESIRW 285

Query: 295 LAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLD 350
           LAEV+VRSEQ+RME +K++EKMR EAEA+RGEMDLKRT+IIANTQLEIAKLFA S K +D
Sbjct: 286 LAEVIVRSEQSRMETMKELEKMRIEAEARRGEMDLKRTEIIANTQLEIAKLFAGSNKGID 339

BLAST of Cla97C05G109140 vs. NCBI nr
Match: XP_022966869.1 (uncharacterized protein LOC111466444 [Cucurbita maxima])

HSP 1 Score: 283.5 bits (724), Expect = 9.8e-73
Identity = 209/302 (69.21%), Postives = 228/302 (75.50%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAV++LLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK
Sbjct: 43  RLKRDEWSEGAVATLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 102

Query: 115 NKIESMKKRYRSESASAAS------SWPLYHRLHLLLRGNATXXXXXXXXXXXXXXXXXX 174
           NKIESMKKRYRSESASA        SWPLYHRL L         XXXXXXXXXXXXXXXX
Sbjct: 103 NKIESMKKRYRSESASAVDAXXXXXSWPLYHRLDL---------XXXXXXXXXXXXXXXX 162

Query: 175 XXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGR-GDESDELSEKNKKMV 234
           XXXXXXXXXXXXXXXXX     N  GSNGVD I PKEDGVD  R  D+ ++   K+ K+V
Sbjct: 163 XXXXXXXXXXXXXXXXXFTATLNCLGSNGVDGIIPKEDGVDETRVSDKEEKNKNKSNKVV 222

Query: 235 TETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVV 294
            ETDSSTPA+ YS+ EK+     +                     L++IA SIRWLAEVV
Sbjct: 223 LETDSSTPAMPYSDNEKLR---SKXXXXXXXXXXXXXXXXXXXXXLDEIASSIRWLAEVV 282

Query: 295 VRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLDSSLRI 350
            RSEQ RME ++D+E+MRAEAEAKRGEMDLKRT+IIANTQLEIAKLFA+  K  DSS RI
Sbjct: 283 TRSEQTRMETMRDMERMRAEAEAKRGEMDLKRTEIIANTQLEIAKLFAAGGKAADSSSRI 332

BLAST of Cla97C05G109140 vs. TrEMBL
Match: tr|A0A059BI69|A0A059BI69_EUCGR (Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_G03225 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 3.8e-73
Identity = 219/307 (71.34%), Postives = 241/307 (78.50%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVSSLLEAYE+KWVLRNRAKLKGHDWEDVARHVSSRAN TKSPKTQTQCK
Sbjct: 46  RLKRDEWSEGAVSSLLEAYETKWVLRNRAKLKGHDWEDVARHVSSRANSTKSPKTQTQCK 105

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNA-----------TXXXXXXXXXXXX 174
           NKIESMKKRYR+ESA+A  SSWPLY RL +LLRG A            XXXXXXXXXXXX
Sbjct: 106 NKIESMKKRYRTESATADTSSWPLYPRLDMLLRGTAPPQLPLAPPXXXXXXXXXXXXXXX 165

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEK 234
           XXXXXXXXXXXXXXXXXXXXXXXXXX  NSHGSNGVD++ PKEDG        SD++S+K
Sbjct: 166 XXXXXXXXXXXXXXXXXXXXXXXXXXAQNSHGSNGVDKL-PKEDGAGT---KLSDQVSDK 225

Query: 235 NKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRW 294
           N     ETDSSTPA+ YS+K+K +   K +T M     K              IA SIRW
Sbjct: 226 NP---VETDSSTPAL-YSDKQKSSRSKKHKTVM-----KMXXXXXXXXREEYGIAESIRW 285

Query: 295 LAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPLD 350
           LAEV+VRSEQ+RME +K++EKMR EAEA+RGEMDLKRT+IIANTQLEIAKLFA S K +D
Sbjct: 286 LAEVIVRSEQSRMETMKELEKMRIEAEARRGEMDLKRTEIIANTQLEIAKLFAGSNKGID 339

BLAST of Cla97C05G109140 vs. TrEMBL
Match: tr|B9GLZ4|B9GLZ4_POPTR (Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_001G028400v3 PE=4 SV=2)

HSP 1 Score: 281.6 bits (719), Expect = 2.5e-72
Identity = 224/316 (70.89%), Postives = 247/316 (78.16%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYESKW+LRNRAKLKGHDWEDVARHVSSRAN TKSPKTQTQCK
Sbjct: 52  RLKRDEWSEGAVSTLLEAYESKWILRNRAKLKGHDWEDVARHVSSRANCTKSPKTQTQCK 111

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNA--------------------TXXX 174
           NKIESMKKRYRSESA+A ASSWPLY RL LLLRGN+                     XXX
Sbjct: 112 NKIESMKKRYRSESATADASSWPLYPRLDLLLRGNSXXXXXXXXXXXXXXXXXXXXXXXX 171

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRG 234
           XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDR   KEDGVD    
Sbjct: 172 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDR-GQKEDGVDT--- 231

Query: 235 DESDELSEKNKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSL 294
             S+ +S+KN   V  TDSSTPA+ YS+K+K      ++ KM+  ++K G R        
Sbjct: 232 KLSNHVSDKNAMEV--TDSSTPAL-YSDKKKTR---SKKLKMRKERRKWGKR------EE 291

Query: 295 EQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKL 350
            +IA SIRWLAEVVVRSEQARM+ ++++EKMR EAEAKRGEMDLKRT+IIA TQLEIAKL
Sbjct: 292 WEIADSIRWLAEVVVRSEQARMDTMREVEKMRIEAEAKRGEMDLKRTEIIAKTQLEIAKL 351

BLAST of Cla97C05G109140 vs. TrEMBL
Match: tr|A0A1J7IPA2|A0A1J7IPA2_LUPAN (Uncharacterized protein OS=Lupinus angustifolius OX=3871 GN=TanjilG_19222 PE=4 SV=1)

HSP 1 Score: 275.8 bits (704), Expect = 1.4e-70
Identity = 214/310 (69.03%), Postives = 242/310 (78.06%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYE+KWVLRNRAKLKG DWEDVA++VSSRAN TKSPKTQTQCK
Sbjct: 37  RLKRDEWSEGAVSTLLEAYEAKWVLRNRAKLKGQDWEDVAKYVSSRANSTKSPKTQTQCK 96

Query: 115 NKIESMKKRYRSESA-SAASSWPLYHRLHLLLRGNA-------------TXXXXXXXXXX 174
           NKIESMKKRYRSESA S AS+WPLY RL LLLRG                XXXXXXXXXX
Sbjct: 97  NKIESMKKRYRSESATSDASTWPLYSRLDLLLRGTGPVSSXXXXXXXXXXXXXXXXXXXX 156

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELS 234
           XXXXXXXXXXXXXXXXXXXXXXXXXXX   NSHGSNGVDR+  KEDG+       SD++S
Sbjct: 157 XXXXXXXXXXXXXXXXXXXXXXXXXXXTAQNSHGSNGVDRL-AKEDGLGT---KSSDQVS 216

Query: 235 EKNKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSI 294
            KN     +TDSSTPA+ YSEK+ V    K++ KM +NK+++   +         IA S+
Sbjct: 217 NKN---TLDTDSSTPAL-YSEKDNVRFN-KKKMKMDSNKRQRKEHM--------DIAESL 276

Query: 295 RWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKP 351
           RWLAEVVVRSEQ RM+ +K+IE+MR EAEAKR EMDLKRT+IIANTQLEIAK+FAS  K 
Sbjct: 277 RWLAEVVVRSEQTRMDTMKEIERMRVEAEAKRSEMDLKRTEIIANTQLEIAKIFASVNKG 329

BLAST of Cla97C05G109140 vs. TrEMBL
Match: tr|A0A061FSU0|A0A061FSU0_THECC (Sequence-specific DNA binding transcription factors OS=Theobroma cacao OX=3641 GN=TCM_045048 PE=4 SV=1)

HSP 1 Score: 272.7 bits (696), Expect = 1.1e-69
Identity = 204/318 (64.15%), Postives = 226/318 (71.07%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVSSLLEAYE+KWVLRNRAKLKGHDWEDVAR+VS+RAN TKSPKTQTQCK
Sbjct: 52  RLKRDEWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVSARANCTKSPKTQTQCK 111

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNA---------------------TXX 174
           NKIESMKKRYRSESA+A  SSWPLY RL LLLRG+A                      XX
Sbjct: 112 NKIESMKKRYRSESATADGSSWPLYPRLDLLLRGSAPPPXQPPLQLQPPSAVPQAXXXXX 171

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGR 234
           XXXXXXXXXXX           XXXXXXXXXXXX    NSHGSNGVDRI PKEDG     
Sbjct: 172 XXXXXXXXXXXMVVVLQHQQPPXXXXXXXXXXXXGTAQNSHGSNGVDRI-PKEDGAGT-- 231

Query: 235 GDESDELSEKNKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDS 294
              SD LS+   K+  ETDSSTPA+ YS+KEK+                           
Sbjct: 232 -KLSDHLSD---KVAMETDSSTPAL-YSDKEKL---------XXXXXXXXXXXXXXXXXX 291

Query: 295 LEQIAGSIRWLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAK 351
             +IA SIRWLAEVV++SEQARME +++IEKMR EAEAKRGEMDLKRT+I+ANTQLEIA+
Sbjct: 292 XWEIAESIRWLAEVVLKSEQARMETMREIEKMRVEAEAKRGEMDLKRTEILANTQLEIAR 351

BLAST of Cla97C05G109140 vs. TrEMBL
Match: tr|A0A2N9HTC7|A0A2N9HTC7_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS45179 PE=4 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 5.7e-69
Identity = 211/309 (68.28%), Postives = 233/309 (75.40%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVSSLLEAYE+KWVLRNRAKLKGHDWEDVARHVS RAN TKSPKTQTQCK
Sbjct: 47  RLKRDEWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARHVSLRANSTKSPKTQTQCK 106

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNA------------TXXXXXXXXXXX 174
           NKIESMKKRYRSESA+A ASSWPLY RL LLLRG+              XXXXXXXXXXX
Sbjct: 107 NKIESMKKRYRSESATADASSWPLYPRLDLLLRGSGPVXXXXXXXXXXXXXXXXXXXXXX 166

Query: 175 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSE 234
           XXXXXXXXXXXXXXXXXXXXXXXXX    NSHGSNGVDR+  KEDGV       SD++S+
Sbjct: 167 XXXXXXXXXXXXXXXXXXXXXXXXXGATQNSHGSNGVDRL-AKEDGVGT---KLSDQVSD 226

Query: 235 KNKKMVTETDSSTPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIR 294
           KN     ETDSSTPA+  S+K+K+                             +IA SIR
Sbjct: 227 KNN---LETDSSTPALYNSDKDKLXXXXXXXXXXXXXXL--------------EIAESIR 286

Query: 295 WLAEVVVRSEQARMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASSTKPL 351
           WLAEVVVRSEQ RM+ +++IE+MR EAEAKRGEMDLKRT+I+ANTQLEIA+LFA   K +
Sbjct: 287 WLAEVVVRSEQTRMDTMREIERMRVEAEAKRGEMDLKRTEILANTQLEIARLFAGIGKGV 334

BLAST of Cla97C05G109140 vs. Swiss-Prot
Match: sp|Q9LJG8|ASIL2_ARATH (Trihelix transcription factor ASIL2 OS=Arabidopsis thaliana OX=3702 GN=ASIL2 PE=1 SV=1)

HSP 1 Score: 68.9 bits (167), Expect = 1.2e-10
Identity = 36/96 (37.50%), Postives = 57/96 (59.38%), Query Frame = 0

Query: 59  DEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIE 118
           D WSE A + L++A+  +++  +R  LK   W++VA  VSSR ++ K PKT  QCKN+I+
Sbjct: 82  DCWSEAATAVLIDAWGERYLELSRGNLKQKHWKEVAEIVSSREDYGKIPKTDIQCKNRID 141

Query: 119 SMKKRYRSESASAA-----SSWPLYHRLHLLLRGNA 150
           ++KK+Y+ E    A     S W  + +L  L+   A
Sbjct: 142 TVKKKYKQEKVRIANGGGRSRWVFFDKLDRLIGSTA 177

BLAST of Cla97C05G109140 vs. Swiss-Prot
Match: sp|Q9SYG2|ASIL1_ARATH (Trihelix transcription factor ASIL1 OS=Arabidopsis thaliana OX=3702 GN=ASIL1 PE=1 SV=1)

HSP 1 Score: 61.6 bits (148), Expect = 2.0e-08
Identity = 34/97 (35.05%), Postives = 54/97 (55.67%), Query Frame = 0

Query: 59  DEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIE 118
           D WSE A   L+EA+  ++    +  LK   W++VA  + +++   K PKT  QCKN+I+
Sbjct: 92  DCWSEEATKVLIEAWGDRFSEPGKGTLKQQHWKEVA-EIVNKSRQCKYPKTDIQCKNRID 151

Query: 119 SMKKRYRSESASAA-----SSWPLYHRLHLLLRGNAT 151
           ++KK+Y+ E A  A     S W  + +L  L+ G  T
Sbjct: 152 TVKKKYKQEKAKIASGDGPSKWVFFKKLESLIGGTTT 187

BLAST of Cla97C05G109140 vs. TAIR10
Match: AT3G54390.1 (sequence-specific DNA binding transcription factors)

HSP 1 Score: 244.2 bits (622), Expect = 1.2e-64
Identity = 161/301 (53.49%), Postives = 187/301 (62.13%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           RLKRDEWSEGAVS+LLEAYESKWVLRNRAKLKG DWEDVA+HVSSRA  TKSPKTQTQCK
Sbjct: 32  RLKRDEWSEGAVSTLLEAYESKWVLRNRAKLKGQDWEDVAKHVSSRATHTKSPKTQTQCK 91

Query: 115 NKIESMKKRYRSESASA-ASSWPLYHRLHLLLRGNATXXXXXXXXXXXXXXXXXXXXXXX 174
           NKIESMKKRYRSESA+A  SSWPLY RL  LLRG                          
Sbjct: 92  NKIESMKKRYRSESATADGSSWPLYPRLDHLLRGTQPQPQPQAVLPLNCSVPLLLLEPPL 151

Query: 175 XXXXXXXXXXXXXXXXXNSHGSNGVDRINPKEDGVDNGRGDESDELSEKNKKMVTETDSS 234
                             S+GSNGV +I PKEDG              +NK         
Sbjct: 152 PAVAHPPQI---------SYGSNGVGKI-PKEDG-----------FKPENK--------- 211

Query: 235 TPAIVYSEKEKVAMRPKQQTKMKNNKKKKGTRLLTTEDSLEQIAGSIRWLAEVVVRSEQA 294
                          P  +TK++  K K+       ++  E+IAGSIRWLAEVV+RSE+A
Sbjct: 212 --XXXXXXXXXXXXXPVVKTKVRGKKVKR-----RYKEEKEEIAGSIRWLAEVVMRSERA 271

Query: 295 RMEMIKDIEKMRAEAEAKRGEMDLKRTQIIANTQLEIAKLFASS-----TKPLDSSLRIG 350
           RME +K+IE+MRAEAEAKRGE+DLKRT+I+ANTQLEIA++FA++      K +DSSLRIG
Sbjct: 272 RMETMKEIERMRAEAEAKRGELDLKRTEIMANTQLEIARIFAAAASSGQNKGVDSSLRIG 295

BLAST of Cla97C05G109140 vs. TAIR10
Match: AT3G10030.1 (aspartate/glutamate/uridylate kinase family protein)

HSP 1 Score: 79.0 bits (193), Expect = 6.6e-15
Identity = 41/93 (44.09%), Postives = 58/93 (62.37%), Query Frame = 0

Query: 55  RLKRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCK 114
           R  R+EWS+ A++ LL+AY  K+   NR  L+G DWE+VA  VS R    K  K+  QCK
Sbjct: 155 RKDREEWSDAAIACLLDAYSDKFTQLNRGNLRGRDWEEVASSVSERCE--KLSKSVEQCK 214

Query: 115 NKIESMKKRYR------SESASAASSWPLYHRL 142
           NKI+++KKRY+      S   +AAS WP + ++
Sbjct: 215 NKIDNLKKRYKLERHRMSSGGTAASHWPWFKKM 245

BLAST of Cla97C05G109140 vs. TAIR10
Match: AT5G05550.2 (sequence-specific DNA binding transcription factors)

HSP 1 Score: 75.9 bits (185), Expect = 5.6e-14
Identity = 36/90 (40.00%), Postives = 61/90 (67.78%), Query Frame = 0

Query: 57  KRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNK 116
           + D WSE A ++L+EA+ +++V  N   L+ +DW+DVA  V+SR       KT  QCKN+
Sbjct: 20  REDWWSEEATATLVEAWGNRYVKLNHGNLRQNDWKDVADAVNSRHGDNSRKKTDLQCKNR 79

Query: 117 IESMKKRYRSESAS-AASSWPLYHRLHLLL 146
           ++++KK+Y++E A  + S+W  Y+RL +L+
Sbjct: 80  VDTLKKKYKTEKAKLSPSTWRFYNRLDVLI 109

BLAST of Cla97C05G109140 vs. TAIR10
Match: AT3G11100.1 (sequence-specific DNA binding transcription factors)

HSP 1 Score: 75.1 bits (183), Expect = 9.6e-14
Identity = 36/89 (40.45%), Postives = 60/89 (67.42%), Query Frame = 0

Query: 57  KRDEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNK 116
           + D WSE A ++L+EA+  ++V  NR  L+ +DW++VA  V+S ++    PKT  QCKN+
Sbjct: 18  REDWWSEDATATLIEAWGDRYVNLNRGNLRQNDWKEVADAVNS-SHGNGRPKTDVQCKNR 77

Query: 117 IESMKKRYRSESASAASSWPLYHRLHLLL 146
           I+++KK+Y++E A   S+W  + RL  L+
Sbjct: 78  IDTLKKKYKTEKAKPLSNWCFFDRLDFLI 105

BLAST of Cla97C05G109140 vs. TAIR10
Match: AT3G14180.1 (sequence-specific DNA binding transcription factors)

HSP 1 Score: 68.9 bits (167), Expect = 6.9e-12
Identity = 36/96 (37.50%), Postives = 57/96 (59.38%), Query Frame = 0

Query: 59  DEWSEGAVSSLLEAYESKWVLRNRAKLKGHDWEDVARHVSSRANFTKSPKTQTQCKNKIE 118
           D WSE A + L++A+  +++  +R  LK   W++VA  VSSR ++ K PKT  QCKN+I+
Sbjct: 82  DCWSEAATAVLIDAWGERYLELSRGNLKQKHWKEVAEIVSSREDYGKIPKTDIQCKNRID 141

Query: 119 SMKKRYRSESASAA-----SSWPLYHRLHLLLRGNA 150
           ++KK+Y+ E    A     S W  + +L  L+   A
Sbjct: 142 TVKKKYKQEKVRIANGGGRSRWVFFDKLDRLIGSTA 177

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022988920.18.8e-11487.84trihelix transcription factor ASIL1-like isoform X1 [Cucurbita maxima][more]
XP_023526851.11.1e-11388.18trihelix transcription factor ASIL1 [Cucurbita pepo subsp. pepo][more]
XP_022957512.19.7e-11387.84trihelix transcription factor ASIL2 [Cucurbita moschata][more]
XP_010067722.15.7e-7371.34PREDICTED: trihelix transcription factor ASIL2 [Eucalyptus grandis] >KCW65902.1 ... [more]
XP_022966869.19.8e-7369.21uncharacterized protein LOC111466444 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
tr|A0A059BI69|A0A059BI69_EUCGR3.8e-7371.34Uncharacterized protein OS=Eucalyptus grandis OX=71139 GN=EUGRSUZ_G03225 PE=4 SV... [more]
tr|B9GLZ4|B9GLZ4_POPTR2.5e-7270.89Uncharacterized protein OS=Populus trichocarpa OX=3694 GN=POPTR_001G028400v3 PE=... [more]
tr|A0A1J7IPA2|A0A1J7IPA2_LUPAN1.4e-7069.03Uncharacterized protein OS=Lupinus angustifolius OX=3871 GN=TanjilG_19222 PE=4 S... [more]
tr|A0A061FSU0|A0A061FSU0_THECC1.1e-6964.15Sequence-specific DNA binding transcription factors OS=Theobroma cacao OX=3641 G... [more]
tr|A0A2N9HTC7|A0A2N9HTC7_FAGSY5.7e-6968.28Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS45179 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9LJG8|ASIL2_ARATH1.2e-1037.50Trihelix transcription factor ASIL2 OS=Arabidopsis thaliana OX=3702 GN=ASIL2 PE=... [more]
sp|Q9SYG2|ASIL1_ARATH2.0e-0835.05Trihelix transcription factor ASIL1 OS=Arabidopsis thaliana OX=3702 GN=ASIL1 PE=... [more]
Match NameE-valueIdentityDescription
AT3G54390.11.2e-6453.49sequence-specific DNA binding transcription factors[more]
AT3G10030.16.6e-1544.09aspartate/glutamate/uridylate kinase family protein[more]
AT5G05550.25.6e-1440.00sequence-specific DNA binding transcription factors[more]
AT3G11100.19.6e-1440.45sequence-specific DNA binding transcription factors[more]
AT3G14180.16.9e-1237.50sequence-specific DNA binding transcription factors[more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0009793 embryo development ending in seed dormancy
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0010431 seed maturation
biological_process GO:0006351 transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003677 DNA binding
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C05G109140.1Cla97C05G109140.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 294..321
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 58..142
e-value: 2.7E-19
score: 69.3
NoneNo IPR availableGENE3DG3DSA:1.10.10.60coord: 61..126
e-value: 3.1E-5
score: 25.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..266
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 199..228
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 150..190
NoneNo IPR availablePANTHERPTHR31307:SF7SUBFAMILY NOT NAMEDcoord: 34..349
NoneNo IPR availablePANTHERPTHR31307FAMILY NOT NAMEDcoord: 34..349