Cucsa.114700 (gene) Cucumber (Gy14) v1

NameCucsa.114700
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionTrihelix transcription factor GT-3a-like protein
Locationscaffold00968 : 62780 .. 64838 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CATTAGAATTGTTCTTCTCCTGCCTCCCCGGTGGTGTCACCCTTCTTCACTCAACCCCACCCCAGACCCCTGCAATACAATCAACCATCACCTCCACCGCGATGGCGGCTACTCCTCACCAACACCAATGGAGCGAGGAGGAGACGAGGGAGTTCATTCGTATTCGAGCCGACCTAGAGAAGGACCTCGCCGCGGTTTCCATCGGAGAAGCTCCGGCGGCGAAGAAGAAAACTTTGTGGGAGATGGCGAGTGTTAGGATGCGAGAGAAAGGGTTTTGGAGGACCGCCGATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGCCGCTACAAGGTTGGTCCACTTTCTAATTCGGTATTAATTAAGAAAATTTTCGAATCTGTAATTTTAGATTGGTTTAGTTTCTTTCTCCAAGTGGGCTCCTCTGAATTTGGACTCTACTTGCTTTCTTCTATTGATTCTCCTGCCATTTGATGGATGGGGTTTCAATTCGCCATTTCTGATTGCAGATTCTTGAAGTTCAGCTTGAGAAAGAACAGGACAAGAAGTCTTGTTGTCTTTGTTCAAAATAGACTTGTTTTTCACACTTCAAACTTCAAACAGATATATTCTATTTCCATACTTGAACTCGTAACTTTGAGATTATTATTCTCAAACTCAATCAAACATATGTTTGATTTTGATTTGCAAGAAGAAAGGGTGGTGTTAAAGGAGGGGAGGAATTTACTAGACTAGGGCATTAACTCTAAAAACTAACCTTCTAAATTGTATGGCACTTGAAAACTTGGTGAAACAGATTTGGCTTTGATAAGGAATAGCTTTGTAGATTTTTTCCATGGATAAAAGTGTTCTTTAGCTTCCTGTAATTGCCGGTAAGTGGAAGCTCTTAAAAAAGATAGATGGAACTTTAGGATATTACTACATCAAGACAAAATTAAATATCTGTTGATTACATGGAAAAGAACATCGTTCTACTTTATTTCCTAGGTTACATACTTGTCAAAACTTCCATTTCCAGTCGTTTGATCCTCAATCTTCTTGGTTGGTTATTCCCCAGGAATTTAGGCACCTAAGTGATAAGATTAGAGTGTTACTTCTATTGAATCTAGAACGTAGACTGTTCTTAAGTTCTCTCTTTTAAATCTTTCTTGCATAATCTAATGGACGGTCATCTCCATTCTAATGAACAGATCTAATAGAGGAAGCTTCTTATGCACTTGCAAGATGACTCATGCCTCCATAGATAACATTAAACAGTGTTCCTATGTTTCTAATGTTACAACATTTGTGCAGGGGAAGGAGACATCTCATAAGGAGTATGGATGGCAATGCCCATTTTTTGAAGAAATCCATGCAGTTTTTACAGAAAGAGGAAAAGCTATGCACCGACTGCTCCTCGAACCTGAAGCATGTTCTATTTCAACAAAGAAAAGGGGGAGGGAAAGAAGTTTAGAAGAACATTCAGACCTCAAAGAACTCAATGAAGACGAAAATGAGGAGGAGGTGACTTTCACTCAAAGCAACTCACAGAAGAGAAAGGCTGCAAGAAAACTCCCAGCAAAGTCTTTGGGAGCAACTGATTCTAAAAGTTCTAGTAGCTCAACTAGTAATGAAATTCAAGAAATGTTGAAGGGCTTCTTTCAATGGCAGCAGAGGATGGAGATGGAATGGAGGGAAATAGTTGAGAGACATTACAACAACCGTCGAATGTTTGAGCAGGAATGGCGTGAGTCAATGGAGAAGCTTGAGAGGGAGAGGTTAATGGCTGAGCAAGCTTGGAGGGAAAGGGAAGAACAGAGAAAGGAAAGGCAAGATATCCGAGCTGAAGGAATGAATGCCCTCTTAACAACCCTTTTAAACAAGCTCAACCACGAAAATAATTTATGAGATAATGCGACGACAAGGCAGTCCTTCAGAGAAATAGTTAAAATAATATGGTACGGTTGAAGTTGAGGTTTCTTTTTCTTTTCCCATTGGGGTGACATATACATAAATTTATTTTTCACATTTTTTTCTTGGGTATGATTGTATGTATAAAAAGTCCATATTCTTCAAGGAAT

mRNA sequence

CATTAGAATTGTTCTTCTCCTGCCTCCCCGGTGGTGTCACCCTTCTTCACTCAACCCCACCCCAGACCCCTGCAATACAATCAACCATCACCTCCACCGCGATGGCGGCTACTCCTCACCAACACCAATGGAGCGAGGAGGAGACGAGGGAGTTCATTCGTATTCGAGCCGACCTAGAGAAGGACCTCGCCGCGGTTTCCATCGGAGAAGCTCCGGCGGCGAAGAAGAAAACTTTGTGGGAGATGGCGAGTGTTAGGATGCGAGAGAAAGGGTTTTGGAGGACCGCCGATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGCCGCTACAAGGGGAAGGAGACATCTCATAAGGAGTATGGATGGCAATGCCCATTTTTTGAAGAAATCCATGCAGTTTTTACAGAAAGAGGAAAAGCTATGCACCGACTGCTCCTCGAACCTGAAGCATGTTCTATTTCAACAAAGAAAAGGGGGAGGGAAAGAAGTTTAGAAGAACATTCAGACCTCAAAGAACTCAATGAAGACGAAAATGAGGAGGAGGTGACTTTCACTCAAAGCAACTCACAGAAGAGAAAGGCTGCAAGAAAACTCCCAGCAAAGTCTTTGGGAGCAACTGATTCTAAAAGTTCTAGTAGCTCAACTAGTAATGAAATTCAAGAAATGTTGAAGGGCTTCTTTCAATGGCAGCAGAGGATGGAGATGGAATGGAGGGAAATAGTTGAGAGACATTACAACAACCGTCGAATGTTTGAGCAGGAATGGCGTGAGTCAATGGAGAAGCTTGAGAGGGAGAGGTTAATGGCTGAGCAAGCTTGGAGGGAAAGGGAAGAACAGAGAAAGGAAAGGCAAGATATCCGAGCTGAAGGAATGAATGCCCTCTTAACAACCCTTTTAAACAAGCTCAACCACGAAAATAATTTATGAGATAATGCGACGACAAGGCAGTCCTTCAGAGAAATAGTTAAAATAATATGGTACGGTTGAAGTTGAGGTTTCTTTTTCTTTTCCCATTGGGGTGACATATACATAAATTTATTTTTCACATTTTTTTCTTGGGTATGATTGTATGTATAAAAAGTCCATATTCTTCAAGGAAT

Coding sequence (CDS)

ATGGCGGCTACTCCTCACCAACACCAATGGAGCGAGGAGGAGACGAGGGAGTTCATTCGTATTCGAGCCGACCTAGAGAAGGACCTCGCCGCGGTTTCCATCGGAGAAGCTCCGGCGGCGAAGAAGAAAACTTTGTGGGAGATGGCGAGTGTTAGGATGCGAGAGAAAGGGTTTTGGAGGACCGCCGATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGCCGCTACAAGGGGAAGGAGACATCTCATAAGGAGTATGGATGGCAATGCCCATTTTTTGAAGAAATCCATGCAGTTTTTACAGAAAGAGGAAAAGCTATGCACCGACTGCTCCTCGAACCTGAAGCATGTTCTATTTCAACAAAGAAAAGGGGGAGGGAAAGAAGTTTAGAAGAACATTCAGACCTCAAAGAACTCAATGAAGACGAAAATGAGGAGGAGGTGACTTTCACTCAAAGCAACTCACAGAAGAGAAAGGCTGCAAGAAAACTCCCAGCAAAGTCTTTGGGAGCAACTGATTCTAAAAGTTCTAGTAGCTCAACTAGTAATGAAATTCAAGAAATGTTGAAGGGCTTCTTTCAATGGCAGCAGAGGATGGAGATGGAATGGAGGGAAATAGTTGAGAGACATTACAACAACCGTCGAATGTTTGAGCAGGAATGGCGTGAGTCAATGGAGAAGCTTGAGAGGGAGAGGTTAATGGCTGAGCAAGCTTGGAGGGAAAGGGAAGAACAGAGAAAGGAAAGGCAAGATATCCGAGCTGAAGGAATGAATGCCCTCTTAACAACCCTTTTAAACAAGCTCAACCACGAAAATAATTTATGA

Protein sequence

MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWREREEQRKERQDIRAEGMNALLTTLLNKLNHENNL*
BLAST of Cucsa.114700 vs. Swiss-Prot
Match: TGT3B_ARATH (Trihelix transcription factor GT-3b OS=Arabidopsis thaliana GN=GT-3B PE=1 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 3.3e-43
Identity = 108/267 (40.45%), Postives = 153/267 (57.30%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QWS EET+E I IR +L++             + K LWE+ S +MR+K F R+ +QCKCK
Sbjct: 41  QWSVEETKELIGIRGELDQTFMETK-------RNKLLWEVISNKMRDKSFPRSPEQCKCK 100

Query: 69  WKNLLSRYKGKETSHKEYG-WQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGRE 128
           WKNL++R+KG ET   E    Q PF++++  +FT R + M  L  E E     T    R+
Sbjct: 101 WKNLVTRFKGCETMEAETARQQFPFYDDMQNIFTTRMQRM--LWAESEGGGGGTSGAARK 160

Query: 129 RSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTSNEIQE 188
           R  E  SD     E+EN  E     SN  K    +K  AK        S+SS+++N ++E
Sbjct: 161 R--EYSSD----EEEENVNEELVDVSNDPKILNPKKNIAKK---RKGGSNSSNSNNGVRE 220

Query: 189 MLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWREREEQRK 248
           +L+ F + Q RME EWRE  E     R   E+EWR  ME+LE+ERL  E+ WR+REEQR+
Sbjct: 221 VLEEFMRHQVRMESEWREGWEAREKERAEKEEEWRRKMEELEKERLAMERMWRDREEQRR 280

Query: 249 ERQDIRAEGMNALLTTLLNKLNHENNL 275
            R+++RAE  ++L+  LL KL  + +L
Sbjct: 281 SREEMRAEKRDSLINALLAKLTRDGSL 289

BLAST of Cucsa.114700 vs. Swiss-Prot
Match: TGT3A_ARATH (Trihelix transcription factor GT-3a OS=Arabidopsis thaliana GN=GT-3A PE=1 SV=1)

HSP 1 Score: 165.2 bits (417), Expect = 9.8e-40
Identity = 98/280 (35.00%), Postives = 155/280 (55.36%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QWS EET+E + IR +L++             + K LWE+ + +M +KGF R+A+QCK K
Sbjct: 51  QWSIEETKELLAIREELDQTFMETK-------RNKLLWEVVAAKMADKGFVRSAEQCKSK 110

Query: 69  WKNLLSRYKGKETSHKE-YGWQCPFFEEIHAVFTERGKAMHRLLLEP--EACSISTKKRG 128
           WKNL++RYK  ET+  +    Q PF+ EI ++F  R   M R+L     E  + S +K  
Sbjct: 111 WKNLVTRYKACETTEPDAIRQQFPFYNEIQSIFEAR---MQRMLWSEATEPSTSSKRKHH 170

Query: 129 RERSLEEHSDLKELNEDENEE------------EVTFTQSNSQKRKAARKLPAKSLGATD 188
           +  S +E  ++ E N+D NEE            EV  T +++  RK A+K    + G   
Sbjct: 171 QFSSDDEEEEVDEPNQDINEELLSLVETQKRETEVITTSTSTNPRKRAKKGKGVASG--- 230

Query: 189 SKSSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERL 248
             + + +  N ++++L+ F +   +ME EWR+  E     R   E+EWR  M +LE ER 
Sbjct: 231 --TKAETAGNTLKDILEEFMRQTVKMEKEWRDAWEMKEIEREKREKEWRRRMAELEEERA 290

Query: 249 MAEQAWREREEQRKERQDIRAEGMNALLTTLLNKLNHENN 274
             E+ W EREE+R+ R++ RA+  ++L+  LLN+LN ++N
Sbjct: 291 ATERRWMEREEERRLREEARAQKRDSLIDALLNRLNRDHN 315

BLAST of Cucsa.114700 vs. Swiss-Prot
Match: TGT2_ARATH (Trihelix transcription factor GT-2 OS=Arabidopsis thaliana GN=GT-2 PE=2 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 2.4e-09
Identity = 47/163 (28.83%), Postives = 74/163 (45.40%), Query Frame = 1

Query: 2   AATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRT 61
           + +P   +W + E    IRIR +LE +             K  LWE  S  MR  G+ R+
Sbjct: 390 SVSPSSSRWPKTEVEALIRIRKNLEANYQE-------NGTKGPLWEEISAGMRRLGYNRS 449

Query: 62  ADQCKCKWKNLLSRYKGKETSHKEY---GWQCPFFEEIHAVFTERGKA------------ 121
           A +CK KW+N+   +K  + S+K+       CP+F ++ A++ ER K+            
Sbjct: 450 AKRCKEKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALYNERNKSGAMPLPLPLMVT 509

Query: 122 -MHRLLLEPEA-CSISTKKRGRERSLEEHSDLKELNEDENEEE 148
              +LLL  E      T +R +    E+  +  E  EDE +EE
Sbjct: 510 PQRQLLLSQETQTEFETDQREKVGDKEDEEE-GESEEDEYDEE 544


HSP 2 Score: 55.8 bits (133), Expect = 8.4e-07
Identity = 55/280 (19.64%), Postives = 117/280 (41.79%), Query Frame = 1

Query: 8   HQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKC 67
           ++W   ET   +RIR++++K     ++       K  LWE  S +M E G+ R++ +CK 
Sbjct: 40  NRWPRPETLALLRIRSEMDKAFRDSTL-------KAPLWEEISRKMMELGYKRSSKKCKE 99

Query: 68  KWKNLLSRYKGKETSH--KEYGWQCPFFEEIHAVFT----------ERGKAMHRLLLEPE 127
           K++N+   +K  +     K  G    FFEE+ A  T          +  K+   +   P 
Sbjct: 100 KFENVYKYHKRTKEGRTGKSEGKTYRFFEELEAFETLSSYQPEPESQPAKSSAVITNAPA 159

Query: 128 ACSISTKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAA-------------- 187
             S+         S E+ S   + +   + + +T   +   K+ ++              
Sbjct: 160 TSSLIPWISSSNPSTEKSSSPLKHHHQVSVQPITTNPTFLAKQPSSTTPFPFYSSNNTTT 219

Query: 188 -----------RKLPAKSLGATDSKSSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERH 247
                        + + +L ++ + SS++S   E    +K   + ++  +  + ++ +  
Sbjct: 220 VSQPPISNDLMNNVSSLNLFSSSTSSSTASDEEEDHHQVKSSRKKRKYWKGLFTKLTKEL 279

Query: 248 YNNRRMFEQEWRESMEKLERERLMAEQAWREREEQRKERQ 251
              +   ++ + E++E  E+ER+  E+AWR +E  R  R+
Sbjct: 280 MEKQEKMQKRFLETLEYREKERISREEAWRVQEIGRINRE 312

BLAST of Cucsa.114700 vs. Swiss-Prot
Match: TGT4_ARATH (Trihelix transcription factor GT-4 OS=Arabidopsis thaliana GN=GT-4 PE=2 SV=1)

HSP 1 Score: 57.8 bits (138), Expect = 2.2e-07
Identity = 32/101 (31.68%), Postives = 51/101 (50.50%), Query Frame = 1

Query: 10  WSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCKW 69
           W+++ETR  I +R +++            +   K LWE  S +MREKGF R+   C  KW
Sbjct: 55  WAQDETRTLISLRREMDNLFNT-------SKSNKHLWEQISKKMREKGFDRSPSMCTDKW 114

Query: 70  KNLLSRYKGKETSHKEY-----GWQCPFFEEIHAVFTERGK 106
           +N+L  +K K   H++        +  ++ EI  +F ER K
Sbjct: 115 RNILKEFK-KAKQHEDKATSGGSTKMSYYNEIEDIFRERKK 147

BLAST of Cucsa.114700 vs. Swiss-Prot
Match: TGT1_ARATH (Trihelix transcription factor GT-1 OS=Arabidopsis thaliana GN=GT-1 PE=1 SV=1)

HSP 1 Score: 54.3 bits (129), Expect = 2.4e-06
Identity = 32/99 (32.32%), Postives = 48/99 (48.48%), Query Frame = 1

Query: 10  WSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCKW 69
           W ++ETR  I  R  ++            +   K LWE  S +MREKGF R+   C  KW
Sbjct: 87  WVQDETRSLIMFRRGMDGLFNT-------SKSNKHLWEQISSKMREKGFDRSPTMCTDKW 146

Query: 70  KNLLSRYKGKETSHKEYG---WQCPFFEEIHAVFTERGK 106
           +NLL  +  K+  H + G    +  +++EI  +  ER K
Sbjct: 147 RNLLKEF--KKAKHHDRGNGSAKMSYYKEIEDILRERSK 176

BLAST of Cucsa.114700 vs. TrEMBL
Match: A0A0A0KV13_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055350 PE=4 SV=1)

HSP 1 Score: 559.7 bits (1441), Expect = 2.0e-156
Identity = 274/274 (100.00%), Postives = 274/274 (100.00%), Query Frame = 1

Query: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60
           MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR
Sbjct: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60

Query: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120
           TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS
Sbjct: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120

Query: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 180
           TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS
Sbjct: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 180

Query: 181 TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR 240
           TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR
Sbjct: 181 TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR 240

Query: 241 EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL
Sbjct: 241 EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 274

BLAST of Cucsa.114700 vs. TrEMBL
Match: U5GMR3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s01210g PE=4 SV=1)

HSP 1 Score: 300.4 bits (768), Expect = 2.2e-78
Identity = 160/272 (58.82%), Postives = 200/272 (73.53%), Query Frame = 1

Query: 7   QHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCK 66
           Q QW ++ET+EFI IRA+LEKD            + KTLWE+ SV+MREKG+ RT +QCK
Sbjct: 39  QPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSVKMREKGYRRTPEQCK 98

Query: 67  CKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGR 126
           CKWKNL++RYKGKETS  E G QCPFFEE+HAVFTER K M RLLLE EA S  ++K+ +
Sbjct: 99  CKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKMK 158

Query: 127 ----ERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTS 186
               +RS +E S+ ++ +ED++EEE    +SNS+KRK  + +  KS  A      SSST 
Sbjct: 159 RTSGDRSSDEFSEEEDEDEDDSEEEKP-VRSNSRKRKVEKIIAEKSPRA------SSSTV 218

Query: 187 NEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWRER 246
             IQEMLK F Q QQ+MEM+WRE++ER  + R+MFEQEWR+SMEKLERERLM EQAWRER
Sbjct: 219 GGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRER 278

Query: 247 EEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EEQR+ R++ RAE  +ALLTTLLNKL  ENN+
Sbjct: 279 EEQRRIREESRAERRDALLTTLLNKLIRENNI 296

BLAST of Cucsa.114700 vs. TrEMBL
Match: A0A061G7N3_THECC (Homeodomain-like superfamily protein OS=Theobroma cacao GN=TCM_016743 PE=4 SV=1)

HSP 1 Score: 300.1 bits (767), Expect = 2.8e-78
Identity = 162/270 (60.00%), Postives = 196/270 (72.59%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QW  EETRE I IR +LE+D  A       A + KTLWE+ S RMR++G+ RT DQCKCK
Sbjct: 24  QWGPEETRELILIRGELERDFTA-------AKRNKTLWEIVSARMRDRGYIRTPDQCKCK 83

Query: 69  WKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGR-- 128
           WKNLL+RYKGKETS  E G Q PFFEE+HAVFTER K M RLLLE EA S   KKR R  
Sbjct: 84  WKNLLNRYKGKETSDPENGRQFPFFEELHAVFTERAKNMQRLLLESEAGSTQAKKRMRRI 143

Query: 129 --ERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTSNE 188
             +RS +E S+ ++ +EDE+EEE      +S+KRKA R +  KS       SS+SST   
Sbjct: 144 SADRSSDEFSEEEDDDEDESEEERHARSISSRKRKADRVVLDKSPRPNSGTSSTSSTG-- 203

Query: 189 IQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWREREE 248
           +QEML+ FFQ QQRMEM+WRE++ER    R++FEQEWR+SMEKLERERLM EQAWREREE
Sbjct: 204 LQEMLREFFQQQQRMEMQWREMMERRARERQLFEQEWRQSMEKLERERLMVEQAWREREE 263

Query: 249 QRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           QR+ R++ RAE  +ALLTTLLNKL ++NNL
Sbjct: 264 QRRLREESRAERRDALLTTLLNKLINDNNL 284

BLAST of Cucsa.114700 vs. TrEMBL
Match: A0A151RUB7_CAJCA (Zinc finger and SCAN domain-containing protein 29 OS=Cajanus cajan GN=KK1_032304 PE=4 SV=1)

HSP 1 Score: 297.0 bits (759), Expect = 2.4e-77
Identity = 156/274 (56.93%), Postives = 201/274 (73.36%), Query Frame = 1

Query: 5   PHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQ 64
           P Q QWS++ETREFI IRA+LEKD  A       + + KTLWE+ S +MRE+GF R+ +Q
Sbjct: 12  PGQPQWSQQETREFIAIRAELEKDFTA-------SKRNKTLWEVVSSKMRERGFRRSPEQ 71

Query: 65  CKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKR 124
           CKCKWKNL++RYKGKETS  E+G QCPFFEE+HAVFT+R   M RLLLE E  S  TKK 
Sbjct: 72  CKCKWKNLVNRYKGKETSDPEHGRQCPFFEELHAVFTQRAHNMQRLLLESETRSAQTKKG 131

Query: 125 GRERSLEEHSDLKELNEDENE-----EEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSS 184
            +  S++  S+  EL+ED++E     EE   ++SN++KRK  +    KS  A +  +  S
Sbjct: 132 VKRSSVDRSSE--ELSEDDDEVEYDSEEEKPSRSNTRKRKVDKVGMEKSSRANNPSNVVS 191

Query: 185 STSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAW 244
           ++++ IQEMLK FFQ Q RMEM+WRE++ER    R++FEQEWR+SMEKLERERLM EQAW
Sbjct: 192 NSTSSIQEMLKEFFQHQLRMEMQWREMMERRAQERQLFEQEWRQSMEKLERERLMIEQAW 251

Query: 245 REREEQRKERQDIRAEGMNALLTTLLNKLNHENN 274
           REREEQR+ R++ RAE  +ALLTTLLNKL +E+N
Sbjct: 252 REREEQRRMREESRAERRDALLTTLLNKLINESN 276

BLAST of Cucsa.114700 vs. TrEMBL
Match: K7K3Z3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_01G150500 PE=4 SV=1)

HSP 1 Score: 296.2 bits (757), Expect = 4.1e-77
Identity = 160/274 (58.39%), Postives = 203/274 (74.09%), Query Frame = 1

Query: 5   PHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQ 64
           P Q QWS++ETREFI IRA+LE+D  A       + + KTLWE+ S +MRE+GF R+ +Q
Sbjct: 45  PAQPQWSQQETREFIAIRAELERDFTA-------SKRNKTLWEVVSAKMRERGFRRSPEQ 104

Query: 65  CKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKR 124
           CKCKWKNL++RYKGKETS  E+G QCPFFEE+HAVFT+R   M RLLLE E  S  TKK 
Sbjct: 105 CKCKWKNLVNRYKGKETSDPEHGKQCPFFEELHAVFTQRAHNMQRLLLESETRSAQTKK- 164

Query: 125 GRERSLEEHSDLKELNEDENE-----EEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSS 184
           G +RS  + S  +EL+ED+NE     EE   ++SN++KRK  +    KS  A++  S+S+
Sbjct: 165 GVKRSSGDRSS-EELSEDDNEVEYDSEEEKPSRSNTRKRKVDKVGVEKSSRASNP-SNSA 224

Query: 185 STSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAW 244
           S S  IQEMLK FFQ Q  MEM+WRE++ER  + R++FEQEWR+SMEKLERERLM EQAW
Sbjct: 225 SNSTSIQEMLKEFFQHQLSMEMQWREMMERRAHERQLFEQEWRQSMEKLERERLMIEQAW 284

Query: 245 REREEQRKERQDIRAEGMNALLTTLLNKLNHENN 274
           REREEQR+ R++ RAE  +ALLTTLLNKL +E+N
Sbjct: 285 REREEQRRMREESRAERRDALLTTLLNKLINESN 308

BLAST of Cucsa.114700 vs. TAIR10
Match: AT2G38250.1 (AT2G38250.1 Homeodomain-like superfamily protein)

HSP 1 Score: 176.8 bits (447), Expect = 1.8e-44
Identity = 108/267 (40.45%), Postives = 153/267 (57.30%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QWS EET+E I IR +L++             + K LWE+ S +MR+K F R+ +QCKCK
Sbjct: 41  QWSVEETKELIGIRGELDQTFMETK-------RNKLLWEVISNKMRDKSFPRSPEQCKCK 100

Query: 69  WKNLLSRYKGKETSHKEYG-WQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGRE 128
           WKNL++R+KG ET   E    Q PF++++  +FT R + M  L  E E     T    R+
Sbjct: 101 WKNLVTRFKGCETMEAETARQQFPFYDDMQNIFTTRMQRM--LWAESEGGGGGTSGAARK 160

Query: 129 RSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTSNEIQE 188
           R  E  SD     E+EN  E     SN  K    +K  AK        S+SS+++N ++E
Sbjct: 161 R--EYSSD----EEEENVNEELVDVSNDPKILNPKKNIAKK---RKGGSNSSNSNNGVRE 220

Query: 189 MLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWREREEQRK 248
           +L+ F + Q RME EWRE  E     R   E+EWR  ME+LE+ERL  E+ WR+REEQR+
Sbjct: 221 VLEEFMRHQVRMESEWREGWEAREKERAEKEEEWRRKMEELEKERLAMERMWRDREEQRR 280

Query: 249 ERQDIRAEGMNALLTTLLNKLNHENNL 275
            R+++RAE  ++L+  LL KL  + +L
Sbjct: 281 SREEMRAEKRDSLINALLAKLTRDGSL 289

BLAST of Cucsa.114700 vs. TAIR10
Match: AT5G01380.1 (AT5G01380.1 Homeodomain-like superfamily protein)

HSP 1 Score: 165.2 bits (417), Expect = 5.5e-41
Identity = 98/280 (35.00%), Postives = 155/280 (55.36%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QWS EET+E + IR +L++             + K LWE+ + +M +KGF R+A+QCK K
Sbjct: 51  QWSIEETKELLAIREELDQTFMETK-------RNKLLWEVVAAKMADKGFVRSAEQCKSK 110

Query: 69  WKNLLSRYKGKETSHKE-YGWQCPFFEEIHAVFTERGKAMHRLLLEP--EACSISTKKRG 128
           WKNL++RYK  ET+  +    Q PF+ EI ++F  R   M R+L     E  + S +K  
Sbjct: 111 WKNLVTRYKACETTEPDAIRQQFPFYNEIQSIFEAR---MQRMLWSEATEPSTSSKRKHH 170

Query: 129 RERSLEEHSDLKELNEDENEE------------EVTFTQSNSQKRKAARKLPAKSLGATD 188
           +  S +E  ++ E N+D NEE            EV  T +++  RK A+K    + G   
Sbjct: 171 QFSSDDEEEEVDEPNQDINEELLSLVETQKRETEVITTSTSTNPRKRAKKGKGVASG--- 230

Query: 189 SKSSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERL 248
             + + +  N ++++L+ F +   +ME EWR+  E     R   E+EWR  M +LE ER 
Sbjct: 231 --TKAETAGNTLKDILEEFMRQTVKMEKEWRDAWEMKEIEREKREKEWRRRMAELEEERA 290

Query: 249 MAEQAWREREEQRKERQDIRAEGMNALLTTLLNKLNHENN 274
             E+ W EREE+R+ R++ RA+  ++L+  LLN+LN ++N
Sbjct: 291 ATERRWMEREEERRLREEARAQKRDSLIDALLNRLNRDHN 315

BLAST of Cucsa.114700 vs. TAIR10
Match: AT1G76880.1 (AT1G76880.1 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 67.8 bits (164), Expect = 1.2e-11
Identity = 59/259 (22.78%), Postives = 115/259 (44.40%), Query Frame = 1

Query: 8   HQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKC 67
           ++W  +ET   ++IR+D+        I    A+ K  LWE  S +M E G+ R A +CK 
Sbjct: 60  NRWPRQETLALLKIRSDM-------GIAFRDASVKGPLWEEVSRKMAEHGYIRNAKKCKE 119

Query: 68  KWKNLLSRYKGKETSH--KEYGWQCPFFEEIHAVFTERGKAMH----RLLLEPEACSIST 127
           K++N+   +K  +     K  G    FF+++ A+ ++   ++H    +  L P+      
Sbjct: 120 KFENVYKYHKRTKEGRTGKSEGKTYRFFDQLEALESQSTTSLHHHQQQTPLRPQ------ 179

Query: 128 KKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPA------KSLGATDSK 187
           +      +   +S +            T   S+         +P+        L    + 
Sbjct: 180 QNNNNNNNNNNNSSIFSTPPPVTTVMPTLPSSSIPPYTQQINVPSFPNISGDFLSDNSTS 239

Query: 188 SSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERHY----NNRRMFEQEWRESMEKLERE 247
           SSSS +++   EM  G    +++ + +W+   ER      + +   ++++ E++EK E E
Sbjct: 240 SSSSYSTSSDMEMGGGTATTRKKRKRKWKVFFERLMKQVVDKQEELQRKFLEAVEKREHE 299

Query: 248 RLMAEQAWREREEQRKERQ 251
           RL+ E++WR +E  R  R+
Sbjct: 300 RLVREESWRVQEIARINRE 305


HSP 2 Score: 56.6 bits (135), Expect = 2.8e-08
Identity = 33/107 (30.84%), Postives = 52/107 (48.60%), Query Frame = 1

Query: 2   AATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRT 61
           AA+    +W + E    I++R +L+               K  LWE  S  MR  GF R 
Sbjct: 401 AASASSSRWPKVEIEALIKLRTNLDSKYQE-------NGPKGPLWEEISAGMRRLGFNRN 460

Query: 62  ADQCKCKWKNLLSRYKGKETSHK---EYGWQCPFFEEIHAVFTERGK 106
           + +CK KW+N+   +K  + S+K   E    CP+F ++ A++ ER K
Sbjct: 461 SKRCKEKWENINKYFKKVKESNKKRPEDSKTCPYFHQLDALYRERNK 500

BLAST of Cucsa.114700 vs. TAIR10
Match: AT1G76890.2 (AT1G76890.2 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 64.3 bits (155), Expect = 1.3e-10
Identity = 47/163 (28.83%), Postives = 74/163 (45.40%), Query Frame = 1

Query: 2   AATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRT 61
           + +P   +W + E    IRIR +LE +             K  LWE  S  MR  G+ R+
Sbjct: 390 SVSPSSSRWPKTEVEALIRIRKNLEANYQE-------NGTKGPLWEEISAGMRRLGYNRS 449

Query: 62  ADQCKCKWKNLLSRYKGKETSHKEY---GWQCPFFEEIHAVFTERGKA------------ 121
           A +CK KW+N+   +K  + S+K+       CP+F ++ A++ ER K+            
Sbjct: 450 AKRCKEKWENINKYFKKVKESNKKRPLDSKTCPYFHQLEALYNERNKSGAMPLPLPLMVT 509

Query: 122 -MHRLLLEPEA-CSISTKKRGRERSLEEHSDLKELNEDENEEE 148
              +LLL  E      T +R +    E+  +  E  EDE +EE
Sbjct: 510 PQRQLLLSQETQTEFETDQREKVGDKEDEEE-GESEEDEYDEE 544


HSP 2 Score: 55.8 bits (133), Expect = 4.7e-08
Identity = 55/280 (19.64%), Postives = 117/280 (41.79%), Query Frame = 1

Query: 8   HQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKC 67
           ++W   ET   +RIR++++K     ++       K  LWE  S +M E G+ R++ +CK 
Sbjct: 40  NRWPRPETLALLRIRSEMDKAFRDSTL-------KAPLWEEISRKMMELGYKRSSKKCKE 99

Query: 68  KWKNLLSRYKGKETSH--KEYGWQCPFFEEIHAVFT----------ERGKAMHRLLLEPE 127
           K++N+   +K  +     K  G    FFEE+ A  T          +  K+   +   P 
Sbjct: 100 KFENVYKYHKRTKEGRTGKSEGKTYRFFEELEAFETLSSYQPEPESQPAKSSAVITNAPA 159

Query: 128 ACSISTKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAA-------------- 187
             S+         S E+ S   + +   + + +T   +   K+ ++              
Sbjct: 160 TSSLIPWISSSNPSTEKSSSPLKHHHQVSVQPITTNPTFLAKQPSSTTPFPFYSSNNTTT 219

Query: 188 -----------RKLPAKSLGATDSKSSSSSTSNEIQEMLKGFFQWQQRMEMEWREIVERH 247
                        + + +L ++ + SS++S   E    +K   + ++  +  + ++ +  
Sbjct: 220 VSQPPISNDLMNNVSSLNLFSSSTSSSTASDEEEDHHQVKSSRKKRKYWKGLFTKLTKEL 279

Query: 248 YNNRRMFEQEWRESMEKLERERLMAEQAWREREEQRKERQ 251
              +   ++ + E++E  E+ER+  E+AWR +E  R  R+
Sbjct: 280 MEKQEKMQKRFLETLEYREKERISREEAWRVQEIGRINRE 312

BLAST of Cucsa.114700 vs. TAIR10
Match: AT3G10000.1 (AT3G10000.1 Homeodomain-like superfamily protein)

HSP 1 Score: 59.3 bits (142), Expect = 4.3e-09
Identity = 57/264 (21.59%), Postives = 116/264 (43.94%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLW-EMASVRMREKGFWRTADQCKC 68
           +W  +ET   + +R+ L+            A +K  LW E++ +   E G+ R+  +C+ 
Sbjct: 88  RWPRQETLMLLEVRSRLDHKFKE-------ANQKGPLWDEVSRIMSEEHGYTRSGKKCRE 147

Query: 69  KWKNLLSRYKGKE---TSHKEYGWQCPFFEEIHAVFTERGKAM----HRLLLEPEACSIS 128
           K++NL   YK  +   +  ++ G    FF ++ A++ E   ++    +   +   A   +
Sbjct: 148 KFENLYKYYKKTKEGKSGRRQDGKNYRFFRQLEAIYGESKDSVSCYNNTQFIMTNALHSN 207

Query: 129 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 188
            +       +  H +   L  + N +  + + SN+    +   L + S G   +K     
Sbjct: 208 FRASNIHNIVPHHQN--PLMTNTNTQSQSLSISNNFNSSSDLDLTSSSEGNETTKREGMH 267

Query: 189 TSNEIQEMLKGFFQWQQRMEMEWRE----IVERHYNNRRMFEQEWRESMEKLERERLMAE 248
              +I+E +    +     +  W E    IVE   + R + E+EWR    ++E ER+  E
Sbjct: 268 WKEKIKEFIGVHMERLIEKQDFWLEKLMKIVEDKEHQRMLREEEWR----RIEAERIDKE 327

Query: 249 QAWREREEQRKERQDIRAEGMNAL 261
           +++  +E +R E +D+    +NAL
Sbjct: 328 RSFWTKERERIEARDVAV--INAL 336

BLAST of Cucsa.114700 vs. NCBI nr
Match: gi|449462507|ref|XP_004148982.1| (PREDICTED: trihelix transcription factor GT-3b-like [Cucumis sativus])

HSP 1 Score: 559.7 bits (1441), Expect = 2.9e-156
Identity = 274/274 (100.00%), Postives = 274/274 (100.00%), Query Frame = 1

Query: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60
           MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR
Sbjct: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60

Query: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120
           TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS
Sbjct: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120

Query: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 180
           TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS
Sbjct: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 180

Query: 181 TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR 240
           TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR
Sbjct: 181 TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR 240

Query: 241 EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL
Sbjct: 241 EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 274

BLAST of Cucsa.114700 vs. NCBI nr
Match: gi|659102022|ref|XP_008451911.1| (PREDICTED: trihelix transcription factor GT-3b-like [Cucumis melo])

HSP 1 Score: 534.6 bits (1376), Expect = 9.9e-149
Identity = 263/274 (95.99%), Postives = 264/274 (96.35%), Query Frame = 1

Query: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60
           MAAT HQHQWSEEETREFIRIRADLEKDL AVS GEAPAAKKKTLWEMASVRMREKGFWR
Sbjct: 6   MAATLHQHQWSEEETREFIRIRADLEKDLTAVSTGEAPAAKKKTLWEMASVRMREKGFWR 65

Query: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120
           TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS
Sbjct: 66  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 125

Query: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSS 180
           TKKRGRERSLEEHSDLKELNEDE EEEVT TQ NSQKRKAARKLPAKSLGATDSKSSSSS
Sbjct: 126 TKKRGRERSLEEHSDLKELNEDETEEEVTLTQRNSQKRKAARKLPAKSLGATDSKSSSSS 185

Query: 181 TSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWR 240
            S EIQEMLKGF QWQQRMEMEWREIVERHYNNRRM EQEWRESMEKLERERLMAEQAWR
Sbjct: 186 ISYEIQEMLKGFLQWQQRMEMEWREIVERHYNNRRMLEQEWRESMEKLERERLMAEQAWR 245

Query: 241 EREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EREEQRKE+QDIRAEGMNALLTTLLNKLNHENNL
Sbjct: 246 EREEQRKEKQDIRAEGMNALLTTLLNKLNHENNL 279

BLAST of Cucsa.114700 vs. NCBI nr
Match: gi|566146525|ref|XP_006368276.1| (hypothetical protein POPTR_0001s01210g [Populus trichocarpa])

HSP 1 Score: 300.4 bits (768), Expect = 3.1e-78
Identity = 160/272 (58.82%), Postives = 200/272 (73.53%), Query Frame = 1

Query: 7   QHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCK 66
           Q QW ++ET+EFI IRA+LEKD            + KTLWE+ SV+MREKG+ RT +QCK
Sbjct: 39  QPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSVKMREKGYRRTPEQCK 98

Query: 67  CKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGR 126
           CKWKNL++RYKGKETS  E G QCPFFEE+HAVFTER K M RLLLE EA S  ++K+ +
Sbjct: 99  CKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKMK 158

Query: 127 ----ERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTS 186
               +RS +E S+ ++ +ED++EEE    +SNS+KRK  + +  KS  A      SSST 
Sbjct: 159 RTSGDRSSDEFSEEEDEDEDDSEEEKP-VRSNSRKRKVEKIIAEKSPRA------SSSTV 218

Query: 187 NEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWRER 246
             IQEMLK F Q QQ+MEM+WRE++ER  + R+MFEQEWR+SMEKLERERLM EQAWRER
Sbjct: 219 GGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRER 278

Query: 247 EEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EEQR+ R++ RAE  +ALLTTLLNKL  ENN+
Sbjct: 279 EEQRRIREESRAERRDALLTTLLNKLIRENNI 296

BLAST of Cucsa.114700 vs. NCBI nr
Match: gi|590680697|ref|XP_007040932.1| (Homeodomain-like superfamily protein [Theobroma cacao])

HSP 1 Score: 300.1 bits (767), Expect = 4.1e-78
Identity = 162/270 (60.00%), Postives = 196/270 (72.59%), Query Frame = 1

Query: 9   QWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCKCK 68
           QW  EETRE I IR +LE+D  A       A + KTLWE+ S RMR++G+ RT DQCKCK
Sbjct: 24  QWGPEETRELILIRGELERDFTA-------AKRNKTLWEIVSARMRDRGYIRTPDQCKCK 83

Query: 69  WKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGR-- 128
           WKNLL+RYKGKETS  E G Q PFFEE+HAVFTER K M RLLLE EA S   KKR R  
Sbjct: 84  WKNLLNRYKGKETSDPENGRQFPFFEELHAVFTERAKNMQRLLLESEAGSTQAKKRMRRI 143

Query: 129 --ERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTSNE 188
             +RS +E S+ ++ +EDE+EEE      +S+KRKA R +  KS       SS+SST   
Sbjct: 144 SADRSSDEFSEEEDDDEDESEEERHARSISSRKRKADRVVLDKSPRPNSGTSSTSSTG-- 203

Query: 189 IQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWREREE 248
           +QEML+ FFQ QQRMEM+WRE++ER    R++FEQEWR+SMEKLERERLM EQAWREREE
Sbjct: 204 LQEMLREFFQQQQRMEMQWREMMERRARERQLFEQEWRQSMEKLERERLMVEQAWREREE 263

Query: 249 QRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           QR+ R++ RAE  +ALLTTLLNKL ++NNL
Sbjct: 264 QRRLREESRAERRDALLTTLLNKLINDNNL 284

BLAST of Cucsa.114700 vs. NCBI nr
Match: gi|743808955|ref|XP_011018396.1| (PREDICTED: trihelix transcription factor GT-3b-like [Populus euphratica])

HSP 1 Score: 298.5 bits (763), Expect = 1.2e-77
Identity = 159/272 (58.46%), Postives = 199/272 (73.16%), Query Frame = 1

Query: 7   QHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWRTADQCK 66
           Q QW ++ET+EFI IRA+LEKD            + KTLWE+ S +MREKG+ RT +QCK
Sbjct: 39  QPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSAKMREKGYRRTPEQCK 98

Query: 67  CKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSISTKKRGR 126
           CKWKNL++RYKGKETS  E G QCPFFEE+HAVFTER K M RLLLE EA S  ++K+ +
Sbjct: 99  CKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKMK 158

Query: 127 ----ERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKAARKLPAKSLGATDSKSSSSSTS 186
               +RS +E S+ ++ +ED++EEE    +SNS+KRK  + +  KS  A      SSST 
Sbjct: 159 RTSGDRSSDEFSEEEDEDEDDSEEEKP-VRSNSRKRKVEKIIAEKSPRA------SSSTV 218

Query: 187 NEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAWRER 246
             IQEMLK F Q QQ+MEM+WRE++ER  + R+MFEQEWR+SMEKLERERLM EQAWRER
Sbjct: 219 GGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRER 278

Query: 247 EEQRKERQDIRAEGMNALLTTLLNKLNHENNL 275
           EEQR+ R++ RAE  +ALLTTLLNKL  ENN+
Sbjct: 279 EEQRRIREESRAERRDALLTTLLNKLIRENNV 296

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TGT3B_ARATH3.3e-4340.45Trihelix transcription factor GT-3b OS=Arabidopsis thaliana GN=GT-3B PE=1 SV=1[more]
TGT3A_ARATH9.8e-4035.00Trihelix transcription factor GT-3a OS=Arabidopsis thaliana GN=GT-3A PE=1 SV=1[more]
TGT2_ARATH2.4e-0928.83Trihelix transcription factor GT-2 OS=Arabidopsis thaliana GN=GT-2 PE=2 SV=1[more]
TGT4_ARATH2.2e-0731.68Trihelix transcription factor GT-4 OS=Arabidopsis thaliana GN=GT-4 PE=2 SV=1[more]
TGT1_ARATH2.4e-0632.32Trihelix transcription factor GT-1 OS=Arabidopsis thaliana GN=GT-1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KV13_CUCSA2.0e-156100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055350 PE=4 SV=1[more]
U5GMR3_POPTR2.2e-7858.82Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s01210g PE=4 SV=1[more]
A0A061G7N3_THECC2.8e-7860.00Homeodomain-like superfamily protein OS=Theobroma cacao GN=TCM_016743 PE=4 SV=1[more]
A0A151RUB7_CAJCA2.4e-7756.93Zinc finger and SCAN domain-containing protein 29 OS=Cajanus cajan GN=KK1_032304... [more]
K7K3Z3_SOYBN4.1e-7758.39Uncharacterized protein OS=Glycine max GN=GLYMA_01G150500 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G38250.11.8e-4440.45 Homeodomain-like superfamily protein[more]
AT5G01380.15.5e-4135.00 Homeodomain-like superfamily protein[more]
AT1G76880.11.2e-1122.78 Duplicated homeodomain-like superfamily protein[more]
AT1G76890.21.3e-1028.83 Duplicated homeodomain-like superfamily protein[more]
AT3G10000.14.3e-0921.59 Homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|449462507|ref|XP_004148982.1|2.9e-156100.00PREDICTED: trihelix transcription factor GT-3b-like [Cucumis sativus][more]
gi|659102022|ref|XP_008451911.1|9.9e-14995.99PREDICTED: trihelix transcription factor GT-3b-like [Cucumis melo][more]
gi|566146525|ref|XP_006368276.1|3.1e-7858.82hypothetical protein POPTR_0001s01210g [Populus trichocarpa][more]
gi|590680697|ref|XP_007040932.1|4.1e-7860.00Homeodomain-like superfamily protein [Theobroma cacao][more]
gi|743808955|ref|XP_011018396.1|1.2e-7758.46PREDICTED: trihelix transcription factor GT-3b-like [Populus euphratica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR017877Myb-like_dom
IPR027775C2H2- zinc finger protein family
Vocabulary: Molecular Function
TermDefinition
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.114700.1Cucsa.114700.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 10..73
score: 6
IPR027775C2H2- zinc finger protein familyPANTHERPTHR10032ZINC FINGER PROTEIN WITH KRAB AND SCAN DOMAINScoord: 9..250
score: 1.1
NoneNo IPR availableunknownCoilCoilcoord: 218..245
score: -coord: 126..146
scor
NoneNo IPR availablePANTHERPTHR10032:SF204PROTEIN Y5F2A.4coord: 9..250
score: 1.1
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 8..98
score: 1.6