Cp4.1LG01g01360 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g01360
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionTrihelix transcription factor GT-3a-like protein
LocationCp4.1LG01 : 3024161 .. 3028506 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAGGAAGAAAAACAATGTCTTATCAAGTGATCAAAAGTCATAGGTGGACGAGTAAATTAAAAGATCGCCAAATTTCAACGATTCCTTTCCTTCTTTAAGAAGCAAGCCGGAAAAGGCCGATAATGGAATGAGGGCCTTCGTCAATCACAAATTTCCACCATTAAATTTCTCATCGGAGTTCGCCGATGTCACCGCCGTCCACTGTTTTCACTGAAGCCGAAGCTAGAGCCCTGCAATAGTATCAACCATGACCTCGAGCGCGATGGCGGCGACGGCTCAGCAACACCAGTGGAGCGAGGAGGAGACGAGGGAGTTCATCCGGATTCGAGCCGACCTAGAGAGGGACCTGACGGCGGTTTCCACCGGAGAAGGTGCGGCGGTGAAGAAGAAGACACTGTGGGACATGGCGAGTGCAAGGATGAGAGAGCGAGGGTTTTGGAGGACCGCCTATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGTCGCTACAAGGTTCGTCCAGTTTCCATTCCGTATTCATTTGAGGATTTTTTTTTTTTTAATCAGTCTGTTCTCCGTTATGTTTCTTTCAAGTGGGCTCCTCTGAATCTGGACTGTTCTTGCTTTGATATTTTGATTCTCATGGATTTCGATTGGGTTCAATTTTGCCATTTATGGTTGCAAATTCTTAAAGCTAAATCTGAGAAAGGTAGTAAAATGCTGATAATGTCTCGCAACATGCGGACGAGAAAATGACAAGATGTCATAAAGTCTTTATTCAAAATAGACACGTTTTGACCTTTCAAAACTCTAGCTGCCTGTAATTGCCTGTAAGAGGAGGTTTGGAGAAAATGGTTAGAATTGAAGGGTATTGCTAAGAGGTAGCTACCATTGTTTTTGGAGATTTGTCTCGCAAAATGGCTTTCTTCATTTCTAGGAGTACATATCAACTCAGACACGGTGGCAAAATTAGATATCTGTTGACTACTTGTACAAGATACATCATTCTACTTAATTTCCTAGGTGTATGTTTTGTCAAAGTGTGGTAATGCTTCCATTTCCAGTCTTGTTTCTGTCCAATCTTCTTGTCGAAGTGATAAGAGATTCTTGACTGTGCCTCCTCTAGAATCTAGAAAATCTGGCTGTTCTTAAGTTTTCTCTTATAGATCTTTCTTACACAATCTCTTAGTTACCTGTGAATGGTCAGCCTTTCAATTCTAATCTCTTAGTTACCCCAATGGATGAAAAGCCGAAGTTGATTCCAAAAGGCTTTCTGGTTAACATTTAATGGAATTTGGTTCTCAGATTTCATTCTGAACAAAGACCTGTCCACGAACCATAGCCAAAGACTGCATTGCGCTGCATTATCTTATCACCTTACCATAATGAATGTGTGGCCAGCAAGAAGTTGAAGGAGATAGCCACAGAATTATGGAAAAAACTAGAAGGCAGTGATACCTTCTCCTATATTTCTGATAGGCTGCATCATAGAGGGGATAAGCAGACTTCAGGCTTAGGTTTATTTAACTTGATATTGTTACAAGTGTTAAGTTACCAAGAGCTACATGCACCAGACAAAGAACTTGACTTTGTTTTGACACTTCAGCTTCGTGAAGATGCGGGATTCACATGATGGGAAGAGGGGTAGAAGTTAGAATTAGTAAGAAGGTGATTGAAGAGGGATTCCACATCGGTTGGAGAGGAGAACAAAGCATTCTTTATAAAACCTTGAGGGGAAGCCCGAAAGGGAAATTCCAAAGAGGACTATCTGCTAGCGGTGGCTTTGGCCGTTACAAATGGTATCAGAGTCAGACACCAGGCGATTTGCCAACGAGGAGGTTGAGCCCCGAAGGGGGGTGGGCATGAGGCGGTGTGCCAGTAAGGACACTGGGCCTCGAAGGGGGTAGATTGGGGAGTCTCACATTGTTTGAAGAAGGGAATGAGTGCCAGCAAGGATACTGAGTCTCGAAGGGGGGTGGATTGTCAGATCCCACATTGGTTGAGGAGGATAACAAAGCATTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAAACGTGTTTTAAAAACTTTGAGGGGAAGCTCGAAAGGGAAAACCCAAAAAGGACAATATCTGCTAGCAATGGGCTTGGACTGTTACAAATGGTATCAGAGCCAGAAATCGGGCGGTGTGCCATTGAGGACGTTGAGCTTCGAAGGAGTGTGGATTGTGAGATCCCACATCGAATGGAGAGGAGAACGAAGCGTTCTTTATAAGGGTGTGGAAACCTCTCCCTAGCAGAAGCGTTTTAAAAACCTTGAGGGGAAGCTCAGAAGGAAAAATCCAAAAATGACAATATCTGCTAGCAGTGAGCTTGAGTCGTTACATTAACATGTGGAAAAAAGTTCAAAGAAAACAATATCTGCTAGAAGTGAGCTTGGGCCGTTACATTAACATGTGGAAAAAAGCTCAAAGAGGACAATATCTGCTAGCAGTGATCTTGGGCCGTTACATTAACACGTGAAAAAAAACCAGATATATGTCATTCTAACTCTTGTTTTATCCATGAGATCAATGAGCTCCTATGATAAAAACAAGCGCAGTTAATTCAGGAAGTTTCTCGTTCACTAAAACCCATCTTAACTGAGCTTCCTGCTCTCTCTTTAAAGCACCTTCTTAAACTAAAGTTTCCAAACAAGAACTTGCAGTGCCATTACAGTTCAAACATCGAGTGTGGTCTACATTGCGAGTGAAAAATTAACCTGAACCCTTTCTTTATGTGGGAGGTTGGTCATAAGTGGCGAAACATTTAATTGCTTCCCATATATGCAGTAATTCTCGTTCATTAGTATTTGTAGAAAAGACATCTTTTACTTTTAGATCCGTTTCATGTAATTCCGACTAAACTCTTCATAGAATTGTAGTAATAACATAGCCTAGAATTAGTTTGACTTGTCAAGTCCAATTTTGGCCCCAAAACTCAATGAGCTAAAGATACCCCTAGAAGTTGAGGCAGTTGGCATTGTTTATGTTGTTTAATCACCGTCATATGCTTCTAATTTTATAATGGTTGTACAGGGGAAGGAGACAACCCATAAAGAGTTTGGCTGGCAATGCCCATTTTTTGAGGAAATCCGTGCAGTATTTGCCGTAAGGGAGAAAGCTATGCACCGATTGCTCCTTGAACCTGAAGCAGGTTCTAGTGCAACAAAGAAAAGAGGGAGGGAGAGAAGTTTAGAAGAATATTCAGATCTCAAAGAACTTGATGAAGAAGAAAGTGAGGAGGAGATCCCAACTCAAAGCAACTCACAGAAGGGAAAGGCTATAGGAACTCAGTACCCAGCAAAGTCTTTAAGAACGGCTGGTTCTAAACGTTCAAGTAGCTCGGTTAGTAATGAAATCCTAGAGTTGTTGAAGGGCTTCTTTCAGTGGCAGCAGAGGATGGAAATGGAATGGAGGGAAATACTTGAGAGACATTACAACAACCGACGAATGTTGGAGCAGGAATGGCGTGACTCGATGGAGAAGCTCGAGAGGGAGAGGTTAATGACTGAGAAAGCTTGGAGGGAAAGAGAAGAACAGAGAAAGAAAAGACAAGATATCCGTGCTCAAGGAATGGATGCTCTCTTAACAGACCTTTTAAACAAGCTCAACCGCGAAAATAATTTATGAGACGATGAGATAAAAAACGGTACTGATTCCCGAGAATGAGATATAGTTGAAGCAAAAAAGGAAGAATGTAGTTACAGCAACACTATTTTTGAAGGCATGGTAGATATAAAAAATTCAGATTTGTGGCTGATAGTACATATCACATATGCAATTTTTCACAAGTGTCTAGCCTGATTGTATGCATACAGAGATTGAACAGTTTTCTCCCAGGAATTTATAGCTCTTGGAAGAAAAAAGAATACATTGTTATGTACATTATCAGTATTGAAGAAAACCTATGAAGTTTAGCTTGAATTAAGAATCTTATTACTATGATCACCTTCCTATTCTCTTCTGTGATGATTAGCCTATTCATTTGTATCATTTTATGCTGGAAACAAGGCCATTTCTTTCAAGGATTCCATATTTTCTAAGGCACAGAGAAAATTAGACTGGGAATGTGGAATAAGCATGCTAGAGTTTGGTTTATTTTACAAGTGACGGAACAAATCATCATTCTTTCTAGTACATTTTAAGAACAATAAATATGAAGACAATACCAGCAACTTTCATTGATCTGAACATAAGAGAACTTCGAAATGCCTTCAATATGTAGTGGCACACATACTTCAACGTGCAAGCGACGATTCAGCCCGATTCTGATGGTGGCTTTTCCTTTGGATGTTTGGAGGCATAATGATCGCCCAATTGATTATGATTTGCTAGTTGTACCTG

mRNA sequence

AAAGGAAGAAAAACAATGTCTTATCAAGTGATCAAAAGTCATAGGTGGACGAGTAAATTAAAAGATCGCCAAATTTCAACGATTCCTTTCCTTCTTTAAGAAGCAAGCCGGAAAAGGCCGATAATGGAATGAGGGCCTTCGTCAATCACAAATTTCCACCATTAAATTTCTCATCGGAGTTCGCCGATGTCACCGCCGTCCACTGTTTTCACTGAAGCCGAAGCTAGAGCCCTGCAATAGTATCAACCATGACCTCGAGCGCGATGGCGGCGACGGCTCAGCAACACCAGTGGAGCGAGGAGGAGACGAGGGAGTTCATCCGGATTCGAGCCGACCTAGAGAGGGACCTGACGGCGGTTTCCACCGGAGAAGGTGCGGCGGTGAAGAAGAAGACACTGTGGGACATGGCGAGTGCAAGGATGAGAGAGCGAGGGTTTTGGAGGACCGCCTATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGTCGCTACAAGGGGAAGGAGACAACCCATAAAGAGTTTGGCTGGCAATGCCCATTTTTTGAGGAAATCCGTGCAGTATTTGCCGTAAGGGAGAAAGCTATGCACCGATTGCTCCTTGAACCTGAAGCAGGTTCTAGTGCAACAAAGAAAAGAGGGAGGGAGAGAAGTTTAGAAGAATATTCAGATCTCAAAGAACTTGATGAAGAAGAAAGTGAGGAGGAGATCCCAACTCAAAGCAACTCACAGAAGGGAAAGGCTATAGGAACTCAGTACCCAGCAAAGTCTTTAAGAACGGCTGGTTCTAAACGTTCAAGTAGCTCGGTTAGTAATGAAATCCTAGAGTTGTTGAAGGGCTTCTTTCAGTGGCAGCAGAGGATGGAAATGGAATGGAGGGAAATACTTGAGAGACATTACAACAACCGACGAATGTTGGAGCAGGAATGGCGTGACTCGATGGAGAAGCTCGAGAGGGAGAGGTTAATGACTGAGAAAGCTTGGAGGGAAAGAGAAGAACAGAGAAAGAAAAGACAAGATATCCGTGCTCAAGGAATGGATGCTCTCTTAACAGACCTTTTAAACAAGCTCAACCGCGAAAATAATTTATGAGACGATGAGATAAAAAACGGTACTGATTCCCGAGAATGAGATATAGTTGAAGCAAAAAAGGAAGAATGTAGTTACAGCAACACTATTTTTGAAGGCATGGTAGATATAAAAAATTCAGATTTGTGGCTGATAGTACATATCACATATGCAATTTTTCACAAGTGTCTAGCCTGATTGTATGCATACAGAGATTGAACAGTTTTCTCCCAGGAATTTATAGCTCTTGGAAGAAAAAAGAATACATTGTTATGTACATTATCAGTATTGAAGAAAACCTATGAAGTTTAGCTTGAATTAAGAATCTTATTACTATGATCACCTTCCTATTCTCTTCTGTGATGATTAGCCTATTCATTTGTATCATTTTATGCTGGAAACAAGGCCATTTCTTTCAAGGATTCCATATTTTCTAAGGCACAGAGAAAATTAGACTGGGAATGTGGAATAAGCATGCTAGAGTTTGGTTTATTTTACAAGTGACGGAACAAATCATCATTCTTTCTAGTACATTTTAAGAACAATAAATATGAAGACAATACCAGCAACTTTCATTGATCTGAACATAAGAGAACTTCGAAATGCCTTCAATATGTAGTGGCACACATACTTCAACGTGCAAGCGACGATTCAGCCCGATTCTGATGGTGGCTTTTCCTTTGGATGTTTGGAGGCATAATGATCGCCCAATTGATTATGATTTGCTAGTTGTACCTG

Coding sequence (CDS)

ATGACCTCGAGCGCGATGGCGGCGACGGCTCAGCAACACCAGTGGAGCGAGGAGGAGACGAGGGAGTTCATCCGGATTCGAGCCGACCTAGAGAGGGACCTGACGGCGGTTTCCACCGGAGAAGGTGCGGCGGTGAAGAAGAAGACACTGTGGGACATGGCGAGTGCAAGGATGAGAGAGCGAGGGTTTTGGAGGACCGCCTATCAGTGCAAATGCAAGTGGAAGAATCTCCTCAGTCGCTACAAGGGGAAGGAGACAACCCATAAAGAGTTTGGCTGGCAATGCCCATTTTTTGAGGAAATCCGTGCAGTATTTGCCGTAAGGGAGAAAGCTATGCACCGATTGCTCCTTGAACCTGAAGCAGGTTCTAGTGCAACAAAGAAAAGAGGGAGGGAGAGAAGTTTAGAAGAATATTCAGATCTCAAAGAACTTGATGAAGAAGAAAGTGAGGAGGAGATCCCAACTCAAAGCAACTCACAGAAGGGAAAGGCTATAGGAACTCAGTACCCAGCAAAGTCTTTAAGAACGGCTGGTTCTAAACGTTCAAGTAGCTCGGTTAGTAATGAAATCCTAGAGTTGTTGAAGGGCTTCTTTCAGTGGCAGCAGAGGATGGAAATGGAATGGAGGGAAATACTTGAGAGACATTACAACAACCGACGAATGTTGGAGCAGGAATGGCGTGACTCGATGGAGAAGCTCGAGAGGGAGAGGTTAATGACTGAGAAAGCTTGGAGGGAAAGAGAAGAACAGAGAAAGAAAAGACAAGATATCCGTGCTCAAGGAATGGATGCTCTCTTAACAGACCTTTTAAACAAGCTCAACCGCGAAAATAATTTATGA

Protein sequence

MTSSAMAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGRERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL
BLAST of Cp4.1LG01g01360 vs. Swiss-Prot
Match: TGT3B_ARATH (Trihelix transcription factor GT-3b OS=Arabidopsis thaliana GN=GT-3B PE=1 SV=1)

HSP 1 Score: 150.6 bits (379), Expect = 2.5e-35
Identity = 109/278 (39.21%), Postives = 159/278 (57.19%), Query Frame = 1

Query: 3   SSAMAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERG 62
           +S +A   +  QWS EET+E I IR +L++             + K LW++ S +MR++ 
Sbjct: 30  ASPVAVGDRFPQWSVEETKELIGIRGELDQTFMETK-------RNKLLWEVISNKMRDKS 89

Query: 63  FWRTAYQCKCKWKNLLSRYKGKETTHKEFG-WQCPFFEEIRAVFAVREKAMHRLLLEPEA 122
           F R+  QCKCKWKNL++R+KG ET   E    Q PF+++++ +F  R + M  L  E E 
Sbjct: 90  FPRSPEQCKCKWKNLVTRFKGCETMEAETARQQFPFYDDMQNIFTTRMQRM--LWAESEG 149

Query: 123 GSSATKKRGRERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKR 182
           G   T    R+R   EYS  +E  EE   EE+   SN    K +  +      R  GS  
Sbjct: 150 GGGGTSGAARKR---EYSSDEE--EENVNEELVDVSNDP--KILNPKKNIAKKRKGGS-- 209

Query: 183 SSSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTE 242
           +SS+ +N + E+L+ F + Q RME EWRE  E     R   E+EWR  ME+LE+ERL  E
Sbjct: 210 NSSNSNNGVREVLEEFMRHQVRMESEWREGWEAREKERAEKEEEWRRKMEELEKERLAME 269

Query: 243 KAWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           + WR+REEQR+ R+++RA+  D+L+  LL KL R+ +L
Sbjct: 270 RMWRDREEQRRSREEMRAEKRDSLINALLAKLTRDGSL 289

BLAST of Cp4.1LG01g01360 vs. Swiss-Prot
Match: TGT3A_ARATH (Trihelix transcription factor GT-3a OS=Arabidopsis thaliana GN=GT-3A PE=1 SV=1)

HSP 1 Score: 136.7 bits (343), Expect = 3.8e-31
Identity = 95/276 (34.42%), Postives = 152/276 (55.07%), Query Frame = 1

Query: 14  QWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCK 73
           QWS EET+E + IR +L++             + K LW++ +A+M ++GF R+A QCK K
Sbjct: 51  QWSIEETKELLAIREELDQTFMETK-------RNKLLWEVVAAKMADKGFVRSAEQCKSK 110

Query: 74  WKNLLSRYKGKETTHKE-FGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGRE 133
           WKNL++RYK  ETT  +    Q PF+ EI+++F  R + M    L  EA   +T  + + 
Sbjct: 111 WKNLVTRYKACETTEPDAIRQQFPFYNEIQSIFEARMQRM----LWSEATEPSTSSKRKH 170

Query: 134 RSLEEYSDLKELDE--EESEEEIPTQSNSQKGK--------AIGTQYPAKSLRTAGSKRS 193
                  + +E+DE  ++  EE+ +   +QK +        +   +  AK  +   S   
Sbjct: 171 HQFSSDDEEEEVDEPNQDINEELLSLVETQKRETEVITTSTSTNPRKRAKKGKGVASGTK 230

Query: 194 SSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEK 253
           + +  N + ++L+ F +   +ME EWR+  E     R   E+EWR  M +LE ER  TE+
Sbjct: 231 AETAGNTLKDILEEFMRQTVKMEKEWRDAWEMKEIEREKREKEWRRRMAELEEERAATER 290

Query: 254 AWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENN 279
            W EREE+R+ R++ RAQ  D+L+  LLN+LNR++N
Sbjct: 291 RWMEREEERRLREEARAQKRDSLIDALLNRLNRDHN 315

BLAST of Cp4.1LG01g01360 vs. TrEMBL
Match: A0A0A0KV13_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055350 PE=4 SV=1)

HSP 1 Score: 418.3 bits (1074), Expect = 7.3e-114
Identity = 223/275 (81.09%), Postives = 242/275 (88.00%), Query Frame = 1

Query: 6   MAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWR 65
           MAAT  QHQWSEEETREFIRIRADLE+DL AVS GE  A KKKTLW+MAS RMRE+GFWR
Sbjct: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60

Query: 66  TAYQCKCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSA 125
           TA QCKCKWKNLLSRYKGKET+HKE+GWQCPFFEEI AVF  R KAMHRLLLEPEA S +
Sbjct: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120

Query: 126 TKKRGRERSLEEYSDLKELDEEESEEEIP-TQSNSQKGKAIGTQYPAKSLRTAGSKRSSS 185
           TKKRGRERSLEE+SDLKEL+E+E+EEE+  TQSNSQK KA   + PAKSL    SK SSS
Sbjct: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKA-ARKLPAKSLGATDSKSSSS 180

Query: 186 SVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAW 245
           S SNEI E+LKGFFQWQQRMEMEWREI+ERHYNNRRM EQEWR+SMEKLERERLM E+AW
Sbjct: 181 STSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAW 240

Query: 246 REREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           REREEQRK+RQDIRA+GM+ALLT LLNKLN ENNL
Sbjct: 241 REREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 274

BLAST of Cp4.1LG01g01360 vs. TrEMBL
Match: U5GMR3_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s01210g PE=4 SV=1)

HSP 1 Score: 261.9 bits (668), Expect = 8.7e-67
Identity = 150/273 (54.95%), Postives = 196/273 (71.79%), Query Frame = 1

Query: 11  QQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQC 70
           QQ QW ++ET+EFI IRA+LE+D T          + KTLW++ S +MRE+G+ RT  QC
Sbjct: 38  QQPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSVKMREKGYRRTPEQC 97

Query: 71  KCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRG 130
           KCKWKNL++RYKGKET+  E G QCPFFEE+ AVF  R K M RLLLE EAGS+ ++K+ 
Sbjct: 98  KCKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKM 157

Query: 131 R----ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSV 190
           +    +RS +E+S+ ++ DE++SEEE P +SNS+K K        + +    S R+SSS 
Sbjct: 158 KRTSGDRSSDEFSEEEDEDEDDSEEEKPVRSNSRKRKV-------EKIIAEKSPRASSST 217

Query: 191 SNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWRE 250
              I E+LK F Q QQ+MEM+WRE++ER  + R+M EQEWR SMEKLERERLM E+AWRE
Sbjct: 218 VGGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRE 277

Query: 251 REEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           REEQR+ R++ RA+  DALLT LLNKL RENN+
Sbjct: 278 REEQRRIREESRAERRDALLTTLLNKLIRENNI 296

BLAST of Cp4.1LG01g01360 vs. TrEMBL
Match: A0A061G7N3_THECC (Homeodomain-like superfamily protein OS=Theobroma cacao GN=TCM_016743 PE=4 SV=1)

HSP 1 Score: 251.1 bits (640), Expect = 1.5e-63
Identity = 152/270 (56.30%), Postives = 189/270 (70.00%), Query Frame = 1

Query: 14  QWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCK 73
           QW  EETRE I IR +LERD TA       A + KTLW++ SARMR+RG+ RT  QCKCK
Sbjct: 24  QWGPEETRELILIRGELERDFTA-------AKRNKTLWEIVSARMRDRGYIRTPDQCKCK 83

Query: 74  WKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGR-- 133
           WKNLL+RYKGKET+  E G Q PFFEE+ AVF  R K M RLLLE EAGS+  KKR R  
Sbjct: 84  WKNLLNRYKGKETSDPENGRQFPFFEELHAVFTERAKNMQRLLLESEAGSTQAKKRMRRI 143

Query: 134 --ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSVSNE 193
             +RS +E+S+ ++ DE+ESEEE   +S S + +        KS R   +  +SS+ S  
Sbjct: 144 SADRSSDEFSEEEDDDEDESEEERHARSISSRKRKADRVVLDKSPRP--NSGTSSTSSTG 203

Query: 194 ILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWREREE 253
           + E+L+ FFQ QQRMEM+WRE++ER    R++ EQEWR SMEKLERERLM E+AWREREE
Sbjct: 204 LQEMLREFFQQQQRMEMQWREMMERRARERQLFEQEWRQSMEKLERERLMVEQAWREREE 263

Query: 254 QRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           QR+ R++ RA+  DALLT LLNKL  +NNL
Sbjct: 264 QRRLREESRAERRDALLTTLLNKLINDNNL 284

BLAST of Cp4.1LG01g01360 vs. TrEMBL
Match: K7K3Z3_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_01G150500 PE=4 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 7.6e-63
Identity = 153/273 (56.04%), Postives = 192/273 (70.33%), Query Frame = 1

Query: 12  QHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCK 71
           Q QWS++ETREFI IRA+LERD TA       + + KTLW++ SA+MRERGF R+  QCK
Sbjct: 47  QPQWSQQETREFIAIRAELERDFTA-------SKRNKTLWEVVSAKMRERGFRRSPEQCK 106

Query: 72  CKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKK--- 131
           CKWKNL++RYKGKET+  E G QCPFFEE+ AVF  R   M RLLLE E  S+ TKK   
Sbjct: 107 CKWKNLVNRYKGKETSDPEHGKQCPFFEELHAVFTQRAHNMQRLLLESETRSAQTKKGVK 166

Query: 132 -RGRERSLEEYSDLKELDEEESEEEIPTQSNSQKGKA--IGTQYPAKSLRTAGSKRSSSS 191
               +RS EE S+     E +SEEE P++SN++K K   +G +   KS R A +  +S+S
Sbjct: 167 RSSGDRSSEELSEDDNEVEYDSEEEKPSRSNTRKRKVDKVGVE---KSSR-ASNPSNSAS 226

Query: 192 VSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWR 251
            S  I E+LK FFQ Q  MEM+WRE++ER  + R++ EQEWR SMEKLERERLM E+AWR
Sbjct: 227 NSTSIQEMLKEFFQHQLSMEMQWREMMERRAHERQLFEQEWRQSMEKLERERLMIEQAWR 286

Query: 252 EREEQRKKRQDIRAQGMDALLTDLLNKLNRENN 279
           EREEQR+ R++ RA+  DALLT LLNKL  E+N
Sbjct: 287 EREEQRRMREESRAERRDALLTTLLNKLINESN 308

BLAST of Cp4.1LG01g01360 vs. TrEMBL
Match: V7CJ09_PHAVU (Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G126400g PE=4 SV=1)

HSP 1 Score: 248.1 bits (632), Expect = 1.3e-62
Identity = 147/270 (54.44%), Postives = 189/270 (70.00%), Query Frame = 1

Query: 12  QHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCK 71
           Q QWS++ETREFI IRA+LERD TA       + + KTLW++ S++MRERGF R+  QCK
Sbjct: 46  QPQWSQQETREFIAIRAELERDFTA-------SKRNKTLWEVVSSKMRERGFRRSPEQCK 105

Query: 72  CKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGR 131
           CKWKNL++RYKGKET+  E   QCPFFEE+ AVF  R   M RLLLE E  S+ TKK  +
Sbjct: 106 CKWKNLVNRYKGKETSDPEHSRQCPFFEELHAVFTQRAHNMQRLLLESETRSAQTKKGVK 165

Query: 132 ----ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSVS 191
               +RS EE S+ ++  E +SEEE P++SN++K K         S R   S  + SS +
Sbjct: 166 RSSGDRSSEELSEDEDEVEYDSEEEKPSRSNTRKKKVDKVGIDKSSSRAYNSSHAVSSST 225

Query: 192 NEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWRER 251
           + I E+LK FFQ Q RMEM+WRE++ER  + R++ EQEWR SMEKLERERL+ E+AWRER
Sbjct: 226 SSIQEMLKEFFQHQLRMEMKWREMMERRAHERQLFEQEWRQSMEKLERERLIIEQAWRER 285

Query: 252 EEQRKKRQDIRAQGMDALLTDLLNKLNREN 278
           EEQR+ R++ RA+  DALLT LLNKL  E+
Sbjct: 286 EEQRRMREESRAEKRDALLTTLLNKLINES 308

BLAST of Cp4.1LG01g01360 vs. TAIR10
Match: AT2G38250.1 (AT2G38250.1 Homeodomain-like superfamily protein)

HSP 1 Score: 150.6 bits (379), Expect = 1.4e-36
Identity = 109/278 (39.21%), Postives = 159/278 (57.19%), Query Frame = 1

Query: 3   SSAMAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERG 62
           +S +A   +  QWS EET+E I IR +L++             + K LW++ S +MR++ 
Sbjct: 30  ASPVAVGDRFPQWSVEETKELIGIRGELDQTFMETK-------RNKLLWEVISNKMRDKS 89

Query: 63  FWRTAYQCKCKWKNLLSRYKGKETTHKEFG-WQCPFFEEIRAVFAVREKAMHRLLLEPEA 122
           F R+  QCKCKWKNL++R+KG ET   E    Q PF+++++ +F  R + M  L  E E 
Sbjct: 90  FPRSPEQCKCKWKNLVTRFKGCETMEAETARQQFPFYDDMQNIFTTRMQRM--LWAESEG 149

Query: 123 GSSATKKRGRERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKR 182
           G   T    R+R   EYS  +E  EE   EE+   SN    K +  +      R  GS  
Sbjct: 150 GGGGTSGAARKR---EYSSDEE--EENVNEELVDVSNDP--KILNPKKNIAKKRKGGS-- 209

Query: 183 SSSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTE 242
           +SS+ +N + E+L+ F + Q RME EWRE  E     R   E+EWR  ME+LE+ERL  E
Sbjct: 210 NSSNSNNGVREVLEEFMRHQVRMESEWREGWEAREKERAEKEEEWRRKMEELEKERLAME 269

Query: 243 KAWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           + WR+REEQR+ R+++RA+  D+L+  LL KL R+ +L
Sbjct: 270 RMWRDREEQRRSREEMRAEKRDSLINALLAKLTRDGSL 289

BLAST of Cp4.1LG01g01360 vs. TAIR10
Match: AT5G01380.1 (AT5G01380.1 Homeodomain-like superfamily protein)

HSP 1 Score: 136.7 bits (343), Expect = 2.1e-32
Identity = 95/276 (34.42%), Postives = 152/276 (55.07%), Query Frame = 1

Query: 14  QWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCK 73
           QWS EET+E + IR +L++             + K LW++ +A+M ++GF R+A QCK K
Sbjct: 51  QWSIEETKELLAIREELDQTFMETK-------RNKLLWEVVAAKMADKGFVRSAEQCKSK 110

Query: 74  WKNLLSRYKGKETTHKE-FGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGRE 133
           WKNL++RYK  ETT  +    Q PF+ EI+++F  R + M    L  EA   +T  + + 
Sbjct: 111 WKNLVTRYKACETTEPDAIRQQFPFYNEIQSIFEARMQRM----LWSEATEPSTSSKRKH 170

Query: 134 RSLEEYSDLKELDE--EESEEEIPTQSNSQKGK--------AIGTQYPAKSLRTAGSKRS 193
                  + +E+DE  ++  EE+ +   +QK +        +   +  AK  +   S   
Sbjct: 171 HQFSSDDEEEEVDEPNQDINEELLSLVETQKRETEVITTSTSTNPRKRAKKGKGVASGTK 230

Query: 194 SSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEK 253
           + +  N + ++L+ F +   +ME EWR+  E     R   E+EWR  M +LE ER  TE+
Sbjct: 231 AETAGNTLKDILEEFMRQTVKMEKEWRDAWEMKEIEREKREKEWRRRMAELEEERAATER 290

Query: 254 AWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENN 279
            W EREE+R+ R++ RAQ  D+L+  LLN+LNR++N
Sbjct: 291 RWMEREEERRLREEARAQKRDSLIDALLNRLNRDHN 315

BLAST of Cp4.1LG01g01360 vs. TAIR10
Match: AT5G47660.1 (AT5G47660.1 Homeodomain-like superfamily protein)

HSP 1 Score: 52.4 bits (124), Expect = 5.3e-07
Identity = 26/69 (37.68%), Postives = 40/69 (57.97%), Query Frame = 1

Query: 14  QWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCK 73
           +W +EE +  I  R+D+E         E   + K  +WD  SARM+ERG+ R+A +CK K
Sbjct: 303 RWPQEEVQALISSRSDVE---------EKTGINKGAIWDEISARMKERGYERSAKKCKEK 362

Query: 74  WKNLLSRYK 83
           W+N+   Y+
Sbjct: 363 WENMNKYYR 362

BLAST of Cp4.1LG01g01360 vs. TAIR10
Match: AT3G25990.1 (AT3G25990.1 Homeodomain-like superfamily protein)

HSP 1 Score: 50.4 bits (119), Expect = 2.0e-06
Identity = 29/101 (28.71%), Postives = 51/101 (50.50%), Query Frame = 1

Query: 15  WSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCKW 74
           W+++ETR  I +R +++       +        K LW+  S +MRE+GF R+   C  KW
Sbjct: 55  WAQDETRTLISLRREMDNLFNTSKS-------NKHLWEQISKKMREKGFDRSPSMCTDKW 114

Query: 75  KNLLSRYKGKETTHKE-----FGWQCPFFEEIRAVFAVREK 111
           +N+L  +K K   H++        +  ++ EI  +F  R+K
Sbjct: 115 RNILKEFK-KAKQHEDKATSGGSTKMSYYNEIEDIFRERKK 147

BLAST of Cp4.1LG01g01360 vs. TAIR10
Match: AT1G76880.1 (AT1G76880.1 Duplicated homeodomain-like superfamily protein)

HSP 1 Score: 50.4 bits (119), Expect = 2.0e-06
Identity = 32/107 (29.91%), Postives = 53/107 (49.53%), Query Frame = 1

Query: 7   AATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRT 66
           AA+A   +W + E    I++R +L+               K  LW+  SA MR  GF R 
Sbjct: 401 AASASSSRWPKVEIEALIKLRTNLDSKYQENGP-------KGPLWEEISAGMRRLGFNRN 460

Query: 67  AYQCKCKWKNLLSRYKGKETTHK---EFGWQCPFFEEIRAVFAVREK 111
           + +CK KW+N+   +K  + ++K   E    CP+F ++ A++  R K
Sbjct: 461 SKRCKEKWENINKYFKKVKESNKKRPEDSKTCPYFHQLDALYRERNK 500

BLAST of Cp4.1LG01g01360 vs. NCBI nr
Match: gi|659102022|ref|XP_008451911.1| (PREDICTED: trihelix transcription factor GT-3b-like [Cucumis melo])

HSP 1 Score: 423.3 bits (1087), Expect = 3.2e-115
Identity = 226/280 (80.71%), Postives = 248/280 (88.57%), Query Frame = 1

Query: 1   MTSSAMAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRE 60
           MTS+AMAAT  QHQWSEEETREFIRIRADLE+DLTAVSTGE  A KKKTLW+MAS RMRE
Sbjct: 1   MTSTAMAATLHQHQWSEEETREFIRIRADLEKDLTAVSTGEAPAAKKKTLWEMASVRMRE 60

Query: 61  RGFWRTAYQCKCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPE 120
           +GFWRTA QCKCKWKNLLSRYKGKET+HKE+GWQCPFFEEI AVF  R KAMHRLLLEPE
Sbjct: 61  KGFWRTADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPE 120

Query: 121 AGSSATKKRGRERSLEEYSDLKELDEEESEEEIP-TQSNSQKGKAIGTQYPAKSLRTAGS 180
           A S +TKKRGRERSLEE+SDLKEL+E+E+EEE+  TQ NSQK KA   + PAKSL    S
Sbjct: 121 ACSISTKKRGRERSLEEHSDLKELNEDETEEEVTLTQRNSQKRKA-ARKLPAKSLGATDS 180

Query: 181 KRSSSSVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLM 240
           K SSSS+S EI E+LKGF QWQQRMEMEWREI+ERHYNNRRMLEQEWR+SMEKLERERLM
Sbjct: 181 KSSSSSISYEIQEMLKGFLQWQQRMEMEWREIVERHYNNRRMLEQEWRESMEKLERERLM 240

Query: 241 TEKAWREREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
            E+AWREREEQRK++QDIRA+GM+ALLT LLNKLN ENNL
Sbjct: 241 AEQAWREREEQRKEKQDIRAEGMNALLTTLLNKLNHENNL 279

BLAST of Cp4.1LG01g01360 vs. NCBI nr
Match: gi|449462507|ref|XP_004148982.1| (PREDICTED: trihelix transcription factor GT-3b-like [Cucumis sativus])

HSP 1 Score: 418.3 bits (1074), Expect = 1.0e-113
Identity = 223/275 (81.09%), Postives = 242/275 (88.00%), Query Frame = 1

Query: 6   MAATAQQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWR 65
           MAAT  QHQWSEEETREFIRIRADLE+DL AVS GE  A KKKTLW+MAS RMRE+GFWR
Sbjct: 1   MAATPHQHQWSEEETREFIRIRADLEKDLAAVSIGEAPAAKKKTLWEMASVRMREKGFWR 60

Query: 66  TAYQCKCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSA 125
           TA QCKCKWKNLLSRYKGKET+HKE+GWQCPFFEEI AVF  R KAMHRLLLEPEA S +
Sbjct: 61  TADQCKCKWKNLLSRYKGKETSHKEYGWQCPFFEEIHAVFTERGKAMHRLLLEPEACSIS 120

Query: 126 TKKRGRERSLEEYSDLKELDEEESEEEIP-TQSNSQKGKAIGTQYPAKSLRTAGSKRSSS 185
           TKKRGRERSLEE+SDLKEL+E+E+EEE+  TQSNSQK KA   + PAKSL    SK SSS
Sbjct: 121 TKKRGRERSLEEHSDLKELNEDENEEEVTFTQSNSQKRKA-ARKLPAKSLGATDSKSSSS 180

Query: 186 SVSNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAW 245
           S SNEI E+LKGFFQWQQRMEMEWREI+ERHYNNRRM EQEWR+SMEKLERERLM E+AW
Sbjct: 181 STSNEIQEMLKGFFQWQQRMEMEWREIVERHYNNRRMFEQEWRESMEKLERERLMAEQAW 240

Query: 246 REREEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           REREEQRK+RQDIRA+GM+ALLT LLNKLN ENNL
Sbjct: 241 REREEQRKERQDIRAEGMNALLTTLLNKLNHENNL 274

BLAST of Cp4.1LG01g01360 vs. NCBI nr
Match: gi|743808955|ref|XP_011018396.1| (PREDICTED: trihelix transcription factor GT-3b-like [Populus euphratica])

HSP 1 Score: 263.1 bits (671), Expect = 5.6e-67
Identity = 151/273 (55.31%), Postives = 197/273 (72.16%), Query Frame = 1

Query: 11  QQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQC 70
           QQ QW ++ET+EFI IRA+LE+D T          + KTLW++ SA+MRE+G+ RT  QC
Sbjct: 38  QQPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSAKMREKGYRRTPEQC 97

Query: 71  KCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRG 130
           KCKWKNL++RYKGKET+  E G QCPFFEE+ AVF  R K M RLLLE EAGS+ ++K+ 
Sbjct: 98  KCKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKM 157

Query: 131 R----ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSV 190
           +    +RS +E+S+ ++ DE++SEEE P +SNS+K K        + +    S R+SSS 
Sbjct: 158 KRTSGDRSSDEFSEEEDEDEDDSEEEKPVRSNSRKRKV-------EKIIAEKSPRASSST 217

Query: 191 SNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWRE 250
              I E+LK F Q QQ+MEM+WRE++ER  + R+M EQEWR SMEKLERERLM E+AWRE
Sbjct: 218 VGGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRE 277

Query: 251 REEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           REEQR+ R++ RA+  DALLT LLNKL RENN+
Sbjct: 278 REEQRRIREESRAERRDALLTTLLNKLIRENNV 296

BLAST of Cp4.1LG01g01360 vs. NCBI nr
Match: gi|566146525|ref|XP_006368276.1| (hypothetical protein POPTR_0001s01210g [Populus trichocarpa])

HSP 1 Score: 261.9 bits (668), Expect = 1.3e-66
Identity = 150/273 (54.95%), Postives = 196/273 (71.79%), Query Frame = 1

Query: 11  QQHQWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQC 70
           QQ QW ++ET+EFI IRA+LE+D T          + KTLW++ S +MRE+G+ RT  QC
Sbjct: 38  QQPQWGQQETKEFIGIRAELEKDFTVTK-------RNKTLWEIVSVKMREKGYRRTPEQC 97

Query: 71  KCKWKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRG 130
           KCKWKNL++RYKGKET+  E G QCPFFEE+ AVF  R K M RLLLE EAGS+ ++K+ 
Sbjct: 98  KCKWKNLVNRYKGKETSDPETGRQCPFFEELHAVFTERAKNMQRLLLESEAGSTQSRKKM 157

Query: 131 R----ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSV 190
           +    +RS +E+S+ ++ DE++SEEE P +SNS+K K        + +    S R+SSS 
Sbjct: 158 KRTSGDRSSDEFSEEEDEDEDDSEEEKPVRSNSRKRKV-------EKIIAEKSPRASSST 217

Query: 191 SNEILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWRE 250
              I E+LK F Q QQ+MEM+WRE++ER  + R+M EQEWR SMEKLERERLM E+AWRE
Sbjct: 218 VGGIQEMLKEFLQQQQKMEMQWREMMERRSHERQMFEQEWRQSMEKLERERLMIEQAWRE 277

Query: 251 REEQRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           REEQR+ R++ RA+  DALLT LLNKL RENN+
Sbjct: 278 REEQRRIREESRAERRDALLTTLLNKLIRENNI 296

BLAST of Cp4.1LG01g01360 vs. NCBI nr
Match: gi|590680697|ref|XP_007040932.1| (Homeodomain-like superfamily protein [Theobroma cacao])

HSP 1 Score: 251.1 bits (640), Expect = 2.2e-63
Identity = 152/270 (56.30%), Postives = 189/270 (70.00%), Query Frame = 1

Query: 14  QWSEEETREFIRIRADLERDLTAVSTGEGAAVKKKTLWDMASARMRERGFWRTAYQCKCK 73
           QW  EETRE I IR +LERD TA       A + KTLW++ SARMR+RG+ RT  QCKCK
Sbjct: 24  QWGPEETRELILIRGELERDFTA-------AKRNKTLWEIVSARMRDRGYIRTPDQCKCK 83

Query: 74  WKNLLSRYKGKETTHKEFGWQCPFFEEIRAVFAVREKAMHRLLLEPEAGSSATKKRGR-- 133
           WKNLL+RYKGKET+  E G Q PFFEE+ AVF  R K M RLLLE EAGS+  KKR R  
Sbjct: 84  WKNLLNRYKGKETSDPENGRQFPFFEELHAVFTERAKNMQRLLLESEAGSTQAKKRMRRI 143

Query: 134 --ERSLEEYSDLKELDEEESEEEIPTQSNSQKGKAIGTQYPAKSLRTAGSKRSSSSVSNE 193
             +RS +E+S+ ++ DE+ESEEE   +S S + +        KS R   +  +SS+ S  
Sbjct: 144 SADRSSDEFSEEEDDDEDESEEERHARSISSRKRKADRVVLDKSPRP--NSGTSSTSSTG 203

Query: 194 ILELLKGFFQWQQRMEMEWREILERHYNNRRMLEQEWRDSMEKLERERLMTEKAWREREE 253
           + E+L+ FFQ QQRMEM+WRE++ER    R++ EQEWR SMEKLERERLM E+AWREREE
Sbjct: 204 LQEMLREFFQQQQRMEMQWREMMERRARERQLFEQEWRQSMEKLERERLMVEQAWREREE 263

Query: 254 QRKKRQDIRAQGMDALLTDLLNKLNRENNL 280
           QR+ R++ RA+  DALLT LLNKL  +NNL
Sbjct: 264 QRRLREESRAERRDALLTTLLNKLINDNNL 284

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TGT3B_ARATH2.5e-3539.21Trihelix transcription factor GT-3b OS=Arabidopsis thaliana GN=GT-3B PE=1 SV=1[more]
TGT3A_ARATH3.8e-3134.42Trihelix transcription factor GT-3a OS=Arabidopsis thaliana GN=GT-3A PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0KV13_CUCSA7.3e-11481.09Uncharacterized protein OS=Cucumis sativus GN=Csa_4G055350 PE=4 SV=1[more]
U5GMR3_POPTR8.7e-6754.95Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0001s01210g PE=4 SV=1[more]
A0A061G7N3_THECC1.5e-6356.30Homeodomain-like superfamily protein OS=Theobroma cacao GN=TCM_016743 PE=4 SV=1[more]
K7K3Z3_SOYBN7.6e-6356.04Uncharacterized protein OS=Glycine max GN=GLYMA_01G150500 PE=4 SV=1[more]
V7CJ09_PHAVU1.3e-6254.44Uncharacterized protein OS=Phaseolus vulgaris GN=PHAVU_002G126400g PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT2G38250.11.4e-3639.21 Homeodomain-like superfamily protein[more]
AT5G01380.12.1e-3234.42 Homeodomain-like superfamily protein[more]
AT5G47660.15.3e-0737.68 Homeodomain-like superfamily protein[more]
AT3G25990.12.0e-0628.71 Homeodomain-like superfamily protein[more]
AT1G76880.12.0e-0629.91 Duplicated homeodomain-like superfamily protein[more]
Match NameE-valueIdentityDescription
gi|659102022|ref|XP_008451911.1|3.2e-11580.71PREDICTED: trihelix transcription factor GT-3b-like [Cucumis melo][more]
gi|449462507|ref|XP_004148982.1|1.0e-11381.09PREDICTED: trihelix transcription factor GT-3b-like [Cucumis sativus][more]
gi|743808955|ref|XP_011018396.1|5.6e-6755.31PREDICTED: trihelix transcription factor GT-3b-like [Populus euphratica][more]
gi|566146525|ref|XP_006368276.1|1.3e-6654.95hypothetical protein POPTR_0001s01210g [Populus trichocarpa][more]
gi|590680697|ref|XP_007040932.1|2.2e-6356.30Homeodomain-like superfamily protein [Theobroma cacao][more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006351transcription, DNA-templated
GO:0006355regulation of transcription, DNA-templated
Vocabulary: Molecular Function
TermDefinition
GO:0043565sequence-specific DNA binding
GO:0003700transcription factor activity, sequence-specific DNA binding
Vocabulary: Cellular Component
TermDefinition
GO:0005634nucleus
Vocabulary: INTERPRO
TermDefinition
IPR027775C2H2- zinc finger protein family
IPR027759Trihelix_TF_GT3
IPR017877Myb-like_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0006351 transcription, DNA-templated
biological_process GO:0006357 regulation of transcription from RNA polymerase II promoter
biological_process GO:0008150 biological_process
cellular_component GO:0005634 nucleus
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005575 cellular_component
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
molecular_function GO:0000981 RNA polymerase II transcription factor activity, sequence-specific DNA binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g01360.1Cp4.1LG01g01360.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR017877Myb-like domainPROFILEPS50090MYB_LIKEcoord: 7..78
score: 6
IPR027759Trihelix transcription factor GT3PANTHERPTHR10032:SF215TRIHELIX TRANSCRIPTION FACTOR GT-3A-RELATEDcoord: 14..279
score: 2.9
IPR027775C2H2- zinc finger protein familyPANTHERPTHR10032ZINC FINGER PROTEIN WITH KRAB AND SCAN DOMAINScoord: 14..279
score: 2.9
NoneNo IPR availablePFAMPF13837Myb_DNA-bind_4coord: 13..103
score: 1.3