Csa6G031440 (gene) Cucumber (Chinese Long) v2

NameCsa6G031440
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionMyb transcription factor; contains IPR009057 (Homeodomain-like)
LocationChr6 : 2541807 .. 2544144 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTGTTTTTTAGTTTAATTATTATGATGATGGAAAAAGTTGGACACCAAGTTTATTCCTTCCAATGATTCCCTTCTACATCCCTTATTCTCTCTCATCTCCAATCCCCAATTCCCAATCCCTAATTCCATCTCAATAACACATACTCCTACACAATCATGCTTTACTCCGACAAAATGCAACAAATTGCAGCCAAAATGGGTTTCACACTCTCCGATTTCGCCGACACTTTGGAACAAGAACGCCGCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCACCTTGTTTCCCATGGTGAACATCTCATCCCATCTCTCCCCTTCCTTTTCCACTCTATATTTTCTTTTCTGATTTCGATTTTTTTCGCAGCCATTGATTGTTGTAGGCAGCAGCTATCGGGGACGACTACGGAGAATCGTCAATCCGAGTGTTCTGAGCAGACTTCCAGTGATATGGGTCCGGTTCTTGAGGAATTTATTCCCATTAACAGAAATGGGGTTTCCGATTTTGAAAAAACGGAGAAGAATAACAAAAACCATGACTCTGATTTAAATAATTTGAATTTGGCTCCGTCTGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACTTCAGATCCTCCTCCCTTGAATCAGGTAAAAAAGGAAGTTCCCCCCTTTCCATGGATTTCAAAATTATTCTCCTCCGATGTTTAGGGGTTTATTTGCTTATGTTCTGCAGGACCTGCCAGAGAACACGCCGGTCGTTGAGGTCAATAGAAATGGCGGCGCTTTCCGGCCGTTTCAGAAGGAGAAAACCGGTGGCGGCGGTGGCGGGGGAGGGGCATCGTCGTCTTCACCTCCGGCTCCGGCTGCGGAAACGAGCTCCACGACGGAAACGGGTTCAGGGGGAAGTAGTCGGCGGGAAGAGAAGGAAGCGCAGAATCAGAGGAAACAGAGACGTTGCTGGTCGCCGGAGCTTCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGGTCCCATGGTAGGGAAGAACGAAGTAATTAAATTTATTATTTTGTATTTCTTTAATTTAATTTAATTTGAAGTTATTGATTATTATTCTGTGAATTGAAATGAAATGCAGTGGCGACGCCGAAGCAAATAAGAGAATTGATGAAGGTGGATGGTCTCACCAACGATGAAGTGAAAAGTCACCTGCAGGTTTGTTCTTCTTCCTTTTTGTTTCAAATTAAGTTATACTGTTTTTTTAAAAGGAAATTGTCTCTTAAAAATGAAATATATATATATATGATTCAAATAAATTTTCAGTAATCAGCGATTTTAATGGGAATTCTTCTCTAAATTACTTTAGATAGAAGATCAAATCATTCTAGAAAGTGTAAAGTAAGATAAAAATAAATAGAATGCTTGGAATGAGTTTTAATTTTAACTTATCATTTATTAAACCAAAAGAAAAATGTCAGTAAAGTATGTTCATGACAGCTAATAAATATTCAACTTTTATTCTATTTATTAGAGGAATTTGAAATCTCTGCTTTTTTATTTTATTATTCGGCTTCCTCTTAACTTTTTCTTCAATCCTAAGTTTTTTTTTTTCTCCCTTTTATCAATTAAAGAATGTGATGCCAAATTTATATTTATAAAAATAATGGGAATAATAATAAAGAATCAGGAAGGTTTTAAAATACAAGTGAACTTATTCATAAACCTAACATTTCATATTTGGTCTCTTATCATTAACTCCATATTTTTGTTCAATAAGCCTATGCCCTAATAGTTTTATCAATTTGAAGTTTGTCGTCATTTGGCATTTTTACTTCAAAATAAAATTGAAAATTTAGAGTTTTGTTCAACATGTTTGCTAACTAAGACACAAAAGTTATTTTTCTTTTCTATATTCAAGAATTATTTTGTGAAAATATATTTCTTATAAATAAGAAAGAATATTGTAAAAGATTATTGGGATGATTCATTCTGTAAAACATTATTTTTGTTCAGAAGTATCGTCTGCACACGAGACGACCAACTCCGACGATCCACAACAACGAGGGCGGCCATGCACCGCAGTTCCTAGTTGTTGGCGGCATATGGGTACCTGCAGCCGAGTACGCCGCTGTGTCCACCACCACTTCATCTGGAGAAGTGGTCAGTGCCGCTACCACCAACGGAATTTATGCACCGGTTGTGGCAGCTGCAGCGCCGCAGCCATTAGTTAGTACAGTTCAAAAGCCCAAGCCCAAGCCCAAGCCCAAGATTATTCCTTCCTCCGCCGTGGAATGTAATTCTCCGACTACATCTTCGTCTACTCATACGTCGTCAGTTTCACCAGCTTCTTCTTGAGCCTC

mRNA sequence

ATGCTTTACTCCGACAAAATGCAACAAATTGCAGCCAAAATGGGTTTCACACTCTCCGATTTCGCCGACACTTTGGAACAAGAACGCCGCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCACCTTGTTTCCCATGCCATTGATTGTTGTAGGCAGCAGCTATCGGGGACGACTACGGAGAATCGTCAATCCGAGTGTTCTGAGCAGACTTCCAGTGATATGGGTCCGGTTCTTGAGGAATTTATTCCCATTAACAGAAATGGGGTTTCCGATTTTGAAAAAACGGAGAAGAATAACAAAAACCATGACTCTGATTTAAATAATTTGAATTTGGCTCCGTCTGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACTTCAGATCCTCCTCCCTTGAATCAGGACCTGCCAGAGAACACGCCGGTCGTTGAGGTCAATAGAAATGGCGGCGCTTTCCGGCCGTTTCAGAAGGAGAAAACCGGTGGCGGCGGTGGCGGGGGAGGGGCATCGTCGTCTTCACCTCCGGCTCCGGCTGCGGAAACGAGCTCCACGACGGAAACGGGTTCAGGGGGAAGTAGTCGGCGGGAAGAGAAGGAAGCGCAGAATCAGAGGAAACAGAGACGTTGCTGGTCGCCGGAGCTTCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGGTCCCATGTGGCGACGCCGAAGCAAATAAGAGAATTGATGAAGGTGGATGGTCTCACCAACGATGAAGTGAAAAGTCACCTGCAGAAGTATCGTCTGCACACGAGACGACCAACTCCGACGATCCACAACAACGAGGGCGGCCATGCACCGCAGTTCCTAGTTGTTGGCGGCATATGGGTACCTGCAGCCGAGTACGCCGCTGTGTCCACCACCACTTCATCTGGAGAAGTGGTCAGTGCCGCTACCACCAACGGAATTTATGCACCGGTTGTGGCAGCTGCAGCGCCGCAGCCATTAGTTAGTACAGTTCAAAAGCCCAAGCCCAAGCCCAAGCCCAAGATTATTCCTTCCTCCGCCGTGGAATGTAATTCTCCGACTACATCTTCGTCTACTCATACGTCGTCAGTTTCACCAGCTTCTTCTTGA

Coding sequence (CDS)

ATGCTTTACTCCGACAAAATGCAACAAATTGCAGCCAAAATGGGTTTCACACTCTCCGATTTCGCCGACACTTTGGAACAAGAACGCCGCAAAGTCCTCATGTTTCAGCGCGAGCTCCCTCTCTGTTTGCACCTTGTTTCCCATGCCATTGATTGTTGTAGGCAGCAGCTATCGGGGACGACTACGGAGAATCGTCAATCCGAGTGTTCTGAGCAGACTTCCAGTGATATGGGTCCGGTTCTTGAGGAATTTATTCCCATTAACAGAAATGGGGTTTCCGATTTTGAAAAAACGGAGAAGAATAACAAAAACCATGACTCTGATTTAAATAATTTGAATTTGGCTCCGTCTGATTGGCTCAGATCTGCTCAGCTTTGGAATCAGACTTCAGATCCTCCTCCCTTGAATCAGGACCTGCCAGAGAACACGCCGGTCGTTGAGGTCAATAGAAATGGCGGCGCTTTCCGGCCGTTTCAGAAGGAGAAAACCGGTGGCGGCGGTGGCGGGGGAGGGGCATCGTCGTCTTCACCTCCGGCTCCGGCTGCGGAAACGAGCTCCACGACGGAAACGGGTTCAGGGGGAAGTAGTCGGCGGGAAGAGAAGGAAGCGCAGAATCAGAGGAAACAGAGACGTTGCTGGTCGCCGGAGCTTCACCGGCGGTTCCTTCATGCGCTTCAGCAGCTCGGAGGGTCCCATGTGGCGACGCCGAAGCAAATAAGAGAATTGATGAAGGTGGATGGTCTCACCAACGATGAAGTGAAAAGTCACCTGCAGAAGTATCGTCTGCACACGAGACGACCAACTCCGACGATCCACAACAACGAGGGCGGCCATGCACCGCAGTTCCTAGTTGTTGGCGGCATATGGGTACCTGCAGCCGAGTACGCCGCTGTGTCCACCACCACTTCATCTGGAGAAGTGGTCAGTGCCGCTACCACCAACGGAATTTATGCACCGGTTGTGGCAGCTGCAGCGCCGCAGCCATTAGTTAGTACAGTTCAAAAGCCCAAGCCCAAGCCCAAGCCCAAGATTATTCCTTCCTCCGCCGTGGAATGTAATTCTCCGACTACATCTTCGTCTACTCATACGTCGTCAGTTTCACCAGCTTCTTCTTGA

Protein sequence

MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSSTHTSSVSPASS*
BLAST of Csa6G031440 vs. Swiss-Prot
Match: EFM_ARATH (Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2)

HSP 1 Score: 207.6 bits (527), Expect = 2.3e-52
Identity = 147/386 (38.08%), Postives = 190/386 (49.22%), Query Frame = 1

Query: 18  LSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTENRQSECSEQTSSDM 77
           L D    LEQER K+  F+RELPLC+ L+++A++  +QQL      +  +  S  T    
Sbjct: 36  LEDLLSRLEQERLKIDAFKRELPLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTR--- 95

Query: 78  GPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWLRSAQLWNQTSDPPP-LN 137
            PVLEEFIP+        EKT  NNK             S+W+ +AQLW+Q+   P  ++
Sbjct: 96  -PVLEEFIPLRNQP----EKT--NNKG------------SNWMTTAQLWSQSETKPKNID 155

Query: 138 QDLPENTPVVEVN------------RNG-GAFRPFQKEKT-------------------G 197
               ++ P  E+N            RNG GAF PF KE++                    
Sbjct: 156 STTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPFSKEQSLPELALSTEVKRVSPTNEHT 215

Query: 198 GGGGGGGASSSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHA 257
            G  G   S  +        ++     +G SS      +Q+ RK RRCWSP+LHRRF+ A
Sbjct: 216 NGQDGNDESMINNDNNYNNNNNNNSNSNGVSSTT----SQSNRKARRCWSPDLHRRFVQA 275

Query: 258 LQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLV 317
           LQ LGGS VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP+P+     GG  P  +V
Sbjct: 276 LQMLGGSQVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPS-PQTSGGPGPHLVV 335

Query: 318 VGGIWVPAAEYAAV--STTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKP 369
           +GGIWVP  EY +    T T     V    TN    P     + Q   +T   P+P    
Sbjct: 336 LGGIWVP-PEYTSAHGGTPTLYHHQVHHHHTNTAGPPPPHFCSSQEFYTTPPPPQPLHHH 393

BLAST of Csa6G031440 vs. Swiss-Prot
Match: GLK2_ORYSJ (Probable transcription factor GLK2 OS=Oryza sativa subsp. japonica GN=GLK2 PE=2 SV=1)

HSP 1 Score: 86.3 bits (212), Expect = 7.9e-16
Identity = 75/250 (30.00%), Postives = 101/250 (40.40%), Query Frame = 1

Query: 130 SDPPPL------NQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAE 189
           S PPP       +++   + P  +  +NGG         T     G   S S    P+AE
Sbjct: 135 SPPPPRGKKKKDDEERSSSLPEEKDAKNGGGDEVLSAVTTEDSSAGAAKSCS----PSAE 194

Query: 190 TSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELM 249
             S  +  S  SS    K +  +RK +  W+PELHRRF+ A++QL G   A P +I ELM
Sbjct: 195 GHSKRKPSSSSSSAAAGKNSHGKRKVKVDWTPELHRRFVQAVEQL-GIDKAVPSRILELM 254

Query: 250 KVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTS 309
            ++ LT   + SHLQKYR H +             A  +     ++  AA  AAV+    
Sbjct: 255 GIECLTRHNIASHLQKYRSHRKHLMA-----REAEAASWTQKRQMYTAAAAAAAVA--AG 314

Query: 310 SGEVVSAATTNGIYAPVV--AAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSST 369
            G    AA      AP V      P P  + +  P P P P   P   V    PT     
Sbjct: 315 GGPRKDAAAATAAVAPWVMPTIGFPPPHAAAMVPPPPHPPPFCRPPLHV-WGHPTAGVEP 371

Query: 370 HTSSVSPASS 372
            T++  P  S
Sbjct: 375 TTAAAPPPPS 371

BLAST of Csa6G031440 vs. Swiss-Prot
Match: ARR1_ARATH (Two-component response regulator ARR1 OS=Arabidopsis thaliana GN=ARR1 PE=1 SV=2)

HSP 1 Score: 81.6 bits (200), Expect = 1.9e-14
Identity = 50/114 (43.86%), Postives = 61/114 (53.51%), Query Frame = 1

Query: 166 GGGGGGASSSSPPAPAAETSSTTETGSGGSSRREEKE-------------AQNQRKQRRC 225
           GGGGG A S    A    +SS  E  +  SS R+ K+             A N +K R  
Sbjct: 182 GGGGGAAVSGGEDAVDDNSSSVNEGNNWRSSSRKRKDEEGEEQGDDKDEDASNLKKPRVV 241

Query: 226 WSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRR 267
           WS ELH++F+ A+ QLG    A PK+I ELM V GLT + V SHLQKYR++ RR
Sbjct: 242 WSVELHQQFVAAVNQLGVEK-AVPKKILELMNVPGLTRENVASHLQKYRIYLRR 294

BLAST of Csa6G031440 vs. Swiss-Prot
Match: PHR1_ORYSJ (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 PE=2 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 4.3e-14
Identity = 45/95 (47.37%), Postives = 55/95 (57.89%), Query Frame = 1

Query: 173 SSSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSH 232
           S++S PA    TSS +      +S        +  KQR  W+PELH  F+HA+ +LGGS 
Sbjct: 181 SAASQPAFNQSTSSHSGDICPVTSPPPNNSNASASKQRMRWTPELHESFVHAVNKLGGSE 240

Query: 233 VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP 268
            ATPK + +LMKVDGLT   VKSHLQKYR    +P
Sbjct: 241 KATPKGVLKLMKVDGLTIYHVKSHLQKYRTARYKP 275

BLAST of Csa6G031440 vs. Swiss-Prot
Match: PHR1_ORYSI (Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE=3 SV=1)

HSP 1 Score: 80.5 bits (197), Expect = 4.3e-14
Identity = 45/95 (47.37%), Postives = 55/95 (57.89%), Query Frame = 1

Query: 173 SSSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSH 232
           S++S PA    TSS +      +S        +  KQR  W+PELH  F+HA+ +LGGS 
Sbjct: 181 SAASQPAFNQSTSSHSGDICPVTSPPPNNSNASASKQRMRWTPELHESFVHAVNKLGGSE 240

Query: 233 VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP 268
            ATPK + +LMKVDGLT   VKSHLQKYR    +P
Sbjct: 241 KATPKGVLKLMKVDGLTIYHVKSHLQKYRTARYKP 275

BLAST of Csa6G031440 vs. TrEMBL
Match: A0A0A0KAT4_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031440 PE=4 SV=1)

HSP 1 Score: 752.7 bits (1942), Expect = 2.2e-214
Identity = 371/371 (100.00%), Postives = 371/371 (100.00%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT
Sbjct: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL 120
           TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL
Sbjct: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL 120

Query: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP 180
           RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP
Sbjct: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP 180

Query: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240
           AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR
Sbjct: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240

Query: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST 300
           ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST
Sbjct: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST 300

Query: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS 360
           TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS
Sbjct: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS 360

Query: 361 THTSSVSPASS 372
           THTSSVSPASS
Sbjct: 361 THTSSVSPASS 371

BLAST of Csa6G031440 vs. TrEMBL
Match: D7T987_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g03110 PE=4 SV=1)

HSP 1 Score: 402.1 bits (1032), Expect = 7.2e-109
Identity = 216/360 (60.00%), Postives = 253/360 (70.28%), Query Frame = 1

Query: 20  DFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTE--NRQSECSEQTSSDM 79
           D+ + LE+ERRK+ +FQRELPLCL LVS AI+ CRQQ+SGTT E  + QSECSEQTSSD 
Sbjct: 6   DYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGTTQEYFHGQSECSEQTSSD- 65

Query: 80  GPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAP-SDWLRSAQLWNQTSDPPPLN 139
           GPVLEEFIPI +    D ++ + +  N + D NN      SDWLRS QLWNQT DPP + 
Sbjct: 66  GPVLEEFIPIKKTS-DDEDEQQSHQPNDNKDKNNDKSGKKSDWLRSVQLWNQTPDPP-VK 125

Query: 140 QDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAETSSTTETGSGGSS 199
           +D P+  P +EV +NGGAF PF+++K  G        ++   AP+A TSST ET +G SS
Sbjct: 126 EDTPKKIPSMEVKKNGGAFHPFKRDKAVG--------TNPTSAPSAATSSTAETATGCSS 185

Query: 200 --RREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK 259
             R+EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK
Sbjct: 186 GSRKEEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK 245

Query: 260 SHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSAATTN 319
           SHLQKYRLHTRRP P I +N    APQF+VVGGIWVP  EY AV+ TTSSGE     T N
Sbjct: 246 SHLQKYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEYTAVAATTSSGEATGVTTAN 305

Query: 320 GIYAPVVAAAAPQPLVSTVQKPKPKPKPK------IIPSSAVECNSPTTSSSTHTSSVSP 369
           GIYAPV +     P  ST ++   KPK              V+ NSP TSSSTHT++ SP
Sbjct: 306 GIYAPVASVPPSHPQGSTQRQQPMKPKKSQSEERGSHSEGGVQSNSPATSSSTHTTTTSP 354

BLAST of Csa6G031440 vs. TrEMBL
Match: M5XS96_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007222mg PE=4 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 8.2e-105
Identity = 218/378 (57.67%), Postives = 256/378 (67.72%), Query Frame = 1

Query: 11  AAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTE--NRQSE 70
           A+++GF   D+   LE+ER K+ +FQRELPLCL LV+ AI+ C+QQLS TTT+  + QSE
Sbjct: 3   ASRLGFR--DYVKALEEERHKIQVFQRELPLCLELVTQAIERCKQQLSDTTTDYMHGQSE 62

Query: 71  CSEQTSSDMGPVLEEFIPINRNGVSDFEKTE----KNNKNHDSDLNNLNLAPSDWLRSAQ 130
           CSEQTSS+ G V EEFIP+ R   SD +  E    +  K +D D  N +   SDWLRSAQ
Sbjct: 63  CSEQTSSE-GHVFEEFIPLKRTSSSDSDDDEVQESQEPKTNDKDKTNGDKIKSDWLRSAQ 122

Query: 131 LWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAET 190
           LWN T D PPL  +LP    V+EV RNGGAF+PFQ+EK+    G      +  PA A  T
Sbjct: 123 LWNTTPD-PPLKDELPRKALVMEVKRNGGAFQPFQREKS---VGKTNRPVAKVPASAPAT 182

Query: 191 SSTTET---GSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRE 250
           SSTT+T   GSG S ++EEK+ Q QRKQRR WSPELHRRFLHALQQLGGSH ATPKQIRE
Sbjct: 183 SSTTDTVSGGSGESHKKEEKDGQGQRKQRRNWSPELHRRFLHALQQLGGSHAATPKQIRE 242

Query: 251 LMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGG----HAPQFLVVGGIWVPAAEYAA 310
           LMKVDGLTNDEVKSHLQKYRLHTRRPTPT+HNN        APQFLVVGGIWVP  +YAA
Sbjct: 243 LMKVDGLTNDEVKSHLQKYRLHTRRPTPTMHNNNNSDNNTQAPQFLVVGGIWVPPQDYAA 302

Query: 311 VSTTTSSGEVVSAATTNGIYAPVV---AAAAPQPLVSTVQKPKPKPKPKIIPSSAV---- 366
           V+ TT+SGE    A  NGIYAPV    +   P    S +Q+P+PK          V    
Sbjct: 303 VAATTASGEATRVAAANGIYAPVATSPSTVTPVSPPSLMQRPRPKRPESSHSDERVSHSE 362

BLAST of Csa6G031440 vs. TrEMBL
Match: B9HWL5_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s13900g PE=4 SV=1)

HSP 1 Score: 387.5 bits (994), Expect = 1.8e-104
Identity = 215/401 (53.62%), Postives = 263/401 (65.59%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           M +++KMQ+          ++ + LE+ERRK+ +F+RELPLCL LV+ AI+ C+++LSGT
Sbjct: 1   MDFAEKMQRC--------HEYVEALEEERRKIQVFERELPLCLELVTQAIEACKRELSGT 60

Query: 61  TTENR---QSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKN---------------- 120
           T ++    QSECSEQTSS+ GPVLEEFIPI R   SD E+ + N                
Sbjct: 61  TEDHNMHGQSECSEQTSSE-GPVLEEFIPIKRTHSSDDEENDNNHDDDDHQEQQSQNDNK 120

Query: 121 -NKNHDSDLNNLNLAPSDWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNG--GAFRPF 180
            NK++ S  NN +   SDWLRS QLWNQ+ DPP   QDLP    V EV RNG  GAF+PF
Sbjct: 121 RNKSNSSISNNDHKKKSDWLRSVQLWNQSPDPPQ-KQDLPRKAAVTEVKRNGAGGAFQPF 180

Query: 181 QKEKTGGGGGGGGASSSSPPAPAAETSS----TTETGSGGSSRREEKEAQNQRKQRRCWS 240
            +EK+ G       S + P  PA+ TSS     T    GG +++E+KE  NQRKQRRCWS
Sbjct: 181 HREKSVGKSSNQAISKAPPSVPASATSSIAGAVTGGTGGGGNKKEDKEKGNQRKQRRCWS 240

Query: 241 PELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNN 300
           PELHRRFLH+LQQLGGSH ATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP+PTIH N
Sbjct: 241 PELHRRFLHSLQQLGGSHAATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPTIHTN 300

Query: 301 EGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQ 360
               APQF+VVGGIWVP  EYAAV+ TT++GE  + +  NGIYAP+   AAP P V   +
Sbjct: 301 SSQQAPQFVVVGGIWVPPTEYAAVAATTTAGETSTISAANGIYAPI---AAPPPAVPQNR 360

Query: 361 KPKPKPKPKI-------IPSSAVECNSPTTSSSTHTSSVSP 369
           + K     +            A   NSP TSSSTHT++ SP
Sbjct: 361 QHKQSEHSQSEGRGSHGERGGAHSNNSPATSSSTHTTTTSP 388

BLAST of Csa6G031440 vs. TrEMBL
Match: B9ST36_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_0353920 PE=4 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 4.1e-104
Identity = 218/409 (53.30%), Postives = 263/409 (64.30%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           M Y++KMQ+          ++ + LE+E+RK+ +FQRELPLCL LV+ AI+ C+++LSGT
Sbjct: 1   MDYAEKMQRC--------HEYVEALEEEKRKIQVFQRELPLCLELVTQAIEACKRELSGT 60

Query: 61  TTE--NRQSECSEQTSSDMGP---------VLEEFIPINR--------NGVSDFEKTEKN 120
           TTE  + QSECSEQT+S  G          VLEEFIPI R        N   D  + EK 
Sbjct: 61  TTEYMHGQSECSEQTTSTDGTANGTGTRSLVLEEFIPIKRINSSSHNDNDNDDDNENEKE 120

Query: 121 NKNHDS---------------DLNNLNLAPSDWLRSAQLWNQTS-DPPPLNQDLPENTPV 180
           + + D                D+NN     SDWLRS QLWNQ+S D  P  +DLP    V
Sbjct: 121 DNDDDEEEEDQDSHKPNKSIRDINNDQKKKSDWLRSVQLWNQSSPDSEPPKEDLPRKAAV 180

Query: 181 VEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAETSSTTETGSG------GSSRRE 240
            EV RNGGAF+PF KEK        G + + P  PA+ TSS+ ETG+G      G++R+E
Sbjct: 181 TEVKRNGGAFQPFHKEK--------GIAKTPPSVPASATSSSAETGTGGGTSGAGNNRKE 240

Query: 241 EKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQK 300
           +K+ Q QRKQRRCWSPELHRRFLHALQQLGGSH ATPKQIRELMKVDGLTNDEVKSHLQK
Sbjct: 241 DKDGQAQRKQRRCWSPELHRRFLHALQQLGGSHAATPKQIRELMKVDGLTNDEVKSHLQK 300

Query: 301 YRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSAATTNGIYAP 360
           YRLHTRRP+PTIHNN    APQF+VVGGIWVP  EYAAV+ TT+S E V+ A  NGIYAP
Sbjct: 301 YRLHTRRPSPTIHNNSNPQAPQFVVVGGIWVPPPEYAAVAATTASMETVTTAAANGIYAP 360

Query: 361 VVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSSTHTSSVSP 369
           V A     P     Q    + + +   S +   NSP TSSSTHT++ SP
Sbjct: 361 VAAPLGTIPKQQRAQSQHLQSERR--GSHSERSNSPATSSSTHTTTNSP 391

BLAST of Csa6G031440 vs. TAIR10
Match: AT1G68670.1 (AT1G68670.1 myb-like transcription factor family protein)

HSP 1 Score: 293.1 bits (749), Expect = 2.4e-79
Identity = 179/388 (46.13%), Postives = 229/388 (59.02%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           M Y+ KMQ+          ++ + LE+E++K+ +FQRELPLCL LV+ AI+ CR++LSGT
Sbjct: 5   MDYAKKMQKC--------HEYVEALEEEQKKIQVFQRELPLCLELVTQAIEACRKELSGT 64

Query: 61  TTENRQSECSEQTSSDMG-PVLEEFIPINRNG--VSDFEKTEKNNKNHDSDLNNLNLAPS 120
           TT   + +CSEQT+S  G PV EEFIPI +      + ++ E+ +  H+S    +N   S
Sbjct: 65  TTTTSE-QCSEQTTSVCGGPVFEEFIPIKKISSLCEEVQEEEEEDGEHESSPELVNNKKS 124

Query: 121 DWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSP 180
           DWLRS QLWN + D  P  + + +   VVEV    GAF+PFQK             +SS 
Sbjct: 125 DWLRSVQLWNHSPDLNPKEERVAKKAKVVEVKPKSGAFQPFQKRVLETDLQPAVKVASS- 184

Query: 181 PAPAAETSSTTETGSGGSS---------RREEKEAQNQ--RKQRRCWSPELHRRFLHALQ 240
             PA  TSSTTET  G S          R E++++Q+   RKQRRCWSPELHRRFL+ALQ
Sbjct: 185 -MPATTTSSTTETCGGKSDLIKAGDEERRIEQQQSQSHTHRKQRRCWSPELHRRFLNALQ 244

Query: 241 QLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPT---IHNNEGGHAPQFL 300
           QLGGSHVATPKQIR+ MKVDGLTNDEVKSHLQKYRLHTRRP  T     +      PQF+
Sbjct: 245 QLGGSHVATPKQIRDHMKVDGLTNDEVKSHLQKYRLHTRRPAATSVAAQSTGNQQQPQFV 304

Query: 301 VVGGIWVPAA-EYAAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKP 360
           VVGGIWVP++ ++   S   + G         G+YAPV  A +P+               
Sbjct: 305 VVGGIWVPSSQDFPPPSDVANKG---------GVYAPVAVAQSPK--------------- 354

Query: 361 KIIPSSAVECNSPTTSSSTHTSSVSPAS 371
               S    CNSP  SSST+T++ +P S
Sbjct: 365 ---RSLERSCNSPAASSSTNTNTSTPVS 354

BLAST of Csa6G031440 vs. TAIR10
Match: AT1G25550.1 (AT1G25550.1 myb-like transcription factor family protein)

HSP 1 Score: 292.7 bits (748), Expect = 3.1e-79
Identity = 182/373 (48.79%), Postives = 223/373 (59.79%), Query Frame = 1

Query: 20  DFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTE-NRQSECSEQTSSDMG 79
           ++ + LE+E++K+ +FQRELPLCL LV+ AI+ CR++LS ++     QSECSE+T+S+ G
Sbjct: 20  EYVEALEEEQKKIQVFQRELPLCLELVTQAIESCRKELSESSEHVGGQSECSERTTSECG 79

Query: 80  -PVLEEFIPINRNGVS--------DFEKTEK-NNKNHDSDLNNLNLAPSDWLRSAQLWNQ 139
             V EEF+PI  +  S        + EKTE   N+N+D D        SDWLRS QLWNQ
Sbjct: 80  GAVFEEFMPIKWSSASSDETDKDEEAEKTEMMTNENNDGDKKK-----SDWLRSVQLWNQ 139

Query: 140 TSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSP------PAPAA 199
           + DP P N+       V+EV R+ GAF+PFQKEK         A+ S P      P    
Sbjct: 140 SPDPQPNNK----KPMVIEVKRSAGAFQPFQKEKPK-------AADSQPLIKAITPTSTT 199

Query: 200 ETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIREL 259
            TSST ET  GG    E+K++ + RKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR+L
Sbjct: 200 TTSSTAETVGGGKEFEEQKQSHSNRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRDL 259

Query: 260 MKVDGLTNDEVKSHLQKYRLHTRRP-TPTIHNNEGGHAP---QFLVVGGIWVPAAEYAAV 319
           MKVDGLTNDEVKSHLQKYRLHTRRP TP +    GG  P   QF+V+ GIWVP+ +    
Sbjct: 260 MKVDGLTNDEVKSHLQKYRLHTRRPATPVVRT--GGENPQQRQFMVMEGIWVPSHD---- 319

Query: 320 STTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTS 371
                        T N +YAPV    A QP  S+    +          S   C SP TS
Sbjct: 320 ------------TTNNRVYAPV----ATQPPQSSTSGER----------SNRGCKSPATS 344

BLAST of Csa6G031440 vs. TAIR10
Match: AT3G25790.1 (AT3G25790.1 myb-like transcription factor family protein)

HSP 1 Score: 250.8 bits (639), Expect = 1.4e-66
Identity = 146/368 (39.67%), Postives = 209/368 (56.79%), Query Frame = 1

Query: 20  DFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTENR--QSECSEQTSSDM 79
           ++ + LE+ERRK+ +FQRELPLC+ LV+ AI+  ++++SGT+T+N   QSECSEQT+ + 
Sbjct: 20  EYIEALEEERRKINVFQRELPLCVELVTQAIEAYKREISGTSTDNLYGQSECSEQTTGEC 79

Query: 80  GPVLEEFIPINRNGVS------DFEKTEKNNKNHDSDLN--NLNLAPSDWLRSAQLWNQT 139
           G +L+ FIPI  +  S      D +  ++ +++H++D++  + N+  S+WL+S QLWNQ+
Sbjct: 80  GRILDLFIPIKHSSTSIEEEVDDKDDDDEEHQSHETDIDFDDKNMK-SEWLKSVQLWNQS 139

Query: 140 SDPPPLN-QDLPEN-----TPVVEVN-----RNGGAFRPFQKEKTGGGGGGGGASSSSPP 199
                 N QD  +        ++++N     +N     P      G GGGGG        
Sbjct: 140 DAVVSNNRQDRSQEKTETLVELIKINDEAAKKNNNIKSPVTTSDGGSGGGGG-------- 199

Query: 200 APAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQ 259
                                    + QRK RRCWS ELHRRFL+AL+QLGG HVATPKQ
Sbjct: 200 ------------------------RRGQRKNRRCWSQELHRRFLNALKQLGGPHVATPKQ 259

Query: 260 IRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAV 319
           IR++MKVDGLTNDEVKSHLQKYRLH RRP+ T  NN       F+VVGGIWVP   +   
Sbjct: 260 IRDIMKVDGLTNDEVKSHLQKYRLHARRPSQTTPNNRNSQTQHFVVVGGIWVPQTNH--- 319

Query: 320 STTTSSGEVVSAATTNGIYAPVVAAAAPQ-PLVSTVQKPKPKPKPKIIPSSAVECNSPTT 366
            +T ++   V++  T GIY P+V++   + P  S   +   + + +   +    C+SP  
Sbjct: 320 -STANAVNAVASGETTGIYGPMVSSLPSEWPRHSNFGRKISEDRSRCSNNGFFRCSSPAM 350

BLAST of Csa6G031440 vs. TAIR10
Match: AT2G03500.1 (AT2G03500.1 Homeodomain-like superfamily protein)

HSP 1 Score: 207.6 bits (527), Expect = 1.3e-53
Identity = 147/386 (38.08%), Postives = 190/386 (49.22%), Query Frame = 1

Query: 18  LSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTENRQSECSEQTSSDM 77
           L D    LEQER K+  F+RELPLC+ L+++A++  +QQL      +  +  S  T    
Sbjct: 36  LEDLLSRLEQERLKIDAFKRELPLCMQLLNNAVEVYKQQLEAYRANSNNNNQSVGTR--- 95

Query: 78  GPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWLRSAQLWNQTSDPPP-LN 137
            PVLEEFIP+        EKT  NNK             S+W+ +AQLW+Q+   P  ++
Sbjct: 96  -PVLEEFIPLRNQP----EKT--NNKG------------SNWMTTAQLWSQSETKPKNID 155

Query: 138 QDLPENTPVVEVN------------RNG-GAFRPFQKEKT-------------------G 197
               ++ P  E+N            RNG GAF PF KE++                    
Sbjct: 156 STTDQSLPKDEINSSPKLGHFDAKQRNGSGAFLPFSKEQSLPELALSTEVKRVSPTNEHT 215

Query: 198 GGGGGGGASSSSPPAPAAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHA 257
            G  G   S  +        ++     +G SS      +Q+ RK RRCWSP+LHRRF+ A
Sbjct: 216 NGQDGNDESMINNDNNYNNNNNNNSNSNGVSSTT----SQSNRKARRCWSPDLHRRFVQA 275

Query: 258 LQQLGGSHVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLV 317
           LQ LGGS VATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP+P+     GG  P  +V
Sbjct: 276 LQMLGGSQVATPKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPSPS-PQTSGGPGPHLVV 335

Query: 318 VGGIWVPAAEYAAV--STTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKP 369
           +GGIWVP  EY +    T T     V    TN    P     + Q   +T   P+P    
Sbjct: 336 LGGIWVP-PEYTSAHGGTPTLYHHQVHHHHTNTAGPPPPHFCSSQEFYTTPPPPQPLHHH 393

BLAST of Csa6G031440 vs. TAIR10
Match: AT1G13300.1 (AT1G13300.1 myb-like transcription factor family protein)

HSP 1 Score: 174.5 bits (441), Expect = 1.2e-43
Identity = 93/179 (51.96%), Postives = 105/179 (58.66%), Query Frame = 1

Query: 191 GSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTN 250
           G  G  R  EK+    RKQRRCWS +LHRRFL+ALQ LGG HVATPKQIRE MKVDGLTN
Sbjct: 164 GGEGRKREAEKDGGGGRKQRRCWSSQLHRRFLNALQHLGGPHVATPKQIREFMKVDGLTN 223

Query: 251 DEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSA 310
           DEVKSHLQKYRLHTRRP  T+ NN       F+VVGG+WVP ++Y +   TT      S 
Sbjct: 224 DEVKSHLQKYRLHTRRPRQTVPNNGNSQTQHFVVVGGLWVPQSDY-STGKTTGGATTSST 283

Query: 311 ATTNGIYAPVVAAAAPQPLVSTVQKPK---PKPKPKIIPSSAVECNSPTTSSSTHTSSV 367
            TT GIY  + A   PQ    +  +P     +          V C+SP  SSST    V
Sbjct: 284 TTTTGIYGTMAAPPPPQWPSHSNYRPSIIVDEGSGSHSEGVVVRCSSPAMSSSTRNHYV 341


HSP 2 Score: 120.2 bits (300), Expect = 2.8e-27
Identity = 70/160 (43.75%), Postives = 97/160 (60.62%), Query Frame = 1

Query: 21  FADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTENR--QSECSEQTSSDMG 80
           + + LE+ERRK+ +FQRELPLCL LV+ AI+ C+++L   TTEN   Q ECSEQT+ + G
Sbjct: 20  YIEALEEERRKIHVFQRELPLCLDLVTQAIEACKRELPEMTTENMYGQPECSEQTTGECG 79

Query: 81  PVLEEFIPINRNGVSDFEKTEK---NNKNHDSDLNNLNL-APSDWLRSAQLWNQTSDP-P 140
           PVLE+F+ I  +  S+ E+ E+    + NHD D ++ +    SDWL+S QLWNQ   P  
Sbjct: 80  PVLEQFLTIKDSSTSNEEEDEEFDDEHGNHDPDNDSEDKNTKSDWLKSVQLWNQPDHPLL 139

Query: 141 PLNQDLPENTPVVEVNR------NGGAFRPFQKEKTGGGG 168
           P  + L + T   + +       NGG  R  + EK GGGG
Sbjct: 140 PKEERLQQETMTRDESMRKDPMVNGGEGRKREAEKDGGGG 179

BLAST of Csa6G031440 vs. NCBI nr
Match: gi|778710070|ref|XP_011656513.1| (PREDICTED: probable transcription factor GLK2 [Cucumis sativus])

HSP 1 Score: 752.7 bits (1942), Expect = 3.1e-214
Identity = 371/371 (100.00%), Postives = 371/371 (100.00%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT
Sbjct: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL 120
           TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL
Sbjct: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL 120

Query: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP 180
           RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP
Sbjct: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP 180

Query: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240
           AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR
Sbjct: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240

Query: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST 300
           ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST
Sbjct: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST 300

Query: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS 360
           TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS
Sbjct: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSSAVECNSPTTSSS 360

Query: 361 THTSSVSPASS 372
           THTSSVSPASS
Sbjct: 361 THTSSVSPASS 371

BLAST of Csa6G031440 vs. NCBI nr
Match: gi|659089682|ref|XP_008445641.1| (PREDICTED: probable transcription factor GLK2 [Cucumis melo])

HSP 1 Score: 712.6 bits (1838), Expect = 3.6e-202
Identity = 358/375 (95.47%), Postives = 361/375 (96.27%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           M+YSDKMQ+IAAKMGFTLSDFADTLEQERRKVLMFQRELPLCL LVSHAIDCCRQQLSGT
Sbjct: 1   MVYSDKMQEIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLQLVSHAIDCCRQQLSGT 60

Query: 61  TTENRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAPSDWL 120
           TTENRQSECSEQTSSD+GPVLEEFIPINRNGVSDFEKTEK NKN D DLNNLNLAPSDWL
Sbjct: 61  TTENRQSECSEQTSSDIGPVLEEFIPINRNGVSDFEKTEKINKNDDPDLNNLNLAPSDWL 120

Query: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAP 180
           RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGG GGGGGASSSSPPAP
Sbjct: 121 RSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGCGGGGGASSSSPPAP 180

Query: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240
           AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR
Sbjct: 181 AAETSSTTETGSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIR 240

Query: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVST 300
           ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNE GH PQFLVVGGIWVPAAEYAAVST
Sbjct: 241 ELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNESGHTPQFLVVGGIWVPAAEYAAVST 300

Query: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPKIIPSS----AVECNSPT 360
           TTSSGEVVSAATTNGIYAPVVAAAAPQPL STVQ  KPKPKPKIIPSS    AVECNSPT
Sbjct: 301 TTSSGEVVSAATTNGIYAPVVAAAAPQPLASTVQ--KPKPKPKIIPSSAAVAAVECNSPT 360

Query: 361 TSSSTHTSSVSPASS 372
           TSSSTHTSSVSPASS
Sbjct: 361 TSSSTHTSSVSPASS 373

BLAST of Csa6G031440 vs. NCBI nr
Match: gi|359472981|ref|XP_003631224.1| (PREDICTED: probable transcription factor GLK2 [Vitis vinifera])

HSP 1 Score: 405.6 bits (1041), Expect = 9.3e-110
Identity = 222/379 (58.58%), Postives = 261/379 (68.87%), Query Frame = 1

Query: 1   MLYSDKMQQIAAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGT 60
           M +SDKMQ+          D+ + LE+ERRK+ +FQRELPLCL LVS AI+ CRQQ+SGT
Sbjct: 1   MDFSDKMQRC--------HDYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGT 60

Query: 61  TTE--NRQSECSEQTSSDMGPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAP-S 120
           T E  + QSECSEQTSSD GPVLEEFIPI +    D ++ + +  N + D NN      S
Sbjct: 61  TQEYFHGQSECSEQTSSD-GPVLEEFIPIKKTS-DDEDEQQSHQPNDNKDKNNDKSGKKS 120

Query: 121 DWLRSAQLWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSP 180
           DWLRS QLWNQT DPP + +D P+  P +EV +NGGAF PF+++K  G        ++  
Sbjct: 121 DWLRSVQLWNQTPDPP-VKEDTPKKIPSMEVKKNGGAFHPFKRDKAVG--------TNPT 180

Query: 181 PAPAAETSSTTETGSGGSS--RREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVAT 240
            AP+A TSST ET +G SS  R+EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVAT
Sbjct: 181 SAPSAATSSTAETATGCSSGSRKEEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVAT 240

Query: 241 PKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEY 300
           PKQIRELMKVDGLTNDEVKSHLQKYRLHTRRP P I +N    APQF+VVGGIWVP  EY
Sbjct: 241 PKQIRELMKVDGLTNDEVKSHLQKYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEY 300

Query: 301 AAVSTTTSSGEVVSAATTNGIYAPVVAAAAPQPLVSTVQKPKPKPKPK------IIPSSA 360
            AV+ TTSSGE     T NGIYAPV +     P  ST ++   KPK              
Sbjct: 301 TAVAATTSSGEATGVTTANGIYAPVASVPPSHPQGSTQRQQPMKPKKSQSEERGSHSEGG 360

Query: 361 VECNSPTTSSSTHTSSVSP 369
           V+ NSP TSSSTHT++ SP
Sbjct: 361 VQSNSPATSSSTHTTTTSP 360

BLAST of Csa6G031440 vs. NCBI nr
Match: gi|297737857|emb|CBI27058.3| (unnamed protein product [Vitis vinifera])

HSP 1 Score: 402.1 bits (1032), Expect = 1.0e-108
Identity = 216/360 (60.00%), Postives = 253/360 (70.28%), Query Frame = 1

Query: 20  DFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTE--NRQSECSEQTSSDM 79
           D+ + LE+ERRK+ +FQRELPLCL LVS AI+ CRQQ+SGTT E  + QSECSEQTSSD 
Sbjct: 6   DYIEALEEERRKIQVFQRELPLCLELVSQAIESCRQQMSGTTQEYFHGQSECSEQTSSD- 65

Query: 80  GPVLEEFIPINRNGVSDFEKTEKNNKNHDSDLNNLNLAP-SDWLRSAQLWNQTSDPPPLN 139
           GPVLEEFIPI +    D ++ + +  N + D NN      SDWLRS QLWNQT DPP + 
Sbjct: 66  GPVLEEFIPIKKTS-DDEDEQQSHQPNDNKDKNNDKSGKKSDWLRSVQLWNQTPDPP-VK 125

Query: 140 QDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAETSSTTETGSGGSS 199
           +D P+  P +EV +NGGAF PF+++K  G        ++   AP+A TSST ET +G SS
Sbjct: 126 EDTPKKIPSMEVKKNGGAFHPFKRDKAVG--------TNPTSAPSAATSSTAETATGCSS 185

Query: 200 --RREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK 259
             R+EEKE Q+QRK RRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK
Sbjct: 186 GSRKEEKEGQSQRKARRCWSPELHRRFLHALQQLGGSHVATPKQIRELMKVDGLTNDEVK 245

Query: 260 SHLQKYRLHTRRPTPTIHNNEGGHAPQFLVVGGIWVPAAEYAAVSTTTSSGEVVSAATTN 319
           SHLQKYRLHTRRP P I +N    APQF+VVGGIWVP  EY AV+ TTSSGE     T N
Sbjct: 246 SHLQKYRLHTRRPNPAIQHNGNPQAPQFVVVGGIWVPPPEYTAVAATTSSGEATGVTTAN 305

Query: 320 GIYAPVVAAAAPQPLVSTVQKPKPKPKPK------IIPSSAVECNSPTTSSSTHTSSVSP 369
           GIYAPV +     P  ST ++   KPK              V+ NSP TSSSTHT++ SP
Sbjct: 306 GIYAPVASVPPSHPQGSTQRQQPMKPKKSQSEERGSHSEGGVQSNSPATSSSTHTTTTSP 354

BLAST of Csa6G031440 vs. NCBI nr
Match: gi|596176608|ref|XP_007223224.1| (hypothetical protein PRUPE_ppa007222mg [Prunus persica])

HSP 1 Score: 388.7 bits (997), Expect = 1.2e-104
Identity = 218/378 (57.67%), Postives = 256/378 (67.72%), Query Frame = 1

Query: 11  AAKMGFTLSDFADTLEQERRKVLMFQRELPLCLHLVSHAIDCCRQQLSGTTTE--NRQSE 70
           A+++GF   D+   LE+ER K+ +FQRELPLCL LV+ AI+ C+QQLS TTT+  + QSE
Sbjct: 3   ASRLGFR--DYVKALEEERHKIQVFQRELPLCLELVTQAIERCKQQLSDTTTDYMHGQSE 62

Query: 71  CSEQTSSDMGPVLEEFIPINRNGVSDFEKTE----KNNKNHDSDLNNLNLAPSDWLRSAQ 130
           CSEQTSS+ G V EEFIP+ R   SD +  E    +  K +D D  N +   SDWLRSAQ
Sbjct: 63  CSEQTSSE-GHVFEEFIPLKRTSSSDSDDDEVQESQEPKTNDKDKTNGDKIKSDWLRSAQ 122

Query: 131 LWNQTSDPPPLNQDLPENTPVVEVNRNGGAFRPFQKEKTGGGGGGGGASSSSPPAPAAET 190
           LWN T D PPL  +LP    V+EV RNGGAF+PFQ+EK+    G      +  PA A  T
Sbjct: 123 LWNTTPD-PPLKDELPRKALVMEVKRNGGAFQPFQREKS---VGKTNRPVAKVPASAPAT 182

Query: 191 SSTTET---GSGGSSRREEKEAQNQRKQRRCWSPELHRRFLHALQQLGGSHVATPKQIRE 250
           SSTT+T   GSG S ++EEK+ Q QRKQRR WSPELHRRFLHALQQLGGSH ATPKQIRE
Sbjct: 183 SSTTDTVSGGSGESHKKEEKDGQGQRKQRRNWSPELHRRFLHALQQLGGSHAATPKQIRE 242

Query: 251 LMKVDGLTNDEVKSHLQKYRLHTRRPTPTIHNNEGG----HAPQFLVVGGIWVPAAEYAA 310
           LMKVDGLTNDEVKSHLQKYRLHTRRPTPT+HNN        APQFLVVGGIWVP  +YAA
Sbjct: 243 LMKVDGLTNDEVKSHLQKYRLHTRRPTPTMHNNNNSDNNTQAPQFLVVGGIWVPPQDYAA 302

Query: 311 VSTTTSSGEVVSAATTNGIYAPVV---AAAAPQPLVSTVQKPKPKPKPKIIPSSAV---- 366
           V+ TT+SGE    A  NGIYAPV    +   P    S +Q+P+PK          V    
Sbjct: 303 VAATTASGEATRVAAANGIYAPVATSPSTVTPVSPPSLMQRPRPKRPESSHSDERVSHSE 362

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
EFM_ARATH2.3e-5238.08Myb family transcription factor EFM OS=Arabidopsis thaliana GN=EFM PE=1 SV=2[more]
GLK2_ORYSJ7.9e-1630.00Probable transcription factor GLK2 OS=Oryza sativa subsp. japonica GN=GLK2 PE=2 ... [more]
ARR1_ARATH1.9e-1443.86Two-component response regulator ARR1 OS=Arabidopsis thaliana GN=ARR1 PE=1 SV=2[more]
PHR1_ORYSJ4.3e-1447.37Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. japonica GN=PHR1 ... [more]
PHR1_ORYSI4.3e-1447.37Protein PHOSPHATE STARVATION RESPONSE 1 OS=Oryza sativa subsp. indica GN=PHR1 PE... [more]
Match NameE-valueIdentityDescription
A0A0A0KAT4_CUCSA2.2e-214100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_6G031440 PE=4 SV=1[more]
D7T987_VITVI7.2e-10960.00Putative uncharacterized protein OS=Vitis vinifera GN=VIT_01s0011g03110 PE=4 SV=... [more]
M5XS96_PRUPE8.2e-10557.67Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007222mg PE=4 SV=1[more]
B9HWL5_POPTR1.8e-10453.62Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0010s13900g PE=4 SV=1[more]
B9ST36_RICCO4.1e-10453.30DNA binding protein, putative OS=Ricinus communis GN=RCOM_0353920 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G68670.12.4e-7946.13 myb-like transcription factor family protein[more]
AT1G25550.13.1e-7948.79 myb-like transcription factor family protein[more]
AT3G25790.11.4e-6639.67 myb-like transcription factor family protein[more]
AT2G03500.11.3e-5338.08 Homeodomain-like superfamily protein[more]
AT1G13300.11.2e-4351.96 myb-like transcription factor family protein[more]
Match NameE-valueIdentityDescription
gi|778710070|ref|XP_011656513.1|3.1e-214100.00PREDICTED: probable transcription factor GLK2 [Cucumis sativus][more]
gi|659089682|ref|XP_008445641.1|3.6e-20295.47PREDICTED: probable transcription factor GLK2 [Cucumis melo][more]
gi|359472981|ref|XP_003631224.1|9.3e-11058.58PREDICTED: probable transcription factor GLK2 [Vitis vinifera][more]
gi|297737857|emb|CBI27058.3|1.0e-10860.00unnamed protein product [Vitis vinifera][more]
gi|596176608|ref|XP_007223224.1|1.2e-10457.67hypothetical protein PRUPE_ppa007222mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001005SANT/Myb
IPR006447Myb_dom_plants
IPR009057Homeobox-like_sf
IPR017930Myb_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa6G031440.1Csa6G031440.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 210..261
score: 1.
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 208..263
score: 2.8
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 205..266
score: 7.9
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 206..266
score: 1.74
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 205..265
score: 1
NoneNo IPR availableunknownCoilCoilcoord: 95..115
scor
NoneNo IPR availablePANTHERPTHR31003MYB FAMILY TRANSCRIPTION FACTORcoord: 1..371
score: 7.7E
NoneNo IPR availablePANTHERPTHR31003:SF4GENOMIC DNA, CHROMOSOME 3, TAC CLONE:K13N2-RELATEDcoord: 1..371
score: 7.7E