Cp4.1LG02g10080 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG02g10080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionAt5g37260-like protein
LocationCp4.1LG02 : 9896761 .. 9900990 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAATTTGGGATTTATCCATTCACCATTCAATTTCTTCCACAAATATATTATCTTAAATATCTGTGGAAATTGAAAAAGGGTTGGGGGTGGGGGCGGGAGAGTGACGCGTGTACGTGGCAAAGAAAATCCTCAGCCACAAGATTGAGAACCACTTTCTGGAGGATTCCATGTGGAATTAATATGTGCGTCTGACGTGTCACCCCCTCCCTCTCCCTTCCCCCTTCGCTTTTTCTTCTTCCTCTACTTCTTCTTTCTTCATTATGCCCTCTCTCTCAAATCTCATCTCTCTCTCCCTTCCTCTTCTTCTTCCTCCAATCTCTCCATTTCTCTGCTTCTGCTTCTGCTTCTGCTTCTTCTTCTTCTTCTTCTTCCTCCTTTCTCTGCAATCGCTTTACCAGGTTTGTTTTCTCTACTCTGCGTCTCTTTCTCCATTGTTTTCTTTCTTTTCTTCTTCGATGTGTTCTTCTGGTGTTTACTAGGTTTCTGGAGAAGTGAAAAGTTCGAAAACTTTATGAATTCCAGAATTTTTTATTCTGATGATTCGTTAAGCTTCTGCTGAGCCTTTTCAATTTTAATTTCAGGTCCTTGTTTTGGCGGACTGAGACTATACTATACTTCCTCCCCGGTTTTCTTGCCACTGCTGATGGCTATCCAGGTAAATCCTTCACACCGGTTTTGTTCACGTTTTCTGTACCTCTTAACTTTTCTTGGAAATTTTTTTCTCTCAAACTTGATTGTCTCTGTTTGGATTCGAGTACTGAATTCCTGATTTTCAGTACTCAATTCTCACTTCTTGGGTTTTCTTTAGTAACCTTGTTCTTGTTGTTTACTGAGTAATTGACACAAAAACTTGTCCTTCTACTGATTTGGTTGGTTCACCTATAGGCCATAGATCCAATTTACAAGGACTCCTCCTTTTGTTGGTTGAGTACTCCTACGGACTTTGCTTTTGTTTTATCGCTTAATATATTGAGCTATGTTGTATAGTTACCAGGGCTATGATGTAACCATTTATGAAATGGGTTGCTGTTAATTGATGATGGGATTCTACAGCTCCGGTTGGTTTCTTGTCGTCTCTAACTTTCTTATTTCAACATGGAATGCAGGAAAAGAATGAGGGTGCGCTGTCAAACAGCTCAATTGGAGCTAACAATTGCCTTTCTAGTGATGGGACACAACTGGATCCACTGATGCGCGTTAGCTCCCTATGTTCCTATGGGAATGAAAGTTCATTGAAGGTAACCTAATTAACTTCTGCAATTTCTCAATTCTCCTATTCTGGAAGGATTTGGGTTTGAGTTCAGGCACTCTAATGTATCCACAGGTTAGGAAGCCCTACACTATTTCAAAACAAAGGGAAAAATGGACAGAGGAAGAGCATCAGAGGTTCCTTGAAGCTCTGAAACTCTATGGTCGTGGCTGGCGTCAGATCAAAGGTAAGGCTGTGTGAAATTAGTTATCTAAATCTGGTGTTTGCAATACCCAACCCATCATTTAGTGCGCTACCTTGATCTTCCTGCAGAACATGTAGGCACCAAAACAGCTGTTCAGATCCGAAGCCATGCTCAGAAATTTTTCTCTAAGGTCAAAAAATGCGTTTTGTTTTAGTAACCCCCCAAACTAAGAATCATTACCAAACAGAATATAATTTAGGATGATAGTGGATTTTTCACTAAAATAGATCGTCACCAAACCCTTTACGCTATTTTCTTACACACTTTATGACCAAACATTCCCACACTCAACTTTTATTTTGTCTGTGTTTCAGGTAGTGCGAGAGCCTAGTGGCAGCAATGACAGCTCCATTAACCCAATTGAGATTCCTCCACCTCGACCAAAAAGGAAACCACTGCATCCTTACCCTCGTAAAGCAGTCGATTCTCTTAAGGCAATTTCAGTTGCAAGAGAACCTGAGAGGTCTCCATCTCCAAACCTATCAATTGCCGAAAAAGAGACCCAATCACCCACCTCCGTATTGACTGCATTCAGTTCGGATGATCAAATTTCTACGGTTTCGGTGCAGCATAATAGATGTTCATCACCTATTTCACAAGCTGTTGACATACAGTCAACTAGATTGCCCTCTGTTAGGAAAGGGGAGATGTTATTGCTCGAATCATCCTCCGAGCGGTTCCCAGAAGACTTTCTTACTCTGGTAATTTGAAATGTCCTTTTAATCTTCCACTTCATTTTCCTTTATGCACTTAACTTGACATCTGGGATCTGACTCTTTTTCTTTTTCTCTCTTTCCTAAACATTCAGAAATCCAAGCCAGGATCAGCATCTAAGAAATTAGACAACAAGTTGCATTCTCCTGTTAAAAGCATAAAGCTTTTTGGAAGAACAGTAATGGTTACTGATGACAAAAAACCATCCTTACATGATTTTGAAGTAACTAAATCGTCGGTACTTGATGGTGAGAGTAAGAATGAGTGTGGAGTGTATGCCGAGAAGCCTGTTCAGATGCTACCTTCAAAACATATGGATGTAAATTTATCTCTTCGGATGGATAACAATGGTGATTGGAATATGTCACCCGGTGGAGCACCTACTAACAACACCGCACTGAATCAGGACAATAGTGTCCTTTATGTTGAGGCGATTGCTAATGCTCCTCAAACTTGTTGGTCTTTGTATCAAAGTGTACCATATTTTTACCTTGCTCCACCTGATCAAACTAACCGAGGTATGGAAGAAAGGATGCAAAATGACAATTCTGTAGAAAGTTCGTGTGTGGATTCATGTTCTGGCTCCTCGAGTAAGGATAAGAATGAAAACCAGAGCCCGGAATTCGAATGTCAAGACCCTTGTCTGGTAGGAAGAGGTAATAGAAATCAAAGTAAGAAGGGGTTTGTGCCTTACAAGAGATGCTTGGCTCAGAGAGATGCAAGCTCTTCGTTCATTGTTTCAGAAGAGAGAGAAGGTCGGAGAGTTCGAGTTTGCTTATAGCCTCTGCAAAATTATAGTGGAACTCGATCAGCAACAGATTTAAAGTCGATAAAAGGAAATGACCAAGATGCTTTGACATGGGTTTCTGAGTTCTTTTTCTCTCTTTAGCTTGAGATGAAAAAAATGGAGGCCAAACTTGTAGCATATAAAATCTGAAGTAGCAGGAGGGGCGAAGCCTATAAGGGGTTGCCCAACAACTTCTATCCTGTCTATTCCTGGGTATTTGAGTGACAATTAATCAATCTGTAAAACAAAACGGTGTTGCTCTATGACTTCCTAACATATTAGCAGATCTTCTTAACATATTTGTAAATTTTGATCCTTCAGCGAAGAGAGTGAACCTATCTTTGAAGAACCTTGATTTAGCTCCCTGGTGCAAATAGTCTAATTTGAAGGCATAAAAAATCGAGCAGTCTAACATTCGATCTTCCTGAAAGTTAAGAATGAAGGTACTTCAGGGGCAAAAGAATCCACCATCAAAGCATCCATTCAGTGGCAAGCAATCTTAAACTAATGTATATAAAATCTTTAATTTAACTTAAAAGTATATTATTGTTGATTGGAATTAAAGAAATGAATTGACAAATTCAACTTTTATGCATAACATACACAAATGTTCGCATTTCCAAGCATATAATACACAATCAAAGTACTGAAAAAATCTCTTTCTTTAAGGATGAAGAAAAAACTTTGCAGGTACCTATGATTACAGGGGGGACGATCCTCAATCTCCAAGAGGAATCATTAACTTGTTCTTGGGTAGACTTTTCTAAAAGGAGATTTAGAAGGGACAGTGGGCCTCTCCTTGTATCGAATTATGTCCCACATTCCCACGAGTCCATGTCTAGTACCCCTAAAAGTCCTCTCTGGTATCCCTAAGGCCATCTGAATATTATTCATCATTTCAGCATTCTTGTCCCCTTTTACAACCTGTGATCTTTGACCATATCTCATGTCTCAGAAGTTCATATTTGGTTAGGTGATCCACTTGACCAATCTCCAATATTAATATCAATTAATTTCAAGAGATAATATGACTCCTGAAACACACTGACTATATATCATACGGACAAGATAAAGCCTACTCACATCAATAACAATAACGCAAAGCCTGTCCATATATATTATATATGTTTTCCTAAGATAAAGTACAGCACTTCGAAAAATGAATACCAGGGAAACAATGTTCTCTATGGGCTAGAGTTTTCTTTAAGTCCTTGGCGAAGCTCCCATTTGGACTTTGCAACTTCCACAAATATTACCCTGCCATCCAGAAACTAA

mRNA sequence

TAATTTGGGATTTATCCATTCACCATTCAATTTCTTCCACAAATATATTATCTTAAATATCTGTGGAAATTGAAAAAGGGTTGGGGGTGGGGGCGGGAGAGTGACGCGTGTACGTGGCAAAGAAAATCCTCAGCCACAAGATTGAGAACCACTTTCTGGAGGATTCCATGTGGAATTAATATGTGCGTCTGACGTGTCACCCCCTCCCTCTCCCTTCCCCCTTCGCTTTTTCTTCTTCCTCTACTTCTTCTTTCTTCATTATGCCCTCTCTCTCAAATCTCATCTCTCTCTCCCTTCCTCTTCTTCTTCCTCCAATCTCTCCATTTCTCTGCTTCTGCTTCTGCTTCTGCTTCTTCTTCTTCTTCTTCTTCCTCCTTTCTCTGCAATCGCTTTACCAGGTCCTTGTTTTGGCGGACTGAGACTATACTATACTTCCTCCCCGGTTTTCTTGCCACTGCTGATGGCTATCCAGGAAAAGAATGAGGGTGCGCTGTCAAACAGCTCAATTGGAGCTAACAATTGCCTTTCTAGTGATGGGACACAACTGGATCCACTGATGCGCGTTAGCTCCCTATGTTCCTATGGGAATGAAAGTTCATTGAAGGTTAGGAAGCCCTACACTATTTCAAAACAAAGGGAAAAATGGACAGAGGAAGAGCATCAGAGGTTCCTTGAAGCTCTGAAACTCTATGGTCGTGGCTGGCGTCAGATCAAAGAACATGTAGGCACCAAAACAGCTGTTCAGATCCGAAGCCATGCTCAGAAATTTTTCTCTAAGGTAGTGCGAGAGCCTAGTGGCAGCAATGACAGCTCCATTAACCCAATTGAGATTCCTCCACCTCGACCAAAAAGGAAACCACTGCATCCTTACCCTCGTAAAGCAGTCGATTCTCTTAAGGCAATTTCAGTTGCAAGAGAACCTGAGAGGTCTCCATCTCCAAACCTATCAATTGCCGAAAAAGAGACCCAATCACCCACCTCCGTATTGACTGCATTCAGTTCGGATGATCAAATTTCTACGGTTTCGGTGCAGCATAATAGATGTTCATCACCTATTTCACAAGCTGTTGACATACAGTCAACTAGATTGCCCTCTGTTAGGAAAGGGGAGATGTTATTGCTCGAATCATCCTCCGAGCGGTTCCCAGAAGACTTTCTTACTCTGAAATCCAAGCCAGGATCAGCATCTAAGAAATTAGACAACAAGTTGCATTCTCCTGTTAAAAGCATAAAGCTTTTTGGAAGAACAGTAATGGTTACTGATGACAAAAAACCATCCTTACATGATTTTGAAGTAACTAAATCGTCGGTACTTGATGGTGAGAGTAAGAATGAGTGTGGAGTGTATGCCGAGAAGCCTGTTCAGATGCTACCTTCAAAACATATGGATGTAAATTTATCTCTTCGGATGGATAACAATGGTGATTGGAATATGTCACCCGGTGGAGCACCTACTAACAACACCGCACTGAATCAGGACAATAGTGTCCTTTATGTTGAGGCGATTGCTAATGCTCCTCAAACTTGTTGGTCTTTGTATCAAAGTGTACCATATTTTTACCTTGCTCCACCTGATCAAACTAACCGAGGTATGGAAGAAAGGATGCAAAATGACAATTCTGTAGAAAGTTCGTGTGTGGATTCATGTTCTGGCTCCTCGAGTAAGGATAAGAATGAAAACCAGAGCCCGGAATTCGAATGTCAAGACCCTTGTCTGGTAGGAAGAGGTAATAGAAATCAAAGTAAGAAGGGGTTTGTGCCTTACAAGAGATGCTTGGCTCAGAGAGATGCAAGCTCTTCGTTCATTGTTTCAGAAGAGAGAGAAGGTCGGAGAGTTCGAGTTTGCTTATAGCCTCTGCAAAATTATAGTGGAACTCGATCAGCAACAGATTTAAAGTCGATAAAAGGAAATGACCAAGATGCTTTGACATGGGTTTCTGAGTTCTTTTTCTCTCTTTAGCTTGAGATGAAAAAAATGGAGGCCAAACTTGTAGCATATAAAATCTGAAGTAGCAGGAGGGGCGAAGCCTATAAGGGGTTGCCCAACAACTTCTATCCTGTCTATTCCTGGGTATTTGAGTGACAATTAATCAATCTGTAAAACAAAACGGTGTTGCTCTATGACTTCCTAACATATTAGCAGATCTTCTTAACATATTTGTAAATTTTGATCCTTCAGCGAAGAGAGTGAACCTATCTTTGAAGAACCTTGATTTAGCTCCCTGGTGCAAATAGTCTAATTTGAAGGCATAAAAAATCGAGCAGTCTAACATTCGATCTTCCTGAAAGTTAAGAATGAAGGTACTTCAGGGGCAAAAGAATCCACCATCAAAGCATCCATTCAGTGGCAAGCAATCTTAAACTAATGTATATAAAATCTTTAATTTAACTTAAAAGTATATTATTGTTGATTGGAATTAAAGAAATGAATTGACAAATTCAACTTTTATGCATAACATACACAAATGTTCGCATTTCCAAGCATATAATACACAATCAAAGTACTGAAAAAATCTCTTTCTTTAAGGATGAAGAAAAAACTTTGCAGGTACCTATGATTACAGGGGGGACGATCCTCAATCTCCAAGAGGAATCATTAACTTGTTCTTGGGTAGACTTTTCTAAAAGGAGATTTAGAAGGGACAGTGGGCCTCTCCTTGTATCGAATTATGTCCCACATTCCCACGAGTCCATGTCTAGTACCCCTAAAAGTCCTCTCTGGTATCCCTAAGGCCATCTGAATATTATTCATCATTTCAGCATTCTTGTCCCCTTTTACAACCTGTGATCTTTGACCATATCTCATGTCTCAGAAGTTCATATTTGGTTAGGTGATCCACTTGACCAATCTCCAATATTAATATCAATTAATTTCAAGAGATAATATGACTCCTGAAACACACTGACTATATATCATACGGACAAGATAAAGCCTACTCACATCAATAACAATAACGCAAAGCCTGTCCATATATATTATATATGTTTTCCTAAGATAAAGTACAGCACTTCGAAAAATGAATACCAGGGAAACAATGTTCTCTATGGGCTAGAGTTTTCTTTAAGTCCTTGGCGAAGCTCCCATTTGGACTTTGCAACTTCCACAAATATTACCCTGCCATCCAGAAACTAA

Coding sequence (CDS)

ATGGCTATCCAGGAAAAGAATGAGGGTGCGCTGTCAAACAGCTCAATTGGAGCTAACAATTGCCTTTCTAGTGATGGGACACAACTGGATCCACTGATGCGCGTTAGCTCCCTATGTTCCTATGGGAATGAAAGTTCATTGAAGGTTAGGAAGCCCTACACTATTTCAAAACAAAGGGAAAAATGGACAGAGGAAGAGCATCAGAGGTTCCTTGAAGCTCTGAAACTCTATGGTCGTGGCTGGCGTCAGATCAAAGAACATGTAGGCACCAAAACAGCTGTTCAGATCCGAAGCCATGCTCAGAAATTTTTCTCTAAGGTAGTGCGAGAGCCTAGTGGCAGCAATGACAGCTCCATTAACCCAATTGAGATTCCTCCACCTCGACCAAAAAGGAAACCACTGCATCCTTACCCTCGTAAAGCAGTCGATTCTCTTAAGGCAATTTCAGTTGCAAGAGAACCTGAGAGGTCTCCATCTCCAAACCTATCAATTGCCGAAAAAGAGACCCAATCACCCACCTCCGTATTGACTGCATTCAGTTCGGATGATCAAATTTCTACGGTTTCGGTGCAGCATAATAGATGTTCATCACCTATTTCACAAGCTGTTGACATACAGTCAACTAGATTGCCCTCTGTTAGGAAAGGGGAGATGTTATTGCTCGAATCATCCTCCGAGCGGTTCCCAGAAGACTTTCTTACTCTGAAATCCAAGCCAGGATCAGCATCTAAGAAATTAGACAACAAGTTGCATTCTCCTGTTAAAAGCATAAAGCTTTTTGGAAGAACAGTAATGGTTACTGATGACAAAAAACCATCCTTACATGATTTTGAAGTAACTAAATCGTCGGTACTTGATGGTGAGAGTAAGAATGAGTGTGGAGTGTATGCCGAGAAGCCTGTTCAGATGCTACCTTCAAAACATATGGATGTAAATTTATCTCTTCGGATGGATAACAATGGTGATTGGAATATGTCACCCGGTGGAGCACCTACTAACAACACCGCACTGAATCAGGACAATAGTGTCCTTTATGTTGAGGCGATTGCTAATGCTCCTCAAACTTGTTGGTCTTTGTATCAAAGTGTACCATATTTTTACCTTGCTCCACCTGATCAAACTAACCGAGGTATGGAAGAAAGGATGCAAAATGACAATTCTGTAGAAAGTTCGTGTGTGGATTCATGTTCTGGCTCCTCGAGTAAGGATAAGAATGAAAACCAGAGCCCGGAATTCGAATGTCAAGACCCTTGTCTGGTAGGAAGAGGTAATAGAAATCAAAGTAAGAAGGGGTTTGTGCCTTACAAGAGATGCTTGGCTCAGAGAGATGCAAGCTCTTCGTTCATTGTTTCAGAAGAGAGAGAAGGTCGGAGAGTTCGAGTTTGCTTATAG

Protein sequence

MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGEMLLLESSSERFPEDFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSLHDFEVTKSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQDNSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRGMEERMQNDNSVESSCVDSCSGSSSKDKNENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEEREGRRVRVCL
BLAST of Cp4.1LG02g10080 vs. Swiss-Prot
Match: RVE7_ARATH (Protein REVEILLE 7 OS=Arabidopsis thaliana GN=RVE7 PE=2 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 5.0e-52
Identity = 132/277 (47.65%), Postives = 174/277 (62.82%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           MA ++++E   SN   G+  C S++G  ++P        S+  E+ +KVRKPYT++KQRE
Sbjct: 27  MAAEDRSEELSSNVENGS--CNSNEG--INP-----ETSSHWIENVVKVRKPYTVTKQRE 86

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KW+EEEH RFLEA+KLYGRGWRQI+EH+GTKTAVQIRSHAQKFFSK+ +E    ++ S+ 
Sbjct: 87  KWSEEEHDRFLEAIKLYGRGWRQIQEHIGTKTAVQIRSHAQKFFSKMAQEADSRSEGSVK 146

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
            I IPPPRPKRKP HPYPRK+              +SP PNLS  EK T+SPTSVL++F 
Sbjct: 147 AIVIPPPRPKRKPAHPYPRKSPVPY---------TQSPPPNLSAMEKGTKSPTSVLSSFG 206

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGEMLLLESSSERFPEDFLTLKSKPG 240
           S+DQ+       NRCSSP S   DIQS    S+ K       +S + F +D     S  G
Sbjct: 207 SEDQV-------NRCSSPNSCTSDIQSIGATSIDKKNN--YTTSKQPFKDD-----SDIG 261

Query: 241 SASKKLDNKLHSPVKSIKLFGRTVMVTDDK-KPSLHD 277
           S          +P+ SI LFG+ V+V ++  KPS ++
Sbjct: 267 S----------TPISSITLFGKIVLVAEESHKPSSYN 261

BLAST of Cp4.1LG02g10080 vs. Swiss-Prot
Match: RVE1_ARATH (Protein REVEILLE 1 OS=Arabidopsis thaliana GN=RVE1 PE=2 SV=1)

HSP 1 Score: 193.4 bits (490), Expect = 5.7e-48
Identity = 123/269 (45.72%), Postives = 163/269 (60.59%), Query Frame = 1

Query: 42  GNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQ 101
           GN+ + KVRKPYTI+K+RE+WT+EEH++F+EALKLYGR WR+I+EHVG+KTAVQIRSHAQ
Sbjct: 38  GNDYAPKVRKPYTITKERERWTDEEHKKFVEALKLYGRAWRRIEEHVGSKTAVQIRSHAQ 97

Query: 102 KFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPN 161
           KFFSKV RE +G + SS+ PI IPPPRPKRKP HPYPRK  +       A +  RS SP 
Sbjct: 98  KFFSKVAREATGGDGSSVEPIVIPPPRPKRKPAHPYPRKFGNE------ADQTSRSVSP- 157

Query: 162 LSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGEMLLL 221
              +E++TQSPTSVL+   S+   S  S   NR  SP+S A    +              
Sbjct: 158 ---SERDTQSPTSVLSTVGSEALCSLDSSSPNRSLSPVSSASPPAAL------------- 217

Query: 222 ESSSERFPEDFLTLKSK--PGSASKKLDNKLHSPVK-SIKLFGRTVMVTDDKKPSLHDFE 281
            +++   PE+  TLK +  P       ++ +  P K S+KLFG+TV+V+D    S     
Sbjct: 218 -TTTANAPEELETLKLELFPSERLLNRESSIKEPTKQSLKLFGKTVLVSDSGMSS----S 266

Query: 282 VTKSSVLDGESKNECGVYAEKPVQMLPSK 308
           +T S+            Y + P+Q LP K
Sbjct: 278 LTTST------------YCKSPIQPLPRK 266

BLAST of Cp4.1LG02g10080 vs. Swiss-Prot
Match: RVE7L_ARATH (Protein REVEILLE 7-like OS=Arabidopsis thaliana GN=RVE7L PE=3 SV=1)

HSP 1 Score: 185.3 bits (469), Expect = 1.5e-45
Identity = 110/226 (48.67%), Postives = 148/226 (65.49%), Query Frame = 1

Query: 3   IQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQREKW 62
           +Q+++E   SN   G+  C S++G  ++P        S+  E+ +KVRKPYT++KQREKW
Sbjct: 18  LQDRSEELSSNVENGS--CNSNEG--INP-----ETSSHWIENVVKVRKPYTVTKQREKW 77

Query: 63  TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPI 122
           +EEEH RFLEA+KLYGRGWRQI+EH+GTKTAVQIRSHAQKFFSK+ +E    ++ S+  I
Sbjct: 78  SEEEHDRFLEAIKLYGRGWRQIQEHIGTKTAVQIRSHAQKFFSKMAQEADSRSEGSVKAI 137

Query: 123 EIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFSSD 182
            IPPPRPKRKP HPYPRK+              +SP PNLS  EK T+SPTSVL++F S+
Sbjct: 138 VIPPPRPKRKPAHPYPRKSPVPY---------TQSPPPNLSAMEKGTKSPTSVLSSFGSE 197

Query: 183 DQISTVSVQHNRCSSPISQAVDIQSTRLPSVRK-GEMLLLESSSER 228
           DQ +     +     P     DI ST + S+   G+++L+   S +
Sbjct: 198 DQNN-----YTTSKQPFKDDSDIGSTPISSITLFGKIVLVAEESHK 220

BLAST of Cp4.1LG02g10080 vs. Swiss-Prot
Match: RVE2_ARATH (Protein REVEILLE 2 OS=Arabidopsis thaliana GN=RVE2 PE=2 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 9.1e-38
Identity = 119/266 (44.74%), Postives = 157/266 (59.02%), Query Frame = 1

Query: 34  RVSSLCSYGNESS-----LKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHV 93
           R  SLCS    SS     LK RKPYTI+KQREKWTE EH++F+EALKLYGR WR+I+EHV
Sbjct: 6   RCESLCSDELISSSDAFYLKTRKPYTITKQREKWTEAEHEKFVEALKLYGRAWRRIEEHV 65

Query: 94  GTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAI 153
           GTKTAVQIRSHAQKFF+KV R+   S++S    IEIPPPRPKRKP+HPYPRK V     I
Sbjct: 66  GTKTAVQIRSHAQKFFTKVARDFGVSSES----IEIPPPRPKRKPMHPYPRKLV-----I 125

Query: 154 SVAREPERSP-SPNLSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQS 213
             A+E   +  + +  I +++ +SPTSVL+A  SD   S  S   N  S+ +S   + +S
Sbjct: 126 PDAKEMVYAELTGSKLIQDEDNRSPTSVLSAHGSDGLGSIGSNSPNSSSAELSSHTE-ES 185

Query: 214 TRLPSVRKGEMLLLESSSERFPEDFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRT---- 273
             L +  K  + L   +      D+ +  S   S   K   KL+S  +S++    T    
Sbjct: 186 LSLEAETKQSLKLFGKTF--VVGDYNSSMSCDDSEDGK--KKLYSETQSLQCSSSTSENA 245

Query: 274 ---VMVTDDKKPSLHDFEVTKSSVLD 287
              V+V++ K+     F   KSSV +
Sbjct: 246 ETEVVVSEFKRSERSAFSQLKSSVTE 257

BLAST of Cp4.1LG02g10080 vs. Swiss-Prot
Match: CCA1_ARATH (Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 1.4e-30
Identity = 69/103 (66.99%), Postives = 80/103 (77.67%), Query Frame = 1

Query: 40  SYGNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSH 99
           S G +  +K RKPYTI+KQRE+WTEEEH RF+EAL+LYGR W++I+EHV TKTAVQIRSH
Sbjct: 5   SSGEDLVIKTRKPYTITKQRERWTEEEHNRFIEALRLYGRAWQKIEEHVATKTAVQIRSH 64

Query: 100 AQKFFSKVVR--EPSGSNDSSINPIEIPPPRPKRKPLHPYPRK 141
           AQKFFSKV +  E  G        I IPPPRPKRKP +PYPRK
Sbjct: 65  AQKFFSKVEKEAEAKGVAMGQALDIAIPPPRPKRKPNNPYPRK 107

BLAST of Cp4.1LG02g10080 vs. TrEMBL
Match: A0A0A0LSL2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G032440 PE=4 SV=1)

HSP 1 Score: 710.3 bits (1832), Expect = 1.5e-201
Identity = 377/478 (78.87%), Postives = 406/478 (84.94%), Query Frame = 1

Query: 3   IQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQREKW 62
           + EKNEG LSN SI ANN LS+DG QLDPLMRVSSL SYGNES+LKVRKPYTISKQREKW
Sbjct: 333 VAEKNEGTLSNGSIAANNGLSNDGAQLDPLMRVSSLSSYGNESALKVRKPYTISKQREKW 392

Query: 63  TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPI 122
           TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRE SGSN+SSINPI
Sbjct: 393 TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRESSGSNESSINPI 452

Query: 123 EIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFSSD 182
           EIPPPRPKRKPLHPYPRKAVDSLKAISVARE ERSPSPNLS+AEKET SPTSVLTAFSSD
Sbjct: 453 EIPPPRPKRKPLHPYPRKAVDSLKAISVARESERSPSPNLSLAEKETHSPTSVLTAFSSD 512

Query: 183 DQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGE------------MLLLESSSERFPE 242
           DQIS VS QHNRC SPISQAVD+Q TR   VRKGE            ML LESSSERFPE
Sbjct: 513 DQISAVSEQHNRCPSPISQAVDMQPTRSSPVRKGELYLQSIVGEEKGMLSLESSSERFPE 572

Query: 243 DFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSLHDFEVTKSSVLDGESK 302
           +FLTLK KPGSASKK+DNKLHSPVKSIKLFGRTVMVT+DK+PS  DFEVT++   + +SK
Sbjct: 573 EFLTLKFKPGSASKKVDNKLHSPVKSIKLFGRTVMVTNDKQPSPLDFEVTETLTFEDDSK 632

Query: 303 NECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQDNSVLYVEAIA 362
           +EC VYAE  V+ML SKHMDV+L+L MDNNGD NMSPGGAP+  T   Q+ SV YV+A+ 
Sbjct: 633 SECKVYAENSVEMLTSKHMDVSLALGMDNNGDLNMSPGGAPSL-TLGKQNRSVPYVKALP 692

Query: 363 NAPQTCWSLYQSVPYFYLAPPDQTNRG------MEERMQNDNSVESSCVDSCSGSSSKDK 422
           NA QTCWSLYQ+VPYFYLAP DQT+ G      MEER+QNDNS ESS  DSCSGS  KD+
Sbjct: 693 NASQTCWSLYQTVPYFYLAPSDQTSTGTSTDHIMEERIQNDNSQESSFADSCSGSPRKDQ 752

Query: 423 NENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEEREGRRVRVC 463
           NE QSPE ECQ+PCLVGRGN N+SKKGFVPYKRCLAQRD SS+ IVSEERE RR RVC
Sbjct: 753 NETQSPEVECQEPCLVGRGNANESKKGFVPYKRCLAQRDTSSALIVSEERESRRARVC 809

BLAST of Cp4.1LG02g10080 vs. TrEMBL
Match: A0A061FVK6_THECC (Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=TCM_013400 PE=4 SV=1)

HSP 1 Score: 370.5 bits (950), Expect = 2.9e-99
Identity = 227/488 (46.52%), Postives = 303/488 (62.09%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           MA Q++ EG  S +++ +  C S+   Q + L +   L ++ ++ + KVRKPYTI+KQRE
Sbjct: 1   MATQDQVEGTTSPNALKSGICCSNSSPQCETLTQFQELYTFKHDHTPKVRKPYTITKQRE 60

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KWTEEEHQ+FLEAL+LYGRGWRQI+EHVGTKTAVQIRSHAQKFFSKVVRE +G  + SIN
Sbjct: 61  KWTEEEHQKFLEALRLYGRGWRQIEEHVGTKTAVQIRSHAQKFFSKVVRESNGGFEGSIN 120

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
           PIEIPPPRPKRKP+HPYPRK+VDSLK IS + EPERSPSP+  + E++ +SPTSVL+A +
Sbjct: 121 PIEIPPPRPKRKPVHPYPRKSVDSLKGISPSSEPERSPSPSQFVREQDNKSPTSVLSALT 180

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKG-EMLLLESSSERFP---------- 240
           SD   S  S Q N CSSP S   ++QS     V K  +     SS+E             
Sbjct: 181 SDAMGSAASEQQNGCSSPTSCTTNMQSINTSPVEKDIDYATSNSSAEEEKASLSSVKVFG 240

Query: 241 ----EDFLTLKSKP---GSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSL--HDFEVT 300
               ED L +K      GS   K D K+  P  SIKLFG+TV V D +KPS+   +F+  
Sbjct: 241 HSAVEDVLPMKLNADFKGSVGAKGDAKMVVPFTSIKLFGKTVQVKDSRKPSMDAENFKSP 300

Query: 301 KSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQD 360
            S    G+   E     +  VQ LPS H+D  LSL   N  DW++ P  A  +       
Sbjct: 301 TSKTAQGDIDAE----GDMLVQALPSTHLDTRLSLGTVNE-DWSVVPSQANLSPYMEIHP 360

Query: 361 NSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRG--MEERMQNDNSV-ESSCVDSCS 420
           + + +VE+ ++AP   W+ YQ +P++Y+   +QT     +EER++    + E S   S +
Sbjct: 361 DKLDHVESTSDAPLPWWTFYQGLPFYYITSFNQTQTDSCVEERVKQKEILNERSSTGSNT 420

Query: 421 GSSSKDKNENQSP---EFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEER 463
           GS S+ +N  +S    + +CQ PC  G+    +  +GFVPYKRCLA+RD SSS ++SEER
Sbjct: 421 GSVSQAENREKSSYSVDSQCQRPCPEGKTTLQKCSRGFVPYKRCLAERDMSSSVVMSEER 480

BLAST of Cp4.1LG02g10080 vs. TrEMBL
Match: B9I4X5_POPTR (EARLY-PHYTOCHROME-RESPONSIVE1 family protein OS=Populus trichocarpa GN=POPTR_0012s03530g PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 1.8e-85
Identity = 217/491 (44.20%), Postives = 280/491 (57.03%), Query Frame = 1

Query: 2   AIQEKNEGALSNS-----SIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTIS 61
           A +E+ EG   NS       G N+C  S+       +R+  L S+G+++  KVRKPYTI+
Sbjct: 4   ASKEQMEGTNLNSFGKACDFGTNSCEQSETD-----IRMQELYSFGSDNVPKVRKPYTIT 63

Query: 62  KQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSND 121
           KQREKWT+EEHQRFLEALKLYGRGWR+I+EHVGTKTAVQIRSHAQK+FSKVVREP G N+
Sbjct: 64  KQREKWTDEEHQRFLEALKLYGRGWRRIQEHVGTKTAVQIRSHAQKYFSKVVREPGGINE 123

Query: 122 SSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVL 181
           SS+ PIEIPPPRPKRKP HPYPRK V+ L+    + + ERSPSPN S++EKE QSPTSVL
Sbjct: 124 SSLKPIEIPPPRPKRKPAHPYPRKPVNVLEVTGASSQLERSPSPNSSVSEKENQSPTSVL 183

Query: 182 TAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRL-PSVRKGEMLLLESSSER-------- 241
           +A +SD   S +S   N CSSP S   ++ S  L PS ++ E     SS E         
Sbjct: 184 SALASDTFGSALSEPCNACSSPTSCTTEMHSISLSPSAKETEHGTSNSSGEEKGNLSLVQ 243

Query: 242 ----FPEDFLTLKSKPGSASKKLDNKLHSPVK-----SIKLFGRTVMVTDDKKPSLHDFE 301
                 E+FL+   K    SK      H   K     SIKLFG TV + D +K S    E
Sbjct: 244 MSLSLLENFLSEVKKFELGSKNTVCAEHDAAKKASSASIKLFGMTVKIVDSQKESPPGAE 303

Query: 302 VTKSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALN 361
           +    V+  E+ +      EKP   L  K  D  LSL M N+   N+ P  A   +    
Sbjct: 304 IV-LPVISNENHDNVDADKEKPAHTLQRKQSDTELSLGMANSNQ-NLWPSPASVFHCTEM 363

Query: 362 QDNSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRG-----MEERMQNDNSV-ESSC 421
           Q ++  Y    ++ P   W+L Q VP+ YL   D T+       +EER +    + E SC
Sbjct: 364 QGDNANYFATNSSIP--WWTLCQGVPFLYLTSNDHTSAQKPIPCVEERFEEKEILNERSC 423

Query: 422 VDS--CSGSSSKDKNENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIV 462
             S   S    ++   N   + +C  P + G  +  +S +GFVPYKRCL +RD  S+ I+
Sbjct: 424 TSSNVFSVGDLENGERNLDVDSQCGQPSVEGTSSLQKSTRGFVPYKRCLGERDVKSTVII 483

BLAST of Cp4.1LG02g10080 vs. TrEMBL
Match: V4TXF8_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020130mg PE=4 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 2.9e-83
Identity = 208/448 (46.43%), Postives = 270/448 (60.27%), Query Frame = 1

Query: 36  SSLCSYGNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQ 95
           +SL S+ N+S  KVRKPYTI+KQREKWTEEEHQRFL+ALK+YGRGWRQI+EHVGTKTAVQ
Sbjct: 3   NSLYSFENDSLPKVRKPYTITKQREKWTEEEHQRFLDALKMYGRGWRQIEEHVGTKTAVQ 62

Query: 96  IRSHAQKFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPE 155
           IRSHAQKFFSKVVRE +GS++SSI PIEIPPPRPKRKP+HPYPRK+VDSLKA SV+ + E
Sbjct: 63  IRSHAQKFFSKVVRESNGSSESSIMPIEIPPPRPKRKPVHPYPRKSVDSLKATSVSNQQE 122

Query: 156 RSPSPNLSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRK 215
              S N  +++K+ QSPTSVL++F+SD      S Q N CSSP S   ++ S  L  + K
Sbjct: 123 NFTSSNALVSDKDRQSPTSVLSSFNSDTLGCAASDQQNGCSSPTSCTTEMHSVNLLPIEK 182

Query: 216 GEMLLLESSSERFPEDFLTLKSKPGSASK--------------KLDNKLHSPVKSIKLFG 275
            E   + S S    E   TL +   ++S               K D        SIKLFG
Sbjct: 183 -ENEYVTSISFPKEEKISTLPAHLSASSNVEELASVSKDSVYPKGDAAAAPSCTSIKLFG 242

Query: 276 RTVMVTDDKKPSLHDFEVTKSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNG 335
           RTV+V+D  KP     +  KS +     +N   V  E  VQ  PSKH+D +L L M ++ 
Sbjct: 243 RTVLVSDSWKPYSLGADSYKSPISKSSQEN-LDVDKENFVQSAPSKHLDTHLLLGMVSS- 302

Query: 336 DWNMSPGGAPTNNTALNQDNSVLYVEAIANAPQTCWSLYQSVPYFYLAP--PDQTNRGME 395
           + N S    P       Q       EA  N      SLY+  PYF+L P   +Q    +E
Sbjct: 303 NCNPSSHIGPVFQNMELQKKRTNVAEASHNIYLLWDSLYRGAPYFHLMPIGENQAATPLE 362

Query: 396 ERMQNDNSV-ESSCVDSCSGSSSK----DKNENQSPEFECQDPCLVGRGNRNQSKKGFVP 455
             +++   + E SC  S +GS S+    +K+ + + + +C   C + + + +   +GFVP
Sbjct: 363 FSVKDKEILNERSCSGSSAGSVSELENWEKSSDVAVDSQCPQVCPLSQASPSNCMRGFVP 422

Query: 456 YKRCLAQRDASSSFIVSEEREGRRVRVC 463
           YKRCLA+ +  SS IVSEERE +R RVC
Sbjct: 423 YKRCLAESEIRSSVIVSEERERQRARVC 447

BLAST of Cp4.1LG02g10080 vs. TrEMBL
Match: A0A061FWD0_THECC (Homeodomain-like superfamily protein, putative isoform 2 OS=Theobroma cacao GN=TCM_013400 PE=4 SV=1)

HSP 1 Score: 317.4 bits (812), Expect = 2.9e-83
Identity = 206/488 (42.21%), Postives = 283/488 (57.99%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           MA Q++ EG  S +++ +  C S+   Q + L +   L ++ ++ + KVRKPYTI+KQRE
Sbjct: 1   MATQDQVEGTTSPNALKSGICCSNSSPQCETLTQFQELYTFKHDHTPKVRKPYTITKQRE 60

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KWTEEEHQ+FLEAL+LYGRGWRQI+                    +VVRE +G  + SIN
Sbjct: 61  KWTEEEHQKFLEALRLYGRGWRQIEGF------------------QVVRESNGGFEGSIN 120

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
           PIEIPPPRPKRKP+HPYPRK+VDSLK IS + EPERSPSP+  + E++ +SPTSVL+A +
Sbjct: 121 PIEIPPPRPKRKPVHPYPRKSVDSLKGISPSSEPERSPSPSQFVREQDNKSPTSVLSALT 180

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKG-EMLLLESSSERFP---------- 240
           SD   S  S Q N CSSP S   ++QS     V K  +     SS+E             
Sbjct: 181 SDAMGSAASEQQNGCSSPTSCTTNMQSINTSPVEKDIDYATSNSSAEEEKASLSSVKVFG 240

Query: 241 ----EDFLTLKSKP---GSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSL--HDFEVT 300
               ED L +K      GS   K D K+  P  SIKLFG+TV V D +KPS+   +F+  
Sbjct: 241 HSAVEDVLPMKLNADFKGSVGAKGDAKMVVPFTSIKLFGKTVQVKDSRKPSMDAENFKSP 300

Query: 301 KSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQD 360
            S    G+   E     +  VQ LPS H+D  LSL   N  DW++ P  A  +       
Sbjct: 301 TSKTAQGDIDAE----GDMLVQALPSTHLDTRLSLGTVNE-DWSVVPSQANLSPYMEIHP 360

Query: 361 NSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRG--MEERMQNDNSV-ESSCVDSCS 420
           + + +VE+ ++AP   W+ YQ +P++Y+   +QT     +EER++    + E S   S +
Sbjct: 361 DKLDHVESTSDAPLPWWTFYQGLPFYYITSFNQTQTDSCVEERVKQKEILNERSSTGSNT 420

Query: 421 GSSSKDKNENQSP---EFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEER 463
           GS S+ +N  +S    + +CQ PC  G+    +  +GFVPYKRCLA+RD SSS ++SEER
Sbjct: 421 GSVSQAENREKSSYSVDSQCQRPCPEGKTTLQKCSRGFVPYKRCLAERDMSSSVVMSEER 465

BLAST of Cp4.1LG02g10080 vs. TAIR10
Match: AT1G18330.2 (AT1G18330.2 Homeodomain-like superfamily protein)

HSP 1 Score: 206.8 bits (525), Expect = 2.8e-53
Identity = 132/277 (47.65%), Postives = 174/277 (62.82%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           MA ++++E   SN   G+  C S++G  ++P        S+  E+ +KVRKPYT++KQRE
Sbjct: 27  MAAEDRSEELSSNVENGS--CNSNEG--INP-----ETSSHWIENVVKVRKPYTVTKQRE 86

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KW+EEEH RFLEA+KLYGRGWRQI+EH+GTKTAVQIRSHAQKFFSK+ +E    ++ S+ 
Sbjct: 87  KWSEEEHDRFLEAIKLYGRGWRQIQEHIGTKTAVQIRSHAQKFFSKMAQEADSRSEGSVK 146

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
            I IPPPRPKRKP HPYPRK+              +SP PNLS  EK T+SPTSVL++F 
Sbjct: 147 AIVIPPPRPKRKPAHPYPRKSPVPY---------TQSPPPNLSAMEKGTKSPTSVLSSFG 206

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGEMLLLESSSERFPEDFLTLKSKPG 240
           S+DQ+       NRCSSP S   DIQS    S+ K       +S + F +D     S  G
Sbjct: 207 SEDQV-------NRCSSPNSCTSDIQSIGATSIDKKNN--YTTSKQPFKDD-----SDIG 261

Query: 241 SASKKLDNKLHSPVKSIKLFGRTVMVTDDK-KPSLHD 277
           S          +P+ SI LFG+ V+V ++  KPS ++
Sbjct: 267 S----------TPISSITLFGKIVLVAEESHKPSSYN 261

BLAST of Cp4.1LG02g10080 vs. TAIR10
Match: AT5G17300.1 (AT5G17300.1 Homeodomain-like superfamily protein)

HSP 1 Score: 193.4 bits (490), Expect = 3.2e-49
Identity = 123/269 (45.72%), Postives = 163/269 (60.59%), Query Frame = 1

Query: 42  GNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQ 101
           GN+ + KVRKPYTI+K+RE+WT+EEH++F+EALKLYGR WR+I+EHVG+KTAVQIRSHAQ
Sbjct: 38  GNDYAPKVRKPYTITKERERWTDEEHKKFVEALKLYGRAWRRIEEHVGSKTAVQIRSHAQ 97

Query: 102 KFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPN 161
           KFFSKV RE +G + SS+ PI IPPPRPKRKP HPYPRK  +       A +  RS SP 
Sbjct: 98  KFFSKVAREATGGDGSSVEPIVIPPPRPKRKPAHPYPRKFGNE------ADQTSRSVSP- 157

Query: 162 LSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGEMLLL 221
              +E++TQSPTSVL+   S+   S  S   NR  SP+S A    +              
Sbjct: 158 ---SERDTQSPTSVLSTVGSEALCSLDSSSPNRSLSPVSSASPPAAL------------- 217

Query: 222 ESSSERFPEDFLTLKSK--PGSASKKLDNKLHSPVK-SIKLFGRTVMVTDDKKPSLHDFE 281
            +++   PE+  TLK +  P       ++ +  P K S+KLFG+TV+V+D    S     
Sbjct: 218 -TTTANAPEELETLKLELFPSERLLNRESSIKEPTKQSLKLFGKTVLVSDSGMSS----S 266

Query: 282 VTKSSVLDGESKNECGVYAEKPVQMLPSK 308
           +T S+            Y + P+Q LP K
Sbjct: 278 LTTST------------YCKSPIQPLPRK 266

BLAST of Cp4.1LG02g10080 vs. TAIR10
Match: AT3G10113.1 (AT3G10113.1 Homeodomain-like superfamily protein)

HSP 1 Score: 185.3 bits (469), Expect = 8.7e-47
Identity = 110/226 (48.67%), Postives = 148/226 (65.49%), Query Frame = 1

Query: 3   IQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQREKW 62
           +Q+++E   SN   G+  C S++G  ++P        S+  E+ +KVRKPYT++KQREKW
Sbjct: 18  LQDRSEELSSNVENGS--CNSNEG--INP-----ETSSHWIENVVKVRKPYTVTKQREKW 77

Query: 63  TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPI 122
           +EEEH RFLEA+KLYGRGWRQI+EH+GTKTAVQIRSHAQKFFSK+ +E    ++ S+  I
Sbjct: 78  SEEEHDRFLEAIKLYGRGWRQIQEHIGTKTAVQIRSHAQKFFSKMAQEADSRSEGSVKAI 137

Query: 123 EIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFSSD 182
            IPPPRPKRKP HPYPRK+              +SP PNLS  EK T+SPTSVL++F S+
Sbjct: 138 VIPPPRPKRKPAHPYPRKSPVPY---------TQSPPPNLSAMEKGTKSPTSVLSSFGSE 197

Query: 183 DQISTVSVQHNRCSSPISQAVDIQSTRLPSVRK-GEMLLLESSSER 228
           DQ +     +     P     DI ST + S+   G+++L+   S +
Sbjct: 198 DQNN-----YTTSKQPFKDDSDIGSTPISSITLFGKIVLVAEESHK 220

BLAST of Cp4.1LG02g10080 vs. TAIR10
Match: AT5G37260.1 (AT5G37260.1 Homeodomain-like superfamily protein)

HSP 1 Score: 159.5 bits (402), Expect = 5.1e-39
Identity = 119/266 (44.74%), Postives = 157/266 (59.02%), Query Frame = 1

Query: 34  RVSSLCSYGNESS-----LKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHV 93
           R  SLCS    SS     LK RKPYTI+KQREKWTE EH++F+EALKLYGR WR+I+EHV
Sbjct: 6   RCESLCSDELISSSDAFYLKTRKPYTITKQREKWTEAEHEKFVEALKLYGRAWRRIEEHV 65

Query: 94  GTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPIEIPPPRPKRKPLHPYPRKAVDSLKAI 153
           GTKTAVQIRSHAQKFF+KV R+   S++S    IEIPPPRPKRKP+HPYPRK V     I
Sbjct: 66  GTKTAVQIRSHAQKFFTKVARDFGVSSES----IEIPPPRPKRKPMHPYPRKLV-----I 125

Query: 154 SVAREPERSP-SPNLSIAEKETQSPTSVLTAFSSDDQISTVSVQHNRCSSPISQAVDIQS 213
             A+E   +  + +  I +++ +SPTSVL+A  SD   S  S   N  S+ +S   + +S
Sbjct: 126 PDAKEMVYAELTGSKLIQDEDNRSPTSVLSAHGSDGLGSIGSNSPNSSSAELSSHTE-ES 185

Query: 214 TRLPSVRKGEMLLLESSSERFPEDFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRT---- 273
             L +  K  + L   +      D+ +  S   S   K   KL+S  +S++    T    
Sbjct: 186 LSLEAETKQSLKLFGKTF--VVGDYNSSMSCDDSEDGK--KKLYSETQSLQCSSSTSENA 245

Query: 274 ---VMVTDDKKPSLHDFEVTKSSVLD 287
              V+V++ K+     F   KSSV +
Sbjct: 246 ETEVVVSEFKRSERSAFSQLKSSVTE 257

BLAST of Cp4.1LG02g10080 vs. TAIR10
Match: AT2G46830.1 (AT2G46830.1 circadian clock associated 1)

HSP 1 Score: 135.6 bits (340), Expect = 7.9e-32
Identity = 69/103 (66.99%), Postives = 80/103 (77.67%), Query Frame = 1

Query: 40  SYGNESSLKVRKPYTISKQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSH 99
           S G +  +K RKPYTI+KQRE+WTEEEH RF+EAL+LYGR W++I+EHV TKTAVQIRSH
Sbjct: 5   SSGEDLVIKTRKPYTITKQRERWTEEEHNRFIEALRLYGRAWQKIEEHVATKTAVQIRSH 64

Query: 100 AQKFFSKVVR--EPSGSNDSSINPIEIPPPRPKRKPLHPYPRK 141
           AQKFFSKV +  E  G        I IPPPRPKRKP +PYPRK
Sbjct: 65  AQKFFSKVEKEAEAKGVAMGQALDIAIPPPRPKRKPNNPYPRK 107

BLAST of Cp4.1LG02g10080 vs. NCBI nr
Match: gi|659066338|ref|XP_008439595.1| (PREDICTED: protein REVEILLE 2-like [Cucumis melo])

HSP 1 Score: 728.0 bits (1878), Expect = 1.0e-206
Identity = 384/482 (79.67%), Postives = 410/482 (85.06%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           M +QEKNEG LSN SI ANNCLS+DG QLDPLMRVSSL SYGNE++LKVRKPYTISKQRE
Sbjct: 1   MGVQEKNEGTLSNGSIAANNCLSNDGAQLDPLMRVSSLSSYGNENALKVRKPYTISKQRE 60

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRE SGSN+SSIN
Sbjct: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRESSGSNESSIN 120

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
           PIEIPPPRPKRKPLHPYPRKAVDSLKAISVARE ERSPSPNLS+AEKET SPTSVLTAFS
Sbjct: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVARESERSPSPNLSLAEKETHSPTSVLTAFS 180

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGE--------------MLLLESSSE 240
           SDDQIS VS QHNRC SPISQAVD+Q TRL  VRKGE              ML LES+S 
Sbjct: 181 SDDQISAVSEQHNRCPSPISQAVDMQPTRLSPVRKGELYLPSKSNVGEEKGMLSLESTSG 240

Query: 241 RFPEDFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSLHDFEVTKSSVLD 300
           +FPEDFLTLK KPGSASKK+DNKLHSPVKSIKLFGRTVMVT+DK+PSL DFEVT++   +
Sbjct: 241 QFPEDFLTLKFKPGSASKKVDNKLHSPVKSIKLFGRTVMVTNDKQPSLLDFEVTETLTFE 300

Query: 301 GESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQDNSVLYV 360
           G+SK EC V AE  V+MLPSKHMDV+L+L MDNNGD NM PGGAPT  T  NQD SV YV
Sbjct: 301 GDSKRECKVSAENSVEMLPSKHMDVSLALGMDNNGDLNMPPGGAPT-LTLGNQDKSVPYV 360

Query: 361 EAIANAPQTCWSLYQSVPYFYLAPPDQTNRG------MEERMQNDNSVESSCVDSCSGSS 420
           +A  NAPQTCWSLYQ+VPYFYLAP DQT+ G      MEER+QNDNS ESS  DSCSGS 
Sbjct: 361 KAFPNAPQTCWSLYQNVPYFYLAPSDQTSTGTSTDHIMEERIQNDNSQESSFADSCSGSP 420

Query: 421 SKDKNENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEEREGRRVR 463
            KDKNE QSPE ECQ+PCLVGRGN N+SKKGFVPYKRCLAQRD SS+ IVSEERE RR R
Sbjct: 421 RKDKNETQSPEVECQEPCLVGRGNANESKKGFVPYKRCLAQRDTSSALIVSEERESRRAR 480

BLAST of Cp4.1LG02g10080 vs. NCBI nr
Match: gi|778656780|ref|XP_011649662.1| (PREDICTED: protein REVEILLE 7-like [Cucumis sativus])

HSP 1 Score: 714.5 bits (1843), Expect = 1.2e-202
Identity = 379/480 (78.96%), Postives = 408/480 (85.00%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           M +QEKNEG LSN SI ANN LS+DG QLDPLMRVSSL SYGNES+LKVRKPYTISKQRE
Sbjct: 1   MGVQEKNEGTLSNGSIAANNGLSNDGAQLDPLMRVSSLSSYGNESALKVRKPYTISKQRE 60

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRE SGSN+SSIN
Sbjct: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRESSGSNESSIN 120

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
           PIEIPPPRPKRKPLHPYPRKAVDSLKAISVARE ERSPSPNLS+AEKET SPTSVLTAFS
Sbjct: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVARESERSPSPNLSLAEKETHSPTSVLTAFS 180

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGE------------MLLLESSSERF 240
           SDDQIS VS QHNRC SPISQAVD+Q TR   VRKGE            ML LESSSERF
Sbjct: 181 SDDQISAVSEQHNRCPSPISQAVDMQPTRSSPVRKGELYLQSIVGEEKGMLSLESSSERF 240

Query: 241 PEDFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSLHDFEVTKSSVLDGE 300
           PE+FLTLK KPGSASKK+DNKLHSPVKSIKLFGRTVMVT+DK+PS  DFEVT++   + +
Sbjct: 241 PEEFLTLKFKPGSASKKVDNKLHSPVKSIKLFGRTVMVTNDKQPSPLDFEVTETLTFEDD 300

Query: 301 SKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQDNSVLYVEA 360
           SK+EC VYAE  V+ML SKHMDV+L+L MDNNGD NMSPGGAP+  T   Q+ SV YV+A
Sbjct: 301 SKSECKVYAENSVEMLTSKHMDVSLALGMDNNGDLNMSPGGAPSL-TLGKQNRSVPYVKA 360

Query: 361 IANAPQTCWSLYQSVPYFYLAPPDQTNRG------MEERMQNDNSVESSCVDSCSGSSSK 420
           + NA QTCWSLYQ+VPYFYLAP DQT+ G      MEER+QNDNS ESS  DSCSGS  K
Sbjct: 361 LPNASQTCWSLYQTVPYFYLAPSDQTSTGTSTDHIMEERIQNDNSQESSFADSCSGSPRK 420

Query: 421 DKNENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEEREGRRVRVC 463
           D+NE QSPE ECQ+PCLVGRGN N+SKKGFVPYKRCLAQRD SS+ IVSEERE RR RVC
Sbjct: 421 DQNETQSPEVECQEPCLVGRGNANESKKGFVPYKRCLAQRDTSSALIVSEERESRRARVC 479

BLAST of Cp4.1LG02g10080 vs. NCBI nr
Match: gi|700208878|gb|KGN63974.1| (hypothetical protein Csa_1G032440 [Cucumis sativus])

HSP 1 Score: 710.3 bits (1832), Expect = 2.2e-201
Identity = 377/478 (78.87%), Postives = 406/478 (84.94%), Query Frame = 1

Query: 3   IQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQREKW 62
           + EKNEG LSN SI ANN LS+DG QLDPLMRVSSL SYGNES+LKVRKPYTISKQREKW
Sbjct: 333 VAEKNEGTLSNGSIAANNGLSNDGAQLDPLMRVSSLSSYGNESALKVRKPYTISKQREKW 392

Query: 63  TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSINPI 122
           TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRE SGSN+SSINPI
Sbjct: 393 TEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVRESSGSNESSINPI 452

Query: 123 EIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFSSD 182
           EIPPPRPKRKPLHPYPRKAVDSLKAISVARE ERSPSPNLS+AEKET SPTSVLTAFSSD
Sbjct: 453 EIPPPRPKRKPLHPYPRKAVDSLKAISVARESERSPSPNLSLAEKETHSPTSVLTAFSSD 512

Query: 183 DQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKGE------------MLLLESSSERFPE 242
           DQIS VS QHNRC SPISQAVD+Q TR   VRKGE            ML LESSSERFPE
Sbjct: 513 DQISAVSEQHNRCPSPISQAVDMQPTRSSPVRKGELYLQSIVGEEKGMLSLESSSERFPE 572

Query: 243 DFLTLKSKPGSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSLHDFEVTKSSVLDGESK 302
           +FLTLK KPGSASKK+DNKLHSPVKSIKLFGRTVMVT+DK+PS  DFEVT++   + +SK
Sbjct: 573 EFLTLKFKPGSASKKVDNKLHSPVKSIKLFGRTVMVTNDKQPSPLDFEVTETLTFEDDSK 632

Query: 303 NECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQDNSVLYVEAIA 362
           +EC VYAE  V+ML SKHMDV+L+L MDNNGD NMSPGGAP+  T   Q+ SV YV+A+ 
Sbjct: 633 SECKVYAENSVEMLTSKHMDVSLALGMDNNGDLNMSPGGAPSL-TLGKQNRSVPYVKALP 692

Query: 363 NAPQTCWSLYQSVPYFYLAPPDQTNRG------MEERMQNDNSVESSCVDSCSGSSSKDK 422
           NA QTCWSLYQ+VPYFYLAP DQT+ G      MEER+QNDNS ESS  DSCSGS  KD+
Sbjct: 693 NASQTCWSLYQTVPYFYLAPSDQTSTGTSTDHIMEERIQNDNSQESSFADSCSGSPRKDQ 752

Query: 423 NENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEEREGRRVRVC 463
           NE QSPE ECQ+PCLVGRGN N+SKKGFVPYKRCLAQRD SS+ IVSEERE RR RVC
Sbjct: 753 NETQSPEVECQEPCLVGRGNANESKKGFVPYKRCLAQRDTSSALIVSEERESRRARVC 809

BLAST of Cp4.1LG02g10080 vs. NCBI nr
Match: gi|590666823|ref|XP_007037069.1| (Homeodomain-like superfamily protein, putative isoform 1 [Theobroma cacao])

HSP 1 Score: 370.5 bits (950), Expect = 4.1e-99
Identity = 227/488 (46.52%), Postives = 303/488 (62.09%), Query Frame = 1

Query: 1   MAIQEKNEGALSNSSIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTISKQRE 60
           MA Q++ EG  S +++ +  C S+   Q + L +   L ++ ++ + KVRKPYTI+KQRE
Sbjct: 1   MATQDQVEGTTSPNALKSGICCSNSSPQCETLTQFQELYTFKHDHTPKVRKPYTITKQRE 60

Query: 61  KWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSNDSSIN 120
           KWTEEEHQ+FLEAL+LYGRGWRQI+EHVGTKTAVQIRSHAQKFFSKVVRE +G  + SIN
Sbjct: 61  KWTEEEHQKFLEALRLYGRGWRQIEEHVGTKTAVQIRSHAQKFFSKVVRESNGGFEGSIN 120

Query: 121 PIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVLTAFS 180
           PIEIPPPRPKRKP+HPYPRK+VDSLK IS + EPERSPSP+  + E++ +SPTSVL+A +
Sbjct: 121 PIEIPPPRPKRKPVHPYPRKSVDSLKGISPSSEPERSPSPSQFVREQDNKSPTSVLSALT 180

Query: 181 SDDQISTVSVQHNRCSSPISQAVDIQSTRLPSVRKG-EMLLLESSSERFP---------- 240
           SD   S  S Q N CSSP S   ++QS     V K  +     SS+E             
Sbjct: 181 SDAMGSAASEQQNGCSSPTSCTTNMQSINTSPVEKDIDYATSNSSAEEEKASLSSVKVFG 240

Query: 241 ----EDFLTLKSKP---GSASKKLDNKLHSPVKSIKLFGRTVMVTDDKKPSL--HDFEVT 300
               ED L +K      GS   K D K+  P  SIKLFG+TV V D +KPS+   +F+  
Sbjct: 241 HSAVEDVLPMKLNADFKGSVGAKGDAKMVVPFTSIKLFGKTVQVKDSRKPSMDAENFKSP 300

Query: 301 KSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALNQD 360
            S    G+   E     +  VQ LPS H+D  LSL   N  DW++ P  A  +       
Sbjct: 301 TSKTAQGDIDAE----GDMLVQALPSTHLDTRLSLGTVNE-DWSVVPSQANLSPYMEIHP 360

Query: 361 NSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRG--MEERMQNDNSV-ESSCVDSCS 420
           + + +VE+ ++AP   W+ YQ +P++Y+   +QT     +EER++    + E S   S +
Sbjct: 361 DKLDHVESTSDAPLPWWTFYQGLPFYYITSFNQTQTDSCVEERVKQKEILNERSSTGSNT 420

Query: 421 GSSSKDKNENQSP---EFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIVSEER 463
           GS S+ +N  +S    + +CQ PC  G+    +  +GFVPYKRCLA+RD SSS ++SEER
Sbjct: 421 GSVSQAENREKSSYSVDSQCQRPCPEGKTTLQKCSRGFVPYKRCLAERDMSSSVVMSEER 480

BLAST of Cp4.1LG02g10080 vs. NCBI nr
Match: gi|224118068|ref|XP_002317724.1| (EARLY-PHYTOCHROME-RESPONSIVE1 family protein [Populus trichocarpa])

HSP 1 Score: 324.7 bits (831), Expect = 2.6e-85
Identity = 217/491 (44.20%), Postives = 280/491 (57.03%), Query Frame = 1

Query: 2   AIQEKNEGALSNS-----SIGANNCLSSDGTQLDPLMRVSSLCSYGNESSLKVRKPYTIS 61
           A +E+ EG   NS       G N+C  S+       +R+  L S+G+++  KVRKPYTI+
Sbjct: 4   ASKEQMEGTNLNSFGKACDFGTNSCEQSETD-----IRMQELYSFGSDNVPKVRKPYTIT 63

Query: 62  KQREKWTEEEHQRFLEALKLYGRGWRQIKEHVGTKTAVQIRSHAQKFFSKVVREPSGSND 121
           KQREKWT+EEHQRFLEALKLYGRGWR+I+EHVGTKTAVQIRSHAQK+FSKVVREP G N+
Sbjct: 64  KQREKWTDEEHQRFLEALKLYGRGWRRIQEHVGTKTAVQIRSHAQKYFSKVVREPGGINE 123

Query: 122 SSINPIEIPPPRPKRKPLHPYPRKAVDSLKAISVAREPERSPSPNLSIAEKETQSPTSVL 181
           SS+ PIEIPPPRPKRKP HPYPRK V+ L+    + + ERSPSPN S++EKE QSPTSVL
Sbjct: 124 SSLKPIEIPPPRPKRKPAHPYPRKPVNVLEVTGASSQLERSPSPNSSVSEKENQSPTSVL 183

Query: 182 TAFSSDDQISTVSVQHNRCSSPISQAVDIQSTRL-PSVRKGEMLLLESSSER-------- 241
           +A +SD   S +S   N CSSP S   ++ S  L PS ++ E     SS E         
Sbjct: 184 SALASDTFGSALSEPCNACSSPTSCTTEMHSISLSPSAKETEHGTSNSSGEEKGNLSLVQ 243

Query: 242 ----FPEDFLTLKSKPGSASKKLDNKLHSPVK-----SIKLFGRTVMVTDDKKPSLHDFE 301
                 E+FL+   K    SK      H   K     SIKLFG TV + D +K S    E
Sbjct: 244 MSLSLLENFLSEVKKFELGSKNTVCAEHDAAKKASSASIKLFGMTVKIVDSQKESPPGAE 303

Query: 302 VTKSSVLDGESKNECGVYAEKPVQMLPSKHMDVNLSLRMDNNGDWNMSPGGAPTNNTALN 361
           +    V+  E+ +      EKP   L  K  D  LSL M N+   N+ P  A   +    
Sbjct: 304 IV-LPVISNENHDNVDADKEKPAHTLQRKQSDTELSLGMANSNQ-NLWPSPASVFHCTEM 363

Query: 362 QDNSVLYVEAIANAPQTCWSLYQSVPYFYLAPPDQTNRG-----MEERMQNDNSV-ESSC 421
           Q ++  Y    ++ P   W+L Q VP+ YL   D T+       +EER +    + E SC
Sbjct: 364 QGDNANYFATNSSIP--WWTLCQGVPFLYLTSNDHTSAQKPIPCVEERFEEKEILNERSC 423

Query: 422 VDS--CSGSSSKDKNENQSPEFECQDPCLVGRGNRNQSKKGFVPYKRCLAQRDASSSFIV 462
             S   S    ++   N   + +C  P + G  +  +S +GFVPYKRCL +RD  S+ I+
Sbjct: 424 TSSNVFSVGDLENGERNLDVDSQCGQPSVEGTSSLQKSTRGFVPYKRCLGERDVKSTVII 483

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
RVE7_ARATH5.0e-5247.65Protein REVEILLE 7 OS=Arabidopsis thaliana GN=RVE7 PE=2 SV=1[more]
RVE1_ARATH5.7e-4845.72Protein REVEILLE 1 OS=Arabidopsis thaliana GN=RVE1 PE=2 SV=1[more]
RVE7L_ARATH1.5e-4548.67Protein REVEILLE 7-like OS=Arabidopsis thaliana GN=RVE7L PE=3 SV=1[more]
RVE2_ARATH9.1e-3844.74Protein REVEILLE 2 OS=Arabidopsis thaliana GN=RVE2 PE=2 SV=1[more]
CCA1_ARATH1.4e-3066.99Protein CCA1 OS=Arabidopsis thaliana GN=CCA1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LSL2_CUCSA1.5e-20178.87Uncharacterized protein OS=Cucumis sativus GN=Csa_1G032440 PE=4 SV=1[more]
A0A061FVK6_THECC2.9e-9946.52Homeodomain-like superfamily protein, putative isoform 1 OS=Theobroma cacao GN=T... [more]
B9I4X5_POPTR1.8e-8544.20EARLY-PHYTOCHROME-RESPONSIVE1 family protein OS=Populus trichocarpa GN=POPTR_001... [more]
V4TXF8_9ROSI2.9e-8346.43Uncharacterized protein OS=Citrus clementina GN=CICLE_v10020130mg PE=4 SV=1[more]
A0A061FWD0_THECC2.9e-8342.21Homeodomain-like superfamily protein, putative isoform 2 OS=Theobroma cacao GN=T... [more]
Match NameE-valueIdentityDescription
AT1G18330.22.8e-5347.65 Homeodomain-like superfamily protein[more]
AT5G17300.13.2e-4945.72 Homeodomain-like superfamily protein[more]
AT3G10113.18.7e-4748.67 Homeodomain-like superfamily protein[more]
AT5G37260.15.1e-3944.74 Homeodomain-like superfamily protein[more]
AT2G46830.17.9e-3266.99 circadian clock associated 1[more]
Match NameE-valueIdentityDescription
gi|659066338|ref|XP_008439595.1|1.0e-20679.67PREDICTED: protein REVEILLE 2-like [Cucumis melo][more]
gi|778656780|ref|XP_011649662.1|1.2e-20278.96PREDICTED: protein REVEILLE 7-like [Cucumis sativus][more]
gi|700208878|gb|KGN63974.1|2.2e-20178.87hypothetical protein Csa_1G032440 [Cucumis sativus][more]
gi|590666823|ref|XP_007037069.1|4.1e-9946.52Homeodomain-like superfamily protein, putative isoform 1 [Theobroma cacao][more]
gi|224118068|ref|XP_002317724.1|2.6e-8544.20EARLY-PHYTOCHROME-RESPONSIVE1 family protein [Populus trichocarpa][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR017930Myb_dom
IPR009057Homeobox-like_sf
IPR006447Myb_dom_plants
IPR001005SANT/Myb
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0042967 acyl-carrier-protein biosynthetic process
biological_process GO:0016573 histone acetylation
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0000123 histone acetyltransferase complex
cellular_component GO:0005667 transcription factor complex
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding
molecular_function GO:0004402 histone acetyltransferase activity
molecular_function GO:0005515 protein binding
molecular_function GO:0003712 transcription cofactor activity
molecular_function GO:0008270 zinc ion binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG02g10080.1Cp4.1LG02g10080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001005SANT/Myb domainPFAMPF00249Myb_DNA-bindingcoord: 59..102
score: 7.6
IPR001005SANT/Myb domainSMARTSM00717santcoord: 58..106
score: 9.0
IPR006447Myb domain, plantsTIGRFAMsTIGR01557TIGR01557coord: 57..106
score: 3.5
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 59..99
score: 1.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 53..107
score: 2.47
IPR017930Myb domainPROFILEPS51294HTH_MYBcoord: 54..108
score: 21
NoneNo IPR availablePANTHERPTHR12802SWI/SNF COMPLEX-RELATEDcoord: 40..225
score: 4.2
NoneNo IPR availablePANTHERPTHR12802:SF60PROTEIN REVEILLE 1-RELATEDcoord: 40..225
score: 4.2