Cp4.1LG14g03640 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG14g03640
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSequence-specific DNA-binding transcription factor
LocationCp4.1LG14 : 2107933 .. 2112154 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TACATTCATTTATATCCGCTCAAAGATTTCCGAAAATAACGAATTTCATTTCTGTAGACGGGATCTTCAATTCTGCTCTCTTTTTCCCGGCACTTTTTTTTTCTTCTTCTCAGTTTCTCTCTCTGCTTCTGCGATAGCGACGCCGAAATCAGAGAAACCACAGCGGAAGGTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACGGCTCCCGAGGTTTTTCTCCGCTTCTTCCTCTTCCTGTTCTGTATTTTTCTTATTTTGTCTCTTTTTTTTTTTTTTTTTTTGCTGTGAATGATGCGTCTGTGATGCATTTTCGTTGCTGGAGTTGAAATTTTTTGTGGATTACTTCAATTTCTATGTCTATTGGCTTGATTGATTGATTATTCTTTTTGAAATTGTGCTTACTGATTATTGTGCTTGCTTTCTTTTGAACTAATAACTGTTCCTTTGTTTTGCAACTTCTATTGTTCTTTCTAACATTTTTTCCTGTTTCTTATTCCACTGGTAATTGTGTTCTATCCCTTCGAATTACGATTTCTCGTCCCCAAATTCCTAATCTGTAATTCGTTTGCTCTAATTGCACAGAATCGATGCTGTGGTTAGTGTAGGTGTAGGTAGCTACATTTAGGTAAGAACTGCAGCGAGAGGGGACTGTGGGGCTTACTGGATAACGATAATTTGGAATTAGAGTGCCATTTTTAATGAAGTGGCTGGGAGTTAAACTTTTGGAGCATAGTTGTTGATAAAATTTGTTCCAGACCCTAACGAAATAGATCTATCCATCTAAATTTTTGTGGAAATTGTGTTGACCTTCTGCAGTTCAATCTGCAACAAGTTAATTTAGTACATTCTTGATAAGGAATTGAATTCTTTTCAATAAAATAAAATTTTCTTTTCTTGCTTTAAGGCTCCTATTATAGTTGTTTCTAGGCATTAAAGTCTTACAGGGTACACTTCCTCGAAACTTCCGGTGCAGTAATGTTAAATTCTTATTTTGATCAAGGTTGCGGAGATGGACGCTATATTGCAAGCACACAATAATACCATGCCAGCTCGGGAAGTTCTCGTTGCCCTTGCTGAGAAGTTCAGGTGGGCATTGAGCAATAGGCTTACTGTTGTTTTTGTTAAATTTGAATTGATCTTCTTTTGTTTTTTCTATAAGACAAGATTGAATAGTGAAAAGAATAATTTATTTATGTTCTTATAGTGAATCGGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAACAAGTATAGTGACTTCTGTTCTGTACTCGACCTTGATATGATGATAGGTAGTGGTGTTCTGCTGATTATTATTGTTATTTCTTAGGTTTGGAATTGGTTCCAGAATAGACGATATGCTATAAGAGCGAAGACAACCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATCGAGTCAACTCCTGTGAGAAATGTGCCTCAAACCACAGTTGTTCCTGCTCCTGCACCAGTAGGTATTAAAAAAATTTGTTATGTTTTTGAAGTTTTTAAATATTTTCGTTTGCTTATCCTGTTCTTTCTTTTTTATTTCATCTTTATTATGAGCTTATCACTAAAAGATCTATTCTAGTAAGGTATATGTGAGGCTTGGGGTTTGTGGCCCTTAGTCTTCATTTTAATTTCTACGTCCTTAATGAAATTACCATGTAAAAACAAATTTCAAATTGAGCTATGGTAGCTCTTTTGGTCCCTTTGCCCTTCTCAAGGCTTTTCGTCTAGCTCTCTTTTGCACCCGCTGAACATTCCCTCATTCCCCCAATACCCCCGCTTTTTCCTTGATTTGGAAGGTTAAGATTTTGAGGGTTATATTCTTTGTGTAGCATGTCTTACACGAGAGAGTTAAGGCTATAGTCCATGTGTAGAGGGACTCTACCTTCTTTCGAATCTCAAGCATTAGAAATCCATGAGGTATTGTTTTTTCTTTTAAATCATGGGTTGACCGTGTTTATGTTCTGTGAAGAACTATAATTTTTAGAATGGCATGGTGTGGGTAGCTATCAAGCACGGATGCTTCAGTTTGGATAGCATGTGAACGAATTGTATGTGTCCGAACACTTATTGAACATTTCAACACACGTTGGATGCTTGAGAGTACATGTCACTGTCGAAAAATTGTTGGACACGTGAACTAAATTGCTCATTTATCTATAAGAGAACAATAATAAACTTGAGAGGAAACACTAAACTTGTTTTTTTATGCATATAAGTGGATAAGCATCATTGATTCTAGATTTGCTATTTTAAGATGATTCATATTAAAACATGCTAAGTTTGGTTTTGTATGTTTATTTTTTCATTCAATTTCTGTACCAAATTCCAGGATCTGCAAAGAGTGCTTCAGAAAATCCATCGTTGGAGTTCGAAGCTAAATCTGGGAGAGATGGTGCATGGTCAGTATAGCCTGTCTACAAATATGGGGATTTTGGTGTGAATTAGCTTAGCCGGTCTCCATGTTTTATTATGATAAACTTCTTCATGATACACGTTTTCATAATACAAGAAATTAATAATTGATCTTTTAACTGCTCGTATTTTCAGGTATGACGTTGCTACCTTTCTATCCCATAGATCTGTGGAAAGCGGTGACCTGGTAATTTTGCGCTCGTTCCTTCATTATGTTGCCCTTCTTTCCTCTTCAGTCTATTTTCTGTGCGTTTTTATTAAAAACGAACATGTAAGTATTAAGCCAATTAATTCTTGAACGACACCATTCTTCGTTTATAGGAGAACATTACAATATGTGGTCATGTGATCACACTCCAAAGGATAACGAAGGCCTAATGTCAAGGGCTATTCGTTAGTGTGGTCGTGGATTCAAAACCTTTTGAATCACATTCTCAATCCTATGAGTTCTCTTAGTATTTTATGATGTATGGTTCGGCGTGGTCGCAAACTCAAACAAAAGTTCGAATTATGGGAATTGTAACATTGTCTATAAAGAAACCTGATGTTGTGATTTCTTTGCTTTAATCAGACTGCCTGTATCACCTAATAAACGTTGTTTGTCAGGAAGTACTAGTCAGATTTTCTGGTTTTGGATCGGAGGAGGACGAGTGGGTTAATATCCGAAGGAACATTAGACCTCGTTCTTTTCCTTGTGAATCATCGGAATGCGTGGCAGTTCTTCCGGGTGATCTCATCTTATGCTTTCAGGTAAAAGTGAGGCCTTTCCATCACCTTCTTGCAACCAATTCAGGATAATCTCTTTTACTAGGCCTGTCTATATTGCGCATATAATTTACGACTACCTTGTAAAACGACGCCCTTAAGTGGTGCACACAACGATCTTTTAACGTTCTCACAGCACACCCTATCAGTTATAGAGAAACCTTTTTTATGAGTAATTTTGTGATCTTCAAAATATTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGACGTTCGAGGCTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAGTCTGAGGTGTGTATCTTCTTTAGTTTCTTTTGATCATCAGCATTGTGAGCTGTTTGGGATTATTGATTTGTCTTGCTCTAAATTATTGTTAGGAAATCGTCCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCACTGAGCCCTCAAAGTCTGGCTTGGATTCTGTACTGCTCAGCGGCCAGAGGATAAATTTTGAGGCAACACAAAAGCCAAATGCCAATATAAACGTCCATGCCCAAACTAATACTCAGGAAGGAAGGAGTACTGAAACTAACAGTGCTCCAACCACACTCAACTCTGGTAATTCTGCAGCTAGCTCTGCATTCTCGAGTGGTATCGTGACGTCGAACTCTGTTTCTGGATTGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGCTGACTATGAAAACGAATTTCTCACTCAGTCTAATTTTAACTGAACGTATCAATTTAAAATTTTGCCTGACTCGTTTATTTAGGATGAGTAAATACGTAGCGAAGTCTGTTTTTTGCCACATGTTTCGAAGTTTTAGGTTCGAATCCACTTGTTGTGAATGCTGAATGTTCTCGACGGATAAAAAATGCAGGAGAACGCCTCGAGGCTAACAGGTCAGAGGCATCATCTCTCTTGTTTTAC

mRNA sequence

TACATTCATTTATATCCGCTCAAAGATTTCCGAAAATAACGAATTTCATTTCTGTAGACGGGATCTTCAATTCTGCTCTCTTTTTCCCGGCACTTTTTTTTTCTTCTTCTCAGTTTCTCTCTCTGCTTCTGCGATAGCGACGCCGAAATCAGAGAAACCACAGCGGAAGGTTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACGGCTCCCGAGGTTGCGGAGATGGACGCTATATTGCAAGCACACAATAATACCATGCCAGCTCGGGAAGTTCTCGTTGCCCTTGCTGAGAAGTTCAGTGAATCGGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAACAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATAAGAGCGAAGACAACCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATCGAGTCAACTCCTGTGAGAAATGTGCCTCAAACCACAGTTGTTCCTGCTCCTGCACCAGTAGGATCTGCAAAGAGTGCTTCAGAAAATCCATCGTTGGAGTTCGAAGCTAAATCTGGGAGAGATGGTGCATGGTATGACGTTGCTACCTTTCTATCCCATAGATCTGTGGAAAGCGGTGACCTGGAAGTACTAGTCAGATTTTCTGGTTTTGGATCGGAGGAGGACGAGTGGGTTAATATCCGAAGGAACATTAGACCTCGTTCTTTTCCTTGTGAATCATCGGAATGCGTGGCAGTTCTTCCGGGTGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGACGTTCGAGGCTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAGTCTGAGGAAATCGTCCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCACTGAGCCCTCAAAGTCTGGCTTGGATTCTGTACTGCTCAGCGGCCAGAGGATAAATTTTGAGGCAACACAAAAGCCAAATGCCAATATAAACGTCCATGCCCAAACTAATACTCAGGAAGGAAGGAGTACTGAAACTAACAGTGCTCCAACCACACTCAACTCTGGTAATTCTGCAGCTAGCTCTGCATTCTCGAGTGGTATCGTGACGTCGAACTCTGTTTCTGGATTGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGCTGACTATGAAAACGAATTTCTCACTCAGTCTAATTTTAACTGAACGTATCAATTTAAAATTTTGCCTGACTCGTTTATTTAGGATGAGTAAATACGTAGCGAAGTCTGTTTTTTGCCACATGTTTCGAAGTTTTAGGTTCGAATCCACTTGTTGTGAATGCTGAATGTTCTCGACGGATAAAAAATGCAGGAGAACGCCTCGAGGCTAACAGGTCAGAGGCATCATCTCTCTTGTTTTAC

Coding sequence (CDS)

ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACGGCTCCCGAGGTTGCGGAGATGGACGCTATATTGCAAGCACACAATAATACCATGCCAGCTCGGGAAGTTCTCGTTGCCCTTGCTGAGAAGTTCAGTGAATCGGTAGAACGGAAAGGGAAGATTGCTGTGCAAATGAAACAAGTTTGGAATTGGTTCCAGAATAGACGATATGCTATAAGAGCGAAGACAACCAAGGCTCCTGGAAAGTTAGCTGTCTCTCCAATTGTCCAAATCGAGTCAACTCCTGTGAGAAATGTGCCTCAAACCACAGTTGTTCCTGCTCCTGCACCAGTAGGATCTGCAAAGAGTGCTTCAGAAAATCCATCGTTGGAGTTCGAAGCTAAATCTGGGAGAGATGGTGCATGGTATGACGTTGCTACCTTTCTATCCCATAGATCTGTGGAAAGCGGTGACCTGGAAGTACTAGTCAGATTTTCTGGTTTTGGATCGGAGGAGGACGAGTGGGTTAATATCCGAAGGAACATTAGACCTCGTTCTTTTCCTTGTGAATCATCGGAATGCGTGGCAGTTCTTCCGGGTGATCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGAAGACATGACGTTCGAGGCTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAGTCTGAGGAAATCGTCCAGTTGAGAAAGATTTGTCGTCGGCCCGAGACTGATTACAGGTTGCAACAGCTTCATGCTGTAAATGAAGCAGCATCCACTGAGCCCTCAAAGTCTGGCTTGGATTCTGTACTGCTCAGCGGCCAGAGGATAAATTTTGAGGCAACACAAAAGCCAAATGCCAATATAAACGTCCATGCCCAAACTAATACTCAGGAAGGAAGGAGTACTGAAACTAACAGTGCTCCAACCACACTCAACTCTGGTAATTCTGCAGCTAGCTCTGCATTCTCGAGTGGTATCGTGACGTCGAACTCTGTTTCTGGATTGTCGGCTGACAATGTGTCTGATGGGAAGTTACTTAGCTGA

Protein sequence

MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKPNANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS
BLAST of Cp4.1LG14g03640 vs. Swiss-Prot
Match: SHH2_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1)

HSP 1 Score: 373.2 bits (957), Expect = 3.1e-102
Identity = 206/353 (58.36%), Postives = 247/353 (69.97%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVRNVPQTTVVP----------- 120
           KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R+V Q   VP           
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 -APAPVGSA-----KSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSG 180
             PAP GS      +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD EV VRF+G
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  EEDEW+N+++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQQLH-AVNEAA-STEPSKSGLD 300
           HDVRGCRCRFLVRY HDQSEEIV LRKICRRPETDYRLQQLH AVN+ A S +     LD
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPALD 300

Query: 301 SVLLSGQRINFEATQKPNANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASS 334
           +   +   +       P A + + A     E +    ++ P TL   +S A++
Sbjct: 301 AAAKTPLSL-------PGATVPIVA----PESKDPSLSATPATLVQPSSNAAT 342

BLAST of Cp4.1LG14g03640 vs. Swiss-Prot
Match: SHH1_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1)

HSP 1 Score: 184.9 bits (468), Expect = 1.6e-45
Identity = 108/249 (43.37%), Postives = 149/249 (59.84%), Query Frame = 1

Query: 14  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RY 73
           FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKH 73

Query: 74  AIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTTVVPAPAPVGSAKS-ASENPSLE 133
             + K+   P     SP +QI      S+   N    T V     V + K  AS+   L 
Sbjct: 74  QSQPKSKTLP-----SPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLA 133

Query: 134 FEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCES 193
           FEAKS RD AWYDV++FL++R + +G+LEV VRFSGF +  DEWVN++ ++R RS P E 
Sbjct: 134 FEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEP 193

Query: 194 SECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQ 253
           SEC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + 
Sbjct: 194 SECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLG 253

Query: 254 LRKICRRPE 256
           L +ICRRPE
Sbjct: 254 LERICRRPE 257

BLAST of Cp4.1LG14g03640 vs. TrEMBL
Match: A0A0A0LC67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1)

HSP 1 Score: 601.7 bits (1550), Expect = 6.0e-169
Identity = 321/372 (86.29%), Postives = 332/372 (89.25%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSAS 120
           K      QNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRNVPQT VVPAPAPVGSAK A 
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           S PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP--- 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQRINFE +Q P   
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 ---------NANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGL 360
                    N +IN HAQT+TQE R+TETN+APTT NS N A SSAFSSGIVT N+VS  
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVT-NTVSAG 360

BLAST of Cp4.1LG14g03640 vs. TrEMBL
Match: M5WG16_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1)

HSP 1 Score: 452.6 bits (1163), Expect = 4.5e-124
Identity = 245/369 (66.40%), Postives = 282/369 (76.42%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT  EV+EM+AILQ HNNTMPAREVLVALA+KFSES ERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQSEVSEMEAILQQHNNTMPAREVLVALADKFSESAERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQ-----TTVVPAPAPVGS 120
           KQVWNWFQNRRYAIRAK++K  GKL VSP+ + +S PVRNVPQ        + AP+  GS
Sbjct: 61  KQVWNWFQNRRYAIRAKSSKVLGKLNVSPMSRDDSNPVRNVPQGPQPIAAPIHAPSAQGS 120

Query: 121 AKSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRR 180
            K ASEN   EFEAKSGRDGAWYDVA FLSHR +E+GD EVLVRF+GFG EEDEWVN+R+
Sbjct: 121 GKGASENSIFEFEAKSGRDGAWYDVANFLSHRYLETGDPEVLVRFAGFGPEEDEWVNVRK 180

Query: 181 NIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVR 240
           ++R RS PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVR
Sbjct: 181 HVRQRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVR 240

Query: 241 YDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSK-----SGLDSVLLSGQRIN 300
           Y HDQSEEIV LRK+CRRPETDYRLQQLHAVNEAAS E          + S  +  ++ N
Sbjct: 241 YVHDQSEEIVPLRKVCRRPETDYRLQQLHAVNEAASAEQKSMDHFMGSVTSAEMMQKQQN 300

Query: 301 FEATQKP---NANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSG 357
            +A   P   +AN ++  Q+ T E + +E ++  ++ NS     S+  +SG  T   V G
Sbjct: 301 TDAASAPPVLHANASLATQSTTPEFKGSEVSTVISSGNSNFPPGSAVITSGTATV-VVPG 360

BLAST of Cp4.1LG14g03640 vs. TrEMBL
Match: G7LDQ2_MEDTR (Sequence-specific DNA-binding transcription factor OS=Medicago truncatula GN=MTR_8g061040 PE=4 SV=1)

HSP 1 Score: 450.3 bits (1157), Expect = 2.2e-123
Identity = 244/375 (65.07%), Postives = 284/375 (75.73%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT PEV EM+AIL  HNN MPAR+VL ALA+KFSES +RKGKI VQM
Sbjct: 1   MGRPPSNGGPAFRFTQPEVTEMEAILSEHNNAMPARDVLQALADKFSESPDRKGKITVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGS----A 120
           KQVWNWFQN+RYAIRAK++K P KL ++P+ + + TP R + Q T  P PAP  S    A
Sbjct: 61  KQVWNWFQNKRYAIRAKSSKTPAKLNITPMPRTDLTPGRIMTQPTASPIPAPSASVQTTA 120

Query: 121 KSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRN 180
           K+A EN  +EFEAKSGRDGAWYDVATFLS+R +ES D EVLVRF+GFGSEEDEW+N+R+N
Sbjct: 121 KAAPENSVMEFEAKSGRDGAWYDVATFLSYRHLESSDPEVLVRFAGFGSEEDEWINVRKN 180

Query: 181 IRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRY 240
           +RPRS PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVRY
Sbjct: 181 VRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVRY 240

Query: 241 DHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLD-SVLLSGQRIN--FEA 300
           DHDQSEEIV LRKICRRPETDYRL QLHAVN+AA T+  K  LD    + G R+    E 
Sbjct: 241 DHDQSEEIVPLRKICRRPETDYRLHQLHAVNDAAPTDQQKIALDHPANVHGARVTNPSEM 300

Query: 301 TQKPNANINVH-----AQTNTQ---EGRSTETNSAPT--TLNSGNSAA-SSAFSSGIVTS 358
            QK     N+H      QTN     +  + +   A T   + +GNS   SSA  +GI+ +
Sbjct: 301 VQKQQQIANIHIVTPVLQTNVSIPPQSMNVDPMKAETKADVQAGNSVTPSSAAFTGIIAT 360

BLAST of Cp4.1LG14g03640 vs. TrEMBL
Match: W9RI10_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 1.1e-122
Identity = 241/372 (64.78%), Postives = 276/372 (74.19%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPP NGGPAFRFTA EVAEM+AILQ HNNTMPARE+LV LA+KFSESVERKGKI VQM
Sbjct: 1   MGRPPGNGGPAFRFTASEVAEMEAILQEHNNTMPAREILVDLADKFSESVERKGKIMVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSAS 120
           KQVWNWFQNRRYAIRAK ++  G L+VS + + + TPVRNVPQ    P PAP G+ + AS
Sbjct: 61  KQVWNWFQNRRYAIRAKLSRNLGMLSVSSMPRDDPTPVRNVPQAITAPIPAPSGTGRGAS 120

Query: 121 ENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           EN  +EFEAKSGRDGAWYDVA F SHR +ESGD EVLVRF GFG E+DEWVNIR+++R R
Sbjct: 121 ENSIMEFEAKSGRDGAWYDVANFFSHRYLESGDPEVLVRFVGFGPEDDEWVNIRKHVRQR 180

Query: 181 SFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           S PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSG--QRINFEATQK-- 300
           SEEIV LRK+CRRPETDYRLQQL+AVNEAAS E  KS  D+    G   RI+ E T K  
Sbjct: 241 SEEIVPLRKVCRRPETDYRLQQLYAVNEAASAEQQKSSTDNFGGGGFRARISAETTPKLQ 300

Query: 301 --------PNANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSG- 358
                   P  +      T        +       +++GNS   +A  +GI++ +  S  
Sbjct: 301 HADAALVAPALHATAALATKASILEPKKVEIVNVVVDAGNSNNVTASGNGIMSGSPASNK 360

BLAST of Cp4.1LG14g03640 vs. TrEMBL
Match: A0A0R0H5Q7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_12G158000 PE=4 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 1.6e-121
Identity = 231/342 (67.54%), Postives = 263/342 (76.90%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT PEVAEM+AILQ HNN MP+R+VL  LAEKFSES +RKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQPEVAEMEAILQEHNNAMPSRDVLTTLAEKFSESQDRKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE--STPVRNVPQTTVVPAPAPVGSA-- 120
           KQVWNWFQN+RYAIRAK++K PGKL ++P+ + +  STP+R++PQ     AP P  SA  
Sbjct: 61  KQVWNWFQNKRYAIRAKSSKTPGKLNITPMPRDDYNSTPIRSMPQQPTA-APIPAASATV 120

Query: 121 ----KSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVN 180
               K+  EN  LEFEAKSGRDGAWYDVATFLSHR +E+ D EVLVRF+GFG EEDEW+N
Sbjct: 121 PTAVKATPENSVLEFEAKSGRDGAWYDVATFLSHRYLETSDPEVLVRFAGFGPEEDEWIN 180

Query: 181 IRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRF 240
           IR+++RPRS PCESSECV V+PGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRF
Sbjct: 181 IRKHVRPRSLPCESSECVVVIPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRF 240

Query: 241 LVRYDHDQSEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFE 300
           LVRYDHDQSEEIV LRKICRRPETDYRLQQLHAVNEAA  +  K+G+D            
Sbjct: 241 LVRYDHDQSEEIVPLRKICRRPETDYRLQQLHAVNEAAPMDQQKTGMD------------ 300

Query: 301 ATQKPNANINVHAQTNTQEG------RSTETNSAPTTLNSGN 329
               P AN+N    T T+        R+T T + P  L + N
Sbjct: 301 ----PAANVNAVRATTTETAANVNAVRATTTETVPKQLIAAN 325

BLAST of Cp4.1LG14g03640 vs. TAIR10
Match: AT3G18380.2 (AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding)

HSP 1 Score: 368.6 bits (945), Expect = 4.3e-102
Identity = 206/354 (58.19%), Postives = 247/354 (69.77%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF  PEV EM+AIL  HN  MP R +L ALA+KFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIE-STPVRNVPQTTVVP----------- 120
           KQ+WNWFQNRRYA+RA+  KAPGKL VS + +++    +R+V Q   VP           
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 -APAPVGSA-----KSASENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSG 180
             PAP GS      +S S+N  LEFEAKS RDGAWYDV  FL+HR++E GD EV VRF+G
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  EEDEW+N+++++R RS PCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDYRLQQLH-AVNEAA-STEPSKSGL 300
           HDVRGCRCRFLVRY HDQSE EIV LRKICRRPETDYRLQQLH AVN+ A S +     L
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEQEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPAL 300

Query: 301 DSVLLSGQRINFEATQKPNANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASS 334
           D+   +   +       P A + + A     E +    ++ P TL   +S A++
Sbjct: 301 DAAAKTPLSL-------PGATVPIVA----PESKDPSLSATPATLVQPSSNAAT 343

BLAST of Cp4.1LG14g03640 vs. TAIR10
Match: AT1G15215.2 (AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1))

HSP 1 Score: 184.9 bits (468), Expect = 8.9e-47
Identity = 108/249 (43.37%), Postives = 149/249 (59.84%), Query Frame = 1

Query: 14  FTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNR-RY 73
           FT  E+ +M+ + +   +    ++    +A  FS SV R GK ++  KQV  WFQ + ++
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKH 73

Query: 74  AIRAKTTKAPGKLAVSPIVQIE-----STPVRNVPQTTVVPAPAPVGSAKS-ASENPSLE 133
             + K+   P     SP +QI      S+   N    T V     V + K  AS+   L 
Sbjct: 74  QSQPKSKTLP-----SPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLA 133

Query: 134 FEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCES 193
           FEAKS RD AWYDV++FL++R + +G+LEV VRFSGF +  DEWVN++ ++R RS P E 
Sbjct: 134 FEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEP 193

Query: 194 SECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQ 253
           SEC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + 
Sbjct: 194 SECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLG 253

Query: 254 LRKICRRPE 256
           L +ICRRPE
Sbjct: 254 LERICRRPE 257

BLAST of Cp4.1LG14g03640 vs. NCBI nr
Match: gi|778680368|ref|XP_011651298.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 624.8 bits (1610), Expect = 9.5e-176
Identity = 327/372 (87.90%), Postives = 338/372 (90.86%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSAS 120
           KQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRNVPQT VVPAPAPVGSAK A 
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           S PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP--- 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQRINFE +Q P   
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 ---------NANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGL 360
                    N +IN HAQT+TQE R+TETN+APTT NS N A SSAFSSGIVT N+VS  
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVT-NTVSAG 360

BLAST of Cp4.1LG14g03640 vs. NCBI nr
Match: gi|659111991|ref|XP_008456010.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo])

HSP 1 Score: 615.1 bits (1585), Expect = 7.5e-173
Identity = 326/383 (85.12%), Postives = 337/383 (87.99%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEM+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSAS 120
           KQVWNWFQNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRNVPQT VVPAP PVG+AKSA 
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAP 120

Query: 121 ENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           S PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP--- 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQRINFE  Q P   
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK 300

Query: 301 ---------NANINVHAQTNTQEGRSTETN-----------SAPTTLNSGNSAASSAFSS 360
                    N +IN HAQT+TQE R+TETN           +APTT NS N A SSAFSS
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSS 360

BLAST of Cp4.1LG14g03640 vs. NCBI nr
Match: gi|700202508|gb|KGN57641.1| (hypothetical protein Csa_3G236580 [Cucumis sativus])

HSP 1 Score: 601.7 bits (1550), Expect = 8.6e-169
Identity = 321/372 (86.29%), Postives = 332/372 (89.25%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEM+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKAPGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSAS 120
           K      QNRRYAIRAKT+KAPGKLAVSP+VQIESTPVRNVPQT VVPAPAPVGSAK A 
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPSLEFEAKSGRDGAWYDVATFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPR 180
           ENP  EFEAKSGRDGAWYDVATFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SFPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           S PCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP--- 300
           SEEIVQLRKICRRPETDYRLQQLHAVNEAAS EPSKSG+DSVLLSGQRINFE +Q P   
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 ---------NANINVHAQTNTQEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGL 360
                    N +IN HAQT+TQE R+TETN+APTT NS N A SSAFSSGIVT N+VS  
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSAFSSGIVT-NTVSAG 360

BLAST of Cp4.1LG14g03640 vs. NCBI nr
Match: gi|778680371|ref|XP_011651299.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 584.3 bits (1505), Expect = 1.4e-163
Identity = 307/351 (87.46%), Postives = 318/351 (90.60%), Query Frame = 1

Query: 22  MDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKA 81
           M+AILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KA
Sbjct: 1   MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVA 141
           PGKLAVSP+VQIESTPVRNVPQT VVPAPAPVGSAK A ENP  EFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP------------NANINVHAQTNT 321
           QLHAVNEAAS EPSKSG+DSVLLSGQRINFE +Q P            N +IN HAQT+T
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEGRSTETNSAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKLLS 361
           QE R+TETN+APTT NS N A SSAFSSGIVT N+VS  SADNVSDGKLLS
Sbjct: 301 QEARNTETNTAPTTFNSANLAGSSAFSSGIVT-NTVSAGSADNVSDGKLLS 350

BLAST of Cp4.1LG14g03640 vs. NCBI nr
Match: gi|659111993|ref|XP_008456011.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo])

HSP 1 Score: 574.7 bits (1480), Expect = 1.1e-160
Identity = 306/362 (84.53%), Postives = 317/362 (87.57%), Query Frame = 1

Query: 22  MDAILQAHNNTMPAREVLVALAEKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKA 81
           M+ ILQ HNNTMPAREVLVALA+KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+KA
Sbjct: 1   METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAVSPIVQIESTPVRNVPQTTVVPAPAPVGSAKSASENPSLEFEAKSGRDGAWYDVA 141
           PGKLAVSP+VQIESTPVRNVPQT VVPAP PVG+AKSA ENP  EFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDLEVLVRFSGFGSEEDEWVNIRRNIRPRSFPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGD EVLVRFSGFGSEEDEWVNIRRNIRPRS PCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASTEPSKSGLDSVLLSGQRINFEATQKP------------NANINVHAQTNT 321
           QLHAVNEAAS EPSKSG+DSVLLSGQRINFE  Q P            N +IN HAQT+T
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEGRSTETN-----------SAPTTLNSGNSAASSAFSSGIVTSNSVSGLSADNVSDGKL 361
           QE R+TETN           +APTT NS N A SSAFSSGIVT N+VSG SADNVSDGKL
Sbjct: 301 QEARNTETNTAPITFSSGNHNAPTTFNSANLAGSSAFSSGIVT-NTVSGGSADNVSDGKL 360

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SHH2_ARATH3.1e-10258.36Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1[more]
SHH1_ARATH1.6e-4543.37Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LC67_CUCSA6.0e-16986.29Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1[more]
M5WG16_PRUPE4.5e-12466.40Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1[more]
G7LDQ2_MEDTR2.2e-12365.07Sequence-specific DNA-binding transcription factor OS=Medicago truncatula GN=MTR... [more]
W9RI10_9ROSA1.1e-12264.78Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1[more]
A0A0R0H5Q7_SOYBN1.6e-12167.54Uncharacterized protein OS=Glycine max GN=GLYMA_12G158000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18380.24.3e-10258.19 sequence-specific DNA binding transcription factors;sequence-specifi... [more]
AT1G15215.28.9e-4743.37 BEST Arabidopsis thaliana protein match is: sequence-specific DNA bi... [more]
Match NameE-valueIdentityDescription
gi|778680368|ref|XP_011651298.1|9.5e-17687.90PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus][more]
gi|659111991|ref|XP_008456010.1|7.5e-17385.12PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo][more]
gi|700202508|gb|KGN57641.1|8.6e-16986.29hypothetical protein Csa_3G236580 [Cucumis sativus][more]
gi|778680371|ref|XP_011651299.1|1.4e-16387.46PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus][more]
gi|659111993|ref|XP_008456011.1|1.1e-16084.53PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003682chromatin binding
Vocabulary: INTERPRO
TermDefinition
IPR009057Homeobox-like_sf
IPR001356Homeobox_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0000785 chromatin
cellular_component GO:0005634 nucleus
molecular_function GO:0003682 chromatin binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG14g03640.1Cp4.1LG14g03640.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 7..81
score: 0.
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 13..77
score: 9
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 13..80
score: 8.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 4..76
score: 9.84
NoneNo IPR availablePANTHERPTHR33827FAMILY NOT NAMEDcoord: 2..311
score: 2.3E
NoneNo IPR availablePANTHERPTHR33827:SF3PROTEIN SAWADEE HOMEODOMAIN HOMOLOG 2coord: 2..311
score: 2.3E