Cp4.1LG01g01690.1 (mRNA) Cucurbita pepo (Zucchini)

NameCp4.1LG01g01690.1
TypemRNA
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionSequence-specific DNA-binding transcription factor
LocationCp4.1LG01 : 2887891 .. 2893113 (-)
Sequence length1702
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGGTTTTAGTTCTTTTTTGAGGTACACAATTTTGATATTTGGTACGGAATCACCTGGGCATTTTGTTTTGCCCCCCAAAACCATGAATTTTGGGCTTAAAGCTTCAATTTCTCTCTCTATCATCATCATCATATATGTGTATATATATATGTATCGATGCACATTTATTTATATAGTCGCTCAAAGTTTTCCCGAAATAACGAATTTCTGAAGACGGGATCTTCAATTCTGAATTTTTTCTTGGTACTTTTTTCTCAGTTTCTCTCTTTGATTCTTTGATAGTGACGGCGAAATCAGTGAAACCACACAGGGAAGCTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACTGCTCCCGAGGTTTTTCTCCGCTTCTTCTTCTTCTACTTCCTCCGCTTTTGTTCGTTTTCTTTATCTGAATGATGCGTTTGTGATGCATGTTGATTTTGGTGCTGGAAATTTTTCTTAGCTCAATTTCTGTGTTTATTGGCTTGATTGTTTGTTGATTCTTGTGGTTGTTTTGTTCGTTTCATGGAGGGTTCTAATCGTGTTCTTCTGAACTGATTGACTGTTCTTCTGCTTTCTTTTTTGATTATTTTCCATTCCATTTCTTGTTCTACTGGTAATTGTGCTTTTCCCCTTTTGAATTTTGATGTCTGTTTCCTAATTACTGCATAGAGTTGATGCTGTGGTTAGTTTAGGTAGCTGCATGTAAATAAGAACTGTTGAAAGGGGGCTTGTGGGGCTTGTTGGATAGCGATAATTTGGAAGAAGAGTGTCCTTTTCGAGCTTGAGTTTTGACAAAATTTGTATAGATCTAATTCAATCTGCTAGAAGTTCATTAGTTTTGCTGCTGGATGCTGTTTATTCTACAAGGAATTGATAATGACTTTAATCAAATGAAGTTTTTATTGCTTGCTCTAAGGCTATTACTATAGTTCTTTCTTGACTTAGTACTCAGATAGGGTACGCTTCCTCGAAGCTGCTGGTGCTGTAACGTTCGGTTCTTATTTTGATTAAGGTTGCGGAGATGGAGGCTATACTGCAAGGACACAATAATACCATGCCGTCTCGGGAAGTTCTGGTTTCTCTTGCTGGGAAGTTCAGGTGGACATTCTACATTGCTTACTGTGACTTTTATATGATTTGAATTGCTCTTTTGGTGTAGAGAATAATTTGTTTATGTTGTTTGTAGTGAATCGGTCGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTATGGTGGCCGTACTTGACGTTTATATGATAGGTAATGGTGATTTACTAATTATTGTTGTTATTTTGTAGGTTTGGAATTGGTTCCAGAATAGGCGATATGCTATCAGAGCAAAGACAACAAAGGTTCCCGGAAAGTTGGCTGGCTCTCCAATTGTCCAAGTCGAGTCAACACCCCCGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCCACACTAGTAGGTATTAAAAAAATCGCTATGTTTTATCAATGTTTAAATATTTTCATTTTCTTATCCTTTTTTTTTCTCTTATCATATGTTTGTTATTAGCTTATTGTTACGAAATCTATCGTAGTAGTAGAATGGCATCTGCTAAACGCCTCTGTCTGGGCTAACATCATCATTTCAAAACTTTTCCTCTTTTCGTCGCTAGTCAGGTGAGGAGGTATATCTGGAAGCATGGGAGCTTTTATGGCCTTTAATCTTCATTTTAAGTACTGTTCTTATGAGATTTCTATGAAAAAACCAATTGCAAAGTGGCTGCTTGCAAGTTTCCTACGTTCAAAAGATCCTACTCTGTACAAAAAGAAGTTAGATGTTTTTTATAATGCCAGGGTTTATCGCACAAGTATTGTTTTTTCAATTGGAACGTATTGTTTATGGTATTCGAGCAAAAAGTTTGTATGGTGTAACAACCCAAGCCCATTGCTAGTAGATATTATCTGTTTTGGCCTGTTATGTATTGTTGTCAGCCTCACGGTTTTAAAGCGCGTCTACTAGGGAGGTTTCCATACCCTTATAAGGAATGTTTCATTTCCCTCTCCAACCAATGTGGGATCTCACAATCCACCCCTACCCCTAGGGGCTCAGCATCCTCACTAGCACACCGTCCGGTGTCTAGCTCTGATACCATTTGTAACATCTCAAATCTACCGCTAGTAGGTAGTGTCCGCTTTGACCCATTATGTATCATCGTTAGCCTCATGGTTTTAAATACATCTACTAGGGAGAGATTTCAACCCTCTTATAAGGAATGGTTTGTTCCCCTCTCCAACCAAGGTGAGATCTCACAATCTACTCCCCTTGGTGGCTCGGCGTCTTTGTTGGCACACCGCCTGGTGTCTGTGTCTAATATCATTTGTTGCCCAAGCCCACCACTAGTAGATATTGTTCTTGGCTCGTTACGTATCGCCGTTAACTTCACGGTTTTAAAATGTATCTACTAGGATGATGTTTCTATACCCTTATAAGGAATGCTTCGTTCCCATTTCCAACCAATGTGAGATCTTAGTCTTGATATACCTTATTCCATTATATTGTTGTAGTTAACTTAAATCTCCAAATATGTTATAAAAAGTATGAGAAATCGATGATAAATTGTTTTTCTGTAAATCATGGGCTGACCATGTTTATGGCCTGTGAAGAACCATAATCTTTAAAATGACATGGTGTGGAAAGATGTGAACCACAGATACATTAGTTTGGATAGCATGTGATTCTAGATTTGCTTTTTAAAGATGATTCATATTTAAACATGTTGGTATCAACTTCTTTTAATTTTTCATTCAATTTCTGTACCAAATTCCAGGACCCCTAAAGCGTGCTTCAGAAAATCCACTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTTAGTATAGCTTGTCTTCTGCTTTAGGATGATAAACTTCTTCACGACAGGCGTAGATGATTGATTCTTTTAACTGTTCATGATATGTTTCAGGTATGACGTTGCTACCTTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGTAATTCTGCACTCATTCCCTTGCACGCACGTTATTCTTCAACCTATTTTCTGATCAGTTATTCATGCTCGAATTATTTTGCTTTATGAAAACCGAACACGGAAGTATTAAGCCAAATAGTTCTTGTACGGCCGCTCTCTCCATTTATAGAAAAACATTACAGTTTGTGGTCTCTACTGTCTTGTACTGTGTTTGTGGGTTCAAGATCTTTTGGATCACATTCTCAATCCTATGAGTTTCTAAGTATTTTATGAGTTCTCGAAATATTTTGACTTGTGAGAGTTAATATTGCCCTGGATTTTTCTCCACCCAATCCTTTTCTCCTCACTACCTCTGAGTTTGCAACATCATATTGAATGTTAACCAACAACACCTTGTGTAACCGCCCAAGCCCACCGCTAGCGGATATGGTCCTTTTTGGGCTTTCCCTCAAGGTTTTTAAAACGCGTCTGCTAGGGAGAGGTTTCCACACCCTTATAAAGATGACTTCGTTCTCTTCCCCAACAAATGTGGAATCTCACACATTATTTGGCCAATATTTTTGTGTTTCTCTAAAATGCTCCAAGATATGTAGTTGATTGTAACTCAAAAGTTCATCTCCGATTCGTTTTTTCCGGTAGAACAAATGTTGGGGTAGTAGGGGTTTGATCCTTGAACTTCAAAGGAACGAGTAGATGCCTTAACCATTGGGCTATGCTTATGATTTGTAAAGATATCACATGCTTCGTTTTGTTTGCGTTCATCTGAGTCCCTGCATTGCCTAAAAATTGTTGTTTGCCAGGAAGTACTAGTTAGATTTGCTGGTTTTGGATCAGATGAGGATGAGTGGGTTAATGTCAGAAGGAACATTAGACCTCGTTCGCTACCTTGTGAATCTTCAGAATGTGTGGCAGTTCTTCCAGGCGACCTCATCTTATGCTTTCAGGTAAAAACTTTTATTTTCCCTTGAAGCGAGGCCTTTCCATCACCTTCTTGCTATCGATTCAGGATAATCTCGTTTACTAGATCTTTCGATATTGGACATAATTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGACGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAATCTGAGGTGCGTATCTTGTTCATGAACGCAATACTGCATGAATTGTGATATGTTCATAAACGTCTAACTTAATCATGCACTCAACAAATGCTGTTTAGGATTCTATAATTGACTTGGCTTGCTCTAAATGAATGTTAGGAAATTGTTCAGTTGAGAAAGATTTGTCGTCGGCCTGAGACCGACCACAGGTTGCTACAGCTTCATGCTGTAAACGAAGCAGCATCGATGGAGCCCTCACGATCTGGCATGGATTCTGTACTGCTCAGTGGTCAGACGATAAGTTTCGAGGCAACCCAAAAGCTACTCAACAAGGATGCAACCATCGTTATACCAAATGCAAATGCAAATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACAATAACTAACAGCGCCCCAGGCGTATTCAACGCTGGTAATCACGCAGGTAGCTCTGCTTCTCGAGCGGTATCATGACCAGCTCTGTGTCTGGTATGTCAGCAGACAATGTGTCTGAATGGGGAAGTTAGTTGATTATGGAAAAACTTCCTTCCATCAGTCTAATCTTAACCGAACCTATCAATTTAAAATTTTGCCCGACTCGTTTTTTTAGGATGAGTAAATACACAGTGAAGTCTGTTTTTGCCCCATGTTTCAAAGTGATAGCTACCCATCCACTTGCTGTGAATGCTGAACGTTCTCGACCGATGAAACATGCAGGAGCCACACCTCGATCGTAGTATCGAACAGGTCAGAGGCATCATCTCTCTTGTTTTACTTTTTTCTTCTCTGCTTAGCCAAGCCAAACCAAACCAAACCAAACCATGGCTCTACACTGATCATATATCTTACTGCTTAGTTTTCCCATAGAGCTTTGCTGATTACCAAATGTTACTAACGATATCGATGCACTCGACAAAGGGTTCTCTTCGAATCGTTAAATGCCATGTGAAAAACTGGACTTGTTTGTGTACCTTTGGCTGAAGGGCAGTCAACTTAATGCATTACAGGAACCTTTAGATATACGTTCTTTCTGTTCT

mRNA sequence

TGGGTTTTAGTTCTTTTTTGAGGTACACAATTTTGATATTTGTGACGGCGAAATCAGTGAAACCACACAGGGAAGCTTAGCTTATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACTGCTCCCGAGGTTGCGGAGATGGAGGCTATACTGCAAGGACACAATAATACCATGCCGTCTCGGGAAGTTCTGGTTTCTCTTGCTGGGAAGTTCAGTGAATCGGTCGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGGCGATATGCTATCAGAGCAAAGACAACAAAGGTTCCCGGAAAGTTGGCTGGCTCTCCAATTGTCCAAGTCGAGTCAACACCCCCGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCCACACTAGTAGGACCCCTAAAGCGTGCTTCAGAAAATCCACTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTTAGATTTGCTGGTTTTGGATCAGATGAGGATGAGTGGGTTAATGTCAGAAGGAACATTAGACCTCGTTCGCTACCTTGTGAATCTTCAGAATGTGTGGCAGTTCTTCCAGGCGACCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGACGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGTCGTCGGCCTGAGACCGACCACAGGTTGCTACAGCTTCATGCTGTAAACGAAGCAGCATCGATGGAGCCCTCACGATCTGGCATGGATTCTGTACTGCTCAGTGGTCAGACGATAAGTTTCGAGGCAACCCAAAAGCTACTCAACAAGGATGCAACCATCGTTATACCAAATGCAAATGCAAATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACAATAACTAACAGCGCCCCAGGCGTATTCAACGCTGGTAATCACGCAGGTAGCTCTGCTTCTCGAGCGGTATCATGACCAGCTCTGTGTCTGGTATGTCAGCAGACAATGTGTCTGAATGGGGAAGTTAGTTGATTATGGAAAAACTTCCTTCCATCAGTCTAATCTTAACCGAACCTATCAATTTAAAATTTTGCCCGACTCGTTTTTTTAGGATGAGTAAATACACAGTGAAGTCTGTTTTTGCCCCATGTTTCAAAGTGATAGCTACCCATCCACTTGCTGTGAATGCTGAACGTTCTCGACCGATGAAACATGCAGGAGCCACACCTCGATCGTAGTATCGAACAGGTCAGAGGCATCATCTCTCTTGTTTTACTTTTTTCTTCTCTGCTTAGCCAAGCCAAACCAAACCAAACCAAACCATGGCTCTACACTGATCATATATCTTACTGCTTAGTTTTCCCATAGAGCTTTGCTGATTACCAAATGTTACTAACGATATCGATGCACTCGACAAAGGGTTCTCTTCGAATCGTTAAATGCCATGTGAAAAACTGGACTTGTTTGTGTACCTTTGGCTGAAGGGCAGTCAACTTAATGCATTACAGGAACCTTTAGATATACGTTCTTTCTGTTCT

Coding sequence (CDS)

ATGGGTCGGCCTCCCAGCAATGGAGGCCCTGCCTTCCGTTTCACTGCTCCCGAGGTTGCGGAGATGGAGGCTATACTGCAAGGACACAATAATACCATGCCGTCTCGGGAAGTTCTGGTTTCTCTTGCTGGGAAGTTCAGTGAATCGGTCGAACGGAAAGGGAAGATTGCTGTGCAAATGAAGCAAGTTTGGAATTGGTTCCAGAATAGGCGATATGCTATCAGAGCAAAGACAACAAAGGTTCCCGGAAAGTTGGCTGGCTCTCCAATTGTCCAAGTCGAGTCAACACCCCCGAGAAATGTGCCTCAAACCATAGTTGTTCCTGCTCCCACACTAGTAGGACCCCTAAAGCGTGCTTCAGAAAATCCACTGTCGGAATTTGAAGCTAAATCTGGGAGGGATGGTGCATGGTATGACGTTGCTACCTTTTTATCCCATAGATCTGTGGAAAGTGGTGACCCGGAAGTACTAGTTAGATTTGCTGGTTTTGGATCAGATGAGGATGAGTGGGTTAATGTCAGAAGGAACATTAGACCTCGTTCGCTACCTTGTGAATCTTCAGAATGTGTGGCAGTTCTTCCAGGCGACCTCATCTTATGCTTTCAGGAGGGTAAAGAGCAGGCACTTTACTTTGATGCCCATGTGCTTGATACACAAAGAAGACGACATGATGTACGAGGTTGTCGCTGCAGGTTTTTGGTCCGTTATGATCACGATCAATCTGAGGAAATTGTTCAGTTGAGAAAGATTTGTCGTCGGCCTGAGACCGACCACAGGTTGCTACAGCTTCATGCTGTAAACGAAGCAGCATCGATGGAGCCCTCACGATCTGGCATGGATTCTGTACTGCTCAGTGGTCAGACGATAAGTTTCGAGGCAACCCAAAAGCTACTCAACAAGGATGCAACCATCGTTATACCAAATGCAAATGCAAATATAAATGTCCATGCCCAAACTAGTACTCAGGAAGCAAGGAATACAATAACTAACAGCGCCCCAGGCGTATTCAACGCTGGTAATCACGCAGGTAGCTCTGCTTCTCGAGCGGTATCATGA

Protein sequence

MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNKDATIVIPNANANINVHAQTSTQEARNTITNSAPGVFNAGNHAGSSASRAVS
BLAST of Cp4.1LG01g01690.1 vs. Swiss-Prot
Match: SHH2_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1)

HSP 1 Score: 378.3 bits (970), Expect = 9.4e-104
Identity = 200/340 (58.82%), Postives = 241/340 (70.88%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF  PEV EMEAIL  HN  MP R +L +LA KFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVEST-------PPRNVPQTI-------- 120
           KQ+WNWFQNRRYA+RA+  K PGKL  S + +++          P +VP+T         
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 VVPAPT---LVGPLKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAG 180
           + PAP+   + G ++  S+N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRFAG
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSDEDEWVNVRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  +EDEW+NV++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDHRLLQLH-AVNEAASMEPSR----- 300
           HDVRGCRCRFLVRY HDQSEEIV LRKICRRPETD+RL QLH AVN+ A+    +     
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPALD 300

Query: 301 -SGMDSVLLSGQTISFEATQ----KLLNKDATIVIPNANA 312
            +    + L G T+   A +     L    AT+V P++NA
Sbjct: 301 AAAKTPLSLPGATVPIVAPESKDPSLSATPATLVQPSSNA 340

BLAST of Cp4.1LG01g01690.1 vs. Swiss-Prot
Match: SHH1_ARATH (Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1)

HSP 1 Score: 183.7 bits (465), Expect = 3.4e-45
Identity = 106/249 (42.57%), Postives = 150/249 (60.24%), Query Frame = 1

Query: 14  FTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQMKQVWNWFQNR-RY 73
           FT  E+ +ME + +   +    ++   ++A  FS SV R GK ++  KQV  WFQ + ++
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKH 73

Query: 74  AIRAKTTKVPGKLAGSPIVQVE-----STPPRNVPQTIVVPAPTLVGPLK-RASENPLSE 133
             + K+  +P     SP +Q+      S+   N      V   T V   K +AS+     
Sbjct: 74  QSQPKSKTLP-----SPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLA 133

Query: 134 FEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPRSLPCES 193
           FEAKS RD AWYDV++FL++R + +G+ EV VRF+GF +  DEWVNV+ ++R RS+P E 
Sbjct: 134 FEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEP 193

Query: 194 SECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQ 253
           SEC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + 
Sbjct: 194 SECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLG 253

Query: 254 LRKICRRPE 256
           L +ICRRPE
Sbjct: 254 LERICRRPE 257

BLAST of Cp4.1LG01g01690.1 vs. TrEMBL
Match: A0A0A0LC67_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1)

HSP 1 Score: 585.5 bits (1508), Expect = 4.3e-164
Identity = 297/346 (85.84%), Postives = 316/346 (91.33%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEMEAILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRAS 120
           K      QNRRYAIRAKT+K PGKLA SP+VQ+ESTP RNVPQT+VVPAP  VG  K A 
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNK 300
           SEEIVQLRKICRRPETD+RL QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE +Q  L+K
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DATIVIPNANANINVHAQTSTQEARNTITNSAPGVFNAGNHAGSSA 347
           DA +VIPNAN +IN HAQTSTQEARNT TN+AP  FN+ N AGSSA
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSA 340

BLAST of Cp4.1LG01g01690.1 vs. TrEMBL
Match: M5WG16_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.8e-125
Identity = 246/359 (68.52%), Postives = 278/359 (77.44%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT  EV+EMEAILQ HNNTMP+REVLV+LA KFSES ERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQSEVSEMEAILQQHNNTMPAREVLVALADKFSESAERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQ---TIVVP--APTLVGP 120
           KQVWNWFQNRRYAIRAK++KV GKL  SP+ + +S P RNVPQ    I  P  AP+  G 
Sbjct: 61  KQVWNWFQNRRYAIRAKSSKVLGKLNVSPMSRDDSNPVRNVPQGPQPIAAPIHAPSAQGS 120

Query: 121 LKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRR 180
            K ASEN + EFEAKSGRDGAWYDVA FLSHR +E+GDPEVLVRFAGFG +EDEWVNVR+
Sbjct: 121 GKGASENSIFEFEAKSGRDGAWYDVANFLSHRYLETGDPEVLVRFAGFGPEEDEWVNVRK 180

Query: 181 NIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVR 240
           ++R RSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVR
Sbjct: 181 HVRQRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVR 240

Query: 241 YDHDQSEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQ 300
           Y HDQSEEIV LRK+CRRPETD+RL QLHAVNEAAS E  +  MD  +  G   S E  Q
Sbjct: 241 YVHDQSEEIVPLRKVCRRPETDYRLQQLHAVNEAASAE--QKSMDHFM--GSVTSAEMMQ 300

Query: 301 KLLNKDATIVIPNANANINVHAQTSTQEAR----NTITNSAPGVFNAGNHAGSSASRAV 351
           K  N DA    P  +AN ++  Q++T E +    +T+ +S    F  G+   +S +  V
Sbjct: 301 KQQNTDAASAPPVLHANASLATQSTTPEFKGSEVSTVISSGNSNFPPGSAVITSGTATV 355

BLAST of Cp4.1LG01g01690.1 vs. TrEMBL
Match: W9RI10_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1)

HSP 1 Score: 454.5 bits (1168), Expect = 1.2e-124
Identity = 239/350 (68.29%), Postives = 269/350 (76.86%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPP NGGPAFRFTA EVAEMEAILQ HNNTMP+RE+LV LA KFSESVERKGKI VQM
Sbjct: 1   MGRPPGNGGPAFRFTASEVAEMEAILQEHNNTMPAREILVDLADKFSESVERKGKIMVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRAS 120
           KQVWNWFQNRRYAIRAK ++  G L+ S + + + TP RNVPQ I  P P   G  + AS
Sbjct: 61  KQVWNWFQNRRYAIRAKLSRNLGMLSVSSMPRDDPTPVRNVPQAITAPIPAPSGTGRGAS 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPR 180
           EN + EFEAKSGRDGAWYDVA F SHR +ESGDPEVLVRF GFG ++DEWVN+R+++R R
Sbjct: 121 ENSIMEFEAKSGRDGAWYDVANFFSHRYLESGDPEVLVRFVGFGPEDDEWVNIRKHVRQR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSG--QTISFEATQKLL 300
           SEEIV LRK+CRRPETD+RL QL+AVNEAAS E  +S  D+    G    IS E T KL 
Sbjct: 241 SEEIVPLRKVCRRPETDYRLQQLYAVNEAASAEQQKSSTDNFGGGGFRARISAETTPKLQ 300

Query: 301 NKDATIVIPNANANINVHAQTSTQEARNT-ITNSAPGVFNAGNHAGSSAS 348
           + DA +V P  +A   +  + S  E +   I N    V +AGN    +AS
Sbjct: 301 HADAALVAPALHATAALATKASILEPKKVEIVNV---VVDAGNSNNVTAS 347

BLAST of Cp4.1LG01g01690.1 vs. TrEMBL
Match: A0A0B2PMB5_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_048524 PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 2.7e-121
Identity = 224/330 (67.88%), Postives = 258/330 (78.18%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT PEVAEMEAILQ HNN MPSR+VL +LA KFSES +RKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQPEVAEMEAILQEHNNAMPSRDVLTTLAEKFSESQDRKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVE--STPPRNVPQ---TIVVPAPTLVGP 120
           KQVWNWFQN+RYAIRAK++K PGKL  +P+ + +  STP R++PQ      +PA +   P
Sbjct: 61  KQVWNWFQNKRYAIRAKSSKTPGKLNITPMPRDDYNSTPIRSMPQQPTAAPIPAASATVP 120

Query: 121 --LKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNV 180
             +K   EN + EFEAKSGRDGAWYDVATFLSHR +E+ DPEVLVRFAGFG +EDEW+N+
Sbjct: 121 TAVKATPENSVLEFEAKSGRDGAWYDVATFLSHRYLETSDPEVLVRFAGFGPEEDEWINI 180

Query: 181 RRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFL 240
           R+++RPRSLPCESSECV V+PGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFL
Sbjct: 181 RKHVRPRSLPCESSECVVVIPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFL 240

Query: 241 VRYDHDQSEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSV----LLSGQTI 300
           VRYDHDQSEEIV LRKICRRPETD+RL QLHAVNEAA M+  ++GMD       +   T 
Sbjct: 241 VRYDHDQSEEIVPLRKICRRPETDYRLQQLHAVNEAAPMDQQKTGMDPAANVNAVRATTT 300

Query: 301 SFEATQKLLNKDATIVIPNANANINVHAQT 320
              A    +    T  +P      N+H +T
Sbjct: 301 ETAANVNAVRATTTETVPKQLIAANIHMET 330

BLAST of Cp4.1LG01g01690.1 vs. TrEMBL
Match: A0A0R0H5Q7_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_12G158000 PE=4 SV=1)

HSP 1 Score: 443.4 bits (1139), Expect = 2.7e-121
Identity = 224/330 (67.88%), Postives = 258/330 (78.18%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFT PEVAEMEAILQ HNN MPSR+VL +LA KFSES +RKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTQPEVAEMEAILQEHNNAMPSRDVLTTLAEKFSESQDRKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVE--STPPRNVPQ---TIVVPAPTLVGP 120
           KQVWNWFQN+RYAIRAK++K PGKL  +P+ + +  STP R++PQ      +PA +   P
Sbjct: 61  KQVWNWFQNKRYAIRAKSSKTPGKLNITPMPRDDYNSTPIRSMPQQPTAAPIPAASATVP 120

Query: 121 --LKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNV 180
             +K   EN + EFEAKSGRDGAWYDVATFLSHR +E+ DPEVLVRFAGFG +EDEW+N+
Sbjct: 121 TAVKATPENSVLEFEAKSGRDGAWYDVATFLSHRYLETSDPEVLVRFAGFGPEEDEWINI 180

Query: 181 RRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFL 240
           R+++RPRSLPCESSECV V+PGDLILCFQEGKEQALYFDAHVLD QRRRHDVRGCRCRFL
Sbjct: 181 RKHVRPRSLPCESSECVVVIPGDLILCFQEGKEQALYFDAHVLDAQRRRHDVRGCRCRFL 240

Query: 241 VRYDHDQSEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSV----LLSGQTI 300
           VRYDHDQSEEIV LRKICRRPETD+RL QLHAVNEAA M+  ++GMD       +   T 
Sbjct: 241 VRYDHDQSEEIVPLRKICRRPETDYRLQQLHAVNEAAPMDQQKTGMDPAANVNAVRATTT 300

Query: 301 SFEATQKLLNKDATIVIPNANANINVHAQT 320
              A    +    T  +P      N+H +T
Sbjct: 301 ETAANVNAVRATTTETVPKQLIAANIHMET 330

BLAST of Cp4.1LG01g01690.1 vs. TAIR10
Match: AT3G18380.2 (AT3G18380.2 sequence-specific DNA binding transcription factors;sequence-specific DNA binding)

HSP 1 Score: 373.6 bits (958), Expect = 1.3e-103
Identity = 200/341 (58.65%), Postives = 241/341 (70.67%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRF  PEV EMEAIL  HN  MP R +L +LA KFSES ERKGK+ VQ 
Sbjct: 1   MGRPPSNGGPAFRFILPEVTEMEAILLQHNTAMPGRHILEALADKFSESPERKGKVVVQF 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVEST-------PPRNVPQTI-------- 120
           KQ+WNWFQNRRYA+RA+  K PGKL  S + +++          P +VP+T         
Sbjct: 61  KQIWNWFQNRRYALRARGNKAPGKLNVSSMPRMDLPNQMRSVIQPLSVPKTTHMTGNLPG 120

Query: 121 VVPAPT---LVGPLKRASENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAG 180
           + PAP+   + G ++  S+N   EFEAKS RDGAWYDV  FL+HR++E GDPEV VRFAG
Sbjct: 121 MTPAPSGSLVPGVMRSGSDNSYLEFEAKSARDGAWYDVQAFLAHRNLEIGDPEVQVRFAG 180

Query: 181 FGSDEDEWVNVRRNIRPRSLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRR 240
           F  +EDEW+NV++++R RSLPCE+SECVAVL GDL+LCFQEGK+QALYFDA VLD QRRR
Sbjct: 181 FEVEEDEWINVKKHVRQRSLPCEASECVAVLAGDLVLCFQEGKDQALYFDAIVLDAQRRR 240

Query: 241 HDVRGCRCRFLVRYDHDQSE-EIVQLRKICRRPETDHRLLQLH-AVNEAASMEPSR---- 300
           HDVRGCRCRFLVRY HDQSE EIV LRKICRRPETD+RL QLH AVN+ A+    +    
Sbjct: 241 HDVRGCRCRFLVRYSHDQSEQEIVPLRKICRRPETDYRLQQLHNAVNDLANSNQHQIPAL 300

Query: 301 --SGMDSVLLSGQTISFEATQ----KLLNKDATIVIPNANA 312
             +    + L G T+   A +     L    AT+V P++NA
Sbjct: 301 DAAAKTPLSLPGATVPIVAPESKDPSLSATPATLVQPSSNA 341

BLAST of Cp4.1LG01g01690.1 vs. TAIR10
Match: AT1G15215.2 (AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors;sequence-specific DNA binding (TAIR:AT3G18380.1))

HSP 1 Score: 183.7 bits (465), Expect = 1.9e-46
Identity = 106/249 (42.57%), Postives = 150/249 (60.24%), Query Frame = 1

Query: 14  FTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQMKQVWNWFQNR-RY 73
           FT  E+ +ME + +   +    ++   ++A  FS SV R GK ++  KQV  WFQ + ++
Sbjct: 14  FTLSEIVDMENLYKELGDQSLHKDFCQTVASTFSCSVNRNGKSSITWKQVQIWFQEKLKH 73

Query: 74  AIRAKTTKVPGKLAGSPIVQVE-----STPPRNVPQTIVVPAPTLVGPLK-RASENPLSE 133
             + K+  +P     SP +Q+      S+   N      V   T V   K +AS+     
Sbjct: 74  QSQPKSKTLP-----SPPLQIHDLSNPSSYASNASNATFVGNSTFVQTRKGKASDLADLA 133

Query: 134 FEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPRSLPCES 193
           FEAKS RD AWYDV++FL++R + +G+ EV VRF+GF +  DEWVNV+ ++R RS+P E 
Sbjct: 134 FEAKSARDYAWYDVSSFLTYRVLRTGELEVRVRFSGFDNRHDEWVNVKTSVRERSIPVEP 193

Query: 194 SECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQ 253
           SEC  V  GDL+LCFQE ++QALY D HVL+ +R  HD   C C FLVRY+ D +EE + 
Sbjct: 194 SECGRVNVGDLLLCFQEREDQALYCDGHVLNIKRGIHDHARCNCVFLVRYELDNTEESLG 253

Query: 254 LRKICRRPE 256
           L +ICRRPE
Sbjct: 254 LERICRRPE 257

BLAST of Cp4.1LG01g01690.1 vs. NCBI nr
Match: gi|778680368|ref|XP_011651298.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus])

HSP 1 Score: 608.6 bits (1568), Expect = 6.9e-171
Identity = 303/346 (87.57%), Postives = 322/346 (93.06%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEMEAILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRAS 120
           KQVWNWFQNRRYAIRAKT+K PGKLA SP+VQ+ESTP RNVPQT+VVPAP  VG  K A 
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNK 300
           SEEIVQLRKICRRPETD+RL QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE +Q  L+K
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DATIVIPNANANINVHAQTSTQEARNTITNSAPGVFNAGNHAGSSA 347
           DA +VIPNAN +IN HAQTSTQEARNT TN+AP  FN+ N AGSSA
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSA 346

BLAST of Cp4.1LG01g01690.1 vs. NCBI nr
Match: gi|659111991|ref|XP_008456010.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo])

HSP 1 Score: 605.5 bits (1560), Expect = 5.8e-170
Identity = 299/341 (87.68%), Postives = 318/341 (93.26%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEME ILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMETILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRAS 120
           KQVWNWFQNRRYAIRAKT+K PGKLA SP+VQ+ESTP RNVPQT+VVPAPT VG  K A 
Sbjct: 61  KQVWNWFQNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNK 300
           SEEIVQLRKICRRPETD+RL QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE  Q  L+K
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSK 300

Query: 301 DATIVIPNANANINVHAQTSTQEARNTITNSAPGVFNAGNH 342
           DA +VIPNAN +IN HAQTSTQEARNT TN+AP  F++GNH
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPITFSSGNH 341

BLAST of Cp4.1LG01g01690.1 vs. NCBI nr
Match: gi|700202508|gb|KGN57641.1| (hypothetical protein Csa_3G236580 [Cucumis sativus])

HSP 1 Score: 585.5 bits (1508), Expect = 6.2e-164
Identity = 297/346 (85.84%), Postives = 316/346 (91.33%), Query Frame = 1

Query: 1   MGRPPSNGGPAFRFTAPEVAEMEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQM 60
           MGRPPSNGGPAFRFTA EVAEMEAILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQM
Sbjct: 1   MGRPPSNGGPAFRFTASEVAEMEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQM 60

Query: 61  KQVWNWFQNRRYAIRAKTTKVPGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRAS 120
           K      QNRRYAIRAKT+K PGKLA SP+VQ+ESTP RNVPQT+VVPAP  VG  K A 
Sbjct: 61  K------QNRRYAIRAKTSKAPGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAP 120

Query: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPR 180
           ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPR
Sbjct: 121 ENPLSEFEAKSGRDGAWYDVATFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPR 180

Query: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240
           SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ
Sbjct: 181 SLPCESSECVAVLPGDLILCFQEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQ 240

Query: 241 SEEIVQLRKICRRPETDHRLLQLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNK 300
           SEEIVQLRKICRRPETD+RL QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE +Q  L+K
Sbjct: 241 SEEIVQLRKICRRPETDYRLQQLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSK 300

Query: 301 DATIVIPNANANINVHAQTSTQEARNTITNSAPGVFNAGNHAGSSA 347
           DA +VIPNAN +IN HAQTSTQEARNT TN+AP  FN+ N AGSSA
Sbjct: 301 DAALVIPNANPHINAHAQTSTQEARNTETNTAPTTFNSANLAGSSA 340

BLAST of Cp4.1LG01g01690.1 vs. NCBI nr
Match: gi|778680371|ref|XP_011651299.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus])

HSP 1 Score: 568.2 bits (1463), Expect = 1.0e-158
Identity = 283/325 (87.08%), Postives = 302/325 (92.92%), Query Frame = 1

Query: 22  MEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKV 81
           MEAILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+K 
Sbjct: 1   MEAILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRASENPLSEFEAKSGRDGAWYDVA 141
           PGKLA SP+VQ+ESTP RNVPQT+VVPAP  VG  K A ENPLSEFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPAPVGSAKGAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPRSLPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPRSLPCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDHRLL 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETD+RL 
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNKDATIVIPNANANINVHAQTST 321
           QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE +Q  L+KDA +VIPNAN +IN HAQTST
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETSQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEARNTITNSAPGVFNAGNHAGSSA 347
           QEARNT TN+AP  FN+ N AGSSA
Sbjct: 301 QEARNTETNTAPTTFNSANLAGSSA 325

BLAST of Cp4.1LG01g01690.1 vs. NCBI nr
Match: gi|659111993|ref|XP_008456011.1| (PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo])

HSP 1 Score: 565.1 bits (1455), Expect = 8.7e-158
Identity = 279/320 (87.19%), Postives = 298/320 (93.12%), Query Frame = 1

Query: 22  MEAILQGHNNTMPSREVLVSLAGKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTTKV 81
           ME ILQGHNNTMP+REVLV+LA KFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKT+K 
Sbjct: 1   METILQGHNNTMPAREVLVALADKFSESVERKGKIAVQMKQVWNWFQNRRYAIRAKTSKA 60

Query: 82  PGKLAGSPIVQVESTPPRNVPQTIVVPAPTLVGPLKRASENPLSEFEAKSGRDGAWYDVA 141
           PGKLA SP+VQ+ESTP RNVPQT+VVPAPT VG  K A ENPLSEFEAKSGRDGAWYDVA
Sbjct: 61  PGKLAVSPVVQIESTPVRNVPQTVVVPAPTPVGTAKSAPENPLSEFEAKSGRDGAWYDVA 120

Query: 142 TFLSHRSVESGDPEVLVRFAGFGSDEDEWVNVRRNIRPRSLPCESSECVAVLPGDLILCF 201
           TFLSHRSVESGDPEVLVRF+GFGS+EDEWVN+RRNIRPRSLPCESSECVAVLPGDLILCF
Sbjct: 121 TFLSHRSVESGDPEVLVRFSGFGSEEDEWVNIRRNIRPRSLPCESSECVAVLPGDLILCF 180

Query: 202 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDHRLL 261
           QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETD+RL 
Sbjct: 181 QEGKEQALYFDAHVLDTQRRRHDVRGCRCRFLVRYDHDQSEEIVQLRKICRRPETDYRLQ 240

Query: 262 QLHAVNEAASMEPSRSGMDSVLLSGQTISFEATQKLLNKDATIVIPNANANINVHAQTST 321
           QLHAVNEAAS+EPS+SGMDSVLLSGQ I+FE  Q  L+KDA +VIPNAN +IN HAQTST
Sbjct: 241 QLHAVNEAASIEPSKSGMDSVLLSGQRINFETPQNPLSKDAALVIPNANPHINAHAQTST 300

Query: 322 QEARNTITNSAPGVFNAGNH 342
           QEARNT TN+AP  F++GNH
Sbjct: 301 QEARNTETNTAPITFSSGNH 320

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SHH2_ARATH9.4e-10458.82Protein SAWADEE HOMEODOMAIN HOMOLOG 2 OS=Arabidopsis thaliana GN=SHH2 PE=2 SV=1[more]
SHH1_ARATH3.4e-4542.57Protein SAWADEE HOMEODOMAIN HOMOLOG 1 OS=Arabidopsis thaliana GN=SHH1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LC67_CUCSA4.3e-16485.84Uncharacterized protein OS=Cucumis sativus GN=Csa_3G236580 PE=4 SV=1[more]
M5WG16_PRUPE1.8e-12568.52Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa007389mg PE=4 SV=1[more]
W9RI10_9ROSA1.2e-12468.29Uncharacterized protein OS=Morus notabilis GN=L484_012170 PE=4 SV=1[more]
A0A0B2PMB5_GLYSO2.7e-12167.88Uncharacterized protein OS=Glycine soja GN=glysoja_048524 PE=4 SV=1[more]
A0A0R0H5Q7_SOYBN2.7e-12167.88Uncharacterized protein OS=Glycine max GN=GLYMA_12G158000 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G18380.21.3e-10358.65 sequence-specific DNA binding transcription factors;sequence-specifi... [more]
AT1G15215.21.9e-4642.57 BEST Arabidopsis thaliana protein match is: sequence-specific DNA bi... [more]
Match NameE-valueIdentityDescription
gi|778680368|ref|XP_011651298.1|6.9e-17187.57PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis sativus][more]
gi|659111991|ref|XP_008456010.1|5.8e-17087.68PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X1 [Cucumis melo][more]
gi|700202508|gb|KGN57641.1|6.2e-16485.84hypothetical protein Csa_3G236580 [Cucumis sativus][more]
gi|778680371|ref|XP_011651299.1|1.0e-15887.08PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis sativus][more]
gi|659111993|ref|XP_008456011.1|8.7e-15887.19PREDICTED: protein SAWADEE HOMEODOMAIN HOMOLOG 2 isoform X2 [Cucumis melo][more]
The following terms have been associated with this mRNA:
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0003682chromatin binding
Vocabulary: INTERPRO
TermDefinition
IPR009057Homeobox-like_sf
IPR001356Homeobox_dom
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0000785 chromatin
cellular_component GO:0005634 nucleus
molecular_function GO:0003682 chromatin binding
molecular_function GO:0003677 DNA binding

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Cp4.1LG01g01690Cp4.1LG01g01690gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Cp4.1LG01g01690.1Cp4.1LG01g01690.1-proteinpolypeptide


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g01690.1:five_prime_utr:001Cp4.1LG01g01690.1:five_prime_utr:001five_prime_UTR
Cp4.1LG01g01690.1:five_prime_utr:002Cp4.1LG01g01690.1:five_prime_utr:002five_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g01690.1:cds:001Cp4.1LG01g01690.1:cds:001CDS
Cp4.1LG01g01690.1:cds:002Cp4.1LG01g01690.1:cds:002CDS
Cp4.1LG01g01690.1:cds:003Cp4.1LG01g01690.1:cds:003CDS
Cp4.1LG01g01690.1:cds:004Cp4.1LG01g01690.1:cds:004CDS
Cp4.1LG01g01690.1:cds:005Cp4.1LG01g01690.1:cds:005CDS
Cp4.1LG01g01690.1:cds:006Cp4.1LG01g01690.1:cds:006CDS
Cp4.1LG01g01690.1:cds:007Cp4.1LG01g01690.1:cds:007CDS
Cp4.1LG01g01690.1:cds:008Cp4.1LG01g01690.1:cds:008CDS
Cp4.1LG01g01690.1:cds:009Cp4.1LG01g01690.1:cds:009CDS


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Cp4.1LG01g01690.1:three_prime_utr:001Cp4.1LG01g01690.1:three_prime_utr:001three_prime_UTR


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 13..74
score: 1.
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 7..81
score: 0.
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 13..77
score: 9
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 13..78
score: 3.
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 4..75
score: 3.68
NoneNo IPR availablePANTHERPTHR33827FAMILY NOT NAMEDcoord: 2..332
score: 4.4E
NoneNo IPR availablePANTHERPTHR33827:SF3PROTEIN SAWADEE HOMEODOMAIN HOMOLOG 2coord: 2..332
score: 4.4E