CSPI03G18040 (gene) Wild cucumber (PI 183967)

NameCSPI03G18040
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionHomeobox protein HAT3.1, putative
LocationChr3 : 13727168 .. 13730941 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGAAGGTTGGCCCACTGTGTAGATCGACGCCTTCTGAATCTAGTGGAGTAGTTTTTCTATGGTGTCCACGACGACGGTGGCAGAGGTGGTGGCGGCGTCGGTAACATTTTTTTTGGTCAGGGTGTTTCCTAAATTTTTTTCTAGGGTTTTGTGGTTGCTTTGCTCTGATACCATATTGAAAGCAATAAACGCGAAACCAAGGCTTGCGTGGAAACCCGAGAACCAGGAGAAAAACCACGATGTTTTTAGTTTTATTATTTTCTCTGATAATTACAATAGTACAAATGAGGGAACTTAAATAGGATGTAAAGAGCAAAGAAGAAAAGGAAAAGAAATATTTATGGTAAGTTTTCCATAAATATTTTTTCCATAGATACAAATTCTAACAAAAAGATTAAAATAATAATATGATAAAATAGGAGCATTTTAAAAATAGCAAAATAAATTAAAATATTTACAACCTATAGCAAAATTTTGGATTTTATCAATGATAAAAACTGATAGACTTCTATCACTAACTATCAATGTCACTGATAGAAGCATATTAGTGGCTATCAATGTCTATTATTGATATAATCTAAAAAAAATTTGCTTTATGTTTAAATATTTTGTCAAATTTGCTATTTTTGACAATTCCCCGATAAAATAACCCCTTAGACATACTTTTTAAAATTCATAGTTTAGATAGATTTGGAAACTTAAGTAGTTAAATAGACTTGGAAAAGTCAAAAGTGTAATTTACCTTTAATCAAATCTAATTTTTGATGCGAAAGTTTTTAGTTTTTACATTAGATACCAACCAATATTTCCTGTTTGGTCGCTACGAGTTGCATTATATTATCATTTAGGGCATTTTTCTCATAAACTATTATTTATCAAAATAAACCTAATATTCTTCATAGTGAAGAGTAGGGAAATTTGTTTTCCATGCTTTTTGGTAAGTACAGAAGTACGTTCTATGAATTTTTCCTTGTTGTAAAATGTTGAATCAGTAGTAAGTCATTTCTCATTCTTTTATCCTTTTGTAAATAATTGTTTTCTTACTTCTATTGTTTGTTCTTTGTCAAGAAATAGCAAAAGATAAATAAAATCAAGTGGCCTTATCTCCATCAAGATGTGTTTATGTAGCTTTAGTACATCATGTTTGTATGTTGTGCTTTAAATGCTTGATAGGAGAAAGTCTAGATATTGAACAGGGTTCTTGCTTAGTTTTGGACTAAATGAGTTAACATTTCTATTCATTTAAATTTTTTTCCTATTACATTATTGAAGGAGACATACGGGAACGTTCCTACTGACTCAAGCGATGACACCTACGGGAGTACTTTGGACTCGAGTGATGACAGAGGCTGGGATAGTGGTACAAGGAAGAGAGGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGATCTAATGATGATTTGACCAATGTAAAAACTAAACGCAGTTATAAGAGGAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAATTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCCTCTGTTAAGAAAAGCACATCATCATCAAATAGAAGACTCAGTCAACCTGCATTGGAGGTAACTTCTCCTTTTATGTTTGTGTGTATATATATTTTTTCCATTGGCTGGGATTTCCTGTTCCATTCATTGTCTTATTTGTCATGTATGTTCTTTTTCCTCTTTTCTCCGGTGGGGTTGGGAGATAAGGGTTTACTCAAAGATCTCCCATTAGTCCCCTTGCATTTAAGAATGCATTGTGATGCGTAGAGACTACATCAAAGGAGACAAATATTGGAATTACTGACAGCCATACGATCTAATATATTTGCTGTTTCAAAAAATATATCTATATAGAAAGAAACTCTTGCTAAAATAAATGTGTCATTCTAAAATTCTGAAGCCAGTCAGGTGGTCCGAATAATTGCTGAGAAACTCAAAAACAGTTTTCTTCTGTTCTTCTGTCTTTCAGTTTTCATCTCGCTTCCATGAAATTTGAAACGAAATTCAGACTGTCCTTCAAGTTTCATCTCGCTTCCATGAAATTTGAAATGAAATCTGAATCCTGTATATGTTTCACATTCTCATTTAGCTAGTTATATGTAAGAAATTGAATTGTGACATGATGATTAATTTAAATTAAGATGTTATTCCTTCTTGTAACTGGAACAAAACTTTTCTTTGTTGTAGCTAATAGTGAGCATTTTAGCTCTCTTTATGCCCTTGGGTCTTCTCTTTGCAAACGGCAGTCTCAATTTTGAGCAACTATACTCTTGATTTCAACTTAGATGCTTTATTAAAATTCTATAGGTTATCAGCTTGCGTGCGTTGTTGGTTAGATTAGCAATCAGCACCCTTCCTGATCTTTGAGTTTGGACTACTTATTGTGAATATCAGTTTCCTTGCACTTGTATAGTTCACTTGGTTTTTAAAACAAACATGCCTTAGATTTTGAAACGTTAAAATGATTTGCACGCGAAACTAATCAGTTTTGAGTGAGAGGACCAAACAGGTAAGATGTAAAAGAGCCATTGAGATCTTTGTTTCTTGGTTGTGAGAAACAAAATGTATTATTAACTTCGATATATCAAGTTTTTTTATGCAGGTTCAACATTTTACTGAGAAGAAAAGATTACTATTTATTAATTTAATGCCATGCCTTGTTCCTAATATTGTCGCAAATGTTCCAGAGACTTCTTGCATCATTCCAAGAAAATGAGTATCCTAAACGAGCTACAAAGCAGAGTTTAGCACAAGAACTAGGCCTTGGTCTGAAGCAGGTTTGCATTGGGTTTCTTTGAAAGTTTTAGCTCAAATTTTATTCTATTGAAGGCTCAATAATCTACTCCATTTTCAGGTTAGCAAATGGTTTGAGAACACGCGATGGAGCACACGCCATCCCTCAAGCAGTGGTAAGAAAGCAAAAAGTTCCTCAAGAATGAGCATTTATTTATCACAGGCAAGTGGAGAACTATCCAAGAACGAGCCAGAATCTGCAACATGTTTCAGAGATACTGATAGCAATGGTGCTCGACATCAAGACTTACCAATGGCAAATAGTGTTGTGGCTTCATGTCAGAGTGGGGATACAGGGGATAAGAAATTGTCGTCTCGGAAAACTAAAAGAGCAGACTCTTCAGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATAACACGGCATCACATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATGCAAACAGCAGATAGGTTTAAGACAAGGAGGAGGAGATCCATTTAGGTTCTGAAGTATTGATGCGTTCACAGGAATTTATTCGACTATTTACCCTCTCCCAAGAAACACACTAGATCAGATGCGGGGATTGTTTGAAGGGAGGTAATTATGAGAAATCCCAATTGTTTGTAGGTCTTTGTGATCATATTGAAATTGTTCCAGATCTAATCCCTCACGCATAAATGCTTAGTCTTAGTTACTTAAATCTAGAGTTTGTTGTTAGATATTTATTTGCATTTTCAAGTGAATGTTCTCATCAGACCCTTCTCTTGATATGTGTTAAACAATCAATACGGACATCCATTTAGTTGGAGAGATCCTTCAGTTAAACAAGCCTTGTCCAGTATGAGGTGACATGAATTTATATCAGCTGGTAGTAGTCCATTTCTTGAATCTTCTGCAGGGCCATCTTTGACTTTTGCAAGCCATGCCAACTTGAACGCCCCTGGGAAGGAAGCCATGC

mRNA sequence

ATGGCGAAGGTTGGCCCACTGTGTAGATCGACGCCTTCTGAATCTAGTGGAGTAGTTTTTCTATGGTGTCCACGACGACGGTGGCAGAGGTGGTGGCGGCGTCGGGTTTTGTGGTTGCTTTGCTCTGATACCATATTGAAAGCAATAAACGCGAAACCAAGGCTTGCGTGGAAACCCGAGAACCAGGAGAAAAACCACGATGAGACATACGGGAACGTTCCTACTGACTCAAGCGATGACACCTACGGGAGTACTTTGGACTCGAGTGATGACAGAGGCTGGGATAGTGGTACAAGGAAGAGAGGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGATCTAATGATGATTTGACCAATGTAAAAACTAAACGCAGTTATAAGAGGAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAATTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCCTCTGTTAAGAAAAGCACATCATCATCAAATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTCTTGCATCATTCCAAGAAAATGAGTATCCTAAACGAGCTACAAAGCAGAGTTTAGCACAAGAACTAGGCCTTGGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACGCGATGGAGCACACGCCATCCCTCAAGCAGTGGTAAGAAAGCAAAAAGTTCCTCAAGAATGAGCATTTATTTATCACAGGCAAGTGGAGAACTATCCAAGAACGAGCCAGAATCTGCAACATGTTTCAGAGATACTGATAGCAATGGTGCTCGACATCAAGACTTACCAATGGCAAATAGTGTTGTGGCTTCATGTCAGAGTGGGGATACAGGGGATAAGAAATTGTCGTCTCGGAAAACTAAAAGAGCAGACTCTTCAGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATAACACGGCATCACATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATGCAAACAGCAGATAGGTTTAAGACAAGGAGGAGGAGATCCATTTAG

Coding sequence (CDS)

ATGGCGAAGGTTGGCCCACTGTGTAGATCGACGCCTTCTGAATCTAGTGGAGTAGTTTTTCTATGGTGTCCACGACGACGGTGGCAGAGGTGGTGGCGGCGTCGGGTTTTGTGGTTGCTTTGCTCTGATACCATATTGAAAGCAATAAACGCGAAACCAAGGCTTGCGTGGAAACCCGAGAACCAGGAGAAAAACCACGATGAGACATACGGGAACGTTCCTACTGACTCAAGCGATGACACCTACGGGAGTACTTTGGACTCGAGTGATGACAGAGGCTGGGATAGTGGTACAAGGAAGAGAGGTCCTAAAACTCTGGTTCTTGCATTGTCAAACAATGGATCTAATGATGATTTGACCAATGTAAAAACTAAACGCAGTTATAAGAGGAGAACTCGTCAAAAGCCAGGTGCTATAAATGTGAATAATTCTGTGACTGAAACTCCTGTAGACACTGCAAAATCTAGTTCCTCTGTTAAGAAAAGCACATCATCATCAAATAGAAGACTCAGTCAACCTGCATTGGAGAGACTTCTTGCATCATTCCAAGAAAATGAGTATCCTAAACGAGCTACAAAGCAGAGTTTAGCACAAGAACTAGGCCTTGGTCTGAAGCAGGTTAGCAAATGGTTTGAGAACACGCGATGGAGCACACGCCATCCCTCAAGCAGTGGTAAGAAAGCAAAAAGTTCCTCAAGAATGAGCATTTATTTATCACAGGCAAGTGGAGAACTATCCAAGAACGAGCCAGAATCTGCAACATGTTTCAGAGATACTGATAGCAATGGTGCTCGACATCAAGACTTACCAATGGCAAATAGTGTTGTGGCTTCATGTCAGAGTGGGGATACAGGGGATAAGAAATTGTCGTCTCGGAAAACTAAAAGAGCAGACTCTTCAGCCACAAAATCCAGAAAACGGAAGGGCAGGTCAGATAACACGGCATCACATTCAAAAGACAGGGAGGGATCACCAAGGCCTCCTGCCAAGTCACCTAAAGTTAATGAAATGCAAACAGCAGATAGGTTTAAGACAAGGAGGAGGAGATCCATTTAG
BLAST of CSPI03G18040 vs. Swiss-Prot
Match: HAT31_ARATH (Homeobox protein HAT3.1 OS=Arabidopsis thaliana GN=HAT3.1 PE=2 SV=3)

HSP 1 Score: 72.8 bits (177), Expect = 8.5e-12
Identity = 57/157 (36.31%), Postives = 88/157 (56.05%), Query Frame = 1

Query: 63  EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 122
           +K +DE Y NVPT SSDD      D +   G +    +    T+ L  S+N  +     +
Sbjct: 526 KKLYDEEYDNVPTSSSDD---DDWDKTARMGKEDSESEDEGDTVPLKQSSNAEDHTSKKL 585

Query: 123 --KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLA 182
             K+KR+ K+ T + P          E P +    S  ++KS+SS+ ++ + P  +RL  
Sbjct: 586 IRKSKRADKKDTLEMP---------QEGPGENG-GSGEIEKSSSSACKQ-TDPKTQRLYI 645

Query: 183 SFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWS 218
           SFQEN+YP +ATK+SLA+EL + +KQV+ WF++ RWS
Sbjct: 646 SFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRWS 668

BLAST of CSPI03G18040 vs. Swiss-Prot
Match: PRH_PETCR (Pathogenesis-related homeodomain protein OS=Petroselinum crispum GN=PRH PE=2 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.5e-11
Identity = 58/160 (36.25%), Postives = 79/160 (49.38%), Query Frame = 1

Query: 66  HDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTK 125
           + E YGN  +DSSD+ Y  T  SS D+           K          S D   + K +
Sbjct: 849 NQEEYGNTSSDSSDEDYMVT--SSPDKN-------NSDKEATAMERGRESGDLELDQKAR 908

Query: 126 RSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQEN 185
            S   R   K  A+   +S      + + +  +  KSTS +     + A +RLL SF+EN
Sbjct: 909 ESTHNRRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSKTLH--GEHATQRLLQSFKEN 968

Query: 186 EYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSG 226
           +YP+RA K+SLA EL L ++QVS WF N RWS RH S  G
Sbjct: 969 QYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIG 997

BLAST of CSPI03G18040 vs. TrEMBL
Match: A0A0A0L6Y1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G199010 PE=4 SV=1)

HSP 1 Score: 527.3 bits (1357), Expect = 1.4e-146
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 1

Query: 68  ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS 127
           ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS
Sbjct: 3   ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS 62

Query: 128 YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY 187
           YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY
Sbjct: 63  YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY 122

Query: 188 PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK 247
           PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK
Sbjct: 123 PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK 182

Query: 248 NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR 307
           NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR
Sbjct: 183 NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR 242

Query: 308 KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 352
           KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
Sbjct: 243 KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 286

BLAST of CSPI03G18040 vs. TrEMBL
Match: A0A067L7L0_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16279 PE=4 SV=1)

HSP 1 Score: 122.1 bits (305), Expect = 1.4e-24
Identity = 115/309 (37.22%), Postives = 156/309 (50.49%), Query Frame = 1

Query: 63   EKNHDETYGNVPTDSSDD---------------TYGSTL-DSSDDRGW--DSGTRKRGPK 122
            +K +DETYGN  +DSSDD               TYGST  DSSDD  +  D   RKR   
Sbjct: 705  KKLYDETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDEDFIDDVEPRKRRRS 764

Query: 123  TLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTS 182
            T V   S N +N  ++    + +  +R RQK    N + S T+     + SSSS K   S
Sbjct: 765  TEVGQASVN-ANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGASPSSSSGKPVKS 824

Query: 183  SSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRH-PSS 242
            S  RRL +   + L  SF+EN+YP RA K+SLA+ELG+  +QVSKWFENTRWS  H PS+
Sbjct: 825  SGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFENTRWSFNHPPST 884

Query: 243  SGKKAKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGD 302
                 + +++    L + + EL   EPE     R+T SNGA+ ++ P  +        GD
Sbjct: 885  DASTVRKTTKEDSQLPKTNTELCTPEPEKIC--RNTTSNGAQSEESPKVDDATGGSYIGD 944

Query: 303  TGDKKLSSRKTKRADSSATKSRKRKGRSDNTASHSKDREGS-PRPPAKSPKVNEMQTADR 352
            T D K+ S+++ +  S    SRKRK  SD          G   + P   PK  E     R
Sbjct: 945  TRDTKMGSQESCKQKSKTPDSRKRKHISDPRTLDPYSTIGEMEKIPVNLPKSQEKPAGGR 1004

BLAST of CSPI03G18040 vs. TrEMBL
Match: W9R947_9ROSA (Homeobox protein OS=Morus notabilis GN=L484_011492 PE=4 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 5.0e-19
Identity = 99/291 (34.02%), Postives = 141/291 (48.45%), Query Frame = 1

Query: 66   HDETYGNVPTDSSDDTYGSTLDSSDDRGWDSG-TRKRGPKTLVLALSNNGSNDDLTNVKT 125
            HDETYG++P+DSSDD   +   +   R   +G      P      + N  + D   N   
Sbjct: 753  HDETYGHLPSDSSDDEDWTDYAAPRKRKRTTGQVSSVSPNENASIIKNQTTTDAANNDLE 812

Query: 126  KRSY--KRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 185
               Y  +RR+RQ     + NN   +    + KS S+ ++   S+NRRL +   +RL  SF
Sbjct: 813  DNEYVPRRRSRQNSVVTDENNIPNKLLQGSPKSGSTGRRRELSTNRRLGEAVTQRLYQSF 872

Query: 186  QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKK---AKSSSRMSIYLS 245
            +EN+Y  RATK+SLAQELGL   QVSKWFEN RWS RH  SS KK   ++ +S+ S    
Sbjct: 873  KENQYLDRATKESLAQELGLTSYQVSKWFENARWSYRH--SSSKKPGISEHASKESTLSP 932

Query: 246  QASGELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADS 305
            Q + +L   E E  T   ++  NGA + +LP   + +    SGD GD K+     + +  
Sbjct: 933  QTNKKLF--ETELNTSITNSTCNGALNNELPRTGNAMPESCSGDVGDGKVEMPTKESSGQ 992

Query: 306  SATKSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRS 351
            ++T    RK            R      P   P V+  +T  R + R R S
Sbjct: 993  TSTTPGSRK----------TGRPWKVERPETPPVVDTHETGGRKRGRHRES 1029

BLAST of CSPI03G18040 vs. TrEMBL
Match: F6H698_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0091g00290 PE=4 SV=1)

HSP 1 Score: 103.2 bits (256), Expect = 6.5e-19
Identity = 93/265 (35.09%), Postives = 127/265 (47.92%), Query Frame = 1

Query: 63  EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTL---VLALSNNGSN--- 122
           +K HDE YGNV +DSSDD             W      R  K L   V ++S NG+    
Sbjct: 568 KKLHDEAYGNVSSDSSDD-----------EDWTENVIPRKRKNLSGNVASVSPNGNTSIT 627

Query: 123 DDLTNVKTKR--------SYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNR 182
           ++ TN K  +        + KRRTRQK    + NNS+ E+  D+    S+ +KS  SS +
Sbjct: 628 ENGTNTKDIKHDLEAAGCTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYK 687

Query: 183 RLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRH-PSSSGKK 242
           +L +   ERL  SFQEN+YP RA K+ LA+ELG+  +QVSKWFEN RWS RH P      
Sbjct: 688 KLGEAVTERLYKSFQENQYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASA 747

Query: 243 AKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDK 302
            KS+ +       AS   +  +PE     R++  NG   ++ P A            G  
Sbjct: 748 GKSAVK-----KDASTSQTDQKPEQEVVLRESSHNGVGKKESPKA------------GAS 804

Query: 303 KLSSRKTKRADSSATKSRKRKGRSD 313
           K+   K   A  SA K      ++D
Sbjct: 808 KVDRSKEANAGKSAVKKDASTSQTD 804

BLAST of CSPI03G18040 vs. TrEMBL
Match: A0A061E032_THECC (Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 OS=Theobroma cacao GN=TCM_007171 PE=4 SV=1)

HSP 1 Score: 94.4 bits (233), Expect = 3.0e-16
Identity = 75/202 (37.13%), Postives = 110/202 (54.46%), Query Frame = 1

Query: 66  HDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDD---LTNV 125
           +DETYGNVP+ SSDD   S + +   R   +      P+   +++S   S  D       
Sbjct: 751 YDETYGNVPSSSSDDEDWSDITAPRKRNKCTAEVASAPENGNVSVSRTVSVSDGLKQNPE 810

Query: 126 KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 185
           +T+   +R+TRQ     + ++S  E   +T+ S SS KK+ SS+ +RL +   +RL  SF
Sbjct: 811 ETEHKPRRKTRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYKSF 870

Query: 186 QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWS-TRHPSSSGKKAKSSSRMSIYLSQA 245
           +EN+YP RATKQSLA+EL +  +QVSKWF+N RWS    PSS    A ++S   I     
Sbjct: 871 KENQYPDRATKQSLAKELDMTFQQVSKWFDNARWSFNNSPSSHETIANNASEKDI----- 930

Query: 246 SGELSKNEPESATCFRDTDSNG 264
           +  L   E   +   RD D++G
Sbjct: 931 TSSLPNKEVTGSGNVRDGDNSG 947

BLAST of CSPI03G18040 vs. TAIR10
Match: AT3G19510.1 (AT3G19510.1 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain)

HSP 1 Score: 72.8 bits (177), Expect = 4.8e-13
Identity = 57/157 (36.31%), Postives = 88/157 (56.05%), Query Frame = 1

Query: 63  EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 122
           +K +DE Y NVPT SSDD      D +   G +    +    T+ L  S+N  +     +
Sbjct: 526 KKLYDEEYDNVPTSSSDD---DDWDKTARMGKEDSESEDEGDTVPLKQSSNAEDHTSKKL 585

Query: 123 --KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLA 182
             K+KR+ K+ T + P          E P +    S  ++KS+SS+ ++ + P  +RL  
Sbjct: 586 IRKSKRADKKDTLEMP---------QEGPGENG-GSGEIEKSSSSACKQ-TDPKTQRLYI 645

Query: 183 SFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWS 218
           SFQEN+YP +ATK+SLA+EL + +KQV+ WF++ RWS
Sbjct: 646 SFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRWS 668

BLAST of CSPI03G18040 vs. NCBI nr
Match: gi|778679986|ref|XP_011651230.1| (PREDICTED: homeobox protein HOX1A [Cucumis sativus])

HSP 1 Score: 532.7 bits (1371), Expect = 4.8e-148
Identity = 287/289 (99.31%), Postives = 288/289 (99.65%), Query Frame = 1

Query: 63   EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 122
            +K HDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV
Sbjct: 751  KKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 810

Query: 123  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 182
            KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF
Sbjct: 811  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 870

Query: 183  QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQAS 242
            QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQAS
Sbjct: 871  QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQAS 930

Query: 243  GELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSAT 302
            GELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSAT
Sbjct: 931  GELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSAT 990

Query: 303  KSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 352
            KSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
Sbjct: 991  KSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 1039

BLAST of CSPI03G18040 vs. NCBI nr
Match: gi|700202355|gb|KGN57488.1| (hypothetical protein Csa_3G199010 [Cucumis sativus])

HSP 1 Score: 527.3 bits (1357), Expect = 2.0e-146
Identity = 284/284 (100.00%), Postives = 284/284 (100.00%), Query Frame = 1

Query: 68  ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS 127
           ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS
Sbjct: 3   ETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNVKTKRS 62

Query: 128 YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY 187
           YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY
Sbjct: 63  YKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEY 122

Query: 188 PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK 247
           PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK
Sbjct: 123 PKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGELSK 182

Query: 248 NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR 307
           NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR
Sbjct: 183 NEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKR 242

Query: 308 KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 352
           KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI
Sbjct: 243 KGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 286

BLAST of CSPI03G18040 vs. NCBI nr
Match: gi|659112348|ref|XP_008456177.1| (PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Cucumis melo])

HSP 1 Score: 499.6 bits (1285), Expect = 4.5e-138
Identity = 270/289 (93.43%), Postives = 280/289 (96.89%), Query Frame = 1

Query: 63   EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 122
            +K HDETYGNVPT+SSDDTYGSTLDSSDDRG DSGTRKRGPKTLVLALSNNGSNDDLTNV
Sbjct: 774  KKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV 833

Query: 123  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 182
            KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSV++ TSSSNRRLSQPALERL ASF
Sbjct: 834  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASF 893

Query: 183  QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQAS 242
            QENEYPKRATK+SLAQELGL LKQVSKWFENTRWSTRHPSS GKKAKSSSRMSI+LSQAS
Sbjct: 894  QENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQAS 953

Query: 243  GELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSAT 302
            GELSKNE ESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKL++RKTKR +SSAT
Sbjct: 954  GELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSAT 1013

Query: 303  KSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 352
            KSRKRKGRSDNTAS+SKDREGSPRPPAKSPKVNE QTADRFKTRRRRSI
Sbjct: 1014 KSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI 1062

BLAST of CSPI03G18040 vs. NCBI nr
Match: gi|659112354|ref|XP_008456180.1| (PREDICTED: homeobox protein HAT3.1 isoform X2 [Cucumis melo])

HSP 1 Score: 499.6 bits (1285), Expect = 4.5e-138
Identity = 270/289 (93.43%), Postives = 280/289 (96.89%), Query Frame = 1

Query: 63   EKNHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTRKRGPKTLVLALSNNGSNDDLTNV 122
            +K HDETYGNVPT+SSDDTYGSTLDSSDDRG DSGTRKRGPKTLVLALSNNGSNDDLTNV
Sbjct: 716  KKLHDETYGNVPTESSDDTYGSTLDSSDDRGCDSGTRKRGPKTLVLALSNNGSNDDLTNV 775

Query: 123  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASF 182
            KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSV++ TSSSNRRLSQPALERL ASF
Sbjct: 776  KTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVRQCTSSSNRRLSQPALERLFASF 835

Query: 183  QENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQAS 242
            QENEYPKRATK+SLAQELGL LKQVSKWFENTRWSTRHPSS GKKAKSSSRMSI+LSQAS
Sbjct: 836  QENEYPKRATKESLAQELGLNLKQVSKWFENTRWSTRHPSSGGKKAKSSSRMSIHLSQAS 895

Query: 243  GELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSAT 302
            GELSKNE ESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKL++RKTKR +SSAT
Sbjct: 896  GELSKNEQESATCFRDTDSNGARHQDLPMANSVVASCQSGDTGDKKLTTRKTKRGESSAT 955

Query: 303  KSRKRKGRSDNTASHSKDREGSPRPPAKSPKVNEMQTADRFKTRRRRSI 352
            KSRKRKGRSDNTAS+SKDREGSPRPPAKSPKVNE QTADRFKTRRRRSI
Sbjct: 956  KSRKRKGRSDNTASNSKDREGSPRPPAKSPKVNETQTADRFKTRRRRSI 1004

BLAST of CSPI03G18040 vs. NCBI nr
Match: gi|643738525|gb|KDP44446.1| (hypothetical protein JCGZ_16279 [Jatropha curcas])

HSP 1 Score: 122.1 bits (305), Expect = 2.0e-24
Identity = 115/309 (37.22%), Postives = 156/309 (50.49%), Query Frame = 1

Query: 63   EKNHDETYGNVPTDSSDD---------------TYGSTL-DSSDDRGW--DSGTRKRGPK 122
            +K +DETYGN  +DSSDD               TYGST  DSSDD  +  D   RKR   
Sbjct: 705  KKLYDETYGNASSDSSDDEDFTDDVEPRKRRKETYGSTSSDSSDDEDFIDDVEPRKRRRS 764

Query: 123  TLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKSSSSVKKSTS 182
            T V   S N +N  ++    + +  +R RQK    N + S T+     + SSSS K   S
Sbjct: 765  TEVGQASVN-ANAFVSKTAKQDTTPKRHRQKSKFANTSTSSTKGHEGASPSSSSGKPVKS 824

Query: 183  SSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWFENTRWSTRH-PSS 242
            S  RRL +   + L  SF+EN+YP RA K+SLA+ELG+  +QVSKWFENTRWS  H PS+
Sbjct: 825  SGYRRLGETVTQGLYKSFKENQYPDRAKKESLAKELGITFQQVSKWFENTRWSFNHPPST 884

Query: 243  SGKKAKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGARHQDLPMANSVVASCQSGD 302
                 + +++    L + + EL   EPE     R+T SNGA+ ++ P  +        GD
Sbjct: 885  DASTVRKTTKEDSQLPKTNTELCTPEPEKIC--RNTTSNGAQSEESPKVDDATGGSYIGD 944

Query: 303  TGDKKLSSRKTKRADSSATKSRKRKGRSDNTASHSKDREGS-PRPPAKSPKVNEMQTADR 352
            T D K+ S+++ +  S    SRKRK  SD          G   + P   PK  E     R
Sbjct: 945  TRDTKMGSQESCKQKSKTPDSRKRKHISDPRTLDPYSTIGEMEKIPVNLPKSQEKPAGGR 1004

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
HAT31_ARATH8.5e-1236.31Homeobox protein HAT3.1 OS=Arabidopsis thaliana GN=HAT3.1 PE=2 SV=3[more]
PRH_PETCR1.5e-1136.25Pathogenesis-related homeodomain protein OS=Petroselinum crispum GN=PRH PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0L6Y1_CUCSA1.4e-146100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_3G199010 PE=4 SV=1[more]
A0A067L7L0_JATCU1.4e-2437.22Uncharacterized protein OS=Jatropha curcas GN=JCGZ_16279 PE=4 SV=1[more]
W9R947_9ROSA5.0e-1934.02Homeobox protein OS=Morus notabilis GN=L484_011492 PE=4 SV=1[more]
F6H698_VITVI6.5e-1935.09Putative uncharacterized protein OS=Vitis vinifera GN=VIT_03s0091g00290 PE=4 SV=... [more]
A0A061E032_THECC3.0e-1637.13Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative is... [more]
Match NameE-valueIdentityDescription
AT3G19510.14.8e-1336.31 Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain[more]
Match NameE-valueIdentityDescription
gi|778679986|ref|XP_011651230.1|4.8e-14899.31PREDICTED: homeobox protein HOX1A [Cucumis sativus][more]
gi|700202355|gb|KGN57488.1|2.0e-146100.00hypothetical protein Csa_3G199010 [Cucumis sativus][more]
gi|659112348|ref|XP_008456177.1|4.5e-13893.43PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Cucumis melo][more]
gi|659112354|ref|XP_008456180.1|4.5e-13893.43PREDICTED: homeobox protein HAT3.1 isoform X2 [Cucumis melo][more]
gi|643738525|gb|KDP44446.1|2.0e-2437.22hypothetical protein JCGZ_16279 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001356Homeobox_dom
IPR009057Homeobox-like_sf
IPR017970Homeobox_CS
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO:0043565sequence-specific DNA binding
Vocabulary: Biological Process
TermDefinition
GO:0006355regulation of transcription, DNA-templated
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005634 nucleus
molecular_function GO:0043565 sequence-specific DNA binding
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G18040.1CSPI03G18040.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001356Homeobox domainPFAMPF00046Homeoboxcoord: 169..219
score: 4.6
IPR001356Homeobox domainSMARTSM00389HOX_1coord: 164..225
score: 1.3
IPR001356Homeobox domainPROFILEPS50071HOMEOBOX_2coord: 161..221
score: 13
IPR009057Homeodomain-likeGENE3DG3DSA:1.10.10.60coord: 155..215
score: 1.0
IPR009057Homeodomain-likeunknownSSF46689Homeodomain-likecoord: 156..221
score: 5.99
IPR017970Homeobox, conserved sitePROSITEPS00027HOMEOBOX_1coord: 196..219
scor
NoneNo IPR availablePANTHERPTHR12628POLYCOMB-LIKE TRANSCRIPTION FACTORcoord: 66..236
score: 8.2
NoneNo IPR availablePANTHERPTHR12628:SF13HOMEOBOX PROTEIN HAT3.1coord: 66..236
score: 8.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI03G18040Csa3G199010Cucumber (Chinese Long) v2cpicuB120
The following gene(s) are paralogous to this gene:

None