Cp4.1LG20g00080 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG20g00080
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionZinc finger family protein
LocationCp4.1LG20 : 40531 .. 43233 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TTTCCACACATTTTAATTAGTCCCTCCCATTCACCCGCCCACCCTTTTCTTCTTCCGCTTTTTAATTCTAACGGGTGCGGGAGAAGGGCAGCAGAGTAGGCGTATTGTATTGTTTAGGAGTTTGGGAGTGATCGTAATGAAGTTTGTGTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTGTTTGTTTGTGGGGGTGTGTTTGCCGCTGAAGATGAGGATTATTAGAAGAAAACAAGGAGAGGGAAGGTGGTAGCCCCATCCTCCTTCCCCTTCCCCTTCGACAAAACATATTATAAAACAACCCTCACATCTCATATCTATCTATTCCTTCCTTCCTATTTACACACAACTGAAGAGATATCATTTGGGTCTTCAAAACTTTGGCATGATAGCCAATACACCCGCCTCGGCCTCCCTCTCTCTTCCTTCTTCGGAGCCTTACTCGTGCTTAGAAAATGGAAGCAGCAACAGCAACAAGAGGAAAAGAAGGCCTGCGGGCACCCCAGGTAACCCAATTTCTGATTTTAAGATTTTAGGTTTAAGGAAGTAAAGTTTGTGAGTGAGTAGATGGATGGTAATGTGTGTGTGTGTGTGTGTGCGCGCGCGCAGATCCTGATGCAGAGGTGGTGTCTCTCTCCCCCAAAACACTTCTAGAATCGGACCGGTACGTGTGCGAGATCTGCAACCAGGGGTTCCAGAGGGACCAGAACCTACAGATGCACAGACGGCGGCATAAGGTGCCTTGGAAGCTGTTGAAGAGGGAGACTCCTGTTGTGCGGAAAAGGGTTTTCGTTTGCCCGGAGCCCAGCTGTCTGCACCACGACCCTTGCCACGCCCTCGGCGATCTGGTTGGCATCAAGAAGCACTTCCGAAGAAAGCACAGCAACCACAAGCAGTGGGTTTGTGAAAAGTGCTCCAAGGCCTACGCCGTTCAGTCTGATTACAAGGCCCATCTCAAAACCTGTGGCACCCGTGGCCACTCTTGCGATTGCGGCCGCGTTTTCTCTAGGTACTCATCATCTTTTCTTCTGCACCCTTTCTCCCTTGTTTCTCGCAAATCCAATCTTTTTCATCCCCTTTTAAACATTATTTAATTCTATGTCAACGTTTCCCAGATTTCCTCCTCACACCAAATACTATCAATTCAATTCATGTTTAGCATTAGAAGTGATGAACCATGTGTTTAGTTAAATACTGGGTTTTAGGCGGCATAAATAAATATATACACGTAGTTGGATTAATTAATTGAGGTGATAGGGTGGAGAGCTTTATAGAGCACCAAGATGCTTGCAATATGGGGCATCTCCGCCAAGAATCCCAAACCCTCCACCCCGCGTGTTTGTCCCGAACGGCTTCCAGTCCAAGCCCACCCAGCGACGCTAACTTCACGGCGCCACCCACTGCCATCTTCTTAGCCCCCACCGACGACTCCATCATCAACAACAACAACAACAACAATAATAATAGCCATCACAATCTAGACCTCAAACTCTCAACTTCTACATCAAATGAAGGTTCTTCCACGCAACTACAGCTTTCGATCGGGTCGTGTGATTTTGGGGACAAGAAGAAAAGGGTGCAGTTGTTGGATGATGAGAAAGGTGGGGGCCGGGGGCCGGGGGGTGTTAGAGAAGAAGCGCGGGAAGAGCTAAGAGAGGCAATGGCGGAGAAGGCGTACGCGGAGGAGGCAAGGCAACAGGCGAAGCGGCAGATAGAAATGGCAGAGGAAGAATTCGTGAACGCAAAGAGGATGAGGCAACAGGCACAGGCGGAGTTAGATAAGGCTACAGCTCTTAAACAAGCGGCAATAAAACAGATAAACTCAACCCTTCTTGAGATCACTTGTCAAGCATGCCAGAAGCAATTTCAAGCCAAGCCCACAACAACAACTACTACTACTGCACCCGCTGACCATGACACTTCCGTAGCATTCAGTTACGTATCCTCAGTCATTACAACCGAAGGTGAAGTAGAGAAAGATCAAGCAACCCTTATACCTAATTAACCCTTCCCTTCCCTACCCATCTCATATATTATTATATTATAGTATATGGATCATTTTTTTTTTCATGGTCTCCTTGATAAATAATCTTCCTTTATTTTTATTGTTTATGTTTGTAGTGAATAAATATGCAGAAAGAGTGCTGTGATTGTAGGTCGAGGGATGGAGTGTTGTACCCTTGGGCAGTTGATGCTTTGTAATTTTTTTCCCCCACCGATAGTTTGCTAATTAATTATTGCAGACTTCAATATAGAATGGGTAAGCATTTTTTGTTTGCTTTGTATTAATTTTTTTTATAAAAGAAAAAAGAAAAAAGAAAAAAGAAGTACAAAATTGATCTCATAATAGGTGATTTTGGATTTAGTGTTTGGAGTATATTGCAGGTGCAGAGGGCATTGGGTGTTGGTCATGTCACTGCGTTGAAGGGATACGACTCTCTTTCTTGGGCATTTAAGAACGGCACAGACCAATATTTATAACTGAATATCATTACATTGCATCAGTAAAAGTAAGGAGGAGAGAGAAAGAGATAGGAAAGTGTGTGTGTGTGTGTGTTTTTCTCTCTTCTCTAGGAGGACAGAGTGAGAGATAAATGTACAGACGCCCTATTTTTCAGTGCCACTCTGGTAAAGGGGAGTTGATGGCATAATCAGCCTGCACTCCTCCCTCTCAACTCAACTACCAGACACCATTTCTCTCTCC

mRNA sequence

TTTCCACACATTTTAATTAGTCCCTCCCATTCACCCGCCCACCCTTTTCTTCTTCCGCTTTTTAATTCTAACGGGTGCGGGAGAAGGGCAGCAGAGTAGGCGTATTGTATTGTTTAGGAGTTTGGGAGTGATCGTAATGAAGTTTGTGTTGTTTTTGTTTTTGTTTTTGTTTTTGTTTGTTTGTTTGTGGGGGTGTGTTTGCCGCTGAAGATGAGGATTATTAGAAGAAAACAAGGAGAGGGAAGGTGGTAGCCCCATCCTCCTTCCCCTTCCCCTTCGACAAAACATATTATAAAACAACCCTCACATCTCATATCTATCTATTCCTTCCTTCCTATTTACACACAACTGAAGAGATATCATTTGGGTCTTCAAAACTTTGGCATGATAGCCAATACACCCGCCTCGGCCTCCCTCTCTCTTCCTTCTTCGGAGCCTTACTCGTGCTTAGAAAATGGAAGCAGCAACAGCAACAAGAGGAAAAGAAGGCCTGCGGGCACCCCAGATCCTGATGCAGAGGTGGTGTCTCTCTCCCCCAAAACACTTCTAGAATCGGACCGGTACGTGTGCGAGATCTGCAACCAGGGGTTCCAGAGGGACCAGAACCTACAGATGCACAGACGGCGGCATAAGGTGCCTTGGAAGCTGTTGAAGAGGGAGACTCCTGTTGTGCGGAAAAGGGTTTTCGTTTGCCCGGAGCCCAGCTGTCTGCACCACGACCCTTGCCACGCCCTCGGCGATCTGGTTGGCATCAAGAAGCACTTCCGAAGAAAGCACAGCAACCACAAGCAGTGGGTTTGTGAAAAGTGCTCCAAGGCCTACGCCGTTCAGTCTGATTACAAGGCCCATCTCAAAACCTGTGGCACCCGTGGCCACTCTTGCGATTGCGGCCGCGTTTTCTCTAGGGTGGAGAGCTTTATAGAGCACCAAGATGCTTGCAATATGGGGCATCTCCGCCAAGAATCCCAAACCCTCCACCCCGCGTGTTTGTCCCGAACGGCTTCCAGTCCAAGCCCACCCAGCGACGCTAACTTCACGGCGCCACCCACTGCCATCTTCTTAGCCCCCACCGACGACTCCATCATCAACAACAACAACAACAACAATAATAATAGCCATCACAATCTAGACCTCAAACTCTCAACTTCTACATCAAATGAAGGTTCTTCCACGCAACTACAGCTTTCGATCGGGTCGTGTGATTTTGGGGACAAGAAGAAAAGGGTGCAGTTGTTGGATGATGAGAAAGGTGGGGGCCGGGGGCCGGGGGGTGTTAGAGAAGAAGCGCGGGAAGAGCTAAGAGAGGCAATGGCGGAGAAGGCGTACGCGGAGGAGGCAAGGCAACAGGCGAAGCGGCAGATAGAAATGGCAGAGGAAGAATTCGTGAACGCAAAGAGGATGAGGCAACAGGCACAGGCGGAGTTAGATAAGGCTACAGCTCTTAAACAAGCGGCAATAAAACAGATAAACTCAACCCTTCTTGAGATCACTTGTCAAGCATGCCAGAAGCAATTTCAAGCCAAGCCCACAACAACAACTACTACTACTGCACCCGCTGACCATGACACTTCCGTAGCATTCAGTTACGTATCCTCAGTCATTACAACCGAAGTGAATAAATATGCAGAAAGAGTGCTGTGATTGTAGGTCGAGGGATGGAGTGTTGTACCCTTGGGCAGTTGATGCTTTGTAATTTTTTTCCCCCACCGATAGTTTGCTAATTAATTATTGCAGACTTCAATATAGAATGGGTGCAGAGGGCATTGGGTGTTGGTCATGTCACTGCGTTGAAGGGATACGACTCTCTTTCTTGGGCATTTAAGAACGGCACAGACCAATATTTATAACTGAATATCATTACATTGCATCAGTAAAAGTAAGGAGGAGAGAGAAAGAGATAGGAAAGTGTGTGTGTGTGTGTGTTTTTCTCTCTTCTCTAGGAGGACAGAGTGAGAGATAAATGTACAGACGCCCTATTTTTCAGTGCCACTCTGGTAAAGGGGAGTTGATGGCATAATCAGCCTGCACTCCTCCCTCTCAACTCAACTACCAGACACCATTTCTCTCTCC

Coding sequence (CDS)

ATGATAGCCAATACACCCGCCTCGGCCTCCCTCTCTCTTCCTTCTTCGGAGCCTTACTCGTGCTTAGAAAATGGAAGCAGCAACAGCAACAAGAGGAAAAGAAGGCCTGCGGGCACCCCAGATCCTGATGCAGAGGTGGTGTCTCTCTCCCCCAAAACACTTCTAGAATCGGACCGGTACGTGTGCGAGATCTGCAACCAGGGGTTCCAGAGGGACCAGAACCTACAGATGCACAGACGGCGGCATAAGGTGCCTTGGAAGCTGTTGAAGAGGGAGACTCCTGTTGTGCGGAAAAGGGTTTTCGTTTGCCCGGAGCCCAGCTGTCTGCACCACGACCCTTGCCACGCCCTCGGCGATCTGGTTGGCATCAAGAAGCACTTCCGAAGAAAGCACAGCAACCACAAGCAGTGGGTTTGTGAAAAGTGCTCCAAGGCCTACGCCGTTCAGTCTGATTACAAGGCCCATCTCAAAACCTGTGGCACCCGTGGCCACTCTTGCGATTGCGGCCGCGTTTTCTCTAGGGTGGAGAGCTTTATAGAGCACCAAGATGCTTGCAATATGGGGCATCTCCGCCAAGAATCCCAAACCCTCCACCCCGCGTGTTTGTCCCGAACGGCTTCCAGTCCAAGCCCACCCAGCGACGCTAACTTCACGGCGCCACCCACTGCCATCTTCTTAGCCCCCACCGACGACTCCATCATCAACAACAACAACAACAACAATAATAATAGCCATCACAATCTAGACCTCAAACTCTCAACTTCTACATCAAATGAAGGTTCTTCCACGCAACTACAGCTTTCGATCGGGTCGTGTGATTTTGGGGACAAGAAGAAAAGGGTGCAGTTGTTGGATGATGAGAAAGGTGGGGGCCGGGGGCCGGGGGGTGTTAGAGAAGAAGCGCGGGAAGAGCTAAGAGAGGCAATGGCGGAGAAGGCGTACGCGGAGGAGGCAAGGCAACAGGCGAAGCGGCAGATAGAAATGGCAGAGGAAGAATTCGTGAACGCAAAGAGGATGAGGCAACAGGCACAGGCGGAGTTAGATAAGGCTACAGCTCTTAAACAAGCGGCAATAAAACAGATAAACTCAACCCTTCTTGAGATCACTTGTCAAGCATGCCAGAAGCAATTTCAAGCCAAGCCCACAACAACAACTACTACTACTGCACCCGCTGACCATGACACTTCCGTAGCATTCAGTTACGTATCCTCAGTCATTACAACCGAAGTGAATAAATATGCAGAAAGAGTGCTGTGA

Protein sequence

MIANTPASASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANFTAPPTAIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTSTSNEGSSTQLQLSIGSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSVAFSYVSSVITTEVNKYAERVL
BLAST of Cp4.1LG20g00080 vs. Swiss-Prot
Match: IDD15_ARATH (Protein SHOOT GRAVITROPISM 5 OS=Arabidopsis thaliana GN=SGR5 PE=1 SV=1)

HSP 1 Score: 388.7 bits (997), Expect = 8.3e-107
Identity = 235/442 (53.17%), Postives = 292/442 (66.06%), Query Frame = 1

Query: 4   NTPASASLSLPSSEPY-SCLENGSSNSN----KRKRRPAGTPDPDAEVVSLSPKTLLESD 63
           NT     +S  SS+P+ S  ENG + +N    KRKRRPAGTPDPDAEVVSLSP+TLLESD
Sbjct: 12  NTNTCCVVSSSSSDPFLSSSENGVTTTNTSTQKRKRRPAGTPDPDAEVVSLSPRTLLESD 71

Query: 64  RYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPV-VRKRVFVCPEPSCLHHDPCHAL 123
           RY+CEICNQGFQRDQNLQMHRRRHKVPWKLLKR+  + V+KRV+VCPEP+CLHH+PCHAL
Sbjct: 72  RYICEICNQGFQRDQNLQMHRRRHKVPWKLLKRDNNIEVKKRVYVCPEPTCLHHNPCHAL 131

Query: 124 GDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVES 183
           GDLVGIKKHFRRKHSNHKQWVCE+CSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVES
Sbjct: 132 GDLVGIKKHFRRKHSNHKQWVCERCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVES 191

Query: 184 FIEHQDACNMGHLRQE------SQTLHPACLSRTASSPS-PPSDANFTAPPTAIFLAPTD 243
           FIEHQD C+   + +E      +    PAC SRTAS+ S P S+ N+          P +
Sbjct: 192 FIEHQDNCSARRVHREPPRPPQTAVTVPACSSRTASTVSTPSSETNYGGTVAVTTPQPLE 251

Query: 244 ---------DSIINNNNNNNNNSHHNLDLKLSTSTSNEG----------------SSTQL 303
                     SI+ N++NN N     L L  + + + E                  +T L
Sbjct: 252 GRPIHQRISSSILTNSSNNLNLELQLLPLSSNQNPNQENQQQKVKEPSHHHNHNHDTTNL 311

Query: 304 QLSIGSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQ 363
            LSI              + +              + + ++ AM EKAYAEEA+++AKRQ
Sbjct: 312 NLSIAPSSSYQHYNNFDRIKEIMA-----------SEQIMKIAMKEKAYAEEAKREAKRQ 371

Query: 364 IEMAEEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTT 408
            E+AE EF NAK++RQ+AQAEL++A  LK+ ++K+I+ST++++TCQ C+ QFQA      
Sbjct: 372 REIAENEFANAKKIRQKAQAELERAKFLKEQSMKKISSTIMQVTCQTCKGQFQA----VA 431

BLAST of Cp4.1LG20g00080 vs. Swiss-Prot
Match: IDD14_ARATH (Protein indeterminate-domain 14 OS=Arabidopsis thaliana GN=IDD14 PE=1 SV=1)

HSP 1 Score: 364.0 bits (933), Expect = 2.2e-99
Identity = 233/446 (52.24%), Postives = 278/446 (62.33%), Query Frame = 1

Query: 4   NTPASASLS--LPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYV 63
           N P S+S S  LP         NG++ + KRKRRPAGTPDP+AEVVSLSP+TLLESDRYV
Sbjct: 19  NPPPSSSSSDLLPDG-------NGTAVTQKRKRRPAGTPDPEAEVVSLSPRTLLESDRYV 78

Query: 64  CEICNQGFQRDQNLQMHRRRHKVPWKLLKRET-PVVRKRVFVCPEPSCLHHDPCHALGDL 123
           CEICNQGFQRDQNLQMHRRRHKVPWKLLKRET   VRKRV+VCPEP+CLHH+PCHALGDL
Sbjct: 79  CEICNQGFQRDQNLQMHRRRHKVPWKLLKRETNEEVRKRVYVCPEPTCLHHNPCHALGDL 138

Query: 124 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 183
           VGIKKHFRRKHSNHKQW+CE+CSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 139 VGIKKHFRRKHSNHKQWICERCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 198

Query: 184 HQDACNM-----GHLRQESQTLHPACLSRTASS-----------------------PSPP 243
           HQD C +      + R   Q  H    ++TAS+                        SPP
Sbjct: 199 HQDTCTVRRSQPSNHRLHEQQQHTTNATQTASTAENNENGDLSIGPILPGHPLQRRQSPP 258

Query: 244 SDANFTAPPTAIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTSTSNEGSSTQLQLSIGSC 303
           S+     P T ++   T+ SI               +L+L  S  N    T L LSIG+ 
Sbjct: 259 SEQQ---PSTLLYPFVTNGSI---------------ELQLLPS-RNCADETSLSLSIGTM 318

Query: 304 DFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEE 363
           D       V+    EKG                 E   E+   EEAR++ KRQIE+AE E
Sbjct: 319 D-QKTMSEVEKKSYEKG-----------------ETSLER---EEARRETKRQIEIAELE 378

Query: 364 FVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTT--TTTTAP 417
           F  AKR+RQ A+AEL KA   ++ A ++I++T+++ITC  C++ FQA          T  
Sbjct: 379 FAEAKRIRQHARAELHKAHLFREEASRRISATMMQITCHNCKQHFQAPAALVPPPPQTHC 416

BLAST of Cp4.1LG20g00080 vs. Swiss-Prot
Match: IDD16_ARATH (Protein indeterminate-domain 16 OS=Arabidopsis thaliana GN=IDD16 PE=2 SV=1)

HSP 1 Score: 344.7 bits (883), Expect = 1.4e-93
Identity = 208/369 (56.37%), Postives = 247/369 (66.94%), Query Frame = 1

Query: 41  DPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETP--VVRK 100
           DPDAEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR+     VRK
Sbjct: 20  DPDAEVVSLSPRTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRDKKDEEVRK 79

Query: 101 RVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKT 160
           RV+VCPEP+CLHHDPCHALGDLVGIKKHFRRKHS HKQWVCE+CSK YAVQSDYKAHLKT
Sbjct: 80  RVYVCPEPTCLHHDPCHALGDLVGIKKHFRRKHSVHKQWVCERCSKGYAVQSDYKAHLKT 139

Query: 161 CGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANFT 220
           CG+RGHSCDCGRVFSRVESFIEHQD C    +RQ   T H      T    +P    +  
Sbjct: 140 CGSRGHSCDCGRVFSRVESFIEHQDTCT---IRQPQPTNHRHLQQHTMGLDAPSRTTS-- 199

Query: 221 APPTAIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTSTSNEGSSTQLQLSIGSCDFGDKK 280
              TA F  P    +        +N H         ++S    S +LQLSIG       +
Sbjct: 200 ---TASF-GPLLHGLPLLRPPRPSNQHSPAFAYPFNASSAPFESLELQLSIGMA-----R 259

Query: 281 KRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAKR 340
              Q   +EK   R     +E A EE R+       AEE RQ+AKRQIEMAE++F  AKR
Sbjct: 260 TSAQARHNEK---RETSLTKERANEEARK-------AEETRQEAKRQIEMAEKDFEKAKR 319

Query: 341 MRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSVA 400
           +R++A+ EL+KA  +++ AIK+IN+T++EITC +C++ FQ   T   +T       +S+ 
Sbjct: 320 IREEAKTELEKAHVVREEAIKRINATMMEITCHSCKQLFQLPVTADEST-------SSLV 357

Query: 401 FSYVSSVIT 408
            SYVSS  T
Sbjct: 380 MSYVSSATT 357

BLAST of Cp4.1LG20g00080 vs. Swiss-Prot
Match: IDD3_ARATH (Zinc finger protein MAGPIE OS=Arabidopsis thaliana GN=MGP PE=1 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 5.5e-58
Identity = 105/170 (61.76%), Postives = 132/170 (77.65%), Query Frame = 1

Query: 31  KRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLK 90
           K+KR   G PDP+AEV++LSPKTL+ ++R++CEIC +GFQRDQNLQ+HRR H +PWKL +
Sbjct: 41  KKKRNLPGNPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLPWKLKQ 100

Query: 91  RETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQS 150
           R +  VRKRV+VCPE SC+HH P  ALGDL GIKKHF RKH   K+W CEKC+K YAVQS
Sbjct: 101 RTSKEVRKRVYVCPEKSCVHHHPTRALGDLTGIKKHFCRKH-GEKKWKCEKCAKRYAVQS 160

Query: 151 DYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQESQTLHPA 201
           D+KAH KTCGTR + CDCG +FSR +SFI H+  C+   L +E+  L+ A
Sbjct: 161 DWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDA--LAEETARLNAA 207

BLAST of Cp4.1LG20g00080 vs. Swiss-Prot
Match: IDD2_ARATH (Protein indeterminate-domain 2 OS=Arabidopsis thaliana GN=IDD2 PE=2 SV=1)

HSP 1 Score: 226.5 bits (576), Expect = 5.5e-58
Identity = 112/209 (53.59%), Postives = 148/209 (70.81%), Query Frame = 1

Query: 9   ASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQG 68
           AS+S+ S+   + L N   ++ K+KR   G PDP++EV++LSPKTLL ++R+VCEICN+G
Sbjct: 15  ASVSISSTGNQNPLPN---STGKKKRNLPGMPDPESEVIALSPKTLLATNRFVCEICNKG 74

Query: 69  FQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFR 128
           FQRDQNLQ+HRR H +PWKL ++    V+K+V+VCPE SC+HHDP  ALGDL GIKKHF 
Sbjct: 75  FQRDQNLQLHRRGHNLPWKLRQKSNKEVKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFC 134

Query: 129 RKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDAC--- 188
           RKH   K+W C+KCSK YAVQSD+KAH K CGT+ + CDCG +FSR +SFI H+  C   
Sbjct: 135 RKH-GEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYKCDCGTLFSRRDSFITHRAFCDAL 194

Query: 189 ---NMGHLRQESQTLHPACLSRTASSPSP 212
              N      +S+  +P  L+R    P+P
Sbjct: 195 AEENARSHHSQSKKQNPEILTRKNPVPNP 219

BLAST of Cp4.1LG20g00080 vs. TrEMBL
Match: A0A0A0LPV5_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G029620 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 1.4e-174
Identity = 347/449 (77.28%), Postives = 370/449 (82.41%), Query Frame = 1

Query: 1   MIANTPASASLSLP-SSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60
           MI NT  SAS  LP SSEPYSCLENG++N+NKRKRRPAGTPDPDAEVVSLSPKTLLESDR
Sbjct: 1   MIGNTATSASPLLPNSSEPYSCLENGNNNNNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60

Query: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120
           YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRE+PVVRKRVFVCPEP+CLHHDPCHALGD
Sbjct: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRESPVVRKRVFVCPEPTCLHHDPCHALGD 120

Query: 121 LVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180
           LVGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI
Sbjct: 121 LVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180

Query: 181 EHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANFTAPPT----------------- 240
           EHQDACNMGHLRQESQ + PACLSRTASSPSP SD NF++ P                  
Sbjct: 181 EHQDACNMGHLRQESQ-VQPACLSRTASSPSPSSDTNFSSTPAPSSNWHALVTPPLTLKP 240

Query: 241 --AIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTST-----------SNEGSSTQLQLSI 300
             AIFL PT DS      NNNNNS HNLDLKLST++           + +GSST+L+LS+
Sbjct: 241 VDAIFLTPTGDS------NNNNNSDHNLDLKLSTASNGVEGRNNYNNNKKGSSTKLELSM 300

Query: 301 GSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMA 360
           GS DF D+KK++  LDD      G G VREEAREELR AMAEKAYAEEAR+QAKRQIEMA
Sbjct: 301 GSFDFEDEKKKMLKLDD-----GGAGDVREEAREELRVAMAEKAYAEEARKQAKRQIEMA 360

Query: 361 EEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAK-------PT 410
           EEEF NAKRMRQQAQAELDKATALKQAAIKQINST+LEITCQACQKQFQAK        T
Sbjct: 361 EEEFGNAKRMRQQAQAELDKATALKQAAIKQINSTILEITCQACQKQFQAKTKTKTKTTT 420

BLAST of Cp4.1LG20g00080 vs. TrEMBL
Match: A0A061GQZ7_THECC (C2H2-like zinc finger protein OS=Theobroma cacao GN=TCM_039880 PE=4 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 2.3e-148
Identity = 296/432 (68.52%), Postives = 339/432 (78.47%), Query Frame = 1

Query: 1   MIANTPASASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60
           M+ N   S   S+PSSEP+SC+E G++N NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY
Sbjct: 1   MLTNNSCS---SVPSSEPFSCIEIGNNN-NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60

Query: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120
           VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL
Sbjct: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120

Query: 121 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180
           VGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 121 VGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180

Query: 181 HQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANF-TAPPTAIFLAPTDDSII---NN 240
           HQDAC+MGH+R ESQ L PACLSRTASSPSP SD NF TAP  ++ LA T D++      
Sbjct: 181 HQDACHMGHIRPESQALQPACLSRTASSPSPSSDTNFSTAPWPSLVLAKTTDTMFLSPTK 240

Query: 241 NNNNNNNSHHNLDLKLSTST----------SNEGSSTQLQLSIGSCDFGDKKKRVQLLDD 300
           +N+  N  +HNL+L+L T++          +++  STQLQLSIGS D G+K +      +
Sbjct: 241 DNSPKNAHYHNLELQLLTTSNPTELSVSPKTDDKHSTQLQLSIGSSDIGEKIESTVTCTN 300

Query: 301 EKGGGRGP-----------GGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVN 360
           +    + P             ++E+ARE+LR AMAEKA+AEE RQQAKRQIE+AE+EF N
Sbjct: 301 KDASKKSPHQESEKPTFVASRLKEQAREQLRLAMAEKAFAEEVRQQAKRQIELAEQEFAN 360

Query: 361 AKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDT 408
           AKR+RQQAQAELDKA ALK  AIKQINST+L+ITC AC++QFQA+       T P   + 
Sbjct: 361 AKRIRQQAQAELDKAQALKDHAIKQINSTILQITCHACKQQFQAR-------TPP--EEN 419

BLAST of Cp4.1LG20g00080 vs. TrEMBL
Match: U5FLX0_POPTR (Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s05180g PE=4 SV=1)

HSP 1 Score: 519.6 bits (1337), Expect = 3.5e-144
Identity = 300/450 (66.67%), Postives = 345/450 (76.67%), Query Frame = 1

Query: 1   MIANTPASASLSLPSS-EPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60
           M+AN+  S   SLPS+ EP+ CLENG+SN NKRKRRPAGTPDPDAEVVSLSPKTLLESDR
Sbjct: 1   MLANSSPS---SLPSNPEPFFCLENGNSN-NKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60

Query: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120
           YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD
Sbjct: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120

Query: 121 LVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180
           LVGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI
Sbjct: 121 LVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180

Query: 181 EHQDACNMGHLRQESQTLHP-ACLSRTASSPSPPSDANF-TAP--------PT-----AI 240
           EHQD+CNMG LR ESQ+L P ACLSRTASSPSP SD NF TAP        PT     A+
Sbjct: 181 EHQDSCNMGRLRSESQSLQPAACLSRTASSPSPSSDNNFSTAPWPPLIISRPTTTSDHAM 240

Query: 241 FLAPTD-DSIINNNNNNNNNSHH-NLDLKLSTSTSN-----------EGSSTQLQLSIGS 300
           F +PT    +++  +++ + +H+ NL+L+LST++ N           +  STQLQLSIGS
Sbjct: 241 FFSPTTATDVVDKTDSSKSAAHYQNLELQLSTTSRNPPEVSVSPKRDDNHSTQLQLSIGS 300

Query: 301 CDFGDKKKRVQLLDDEKGGGR--------------GPGGVREEAREELREAMAEKAYAEE 360
            D  D+ +      ++   G+              G   ++E+ARE+LR AMAEK YAEE
Sbjct: 301 SDVSDRNESNITYTNKDHAGKSFPRESNNSPKPELGASRLKEQAREQLRMAMAEKIYAEE 360

Query: 361 ARQQAKRQIEMAEEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQF 408
           ARQQAKRQIE+AE+EF NAKR+RQQAQAEL KA AL+Q AIKQINST+L+ITC AC+++F
Sbjct: 361 ARQQAKRQIELAEQEFANAKRIRQQAQAELGKAQALRQHAIKQINSTILQITCHACKQKF 420

BLAST of Cp4.1LG20g00080 vs. TrEMBL
Match: A0A0B0N813_GOSAR (Zinc finger MAGPIE-like protein OS=Gossypium arboreum GN=F383_34553 PE=4 SV=1)

HSP 1 Score: 518.8 bits (1335), Expect = 5.9e-144
Identity = 289/430 (67.21%), Postives = 330/430 (76.74%), Query Frame = 1

Query: 1   MIANTPASASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60
           M+AN   S   SLPSSEP+SCLENG++N NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY
Sbjct: 1   MLANNSCS---SLPSSEPFSCLENGTTNINKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60

Query: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120
           VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEP+CLHHDPCHALGDL
Sbjct: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPTCLHHDPCHALGDL 120

Query: 121 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180
           VGIKKHFRRKHSNHKQWVC+KCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 121 VGIKKHFRRKHSNHKQWVCDKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180

Query: 181 HQDACNMGH-LRQESQTLHPACLSRTASSPSPPSDANFTAPP------------TAIFLA 240
           HQDAC+MG  +R ESQ +  ACLSRTASSPSP +D +F+  P             A+FL+
Sbjct: 181 HQDACHMGRVIRPESQGVQLACLSRTASSPSPSNDTHFSTAPCPWPNLALPKTKDAMFLS 240

Query: 241 PTDDSIINNNNNNNNNSHHNLDLKLSTSTS----------NEGSSTQLQLSIGSCDFGDK 300
           PT D     +N+  N   HNL+L+L T+++          ++  ST LQLSIGS D GDK
Sbjct: 241 PTKD-----HNSPKNAHSHNLELQLLTTSNPSEVSVSPKKHDNHSTHLQLSIGSSDIGDK 300

Query: 301 KKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAK 360
            +   +    K      G   ++A + L+ AM EK YAEEARQ+AKRQIE+AE+EF  AK
Sbjct: 301 VEYSTVTCTHKDASNKSGHHEKQAPQPLKLAMEEKTYAEEARQEAKRQIEIAEQEFAKAK 360

Query: 361 RMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSV 408
           R+RQQAQAEL+KA ALK  AIKQINST+L+ITCQACQ QFQA+       T P   + S+
Sbjct: 361 RIRQQAQAELEKAQALKDHAIKQINSTILQITCQACQHQFQAR-------TPP--EENSL 413

BLAST of Cp4.1LG20g00080 vs. TrEMBL
Match: F6GSS5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g01870 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 1.3e-143
Identity = 289/428 (67.52%), Postives = 333/428 (77.80%), Query Frame = 1

Query: 15  SSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQN 74
           S+EP++ LENGS++  KRKRRPAGTPDPDAEVVSLSPKTLLESDRY+CEICNQGFQRDQN
Sbjct: 14  SAEPFASLENGSNS--KRKRRPAGTPDPDAEVVSLSPKTLLESDRYICEICNQGFQRDQN 73

Query: 75  LQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNH 134
           LQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNH
Sbjct: 74  LQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNH 133

Query: 135 KQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQES 194
           KQWVCEKC+K YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLR ES
Sbjct: 134 KQWVCEKCNKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRPES 193

Query: 195 QTLHP-ACLSRTASSPSPPSDANFTAPPTAIFLA--PTDDSIINNNNNNNNNS------H 254
           Q L P ACLSRTASSPSP S+ NF+ PP +  +   P D   + ++ +NNNN+      +
Sbjct: 194 QLLQPAACLSRTASSPSPSSETNFSVPPWSGLMTPRPVDSIFLTSDGDNNNNNPPKKAHY 253

Query: 255 HNLDLKLSTS----------TSNEGSSTQLQLSIGSCDFGDKKKR--VQLLDDEKGGGR- 314
           HNL+L+L T+           ++E  STQLQLSIGS DF +K +   + L++ E      
Sbjct: 254 HNLELQLLTTPNPLVALASPKADENHSTQLQLSIGSSDFNEKNESSIINLINKEYSAPAR 313

Query: 315 -------------GPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAKRM 374
                        G   ++EEARE+LR AM EK YAEEARQQAKRQIE+A++EF +AKR+
Sbjct: 314 CPRECNTSEKATFGAARLKEEAREQLRLAMEEKVYAEEARQQAKRQIELADKEFTHAKRI 373

Query: 375 RQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSVAF 408
           RQQAQAELDKA ALK+ A KQINST+L+ITC AC++QF+   T T    AP D + S+  
Sbjct: 374 RQQAQAELDKAQALKEHARKQINSTILQITCHACKQQFR---TRTAGNVAPPD-ENSLVL 433

BLAST of Cp4.1LG20g00080 vs. TAIR10
Match: AT2G01940.3 (AT2G01940.3 C2H2-like zinc finger protein)

HSP 1 Score: 375.9 bits (964), Expect = 3.1e-104
Identity = 232/443 (52.37%), Postives = 289/443 (65.24%), Query Frame = 1

Query: 4   NTPASASLSLPSSEPY-SCLENGSSNSN----KRKRRPAGTPDPDAEVVSLSPKTLLESD 63
           NT     +S  SS+P+ S  ENG + +N    KRKRRPAGTPDPDAEVVSLSP+TLLESD
Sbjct: 12  NTNTCCVVSSSSSDPFLSSSENGVTTTNTSTQKRKRRPAGTPDPDAEVVSLSPRTLLESD 71

Query: 64  RYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPV-VRKRVFVCPEPSCLHHDPCHAL 123
           RY+CEICNQGFQRDQNLQMHRRRHKVPWKLLKR+  + V+KRV+VCPEP+CLHH+PCHAL
Sbjct: 72  RYICEICNQGFQRDQNLQMHRRRHKVPWKLLKRDNNIEVKKRVYVCPEPTCLHHNPCHAL 131

Query: 124 GDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFS-RVE 183
           GDLVGIKKHFRRKHSNHKQWVCE+CSK YAVQSDYKAHLKTCGTRGHSCDCG   S RVE
Sbjct: 132 GDLVGIKKHFRRKHSNHKQWVCERCSKGYAVQSDYKAHLKTCGTRGHSCDCGFFSSFRVE 191

Query: 184 SFIEHQDACNMGHLRQE------SQTLHPACLSRTASSPS-PPSDANFTAPPTAIFLAPT 243
           SFIEHQD C+   + +E      +    PAC SRTAS+ S P S+ N+          P 
Sbjct: 192 SFIEHQDNCSARRVHREPPRPPQTAVTVPACSSRTASTVSTPSSETNYGGTVAVTTPQPL 251

Query: 244 D---------DSIINNNNNNNNNSHHNLDLKLSTSTSNEG----------------SSTQ 303
           +          SI+ N++NN N     L L  + + + E                  +T 
Sbjct: 252 EGRPIHQRISSSILTNSSNNLNLELQLLPLSSNQNPNQENQQQKVKEPSHHHNHNHDTTN 311

Query: 304 LQLSIGSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKR 363
           L LSI              + +              + + ++ AM EKAYAEEA+++AKR
Sbjct: 312 LNLSIAPSSSYQHYNNFDRIKEIMA-----------SEQIMKIAMKEKAYAEEAKREAKR 371

Query: 364 QIEMAEEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTT 408
           Q E+AE EF NAK++RQ+AQAEL++A  LK+ ++K+I+ST++++TCQ C+ QFQA     
Sbjct: 372 QREIAENEFANAKKIRQKAQAELERAKFLKEQSMKKISSTIMQVTCQTCKGQFQA----V 431

BLAST of Cp4.1LG20g00080 vs. TAIR10
Match: AT1G68130.1 (AT1G68130.1 indeterminate(ID)-domain 14)

HSP 1 Score: 364.0 bits (933), Expect = 1.2e-100
Identity = 233/446 (52.24%), Postives = 278/446 (62.33%), Query Frame = 1

Query: 4   NTPASASLS--LPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYV 63
           N P S+S S  LP         NG++ + KRKRRPAGTPDP+AEVVSLSP+TLLESDRYV
Sbjct: 19  NPPPSSSSSDLLPDG-------NGTAVTQKRKRRPAGTPDPEAEVVSLSPRTLLESDRYV 78

Query: 64  CEICNQGFQRDQNLQMHRRRHKVPWKLLKRET-PVVRKRVFVCPEPSCLHHDPCHALGDL 123
           CEICNQGFQRDQNLQMHRRRHKVPWKLLKRET   VRKRV+VCPEP+CLHH+PCHALGDL
Sbjct: 79  CEICNQGFQRDQNLQMHRRRHKVPWKLLKRETNEEVRKRVYVCPEPTCLHHNPCHALGDL 138

Query: 124 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 183
           VGIKKHFRRKHSNHKQW+CE+CSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 139 VGIKKHFRRKHSNHKQWICERCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 198

Query: 184 HQDACNM-----GHLRQESQTLHPACLSRTASS-----------------------PSPP 243
           HQD C +      + R   Q  H    ++TAS+                        SPP
Sbjct: 199 HQDTCTVRRSQPSNHRLHEQQQHTTNATQTASTAENNENGDLSIGPILPGHPLQRRQSPP 258

Query: 244 SDANFTAPPTAIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTSTSNEGSSTQLQLSIGSC 303
           S+     P T ++   T+ SI               +L+L  S  N    T L LSIG+ 
Sbjct: 259 SEQQ---PSTLLYPFVTNGSI---------------ELQLLPS-RNCADETSLSLSIGTM 318

Query: 304 DFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEE 363
           D       V+    EKG                 E   E+   EEAR++ KRQIE+AE E
Sbjct: 319 D-QKTMSEVEKKSYEKG-----------------ETSLER---EEARRETKRQIEIAELE 378

Query: 364 FVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTT--TTTTAP 417
           F  AKR+RQ A+AEL KA   ++ A ++I++T+++ITC  C++ FQA          T  
Sbjct: 379 FAEAKRIRQHARAELHKAHLFREEASRRISATMMQITCHNCKQHFQAPAALVPPPPQTHC 416

BLAST of Cp4.1LG20g00080 vs. TAIR10
Match: AT1G25250.1 (AT1G25250.1 indeterminate(ID)-domain 16)

HSP 1 Score: 344.7 bits (883), Expect = 7.8e-95
Identity = 208/369 (56.37%), Postives = 247/369 (66.94%), Query Frame = 1

Query: 41  DPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETP--VVRK 100
           DPDAEVVSLSP+TLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKR+     VRK
Sbjct: 20  DPDAEVVSLSPRTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRDKKDEEVRK 79

Query: 101 RVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKT 160
           RV+VCPEP+CLHHDPCHALGDLVGIKKHFRRKHS HKQWVCE+CSK YAVQSDYKAHLKT
Sbjct: 80  RVYVCPEPTCLHHDPCHALGDLVGIKKHFRRKHSVHKQWVCERCSKGYAVQSDYKAHLKT 139

Query: 161 CGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANFT 220
           CG+RGHSCDCGRVFSRVESFIEHQD C    +RQ   T H      T    +P    +  
Sbjct: 140 CGSRGHSCDCGRVFSRVESFIEHQDTCT---IRQPQPTNHRHLQQHTMGLDAPSRTTS-- 199

Query: 221 APPTAIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTSTSNEGSSTQLQLSIGSCDFGDKK 280
              TA F  P    +        +N H         ++S    S +LQLSIG       +
Sbjct: 200 ---TASF-GPLLHGLPLLRPPRPSNQHSPAFAYPFNASSAPFESLELQLSIGMA-----R 259

Query: 281 KRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAKR 340
              Q   +EK   R     +E A EE R+       AEE RQ+AKRQIEMAE++F  AKR
Sbjct: 260 TSAQARHNEK---RETSLTKERANEEARK-------AEETRQEAKRQIEMAEKDFEKAKR 319

Query: 341 MRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSVA 400
           +R++A+ EL+KA  +++ AIK+IN+T++EITC +C++ FQ   T   +T       +S+ 
Sbjct: 320 IREEAKTELEKAHVVREEAIKRINATMMEITCHSCKQLFQLPVTADEST-------SSLV 357

Query: 401 FSYVSSVIT 408
            SYVSS  T
Sbjct: 380 MSYVSSATT 357

BLAST of Cp4.1LG20g00080 vs. TAIR10
Match: AT1G03840.1 (AT1G03840.1 C2H2 and C2HC zinc fingers superfamily protein)

HSP 1 Score: 226.5 bits (576), Expect = 3.1e-59
Identity = 105/170 (61.76%), Postives = 132/170 (77.65%), Query Frame = 1

Query: 31  KRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQGFQRDQNLQMHRRRHKVPWKLLK 90
           K+KR   G PDP+AEV++LSPKTL+ ++R++CEIC +GFQRDQNLQ+HRR H +PWKL +
Sbjct: 41  KKKRNLPGNPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLPWKLKQ 100

Query: 91  RETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQS 150
           R +  VRKRV+VCPE SC+HH P  ALGDL GIKKHF RKH   K+W CEKC+K YAVQS
Sbjct: 101 RTSKEVRKRVYVCPEKSCVHHHPTRALGDLTGIKKHFCRKH-GEKKWKCEKCAKRYAVQS 160

Query: 151 DYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDACNMGHLRQESQTLHPA 201
           D+KAH KTCGTR + CDCG +FSR +SFI H+  C+   L +E+  L+ A
Sbjct: 161 DWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDA--LAEETARLNAA 207

BLAST of Cp4.1LG20g00080 vs. TAIR10
Match: AT3G50700.1 (AT3G50700.1 indeterminate(ID)-domain 2)

HSP 1 Score: 226.5 bits (576), Expect = 3.1e-59
Identity = 112/209 (53.59%), Postives = 148/209 (70.81%), Query Frame = 1

Query: 9   ASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRYVCEICNQG 68
           AS+S+ S+   + L N   ++ K+KR   G PDP++EV++LSPKTLL ++R+VCEICN+G
Sbjct: 15  ASVSISSTGNQNPLPN---STGKKKRNLPGMPDPESEVIALSPKTLLATNRFVCEICNKG 74

Query: 69  FQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDLVGIKKHFR 128
           FQRDQNLQ+HRR H +PWKL ++    V+K+V+VCPE SC+HHDP  ALGDL GIKKHF 
Sbjct: 75  FQRDQNLQLHRRGHNLPWKLRQKSNKEVKKKVYVCPEVSCVHHDPSRALGDLTGIKKHFC 134

Query: 129 RKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIEHQDAC--- 188
           RKH   K+W C+KCSK YAVQSD+KAH K CGT+ + CDCG +FSR +SFI H+  C   
Sbjct: 135 RKH-GEKKWKCDKCSKKYAVQSDWKAHSKICGTKEYKCDCGTLFSRRDSFITHRAFCDAL 194

Query: 189 ---NMGHLRQESQTLHPACLSRTASSPSP 212
              N      +S+  +P  L+R    P+P
Sbjct: 195 AEENARSHHSQSKKQNPEILTRKNPVPNP 219

BLAST of Cp4.1LG20g00080 vs. NCBI nr
Match: gi|659066251|ref|XP_008452902.1| (PREDICTED: protein SHOOT GRAVITROPISM 5-like [Cucumis melo])

HSP 1 Score: 620.9 bits (1600), Expect = 1.6e-174
Identity = 352/451 (78.05%), Postives = 372/451 (82.48%), Query Frame = 1

Query: 1   MIANTPASASLSLP-SSEPYSCLENGSSN-SNKRKRRPAGTPDPDAEVVSLSPKTLLESD 60
           MI NT  SAS  LP SSEPYSCLENG++N +NKRKRRPAGTPDPDAEVVSLSPKTLLESD
Sbjct: 1   MIGNTATSASPLLPNSSEPYSCLENGNNNINNKRKRRPAGTPDPDAEVVSLSPKTLLESD 60

Query: 61  RYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALG 120
           RYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRE+PVVRKRVFVCPEP+CLHHDPCHALG
Sbjct: 61  RYVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRESPVVRKRVFVCPEPTCLHHDPCHALG 120

Query: 121 DLVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESF 180
           DLVGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESF
Sbjct: 121 DLVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESF 180

Query: 181 IEHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANF--------------TAPPT-- 240
           IEHQDACNMGHLRQE Q + PACLSRTASSPSP SD NF              TAPPT  
Sbjct: 181 IEHQDACNMGHLRQEPQ-VQPACLSRTASSPSPSSDTNFSSTPAPSNNWPALVTAPPTLK 240

Query: 241 ---AIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTST----------SNEGSSTQLQLSI 300
              AIFL PT DS      NNNNN+ HNLDLKLST++          SN+GSST+L+LS+
Sbjct: 241 PVDAIFLTPTGDS------NNNNNNDHNLDLKLSTASNGVKGRNNNNSNKGSSTKLELSM 300

Query: 301 GSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMA 360
           GS DF D+KK++  LDD      G G VREEAREELR AMAEKAYAEEAR+QAKRQIEMA
Sbjct: 301 GSFDFEDEKKKMLKLDD-----GGAGDVREEAREELRVAMAEKAYAEEARKQAKRQIEMA 360

Query: 361 EEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAK--------- 410
           EEEF NAKRMRQQAQAELDKATALKQAAIKQINST+LEITCQACQKQFQAK         
Sbjct: 361 EEEFGNAKRMRQQAQAELDKATALKQAAIKQINSTILEITCQACQKQFQAKAKTKTKTTT 420

BLAST of Cp4.1LG20g00080 vs. NCBI nr
Match: gi|778656670|ref|XP_011649488.1| (PREDICTED: protein SHOOT GRAVITROPISM 5 [Cucumis sativus])

HSP 1 Score: 620.5 bits (1599), Expect = 2.1e-174
Identity = 347/449 (77.28%), Postives = 370/449 (82.41%), Query Frame = 1

Query: 1   MIANTPASASLSLP-SSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60
           MI NT  SAS  LP SSEPYSCLENG++N+NKRKRRPAGTPDPDAEVVSLSPKTLLESDR
Sbjct: 1   MIGNTATSASPLLPNSSEPYSCLENGNNNNNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60

Query: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120
           YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRE+PVVRKRVFVCPEP+CLHHDPCHALGD
Sbjct: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRESPVVRKRVFVCPEPTCLHHDPCHALGD 120

Query: 121 LVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180
           LVGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI
Sbjct: 121 LVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180

Query: 181 EHQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANFTAPPT----------------- 240
           EHQDACNMGHLRQESQ + PACLSRTASSPSP SD NF++ P                  
Sbjct: 181 EHQDACNMGHLRQESQ-VQPACLSRTASSPSPSSDTNFSSTPAPSSNWHALVTPPLTLKP 240

Query: 241 --AIFLAPTDDSIINNNNNNNNNSHHNLDLKLSTST-----------SNEGSSTQLQLSI 300
             AIFL PT DS      NNNNNS HNLDLKLST++           + +GSST+L+LS+
Sbjct: 241 VDAIFLTPTGDS------NNNNNSDHNLDLKLSTASNGVEGRNNYNNNKKGSSTKLELSM 300

Query: 301 GSCDFGDKKKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMA 360
           GS DF D+KK++  LDD      G G VREEAREELR AMAEKAYAEEAR+QAKRQIEMA
Sbjct: 301 GSFDFEDEKKKMLKLDD-----GGAGDVREEAREELRVAMAEKAYAEEARKQAKRQIEMA 360

Query: 361 EEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAK-------PT 410
           EEEF NAKRMRQQAQAELDKATALKQAAIKQINST+LEITCQACQKQFQAK        T
Sbjct: 361 EEEFGNAKRMRQQAQAELDKATALKQAAIKQINSTILEITCQACQKQFQAKTKTKTKTTT 420

BLAST of Cp4.1LG20g00080 vs. NCBI nr
Match: gi|590582445|ref|XP_007014626.1| (C2H2-like zinc finger protein [Theobroma cacao])

HSP 1 Score: 533.5 bits (1373), Expect = 3.3e-148
Identity = 296/432 (68.52%), Postives = 339/432 (78.47%), Query Frame = 1

Query: 1   MIANTPASASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60
           M+ N   S   S+PSSEP+SC+E G++N NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY
Sbjct: 1   MLTNNSCS---SVPSSEPFSCIEIGNNN-NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60

Query: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120
           VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL
Sbjct: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120

Query: 121 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180
           VGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 121 VGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180

Query: 181 HQDACNMGHLRQESQTLHPACLSRTASSPSPPSDANF-TAPPTAIFLAPTDDSII---NN 240
           HQDAC+MGH+R ESQ L PACLSRTASSPSP SD NF TAP  ++ LA T D++      
Sbjct: 181 HQDACHMGHIRPESQALQPACLSRTASSPSPSSDTNFSTAPWPSLVLAKTTDTMFLSPTK 240

Query: 241 NNNNNNNSHHNLDLKLSTST----------SNEGSSTQLQLSIGSCDFGDKKKRVQLLDD 300
           +N+  N  +HNL+L+L T++          +++  STQLQLSIGS D G+K +      +
Sbjct: 241 DNSPKNAHYHNLELQLLTTSNPTELSVSPKTDDKHSTQLQLSIGSSDIGEKIESTVTCTN 300

Query: 301 EKGGGRGP-----------GGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVN 360
           +    + P             ++E+ARE+LR AMAEKA+AEE RQQAKRQIE+AE+EF N
Sbjct: 301 KDASKKSPHQESEKPTFVASRLKEQAREQLRLAMAEKAFAEEVRQQAKRQIELAEQEFAN 360

Query: 361 AKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDT 408
           AKR+RQQAQAELDKA ALK  AIKQINST+L+ITC AC++QFQA+       T P   + 
Sbjct: 361 AKRIRQQAQAELDKAQALKDHAIKQINSTILQITCHACKQQFQAR-------TPP--EEN 419

BLAST of Cp4.1LG20g00080 vs. NCBI nr
Match: gi|566205919|ref|XP_006374223.1| (hypothetical protein POPTR_0015s05180g [Populus trichocarpa])

HSP 1 Score: 519.6 bits (1337), Expect = 5.0e-144
Identity = 300/450 (66.67%), Postives = 345/450 (76.67%), Query Frame = 1

Query: 1   MIANTPASASLSLPSS-EPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60
           M+AN+  S   SLPS+ EP+ CLENG+SN NKRKRRPAGTPDPDAEVVSLSPKTLLESDR
Sbjct: 1   MLANSSPS---SLPSNPEPFFCLENGNSN-NKRKRRPAGTPDPDAEVVSLSPKTLLESDR 60

Query: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120
           YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD
Sbjct: 61  YVCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGD 120

Query: 121 LVGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180
           LVGIKKHFRRKHSNHKQWVCEKCSK YAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI
Sbjct: 121 LVGIKKHFRRKHSNHKQWVCEKCSKGYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFI 180

Query: 181 EHQDACNMGHLRQESQTLHP-ACLSRTASSPSPPSDANF-TAP--------PT-----AI 240
           EHQD+CNMG LR ESQ+L P ACLSRTASSPSP SD NF TAP        PT     A+
Sbjct: 181 EHQDSCNMGRLRSESQSLQPAACLSRTASSPSPSSDNNFSTAPWPPLIISRPTTTSDHAM 240

Query: 241 FLAPTD-DSIINNNNNNNNNSHH-NLDLKLSTSTSN-----------EGSSTQLQLSIGS 300
           F +PT    +++  +++ + +H+ NL+L+LST++ N           +  STQLQLSIGS
Sbjct: 241 FFSPTTATDVVDKTDSSKSAAHYQNLELQLSTTSRNPPEVSVSPKRDDNHSTQLQLSIGS 300

Query: 301 CDFGDKKKRVQLLDDEKGGGR--------------GPGGVREEAREELREAMAEKAYAEE 360
            D  D+ +      ++   G+              G   ++E+ARE+LR AMAEK YAEE
Sbjct: 301 SDVSDRNESNITYTNKDHAGKSFPRESNNSPKPELGASRLKEQAREQLRMAMAEKIYAEE 360

Query: 361 ARQQAKRQIEMAEEEFVNAKRMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQF 408
           ARQQAKRQIE+AE+EF NAKR+RQQAQAEL KA AL+Q AIKQINST+L+ITC AC+++F
Sbjct: 361 ARQQAKRQIELAEQEFANAKRIRQQAQAELGKAQALRQHAIKQINSTILQITCHACKQKF 420

BLAST of Cp4.1LG20g00080 vs. NCBI nr
Match: gi|728827540|gb|KHG07236.1| (Zinc finger MAGPIE -like protein [Gossypium arboreum])

HSP 1 Score: 518.8 bits (1335), Expect = 8.5e-144
Identity = 289/430 (67.21%), Postives = 330/430 (76.74%), Query Frame = 1

Query: 1   MIANTPASASLSLPSSEPYSCLENGSSNSNKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60
           M+AN   S   SLPSSEP+SCLENG++N NKRKRRPAGTPDPDAEVVSLSPKTLLESDRY
Sbjct: 1   MLANNSCS---SLPSSEPFSCLENGTTNINKRKRRPAGTPDPDAEVVSLSPKTLLESDRY 60

Query: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPSCLHHDPCHALGDL 120
           VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEP+CLHHDPCHALGDL
Sbjct: 61  VCEICNQGFQRDQNLQMHRRRHKVPWKLLKRETPVVRKRVFVCPEPTCLHHDPCHALGDL 120

Query: 121 VGIKKHFRRKHSNHKQWVCEKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180
           VGIKKHFRRKHSNHKQWVC+KCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE
Sbjct: 121 VGIKKHFRRKHSNHKQWVCDKCSKAYAVQSDYKAHLKTCGTRGHSCDCGRVFSRVESFIE 180

Query: 181 HQDACNMGH-LRQESQTLHPACLSRTASSPSPPSDANFTAPP------------TAIFLA 240
           HQDAC+MG  +R ESQ +  ACLSRTASSPSP +D +F+  P             A+FL+
Sbjct: 181 HQDACHMGRVIRPESQGVQLACLSRTASSPSPSNDTHFSTAPCPWPNLALPKTKDAMFLS 240

Query: 241 PTDDSIINNNNNNNNNSHHNLDLKLSTSTS----------NEGSSTQLQLSIGSCDFGDK 300
           PT D     +N+  N   HNL+L+L T+++          ++  ST LQLSIGS D GDK
Sbjct: 241 PTKD-----HNSPKNAHSHNLELQLLTTSNPSEVSVSPKKHDNHSTHLQLSIGSSDIGDK 300

Query: 301 KKRVQLLDDEKGGGRGPGGVREEAREELREAMAEKAYAEEARQQAKRQIEMAEEEFVNAK 360
            +   +    K      G   ++A + L+ AM EK YAEEARQ+AKRQIE+AE+EF  AK
Sbjct: 301 VEYSTVTCTHKDASNKSGHHEKQAPQPLKLAMEEKTYAEEARQEAKRQIEIAEQEFAKAK 360

Query: 361 RMRQQAQAELDKATALKQAAIKQINSTLLEITCQACQKQFQAKPTTTTTTTAPADHDTSV 408
           R+RQQAQAEL+KA ALK  AIKQINST+L+ITCQACQ QFQA+       T P   + S+
Sbjct: 361 RIRQQAQAELEKAQALKDHAIKQINSTILQITCQACQHQFQAR-------TPP--EENSL 413

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
IDD15_ARATH8.3e-10753.17Protein SHOOT GRAVITROPISM 5 OS=Arabidopsis thaliana GN=SGR5 PE=1 SV=1[more]
IDD14_ARATH2.2e-9952.24Protein indeterminate-domain 14 OS=Arabidopsis thaliana GN=IDD14 PE=1 SV=1[more]
IDD16_ARATH1.4e-9356.37Protein indeterminate-domain 16 OS=Arabidopsis thaliana GN=IDD16 PE=2 SV=1[more]
IDD3_ARATH5.5e-5861.76Zinc finger protein MAGPIE OS=Arabidopsis thaliana GN=MGP PE=1 SV=1[more]
IDD2_ARATH5.5e-5853.59Protein indeterminate-domain 2 OS=Arabidopsis thaliana GN=IDD2 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0LPV5_CUCSA1.4e-17477.28Uncharacterized protein OS=Cucumis sativus GN=Csa_1G029620 PE=4 SV=1[more]
A0A061GQZ7_THECC2.3e-14868.52C2H2-like zinc finger protein OS=Theobroma cacao GN=TCM_039880 PE=4 SV=1[more]
U5FLX0_POPTR3.5e-14466.67Uncharacterized protein OS=Populus trichocarpa GN=POPTR_0015s05180g PE=4 SV=1[more]
A0A0B0N813_GOSAR5.9e-14467.21Zinc finger MAGPIE-like protein OS=Gossypium arboreum GN=F383_34553 PE=4 SV=1[more]
F6GSS5_VITVI1.3e-14367.52Putative uncharacterized protein OS=Vitis vinifera GN=VIT_17s0000g01870 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
AT2G01940.33.1e-10452.37 C2H2-like zinc finger protein[more]
AT1G68130.11.2e-10052.24 indeterminate(ID)-domain 14[more]
AT1G25250.17.8e-9556.37 indeterminate(ID)-domain 16[more]
AT1G03840.13.1e-5961.76 C2H2 and C2HC zinc fingers superfamily protein[more]
AT3G50700.13.1e-5953.59 indeterminate(ID)-domain 2[more]
Match NameE-valueIdentityDescription
gi|659066251|ref|XP_008452902.1|1.6e-17478.05PREDICTED: protein SHOOT GRAVITROPISM 5-like [Cucumis melo][more]
gi|778656670|ref|XP_011649488.1|2.1e-17477.28PREDICTED: protein SHOOT GRAVITROPISM 5 [Cucumis sativus][more]
gi|590582445|ref|XP_007014626.1|3.3e-14868.52C2H2-like zinc finger protein [Theobroma cacao][more]
gi|566205919|ref|XP_006374223.1|5.0e-14466.67hypothetical protein POPTR_0015s05180g [Populus trichocarpa][more]
gi|728827540|gb|KHG07236.1|8.5e-14467.21Zinc finger MAGPIE -like protein [Gossypium arboreum][more]
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO:0046872metal ion binding
Vocabulary: INTERPRO
TermDefinition
IPR019786Zinc_finger_PHD-type_CS
IPR015880Zinc finger, C2H2-like
IPR013087Znf_C2H2_type
IPR007087Zinc finger, C2H2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
biological_process GO:0009630 gravitropism
cellular_component GO:0005575 cellular_component
molecular_function GO:0046872 metal ion binding
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG20g00080.1Cp4.1LG20g00080.1mRNA


Analysis Name: InterPro Annotations of Cucurbita pepo
Date Performed: 2017-12-02
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR007087Zinc finger, C2H2PROSITEPS00028ZINC_FINGER_C2H2_1coord: 62..82
scor
IPR007087Zinc finger, C2H2PROFILEPS50157ZINC_FINGER_C2H2_2coord: 60..82
score: 1
IPR013087Zinc finger C2H2-type/integrase DNA-binding domainGENE3DG3DSA:3.30.160.60coord: 128..158
score: 7.5E-5coord: 59..82
score: 1.
IPR015880Zinc finger, C2H2-likeSMARTSM00355c2h2final6coord: 101..131
score: 210.0coord: 60..82
score: 0.0052coord: 137..164
score:
IPR019786Zinc finger, PHD-type, conserved sitePROSITEPS01359ZF_PHD_1coord: 62..142
scor
NoneNo IPR availableunknownCoilCoilcoord: 298..360
scor
NoneNo IPR availablePANTHERPTHR10593SERINE/THREONINE-PROTEIN KINASE RIOcoord: 1..262
score: 1.4E
NoneNo IPR availablePANTHERPTHR10593:SF43SHOOT GRAVITROPISM5-LIKE PROTEINcoord: 1..262
score: 1.4E
NoneNo IPR availableunknownSSF57667beta-beta-alpha zinc fingerscoord: 59..82
score: 1.93E-5coord: 132..157
score: 1.9

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Cp4.1LG20g00080Cp4.1LG02g09930Cucurbita pepo (Zucchini)cpecpeB433
The following block(s) are covering this gene:

None