Tan0020478 (gene) Snake gourd v1

Overview
NameTan0020478
Typegene
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionC2H2-like zinc finger protein
LocationLG05: 7583683 .. 7585694 (+)
RNA-Seq ExpressionTan0020478
SyntenyTan0020478
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CCTATGTGTCATCTTCCACATTTCTTCTCTCTCTCTCTCAATCTTCCCTTCTCCTTCCCCAAAACCAAACCCTTTTCTCTTCTCTCCACATTTTTATTTACTTCCTCCATTCCTCCTGAAACCTACTAAATCTACAGAGAGAAAGAGAGAGATCAAACTCTTGCTTTTATATATACATATATATACTTATATTTACTCTTTTGGTAGCTATTTAGAGATTGAGTCAAACTGGGTTGAACTAGGCTGATCCAACCAAGCTTTAGAATATATATGATTGAGAAAATGGCAGATGATGAATTTTCAAACTGTTTTCTCCAAATCCCTCTTGCCGGATCCAACTCTTCTCTCACCAAGAAGAAGAGAAACCTCCCCGGAACCCCAGGTAAATTTCTCTTCTCGGTTTCGCTTCTTCTGATAAATTACTCGATGTAACTGTCACTCATTAGAAAAACTTTCTCCTATTTTGTGACGACAGATCCCGAAGCAGAAGTGATAGCGTTGTCGCCGAAGACACTGATGGCGACAAATAGGTTCCTGTGCGAAATATGCGGGAAGGGGTTTCAAAGAGACCAAAACTTGCAGCTGCATAGGCGAGGGCATAACCTTCCATGGAAGCTGAAGCAAAGAAGCAGCAAAGAGCCAAGGAAGAGAGTGTATGTGTGCCCTGAGAAGAGCTGCGTGCACCATCATCCATCGAGAGCACTGGGAGACCTCACAGGGATCAAGAAGCACTTTTGTAGAAAGCACGGAGAGAAGAAGTGGAAGTGTGAGAAATGCTCAAAGAGATACGCTGTTCAGTCTGATTGGAAAGCACACTCCAAAACTTGTGGCACTAGAGAGTACAAATGTGACTGTGGAACTCTATTTTCGAGGTATTGATTATTCATTATTAAGCTTTCAACTTTAGATTAATTTATGATGAGAATTTGGAAATGGGTATGAATAATATTTATTCCAAATTTGAATTAATTGGAAAAAGTTTGAAACTTTAATGATACAAACAGGAGAGACAGCTTCATCACTCACAGAGCCTTCTGCGATGCATTAGCTGAAGAAACAGCAAGAGTAAACGCAGCCACAACAATAACCGCCGGCGCCAGCAATTTGAATTACAATTTCATGGCTGAATCCCCATTCATGGCCCAACATTTCCCTTCAATTTTCAAGCCAATCTCAATGAACGAAGCAACAATATTATTCAATCATCAACAACAAACAACCAGCTGCAATGGCCTGATCTCGTTGAGAACCAACAACAACAATAATCTACAACCACTTCCTCATATTAATTCAGGCCTAATGTACTGTGACCCTTTTGTTAATTCTTGCCCAACACCAGCACCAGCACCAGCACCAGCACCAGCTGATTATAATAATCTGAATTGGGTATTTGGAACAAAGGTTAATAATTCTGACCATGAAAATCAAGATTCAAGCCAAAATCAGCCATTGATGAGTGGTGGTGTTTCTTCACTGTACAGCCACGAATTACAGCAAATGAATCAAACCCACATGGCGAATATGTCAGCCACTGCTTTGCTGCAAAAAGCTTCTCAAATTGGGCCAAATTCCAGCTCCGGCCCATCCATACTCCAAGAGGGATTTGCATTCAAATGCAGTGGCAGTACAGTTCAAAATGGGAGTGAATTTTCTAATTCTGGAGCTGCAATCCCAATTATGGCGACGGTTGGCAGTTTTGTTGTTGAGAATGAGAATGAGAATGAGATGTACACTGCAAAACGCCGTCGTACTCAGAGTGAATTAGAGGGAAGTGGAAGTGGAACAACAGGAGGAGGGCAGACGAGGGATTTTCTTGGAGTTGGTGCCAACACTATTTGCCACACCTCCTCAATCAATGGATGGATTTAATTAGATGCCAAATTTAATTTCATGTTTTTTTTTTTTTTTTAATTTTTAAATATGTATTTTGTTTTTCTTTACGCAAACAAAAGTTGTTGACAATAATAAAGTGAATCAATCAAAAGAGAGATGGGAGTGAGAATGAGATTCAA

mRNA sequence

CCTATGTGTCATCTTCCACATTTCTTCTCTCTCTCTCTCAATCTTCCCTTCTCCTTCCCCAAAACCAAACCCTTTTCTCTTCTCTCCACATTTTTATTTACTTCCTCCATTCCTCCTGAAACCTACTAAATCTACAGAGAGAAAGAGAGAGATCAAACTCTTGCTTTTATATATACATATATATACTTATATTTACTCTTTTGGTAGCTATTTAGAGATTGAGTCAAACTGGGTTGAACTAGGCTGATCCAACCAAGCTTTAGAATATATATGATTGAGAAAATGGCAGATGATGAATTTTCAAACTGTTTTCTCCAAATCCCTCTTGCCGGATCCAACTCTTCTCTCACCAAGAAGAAGAGAAACCTCCCCGGAACCCCAGATCCCGAAGCAGAAGTGATAGCGTTGTCGCCGAAGACACTGATGGCGACAAATAGGTTCCTGTGCGAAATATGCGGGAAGGGGTTTCAAAGAGACCAAAACTTGCAGCTGCATAGGCGAGGGCATAACCTTCCATGGAAGCTGAAGCAAAGAAGCAGCAAAGAGCCAAGGAAGAGAGTGTATGTGTGCCCTGAGAAGAGCTGCGTGCACCATCATCCATCGAGAGCACTGGGAGACCTCACAGGGATCAAGAAGCACTTTTGTAGAAAGCACGGAGAGAAGAAGTGGAAGTGTGAGAAATGCTCAAAGAGATACGCTGTTCAGTCTGATTGGAAAGCACACTCCAAAACTTGTGGCACTAGAGAGTACAAATGTGACTGTGGAACTCTATTTTCGAGGAGAGACAGCTTCATCACTCACAGAGCCTTCTGCGATGCATTAGCTGAAGAAACAGCAAGAGTAAACGCAGCCACAACAATAACCGCCGGCGCCAGCAATTTGAATTACAATTTCATGGCTGAATCCCCATTCATGGCCCAACATTTCCCTTCAATTTTCAAGCCAATCTCAATGAACGAAGCAACAATATTATTCAATCATCAACAACAAACAACCAGCTGCAATGGCCTGATCTCGTTGAGAACCAACAACAACAATAATCTACAACCACTTCCTCATATTAATTCAGGCCTAATGTACTGTGACCCTTTTGTTAATTCTTGCCCAACACCAGCACCAGCACCAGCACCAGCACCAGCTGATTATAATAATCTGAATTGGGTATTTGGAACAAAGGTTAATAATTCTGACCATGAAAATCAAGATTCAAGCCAAAATCAGCCATTGATGAGTGGTGGTGTTTCTTCACTGTACAGCCACGAATTACAGCAAATGAATCAAACCCACATGGCGAATATGTCAGCCACTGCTTTGCTGCAAAAAGCTTCTCAAATTGGGCCAAATTCCAGCTCCGGCCCATCCATACTCCAAGAGGGATTTGCATTCAAATGCAGTGGCAGTACAGTTCAAAATGGGAGTGAATTTTCTAATTCTGGAGCTGCAATCCCAATTATGGCGACGGTTGGCAGTTTTGTTGTTGAGAATGAGAATGAGAATGAGATGTACACTGCAAAACGCCGTCGTACTCAGAGTGAATTAGAGGGAAGTGGAAGTGGAACAACAGGAGGAGGGCAGACGAGGGATTTTCTTGGAGTTGGTGCCAACACTATTTGCCACACCTCCTCAATCAATGGATGGATTTAATTAGATGCCAAATTTAATTTCATGTTTTTTTTTTTTTTTTAATTTTTAAATATGTATTTTGTTTTTCTTTACGCAAACAAAAGTTGTTGACAATAATAAAGTGAATCAATCAAAAGAGAGATGGGAGTGAGAATGAGATTCAA

Coding sequence (CDS)

ATGATTGAGAAAATGGCAGATGATGAATTTTCAAACTGTTTTCTCCAAATCCCTCTTGCCGGATCCAACTCTTCTCTCACCAAGAAGAAGAGAAACCTCCCCGGAACCCCAGATCCCGAAGCAGAAGTGATAGCGTTGTCGCCGAAGACACTGATGGCGACAAATAGGTTCCTGTGCGAAATATGCGGGAAGGGGTTTCAAAGAGACCAAAACTTGCAGCTGCATAGGCGAGGGCATAACCTTCCATGGAAGCTGAAGCAAAGAAGCAGCAAAGAGCCAAGGAAGAGAGTGTATGTGTGCCCTGAGAAGAGCTGCGTGCACCATCATCCATCGAGAGCACTGGGAGACCTCACAGGGATCAAGAAGCACTTTTGTAGAAAGCACGGAGAGAAGAAGTGGAAGTGTGAGAAATGCTCAAAGAGATACGCTGTTCAGTCTGATTGGAAAGCACACTCCAAAACTTGTGGCACTAGAGAGTACAAATGTGACTGTGGAACTCTATTTTCGAGGAGAGACAGCTTCATCACTCACAGAGCCTTCTGCGATGCATTAGCTGAAGAAACAGCAAGAGTAAACGCAGCCACAACAATAACCGCCGGCGCCAGCAATTTGAATTACAATTTCATGGCTGAATCCCCATTCATGGCCCAACATTTCCCTTCAATTTTCAAGCCAATCTCAATGAACGAAGCAACAATATTATTCAATCATCAACAACAAACAACCAGCTGCAATGGCCTGATCTCGTTGAGAACCAACAACAACAATAATCTACAACCACTTCCTCATATTAATTCAGGCCTAATGTACTGTGACCCTTTTGTTAATTCTTGCCCAACACCAGCACCAGCACCAGCACCAGCACCAGCTGATTATAATAATCTGAATTGGGTATTTGGAACAAAGGTTAATAATTCTGACCATGAAAATCAAGATTCAAGCCAAAATCAGCCATTGATGAGTGGTGGTGTTTCTTCACTGTACAGCCACGAATTACAGCAAATGAATCAAACCCACATGGCGAATATGTCAGCCACTGCTTTGCTGCAAAAAGCTTCTCAAATTGGGCCAAATTCCAGCTCCGGCCCATCCATACTCCAAGAGGGATTTGCATTCAAATGCAGTGGCAGTACAGTTCAAAATGGGAGTGAATTTTCTAATTCTGGAGCTGCAATCCCAATTATGGCGACGGTTGGCAGTTTTGTTGTTGAGAATGAGAATGAGAATGAGATGTACACTGCAAAACGCCGTCGTACTCAGAGTGAATTAGAGGGAAGTGGAAGTGGAACAACAGGAGGAGGGCAGACGAGGGATTTTCTTGGAGTTGGTGCCAACACTATTTGCCACACCTCCTCAATCAATGGATGGATTTAA

Protein sequence

MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTITAGASNLNYNFMAESPFMAQHFPSIFKPISMNEATILFNHQQQTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGTKVNNSDHENQDSSQNQPLMSGGVSSLYSHELQQMNQTHMANMSATALLQKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI
Homology
BLAST of Tan0020478 vs. ExPASy Swiss-Prot
Match: Q9ZWA6 (Zinc finger protein MAGPIE OS=Arabidopsis thaliana OX=3702 GN=MGP PE=1 SV=1)

HSP 1 Score: 414.5 bits (1064), Expect = 1.6e-114
Identity = 264/500 (52.80%), Postives = 306/500 (61.20%), Query Frame = 0

Query: 23  NSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 82
           N  L KKKRNLPG PDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP
Sbjct: 36  NPPLVKKKRNLPGNPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 95

Query: 83  WKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRY 142
           WKLKQR+SKE RKRVYVCPEKSCVHHHP+RALGDLTGIKKHFCRKHGEKKWKCEKC+KRY
Sbjct: 96  WKLKQRTSKEVRKRVYVCPEKSCVHHHPTRALGDLTGIKKHFCRKHGEKKWKCEKCAKRY 155

Query: 143 AVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTI----- 202
           AVQSDWKAHSKTCGTREY+CDCGT+FSRRDSFITHRAFCDALAEETAR+NAA+ +     
Sbjct: 156 AVQSDWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDALAEETARLNAASHLKSFAA 215

Query: 203 TAGASNLNYNFMAES----------------PFMAQHFPSIFKPISMNEATILFNHQQQT 262
           TAG SNLNY+++  +                P   QH      PI+ N     F+HQ   
Sbjct: 216 TAG-SNLNYHYLMGTLIPSPSLPQPPSFPFGPPQPQHHHHHQFPITTNN----FDHQDVM 275

Query: 263 TSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGT 322
              + L SL +  N N      I   +             AP P     DY   NWVFG 
Sbjct: 276 KPASTL-SLWSGGNINHHQQVTIEDRM-------------APQPHSPQEDY---NWVFGN 335

Query: 323 KVNNSD----------HEN-----QDSSQNQPLMSGGVSSLYSHELQQMNQ------THM 382
             N+ +          H+N     Q         S  V SL+S  + Q+ Q        +
Sbjct: 336 ANNHGELITTSDSLITHDNNINIVQSKENANGATSLSVPSLFS-SVDQITQDANAASVAV 395

Query: 383 ANMSATALLQKASQIGPNSSSGP--------SILQEGFAFKCSGSTVQNGSE--FSNSGA 442
           ANMSATALLQKA+Q+G  SS+ P        S   + FA K +      GS+  F++ G+
Sbjct: 396 ANMSATALLQKAAQMGATSSTSPTTTITTDQSAYLQSFASKSNQIVEDGGSDRFFASFGS 455

Query: 443 -AIPIMAT------------VGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTR 458
            ++ +M+              G  VV    E + Y  KRRR   ++  +G    GGGQTR
Sbjct: 456 NSVELMSNNNNGLHEIGNPRNGVTVVSGMGELQNYPWKRRRV--DIGNAG----GGGQTR 506

BLAST of Tan0020478 vs. ExPASy Swiss-Prot
Match: Q9FFH3 (Zinc finger protein NUTCRACKER OS=Arabidopsis thaliana OX=3702 GN=NUC PE=1 SV=1)

HSP 1 Score: 392.5 bits (1007), Expect = 6.5e-108
Identity = 245/481 (50.94%), Postives = 281/481 (58.42%), Query Frame = 0

Query: 23  NSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 82
           N  L KKKRNLPG PDPEAEVIALSP TLMATNRFLCE+CGKGFQRDQNLQLHRRGHNLP
Sbjct: 32  NPPLVKKKRNLPGNPDPEAEVIALSPTTLMATNRFLCEVCGKGFQRDQNLQLHRRGHNLP 91

Query: 83  WKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRY 142
           WKLKQR+SKE RKRVYVCPEK+CVHHH SRALGDLTGIKKHFCRKHGEKKW CEKC+KRY
Sbjct: 92  WKLKQRTSKEVRKRVYVCPEKTCVHHHSSRALGDLTGIKKHFCRKHGEKKWTCEKCAKRY 151

Query: 143 AVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTITA--- 202
           AVQSDWKAHSKTCGTREY+CDCGT+FSRRDSFITHRAFCDALAEETA++NA + +     
Sbjct: 152 AVQSDWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDALAEETAKINAVSHLNGLAA 211

Query: 203 ----GASNLNYNFMAESPFMAQHFPSI--FKPISMNEATILFNHQQQTTSCNGLISLRTN 262
               G+ NLNY ++     M    P +  F P           H Q  TS     SL   
Sbjct: 212 AGAPGSVNLNYQYL-----MGTFIPPLQPFVPQPQTNPNHHHQHFQPPTSS----SLSLW 271

Query: 263 NNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGTKV-------NNS 322
              ++ P                        P P P DY   +WVFG          NN+
Sbjct: 272 MGQDIAP------------------------PQPQP-DY---DWVFGNAKAASACIDNNN 331

Query: 323 DH-----ENQDSSQNQPLMSGGVSSLYSHELQQMNQTHMANMSATALLQKASQIGPNS-- 382
            H     +N ++S          S   S + Q  N     NMSATALLQKA++IG  S  
Sbjct: 332 THDEQITQNANASLTTTTTLSAPSLFSSDQPQNANANSNVNMSATALLQKAAEIGATSTT 391

Query: 383 ---SSGPSILQEGFAFKCSGSTV--QNGSEF-----SNSGAAIPIMA------------- 442
              ++ PS   + F  K +  T    +G +F     SN+   +   +             
Sbjct: 392 TAATNDPSTFLQSFPLKSTDQTTSYDSGEKFFALFGSNNNIGLMSRSHDHQEIENARNDV 451

Query: 443 TVGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGW 458
           TV S + E +N    Y  KRRR        G    GGGQTRDFLGVG  T+CH SSINGW
Sbjct: 452 TVASALDELQN----YPWKRRRVD-----GGGEVGGGGQTRDFLGVGVQTLCHPSSINGW 466

BLAST of Tan0020478 vs. ExPASy Swiss-Prot
Match: Q700D2 (Zinc finger protein JACKDAW OS=Arabidopsis thaliana OX=3702 GN=JKD PE=1 SV=1)

HSP 1 Score: 331.3 bits (848), Expect = 1.8e-89
Identity = 214/459 (46.62%), Postives = 264/459 (57.52%), Query Frame = 0

Query: 18  PLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRR 77
           P A  NSS  KKKRN PGTPDP+A+VIALSP TLMATNRF+CEIC KGFQRDQNLQLHRR
Sbjct: 43  PNAKPNSSSAKKKRNQPGTPDPDADVIALSPTTLMATNRFVCEICNKGFQRDQNLQLHRR 102

Query: 78  GHNLPWKLKQRSSKEP-RKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCE 137
           GHNLPWKLKQRS +E  +K+VY+CP K+CVHH  SRALGDLTGIKKH+ RKHGEKKWKCE
Sbjct: 103 GHNLPWKLKQRSKQEVIKKKVYICPIKTCVHHDASRALGDLTGIKKHYSRKHGEKKWKCE 162

Query: 138 KCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATT 197
           KCSK+YAVQSDWKAH+KTCGTREYKCDCGTLFSR+DSFITHRAFCDAL EE AR+++ + 
Sbjct: 163 KCSKKYAVQSDWKAHAKTCGTREYKCDCGTLFSRKDSFITHRAFCDALTEEGARMSSLSN 222

Query: 198 ITAGASNLNYNFMAESPFM------------AQHFPSIFKPISMNEATILFNHQQQTTSC 257
                S  N NF  ES  M              H P I   IS  +  + F H       
Sbjct: 223 NNPVISTTNLNFGNESNVMNNPNLPHGFVHRGVHHPDINAAIS--QFGLGFGHDLSAMHA 282

Query: 258 NGL---ISLRTNNNNNLQP-----LPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLN 317
            GL   + + +  N++L P     LP  +    +  P  ++ P+                
Sbjct: 283 QGLSEMVQMASTGNHHLFPSSSSSLPDFSGHHQFQIPMTSTNPS---------------- 342

Query: 318 WVFGTKVNNSDHENQDSSQNQPLMSGGVSSLYSHELQQMNQTHMANMSATALLQKASQIG 377
                  +++  +   S Q+Q L     S L+S   +      ++ MSATALLQKA+Q+G
Sbjct: 343 --LTLSSSSTSQQTSASLQHQTLKDSSFSPLFSSSSENKQNKPLSPMSATALLQKAAQMG 402

Query: 378 ---PNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSF--VVENENENEM 437
               NSS+ PS     FA     S+    S    S + + I   + +F   V  EN N  
Sbjct: 403 STRSNSSTAPSF----FAGPTMTSSSATASPPPRSSSPMMIQQQLNNFNTNVLRENHNRA 462

Query: 438 YTAKRRRTQSELEGS--GSGTTG------GGQTRDFLGV 443
                  + S ++ +   S  +G       G TRDFLGV
Sbjct: 463 PPPLSGVSTSSVDNNPFQSNRSGLNPAQQMGLTRDFLGV 477

BLAST of Tan0020478 vs. ExPASy Swiss-Prot
Match: Q944L3 (Zinc finger protein BALDIBIS OS=Arabidopsis thaliana OX=3702 GN=BIB PE=1 SV=1)

HSP 1 Score: 320.9 bits (821), Expect = 2.4e-86
Identity = 190/384 (49.48%), Postives = 239/384 (62.24%), Query Frame = 0

Query: 22  SNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNL 81
           ++S+  K+KRNLPG PDP+AEVIALSP +LM TNRF+CE+C KGF+RDQNLQLHRRGHNL
Sbjct: 33  TSSNSAKRKRNLPGNPDPDAEVIALSPNSLMTTNRFICEVCNKGFKRDQNLQLHRRGHNL 92

Query: 82  PWKLKQRSSKEP-RKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSK 141
           PWKLKQR++KE  +K+VY+CPEK+CVHH P+RALGDLTGIKKHF RKHGEKKWKC+KCSK
Sbjct: 93  PWKLKQRTNKEQVKKKVYICPEKTCVHHDPARALGDLTGIKKHFSRKHGEKKWKCDKCSK 152

Query: 142 RYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTITAG 201
           +YAV SDWKAHSK CGT+EY+CDCGTLFSR+DSFITHRAFCDALAEE+AR  +     A 
Sbjct: 153 KYAVMSDWKAHSKICGTKEYRCDCGTLFSRKDSFITHRAFCDALAEESARFVSVPPAPAY 212

Query: 202 ASNLNYNFMAESPFMAQHFPSIFKPISMNEATILFNHQQQ----TTSCNGLISLRTNNNN 261
            +N                      + +N   I  NHQQ+    T+S        TN NN
Sbjct: 213 LNNA-------------------LDVEVNHGNINQNHQQRQLNTTSSQLDQPGFNTNRNN 272

Query: 262 NL---QPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNL---------NWVFGTKVN 321
                Q LP         + F +S    +P+P  A     NL          W+     N
Sbjct: 273 IAFLGQTLP--------TNVFASS---SSPSPRSASDSLQNLWHLQGQSSHQWLLNENNN 332

Query: 322 NSDH-------ENQDSSQNQPLMSGGVSSLYSHELQ------QMNQTHMANMSATALLQK 376
           N+++       +NQ+  + + ++S G  SL+S E +        N   +A+MSATALLQK
Sbjct: 333 NNNNILQRGISKNQEEHEMKNVISNG--SLFSSEARNNTNNYNQNGGQIASMSATALLQK 384

BLAST of Tan0020478 vs. ExPASy Swiss-Prot
Match: Q8GYC1 (Protein indeterminate-domain 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=IDD4 PE=1 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 5.4e-86
Identity = 205/468 (43.80%), Postives = 270/468 (57.69%), Query Frame = 0

Query: 22  SNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNL 81
           S++   KK+RN PG P+P+AEV+ALSPKTLMATNRF+C++C KGFQR+QNLQLHRRGHNL
Sbjct: 48  SSAPPPKKRRNQPGNPNPDAEVVALSPKTLMATNRFICDVCNKGFQREQNLQLHRRGHNL 107

Query: 82  PWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKR 141
           PWKLKQ+S+KE +++VY+CPE +CVHH PSRALGDLTGIKKH+ RKHGEKKWKCEKCSKR
Sbjct: 108 PWKLKQKSTKEVKRKVYLCPEPTCVHHDPSRALGDLTGIKKHYYRKHGEKKWKCEKCSKR 167

Query: 142 YAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETAR--VNAATTITA 201
           YAVQSDWKAHSKTCGT+EY+CDCGT+FSRRDS+ITHRAFCDAL +ETAR    + T++TA
Sbjct: 168 YAVQSDWKAHSKTCGTKEYRCDCGTIFSRRDSYITHRAFCDALIQETARNPTVSFTSMTA 227

Query: 202 GASNLN----YNFMAESPFMAQHFPSI-----FKPISMNEATILFNHQQ----------- 261
            +S +     Y  +     ++ H  S      F P+      I  +  +           
Sbjct: 228 ASSGVGSGGIYGRLGGGSALSHHHLSDHPNFGFNPLVGYNLNIASSDNRRDFIPQSSNPN 287

Query: 262 ---QTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLN 321
              Q+ S  G+++   NNNN      H   GL+  DP                   +N+N
Sbjct: 288 FLIQSASSQGMLNTTPNNNNQSFMNQH---GLIQFDP------------------VDNIN 347

Query: 322 WVFGTKVNNSDHENQDSSQNQPLMSGGVSSLYSHEL----QQMNQTHMANMSATALLQKA 381
            +  +  NNS        +N       + SLYS ++    ++ N    +N+SATALLQKA
Sbjct: 348 -LKSSGTNNSFFNLGFFQENTKNSETSLPSLYSTDVLVHHREENLNAGSNVSATALLQKA 407

Query: 382 SQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENENEMY 441
           +Q+G  +S+ PS L  G A   + S+V   + F             G  ++EN+N   + 
Sbjct: 408 TQMGSVTSNDPSALFRGLASSSNSSSV-IANHFG------------GGRIMENDNNGNL- 467

Query: 442 TAKRRRTQSELEGSGSGTTGGG-----------------QTRDFLGVG 444
              +    S    +G G +GG                   T DFLGVG
Sbjct: 468 ---QGLMNSLAAVNGGGGSGGSIFDVQFGDNGNMSGSDKLTLDFLGVG 476

BLAST of Tan0020478 vs. NCBI nr
Match: XP_022970999.1 (zinc finger protein NUTCRACKER-like [Cucurbita maxima])

HSP 1 Score: 655.6 bits (1690), Expect = 3.1e-184
Identity = 355/465 (76.34%), Postives = 368/465 (79.14%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLA SN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 57  MMEKMVNDEFPNSFLQIPLARSNPFSPKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 116

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 117 ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 176

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 177 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 236

Query: 181 CDALAEETARVNAATTIT---AGASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTIT   A A+N N NFMA   E PFM    PSIF           
Sbjct: 237 CDALAEETARVNAATTITAAMASAANFNCNFMAGVTEPPFM----PSIF----------- 296

Query: 241 FNHQQQTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNN 300
                   SCNGL S RTN NNNL P+P INSGLMYCDP +NSC       AP PADY N
Sbjct: 297 --------SCNGL-SSRTNTNNNLHPIPQINSGLMYCDPLLNSC------HAPPPADY-N 356

Query: 301 LNWVFGTKVNNSDHENQDSSQNQPLMS--GGVSSLYSHELQQMNQTHMANMSATALLQKA 360
           LNWVFGTK         DS QN PLMS  GGVSSLYSH+LQQ+NQTHMANMSATALLQKA
Sbjct: 357 LNWVFGTK-------GHDSVQNPPLMSGGGGVSSLYSHQLQQVNQTHMANMSATALLQKA 416

Query: 361 SQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENENEMY 420
           ++IGPNSSS P   QEGF FKCSG TVQNGSEFSNS   IPIMA          NENEMY
Sbjct: 417 AEIGPNSSSDPPFFQEGFVFKCSGGTVQNGSEFSNSVTQIPIMA----------NENEMY 473

Query: 421 TAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           TAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 477 TAKRRRTQSEFMGSGSGLTGGGQTRDFLGVGANTICHTSSINGWI 473

BLAST of Tan0020478 vs. NCBI nr
Match: KAG6604885.1 (Zinc finger protein NUTCRACKER, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 655.2 bits (1689), Expect = 4.1e-184
Identity = 356/468 (76.07%), Postives = 367/468 (78.42%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLAGSN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 7   MMEKMVNDEFPNSFLQIPLAGSNPFSAKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 66

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 67  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 126

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 127 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 186

Query: 181 CDALAEETARVNAATTITA---GASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTITA    A N N NFMA   E PFM    PSIF           
Sbjct: 187 CDALAEETARVNAATTITAAMTAAGNFNCNFMAGVTEPPFM----PSIF----------- 246

Query: 241 FNHQQQTTSCNGLISLRTNNNN-NLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYN 300
                   SCNGL S   NNNN NL PLP INSGLMYCDP +NSC       AP PADY 
Sbjct: 247 --------SCNGLSSRTNNNNNSNLHPLPQINSGLMYCDPLLNSC------HAPPPADY- 306

Query: 301 NLNWVFGTKVNNSDHENQDSSQNQPLMS----GGVSSLYSHELQQMNQTHMANMSATALL 360
           NLNWVFGTK NN       S QN PLMS    GGVSSLYSH+LQQ+NQTHMANMSATALL
Sbjct: 307 NLNWVFGTKGNN-------SVQNPPLMSGGGGGGVSSLYSHQLQQVNQTHMANMSATALL 366

Query: 361 QKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENEN 420
           QKA++IG NSSS P   QEGF FKCSG TVQNGSEFSNS   IPIMA          NEN
Sbjct: 367 QKAAEIGANSSSDPPFFQEGFVFKCSGGTVQNGSEFSNSATQIPIMA----------NEN 426

Query: 421 EMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           EMYTAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 427 EMYTAKRRRTQSEFMGSGSGFTGGGQTRDFLGVGANTICHTSSINGWI 427

BLAST of Tan0020478 vs. NCBI nr
Match: KAG7026968.1 (Zinc finger protein NUTCRACKER [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 654.8 bits (1688), Expect = 5.4e-184
Identity = 356/469 (75.91%), Postives = 367/469 (78.25%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLAGSN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 7   MMEKMVNDEFPNSFLQIPLAGSNPFSAKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 66

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 67  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 126

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 127 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 186

Query: 181 CDALAEETARVNAATTITA---GASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTITA    A N N NFMA   E PFM    PSIF           
Sbjct: 187 CDALAEETARVNAATTITAAMTAAGNFNCNFMAGVTEPPFM----PSIF----------- 246

Query: 241 FNHQQQTTSCNGLISLRTNNNN--NLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADY 300
                   SCNGL S   NNNN  NL PLP INSGLMYCDP +NSC       AP PADY
Sbjct: 247 --------SCNGLSSRTNNNNNNSNLHPLPQINSGLMYCDPLLNSC------HAPPPADY 306

Query: 301 NNLNWVFGTKVNNSDHENQDSSQNQPLMS----GGVSSLYSHELQQMNQTHMANMSATAL 360
            NLNWVFGTK NN       S QN PLMS    GGVSSLYSH+LQQ+NQTHMANMSATAL
Sbjct: 307 -NLNWVFGTKGNN-------SVQNPPLMSGGGGGGVSSLYSHQLQQVNQTHMANMSATAL 366

Query: 361 LQKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENE 420
           LQKA++IG NSSS P   QEGF FKCSG TVQNGSEFSNS   IPIMA          NE
Sbjct: 367 LQKAAEIGANSSSDPPFFQEGFVFKCSGGTVQNGSEFSNSATQIPIMA----------NE 426

Query: 421 NEMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           NEMYTAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 427 NEMYTAKRRRTQSEFMGSGSGFTGGGQTRDFLGVGANTICHTSSINGWI 428

BLAST of Tan0020478 vs. NCBI nr
Match: XP_023534067.1 (zinc finger protein NUTCRACKER-like [Cucurbita pepo subsp. pepo])

HSP 1 Score: 650.6 bits (1677), Expect = 1.0e-182
Identity = 354/467 (75.80%), Postives = 366/467 (78.37%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF+N FLQIPLAGSN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 7   MMEKMVNDEFTNSFLQIPLAGSNPFSAKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 66

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 67  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 126

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 127 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 186

Query: 181 CDALAEETARVNAATTITA---GASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTITA    A N N NFMA   E PFM    PSIF           
Sbjct: 187 CDALAEETARVNAATTITAAMTAAGNFNCNFMAGITEPPFM----PSIF----------- 246

Query: 241 FNHQQQTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNN 300
                   SCNGL S    NNNNL PLP INSGLMYCDP +NSC       AP PADY N
Sbjct: 247 --------SCNGLSS--RTNNNNLHPLPQINSGLMYCDPLLNSC------HAPPPADY-N 306

Query: 301 LNWVFGTKVNNSDHENQDSSQNQPLMS----GGVSSLYSHELQQMNQTHMANMSATALLQ 360
           LNWVFGTK NN       S QN PLMS    GGVSSLYSH+LQQ+NQTHMANMSATALLQ
Sbjct: 307 LNWVFGTKGNN-------SVQNPPLMSGGGGGGVSSLYSHQLQQVNQTHMANMSATALLQ 366

Query: 361 KASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENENE 420
           KA++IG NSSS P   QEGF FKCSG TVQNGSEFS S   IPIMA          NENE
Sbjct: 367 KAAEIGANSSSDPPFFQEGFVFKCSGGTVQNGSEFSTSVTQIPIMA----------NENE 424

Query: 421 MYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           MYTAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 427 MYTAKRRRTQSEFMGSGSGFTGGGQTRDFLGVGANTICHTSSINGWI 424

BLAST of Tan0020478 vs. NCBI nr
Match: XP_022947082.1 (zinc finger protein NUTCRACKER-like [Cucurbita moschata])

HSP 1 Score: 647.5 bits (1669), Expect = 8.6e-182
Identity = 353/468 (75.43%), Postives = 365/468 (77.99%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLA SN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 7   MMEKMVNDEFPNSFLQIPLARSNPFSAKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 66

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 67  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 126

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 127 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 186

Query: 181 CDALAEETARVNAATTITA---GASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTITA    A N N NFMA   E PFM    PSIF           
Sbjct: 187 CDALAEETARVNAATTITAAMTAAGNFNCNFMAGITEPPFM----PSIF----------- 246

Query: 241 FNHQQQTTSCNGLISLRTNNNN-NLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYN 300
                   SCNGL S   NNNN NL PLP INSGLMYCDP +NSC       AP PADY 
Sbjct: 247 --------SCNGLSSRTNNNNNSNLHPLPQINSGLMYCDPLLNSC------HAPPPADY- 306

Query: 301 NLNWVFGTKVNNSDHENQDSSQNQPLMS----GGVSSLYSHELQQMNQTHMANMSATALL 360
           NLNWVFGTK NN       S QN PLMS    GGVSSLYSH+LQQ+NQTHMANMSATALL
Sbjct: 307 NLNWVFGTKGNN-------SVQNPPLMSGSGGGGVSSLYSHQLQQVNQTHMANMSATALL 366

Query: 361 QKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENEN 420
           QKA++IG NSSS P   QEGF  KCSG TVQNG+EFSNS   IPIMA          NEN
Sbjct: 367 QKAAEIGANSSSDPPFFQEGFVLKCSGGTVQNGNEFSNSVTQIPIMA----------NEN 426

Query: 421 EMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           EMYTAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 427 EMYTAKRRRTQSEFMGSGSGFTGGGQTRDFLGVGANTICHTSSINGWI 427

BLAST of Tan0020478 vs. ExPASy TrEMBL
Match: A0A6J1I7B4 (zinc finger protein NUTCRACKER-like OS=Cucurbita maxima OX=3661 GN=LOC111469801 PE=4 SV=1)

HSP 1 Score: 655.6 bits (1690), Expect = 1.5e-184
Identity = 355/465 (76.34%), Postives = 368/465 (79.14%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLA SN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 57  MMEKMVNDEFPNSFLQIPLARSNPFSPKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 116

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 117 ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 176

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 177 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 236

Query: 181 CDALAEETARVNAATTIT---AGASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTIT   A A+N N NFMA   E PFM    PSIF           
Sbjct: 237 CDALAEETARVNAATTITAAMASAANFNCNFMAGVTEPPFM----PSIF----------- 296

Query: 241 FNHQQQTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNN 300
                   SCNGL S RTN NNNL P+P INSGLMYCDP +NSC       AP PADY N
Sbjct: 297 --------SCNGL-SSRTNTNNNLHPIPQINSGLMYCDPLLNSC------HAPPPADY-N 356

Query: 301 LNWVFGTKVNNSDHENQDSSQNQPLMS--GGVSSLYSHELQQMNQTHMANMSATALLQKA 360
           LNWVFGTK         DS QN PLMS  GGVSSLYSH+LQQ+NQTHMANMSATALLQKA
Sbjct: 357 LNWVFGTK-------GHDSVQNPPLMSGGGGVSSLYSHQLQQVNQTHMANMSATALLQKA 416

Query: 361 SQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENENEMY 420
           ++IGPNSSS P   QEGF FKCSG TVQNGSEFSNS   IPIMA          NENEMY
Sbjct: 417 AEIGPNSSSDPPFFQEGFVFKCSGGTVQNGSEFSNSVTQIPIMA----------NENEMY 473

Query: 421 TAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           TAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 477 TAKRRRTQSEFMGSGSGLTGGGQTRDFLGVGANTICHTSSINGWI 473

BLAST of Tan0020478 vs. ExPASy TrEMBL
Match: A0A6J1G5G1 (zinc finger protein NUTCRACKER-like OS=Cucurbita moschata OX=3662 GN=LOC111451062 PE=4 SV=1)

HSP 1 Score: 647.5 bits (1669), Expect = 4.1e-182
Identity = 353/468 (75.43%), Postives = 365/468 (77.99%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM +DEF N FLQIPLA SN    KKKRN PGTPDP+AEVIALSPKTLMA NRF+CE
Sbjct: 7   MMEKMVNDEFPNSFLQIPLARSNPFSAKKKRNHPGTPDPDAEVIALSPKTLMAMNRFVCE 66

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE RKRVYVCPE SCVHHHPSRALGDLTGI
Sbjct: 67  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEVRKRVYVCPETSCVHHHPSRALGDLTGI 126

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSD KAHSKTCGT+EYKC CGTLFSRRDSFITHRAF
Sbjct: 127 KKHFCRKHGEKKWKCEKCSKRYAVQSDCKAHSKTCGTKEYKCGCGTLFSRRDSFITHRAF 186

Query: 181 CDALAEETARVNAATTITA---GASNLNYNFMA---ESPFMAQHFPSIFKPISMNEATIL 240
           CDALAEETARVNAATTITA    A N N NFMA   E PFM    PSIF           
Sbjct: 187 CDALAEETARVNAATTITAAMTAAGNFNCNFMAGITEPPFM----PSIF----------- 246

Query: 241 FNHQQQTTSCNGLISLRTNNNN-NLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYN 300
                   SCNGL S   NNNN NL PLP INSGLMYCDP +NSC       AP PADY 
Sbjct: 247 --------SCNGLSSRTNNNNNSNLHPLPQINSGLMYCDPLLNSC------HAPPPADY- 306

Query: 301 NLNWVFGTKVNNSDHENQDSSQNQPLMS----GGVSSLYSHELQQMNQTHMANMSATALL 360
           NLNWVFGTK NN       S QN PLMS    GGVSSLYSH+LQQ+NQTHMANMSATALL
Sbjct: 307 NLNWVFGTKGNN-------SVQNPPLMSGSGGGGVSSLYSHQLQQVNQTHMANMSATALL 366

Query: 361 QKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSFVVENENEN 420
           QKA++IG NSSS P   QEGF  KCSG TVQNG+EFSNS   IPIMA          NEN
Sbjct: 367 QKAAEIGANSSSDPPFFQEGFVLKCSGGTVQNGNEFSNSVTQIPIMA----------NEN 426

Query: 421 EMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGWI 458
           EMYTAKRRRTQSE  GSGSG TGGGQTRDFLGVGANTICHTSSINGWI
Sbjct: 427 EMYTAKRRRTQSEFMGSGSGFTGGGQTRDFLGVGANTICHTSSINGWI 427

BLAST of Tan0020478 vs. ExPASy TrEMBL
Match: A0A6J1CGM4 (zinc finger protein NUTCRACKER OS=Momordica charantia OX=3673 GN=LOC111011056 PE=4 SV=1)

HSP 1 Score: 536.2 bits (1380), Expect = 1.3e-148
Identity = 317/490 (64.69%), Postives = 352/490 (71.84%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+EKM D+EFSNCFLQIP AGSN SL KKKRNLPG PDPEAEV+ALSPKTLMATNRF+CE
Sbjct: 1   MMEKMDDEEFSNCFLQIPAAGSNPSLAKKKRNLPGNPDPEAEVVALSPKTLMATNRFVCE 60

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI
Sbjct: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKW+C+KCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF
Sbjct: 121 KKHFCRKHGEKKWRCDKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180

Query: 181 CDALAEETARVNAATTITAGASNLNYNFM------AESPFMAQ---HFP--SIFKP-ISM 240
           CDALAEETARVN A T  + A+N N+N+         + FM Q   +FP  SIFKP IS 
Sbjct: 181 CDALAEETARVNYAATSISNAANNNFNYQFMGGGTESALFMPQPQHYFPAASIFKPIISG 240

Query: 241 NEATILFNHQQQTTSCNGLISLRTNNNNNL-QPLPHINSGLMYCDPFVNSCPTPAPAPAP 300
           NE               G  SLR  NN NL   +   +SGL+YC        T    P  
Sbjct: 241 NE---------------GRTSLRNANNLNLGGSMVSSSSGLIYC--------TDPVLPPA 300

Query: 301 APADYNNLNWVFGTKV-------NNSDHENQDSSQNQPLM---SGGVSSLYSHE-LQQMN 360
           A  +Y+NLNWVFG K        N+ D + +D  +N+ L+    GG SS+Y H+ LQQ+N
Sbjct: 301 AGDNYSNLNWVFGAKPFSQNNNNNHGDEDGEDLDENEKLVRATGGGASSMYRHQLLQQLN 360

Query: 361 QTHMANMSATALLQKASQIGPNS--SSGPSI--LQEGFAFKCS--GSTVQNGSEFSNSGA 420
            + MANMSATALLQKA+QIG  S  +S PS+  LQEG+ FKCS    T   G  FS +G 
Sbjct: 361 HSQMANMSATALLQKAAQIGATSTTTSDPSLFQLQEGYVFKCSSGSGTAAAGCGFS-AGT 420

Query: 421 AIPIMATVGSFVVENENENEMYTAKRRRT-QSELE-GSGSGTTGGGQTRDFLGVGAN-TI 458
             PIM       VE E    MY AKRRRT Q E E G+G+GT  GGQTRDFLGVGA+ TI
Sbjct: 421 VNPIME------VELEG---MYPAKRRRTDQCEQEPGTGTGTGTGGQTRDFLGVGAHTTI 457

BLAST of Tan0020478 vs. ExPASy TrEMBL
Match: A0A7N2MD98 (C2H2-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1)

HSP 1 Score: 514.6 bits (1324), Expect = 4.2e-142
Identity = 296/482 (61.41%), Postives = 344/482 (71.37%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           M+E MA++   N F++ P+AGSN   +KKKRN PG PDPEAEVIALSPKTLMATNRFLCE
Sbjct: 1   MLENMAEEALPNGFVENPIAGSNPPASKKKRNQPGNPDPEAEVIALSPKTLMATNRFLCE 60

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQR+SKE RKRVYVCPEKSCVHHH SRALGDLTGI
Sbjct: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRTSKEIRKRVYVCPEKSCVHHHASRALGDLTGI 120

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKC+KRYAVQSDWKAHSKTCGTREYKCDCGT+FSRRDSFITHRAF
Sbjct: 121 KKHFCRKHGEKKWKCEKCAKRYAVQSDWKAHSKTCGTREYKCDCGTIFSRRDSFITHRAF 180

Query: 181 CDALAEETARVNAATTIT-AGASNLNYNFMAE--SPFMAQHFPSIFKPISMNEATILFNH 240
           CDALAEETAR+NAA+ ++ A A+N+NY+FM     P MAQHF SIFKPIS NE  I    
Sbjct: 181 CDALAEETARLNAASNMSNAVANNINYHFMPTPLGPSMAQHFSSIFKPISSNEEAI---- 240

Query: 241 QQQTT----------SCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAP 300
             QTT          S  G  S+  NN + +  L  ++SG ++ DP V SC        P
Sbjct: 241 -NQTTRGLSLWMGQGSSQGQESIGNNNLHGIHQLGSVSSGTIFGDPLV-SCSN------P 300

Query: 301 APADYNNLNWVFGTKVNNSDHENQDSSQNQPLMSG--------GVSSLYSHELQQMNQTH 360
            P+DY  LNWVFGTK+++S+ E    S + PL            V SLYS +  Q +QT 
Sbjct: 301 PPSDY-QLNWVFGTKLSSSNAEELTVSNSLPLTDVKEAGTQLLSVPSLYSTQ-HQSHQTP 360

Query: 361 MANMSATALLQKASQIGPNSSSGPSILQEGFAFKCSGSTVQNGSEFSN--SGAAIPIMAT 420
            ANMSATALLQKA+QIG  +++ PS L   F  KCS S V++G++      G + PI++T
Sbjct: 361 SANMSATALLQKAAQIGV-TTTDPSFL-GSFGLKCSDSQVRDGNKLCGVLYGTSNPILST 420

Query: 421 VGSFVVENENEN--EMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSING 458
                VEN   +  +++ AKRR T +E  G      GGGQTRDFLGVG  TICH SSING
Sbjct: 421 NIGNDVENSAGDLLQVHPAKRRHTMNEESG------GGGQTRDFLGVGVQTICHPSSING 460

BLAST of Tan0020478 vs. ExPASy TrEMBL
Match: A0A0A0LQA1 (C2H2-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G409480 PE=4 SV=1)

HSP 1 Score: 514.2 bits (1323), Expect = 5.5e-142
Identity = 301/479 (62.84%), Postives = 332/479 (69.31%), Query Frame = 0

Query: 1   MIEKMADDEFSNCFLQIPLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCE 60
           MIEKMADDEFSNCFLQIPL GSN SL KKKRNLPGTPDPEAEVIALSPKTL+ATNRF+CE
Sbjct: 1   MIEKMADDEFSNCFLQIPLTGSNPSLLKKKRNLPGTPDPEAEVIALSPKTLLATNRFICE 60

Query: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGI 120
           ICGKGFQRDQNLQLHRRGHNLPWKLKQRS+KE +KRVYVCPEKSCVHHHPSRALGDLTGI
Sbjct: 61  ICGKGFQRDQNLQLHRRGHNLPWKLKQRSNKEAKKRVYVCPEKSCVHHHPSRALGDLTGI 120

Query: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180
           KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF
Sbjct: 121 KKHFCRKHGEKKWKCEKCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAF 180

Query: 181 CDALAEETARVNAATTITAGASNLNYNFM--------AESPFMAQHFPSIFKPISMNEAT 240
           CDALAEETARV A TTI    SNLNYN M            FM QHF S  KP++M    
Sbjct: 181 CDALAEETARVKAGTTI----SNLNYNLMGGWRDHDETAGIFMTQHFGSSMKPVTM---- 240

Query: 241 ILFNHQQQTTSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADY 300
                +  + S   +  +  NN          + G MY +                    
Sbjct: 241 -----KMSSNSVQMIGGMMMNN----------SGGGMYGE-------------------- 300

Query: 301 NNLNWVFGTKVNNSDHENQDSSQNQPLM---SGGVSSLYSHELQQMNQTHMANMSATALL 360
              + V+G +V      N   ++NQ LM    G V SLYSHE QQ+N+T M NMSATALL
Sbjct: 301 ---DSVWGNQVQMG---NYYYNENQGLMVNNGGRVCSLYSHEFQQVNETQMGNMSATALL 360

Query: 361 QKASQIGPNSSSGPSILQEGFAFKCS-------GSTVQNGSEFSNSGAAIPIMATVGSFV 420
           QKA++IG  SS+  + +    A   S       G    NGSEF N+    PI+      V
Sbjct: 361 QKAAEIGATSSASSNTVTRSAAPSLSLLQIQQQGFLFNNGSEFCNTNNN-PIV------V 420

Query: 421 VENENENEMYTAKRRRTQSELE-GSGSGT--TGGGQTRDFLGVGANTICHTS-SINGWI 458
           VEN N +EMYTAKRRR+QSE E G+G+GT  TG G+TRDFLGVGA TICH S SINGWI
Sbjct: 421 VEN-NGSEMYTAKRRRSQSEFECGNGNGTTGTGTGETRDFLGVGAKTICHASTSINGWI 422

BLAST of Tan0020478 vs. TAIR 10
Match: AT1G03840.1 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 414.5 bits (1064), Expect = 1.1e-115
Identity = 264/500 (52.80%), Postives = 306/500 (61.20%), Query Frame = 0

Query: 23  NSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 82
           N  L KKKRNLPG PDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP
Sbjct: 36  NPPLVKKKRNLPGNPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 95

Query: 83  WKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRY 142
           WKLKQR+SKE RKRVYVCPEKSCVHHHP+RALGDLTGIKKHFCRKHGEKKWKCEKC+KRY
Sbjct: 96  WKLKQRTSKEVRKRVYVCPEKSCVHHHPTRALGDLTGIKKHFCRKHGEKKWKCEKCAKRY 155

Query: 143 AVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTI----- 202
           AVQSDWKAHSKTCGTREY+CDCGT+FSRRDSFITHRAFCDALAEETAR+NAA+ +     
Sbjct: 156 AVQSDWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDALAEETARLNAASHLKSFAA 215

Query: 203 TAGASNLNYNFMAES----------------PFMAQHFPSIFKPISMNEATILFNHQQQT 262
           TAG SNLNY+++  +                P   QH      PI+ N     F+HQ   
Sbjct: 216 TAG-SNLNYHYLMGTLIPSPSLPQPPSFPFGPPQPQHHHHHQFPITTNN----FDHQDVM 275

Query: 263 TSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGT 322
              + L SL +  N N      I   +             AP P     DY   NWVFG 
Sbjct: 276 KPASTL-SLWSGGNINHHQQVTIEDRM-------------APQPHSPQEDY---NWVFGN 335

Query: 323 KVNNSD----------HEN-----QDSSQNQPLMSGGVSSLYSHELQQMNQ------THM 382
             N+ +          H+N     Q         S  V SL+S  + Q+ Q        +
Sbjct: 336 ANNHGELITTSDSLITHDNNINIVQSKENANGATSLSVPSLFS-SVDQITQDANAASVAV 395

Query: 383 ANMSATALLQKASQIGPNSSSGP--------SILQEGFAFKCSGSTVQNGSE--FSNSGA 442
           ANMSATALLQKA+Q+G  SS+ P        S   + FA K +      GS+  F++ G+
Sbjct: 396 ANMSATALLQKAAQMGATSSTSPTTTITTDQSAYLQSFASKSNQIVEDGGSDRFFASFGS 455

Query: 443 -AIPIMAT------------VGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTR 458
            ++ +M+              G  VV    E + Y  KRRR   ++  +G    GGGQTR
Sbjct: 456 NSVELMSNNNNGLHEIGNPRNGVTVVSGMGELQNYPWKRRRV--DIGNAG----GGGQTR 506

BLAST of Tan0020478 vs. TAIR 10
Match: AT1G03840.2 (C2H2 and C2HC zinc fingers superfamily protein )

HSP 1 Score: 405.2 bits (1040), Expect = 6.9e-113
Identity = 262/500 (52.40%), Postives = 305/500 (61.00%), Query Frame = 0

Query: 23  NSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 82
           N  L KKKRNLPG  +PEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP
Sbjct: 36  NPPLVKKKRNLPG--NPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 95

Query: 83  WKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRY 142
           WKLKQR+SKE RKRVYVCPEKSCVHHHP+RALGDLTGIKKHFCRKHGEKKWKCEKC+KRY
Sbjct: 96  WKLKQRTSKEVRKRVYVCPEKSCVHHHPTRALGDLTGIKKHFCRKHGEKKWKCEKCAKRY 155

Query: 143 AVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTI----- 202
           AVQSDWKAHSKTCGTREY+CDCGT+FSRRDSFITHRAFCDALAEETAR+NAA+ +     
Sbjct: 156 AVQSDWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDALAEETARLNAASHLKSFAA 215

Query: 203 TAGASNLNYNFMAES----------------PFMAQHFPSIFKPISMNEATILFNHQQQT 262
           TAG SNLNY+++  +                P   QH      PI+ N     F+HQ   
Sbjct: 216 TAG-SNLNYHYLMGTLIPSPSLPQPPSFPFGPPQPQHHHHHQFPITTNN----FDHQDVM 275

Query: 263 TSCNGLISLRTNNNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGT 322
              + L SL +  N N      I   +             AP P     DY   NWVFG 
Sbjct: 276 KPASTL-SLWSGGNINHHQQVTIEDRM-------------APQPHSPQEDY---NWVFGN 335

Query: 323 KVNNSD----------HEN-----QDSSQNQPLMSGGVSSLYSHELQQMNQ------THM 382
             N+ +          H+N     Q         S  V SL+S  + Q+ Q        +
Sbjct: 336 ANNHGELITTSDSLITHDNNINIVQSKENANGATSLSVPSLFS-SVDQITQDANAASVAV 395

Query: 383 ANMSATALLQKASQIGPNSSSGP--------SILQEGFAFKCSGSTVQNGSE--FSNSGA 442
           ANMSATALLQKA+Q+G  SS+ P        S   + FA K +      GS+  F++ G+
Sbjct: 396 ANMSATALLQKAAQMGATSSTSPTTTITTDQSAYLQSFASKSNQIVEDGGSDRFFASFGS 455

Query: 443 -AIPIMAT------------VGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTR 458
            ++ +M+              G  VV    E + Y  KRRR   ++  +G    GGGQTR
Sbjct: 456 NSVELMSNNNNGLHEIGNPRNGVTVVSGMGELQNYPWKRRRV--DIGNAG----GGGQTR 504

BLAST of Tan0020478 vs. TAIR 10
Match: AT5G44160.1 (C2H2-like zinc finger protein )

HSP 1 Score: 392.5 bits (1007), Expect = 4.6e-109
Identity = 245/481 (50.94%), Postives = 281/481 (58.42%), Query Frame = 0

Query: 23  NSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNLP 82
           N  L KKKRNLPG PDPEAEVIALSP TLMATNRFLCE+CGKGFQRDQNLQLHRRGHNLP
Sbjct: 32  NPPLVKKKRNLPGNPDPEAEVIALSPTTLMATNRFLCEVCGKGFQRDQNLQLHRRGHNLP 91

Query: 83  WKLKQRSSKEPRKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSKRY 142
           WKLKQR+SKE RKRVYVCPEK+CVHHH SRALGDLTGIKKHFCRKHGEKKW CEKC+KRY
Sbjct: 92  WKLKQRTSKEVRKRVYVCPEKTCVHHHSSRALGDLTGIKKHFCRKHGEKKWTCEKCAKRY 151

Query: 143 AVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTITA--- 202
           AVQSDWKAHSKTCGTREY+CDCGT+FSRRDSFITHRAFCDALAEETA++NA + +     
Sbjct: 152 AVQSDWKAHSKTCGTREYRCDCGTIFSRRDSFITHRAFCDALAEETAKINAVSHLNGLAA 211

Query: 203 ----GASNLNYNFMAESPFMAQHFPSI--FKPISMNEATILFNHQQQTTSCNGLISLRTN 262
               G+ NLNY ++     M    P +  F P           H Q  TS     SL   
Sbjct: 212 AGAPGSVNLNYQYL-----MGTFIPPLQPFVPQPQTNPNHHHQHFQPPTSS----SLSLW 271

Query: 263 NNNNLQPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLNWVFGTKV-------NNS 322
              ++ P                        P P P DY   +WVFG          NN+
Sbjct: 272 MGQDIAP------------------------PQPQP-DY---DWVFGNAKAASACIDNNN 331

Query: 323 DH-----ENQDSSQNQPLMSGGVSSLYSHELQQMNQTHMANMSATALLQKASQIGPNS-- 382
            H     +N ++S          S   S + Q  N     NMSATALLQKA++IG  S  
Sbjct: 332 THDEQITQNANASLTTTTTLSAPSLFSSDQPQNANANSNVNMSATALLQKAAEIGATSTT 391

Query: 383 ---SSGPSILQEGFAFKCSGSTV--QNGSEF-----SNSGAAIPIMA------------- 442
              ++ PS   + F  K +  T    +G +F     SN+   +   +             
Sbjct: 392 TAATNDPSTFLQSFPLKSTDQTTSYDSGEKFFALFGSNNNIGLMSRSHDHQEIENARNDV 451

Query: 443 TVGSFVVENENENEMYTAKRRRTQSELEGSGSGTTGGGQTRDFLGVGANTICHTSSINGW 458
           TV S + E +N    Y  KRRR        G    GGGQTRDFLGVG  T+CH SSINGW
Sbjct: 452 TVASALDELQN----YPWKRRRVD-----GGGEVGGGGQTRDFLGVGVQTLCHPSSINGW 466

BLAST of Tan0020478 vs. TAIR 10
Match: AT5G03150.1 (C2H2-like zinc finger protein )

HSP 1 Score: 331.3 bits (848), Expect = 1.3e-90
Identity = 214/459 (46.62%), Postives = 264/459 (57.52%), Query Frame = 0

Query: 18  PLAGSNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRR 77
           P A  NSS  KKKRN PGTPDP+A+VIALSP TLMATNRF+CEIC KGFQRDQNLQLHRR
Sbjct: 43  PNAKPNSSSAKKKRNQPGTPDPDADVIALSPTTLMATNRFVCEICNKGFQRDQNLQLHRR 102

Query: 78  GHNLPWKLKQRSSKEP-RKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCE 137
           GHNLPWKLKQRS +E  +K+VY+CP K+CVHH  SRALGDLTGIKKH+ RKHGEKKWKCE
Sbjct: 103 GHNLPWKLKQRSKQEVIKKKVYICPIKTCVHHDASRALGDLTGIKKHYSRKHGEKKWKCE 162

Query: 138 KCSKRYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATT 197
           KCSK+YAVQSDWKAH+KTCGTREYKCDCGTLFSR+DSFITHRAFCDAL EE AR+++ + 
Sbjct: 163 KCSKKYAVQSDWKAHAKTCGTREYKCDCGTLFSRKDSFITHRAFCDALTEEGARMSSLSN 222

Query: 198 ITAGASNLNYNFMAESPFM------------AQHFPSIFKPISMNEATILFNHQQQTTSC 257
                S  N NF  ES  M              H P I   IS  +  + F H       
Sbjct: 223 NNPVISTTNLNFGNESNVMNNPNLPHGFVHRGVHHPDINAAIS--QFGLGFGHDLSAMHA 282

Query: 258 NGL---ISLRTNNNNNLQP-----LPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNLN 317
            GL   + + +  N++L P     LP  +    +  P  ++ P+                
Sbjct: 283 QGLSEMVQMASTGNHHLFPSSSSSLPDFSGHHQFQIPMTSTNPS---------------- 342

Query: 318 WVFGTKVNNSDHENQDSSQNQPLMSGGVSSLYSHELQQMNQTHMANMSATALLQKASQIG 377
                  +++  +   S Q+Q L     S L+S   +      ++ MSATALLQKA+Q+G
Sbjct: 343 --LTLSSSSTSQQTSASLQHQTLKDSSFSPLFSSSSENKQNKPLSPMSATALLQKAAQMG 402

Query: 378 ---PNSSSGPSILQEGFAFKCSGSTVQNGSEFSNSGAAIPIMATVGSF--VVENENENEM 437
               NSS+ PS     FA     S+    S    S + + I   + +F   V  EN N  
Sbjct: 403 STRSNSSTAPSF----FAGPTMTSSSATASPPPRSSSPMMIQQQLNNFNTNVLRENHNRA 462

Query: 438 YTAKRRRTQSELEGS--GSGTTG------GGQTRDFLGV 443
                  + S ++ +   S  +G       G TRDFLGV
Sbjct: 463 PPPLSGVSTSSVDNNPFQSNRSGLNPAQQMGLTRDFLGV 477

BLAST of Tan0020478 vs. TAIR 10
Match: AT3G45260.1 (C2H2-like zinc finger protein )

HSP 1 Score: 320.9 bits (821), Expect = 1.7e-87
Identity = 190/384 (49.48%), Postives = 239/384 (62.24%), Query Frame = 0

Query: 22  SNSSLTKKKRNLPGTPDPEAEVIALSPKTLMATNRFLCEICGKGFQRDQNLQLHRRGHNL 81
           ++S+  K+KRNLPG PDP+AEVIALSP +LM TNRF+CE+C KGF+RDQNLQLHRRGHNL
Sbjct: 33  TSSNSAKRKRNLPGNPDPDAEVIALSPNSLMTTNRFICEVCNKGFKRDQNLQLHRRGHNL 92

Query: 82  PWKLKQRSSKEP-RKRVYVCPEKSCVHHHPSRALGDLTGIKKHFCRKHGEKKWKCEKCSK 141
           PWKLKQR++KE  +K+VY+CPEK+CVHH P+RALGDLTGIKKHF RKHGEKKWKC+KCSK
Sbjct: 93  PWKLKQRTNKEQVKKKVYICPEKTCVHHDPARALGDLTGIKKHFSRKHGEKKWKCDKCSK 152

Query: 142 RYAVQSDWKAHSKTCGTREYKCDCGTLFSRRDSFITHRAFCDALAEETARVNAATTITAG 201
           +YAV SDWKAHSK CGT+EY+CDCGTLFSR+DSFITHRAFCDALAEE+AR  +     A 
Sbjct: 153 KYAVMSDWKAHSKICGTKEYRCDCGTLFSRKDSFITHRAFCDALAEESARFVSVPPAPAY 212

Query: 202 ASNLNYNFMAESPFMAQHFPSIFKPISMNEATILFNHQQQ----TTSCNGLISLRTNNNN 261
            +N                      + +N   I  NHQQ+    T+S        TN NN
Sbjct: 213 LNNA-------------------LDVEVNHGNINQNHQQRQLNTTSSQLDQPGFNTNRNN 272

Query: 262 NL---QPLPHINSGLMYCDPFVNSCPTPAPAPAPAPADYNNL---------NWVFGTKVN 321
                Q LP         + F +S    +P+P  A     NL          W+     N
Sbjct: 273 IAFLGQTLP--------TNVFASS---SSPSPRSASDSLQNLWHLQGQSSHQWLLNENNN 332

Query: 322 NSDH-------ENQDSSQNQPLMSGGVSSLYSHELQ------QMNQTHMANMSATALLQK 376
           N+++       +NQ+  + + ++S G  SL+S E +        N   +A+MSATALLQK
Sbjct: 333 NNNNILQRGISKNQEEHEMKNVISNG--SLFSSEARNNTNNYNQNGGQIASMSATALLQK 384

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZWA61.6e-11452.80Zinc finger protein MAGPIE OS=Arabidopsis thaliana OX=3702 GN=MGP PE=1 SV=1[more]
Q9FFH36.5e-10850.94Zinc finger protein NUTCRACKER OS=Arabidopsis thaliana OX=3702 GN=NUC PE=1 SV=1[more]
Q700D21.8e-8946.62Zinc finger protein JACKDAW OS=Arabidopsis thaliana OX=3702 GN=JKD PE=1 SV=1[more]
Q944L32.4e-8649.48Zinc finger protein BALDIBIS OS=Arabidopsis thaliana OX=3702 GN=BIB PE=1 SV=1[more]
Q8GYC15.4e-8643.80Protein indeterminate-domain 4, chloroplastic OS=Arabidopsis thaliana OX=3702 GN... [more]
Match NameE-valueIdentityDescription
XP_022970999.13.1e-18476.34zinc finger protein NUTCRACKER-like [Cucurbita maxima][more]
KAG6604885.14.1e-18476.07Zinc finger protein NUTCRACKER, partial [Cucurbita argyrosperma subsp. sororia][more]
KAG7026968.15.4e-18475.91Zinc finger protein NUTCRACKER [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_023534067.11.0e-18275.80zinc finger protein NUTCRACKER-like [Cucurbita pepo subsp. pepo][more]
XP_022947082.18.6e-18275.43zinc finger protein NUTCRACKER-like [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1I7B41.5e-18476.34zinc finger protein NUTCRACKER-like OS=Cucurbita maxima OX=3661 GN=LOC111469801 ... [more]
A0A6J1G5G14.1e-18275.43zinc finger protein NUTCRACKER-like OS=Cucurbita moschata OX=3662 GN=LOC11145106... [more]
A0A6J1CGM41.3e-14864.69zinc finger protein NUTCRACKER OS=Momordica charantia OX=3673 GN=LOC111011056 PE... [more]
A0A7N2MD984.2e-14261.41C2H2-type domain-containing protein OS=Quercus lobata OX=97700 PE=4 SV=1[more]
A0A0A0LQA15.5e-14262.84C2H2-type domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_2G409480 P... [more]
Match NameE-valueIdentityDescription
AT1G03840.11.1e-11552.80C2H2 and C2HC zinc fingers superfamily protein [more]
AT1G03840.26.9e-11352.40C2H2 and C2HC zinc fingers superfamily protein [more]
AT5G44160.14.6e-10950.94C2H2-like zinc finger protein [more]
AT5G03150.11.3e-9046.62C2H2-like zinc finger protein [more]
AT3G45260.11.7e-8749.48C2H2-like zinc finger protein [more]
InterPro
Analysis Name: InterPro Annotations of Snake gourd (anguina) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013087Zinc finger C2H2-typeSMARTSM00355c2h2final6coord: 133..153
e-value: 130.0
score: 3.4
coord: 57..79
e-value: 0.076
score: 22.1
IPR013087Zinc finger C2H2-typePROSITEPS00028ZINC_FINGER_C2H2_1coord: 59..79
IPR013087Zinc finger C2H2-typePROSITEPS50157ZINC_FINGER_C2H2_2coord: 57..79
score: 11.489214
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 53..80
e-value: 6.0E-6
score: 28.2
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 92..181
e-value: 1.5E-5
score: 27.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 303..323
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 415..435
NoneNo IPR availablePANTHERPTHR10593:SF148ZINC FINGER PROTEIN MAGPIEcoord: 24..451
NoneNo IPR availablePANTHERPTHR10593SERINE/THREONINE-PROTEIN KINASE RIOcoord: 24..451
IPR036236Zinc finger C2H2 superfamilySUPERFAMILY57667beta-beta-alpha zinc fingerscoord: 56..153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Tan0020478.1Tan0020478.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0003700 DNA-binding transcription factor activity