Csa1G629020 (gene) Cucumber (Chinese Long) v2

NameCsa1G629020
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionDNA binding protein, putative; contains IPR014476 (Predicted AT-hook DNA-binding)
LocationChr1 : 24809836 .. 24811791 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTCCCAATTTTTCCATTTTCCAACATTCATTTTCTCTCTTCAACTTTGACCACGGGAGTTTCCGATGGCGGACTACGGCGGAGCTATTTCCCTCTCCCACCAACCTCCCACCTCCTCTTCCTCCTCCGACGACCACAGCCCTCCAACAAGACCTCAGACAAAACCCAGCAGAACCCCTACCTCCGGCGGTGCCGCCTCGTCAGTAGACACCTCAACCATGAAAAAGCCACGTGGAAGGCCACCTGGATCAAAGAACAAACCAAAGCCTCCAATAGTGATAACCAAAGAGAATGAATCAAGCATGAAGCCCGTTGTCATCGAGATCTCGGCTGGAAACGACGTCGTTGATACTCTCCTTCATTTTGCAAGAAAACGACACGTCGGCCTCACTGTCCTTAGCGGCTCTGGCTCCGTTTCAAATGTTACGTTACGACATCCAATGTCCCATTCGACATCTTTATCACTTCATGGACCTTTCAGCCTTGTTTCCCTCTCTGGCTCTTTTCTTGCTAATACCACGCCGTTTTCCTCTAAACCTCATTCTCTTTCTCCTTCTCCTTCTCCTTCTCCTTCTTCCTCTTTTGGAATTTGTCTTGCTGGTGCTCAAGGTCAGGTGAGCTTTAACCCCATTAAATCCGATACGTTATGGTGAATTTTTAATTATTCTCTCTCTCTTTTTTTTAGGTTTATTGTTCCTGTAGTTTAGAGAAAACTAAAGTTTAGTCCCATAATTTGGTAAAGTGTCTTAAATAGTTTCGATTTTTTATTCTAACTCTAAACAATATTATCAACTGTAGAACTATATGAACTAAAATTTAAATTTCTCTCAAATAATATGTAATTTACCTATAATTTTGTGAGTGAATGATGTTGGAGTTTAGTTATACTAGCAAGTTGAACACTTTTGTTTAAAGTTTAGTAAGTAATTATTTAATATTAAAAGAAGTTATTATGAATAACAAAAAAGAAATTATTTTAACACTTGACAAAAATAAACTTAAATGATTACAAATAATATGATAGATTTTGATGTATTTATAAAATATATATATATATATTTTACCATATTAAATGGATGAATAAGACCTTCCAATGTATTGTGTCTATTAGATTTATAAGAATGGTTCCTTTTGATCAAGATTTTTTTTTTTTAGTTTAGCATTTAAACTTTGTTGTTGCTTACCTTTCAATCTGAAAAGGTAAAACACATTTGAAAAAAAAAGCAAAAGTTTGAAACTACTTTCTTTTAGATAATTGCATTACATTTTAAAGCTTAGACATGATAGTTGTCACTATTTATGTTTTAGTTTTTAGTTTTTAATAAATAAGTATATAAACCATATGACTCAGTACCTAAGAGTTTCTTGTTCATGTAATGTACGTTTTGAAAGTGTTTCAAAAAATAAAATCTTTGTTTAGGGAAGATGAAAATTGTTCAAAATATTCAGCAAAGCACAATTTTGAAAAATAAAAAGGTTAAAAGTTAGGAATTTAAATTGTTATTAGTTAGGGTAAATTTAAACTTTCCATTCATATTATTCTTGACAAGGTGTTTGGAGGGATAGTTGGCGGGAAAGTCACGGCAGCAAGTCTGGTGGTGGTGGTTGCAGCGACATTTATAAATCCAGTGTTCCATCGACTGCCAAGCGAGACAACGGAGGGGGAGGACGACAGGGTTGATATGGCAAAACCCACCATCAACGCTACCGATGAGTCCCCCGTCACAGCGACTACAACAAGCTCGGCGACTCCAATGACGGTTTGCGTATACAATGCTCCATCGCCTCCGGATCATGCAATGCCGTGGGTGCCGAGTTCTCGATCATCTTACTGAAATCTCTATTAGAATAGAACATAACATACACATATTTGGTGATAGTGATTATTGTCAACTAAGAACTGATATATAACAAAAATATGTGATGGTATTTTTGTTGGGAGGCTTAATTAATTGG

mRNA sequence

ATGGCGGACTACGGCGGAGCTATTTCCCTCTCCCACCAACCTCCCACCTCCTCTTCCTCCTCCGACGACCACAGCCCTCCAACAAGACCTCAGACAAAACCCAGCAGAACCCCTACCTCCGGCGGTGCCGCCTCGTCAGTAGACACCTCAACCATGAAAAAGCCACGTGGAAGGCCACCTGGATCAAAGAACAAACCAAAGCCTCCAATAGTGATAACCAAAGAGAATGAATCAAGCATGAAGCCCGTTGTCATCGAGATCTCGGCTGGAAACGACGTCGTTGATACTCTCCTTCATTTTGCAAGAAAACGACACGTCGGCCTCACTGTCCTTAGCGGCTCTGGCTCCGTTTCAAATGTTACGTTACGACATCCAATGTCCCATTCGACATCTTTATCACTTCATGGACCTTTCAGCCTTGTTTCCCTCTCTGGCTCTTTTCTTGCTAATACCACGCCGTTTTCCTCTAAACCTCATTCTCTTTCTCCTTCTCCTTCTCCTTCTCCTTCTTCCTCTTTTGGAATTTGTCTTGCTGGTGCTCAAGGTCAGGTGTTTGGAGGGATAGTTGGCGGGAAAGTCACGGCAGCAAGTCTGGTGGTGGTGGTTGCAGCGACATTTATAAATCCAGTGTTCCATCGACTGCCAAGCGAGACAACGGAGGGGGAGGACGACAGGGTTGATATGGCAAAACCCACCATCAACGCTACCGATGAGTCCCCCGTCACAGCGACTACAACAAGCTCGGCGACTCCAATGACGGTTTGCGTATACAATGCTCCATCGCCTCCGGATCATGCAATGCCGTGGGTGCCGAGTTCTCGATCATCTTACTGA

Coding sequence (CDS)

ATGGCGGACTACGGCGGAGCTATTTCCCTCTCCCACCAACCTCCCACCTCCTCTTCCTCCTCCGACGACCACAGCCCTCCAACAAGACCTCAGACAAAACCCAGCAGAACCCCTACCTCCGGCGGTGCCGCCTCGTCAGTAGACACCTCAACCATGAAAAAGCCACGTGGAAGGCCACCTGGATCAAAGAACAAACCAAAGCCTCCAATAGTGATAACCAAAGAGAATGAATCAAGCATGAAGCCCGTTGTCATCGAGATCTCGGCTGGAAACGACGTCGTTGATACTCTCCTTCATTTTGCAAGAAAACGACACGTCGGCCTCACTGTCCTTAGCGGCTCTGGCTCCGTTTCAAATGTTACGTTACGACATCCAATGTCCCATTCGACATCTTTATCACTTCATGGACCTTTCAGCCTTGTTTCCCTCTCTGGCTCTTTTCTTGCTAATACCACGCCGTTTTCCTCTAAACCTCATTCTCTTTCTCCTTCTCCTTCTCCTTCTCCTTCTTCCTCTTTTGGAATTTGTCTTGCTGGTGCTCAAGGTCAGGTGTTTGGAGGGATAGTTGGCGGGAAAGTCACGGCAGCAAGTCTGGTGGTGGTGGTTGCAGCGACATTTATAAATCCAGTGTTCCATCGACTGCCAAGCGAGACAACGGAGGGGGAGGACGACAGGGTTGATATGGCAAAACCCACCATCAACGCTACCGATGAGTCCCCCGTCACAGCGACTACAACAAGCTCGGCGACTCCAATGACGGTTTGCGTATACAATGCTCCATCGCCTCCGGATCATGCAATGCCGTGGGTGCCGAGTTCTCGATCATCTTACTGA

Protein sequence

MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESPVTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY*
BLAST of Csa1G629020 vs. Swiss-Prot
Match: AHL17_ARATH (AT-hook motif nuclear-localized protein 17 OS=Arabidopsis thaliana GN=AHL17 PE=2 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.0e-40
Identity = 98/272 (36.03%), Postives = 146/272 (53.68%), Query Frame = 1

Query: 10  LSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPPGSKNKPKPP 69
           L H           HS  +      + TPT   ++  V    +++PRGRPPGSKNKPKPP
Sbjct: 17  LPHHQQQQQQQQQQHSLTSHFHLSSTVTPTVDDSSIEV----VRRPRGRPPGSKNKPKPP 76

Query: 70  IVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNVTLRH--PMS 129
           + +T++ +  M P ++E+ +GNDVV+ +  F R++ +G+ VLSGSGSV+NVTLR   P +
Sbjct: 77  VFVTRDTDPPMSPYILEVPSGNDVVEAINRFCRRKSIGVCVLSGSGSVANVTLRQPSPAA 136

Query: 130 HSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGG 189
             ++++ HG F L+S+S +FL         P     S SP  S+ F + LAG QGQ+ GG
Sbjct: 137 LGSTITFHGKFDLLSVSATFL---------PPPPRTSLSPPVSNFFTVSLAGPQGQIIGG 196

Query: 190 IVGGKVTAASLVVVVAATFINPVFHRLPSETTE----GEDDRVDMAKPTINATDESPVTA 249
            V G + +A  V V+AA+F NP +HRLP+E  +    G  +R   + P     +ES   A
Sbjct: 197 FVAGPLISAGTVYVIAASFNNPSYHRLPAEEEQKHSAGTGEREGQSPPVSGGGEESGQMA 256

Query: 250 TTTSSATPMTVCVYNAPSPPDHAMPWVPSSRS 276
              S      V +Y+        + W P++R+
Sbjct: 257 --GSGGESCGVSMYSCHMGGSDVI-WAPTARA 272

BLAST of Csa1G629020 vs. Swiss-Prot
Match: AHL15_ARATH (AT-hook motif nuclear-localized protein 15 OS=Arabidopsis thaliana GN=AHL15 PE=2 SV=1)

HSP 1 Score: 159.1 bits (401), Expect = 7.1e-38
Identity = 103/275 (37.45%), Postives = 146/275 (53.09%), Query Frame = 1

Query: 9   SLSHQPPTSSSSSD--DH--------SPPTRPQTKP---SRTPTSGGAASSVDTSTMKKP 68
           S ++ PPT + S    DH        SP T+ Q++    SR         S   ST ++P
Sbjct: 31  SNNNNPPTMTRSDPRLDHDFTTNNSGSPNTQTQSQEEQNSRDEQPAVEPGSGSGSTGRRP 90

Query: 69  RGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSG 128
           RGRPPGSKNKPK P+V+TKE+ +S++  V+EI+ G DV ++L  FAR+R  G++VLSGSG
Sbjct: 91  RGRPPGSKNKPKSPVVVTKESPNSLQSHVLEIATGADVAESLNAFARRRGRGVSVLSGSG 150

Query: 129 SVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGI 188
            V+NVTLR P +    +SL G F ++S+ G+FL               S SP+ ++   I
Sbjct: 151 LVTNVTLRQPAASGGVVSLRGQFEILSMCGAFLPT-------------SGSPAAAAGLTI 210

Query: 189 CLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINA 248
            LAGAQGQV GG V G + A+  V+V+AATF N  + RLP E  + ++  + +       
Sbjct: 211 YLAGAQGQVVGGGVAGPLIASGPVIVIAATFCNATYERLPIEEEQQQEQPLQLEDGKKQK 270

Query: 249 TDESPVTATTTSSATPMTVCVYNAPS---PPDHAM 268
            +     +    +   M   +YN P    P  H M
Sbjct: 271 EENDDNESGNNGNEGSMQPPMYNMPPNFIPNGHQM 292

BLAST of Csa1G629020 vs. Swiss-Prot
Match: AHL22_ARATH (AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1 SV=1)

HSP 1 Score: 158.7 bits (400), Expect = 9.3e-38
Identity = 88/206 (42.72%), Postives = 127/206 (61.65%), Query Frame = 1

Query: 22  DDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPPGSKNKPKPPIVITKENESSMK 81
           ++HS   + Q+ P     SGG     D    ++PRGRP GSKNKPKPPI+IT+++ +++K
Sbjct: 59  NEHSSAGKDQSTPGSGGESGGGGGG-DNHITRRPRGRPAGSKNKPKPPIIITRDSANALK 118

Query: 82  PVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNVTLRHPMS----HSTSLSLHGP 141
             V+E++ G DV++++  FAR+R  G+ VLSG+G+V+NVT+R P S     S+ ++LHG 
Sbjct: 119 SHVMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGR 178

Query: 142 FSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAAS 201
           F ++SLSGSFL              P P+P  +S   I LAG QGQV GG V G + A+ 
Sbjct: 179 FEILSLSGSFL--------------PPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASG 238

Query: 202 LVVVVAATFINPVFHRLPSETTEGED 224
            VV++AA+F N  + RLP E  + E+
Sbjct: 239 PVVIMAASFGNAAYERLPLEEDDQEE 249

BLAST of Csa1G629020 vs. Swiss-Prot
Match: AHL23_ARATH (AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1 SV=1)

HSP 1 Score: 155.6 bits (392), Expect = 7.9e-37
Identity = 78/168 (46.43%), Postives = 112/168 (66.67%), Query Frame = 1

Query: 53  KKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLS 112
           ++PRGRPPGSKNKPKPP++IT+E+ ++++  ++E++ G DV D +  +AR+R  G+ VLS
Sbjct: 82  RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYARRRQRGICVLS 141

Query: 113 GSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSS 172
           GSG+V+NV++R P +    ++L G F ++SLSGSFL              P P+P  ++S
Sbjct: 142 GSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFL--------------PPPAPPGATS 201

Query: 173 FGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTE 221
             I LAG QGQV GG V G++TAA  V+V+AA+F N  + RLP E  E
Sbjct: 202 LTIFLAGGQGQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDE 235

BLAST of Csa1G629020 vs. Swiss-Prot
Match: AHL24_ARATH (AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2 SV=1)

HSP 1 Score: 155.2 bits (391), Expect = 1.0e-36
Identity = 83/183 (45.36%), Postives = 115/183 (62.84%), Query Frame = 1

Query: 41  GGAASSVDTSTMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHF 100
           GG  S  D    ++PRGRP GSKNKPKPPI+IT+++ ++++  V+EI  G D+V+++  F
Sbjct: 93  GGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCDLVESVATF 152

Query: 101 ARKRHVGLTVLSGSGSVSNVTLRHPMSH---STSLSLHGPFSLVSLSGSFLANTTPFSSK 160
           AR+R  G+ V+SG+G+V+NVT+R P SH    + +SLHG F ++SLSGSFL         
Sbjct: 153 ARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFL--------- 212

Query: 161 PHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSE 220
                P P+P  ++   + LAG QGQV GG V G +  A  VVV+AA+F N  + RLP E
Sbjct: 213 -----PPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLE 261

BLAST of Csa1G629020 vs. TrEMBL
Match: A0A0A0LXI2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G629020 PE=4 SV=1)

HSP 1 Score: 555.4 bits (1430), Expect = 3.8e-155
Identity = 277/277 (100.00%), Postives = 277/277 (100.00%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP 60
           MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP
Sbjct: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP 60

Query: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120
           GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV
Sbjct: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120

Query: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180
           TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA
Sbjct: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180

Query: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240
           QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP
Sbjct: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240

Query: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 278
           VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY
Sbjct: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 277

BLAST of Csa1G629020 vs. TrEMBL
Match: A0A061GSR7_THECC (AT-hook DNA-binding family protein OS=Theobroma cacao GN=TCM_040931 PE=4 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 6.4e-70
Identity = 159/322 (49.38%), Postives = 204/322 (63.35%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTST--------- 60
           MADY GAISLS Q  TS   S +HSP + P    + +   GG++ S   S          
Sbjct: 93  MADYSGAISLS-QAHTSEDDSSEHSPRSVPTLSTAASGGGGGSSKSKTPSNKIITLDHHH 152

Query: 61  ---------------MKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDT 120
                           +KPRGRPPGSKNKPKPPIVIT++ +S+MKPV++EISAG+D++D+
Sbjct: 153 HHHHHHQTPSSSENTARKPRGRPPGSKNKPKPPIVITRDCDSAMKPVILEISAGSDIIDS 212

Query: 121 LLHFARKRHVGLTVLSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSS 180
           +++FAR+ HVG++++S +GSVSNVTLRHP+SH+ +LSLHGPF L+SL GSF+ ++T  SS
Sbjct: 213 IINFARRNHVGVSIISATGSVSNVTLRHPVSHAPALSLHGPFGLLSLCGSFIGSSTVSSS 272

Query: 181 K--PHSLSPSPSPSPSS-------SFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFI 240
              P S S S SPSPSS       SFG+ LAGAQGQVFGGIVGGKV AA+ V+VVAATFI
Sbjct: 273 NKAPQSSSSSTSPSPSSLSSPLSCSFGVTLAGAQGQVFGGIVGGKVMAATQVIVVAATFI 332

Query: 241 NPVFHRLPSETTEGEDDRVDMAKPTINA-------TDESPVTATTTSSATPMTVCVYNAP 275
           NP  HRLP E     +DR    KP +++          + V AT + S+  M++ VY   
Sbjct: 333 NPALHRLPCE--GDNEDRHQETKPGVHSNVGGGGGATAAAVGATESCSSAGMSMSVYGVA 392

BLAST of Csa1G629020 vs. TrEMBL
Match: W9T373_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_000140 PE=4 SV=1)

HSP 1 Score: 270.4 bits (690), Expect = 2.4e-69
Identity = 159/304 (52.30%), Postives = 200/304 (65.79%), Query Frame = 1

Query: 1   MADYGGA-ISLSH------QPPTSSSSSDDHSPPTRPQTKPSRTPTSGG---AASSVDTS 60
           MADY GA +SLS        P +   ++D    P   Q K S  P       ++SS  + 
Sbjct: 1   MADYAGAAMSLSQGGGRDLSPTSDDQNNDSDQSPRSLQAKTSSKPRRMALPLSSSSPLSE 60

Query: 61  TMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTV 120
           T +KPRGRPPGSKNKPKPPIVITK+++S+MKPVV+EISAG+DVVD+++ FAR+R VG+T+
Sbjct: 61  TTRKPRGRPPGSKNKPKPPIVITKDSDSAMKPVVLEISAGSDVVDSVIQFARRRRVGITL 120

Query: 121 LSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFL----ANTT--PFSSKPHSLSPS 180
           LSGSGSVSNVTLRHPMSH+ +LSLHGPF+L+SLSGS +     NTT  P  +   S S S
Sbjct: 121 LSGSGSVSNVTLRHPMSHAPALSLHGPFTLLSLSGSVVGPTNTNTTSSPTETTTSSSSSS 180

Query: 181 PSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGED- 240
           P+ S + SFGICLAGAQGQVFGGIVGGKV AAS V VVA TF+NP FHRLP +  +  + 
Sbjct: 181 PTTSSNPSFGICLAGAQGQVFGGIVGGKVIAASAVAVVATTFVNPSFHRLPGDNQDNVEG 240

Query: 241 ----DRVDMAKPTINATDESPVTATTTS-----SATPMTVCVYNAPSPPDHAMP-WVPSS 278
               D  ++    +  T E+  T+T+++     S TP+        SP  H MP W  SS
Sbjct: 241 THNHDHQEIKPCVVGGTHETTYTSTSSACTHVVSPTPINC---QLSSPDHHVMPSWGHSS 300

BLAST of Csa1G629020 vs. TrEMBL
Match: A0A0D2LYC4_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_001G133900 PE=4 SV=1)

HSP 1 Score: 268.1 bits (684), Expect = 1.2e-68
Identity = 155/303 (51.16%), Postives = 194/303 (64.03%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDT----------- 60
           MADYG AISLS Q  TS   S +HSP + P     R   SGG  S   T           
Sbjct: 1   MADYGVAISLS-QAHTSDGDSSEHSPRSVP-----RLSASGGGGSKSKTPSNKIVTLDYH 60

Query: 61  --------STMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFA 120
                   +T +KPRGRPPGSKNKPKPPIVIT+++ S+MKPV++EISAG+D++D ++ FA
Sbjct: 61  HRTPSSSDNTGRKPRGRPPGSKNKPKPPIVITRDSNSTMKPVILEISAGSDIIDAIISFA 120

Query: 121 RKRHVGLTVLSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPH-- 180
           R   VG++++S +GSVSNVTL HP+SH+ +LSLHGPFSL+SLSGSF+A++T  S+K    
Sbjct: 121 RTHSVGVSIISATGSVSNVTLCHPVSHAPALSLHGPFSLLSLSGSFIASSTLSSNKTSQS 180

Query: 181 ---SLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPS 240
              S SPSPS S S SFG+ LAGAQGQVFGG VGGKV AA+LV+V AATF+NP FH LP 
Sbjct: 181 SSLSTSPSPSLSSSGSFGVTLAGAQGQVFGGKVGGKVMAATLVIVAAATFVNPEFHMLPG 240

Query: 241 ETTEGEDDRVDMAKPTINATDESPVTATTTSSATPMTVCVYNAPSP-----PDHAMPWVP 275
           E      D    +KP+ +       T + TS+   M V    +P+P     P   MPW P
Sbjct: 241 E--GDNKDHNQESKPSTHGCVAGGATESCTSTGLSMPVYGVASPTPLNCQIPPDVMPWGP 295

BLAST of Csa1G629020 vs. TrEMBL
Match: A0A0D2QB68_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_009G075100 PE=4 SV=1)

HSP 1 Score: 263.8 bits (673), Expect = 2.3e-67
Identity = 154/295 (52.20%), Postives = 196/295 (66.44%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSP---PTRPQ-TKPSRTPTSGGAASSVDTSTMKKPR 60
           MADY GAISLS Q   S   S +HSP   PT P  +  S+TPT+    +       +KPR
Sbjct: 1   MADYSGAISLS-QAHVSDDDSSEHSPRSVPTSPAFSSKSKTPTNM-IVTLDHHHHQRKPR 60

Query: 61  GRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGS 120
           GRPPGSKNKPKPPIVIT++  S+MKPV+++ISAG+D++D ++ FAR  HVG+ +++ +GS
Sbjct: 61  GRPPGSKNKPKPPIVITRDTGSAMKPVILDISAGSDIIDAIITFARSNHVGICIINVTGS 120

Query: 121 VSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLS--PSPSPSPSS--- 180
           VSNVTLRHP+S + +LSLHGPF L+SLSGS++A+TT  SS   S S  PS SPSPSS   
Sbjct: 121 VSNVTLRHPVSQAPALSLHGPFGLLSLSGSYIASTTISSSTETSQSSQPSSSPSPSSLSP 180

Query: 181 ----SFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVD 240
               SFGI LAGAQGQVFGG++GGKV AA+ V+VVAATF+NP FHRLP +  + ED   +
Sbjct: 181 NLSCSFGITLAGAQGQVFGGMIGGKVIAATQVIVVAATFVNPAFHRLPCK-GDNEDTHQE 240

Query: 241 MAKPTINATDESPVTATTTSSATPMTVCVYNAPSP--------PDHAMPWVPSSR 275
                          AT + S+T M+  VY++P P        PD  MPW P SR
Sbjct: 241 TKHCIHGNVGGGASGATESCSSTGMSTAVYSSPCPTPLNCQISPD-VMPWGPPSR 291

BLAST of Csa1G629020 vs. TAIR10
Match: AT5G49700.1 (AT5G49700.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 167.5 bits (423), Expect = 1.1e-41
Identity = 98/272 (36.03%), Postives = 146/272 (53.68%), Query Frame = 1

Query: 10  LSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPPGSKNKPKPP 69
           L H           HS  +      + TPT   ++  V    +++PRGRPPGSKNKPKPP
Sbjct: 17  LPHHQQQQQQQQQQHSLTSHFHLSSTVTPTVDDSSIEV----VRRPRGRPPGSKNKPKPP 76

Query: 70  IVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNVTLRH--PMS 129
           + +T++ +  M P ++E+ +GNDVV+ +  F R++ +G+ VLSGSGSV+NVTLR   P +
Sbjct: 77  VFVTRDTDPPMSPYILEVPSGNDVVEAINRFCRRKSIGVCVLSGSGSVANVTLRQPSPAA 136

Query: 130 HSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGG 189
             ++++ HG F L+S+S +FL         P     S SP  S+ F + LAG QGQ+ GG
Sbjct: 137 LGSTITFHGKFDLLSVSATFL---------PPPPRTSLSPPVSNFFTVSLAGPQGQIIGG 196

Query: 190 IVGGKVTAASLVVVVAATFINPVFHRLPSETTE----GEDDRVDMAKPTINATDESPVTA 249
            V G + +A  V V+AA+F NP +HRLP+E  +    G  +R   + P     +ES   A
Sbjct: 197 FVAGPLISAGTVYVIAASFNNPSYHRLPAEEEQKHSAGTGEREGQSPPVSGGGEESGQMA 256

Query: 250 TTTSSATPMTVCVYNAPSPPDHAMPWVPSSRS 276
              S      V +Y+        + W P++R+
Sbjct: 257 --GSGGESCGVSMYSCHMGGSDVI-WAPTARA 272

BLAST of Csa1G629020 vs. TAIR10
Match: AT3G55560.1 (AT3G55560.1 AT-hook protein of GA feedback 2)

HSP 1 Score: 159.1 bits (401), Expect = 4.0e-39
Identity = 103/275 (37.45%), Postives = 146/275 (53.09%), Query Frame = 1

Query: 9   SLSHQPPTSSSSSD--DH--------SPPTRPQTKP---SRTPTSGGAASSVDTSTMKKP 68
           S ++ PPT + S    DH        SP T+ Q++    SR         S   ST ++P
Sbjct: 31  SNNNNPPTMTRSDPRLDHDFTTNNSGSPNTQTQSQEEQNSRDEQPAVEPGSGSGSTGRRP 90

Query: 69  RGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSG 128
           RGRPPGSKNKPK P+V+TKE+ +S++  V+EI+ G DV ++L  FAR+R  G++VLSGSG
Sbjct: 91  RGRPPGSKNKPKSPVVVTKESPNSLQSHVLEIATGADVAESLNAFARRRGRGVSVLSGSG 150

Query: 129 SVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGI 188
            V+NVTLR P +    +SL G F ++S+ G+FL               S SP+ ++   I
Sbjct: 151 LVTNVTLRQPAASGGVVSLRGQFEILSMCGAFLPT-------------SGSPAAAAGLTI 210

Query: 189 CLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINA 248
            LAGAQGQV GG V G + A+  V+V+AATF N  + RLP E  + ++  + +       
Sbjct: 211 YLAGAQGQVVGGGVAGPLIASGPVIVIAATFCNATYERLPIEEEQQQEQPLQLEDGKKQK 270

Query: 249 TDESPVTATTTSSATPMTVCVYNAPS---PPDHAM 268
            +     +    +   M   +YN P    P  H M
Sbjct: 271 EENDDNESGNNGNEGSMQPPMYNMPPNFIPNGHQM 292

BLAST of Csa1G629020 vs. TAIR10
Match: AT2G45430.1 (AT2G45430.1 AT-hook motif nuclear-localized protein 22)

HSP 1 Score: 158.7 bits (400), Expect = 5.2e-39
Identity = 88/206 (42.72%), Postives = 127/206 (61.65%), Query Frame = 1

Query: 22  DDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPPGSKNKPKPPIVITKENESSMK 81
           ++HS   + Q+ P     SGG     D    ++PRGRP GSKNKPKPPI+IT+++ +++K
Sbjct: 59  NEHSSAGKDQSTPGSGGESGGGGGG-DNHITRRPRGRPAGSKNKPKPPIIITRDSANALK 118

Query: 82  PVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNVTLRHPMS----HSTSLSLHGP 141
             V+E++ G DV++++  FAR+R  G+ VLSG+G+V+NVT+R P S     S+ ++LHG 
Sbjct: 119 SHVMEVANGCDVMESVTVFARRRQRGICVLSGNGAVTNVTIRQPASVPGGGSSVVNLHGR 178

Query: 142 FSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAAS 201
           F ++SLSGSFL              P P+P  +S   I LAG QGQV GG V G + A+ 
Sbjct: 179 FEILSLSGSFL--------------PPPAPPAASGLTIYLAGGQGQVVGGSVVGPLMASG 238

Query: 202 LVVVVAATFINPVFHRLPSETTEGED 224
            VV++AA+F N  + RLP E  + E+
Sbjct: 239 PVVIMAASFGNAAYERLPLEEDDQEE 249

BLAST of Csa1G629020 vs. TAIR10
Match: AT4G17800.1 (AT4G17800.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 155.6 bits (392), Expect = 4.4e-38
Identity = 78/168 (46.43%), Postives = 112/168 (66.67%), Query Frame = 1

Query: 53  KKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLS 112
           ++PRGRPPGSKNKPKPP++IT+E+ ++++  ++E++ G DV D +  +AR+R  G+ VLS
Sbjct: 82  RRPRGRPPGSKNKPKPPVIITRESANTLRAHILEVTNGCDVFDCVATYARRRQRGICVLS 141

Query: 113 GSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSS 172
           GSG+V+NV++R P +    ++L G F ++SLSGSFL              P P+P  ++S
Sbjct: 142 GSGTVTNVSIRQPSAAGAVVTLQGTFEILSLSGSFL--------------PPPAPPGATS 201

Query: 173 FGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTE 221
             I LAG QGQV GG V G++TAA  V+V+AA+F N  + RLP E  E
Sbjct: 202 LTIFLAGGQGQVVGGSVVGELTAAGPVIVIAASFTNVAYERLPLEEDE 235

BLAST of Csa1G629020 vs. TAIR10
Match: AT4G22810.1 (AT4G22810.1 Predicted AT-hook DNA-binding family protein)

HSP 1 Score: 155.2 bits (391), Expect = 5.8e-38
Identity = 83/183 (45.36%), Postives = 115/183 (62.84%), Query Frame = 1

Query: 41  GGAASSVDTSTMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHF 100
           GG  S  D    ++PRGRP GSKNKPKPPI+IT+++ ++++  V+EI  G D+V+++  F
Sbjct: 93  GGGGSGGDHQMTRRPRGRPAGSKNKPKPPIIITRDSANALRTHVMEIGDGCDLVESVATF 152

Query: 101 ARKRHVGLTVLSGSGSVSNVTLRHPMSH---STSLSLHGPFSLVSLSGSFLANTTPFSSK 160
           AR+R  G+ V+SG+G+V+NVT+R P SH    + +SLHG F ++SLSGSFL         
Sbjct: 153 ARRRQRGVCVMSGTGNVTNVTIRQPGSHPSPGSVVSLHGRFEILSLSGSFL--------- 212

Query: 161 PHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSE 220
                P P+P  ++   + LAG QGQV GG V G +  A  VVV+AA+F N  + RLP E
Sbjct: 213 -----PPPAPPTATGLSVYLAGGQGQVVGGSVVGPLLCAGPVVVMAASFSNAAYERLPLE 261

BLAST of Csa1G629020 vs. NCBI nr
Match: gi|449442723|ref|XP_004139130.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Cucumis sativus])

HSP 1 Score: 555.4 bits (1430), Expect = 5.5e-155
Identity = 277/277 (100.00%), Postives = 277/277 (100.00%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP 60
           MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP
Sbjct: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP 60

Query: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120
           GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV
Sbjct: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120

Query: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180
           TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA
Sbjct: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180

Query: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240
           QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP
Sbjct: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240

Query: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 278
           VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY
Sbjct: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 277

BLAST of Csa1G629020 vs. NCBI nr
Match: gi|659098900|ref|XP_008450342.1| (PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo])

HSP 1 Score: 529.3 bits (1362), Expect = 4.2e-147
Identity = 265/277 (95.67%), Postives = 270/277 (97.47%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTSTMKKPRGRPP 60
           MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKP++TP SG AASS DTSTMKKPRGRPP
Sbjct: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPNKTPISG-AASSADTSTMKKPRGRPP 60

Query: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120
           GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV
Sbjct: 61  GSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTVLSGSGSVSNV 120

Query: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180
           TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA
Sbjct: 121 TLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSSKPHSLSPSPSPSPSSSFGICLAGA 180

Query: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGEDDRVDMAKPTINATDESP 240
           QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSET   +D+ +DMAKPTINATDESP
Sbjct: 181 QGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETAAEDDEGIDMAKPTINATDESP 240

Query: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 278
           VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY
Sbjct: 241 VTATTTSSATPMTVCVYNAPSPPDHAMPWVPSSRSSY 276

BLAST of Csa1G629020 vs. NCBI nr
Match: gi|1009114395|ref|XP_015873664.1| (PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba])

HSP 1 Score: 289.3 bits (739), Expect = 7.3e-75
Identity = 170/312 (54.49%), Postives = 208/312 (66.67%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSD---DHSPPTRP---------------QTKPSRTPTSGG 60
           MADYGGAISLS     S +S D   D+SP + P                +KP +   S G
Sbjct: 1   MADYGGAISLSQGRDLSHTSDDSDSDNSPRSVPILSCGAAGAAGGGSSSSKPRKQGGSAG 60

Query: 61  AASSVDTST--MKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHF 120
           A + + +S    KKPRGRPPGSKNKPKPPIVITK+++ +MKPV++EISAG+DVV++++ F
Sbjct: 61  AVADIGSSLEISKKPRGRPPGSKNKPKPPIVITKDSDLAMKPVILEISAGSDVVESVMQF 120

Query: 121 ARKRHVGLTVLSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTT-------- 180
           AR+RHVG+TVLSGSGSVSNVTLRHP+SH+ +LSLHGPFSL+SL GSF+ +++        
Sbjct: 121 ARRRHVGITVLSGSGSVSNVTLRHPVSHAPALSLHGPFSLLSLCGSFVGSSSITSPSSSS 180

Query: 181 -PFSSKPHSLSPSPSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVF 240
            P SS   S SPS S S  SSFGICLAGAQGQVFGGIVGGKV AASLVVVVAATF++P F
Sbjct: 181 KPSSSSSSSSSPSSSLSSCSSFGICLAGAQGQVFGGIVGGKVIAASLVVVVAATFLSPSF 240

Query: 241 HRLPSETTEGEDDRVDMAKPTINA-TDESPVTATTTSSATPMTVCVYNAPSP-------- 275
           HRLP+E  E E+      KP  N+ T         +   T M++ VYN  SP        
Sbjct: 241 HRLPNEGDEAEE-----TKPIRNSNTSGGGANEGCSGPCTGMSMSVYNVASPNPINCQIS 300

BLAST of Csa1G629020 vs. NCBI nr
Match: gi|590584830|ref|XP_007015286.1| (AT-hook DNA-binding family protein [Theobroma cacao])

HSP 1 Score: 272.3 bits (695), Expect = 9.2e-70
Identity = 159/322 (49.38%), Postives = 204/322 (63.35%), Query Frame = 1

Query: 1   MADYGGAISLSHQPPTSSSSSDDHSPPTRPQTKPSRTPTSGGAASSVDTST--------- 60
           MADY GAISLS Q  TS   S +HSP + P    + +   GG++ S   S          
Sbjct: 93  MADYSGAISLS-QAHTSEDDSSEHSPRSVPTLSTAASGGGGGSSKSKTPSNKIITLDHHH 152

Query: 61  ---------------MKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDT 120
                           +KPRGRPPGSKNKPKPPIVIT++ +S+MKPV++EISAG+D++D+
Sbjct: 153 HHHHHHQTPSSSENTARKPRGRPPGSKNKPKPPIVITRDCDSAMKPVILEISAGSDIIDS 212

Query: 121 LLHFARKRHVGLTVLSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFLANTTPFSS 180
           +++FAR+ HVG++++S +GSVSNVTLRHP+SH+ +LSLHGPF L+SL GSF+ ++T  SS
Sbjct: 213 IINFARRNHVGVSIISATGSVSNVTLRHPVSHAPALSLHGPFGLLSLCGSFIGSSTVSSS 272

Query: 181 K--PHSLSPSPSPSPSS-------SFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFI 240
              P S S S SPSPSS       SFG+ LAGAQGQVFGGIVGGKV AA+ V+VVAATFI
Sbjct: 273 NKAPQSSSSSTSPSPSSLSSPLSCSFGVTLAGAQGQVFGGIVGGKVMAATQVIVVAATFI 332

Query: 241 NPVFHRLPSETTEGEDDRVDMAKPTINA-------TDESPVTATTTSSATPMTVCVYNAP 275
           NP  HRLP E     +DR    KP +++          + V AT + S+  M++ VY   
Sbjct: 333 NPALHRLPCE--GDNEDRHQETKPGVHSNVGGGGGATAAAVGATESCSSAGMSMSVYGVA 392

BLAST of Csa1G629020 vs. NCBI nr
Match: gi|703162764|ref|XP_010113138.1| (hypothetical protein L484_000140 [Morus notabilis])

HSP 1 Score: 270.4 bits (690), Expect = 3.5e-69
Identity = 159/304 (52.30%), Postives = 200/304 (65.79%), Query Frame = 1

Query: 1   MADYGGA-ISLSH------QPPTSSSSSDDHSPPTRPQTKPSRTPTSGG---AASSVDTS 60
           MADY GA +SLS        P +   ++D    P   Q K S  P       ++SS  + 
Sbjct: 1   MADYAGAAMSLSQGGGRDLSPTSDDQNNDSDQSPRSLQAKTSSKPRRMALPLSSSSPLSE 60

Query: 61  TMKKPRGRPPGSKNKPKPPIVITKENESSMKPVVIEISAGNDVVDTLLHFARKRHVGLTV 120
           T +KPRGRPPGSKNKPKPPIVITK+++S+MKPVV+EISAG+DVVD+++ FAR+R VG+T+
Sbjct: 61  TTRKPRGRPPGSKNKPKPPIVITKDSDSAMKPVVLEISAGSDVVDSVIQFARRRRVGITL 120

Query: 121 LSGSGSVSNVTLRHPMSHSTSLSLHGPFSLVSLSGSFL----ANTT--PFSSKPHSLSPS 180
           LSGSGSVSNVTLRHPMSH+ +LSLHGPF+L+SLSGS +     NTT  P  +   S S S
Sbjct: 121 LSGSGSVSNVTLRHPMSHAPALSLHGPFTLLSLSGSVVGPTNTNTTSSPTETTTSSSSSS 180

Query: 181 PSPSPSSSFGICLAGAQGQVFGGIVGGKVTAASLVVVVAATFINPVFHRLPSETTEGED- 240
           P+ S + SFGICLAGAQGQVFGGIVGGKV AAS V VVA TF+NP FHRLP +  +  + 
Sbjct: 181 PTTSSNPSFGICLAGAQGQVFGGIVGGKVIAASAVAVVATTFVNPSFHRLPGDNQDNVEG 240

Query: 241 ----DRVDMAKPTINATDESPVTATTTS-----SATPMTVCVYNAPSPPDHAMP-WVPSS 278
               D  ++    +  T E+  T+T+++     S TP+        SP  H MP W  SS
Sbjct: 241 THNHDHQEIKPCVVGGTHETTYTSTSSACTHVVSPTPINC---QLSSPDHHVMPSWGHSS 300

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
AHL17_ARATH2.0e-4036.03AT-hook motif nuclear-localized protein 17 OS=Arabidopsis thaliana GN=AHL17 PE=2... [more]
AHL15_ARATH7.1e-3837.45AT-hook motif nuclear-localized protein 15 OS=Arabidopsis thaliana GN=AHL15 PE=2... [more]
AHL22_ARATH9.3e-3842.72AT-hook motif nuclear-localized protein 22 OS=Arabidopsis thaliana GN=AHL22 PE=1... [more]
AHL23_ARATH7.9e-3746.43AT-hook motif nuclear-localized protein 23 OS=Arabidopsis thaliana GN=AHL23 PE=1... [more]
AHL24_ARATH1.0e-3645.36AT-hook motif nuclear-localized protein 24 OS=Arabidopsis thaliana GN=AHL24 PE=2... [more]
Match NameE-valueIdentityDescription
A0A0A0LXI2_CUCSA3.8e-155100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G629020 PE=4 SV=1[more]
A0A061GSR7_THECC6.4e-7049.38AT-hook DNA-binding family protein OS=Theobroma cacao GN=TCM_040931 PE=4 SV=1[more]
W9T373_9ROSA2.4e-6952.30Uncharacterized protein OS=Morus notabilis GN=L484_000140 PE=4 SV=1[more]
A0A0D2LYC4_GOSRA1.2e-6851.16Uncharacterized protein OS=Gossypium raimondii GN=B456_001G133900 PE=4 SV=1[more]
A0A0D2QB68_GOSRA2.3e-6752.20Uncharacterized protein OS=Gossypium raimondii GN=B456_009G075100 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G49700.11.1e-4136.03 Predicted AT-hook DNA-binding family protein[more]
AT3G55560.14.0e-3937.45 AT-hook protein of GA feedback 2[more]
AT2G45430.15.2e-3942.72 AT-hook motif nuclear-localized protein 22[more]
AT4G17800.14.4e-3846.43 Predicted AT-hook DNA-binding family protein[more]
AT4G22810.15.8e-3845.36 Predicted AT-hook DNA-binding family protein[more]
Match NameE-valueIdentityDescription
gi|449442723|ref|XP_004139130.1|5.5e-155100.00PREDICTED: AT-hook motif nuclear-localized protein 17-like [Cucumis sativus][more]
gi|659098900|ref|XP_008450342.1|4.2e-14795.67PREDICTED: putative DNA-binding protein ESCAROLA [Cucumis melo][more]
gi|1009114395|ref|XP_015873664.1|7.3e-7554.49PREDICTED: AT-hook motif nuclear-localized protein 17-like [Ziziphus jujuba][more]
gi|590584830|ref|XP_007015286.1|9.2e-7049.38AT-hook DNA-binding family protein [Theobroma cacao][more]
gi|703162764|ref|XP_010113138.1|3.5e-6952.30hypothetical protein L484_000140 [Morus notabilis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR005175PPC_dom
IPR014476AT-hook_nuclear
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
biological_process GO:0006355 regulation of transcription, DNA-templated
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003674 molecular_function
molecular_function GO:0003680 AT DNA binding
molecular_function GO:0003700 transcription factor activity, sequence-specific DNA binding
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU126477cucumber EST collection version 3.0transcribed_cluster
CU127780cucumber EST collection version 3.0transcribed_cluster
CU147384cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G629020.1Csa1G629020.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU126477CU126477transcribed_cluster
CU147384CU147384transcribed_cluster
CU127780CU127780transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR005175PPC domainPFAMPF03479DUF296coord: 81..207
score: 1.6
IPR005175PPC domainPROFILEPS51742PPCcoord: 77..227
score:
IPR014476AT-hook motif nuclear-localised proteinPIRPIRSF016021ESCAROLAcoord: 6..272
score: 1.2
NoneNo IPR availableGENE3DG3DSA:3.30.1330.80coord: 80..220
score: 3.7
NoneNo IPR availablePANTHERPTHR31100FAMILY NOT NAMEDcoord: 8..277
score: 1.0
NoneNo IPR availableunknownSSF117856AF0104/ALDC/Ptd012-likecoord: 78..220
score: 1.41