Csa4G047920 (gene) Cucumber (Chinese Long) v2

NameCsa4G047920
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionDNA binding protein, putative; contains IPR001606 (ARID/BRIGHT DNA-binding domain)
LocationChr4 : 3770368 .. 3775455 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTGCCCCCCACTTCCACCATTTTTAAACTCATCAAAGTTGCATCCTTCTCCATTTGACCGTCCCAATTTACTCACCCTTTCTCTCTTGATTCCATCTTCCATCTTCAACCTTACAATCTGCGTCTTCATAACATGTTGGAGTCCCAGATTCCCAAAATGAGGTAGACCCTTTTGAGTTTTCTTTGCAAAGGGTTTTATCTATTATTTGCCCTAATCTTGTTGGTTTATTTGTTTCATATGTATCTTTAACGAGAAAAGAAAACAACAATACGAACCCCTGTACTGCGATTTTTTTTTCTTTTTTATTTTAAAGTGAGGGTTTGAGTTCATGGGGAGATGGCCTATTTCATCCAATGATTCCATTCTAGATTGCAACAAAGATGTTGATCCTAATCCCAGTTATGGCTATTGCATTGCCCCGGATTGTTTGGTAGAAGGAAGTTGTGCTAATGTTGATCATGATGATTGCAAAGCCACGATTAGATGCTATTTTGAAAAAGTTCTTTGGGTTTTTCTAAAGGAAACGTGTCGTAGAGGATTTATTAGACCAGTGCCGGCGTTACTAGGTGAAGGGGAATCTTTGGATTTGTTTGAACTGTTCATGGTTGTTAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTTGTTGTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAATTGATTTATTTCAAGTACTTAAGTGACCTTGAGAAATGGCTTATGGTGAGACGTGGAGGCACAAAATTGGAAAATGGGAACTCCGATTGTTATTACTACAGGAAAAACTTTCCATGTTTGGCAGAACTGGAGGCAAAGATTAAGGATATATTATATGGTGTGCTGAGACAAAAGAGCATATATGATGAACGATCTGGATTCAAATCCAACAAACCAAATGGGAACGTTAATGTTGCTGAAACTGCAGCAGAGAAGGAAATAAAATCCCCTAAAATAGAGAAGAAAGAACATGATCTTCACGAGGATGTCACACCAATTCAACAAAATTGTACTGAGACACCTCGGGACAATGGCAAAACCAATCAAATCCATGTTATTGGAGATTGTAGAAGTTCGGATGCTGTTAATGTTGAAACTGAAACAGACTCTCATGGGAGCTCTCGAGAATCTTTATTTCGAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGCAAATCCATCAAATGGTACAGTACCGGGGTCATCCAAGTGGAAAGCGTATGCTAGCGAGGATGCATTATGGCTTCAAGTTATCAAGGCAAAGGATGCTCTTTTAAATAGGAAGGATGTTGACAAAACCGCTGAGAAACGTCTGTTAATACAGGTAGATTTCACTCACTTATTCTGCTTGTCAGCCAGCACTCTCTCTCCCTCCCTCTTCTATCTTCCTCTTATTGCTTGACCGTTAGACATATACACATGGATTCTTGCTTTGGACTGCTATTATGTACAAAAACTGACCTCTGCAGGCAATACGATCTTTGGCTTTCTTTTTCAGGATAAATGTCTCCTGATTATAATCAACGCCCTTTTCATTCTTATGTAGAATGGAAAAAGTTACCTCCTGCTATTAAAACCTCAGATCTCTATCATCATAAGCAAGCAGCATACAACTTCTAAGTCACCCTATCTTGTGTCCTACGCGATAACACGATGGACACAGTGAGGCAAACATAAATCTTTGTCATTGACGATGGGTCTACTAGCATAGTTTTTTTTTTTTTTTTTTATCACGGATACTTCAATGACAATAGTGCAAGGTTGTTCCTGGTTTATGGGTTCTGATGTTCTGTTTCGCTTCACCTCTTAACTCACTATTAGGAGTTCACATCTCTCAGTTTTCAAGCAAGTGTGCCTGTAAAGAAAGAGTTCTCTCCACCGCCAAAAGAATCTGTATATATTTATTATATTTCTTTTGGGGGGTCTTAGTGTTTTCCAAATGGAAGTTGTTATTCATAAGTTATAAAGTCGAGTGTCAAGTTGTGCATGTTGCCTATGCTTTCGGTGGGGGGAGCTCCAAGGCTGCTCATAGGTTTTCATACTTATAGGAAGGATGAGTTCTTCAAAGAATAAGTTTAGTGTAAATGTATCTGGAATTCTAGGACTAGGAAATTGAAAGTAGATTTTACATAGATTCGCAGAGATCAGAAAGATTGAAAGGGCGAGAGTTTAATCTAAGAATTCTGATGATTGAATATGACTTAAGTGATAGTTGGAGGAGCTGCGTATTAGTGAGAGCTCTTACTTGGAGAAAAATTCCAAAGTCTAGGGGTTTAGGAGGGCGAGGTTAATACTAATAGCACCTAGGAGGAATAGCAACCTAATGATTGTGAATAAGGCTGCTTTCACTAAGGAGGAGGATATGTAAAGTGGGATTATTGGTATATTTATAATAACAAGGGCTGTCTCAATCTTCTGATTTCTTTTTCATGGACCAAGATTTTTGTTGTGGGGGGGAGGGGATACTCTTATATGTATAGCAACTACATTTACCTTCTGCCTGTAGGATCCTAGGATTAATGGCTGTGATTGTGATTAGTAGGATGGGGATCTTTCTCACAGCCTCTGATATTTGTCTTCCTTCATGCAACTTTGTTTCCAGATATCAAGGGTTGTTAGACAGAGGACTTAGCTCTCCTTTTCCTTCTTTTTCCTGCTGATGCTCTTGATCATCTATAGAAAGGGTTGAGAGGACTTGTCTATTATTTTTAGATGAGTGGAGGCTTGATTTCAAACTCATCTGCCATACATTTGATTGTCCATGTTCTTATAGGAAGAGATTCACCAAGTGAAGAAAGCCTTTCTGGGAGGAGGAGTCTTGCCCTTACCCAATTGATGTTTTCTTAATGCCCTTTTTTCACACACACACACACACATTTAGAGCCTCTCTTGCTTCTCATTGTGGAGGACAAATTGTGTTATGAGGAATCTATTGTAGTAATGGATGAAGATTGTTTTGATTCTCACCTGGTTTGTTGGATTTGGGTGGTGTGCCTTTTTGCTTTAGTTGTTTTAGGTATCAGAAACTTGGATGCTAAGAACATTGTTTTGCTGGATAAGTGGGTTTTAGAGATTCTAAACGGAGGTTGCTTTGCGTTTGGTGTAAAATTGTTACGGGCAAGAGCGGTCTTTAGGTTAATGGCTGATCCTTCATTCTTGTTAGGTAGTACACACCATAGTCCTTGAAAGTTCATTTTCTTGGGCTTTTGTTCTCGTTGTCCATGGACAATGGAGAGTATGTCCTTTAGGAGTAGAGGTGATCTAGTAGGTAGGAACCCTCAATTTATTGGAACAAATCATTTTAGTGATATAATAAAATATATGTACATATAGGGACAAAAGGAGAGAAAAGCCTCAACCCACCACTGAGGAGGTCACAAGAAAGTTCCATCCTTAGGCAAACTAAAAAATGTTGAATAAAAATCAATGTAGTGAAGTTTCCACACGTGTACACATGTATGATACTAGGTAGAAATATCATTGTTTTGCATCGGGCTTTTCATGTATCTTTTGTTTTTCTTTTGAAAATTTGGGGAACATTGTGAGCTTGTTTTTACATATATATGACAGAAATGTTCTAACGCTTGGGCACGACAAATAAACTTGGAGAAATATGTTCAACTTGTTTGTTAGACTAGTTCTTTTTTGCAGCCATGTCTCTATTAATTTGGTTTTCGTACATTCTTTGTTTGTTATGACATATATAAGTTTATGAACCATGTTCATACAATTTTAATTATATCTCTAAATCTTTTTATCCATATTACTTCGTTCCAAAATTGCCAAGGTGGCTGTTGTAGTGTTGTATTTATGTTGAGTGTAGGTGGTATATATGCTTGCTCCTTAGGATGGCAAGCATAACTTTTAGTTACGAAGAAAGAAGCTTATCTATCAATGGTTTCTTTGTTTAAACAGAATTAAATGAACTACTAAGTTTGAGGAAAAAAGATTTTGGTTTAAACTTGCAGAAGAAAGTAAGGATGCATCCATGCATTTATGAGGATAATATTGATGATAACCATCACCTCTCTACAGAAAGGATCTGTTGCAGCAGAAGATCTAATGCTTTGTCCAAATCTGAATCGGTCGCATGTAATAATTCATGTCCACCCGTCCAAAGTAATCAGATTGGCAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAATCAAGCACTCTTGAATGGTGATTTAGCATCTGAAATGGAAGACAATCAGGCAAATGAAGATTCTGTCGAGAAGCCTGTTCCTGTAGGTGCTTCATTTCAAGCAGTATTACCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGTTAGGGACACGGTCGTGGCCTTCTCAACACGAAAATAACAAGTCTGTAAGCGATAGAAATCCCATCAGCAGAGGGAGACTGGATCCTTGTAGTTGCCAATTTCCAGGTTCGGTTGAATGTTATAGATTTCATATTGCAGAAGCAAGGATGAGATTAAAGCTCGAACTTGGTTTGACATTCTATGATTGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAGTGGACTGCTGAAGAGGAAAATAGATTTAAAGAGTTGGCAATATCCAGTTTTAATAATCAAAATCAGTGCTTTTGGAACCATTCCTTGAAGTGGTTCCCAATGAAATCAAGGAAAAACTTGATAAGCTACTACTTCAACGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTTACTCCAAATGACATTGATAGCGATGGTGAAGATGTAGAGTTTGGTTGCATCAGTGGTGATTTTGGGGCTAAGGCAATGGAAGTTTTAGGCTCAAAATTTGTAGAATGTTCTGAAAATAAACAGTTCATAGGTATTTGAAGGCAACCGGCAGTTTGAAGGAAAGGAGAAAAGAGAAGAATAAGTGTTAAACTCGAACCAATGCCTGCATGAGCGTAATTTTTCTCAACATTCTGCTGTAGAATGTGAAGTCATCCAAAAAAGAGGGGAAATAAACCCAATTTTGAAGAGATGTTTTTAGCTACACAACACCATTTTTGTTTTGGGGGGGGGGGGG

mRNA sequence

ATGGGGAGATGGCCTATTTCATCCAATGATTCCATTCTAGATTGCAACAAAGATGTTGATCCTAATCCCAGTTATGGCTATTGCATTGCCCCGGATTGTTTGGTAGAAGGAAGTTGTGCTAATGTTGATCATGATGATTGCAAAGCCACGATTAGATGCTATTTTGAAAAAGTTCTTTGGGTTTTTCTAAAGGAAACGTGTCGTAGAGGATTTATTAGACCAGTGCCGGCGTTACTAGGTGAAGGGGAATCTTTGGATTTGTTTGAACTGTTCATGGTTGTTAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTTGTTGTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAATTGATTTATTTCAAGTACTTAAGTGACCTTGAGAAATGGCTTATGGTGAGACGTGGAGGCACAAAATTGGAAAATGGGAACTCCGATTGTTATTACTACAGGAAAAACTTTCCATGTTTGGCAGAACTGGAGGCAAAGATTAAGGATATATTATATGGTGTGCTGAGACAAAAGAGCATATATGATGAACGATCTGGATTCAAATCCAACAAACCAAATGGGAACGTTAATGTTGCTGAAACTGCAGCAGAGAAGGAAATAAAATCCCCTAAAATAGAGAAGAAAGAACATGATCTTCACGAGGATGTCACACCAATTCAACAAAATTGTACTGAGACACCTCGGGACAATGGCAAAACCAATCAAATCCATGTTATTGGAGATTGTAGAAGTTCGGATGCTGTTAATGTTGAAACTGAAACAGACTCTCATGGGAGCTCTCGAGAATCTTTATTTCGAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGCAAATCCATCAAATGGTACAGTACCGGGGTCATCCAAGTGGAAAGCGTATGCTAGCGAGGATGCATTATGGCTTCAAGTTATCAAGGCAAAGGATGCTCTTTTAAATAGGAAGGATGTTGACAAAACCGCTGAGAAACGTCTGTTAATACAGAAGAAAGTAAGGATGCATCCATGCATTTATGAGGATAATATTGATGATAACCATCACCTCTCTACAGAAAGGATCTGTTGCAGCAGAAGATCTAATGCTTTGTCCAAATCTGAATCGGTCGCATGTAATAATTCATGTCCACCCGTCCAAAGTAATCAGATTGGCAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAATCAAGCACTCTTGAATGGTGATTTAGCATCTGAAATGGAAGACAATCAGGCAAATGAAGATTCTGTCGAGAAGCCTGTTCCTGTAGGTGCTTCATTTCAAGCAGTATTACCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGTTAGGGACACGGTCGTGGCCTTCTCAACACGAAAATAACAAGTCTGTAAGCGATAGAAATCCCATCAGCAGAGGGAGACTGGATCCTTGTAGTTGCCAATTTCCAGGTTCGGTTGAATGTTATAGATTTCATATTGCAGAAGCAAGGATGAGATTAAAGCTCGAACTTGGTTTGACATTCTATGATTGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAGTGGACTGCTGAAGAGGAAAATAGATTTAAAGAGTTGGCAATATCCAGTTTTAATAATCAAAATCAGTGCTTTTGGAACCATTCCTTGAAGTGGTTCCCAATGAAATCAAGGAAAAACTTGATAAGCTACTACTTCAACGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTTACTCCAAATGACATTGATAGCGATGGTGAAGATGTAGAGTTTGGTTGCATCAGTGGTGATTTTGGGGCTAAGGCAATGGAAGTTTTAGGCTCAAAATTTGTAGAATGTTCTGAAAATAAACAGTTCATAGGTATTTGA

Coding sequence (CDS)

ATGGGGAGATGGCCTATTTCATCCAATGATTCCATTCTAGATTGCAACAAAGATGTTGATCCTAATCCCAGTTATGGCTATTGCATTGCCCCGGATTGTTTGGTAGAAGGAAGTTGTGCTAATGTTGATCATGATGATTGCAAAGCCACGATTAGATGCTATTTTGAAAAAGTTCTTTGGGTTTTTCTAAAGGAAACGTGTCGTAGAGGATTTATTAGACCAGTGCCGGCGTTACTAGGTGAAGGGGAATCTTTGGATTTGTTTGAACTGTTCATGGTTGTTAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTTGTTGTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAATTGATTTATTTCAAGTACTTAAGTGACCTTGAGAAATGGCTTATGGTGAGACGTGGAGGCACAAAATTGGAAAATGGGAACTCCGATTGTTATTACTACAGGAAAAACTTTCCATGTTTGGCAGAACTGGAGGCAAAGATTAAGGATATATTATATGGTGTGCTGAGACAAAAGAGCATATATGATGAACGATCTGGATTCAAATCCAACAAACCAAATGGGAACGTTAATGTTGCTGAAACTGCAGCAGAGAAGGAAATAAAATCCCCTAAAATAGAGAAGAAAGAACATGATCTTCACGAGGATGTCACACCAATTCAACAAAATTGTACTGAGACACCTCGGGACAATGGCAAAACCAATCAAATCCATGTTATTGGAGATTGTAGAAGTTCGGATGCTGTTAATGTTGAAACTGAAACAGACTCTCATGGGAGCTCTCGAGAATCTTTATTTCGAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGCAAATCCATCAAATGGTACAGTACCGGGGTCATCCAAGTGGAAAGCGTATGCTAGCGAGGATGCATTATGGCTTCAAGTTATCAAGGCAAAGGATGCTCTTTTAAATAGGAAGGATGTTGACAAAACCGCTGAGAAACGTCTGTTAATACAGAAGAAAGTAAGGATGCATCCATGCATTTATGAGGATAATATTGATGATAACCATCACCTCTCTACAGAAAGGATCTGTTGCAGCAGAAGATCTAATGCTTTGTCCAAATCTGAATCGGTCGCATGTAATAATTCATGTCCACCCGTCCAAAGTAATCAGATTGGCAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAATCAAGCACTCTTGAATGGTGATTTAGCATCTGAAATGGAAGACAATCAGGCAAATGAAGATTCTGTCGAGAAGCCTGTTCCTGTAGGTGCTTCATTTCAAGCAGTATTACCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGTTAGGGACACGGTCGTGGCCTTCTCAACACGAAAATAACAAGTCTGTAAGCGATAGAAATCCCATCAGCAGAGGGAGACTGGATCCTTGTAGTTGCCAATTTCCAGGTTCGGTTGAATGTTATAGATTTCATATTGCAGAAGCAAGGATGAGATTAAAGCTCGAACTTGGTTTGACATTCTATGATTGGAGATTTCATCAAATGGGGGAGGAAATATCTCTGCAGTGGACTGCTGAAGAGGAAAATAGATTTAAAGAGTTGGCAATATCCAGTTTTAATAATCAAAATCAGTGCTTTTGGAACCATTCCTTGAAGTGGTTCCCAATGAAATCAAGGAAAAACTTGATAAGCTACTACTTCAACGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTTACTCCAAATGACATTGATAGCGATGGTGAAGATGTAGAGTTTGGTTGCATCAGTGGTGATTTTGGGGCTAAGGCAATGGAAGTTTTAGGCTCAAAATTTGTAGAATGTTCTGAAAATAAACAGTTCATAGGTATTTGA

Protein sequence

MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI*
BLAST of Csa4G047920 vs. Swiss-Prot
Match: ARID2_ARATH (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana GN=ARID2 PE=2 SV=1)

HSP 1 Score: 325.1 bits (832), Expect = 1.7e-87
Identity = 218/620 (35.16%), Postives = 319/620 (51.45%), Query Frame = 1

Query: 38  SCANVDH-----DDCKATIRCYFEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFM 97
           SC+ VD      D+C+  +R  F++ L VFL+E    G I+P+PA++G+G+++DLF+LF+
Sbjct: 8   SCSYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFV 67

Query: 98  VVRDKGGYQVVSEKELWSSVVVELGLDLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLEN 157
           +VR++ G+  VS K LW  V  +LG D  L  S+ LIY KYL+ +EKW +        +N
Sbjct: 68  LVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDN 127

Query: 158 GNSD---CYYYRKNFPCLAELEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETA 217
            +S+   CY                            +++  +GFKS   NG        
Sbjct: 128 KDSEKKGCY-------------------------SGMLHELGNGFKSLLDNGK------- 187

Query: 218 AEKEIKSPKIEKKEHDLHEDVTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETET- 277
                     +K+   +      ++++C+E  R   +  +           +V +  ET 
Sbjct: 188 ---------CQKRNRAVAFGCNHMEESCSEFDRSRKRFRESDDDDKGVGLSSVVIREETV 247

Query: 278 ---------DSHGSSRESLFRMLKWVRKTAKHPANPSNGTVPGSSKWKAYASEDALWLQV 337
                    D     R+ L  MLKW+   A  P +P+ G +P SSKWK Y + +  WLQV
Sbjct: 248 VCAVEEGLSDFSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQY-NGNKCWLQV 307

Query: 338 IKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDNHHLSTERICCSRRSNALSK 397
            +AK++LL ++D    AE R       R H  I+  ++ ++   S  R+  S R   LSK
Sbjct: 308 ARAKNSLLVQRD---NAELRYRYH-PFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSK 367

Query: 398 SESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLASEMEDNQANEDSV-EKPVPV 457
             S +C N    V  ++  S +T+  K     +   G  A      + N+  +  + + V
Sbjct: 368 HCSSSCCNGSSLVSLSK--SRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKV 427

Query: 458 GASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSD---RNPISRGRLDPCSCQFP 517
           G   QA + EWT +  DSDSKWLGTR WP   EN++++      + + +GR D CSC+  
Sbjct: 428 GHQHQAQVDEWTESGVDSDSKWLGTRIWPP--ENSEALDQTLGNDLVGKGRPDSCSCELS 487

Query: 518 GSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRFKELAISSFNNQ 577
           G VEC R HIAE RM LK ELG  F+ WRF+QMGEE+ L+WT EEE RFK++ I+     
Sbjct: 488 GFVECTRLHIAEKRMELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----D 547

Query: 578 NQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGEDVEFGCISGDF 636
            Q FW ++ K FP K R+ L+SYYFNVFL+ +R YQNRVTP  IDSD E   FG + G F
Sbjct: 548 PQSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVGGSF 569

BLAST of Csa4G047920 vs. Swiss-Prot
Match: ARID1_ARATH (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana GN=ARID1 PE=2 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 1.3e-37
Identity = 83/182 (45.60%), Postives = 112/182 (61.54%), Query Frame = 1

Query: 428 NEDSVEKPVP-VGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKS--VSDRNPISR 487
           + D  ++P   VG+ FQA +PEWTG   +SDSKWLGTR WP   E  K+  + +R+ I +
Sbjct: 351 SSDEEDRPCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGK 410

Query: 488 GRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRF 547
           GR DPC C  PGS+EC +FHI   R +LKLELG  FY W F  MGE     WT  E  + 
Sbjct: 411 GRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKI 470

Query: 548 KELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGE 607
           K L +SS  + +  F + +    P KSR  ++SY++NV LL+ R+ Q+R+TP+DIDSD +
Sbjct: 471 KSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 530


HSP 2 Score: 120.2 bits (300), Expect = 8.4e-26
Identity = 104/318 (32.70%), Postives = 143/318 (44.97%), Query Frame = 1

Query: 55  FEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVV 114
           F  +L  FL E C      P+PA+ GEG ++DLF LF+ V  KGG+  VSE   W  VV 
Sbjct: 49  FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108

Query: 115 ELGLDLGLSASVKLIYFKYLSDLEKWL-MVRRGGTKLEN----GNSDCYYYRKNFPCLAE 174
           E GL+   SAS KLIY KYL    +WL  V  G T + +    G SD    R N   L+E
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLN-GFLSE 168

Query: 175 LEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHED 234
           ++ K              Y+ R G  +          E  AE +    K  K+ +D H  
Sbjct: 169 VKKK--------------YELRKGRPAK---------ELGAELKWFISKT-KRRYDKHHV 228

Query: 235 VTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGS----SRESLFRMLKWV 294
                 N      D  K  Q   + + R    + +E+ T    S     RE     LKW+
Sbjct: 229 GKESASN------DAVKEFQGSKLAERRLEQIMILESVTQECSSPGKRKRECPLETLKWL 288

Query: 295 RKTAKHPANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKK 354
              AK P +PS G VP  S+W +Y SE+  W Q++  +    +R + D   EK    QK 
Sbjct: 289 SDVAKDPCDPSLGIVPDRSEWVSYGSEEP-WKQLLLFR---ASRTNNDSACEKTW--QKV 329

Query: 355 VRMHPCIYEDNIDDNHHL 364
            +MHPC+Y+D+   +++L
Sbjct: 349 QKMHPCLYDDSAGASYNL 329

BLAST of Csa4G047920 vs. TrEMBL
Match: A0A0A0KZM1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047920 PE=4 SV=1)

HSP 1 Score: 1323.1 bits (3423), Expect = 0.0e+00
Identity = 639/639 (100.00%), Postives = 639/639 (100.00%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL
Sbjct: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG
Sbjct: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240
           VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET
Sbjct: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300
           PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV
Sbjct: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300

Query: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360
           PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN
Sbjct: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360

Query: 361 HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS 420
           HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS
Sbjct: 361 HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS 420

Query: 421 EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN 480
           EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN
Sbjct: 421 EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN 480

Query: 481 PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540
           PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE
Sbjct: 481 PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540

Query: 541 ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600
           ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID
Sbjct: 541 ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600

Query: 601 SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 640
           SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI
Sbjct: 601 SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 639

BLAST of Csa4G047920 vs. TrEMBL
Match: M5W8M5_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026661mg PE=4 SV=1)

HSP 1 Score: 468.0 bits (1203), Expect = 1.8e-128
Identity = 281/638 (44.04%), Postives = 373/638 (58.46%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           M  W   +  S+LDC +  D     G CI  D  V       D DD +  +RC F++VL 
Sbjct: 1   MAGWSSLTPGSVLDCVETNDAYQKNGSCIGSDIDVRDG-VECDEDDDEVRLRCTFDQVLS 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VF+KE   RG +RP+PA++ + + +DLF+LF +VRD+GGY  VS+  LWS V  ELGLD 
Sbjct: 61  VFVKEIGDRGVVRPIPAVIDDRQPVDLFKLFCLVRDRGGYDWVSKNSLWSFVAKELGLDG 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           G +ASVKLIYFKYL++LEKW           NG S  Y     F   +ELE + +D+L  
Sbjct: 121 GATASVKLIYFKYLNELEKWFRESCKSRSSGNGQSGLY---GEFQLSSELEREFRDLLLD 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNV--NVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCT 240
              QK   D    F+S++ NG +  N+++T     + +   + K+ D  E V    QN  
Sbjct: 181 GPEQKGKGDGPVQFESDE-NGKIEFNLSDTKDAYGMHAGADQCKDDD-EEKVCNDDQN-- 240

Query: 241 ETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNG 300
                          G   S D++N + E D     RESL  ML WV + AK P +PS G
Sbjct: 241 ---------------GVLISLDSLN-KKENDRK-RKRESLSGMLNWVVQIAKQPNDPSIG 300

Query: 301 TVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNID 360
            +PG + W+ +  ++  W QVI+A++ALL R++VD   E+ LL QKK++ HP +YEDN+ 
Sbjct: 301 VIPGPTNWREHKGDEC-WFQVIRAREALLLRRNVDSKTEESLL-QKKLKTHPLLYEDNVV 360

Query: 361 DNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDL 420
             H  S+ER+ CS R     KS S  C +SC   QSN I     E+    K QA    DL
Sbjct: 361 AGHQ-SSERLRCSERFPNSVKSRSCPCCSSCSVPQSNLISPRKKELDNNSKEQAPEEVDL 420

Query: 421 ASEMEDNQANEDSV-EKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVS 480
            +       + D+  EK V VG  FQA +PEWTG  S+SD KWLGTR WP Q E + S+ 
Sbjct: 421 LATNTMVCPSVDAPHEKHVSVGTLFQADVPEWTGVASESDIKWLGTRVWPLQCEEDSSLH 480

Query: 481 DRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540
           + +   +GR D C CQ PGSV C RFHIAEARM+LK ELG  FY WRF +MGEE+SLQWT
Sbjct: 481 EADLTGKGRPDLCGCQLPGSVVCIRFHIAEARMKLKRELGSLFYRWRFDRMGEEVSLQWT 540

Query: 541 AEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600
           AEEE RFK+L  S+    +  FWN + +WF  K+R+NL+SYYFNVFL++ RSYQNRVTP 
Sbjct: 541 AEEEKRFKDLVKSN----SPSFWNRASRWFRKKTRENLVSYYFNVFLVQSRSYQNRVTPK 600

Query: 601 DIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           +IDSD ++ EFG  S  F   A+EV  + F  CS+N+Q
Sbjct: 601 NIDSDDDETEFGSFSNGFRHDAVEV-SANFEACSQNQQ 605

BLAST of Csa4G047920 vs. TrEMBL
Match: B9RCE7_RICCO (DNA binding protein, putative OS=Ricinus communis GN=RCOM_1687390 PE=4 SV=1)

HSP 1 Score: 439.9 bits (1130), Expect = 5.4e-120
Identity = 273/676 (40.38%), Postives = 381/676 (56.36%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCN------KDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCY 60
           M  W + +N S LDC       KD    P   + +     VE S      DD +  +RC 
Sbjct: 1   MAGWSMLTNGSSLDCADVTSGFKDSTCRPDVNHAVKDHNAVEES-----DDDHEVKLRCL 60

Query: 61  FEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVV 120
           F++VL VF  E   RG  RP+PALLG G+SLDLF+LF VVR +GG+ +V+    WS VV 
Sbjct: 61  FDQVLSVFANEAAARGSFRPIPALLGGGKSLDLFKLFRVVRKRGGFDLVN--GFWSFVVK 120

Query: 121 ELGLDLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLA-ELEAK 180
           ELGLDL  SASVKL+YFKYL +LE+WL       +L  GN  C    K F CL+ ELEA+
Sbjct: 121 ELGLDLAASASVKLVYFKYLYELERWLRGSNSSRRL--GNGQCRPGGK-FNCLSMELEAE 180

Query: 181 IKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPI 240
            + +    L   S   +   +K    N  +NV ++    +I  P   K  H      T  
Sbjct: 181 FRKL----LSNGSKKGKDGKYKKKSKNFGINVVKS----KIGLPD-TKDVHSAGSRHTDD 240

Query: 241 QQNCTE-TPRDNGKTNQI-------------HVIGDCRSSDAVNVETETDSHGS------ 300
            +N  E T +  GK+  I             H++   R     N +  TD+ G       
Sbjct: 241 DENFQEYTGKCKGKSADICAHPPPTPALAEEHLV---RRVKPYNEKCSTDNDGDDVVILD 300

Query: 301 -------------SRESLFRMLKWVRKTAKHPANPSNGTVPGSSKWKAYASEDALWLQVI 360
                         RESL RML WV + AK P +PS G +P  SK K     + LW Q I
Sbjct: 301 PSIGEKLFSPRKRKRESLSRMLNWVIQAAKSPDHPSIGNIPPLSKCKDNKGNE-LWAQAI 360

Query: 361 KAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDNHHLSTERICCSRRSNALSKS 420
           +A+DAL+ R+ V+   E R L+Q   ++HP +Y+D I  +   S+ER+ CS R  AL K 
Sbjct: 361 RARDALVRRRQVNSGCE-RSLLQNHQKIHPSMYKDAIPPSDP-SSERVRCSERLPALVKP 420

Query: 421 ESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLASEMEDNQANED-SVEKPVPVG 480
            S +C NSC   +S  I    TE+    K + L+  DL++      ++ D  + + V VG
Sbjct: 421 RSCSCCNSCSAPKSQLISPPKTELENAPKAKVLMAEDLSAATATLSSSGDIHIHRHVSVG 480

Query: 481 ASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRNPISRGRLDPCSCQFPGSVE 540
             FQA +PEWTG +S+S+SKWLGT++WP +   + ++   + I +GR + C C+ PGSVE
Sbjct: 481 RRFQAEVPEWTGLVSESESKWLGTQAWPLEFGEHNAMVQEDTIGKGRPESCGCELPGSVE 540

Query: 541 CYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRFKELAISSFNNQNQCF 600
           C RFHIAE R++LK+ELG  FY W+F  MGEEI+L+WTAEEE RFK++   +  + ++ F
Sbjct: 541 CVRFHIAENRIKLKIELGSVFYHWKFDCMGEEIALRWTAEEEKRFKDVVRFNLPSLDKFF 600

Query: 601 WNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGEDVEFGCISGDFGAKA 636
           W++S K+F  K+++ L+SYYFNVFL+++RSYQNRVTP  IDSD ++ EFG +S  +G +A
Sbjct: 601 WDNSRKYFRRKTKEELVSYYFNVFLVQRRSYQNRVTPKHIDSDDDESEFGSLSDTYGQQA 651

BLAST of Csa4G047920 vs. TrEMBL
Match: V4THD3_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019313mg PE=4 SV=1)

HSP 1 Score: 435.6 bits (1119), Expect = 1.0e-118
Identity = 265/645 (41.09%), Postives = 363/645 (56.28%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSY-GYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVL 60
           M  W I +N S LDC K +    S  G C   D  ++   +  D    +  ++C F+KVL
Sbjct: 1   MAGWSILTNGSALDCGKTIGSVQSNDGCCPEADNHMKDDDSVEDSGGYEDELKCLFDKVL 60

Query: 61  WVFLKE-TCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGL 120
              LKE + R+G IRP+PA+LG+G SLDLF+LF  VR++GG+ +VS+  LW  V+ +LGL
Sbjct: 61  ETVLKEGSDRKGSIRPIPAMLGDGRSLDLFKLFCAVRERGGFCMVSKNGLWGFVLEDLGL 120

Query: 121 DLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDIL 180
           D G+SASVKL+Y +YL +LEKWLM   G + L  GN  C +   +     E+E + + +L
Sbjct: 121 DFGVSASVKLVYARYLGELEKWLM---GTSGLSLGNGGCGFGGNSGLLPLEIETRFRGLL 180

Query: 181 YGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDL------HEDVTP 240
               ++K I D+R      K NGN    E           IEK E DL      HE    
Sbjct: 181 MNWSKKK-IKDDRLALLEYKKNGNHVDME-----------IEKTELDLLDTKNRHERCKC 240

Query: 241 IQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHP 300
           + + C+    DN + N  +    C    ++  + E       RESL  ML WV + AK+P
Sbjct: 241 LGKKCS----DNNRKNYDNDDKLCNDDPSIT-QKEYCYRKRKRESLSGMLNWVIQIAKYP 300

Query: 301 ANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCI 360
            +P  G +P  SKWK    ++ LWLQ I+A+DALL RK V+    + L  Q   +MHP +
Sbjct: 301 DDPLIGVIPEPSKWKNNEDKE-LWLQAIRARDALLQRKCVNSNIHQSLF-QNGQKMHPSM 360

Query: 361 YEDNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQA 420
           YED + +  H STER+  S R   + KS   +C +SC    +        E+  G K + 
Sbjct: 361 YED-VTNQRHWSTERLRSSERLPTIMKSRVCSCCSSCSATDNKLTSPHNAELETGPKGKT 420

Query: 421 LLN-GDLASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHE 480
            +     A  +    + ++  EK V VG  FQA +PEWTG + +SDSKWLGTR  P    
Sbjct: 421 PMTVTSSAMNIAVRSSGDEPQEKHVSVGPLFQASVPEWTGVVLESDSKWLGTRICPLVDG 480

Query: 481 NNKSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEE 540
            + SV + NP  RGR D C C+ PGSVEC RFHIAE RM+LKLELG  F+ WRF +MGEE
Sbjct: 481 EHNSVVEMNPCGRGRQDSCGCRLPGSVECIRFHIAENRMKLKLELGPVFFHWRFDRMGEE 540

Query: 541 ISLQWTAEEENRFKELAISSFNN-QNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSY 600
           +SL WT EEE RF+++ I  FN   +  FW  + K F  K R++ +SYYFNVFL+ +RSY
Sbjct: 541 VSLGWTVEEEKRFRDMVI--FNRFLSAGFWGSACKSFLGKKREDFVSYYFNVFLVSRRSY 600

Query: 601 QNRVTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           QN VTP DI+SD ++ EFG +S  FG  A+ V G   + C++N Q
Sbjct: 601 QNHVTPRDINSDDDESEFGSVSDSFGNAAVTVHGFDKLTCAQNNQ 620

BLAST of Csa4G047920 vs. TrEMBL
Match: A0A067G9K4_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006921mg PE=4 SV=1)

HSP 1 Score: 435.3 bits (1118), Expect = 1.3e-118
Identity = 264/645 (40.93%), Postives = 362/645 (56.12%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSY-GYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVL 60
           M  W I +N S LDC K +    S  G C   D  ++   +  D    +  ++C F+KVL
Sbjct: 1   MAGWSILTNGSALDCGKTIGSVQSNDGCCPEADNYMKDDDSVEDSGGYEDELKCLFDKVL 60

Query: 61  WVFLKE-TCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGL 120
              LKE + R+G IRP+PA+LG+G SLDLF+LF  VR++GG+ +VS+  LW  V+ +LGL
Sbjct: 61  ETVLKEGSDRKGSIRPIPAMLGDGRSLDLFKLFCAVRERGGFCMVSKNGLWGFVLEDLGL 120

Query: 121 DLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDIL 180
           D G+SASVKL+Y +YL +LEKWLM   G + L  GN  C +   +     E+E + + +L
Sbjct: 121 DFGVSASVKLVYARYLGELEKWLM---GTSGLSLGNGGCGFGGNSGLLPLEIETRFRGLL 180

Query: 181 YGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDL------HEDVTP 240
               ++K I D+R      K NGN    E           IEK E DL      HE    
Sbjct: 181 MNWSKKK-IKDDRLALLEYKKNGNHVDME-----------IEKTELDLLDTKNRHERCKC 240

Query: 241 IQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHP 300
           + + C+    DN + N  +    C    ++  + E       RESL  ML WV + AK+P
Sbjct: 241 LGKKCS----DNNRKNYDNDDKLCNDDPSIT-QKEYCYRKRKRESLSGMLNWVIQIAKYP 300

Query: 301 ANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCI 360
            +P  G +P  SKWK    ++ LWL  I+A+DALL RK V+    + L  Q   +MHP +
Sbjct: 301 DDPLIGVIPEPSKWKNNEDKE-LWLHAIRARDALLQRKHVNSNIHQSLF-QNGQKMHPSM 360

Query: 361 YEDNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQA 420
           YED + +  H STER+  S R   + KS   +C +SC    +        E+  G K + 
Sbjct: 361 YED-VTNQRHWSTERLRSSERLPTIMKSRVCSCCSSCSATDNKLTSPHNAELETGPKGKT 420

Query: 421 LLN-GDLASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHE 480
            +     A  +    + ++  EK V VG  FQA +PEWTG + +SDSKWLGTR  P    
Sbjct: 421 PMTVTSSAMNIAVRSSGDEPQEKHVSVGPLFQASVPEWTGVVLESDSKWLGTRICPLVDG 480

Query: 481 NNKSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEE 540
            + SV + NP  RGR D C C+ PGSVEC RFHIAE RM+LKLELG  F+ WRF +MGEE
Sbjct: 481 EHNSVVEMNPCGRGRQDSCGCRLPGSVECIRFHIAENRMKLKLELGPVFFHWRFDRMGEE 540

Query: 541 ISLQWTAEEENRFKELAISSFNN-QNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSY 600
           +SL WT EEE RF+++ I  FN   +  FW  + K F  K R++ +SYYFNVFL+ +RSY
Sbjct: 541 VSLGWTVEEEKRFRDMVI--FNRFLSAGFWGSACKSFLGKKREDFVSYYFNVFLVSRRSY 600

Query: 601 QNRVTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           QN VTP DI+SD ++ EFG +S  FG  A+ V G   + C++N Q
Sbjct: 601 QNHVTPRDINSDDDESEFGSVSDSFGNAAVTVHGFDKLTCAQNNQ 620

BLAST of Csa4G047920 vs. TAIR10
Match: AT4G11400.1 (AT4G11400.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein)

HSP 1 Score: 325.1 bits (832), Expect = 9.7e-89
Identity = 218/620 (35.16%), Postives = 319/620 (51.45%), Query Frame = 1

Query: 38  SCANVDH-----DDCKATIRCYFEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFM 97
           SC+ VD      D+C+  +R  F++ L VFL+E    G I+P+PA++G+G+++DLF+LF+
Sbjct: 8   SCSYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFV 67

Query: 98  VVRDKGGYQVVSEKELWSSVVVELGLDLGLSASVKLIYFKYLSDLEKWLMVRRGGTKLEN 157
           +VR++ G+  VS K LW  V  +LG D  L  S+ LIY KYL+ +EKW +        +N
Sbjct: 68  LVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDN 127

Query: 158 GNSD---CYYYRKNFPCLAELEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETA 217
            +S+   CY                            +++  +GFKS   NG        
Sbjct: 128 KDSEKKGCY-------------------------SGMLHELGNGFKSLLDNGK------- 187

Query: 218 AEKEIKSPKIEKKEHDLHEDVTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETET- 277
                     +K+   +      ++++C+E  R   +  +           +V +  ET 
Sbjct: 188 ---------CQKRNRAVAFGCNHMEESCSEFDRSRKRFRESDDDDKGVGLSSVVIREETV 247

Query: 278 ---------DSHGSSRESLFRMLKWVRKTAKHPANPSNGTVPGSSKWKAYASEDALWLQV 337
                    D     R+ L  MLKW+   A  P +P+ G +P SSKWK Y + +  WLQV
Sbjct: 248 VCAVEEGLSDFSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQY-NGNKCWLQV 307

Query: 338 IKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDNHHLSTERICCSRRSNALSK 397
            +AK++LL ++D    AE R       R H  I+  ++ ++   S  R+  S R   LSK
Sbjct: 308 ARAKNSLLVQRD---NAELRYRYH-PFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSK 367

Query: 398 SESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLASEMEDNQANEDSV-EKPVPV 457
             S +C N    V  ++  S +T+  K     +   G  A      + N+  +  + + V
Sbjct: 368 HCSSSCCNGSSLVSLSK--SRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKV 427

Query: 458 GASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSD---RNPISRGRLDPCSCQFP 517
           G   QA + EWT +  DSDSKWLGTR WP   EN++++      + + +GR D CSC+  
Sbjct: 428 GHQHQAQVDEWTESGVDSDSKWLGTRIWPP--ENSEALDQTLGNDLVGKGRPDSCSCELS 487

Query: 518 GSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRFKELAISSFNNQ 577
           G VEC R HIAE RM LK ELG  F+ WRF+QMGEE+ L+WT EEE RFK++ I+     
Sbjct: 488 GFVECTRLHIAEKRMELKRELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----D 547

Query: 578 NQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGEDVEFGCISGDF 636
            Q FW ++ K FP K R+ L+SYYFNVFL+ +R YQNRVTP  IDSD E   FG + G F
Sbjct: 548 PQSFWTNAAKNFPKKKREELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVGGSF 569

BLAST of Csa4G047920 vs. TAIR10
Match: AT2G46040.1 (AT2G46040.1 ARID/BRIGHT DNA-binding domain;ELM2 domain protein)

HSP 1 Score: 159.5 bits (402), Expect = 7.1e-39
Identity = 83/182 (45.60%), Postives = 112/182 (61.54%), Query Frame = 1

Query: 428 NEDSVEKPVP-VGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKS--VSDRNPISR 487
           + D  ++P   VG+ FQA +PEWTG   +SDSKWLGTR WP   E  K+  + +R+ I +
Sbjct: 351 SSDEEDRPCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGK 410

Query: 488 GRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEEENRF 547
           GR DPC C  PGS+EC +FHI   R +LKLELG  FY W F  MGE     WT  E  + 
Sbjct: 411 GRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKI 470

Query: 548 KELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGE 607
           K L +SS  + +  F + +    P KSR  ++SY++NV LL+ R+ Q+R+TP+DIDSD +
Sbjct: 471 KSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 530


HSP 2 Score: 120.2 bits (300), Expect = 4.8e-27
Identity = 104/318 (32.70%), Postives = 143/318 (44.97%), Query Frame = 1

Query: 55  FEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVV 114
           F  +L  FL E C      P+PA+ GEG ++DLF LF+ V  KGG+  VSE   W  VV 
Sbjct: 49  FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108

Query: 115 ELGLDLGLSASVKLIYFKYLSDLEKWL-MVRRGGTKLEN----GNSDCYYYRKNFPCLAE 174
           E GL+   SAS KLIY KYL    +WL  V  G T + +    G SD    R N   L+E
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLN-GFLSE 168

Query: 175 LEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHED 234
           ++ K              Y+ R G  +          E  AE +    K  K+ +D H  
Sbjct: 169 VKKK--------------YELRKGRPAK---------ELGAELKWFISKT-KRRYDKHHV 228

Query: 235 VTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGS----SRESLFRMLKWV 294
                 N      D  K  Q   + + R    + +E+ T    S     RE     LKW+
Sbjct: 229 GKESASN------DAVKEFQGSKLAERRLEQIMILESVTQECSSPGKRKRECPLETLKWL 288

Query: 295 RKTAKHPANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKK 354
              AK P +PS G VP  S+W +Y SE+  W Q++  +    +R + D   EK    QK 
Sbjct: 289 SDVAKDPCDPSLGIVPDRSEWVSYGSEEP-WKQLLLFR---ASRTNNDSACEKTW--QKV 329

Query: 355 VRMHPCIYEDNIDDNHHL 364
            +MHPC+Y+D+   +++L
Sbjct: 349 QKMHPCLYDDSAGASYNL 329

BLAST of Csa4G047920 vs. TAIR10
Match: AT5G04110.1 (AT5G04110.1 DNA GYRASE B3)

HSP 1 Score: 127.5 bits (319), Expect = 3.0e-29
Identity = 70/180 (38.89%), Postives = 100/180 (55.56%), Query Frame = 1

Query: 436 VPVGASFQAVLPEWT---------GNISDSDS-KWLGTRSWPSQHENNKSVSDRNPISRG 495
           +P+G  FQA +P W          G+  DS++ +WLGT  WP+ +   K+V  +  +  G
Sbjct: 361 IPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLRWLGTGVWPT-YSLKKTVHSKK-VGEG 420

Query: 496 RLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQ-WTAEEENRF 555
           R D CSC  P S  C + H  EA+  L+ E+   F  W F QMGEEI L+ WTA+EE RF
Sbjct: 421 RSDSCSCASPRSTNCIKRHKKEAQELLEKEINRAFSTWEFDQMGEEIVLKSWTAKEERRF 480

Query: 556 KELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDIDSDGE 605
           + L   +  + +  FW  +   FP KS+K+L+SYY+NVFL+++         N+IDSD +
Sbjct: 481 EALVKKNPLSSSDGFWEFASNAFPQKSKKDLLSYYYNVFLIKRMRLLKSSAANNIDSDDD 538

BLAST of Csa4G047920 vs. TAIR10
Match: AT1G26580.1 (AT1G26580.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 101.7 bits (252), Expect = 1.8e-21
Identity = 73/211 (34.60%), Postives = 101/211 (47.87%), Query Frame = 1

Query: 429 EDSVEKPVPVGASFQAVLPEW----TGNISDSD-------------SKWLGTRSWPSQHE 488
           +   +K VP+G   QA +PEW    TGNI  S               K  GT   P    
Sbjct: 128 DQRAKKQVPIGPGHQAEIPEWEGSQTGNIETSGMSVQNHISGCADGEKLFGTSVIPMPGL 187

Query: 489 NNKSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGL-TFYDWRFHQMGE 548
              +  D + + +GR   C C+   SV C   HI EAR  L    G  TF +    +MGE
Sbjct: 188 TTVAHID-DIVGKGRKF-CVCRDRDSVRCVCQHIKEAREELVKTFGNETFKELGLCEMGE 247

Query: 549 EISLQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSY 608
           + +L+W+ E+   F E+  S+     Q FW H    F  +++K ++S+YFNVF+LR+R+ 
Sbjct: 248 KGALKWSDEDAQLFHEVVYSNPVTLGQNFWRHLEAAFCSRTQKEIVSFYFNVFVLRRRAI 307

Query: 609 QNRVTPNDIDSDGEDVEFGCISGDFGAKAME 622
           QNR    DIDSD +D   GC  G  G + +E
Sbjct: 308 QNRAFILDIDSD-DDEWHGCYGGSSGTRYVE 335

BLAST of Csa4G047920 vs. TAIR10
Match: AT1G13880.1 (AT1G13880.1 ELM2 domain-containing protein)

HSP 1 Score: 98.2 bits (243), Expect = 1.9e-20
Identity = 66/210 (31.43%), Postives = 106/210 (50.48%), Query Frame = 1

Query: 418 LASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGN-ISDSDSKWLGTRSWPSQHENNKSV 477
           + S  ED      S  K VP+G+ +QA +PE      +D   + +G   +  +    K V
Sbjct: 110 VCSGEEDGYWCPISPRKTVPIGSDYQADIPECVKEEANDQSGQGVG---YDEEQVTGKCV 169

Query: 478 -------SDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLT-FYDWRFHQM 537
                  ++   I +GR + C C   GS+ C + HI E R  L   +G     D    +M
Sbjct: 170 IPMPDCETEVCKIGKGRKE-CICLDKGSIRCVQQHIMENREDLFATIGYDRCLDIGLCEM 229

Query: 538 GEEISLQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQR 597
           GEE++ + T +EE+ F E+  S+  + ++ FW H    FP ++ K ++SYYFNVF+LR+R
Sbjct: 230 GEEVAARLTEDEEDLFHEIVYSNPVSMDRDFWKHLKSAFPSRTMKEIVSYYFNVFILRRR 289

Query: 598 SYQNRVTPNDIDSDGEDVEFGCISGDFGAK 619
           + QNR    D+DSD ++ +    +  +GA+
Sbjct: 290 AIQNRSKSLDVDSDDDEWQVEYDNTFYGAE 315

BLAST of Csa4G047920 vs. NCBI nr
Match: gi|778690826|ref|XP_004146560.2| (PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis sativus])

HSP 1 Score: 1323.1 bits (3423), Expect = 0.0e+00
Identity = 639/639 (100.00%), Postives = 639/639 (100.00%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL
Sbjct: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG
Sbjct: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240
           VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET
Sbjct: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300
           PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV
Sbjct: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300

Query: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360
           PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN
Sbjct: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360

Query: 361 HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS 420
           HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS
Sbjct: 361 HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS 420

Query: 421 EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN 480
           EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN
Sbjct: 421 EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN 480

Query: 481 PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540
           PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE
Sbjct: 481 PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540

Query: 541 ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600
           ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID
Sbjct: 541 ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600

Query: 601 SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 640
           SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI
Sbjct: 601 SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 639

BLAST of Csa4G047920 vs. NCBI nr
Match: gi|659102274|ref|XP_008452043.1| (PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo])

HSP 1 Score: 1247.3 bits (3226), Expect = 0.0e+00
Identity = 605/639 (94.68%), Postives = 616/639 (96.40%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           MGRWPISSNDSILDCNKDVDPNPS GYCIAPDCLVEGS ANVDHDDCKATIRCYFEK+LW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSNGYCIAPDCLVEGSRANVDHDDCKATIRCYFEKILW 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VFLKE CRRGFIRPVPALLGEG SLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL
Sbjct: 61  VFLKEICRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           GLSASVKLIYFKYLS+LEKWLMVRRGGTKLENGNSD YYYRK+FPCLAELEAKIKD+LYG
Sbjct: 121 GLSASVKLIYFKYLSELEKWLMVRRGGTKLENGNSDYYYYRKSFPCLAELEAKIKDMLYG 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240
           VLRQKSIYDER GFKSNKPNGNVNVAETAAEKEIK PKIEKKEHDLHEDVTPIQQNCTET
Sbjct: 181 VLRQKSIYDERPGFKSNKPNGNVNVAETAAEKEIKFPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300
           PR NG+TNQIHVIGDCRS DAVNVETETDSHG SRESL RMLKWVRKTAKHPANPSNGTV
Sbjct: 241 PRVNGETNQIHVIGDCRSLDAVNVETETDSHGRSRESLLRMLKWVRKTAKHPANPSNGTV 300

Query: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360
           P SSKWKAYAS+DALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN
Sbjct: 301 PESSKWKAYASDDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNIDDN 360

Query: 361 HHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDLAS 420
           HHLSTERICCSRRSNAL+KSE VA NNSCPPV+SNQIGSLTTEIGKGLKNQALLNGDLAS
Sbjct: 361 HHLSTERICCSRRSNALAKSELVASNNSCPPVRSNQIGSLTTEIGKGLKNQALLNGDLAS 420

Query: 421 EMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVSDRN 480
           EMEDNQANEDSVEKPVPVGA FQA +PEWTGNISDSDSKWLGTR WPSQHENNKSVS+RN
Sbjct: 421 EMEDNQANEDSVEKPVPVGALFQAAIPEWTGNISDSDSKWLGTRLWPSQHENNKSVSNRN 480

Query: 481 PISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540
           PI RGRLD CSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE
Sbjct: 481 PIGRGRLDSCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWTAEE 540

Query: 541 ENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600
           E RFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID
Sbjct: 541 EKRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPNDID 600

Query: 601 SDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 640
           SD EDVEFGCISGDFGAKAME+LGSK VECSENKQFI I
Sbjct: 601 SDDEDVEFGCISGDFGAKAMEILGSKSVECSENKQFIDI 639

BLAST of Csa4G047920 vs. NCBI nr
Match: gi|470129814|ref|XP_004300803.1| (PREDICTED: AT-rich interactive domain-containing protein 2-like [Fragaria vesca subsp. vesca])

HSP 1 Score: 470.7 bits (1210), Expect = 4.1e-129
Identity = 288/656 (43.90%), Postives = 373/656 (56.86%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAP---------DCLVEGS--------CANVD 60
           M  W + +N S+LD  +  D NP  G  +           +C  +G+        C N D
Sbjct: 1   MAEWSLLTNGSVLDLAETSDANPINGGSVGSGIEFVKDSVECDHKGTDIVKDGVECVNGD 60

Query: 61  HDDCKATIRCYFEKVLWVFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVV 120
            DD K  +RC F++VL VF+KE   RG +RPVPA+ G+ + +DLF+L+ VVRDKGGY +V
Sbjct: 61  DDDYKGRLRCTFDQVLSVFIKEIGDRGVVRPVPAIFGDRQHVDLFKLYCVVRDKGGYDLV 120

Query: 121 SEKELWSSVVVELGLDLGLSA-SVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRK 180
           S++ LWS V  ELGLD G +A SVKLIYFKYL++LE W      G  L NG S  Y    
Sbjct: 121 SKRRLWSHVSKELGLDNGATAASVKLIYFKYLNELEIWFRESCTGRSLGNGESGRY---G 180

Query: 181 NFPCLA-ELEAKIKDILYGVLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEK 240
            F  ++ ELE + + +L     QK   D     +S++ NG                KIE 
Sbjct: 181 TFHLMSVELETEFRGLLLDGTEQKDNGDGLVHLESDR-NG----------------KIEY 240

Query: 241 KEHDLHEDVTPIQQNCTETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRM 300
              D  +  T      T    D+ K       G       V+ + E +     RESL  M
Sbjct: 241 GLSDTKDACTMHSGTGTGNGHDDEKACHDDQNGTFVLPSRVDSK-ENERKRKRRESLSGM 300

Query: 301 LKWVRKTAKHPANPSNGTVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLL 360
           L WV +TA    + S G +P  +KWK +   +  W Q IKA++AL+ R+D++   E+ LL
Sbjct: 301 LNWVIQTATQLGDVSIGEIPEPAKWKKHQGNE-FWFQAIKAREALMVRRDINPKTEE-LL 360

Query: 361 IQKKVRMHPCIYEDNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLT 420
            QKK+RMHP +YEDNI   H L +ER+ C  R    +KS S  C NS PP QSN I    
Sbjct: 361 QQKKLRMHPLMYEDNIAAGH-LFSERLRCRGRLPHSTKSRSCTCCNSSPPTQSNLISPSM 420

Query: 421 TEIGKGLKNQALLNGDLASEMED--NQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSK 480
            E     K Q  +  DLAS          ++  EK V +G  FQA +PEWTG  S+SD K
Sbjct: 421 EEP----KEQEPMEVDLASPHTSVIPSDQDEFREKYVSIGPLFQADVPEWTGEASESDDK 480

Query: 481 WLGTRSWPSQHENNKSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLT 540
           WLGTR WP + E N S+  ++ I RGR   C C+ PGSV C RFHIAEARM+LK ELG  
Sbjct: 481 WLGTRVWPLECEENASLVKKDTIGRGRPHFCGCRLPGSVTCLRFHIAEARMKLKKELGSL 540

Query: 541 FYDWRFHQMGEEISLQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYY 600
           FY WRF +MGEEI LQWTAEEE RFK L  S +      FWN + KWFP K+R+NL+SYY
Sbjct: 541 FYHWRFDRMGEEICLQWTAEEEKRFKILVQSKY----PFFWNSASKWFPRKTRENLVSYY 600

Query: 601 FNVFLLRQRSYQNRVTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           FNVFL+R+RSYQNR TP +IDSD ++ +FG +S  FG +A++V GS FV CS+NKQ
Sbjct: 601 FNVFLVRRRSYQNRATPKNIDSDDDETDFGSLSEGFGHEAVKVAGSSFVACSQNKQ 624

BLAST of Csa4G047920 vs. NCBI nr
Match: gi|595845222|ref|XP_007208888.1| (hypothetical protein PRUPE_ppa026661mg [Prunus persica])

HSP 1 Score: 468.0 bits (1203), Expect = 2.6e-128
Identity = 281/638 (44.04%), Postives = 373/638 (58.46%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           M  W   +  S+LDC +  D     G CI  D  V       D DD +  +RC F++VL 
Sbjct: 1   MAGWSSLTPGSVLDCVETNDAYQKNGSCIGSDIDVRDG-VECDEDDDEVRLRCTFDQVLS 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VF+KE   RG +RP+PA++ + + +DLF+LF +VRD+GGY  VS+  LWS V  ELGLD 
Sbjct: 61  VFVKEIGDRGVVRPIPAVIDDRQPVDLFKLFCLVRDRGGYDWVSKNSLWSFVAKELGLDG 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           G +ASVKLIYFKYL++LEKW           NG S  Y     F   +ELE + +D+L  
Sbjct: 121 GATASVKLIYFKYLNELEKWFRESCKSRSSGNGQSGLY---GEFQLSSELEREFRDLLLD 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNV--NVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCT 240
              QK   D    F+S++ NG +  N+++T     + +   + K+ D  E V    QN  
Sbjct: 181 GPEQKGKGDGPVQFESDE-NGKIEFNLSDTKDAYGMHAGADQCKDDD-EEKVCNDDQN-- 240

Query: 241 ETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNG 300
                          G   S D++N + E D     RESL  ML WV + AK P +PS G
Sbjct: 241 ---------------GVLISLDSLN-KKENDRK-RKRESLSGMLNWVVQIAKQPNDPSIG 300

Query: 301 TVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNID 360
            +PG + W+ +  ++  W QVI+A++ALL R++VD   E+ LL QKK++ HP +YEDN+ 
Sbjct: 301 VIPGPTNWREHKGDEC-WFQVIRAREALLLRRNVDSKTEESLL-QKKLKTHPLLYEDNVV 360

Query: 361 DNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDL 420
             H  S+ER+ CS R     KS S  C +SC   QSN I     E+    K QA    DL
Sbjct: 361 AGHQ-SSERLRCSERFPNSVKSRSCPCCSSCSVPQSNLISPRKKELDNNSKEQAPEEVDL 420

Query: 421 ASEMEDNQANEDSV-EKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVS 480
            +       + D+  EK V VG  FQA +PEWTG  S+SD KWLGTR WP Q E + S+ 
Sbjct: 421 LATNTMVCPSVDAPHEKHVSVGTLFQADVPEWTGVASESDIKWLGTRVWPLQCEEDSSLH 480

Query: 481 DRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540
           + +   +GR D C CQ PGSV C RFHIAEARM+LK ELG  FY WRF +MGEE+SLQWT
Sbjct: 481 EADLTGKGRPDLCGCQLPGSVVCIRFHIAEARMKLKRELGSLFYRWRFDRMGEEVSLQWT 540

Query: 541 AEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600
           AEEE RFK+L  S+    +  FWN + +WF  K+R+NL+SYYFNVFL++ RSYQNRVTP 
Sbjct: 541 AEEEKRFKDLVKSN----SPSFWNRASRWFRKKTRENLVSYYFNVFLVQSRSYQNRVTPK 600

Query: 601 DIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           +IDSD ++ EFG  S  F   A+EV  + F  CS+N+Q
Sbjct: 601 NIDSDDDETEFGSFSNGFRHDAVEV-SANFEACSQNQQ 605

BLAST of Csa4G047920 vs. NCBI nr
Match: gi|645267129|ref|XP_008238928.1| (PREDICTED: AT-rich interactive domain-containing protein 2 [Prunus mume])

HSP 1 Score: 463.8 bits (1192), Expect = 5.0e-127
Identity = 280/638 (43.89%), Postives = 374/638 (58.62%), Query Frame = 1

Query: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60
           M  W   +  S+LDC +  D     G CI  D  V       D DD +  +RC F++VL 
Sbjct: 1   MAGWSSLTPGSVLDCVETNDAYQKNGSCIGSDIDVRDG-VECDEDDDEVRLRCTFDQVLS 60

Query: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120
           VF+KE   RG  RP+PA++ + + +DLF+LF +VRD+GGY  VS+  LWS V  ELGLD 
Sbjct: 61  VFVKEIGDRGVARPIPAVIDDRQPVDLFKLFCLVRDRGGYDWVSKNSLWSFVAKELGLDG 120

Query: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180
           G +ASVKLIYFKYL++LEKW   R       +GN     Y +     +ELE + +D+L  
Sbjct: 121 GATASVKLIYFKYLNELEKWF--RESCKSRSSGNGQSGLYGEFQLLSSELEREFRDLLLD 180

Query: 181 VLRQKSIYDERSGFKSNKPNGNV--NVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCT 240
              QK   D    F+S++ NG +  N+++T     + +   + K+ D  E V    QN  
Sbjct: 181 GPEQKGKGDGPVQFESDE-NGKIEFNLSDTKDAYGMHAGADQCKDDD-DEKVCNDDQN-- 240

Query: 241 ETPRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNG 300
                          G   S D++N + E D     RESL  ML WV + AK P +PS G
Sbjct: 241 ---------------GVLISLDSLN-KKENDRK-RKRESLSGMLNWVVQIAKQPNDPSIG 300

Query: 301 TVPGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQKKVRMHPCIYEDNID 360
            +PG + WK +  ++  W QVI+A++ALL R++VD   E+ LL QKK++ HP +YEDNI 
Sbjct: 301 VIPGPTNWKEHKGDEC-WFQVIRAREALLLRRNVDSKTEESLL-QKKLKTHPLLYEDNIV 360

Query: 361 DNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGLKNQALLNGDL 420
             H  S+ER+ CS R     KS S  C +SC   QSN I     E+    K QA    DL
Sbjct: 361 AGHQ-SSERLRCSERFPNSVKSRSCPCCSSCSVPQSNLISPRKKELDNISKEQAPAEVDL 420

Query: 421 ASEMEDNQANEDSV-EKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENNKSVS 480
            +       + D+  EK V VG  FQA +P+WTG  S+SD KWLGTR WP Q E +  + 
Sbjct: 421 LTTNTMVCPSVDAPHEKHVSVGTLFQAEVPDWTGVASESDIKWLGTRVWPLQCEEDSFLH 480

Query: 481 DRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEISLQWT 540
           + +   +GR D C C+ PGSV C RFHIAEARM+LK ELG  FY W+F +MGEE+SLQWT
Sbjct: 481 ETDLTGKGRPDLCGCRLPGSVLCIRFHIAEARMKLKRELGSLFYRWQFDRMGEEVSLQWT 540

Query: 541 AEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNRVTPN 600
           AEEE RFK+L  S+    +  FWN + +WF  K+R+NL+SYYFNVFL++ RSYQNRVTP 
Sbjct: 541 AEEEKRFKDLVKSN----SPSFWNRASRWFRKKTRENLVSYYFNVFLVQSRSYQNRVTPK 600

Query: 601 DIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQ 636
           +IDSD ++ EFG  S  FG  A+EV  + FV CS+N+Q
Sbjct: 601 NIDSDDDETEFGSFSNGFGHDAVEV-SANFVACSQNQQ 606

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
ARID2_ARATH1.7e-8735.16AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana GN=ARID2... [more]
ARID1_ARATH1.3e-3745.60AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana GN=ARID1... [more]
Match NameE-valueIdentityDescription
A0A0A0KZM1_CUCSA0.0e+00100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G047920 PE=4 SV=1[more]
M5W8M5_PRUPE1.8e-12844.04Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa026661mg PE=4 SV=1[more]
B9RCE7_RICCO5.4e-12040.38DNA binding protein, putative OS=Ricinus communis GN=RCOM_1687390 PE=4 SV=1[more]
V4THD3_9ROSI1.0e-11841.09Uncharacterized protein OS=Citrus clementina GN=CICLE_v10019313mg PE=4 SV=1[more]
A0A067G9K4_CITSI1.3e-11840.93Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g006921mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G11400.19.7e-8935.16 ARID/BRIGHT DNA-binding domain;ELM2 domain protein[more]
AT2G46040.17.1e-3945.60 ARID/BRIGHT DNA-binding domain;ELM2 domain protein[more]
AT5G04110.13.0e-2938.89 DNA GYRASE B3[more]
AT1G26580.11.8e-2134.60 FUNCTIONS IN: molecular_function unknown[more]
AT1G13880.11.9e-2031.43 ELM2 domain-containing protein[more]
Match NameE-valueIdentityDescription
gi|778690826|ref|XP_004146560.2|0.0e+00100.00PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis sativus][more]
gi|659102274|ref|XP_008452043.1|0.0e+0094.68PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo][more]
gi|470129814|ref|XP_004300803.1|4.1e-12943.90PREDICTED: AT-rich interactive domain-containing protein 2-like [Fragaria vesca ... [more]
gi|595845222|ref|XP_007208888.1|2.6e-12844.04hypothetical protein PRUPE_ppa026661mg [Prunus persica][more]
gi|645267129|ref|XP_008238928.1|5.0e-12743.89PREDICTED: AT-rich interactive domain-containing protein 2 [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001606ARID_dom
Vocabulary: Molecular Function
TermDefinition
GO:0003677DNA binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G047920.1Csa4G047920.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001606ARID DNA-binding domainGENE3DG3DSA:1.10.150.60coord: 60..142
score: 7.0
IPR001606ARID DNA-binding domainPFAMPF01388ARIDcoord: 66..138
score: 1.
IPR001606ARID DNA-binding domainSMARTSM00501bright_3coord: 49..143
score: 4.7
IPR001606ARID DNA-binding domainPROFILEPS51011ARIDcoord: 49..142
score: 17
IPR001606ARID DNA-binding domainunknownSSF46774ARID-likecoord: 52..145
score: 6.02
NoneNo IPR availablePANTHERPTHR22970FAMILY NOT NAMEDcoord: 204..620
score: 8.9E
NoneNo IPR availablePANTHERPTHR22970:SF23AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 1coord: 204..620
score: 8.9E
NoneNo IPR availableSMARTSM01014ARID_2coord: 46..138
score: 4.

The following gene(s) are paralogous to this gene:
GeneParalogueOrganismBlock
Csa4G047920Csa3G121790Cucumber (Chinese Long) v2cucuB093