ClCG07G008230 (gene) Watermelon (Charleston Gray) v2.5

Overview
NameClCG07G008230
Typegene
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionAT-rich interactive domain-containing protein 2
LocationCG_Chr07: 22001348 .. 22006866 (-)
RNA-Seq ExpressionClCG07G008230
SyntenyClCG07G008230
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTATTTATTGCAAATTCCCAAAATTTAACAAGGTGCATCTAAGGAGAGAAGGGAAATCTTGGGAAGCCACCTGCGTGCTCTGTTTTCCCTACTTGGTTTTCGTTTTGTTGAGCTGCCTCCCACTTCCACCATTTTTAAACTCATCAAAGTTCTAACCTTCATATATTTGACTGTCCCCTTTTTGTCTTGATTCCATCTTCAACAGTGCAATCTGCGTCTTCATAACGGGTTGCAGACAAAATGGAGTCCAAGATTCCCAAAATGAGGTAGACCCTTTTGAGTTTTCTTTGCAAAGGGTTTTGTCTATCATATGCCCTAATCTTGTAGGTCTTCGTTTTTGTTTCATATGTATTTTTAACGAGAAAGAAAAGGACAATAGGGAACCCTCAGCTGCGATTTTTGCTTTTTAATTTCAAGTGGGGTTGAGTTCATGGGGAGATGGCCTATTTCATCCAATGCTTCCATTTTAGATTGCAACAAAGATGTTGATCCTAGTCCCAGTAATGGCTGTTGCATTGCCCCGGCTTGTTCGGTAGAGGGAAGTTATGCGAATGTTGACTATGATGATTACAAAGCCACAATTAGATGTTATTTTGAGAAAATTCTTTGGGTTTTTCTGAAGGAAATTGGTCGTAGAGGACTTATTAGGCCAGTGCCGGCGTTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAACTCTTCATGGTAGTAAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTGGTTTTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAGTTGATTTATTTGAAGTATTTAAATGACGTAGAGAAATGGCTTATGGTGAGATGTGGAGGCACAAAACTGGAAAATGGGAACTCTGATTATCACTACAGGAAAAGCTTTCCATTTTTGTCAGAACTGGAGGTGAAGATTAAGGGTATGTTATTTGGTGTGCTGAGACAAAAGAGCATATATGATGAATGTTCTAGATTCAAATCTAACAAACCAAATGAGAACGTTAATGTTGCTGCCGCTACAGTGGAGAAGGAAATAAAGAAGAAAGAACACGATCTTCATGGGGATGTTACACCAATTCAACAAAATTGTATCGATACACCTCGGGATAATGGCGAAACCGATCAAATCCATGTTATTGAAGATTGTAGAATTTTGGATGCTGTTAATGTTGAAACTGACATAGACTCTCATGGGAGATATCGAAAATCGTTATTACAAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGTAAATCCATTAAATGGTACAATACCAGGGGCATCCAAGTGGAAAGCGTATGCTAGCGATGATGCATTATGGCTTCAAGTTATCAGGGCAAAGAATGCTCTTTTAACTAGGAAGGATGTTGACAAAAATGCTGAGAAACGTCTGTTAATACAGGTAGATTTCGCCCACTTATTCTGCTTGTCAGCCAGTTATCTCTCTCCCCCTTCCCCTTTTTCCTTGATCAATAGACATACATACTTGGGTTCTTGCTTTGGACTGCAATTATGTACAAAAAACTGACCTATGCAGGTGTTACGATCTTGGTTTTCTTTTTAGGATAGATGTCTCAAAACTGATCTATGCAGGTATTACAATCTTTGGCCTTCTTTTTCAGGATAGATGTCTCCTGATTATAATCAGTGCCCTTTCCAATCTTATGTGAATGGGAACAGTTGCCTCCCGCTCTATTAGAACCTCGGGTCTCCATCTTCGCAAGCAGCATACGACTTTTACGTCACCTTGTCTTGTGTCCCACATGATTACACGATGGACACAGTGAGGAAAACTTAGATTTTTGTCATTGGTTCGACTAGGATAGTCTTTAATCAGGGATGCTTAAATGAAAGAGCAATGTTGTTCCTTTTTTATGGGTTCTGGCGTTCTGTTTCTCAACACCTTTTAACTCACTATTAGGAGTTCACATCTCTCACAAGTAAGTGCGACTGTAAAGAAAGAGCTCTCGCCACCACCAAAAGGATGTGTATATATTTATTATATTTCTTTTGGGAGAATAAGGTCTTAGTGTTTTCCAAACGGAAGTTGTTATTCATTAGTTATGAAGTCAAGTGTCAAGTTGTGCAGGGTAGTCCATGCTCATAGGGTTTCAAACTTATGGGAAGGATGAGTTCTTCAAGGAATAAGTTTACTGTATGGATATAAATGTGTCTGGGAGTTCTAGGTTTAGGAAATCGAAAGTGGATTTTACATACAGTAAGATCGAAAGGACGAGAGTTTCATCAAGAATTCTGAGGATTGAATAAGGCTTAAGAGCTGGTTGGAGGAGCTGCATATAAGTGAGAGCTCTAGCTGGAAGAAAAATTCCAAAGTCCAGGGGATTTAGGAAGGTGATGCTAATAGCACCTTAGGGAGGAATAGCAACCTAGTGATTGTGAATAAGACTGCTTTCACTAAGGAGGATATGTAAAGTGAGATTATTAGTATTTTTTTGTAATATCGAGGAGGGCTGTCTCAATTTTCTGATTTCTTTTTCATGGGCCAGGGATTTTGTTGTGAGGGGAGGGGATATCCTTATATGTATAGAAACTACATTTACCTTGGGCCTACAGTATCATTTTCAACATTGACTTTTGGTAAAAGGACTCAATAGGATCCTAGGATGAAGGGCTCCTGTTGTCTTGTGGTCATTAGGACAAGGATCTTTCTCGCATCCTCCAATATTTGTGTCTTCCTCCATGCAACATTGTTTTTAGATATCAAGTGTTGTTAGACAGAGGACTTAAATCTCCTTTTCCTTCTTTTACCTGTCGATGCTCTTAATCATCTACTAGAAAGGGTTCAGAGGACTTGTCTATCAATTTTAGATAGGTGGAGGCATGAGTTCAAACTCATCTGCTGTACATTGATTCTCCATGTTGTCGTGGAAGAGATCGACCAAGTGGAGAAACCCTTTCTGGAAGGAGGAGGACTTACCCTTAGCCAATTGATGTTTTCTTAATCCCCTTTTTTCGCGTATATATATTTAGAGCTTCTCTTGCTTCTCATCCTCGAGGACAATATGTGTTATGAGGAATCTACTGTAGTAATGGATGAAGGATCATTTTGGTTCTCACCTGGTTAGTTGGAATAGGGTGTGGTGTGCCTTTTTGCTTTAGTTGTGATAGGTATCAGAAACTTGGATTCCAAGAACATTGTTTTTCTGGATAAGTGGGTTTTAGAGATTCTAATTGGAGGTGACTTTGCGTTTGGCGTAAAATTATTATGGGCAAGAGCAGTCTTTAGGTTAATGACTAATCCTTCATTCTTCTTAGAGGTAGTACACATCGTAGTCCTTGGAAGTTCATTTTCTTGGGCTTTTGTCCTCGTCCTTGACGATGTAGATTGTGTTTTTAGGAGCAGAGGTGGTCTAGTAGGAACCGTCAATCTACTTTTTCCCCATCCTTTTCACCCTTTTTCTTCACAAAATTGGTCTTTTCATTTTTCATTTTATTTTAGAAAGTTGGTCTAGTAGGAGCCCTCAGTCTACTCTTTCTTGACAAAATATATATACATATAGGAAAAATAAAAGGAGAGAAGCCTCAACCCACAACTAAGGAGGTCACAAGAAAGCTCCCTACTTGGGAAAAATATAAAATGTTGAATAGAAATCAATTTAATGGAGTTTCCCCATGTGTACACATGTATGATGCTAGATAGAAATACCATTGTTTTGCATTGAGCTTTTCATATATCTTCTATTTTTCTTTTGAAAAATTGGGAAACGTGAGCTTGTTTTGACATATTTATGACAAAGATATGTTCAAACGCTTGGACACAATAAATAAACCTGGGGAAAAATGTCCAATTTGTTTGTTACTTGTTAGACTTGTTCTTTGCAGCCATATGTCTCTATTAACTTGGTTTTGTACGTCCTTTGTTTGTTATGACAAAGGTTGCTTTCACTTATACATATATGCAGATAACATATAGAAGTTTATGAACCATGTTCATACAATTCAATATATCTCTAAACCTTTTTATCTATTACTTCATTTTCCAAAATGCCAACGTGGCTGTTCTAGTGTCATATTTATGTTGAGTGTAGGCAGTGCCTATATGCTTGTACCTTAGGTTGGGAAGCATAAGTTTTAGCTAGGAAGACAGAAGCTTATCTATCAATGGTTTCATTGTTTGAACAAAATTAAATGAACTACTGAGTTTGAGAAAAAAAAGATTTTGGTTTAAACTTGCAGAAGAAAGTAAGGATGCATCCATCCATTTATGAGGATAATATTGATAAACATCACCTTTCTACAGAAAGGATCTGTTGCAGCAAAAGATCTAATGCTTTGGTCGAATCCGTCTTCGTCACGCATAATAATTCATGTCCAATCGTTCAAAGTAATTGTATTAGTAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAAGAATCGAGCACTTTTGAATGGTGATTTTCCATCGGAAACGGAAGACAATCAGCCAAATGAAGATTCTGTTGATAAGCCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTGGTTCCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGCTAGGGACAAAGTCGTGGCCTTCTCCACACGGAAATAGTAATTCCGTAAGTGATAGAAATCCCATTGGTAGAGGGAGACTGGATTCATGTAGATGTCAATTTCCAGGTTCTGTTGAATGTTTTAGATTTCACATTGCGGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTACGATTGGAGATTTCATCATATGGGGGAGGAAATATCTCTGCAGTGGACTGCCGAAGAGGAAAAGAGATTTAAGGAGTTGGCAACATCAAGTTTTAACAATCAGAGCCAGTGCTTTTGGAGTTATTCCTTGAAGTGGTTCCCATTGAAATCAAAGAAAAATTTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTGACTCCAAATAGCATTGATAGCGATGATGAAGATGTAGAGTTTGGTTGCATCAGTGGTGACTTTGGGGCTAAGGCAATGGAAATTTTAGGCTCAAAGTCTGTAGAATGTTCTGAAAATAGACAATTCACAGATGTGGAGTAGAATCGATGGAGGCAACAGGCAAGTTTGAAGAAAGGGAGAAAATTGAAGAATGAAGGTAATTTTTCTCAACATGCTGCTGCAGCCTGCAGCAGAAAGAAACCCAATTTTGGCGAGAACACGTTTTAGCTACACAACACCATTTTTGTTGGGGGAATCTGTATGCTATCAGCTGGTGAGACGAAATATGAGGACCAAAGAGAGCCAGCGGTACATTTGCTTATTTTGAATTTGTGTATGTGTGGAACTCAATCTGATACTTCTTAGTTTTGTATTTGGAGATTGTTGTAATCATTTGAAGGGGTTTCGAGAGAATTTTACTGAAGATTTAAAAGCCAACCTGGGCATAGCTCAACCGGTTAAAGACATGCCTTCAACCAAAAAGATTAGA

mRNA sequence

CTTATTTATTGCAAATTCCCAAAATTTAACAAGGTGCATCTAAGGAGAGAAGGGAAATCTTGGGAAGCCACCTGCGTGCTCTGTTTTCCCTACTTGGTTTTCGTTTTGTTGAGCTGCCTCCCACTTCCACCATTTTTAAACTCATCAAAGTTCTAACCTTCATATATTTGACTGTCCCCTTTTTGTCTTGATTCCATCTTCAACAGTGCAATCTGCGTCTTCATAACGGGTTGCAGACAAAATGGAGTCCAAGATTCCCAAAATGAGTGGGGTTGAGTTCATGGGGAGATGGCCTATTTCATCCAATGCTTCCATTTTAGATTGCAACAAAGATGTTGATCCTAGTCCCAGTAATGGCTGTTGCATTGCCCCGGCTTGTTCGGTAGAGGGAAGTTATGCGAATGTTGACTATGATGATTACAAAGCCACAATTAGATGTTATTTTGAGAAAATTCTTTGGGTTTTTCTGAAGGAAATTGGTCGTAGAGGACTTATTAGGCCAGTGCCGGCGTTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAACTCTTCATGGTAGTAAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTGGTTTTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAGTTGATTTATTTGAAGTATTTAAATGACGTAGAGAAATGGCTTATGGTGAGATGTGGAGGCACAAAACTGGAAAATGGGAACTCTGATTATCACTACAGGAAAAGCTTTCCATTTTTGTCAGAACTGGAGGTGAAGATTAAGGGTATGTTATTTGGTGTGCTGAGACAAAAGAGCATATATGATGAATGTTCTAGATTCAAATCTAACAAACCAAATGAGAACGTTAATGTTGCTGCCGCTACAGTGGAGAAGGAAATAAAGAAGAAAGAACACGATCTTCATGGGGATGTTACACCAATTCAACAAAATTGTATCGATACACCTCGGGATAATGGCGAAACCGATCAAATCCATGTTATTGAAGATTGTAGAATTTTGGATGCTGTTAATGTTGAAACTGACATAGACTCTCATGGGAGATATCGAAAATCGTTATTACAAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGTAAATCCATTAAATGGTACAATACCAGGGGCATCCAAGTGGAAAGCGTATGCTAGCGATGATGCATTATGGCTTCAAGTTATCAGGGCAAAGAATGCTCTTTTAACTAGGAAGGATGTTGACAAAAATGCTGAGAAACGTCTGTTAATACAGACAGAGGACTTAAATCTCCTTTTCCTTCTTTTACCTGTCGATGCTCTTAATCATCTACTAGAAAGGGTTCAGAGGACTTGTCTATCAATTTTAGATAGGTGGAGGCATGAGTTCAAACTCATCTGCTGTACATTGATTCTCCATGTTGTCGTGGAAGAGATCGACCAAGTGGAGAAACCCTTTCTGGAAGGAGGAGGACTTACCCTTAGCCAATTGATAAACTTGGATTCCAAGAACATTGTTTTTCTGGATAAGTGGGTTTTAGAGATTCTAATTGGAGGTGACTTTGCGTTTGGCGTAAAATTATTATGGGCAAGAGCAGTCTTTAGGTTAATGACTAATCCTTCATTCTTCTTAGAGGTAGTACACATCGTAGTCCTTGGAAGTTCATTTTCTTGGGCTTTTGTCCTCGTCCTTGACGATGTAGATTGTGTTTTTAGGAGCAGAGGTGGTCTAAAAGTTGGTCTAGTAGGAGCCCTCAGTCTACTCTTTCTTGACAAAATATATATACATATAGGAAAAATAAAAGGAGAGAAGCCTCAACCCACAACTAAGGAGAAGAAAGTAAGGATGCATCCATCCATTTATGAGGATAATATTGATAAACATCACCTTTCTACAGAAAGGATCTGTTGCAGCAAAAGATCTAATGCTTTGGTCGAATCCGTCTTCGTCACGCATAATAATTCATGTCCAATCGTTCAAAGTAATTGTATTAGTAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAAGAATCGAGCACTTTTGAATGGTGATTTTCCATCGGAAACGGAAGACAATCAGCCAAATGAAGATTCTGTTGATAAGCCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTGGTTCCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGCTAGGGACAAAGTCGTGGCCTTCTCCACACGGAAATAGTAATTCCGTAAGTGATAGAAATCCCATTGGTAGAGGGAGACTGGATTCATGTAGATGTCAATTTCCAGGTTCTGTTGAATGTTTTAGATTTCACATTGCGGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTACGATTGGAGATTTCATCATATGGGGGAGGAAATATCTCTGCAGTGGACTGCCGAAGAGGAAAAGAGATTTAAGGAGTTGGCAACATCAAGTTTTAACAATCAGAGCCAGTGCTTTTGGAGTTATTCCTTGAAGTGGTTCCCATTGAAATCAAAGAAAAATTTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTGACTCCAAATAGCATTGATAGCGATGATGAAGATGTAGAGTTTGGTTGCATCAGTGGTGACTTTGGGGCTAAGGCAATGGAAATTTTAGGCTCAAAGTCTGTAGAATGTTCTGAAAATAGACAATTCACAGATGTGGAGTAGAATCGATGGAGGCAACAGGCAAGTTTGAAGAAAGGGAGAAAATTGAAGAATGAAGGTAATTTTTCTCAACATGCTGCTGCAGCCTGCAGCAGAAAGAAACCCAATTTTGGCGAGAACACGTTTTAGCTACACAACACCATTTTTGTTGGGGGAATCTGTATGCTATCAGCTGGTGAGACGAAATATGAGGACCAAAGAGAGCCAGCGGTACATTTGCTTATTTTGAATTTGTGTATGTGTGGAACTCAATCTGATACTTCTTAGTTTTGTATTTGGAGATTGTTGTAATCATTTGAAGGGGTTTCGAGAGAATTTTACTGAAGATTTAAAAGCCAACCTGGGCATAGCTCAACCGGTTAAAGACATGCCTTCAACCAAAAAGATTAGA

Coding sequence (CDS)

ATGGAGTCCAAGATTCCCAAAATGAGTGGGGTTGAGTTCATGGGGAGATGGCCTATTTCATCCAATGCTTCCATTTTAGATTGCAACAAAGATGTTGATCCTAGTCCCAGTAATGGCTGTTGCATTGCCCCGGCTTGTTCGGTAGAGGGAAGTTATGCGAATGTTGACTATGATGATTACAAAGCCACAATTAGATGTTATTTTGAGAAAATTCTTTGGGTTTTTCTGAAGGAAATTGGTCGTAGAGGACTTATTAGGCCAGTGCCGGCGTTACTAGGTGAAGGGGGATCTTTGGATTTGTTTGAACTCTTCATGGTAGTAAGAGATAAAGGAGGTTATCAAGTGGTTTCAGAAAAGGAACTATGGTCTTCAGTGGTTTTGGAATTAGGTTTGGATCTTGGGCTTTCGGCTTCAGTGAAGTTGATTTATTTGAAGTATTTAAATGACGTAGAGAAATGGCTTATGGTGAGATGTGGAGGCACAAAACTGGAAAATGGGAACTCTGATTATCACTACAGGAAAAGCTTTCCATTTTTGTCAGAACTGGAGGTGAAGATTAAGGGTATGTTATTTGGTGTGCTGAGACAAAAGAGCATATATGATGAATGTTCTAGATTCAAATCTAACAAACCAAATGAGAACGTTAATGTTGCTGCCGCTACAGTGGAGAAGGAAATAAAGAAGAAAGAACACGATCTTCATGGGGATGTTACACCAATTCAACAAAATTGTATCGATACACCTCGGGATAATGGCGAAACCGATCAAATCCATGTTATTGAAGATTGTAGAATTTTGGATGCTGTTAATGTTGAAACTGACATAGACTCTCATGGGAGATATCGAAAATCGTTATTACAAATGCTGAAGTGGGTGAGAAAGACTGCAAAGCATCCTGTAAATCCATTAAATGGTACAATACCAGGGGCATCCAAGTGGAAAGCGTATGCTAGCGATGATGCATTATGGCTTCAAGTTATCAGGGCAAAGAATGCTCTTTTAACTAGGAAGGATGTTGACAAAAATGCTGAGAAACGTCTGTTAATACAGACAGAGGACTTAAATCTCCTTTTCCTTCTTTTACCTGTCGATGCTCTTAATCATCTACTAGAAAGGGTTCAGAGGACTTGTCTATCAATTTTAGATAGGTGGAGGCATGAGTTCAAACTCATCTGCTGTACATTGATTCTCCATGTTGTCGTGGAAGAGATCGACCAAGTGGAGAAACCCTTTCTGGAAGGAGGAGGACTTACCCTTAGCCAATTGATAAACTTGGATTCCAAGAACATTGTTTTTCTGGATAAGTGGGTTTTAGAGATTCTAATTGGAGGTGACTTTGCGTTTGGCGTAAAATTATTATGGGCAAGAGCAGTCTTTAGGTTAATGACTAATCCTTCATTCTTCTTAGAGGTAGTACACATCGTAGTCCTTGGAAGTTCATTTTCTTGGGCTTTTGTCCTCGTCCTTGACGATGTAGATTGTGTTTTTAGGAGCAGAGGTGGTCTAAAAGTTGGTCTAGTAGGAGCCCTCAGTCTACTCTTTCTTGACAAAATATATATACATATAGGAAAAATAAAAGGAGAGAAGCCTCAACCCACAACTAAGGAGAAGAAAGTAAGGATGCATCCATCCATTTATGAGGATAATATTGATAAACATCACCTTTCTACAGAAAGGATCTGTTGCAGCAAAAGATCTAATGCTTTGGTCGAATCCGTCTTCGTCACGCATAATAATTCATGTCCAATCGTTCAAAGTAATTGTATTAGTAGTCTAACAACAGAAATTGGGAAGGGACTCAAGAAGAATCGAGCACTTTTGAATGGTGATTTTCCATCGGAAACGGAAGACAATCAGCCAAATGAAGATTCTGTTGATAAGCCAGTTCCCGTGGGTGCTTTATTTCAAGCAGTGGTTCCTGAATGGACTGGTAATATTTCCGATAGCGACTCTAAATGGCTAGGGACAAAGTCGTGGCCTTCTCCACACGGAAATAGTAATTCCGTAAGTGATAGAAATCCCATTGGTAGAGGGAGACTGGATTCATGTAGATGTCAATTTCCAGGTTCTGTTGAATGTTTTAGATTTCACATTGCGGAAGCGAGGATGAGATTAAAGCTTGAACTTGGTTTGACATTCTACGATTGGAGATTTCATCATATGGGGGAGGAAATATCTCTGCAGTGGACTGCCGAAGAGGAAAAGAGATTTAAGGAGTTGGCAACATCAAGTTTTAACAATCAGAGCCAGTGCTTTTGGAGTTATTCCTTGAAGTGGTTCCCATTGAAATCAAAGAAAAATTTGATAAGCTATTACTTCAATGTGTTTCTTTTACGGCAGAGAAGCTATCAGAATCGTGTGACTCCAAATAGCATTGATAGCGATGATGAAGATGTAGAGTTTGGTTGCATCAGTGGTGACTTTGGGGCTAAGGCAATGGAAATTTTAGGCTCAAAGTCTGTAGAATGTTCTGAAAATAGACAATTCACAGATGTGGAGTAG

Protein sequence

MESKIPKMSGVEFMGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIKKKEHDLHGDVTPIQQNCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSNSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRVTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE
Homology
BLAST of ClCG07G008230 vs. NCBI nr
Match: XP_038893741.1 (AT-rich interactive domain-containing protein 2 [Benincasa hispida])

HSP 1 Score: 1038.9 bits (2685), Expect = 2.4e-299
Identity = 549/832 (65.99%), Postives = 584/832 (70.19%), Query Frame = 0

Query: 8   MSGVEFMGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCY 67
           MSG+EFMGRWPISSNASI+DCNKDVDP+PSNGCCIAP C VEGSYANV+YDD KATIRCY
Sbjct: 1   MSGIEFMGRWPISSNASIVDCNKDVDPNPSNGCCIAPDCLVEGSYANVNYDDCKATIRCY 60

Query: 68  FEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVL 127
           FEKILWVFLKEIGRRG IRPV ALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVL
Sbjct: 61  FEKILWVFLKEIGRRGSIRPVAALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVL 120

Query: 128 ELGLDLGLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIK 187
           ELGLDLGLSASVKLIY KYL+D+EKWLMVR GGTKLENGNSDYHYRKSFPFLSELE K+K
Sbjct: 121 ELGLDLGLSASVKLIYSKYLSDLEKWLMVRSGGTKLENGNSDYHYRKSFPFLSELEAKVK 180

Query: 188 GMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEI-----KKKEHDLHGDVTPIQQ 247
            ML         YDECS FKSNKPN NVNVA A +EKEI     KK+EHDLHGDVTPIQQ
Sbjct: 181 CML---------YDECSGFKSNKPNGNVNVATAALEKEIKFPKLKKEEHDLHGDVTPIQQ 240

Query: 248 NCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNP 307
           NC +TPRDNGETDQIHVIEDCR L AVN+ET++D+HGRYR+SLL+MLKW RKTAKHP NP
Sbjct: 241 NCTETPRDNGETDQIHVIEDCRSLAAVNIETELDTHGRYRESLLRMLKWARKTAKHPGNP 300

Query: 308 LNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLP 367
            N T+PGASKWKAYASDDALWLQVIRAK+ALLTRKDVD+ AEKRLLIQ            
Sbjct: 301 SNCTVPGASKWKAYASDDALWLQVIRAKDALLTRKDVDRIAEKRLLIQ------------ 360

Query: 368 VDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQL 427
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 428 INLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFS 487
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 488 WAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMH 547
                                                                 KK RMH
Sbjct: 481 ------------------------------------------------------KKTRMH 540

Query: 548 PSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKK 607
           PSIYEDNID H LSTERICCSK+SNA         NNS P +QSNCISSLTTEIGKGL +
Sbjct: 541 PSIYEDNIDNHQLSTERICCSKKSNA------SACNNSHPTIQSNCISSLTTEIGKGL-E 600

Query: 608 NRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSP 667
           N+AL NGD PS+ EDNQPNEDSV+KPVP GALFQAV+PEWTGNISDSDSKWLGT+SWPS 
Sbjct: 601 NQALSNGDLPSKMEDNQPNEDSVEKPVPTGALFQAVIPEWTGNISDSDSKWLGTQSWPSQ 630

Query: 668 HGNSNS-VSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHM 727
           HGN NS VSD+NPIG+GR DSC CQFPGSVECFRFHIAEARM LKLELGLTFYDWRFHHM
Sbjct: 661 HGNINSVVSDKNPIGKGRPDSCSCQFPGSVECFRFHIAEARMGLKLELGLTFYDWRFHHM 630

Query: 728 GEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQR 787
           GEEISLQWTAEEEKRFKELA SSFNNQS+CFW+YSLKWFP+KS+KNLISYYFNVFLLRQR
Sbjct: 721 GEEISLQWTAEEEKRFKELAVSSFNNQSRCFWNYSLKWFPMKSRKNLISYYFNVFLLRQR 630

Query: 788 SYQNRVTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           SYQNR TPNSIDSDDED+EFGCISGDFGAKAMEILGSKSVEC+ENRQFTDVE
Sbjct: 781 SYQNRATPNSIDSDDEDLEFGCISGDFGAKAMEILGSKSVECAENRQFTDVE 630

BLAST of ClCG07G008230 vs. NCBI nr
Match: XP_008452043.1 (PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo])

HSP 1 Score: 1013.4 bits (2619), Expect = 1.1e-291
Identity = 540/826 (65.38%), Postives = 575/826 (69.61%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRWPISSN SILDCNKDVDP+PSNG CIAP C VEGS ANVD+DD KATIRCYFEKILW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSNGYCIAPDCLVEGSRANVDHDDCKATIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEI RRG IRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVV+ELGLDL
Sbjct: 61  VFLKEICRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSD-YHYRKSFPFLSELEVKIKGMLFG 193
           GLSASVKLIY KYL+++EKWLMVR GGTKLENGNSD Y+YRKSFP L+ELE KIK ML+G
Sbjct: 121 GLSASVKLIYFKYLSELEKWLMVRRGGTKLENGNSDYYYYRKSFPCLAELEAKIKDMLYG 180

Query: 194 VLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK-----KKEHDLHGDVTPIQQNCIDT 253
           VLRQKSIYDE   FKSNKPN NVNVA    EKEIK     KKEHDLH DVTPIQQNC +T
Sbjct: 181 VLRQKSIYDERPGFKSNKPNGNVNVAETAAEKEIKFPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
           PR NGET+QIHVI DCR LDAVNVET+ DSHGR R+SLL+MLKWVRKTAKHP NP NGT+
Sbjct: 241 PRVNGETNQIHVIGDCRSLDAVNVETETDSHGRSRESLLRMLKWVRKTAKHPANPSNGTV 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           P +SKWKAYASDDALWLQVI+AK+ALL RKDVDK AEKRLLIQ                 
Sbjct: 301 PESSKWKAYASDDALWLQVIKAKDALLNRKDVDKTAEKRLLIQ----------------- 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                                            KKVRMHP IYE
Sbjct: 481 -------------------------------------------------KKVRMHPCIYE 540

Query: 554 DNI-DKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRAL 613
           DNI D HHLSTERICCS+RSNAL +S  V  NNSCP V+SN I SLTTEIGKGL KN+AL
Sbjct: 541 DNIDDNHHLSTERICCSRRSNALAKSELVASNNSCPPVRSNQIGSLTTEIGKGL-KNQAL 600

Query: 614 LNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNS 673
           LNGD  SE EDNQ NEDSV+KPVPVGALFQA +PEWTGNISDSDSKWLGT+ WPS H N+
Sbjct: 601 LNGDLASEMEDNQANEDSVEKPVPVGALFQAAIPEWTGNISDSDSKWLGTRLWPSQHENN 639

Query: 674 NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEIS 733
            SVS+RNPIGRGRLDSC CQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEIS
Sbjct: 661 KSVSNRNPIGRGRLDSCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEIS 639

Query: 734 LQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNR 793
           LQWTAEEEKRFKELA SSFNNQ+QCFW++SLKWFP+KS+KNLISYYFNVFLLRQRSYQNR
Sbjct: 721 LQWTAEEEKRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNR 639

Query: 794 VTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV 833
           VTPN IDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN+QF D+
Sbjct: 781 VTPNDIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENKQFIDI 639

BLAST of ClCG07G008230 vs. NCBI nr
Match: XP_004146560.2 (AT-rich interactive domain-containing protein 2 [Cucumis sativus] >KGN53331.1 hypothetical protein Csa_015265 [Cucumis sativus])

HSP 1 Score: 990.7 bits (2560), Expect = 7.5e-285
Identity = 529/826 (64.04%), Postives = 567/826 (68.64%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRWPISSN SILDCNKDVDP+PS G CIAP C VEGS ANVD+DD KATIRCYFEK+LW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKE  RRG IRPVPALLGEG SLDLFELFMVVRDKGGYQVVSEKELWSSVV+ELGLDL
Sbjct: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSD-YHYRKSFPFLSELEVKIKGMLFG 193
           GLSASVKLIY KYL+D+EKWLMVR GGTKLENGNSD Y+YRK+FP L+ELE KIK +L+G
Sbjct: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180

Query: 194 VLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK-----KKEHDLHGDVTPIQQNCIDT 253
           VLRQKSIYDE S FKSNKPN NVNVA    EKEIK     KKEHDLH DVTPIQQNC +T
Sbjct: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
           PRDNG+T+QIHVI DCR  DAVNVET+ DSHG  R+SL +MLKWVRKTAKHP NP NGT+
Sbjct: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG+SKWKAYAS+DALWLQVI+AK+ALL RKDVDK AEKRLLIQ                 
Sbjct: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQ----------------- 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                                            KKVRMHP IYE
Sbjct: 481 -------------------------------------------------KKVRMHPCIYE 540

Query: 554 DNI-DKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRAL 613
           DNI D HHLSTERICCS+RSNAL +S  V  NNSCP VQSN I SLTTEIGKGL KN+AL
Sbjct: 541 DNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGL-KNQAL 600

Query: 614 LNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNS 673
           LNGD  SE EDNQ NEDSV+KPVPVGA FQAV+PEWTGNISDSDSKWLGT+SWPS H N+
Sbjct: 601 LNGDLASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENN 639

Query: 674 NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEIS 733
            SVSDRNPI RGRLD C CQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEIS
Sbjct: 661 KSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEIS 639

Query: 734 LQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNR 793
           LQWTAEEE RFKELA SSFNNQ+QCFW++SLKWFP+KS+KNLISYYFNVFLLRQRSYQNR
Sbjct: 721 LQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNR 639

Query: 794 VTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV 833
           VTPN IDSD EDVEFGCISGDFGAKAME+LGSK VECSEN+QF  +
Sbjct: 781 VTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 639

BLAST of ClCG07G008230 vs. NCBI nr
Match: XP_022931395.1 (AT-rich interactive domain-containing protein 2-like isoform X1 [Cucurbita moschata])

HSP 1 Score: 958.7 bits (2477), Expect = 3.2e-275
Identity = 518/826 (62.71%), Postives = 577/826 (69.85%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRW +SSNASILDCNKDVDP+PSNGCCIA  C VE SY NVDYDD KA IRCYFEKILW
Sbjct: 1   MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEIGRRG +RP+PAL+GEGG+LDLFELF+VVRDKGG QVVSEK+LWSSVV+ELGLDL
Sbjct: 61  VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGV 193
           GLSASVKLIY KYL+D+EKWLMVRCG TKLENG+SDY Y+KS PFLSEL  KI GML+GV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180

Query: 194 LRQKSIYDECSRFKSNKPNENVNV-AAATVEK-----EIKKKEHDLHGDVTPIQQNCIDT 253
            RQ SIYDEC  FKSNK N NVNV AAA VEK     EIKKKEHDLHGDVT IQQ+C   
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCT-- 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
                ET  IHVIED + LDAVNVE +I+S G+YR+SLL+MLKWVRKTAKHP +PLNGTI
Sbjct: 241 -----ETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG S+WK Y+SDDALWLQVIRAK+ALL RK VDK AEKRLLIQ   L  L     +++ +
Sbjct: 301 PGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQV--LRSLAFFFRIESSD 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
           H     QR  LS ++R                                            
Sbjct: 361 H----NQRPFLSYIER-------------------------------------------E 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
             ++ L+ WV                            S  +   H   +          
Sbjct: 421 SYLLLLEPWV----------------------------SIIISKQHTTTV---------- 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                +LS +  D +                  KKV+MHPSIYE
Sbjct: 481 ---------------------SLSCVLCDNM---------------MDTKKVKMHPSIYE 540

Query: 554 DNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRALL 613
           DNID HHLSTERI CSKRS AL ESV    +NSCP V+SNCISSLTTE+GKGL KN+A+L
Sbjct: 541 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGL-KNQAVL 600

Query: 614 NGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSN 673
           NGD PSE ED+ PNEDS ++ VPVGA+ QA +PEWTGN SDSDSKWLGT+SWP  H NSN
Sbjct: 601 NGDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSN 660

Query: 674 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISL 733
           SV DR  IGRGR DSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISL
Sbjct: 661 SVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISL 695

Query: 734 QWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRV 793
           QWT EEEKRFKELA S FNN ++CFW YSL+WFP+KS+KNLISYYFNVFLLR RSYQNRV
Sbjct: 721 QWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRV 695

Query: 794 TPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           TPNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TDVE
Sbjct: 781 TPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 695

BLAST of ClCG07G008230 vs. NCBI nr
Match: XP_022931396.1 (AT-rich interactive domain-containing protein 2-like isoform X2 [Cucurbita moschata])

HSP 1 Score: 936.8 bits (2420), Expect = 1.3e-268
Identity = 510/826 (61.74%), Postives = 568/826 (68.77%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRW +SSNASILDCNKDVDP+PSNGCCIA  C VE SY NVDYDD KA IRCYFEKILW
Sbjct: 1   MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEIGRRG +RP+PAL+GEGG+LDLFELF+VVRDKGG QVVSEK+LWSSVV+ELGLDL
Sbjct: 61  VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGV 193
           GLSASVKLIY KYL+D+EKWLMVRCG TKLENG+SDY Y+KS PFLSEL  KI GML+GV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180

Query: 194 LRQKSIYDECSRFKSNKPNENVNV-AAATVEK-----EIKKKEHDLHGDVTPIQQNCIDT 253
            RQ SIYDEC  FKSNK N NVNV AAA VEK     EIKKKEHDLHGDVT IQQ+C   
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCT-- 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
                ET  IHVI           E +I+S G+YR+SLL+MLKWVRKTAKHP +PLNGTI
Sbjct: 241 -----ETHPIHVI-----------EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG S+WK Y+SDDALWLQVIRAK+ALL RK VDK AEKRLLIQ   L  L     +++ +
Sbjct: 301 PGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQV--LRSLAFFFRIESSD 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
           H     QR  LS ++R                                            
Sbjct: 361 H----NQRPFLSYIER-------------------------------------------E 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
             ++ L+ WV                            S  +   H   +          
Sbjct: 421 SYLLLLEPWV----------------------------SIIISKQHTTTV---------- 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                +LS +  D +                  KKV+MHPSIYE
Sbjct: 481 ---------------------SLSCVLCDNM---------------MDTKKVKMHPSIYE 540

Query: 554 DNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRALL 613
           DNID HHLSTERI CSKRS AL ESV    +NSCP V+SNCISSLTTE+GKGL KN+A+L
Sbjct: 541 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGL-KNQAVL 600

Query: 614 NGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSN 673
           NGD PSE ED+ PNEDS ++ VPVGA+ QA +PEWTGN SDSDSKWLGT+SWP  H NSN
Sbjct: 601 NGDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSN 660

Query: 674 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISL 733
           SV DR  IGRGR DSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISL
Sbjct: 661 SVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISL 684

Query: 734 QWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRV 793
           QWT EEEKRFKELA S FNN ++CFW YSL+WFP+KS+KNLISYYFNVFLLR RSYQNRV
Sbjct: 721 QWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRV 684

Query: 794 TPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           TPNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TDVE
Sbjct: 781 TPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 684

BLAST of ClCG07G008230 vs. ExPASy Swiss-Prot
Match: Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)

HSP 1 Score: 266.9 bits (681), Expect = 7.5e-70
Identity = 232/785 (29.55%), Postives = 325/785 (41.40%), Query Frame = 0

Query: 51  SYANVD---YDDYKATIRCYFEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVV 110
           SY +V+    D+ +  +R  F++ L VFL+E    G I+P+PA++G+G ++DLF+LF++V
Sbjct: 10  SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFVLV 69

Query: 111 RDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGN 170
           R++ G+  VS K LW  V  +LG D  L  S+ LIYLKYLN +EKW +        +N +
Sbjct: 70  REREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKD 129

Query: 171 SDYHYRKSFPFLSELEVKIKGMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK 230
           S+     S   L EL    K +L     QK   +    F  N   E+ +    +  K  +
Sbjct: 130 SEKKGCYS-GMLHELGNGFKSLLDNGKCQKR--NRAVAFGCNHMEESCSEFDRS-RKRFR 189

Query: 231 KKEHDLHGDVTPIQQNCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQ 290
           + + D                 D G      VI +  ++ AV  E   D     R  L  
Sbjct: 190 ESDDD-----------------DKGVGLSSVVIREETVVCAVE-EGLSDFSLEKRDDLPG 249

Query: 291 MLKWVRKTAKHPVNPLNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRL 350
           MLKW+   A  P +P  G IP +SKWK Y + +  WLQV RAKN+LL ++D   NAE   
Sbjct: 250 MLKWLALVATSPHDPAIGVIPHSSKWKQY-NGNKCWLQVARAKNSLLVQRD---NAEL-- 309

Query: 351 LIQTEDLNLLFLLLPVDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQV 410
                                              R+R+                     
Sbjct: 310 -----------------------------------RYRYH-------------------- 369

Query: 411 EKPFLEGGGLTLSQLINLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSF 470
             PF     +    +   D K+I                                     
Sbjct: 370 --PFRGHQNIHHPSMYEDDRKSI------------------------------------- 429

Query: 471 FLEVVHIVVLGSSFSWAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKG 530
                                                                       
Sbjct: 430 ------------------------------------------------------------ 489

Query: 531 EKPQPTTKEKKVRMHPSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSN 590
                       R+  SI   N+ KH  S+   CC+  S   +     T      I+ S 
Sbjct: 490 -----------GRLRYSIRPPNLSKHCSSS---CCNGSSLVSLSKSRSTKCRKLTIIASE 549

Query: 591 CISSLTTEIGKGLKKNRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNIS 650
             + LT    +  K+N+A    + P              + + VG   QA V EWT +  
Sbjct: 550 -RAGLTAGTSRARKRNKA----EIPR-------------RCIKVGHQHQAQVDEWTESGV 572

Query: 651 DSDSKWLGTKSWPSPHGNS-NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLK 710
           DSDSKWLGT+ WP  +  + +     + +G+GR DSC C+  G VEC R HIAE RM LK
Sbjct: 610 DSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELK 572

Query: 711 LELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKK 770
            ELG  F+ WRF+ MGEE+ L+WT EEEKRFK++  +      Q FW+ + K FP K ++
Sbjct: 670 RELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----DPQSFWTNAAKNFPKKKRE 572

Query: 771 NLISYYFNVFLLRQRSYQNRVTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN 830
            L+SYYFNVFL+ +R YQNRVTP SIDSDDE   FG + G FG  A+   GS  + C++N
Sbjct: 730 ELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVGGSFGRDAVTSSGSDVMICAQN 572

Query: 831 RQFTD 832
           RQ  D
Sbjct: 790 RQCED 572

BLAST of ClCG07G008230 vs. ExPASy Swiss-Prot
Match: Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 1.6e-40
Identity = 184/739 (24.90%), Postives = 262/739 (35.45%), Query Frame = 0

Query: 68  FEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVL 127
           F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+  VSE   W  VV 
Sbjct: 49  FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108

Query: 128 ELGLDLGLSASVKLIYLKYLNDVEKWL-MVRCGGTKLE----NGNSDYHYRKSFPFLSEL 187
           E GL+   SAS KLIY+KYL+   +WL  V  G T +     +G SD    +   FLSE+
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV 168

Query: 188 EVKIKGMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIKKKEHDLHGDVTPIQQ 247
           + K +                   +  +P + +         + K++    H        
Sbjct: 169 KKKYE------------------LRKGRPAKELGAELKWFISKTKRRYDKHHVGKESASN 228

Query: 248 NCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNP 307
           + +      G       +E   IL++V  E       R R+  L+ LKW+   AK P +P
Sbjct: 229 DAV--KEFQGSKLAERRLEQIMILESVTQECSSPGK-RKRECPLETLKWLSDVAKDPCDP 288

Query: 308 LNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLP 367
             G +P  S+W +Y S++  W Q+                                    
Sbjct: 289 SLGIVPDRSEWVSYGSEEP-WKQL------------------------------------ 348

Query: 368 VDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQL 427
                 LL R  RT                                              
Sbjct: 349 ------LLFRASRT---------------------------------------------- 408

Query: 428 INLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFS 487
            N DS                                                       
Sbjct: 409 -NNDSA------------------------------------------------------ 468

Query: 488 WAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMH 547
                      C                                 EK    T +K  +MH
Sbjct: 469 -----------C---------------------------------EK----TWQKVQKMH 528

Query: 548 PSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKK 607
           P +Y+D+    +   ER+                                     +  K+
Sbjct: 529 PCLYDDSAGASYNLRERLSY-----------------------------------EDYKR 531

Query: 608 NRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWP-- 667
            +     D  S  E+++P          VG+ FQA VPEWTG   +SDSKWLGT+ WP  
Sbjct: 589 GKTGNGSDIGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLT 531

Query: 668 SPHGNSNSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHH 727
                +N + +R+ IG+GR D C C  PGS+EC +FHI   R +LKLELG  FY W F  
Sbjct: 649 KEQTKANLLIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDV 531

Query: 728 MGEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQ 787
           MGE     WT  E K+ K L  SS  + S  F   +    P KS+  ++SY++NV LL+ 
Sbjct: 709 MGECTLQYWTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQY 531

Query: 788 RSYQNRVTPNSIDSDDEDV 800
           R+ Q+R+TP+ IDSD + +
Sbjct: 769 RASQSRITPHDIDSDTDQI 531

BLAST of ClCG07G008230 vs. ExPASy TrEMBL
Match: A0A1S3BSW2 (AT-rich interactive domain-containing protein 2 OS=Cucumis melo OX=3656 GN=LOC103493169 PE=4 SV=1)

HSP 1 Score: 1013.4 bits (2619), Expect = 5.2e-292
Identity = 540/826 (65.38%), Postives = 575/826 (69.61%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRWPISSN SILDCNKDVDP+PSNG CIAP C VEGS ANVD+DD KATIRCYFEKILW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSNGYCIAPDCLVEGSRANVDHDDCKATIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEI RRG IRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVV+ELGLDL
Sbjct: 61  VFLKEICRRGFIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSD-YHYRKSFPFLSELEVKIKGMLFG 193
           GLSASVKLIY KYL+++EKWLMVR GGTKLENGNSD Y+YRKSFP L+ELE KIK ML+G
Sbjct: 121 GLSASVKLIYFKYLSELEKWLMVRRGGTKLENGNSDYYYYRKSFPCLAELEAKIKDMLYG 180

Query: 194 VLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK-----KKEHDLHGDVTPIQQNCIDT 253
           VLRQKSIYDE   FKSNKPN NVNVA    EKEIK     KKEHDLH DVTPIQQNC +T
Sbjct: 181 VLRQKSIYDERPGFKSNKPNGNVNVAETAAEKEIKFPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
           PR NGET+QIHVI DCR LDAVNVET+ DSHGR R+SLL+MLKWVRKTAKHP NP NGT+
Sbjct: 241 PRVNGETNQIHVIGDCRSLDAVNVETETDSHGRSRESLLRMLKWVRKTAKHPANPSNGTV 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           P +SKWKAYASDDALWLQVI+AK+ALL RKDVDK AEKRLLIQ                 
Sbjct: 301 PESSKWKAYASDDALWLQVIKAKDALLNRKDVDKTAEKRLLIQ----------------- 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                                            KKVRMHP IYE
Sbjct: 481 -------------------------------------------------KKVRMHPCIYE 540

Query: 554 DNI-DKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRAL 613
           DNI D HHLSTERICCS+RSNAL +S  V  NNSCP V+SN I SLTTEIGKGL KN+AL
Sbjct: 541 DNIDDNHHLSTERICCSRRSNALAKSELVASNNSCPPVRSNQIGSLTTEIGKGL-KNQAL 600

Query: 614 LNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNS 673
           LNGD  SE EDNQ NEDSV+KPVPVGALFQA +PEWTGNISDSDSKWLGT+ WPS H N+
Sbjct: 601 LNGDLASEMEDNQANEDSVEKPVPVGALFQAAIPEWTGNISDSDSKWLGTRLWPSQHENN 639

Query: 674 NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEIS 733
            SVS+RNPIGRGRLDSC CQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEIS
Sbjct: 661 KSVSNRNPIGRGRLDSCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEIS 639

Query: 734 LQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNR 793
           LQWTAEEEKRFKELA SSFNNQ+QCFW++SLKWFP+KS+KNLISYYFNVFLLRQRSYQNR
Sbjct: 721 LQWTAEEEKRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNR 639

Query: 794 VTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV 833
           VTPN IDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN+QF D+
Sbjct: 781 VTPNDIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENKQFIDI 639

BLAST of ClCG07G008230 vs. ExPASy TrEMBL
Match: A0A0A0KZM1 (ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G047920 PE=4 SV=1)

HSP 1 Score: 990.7 bits (2560), Expect = 3.6e-285
Identity = 529/826 (64.04%), Postives = 567/826 (68.64%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRWPISSN SILDCNKDVDP+PS G CIAP C VEGS ANVD+DD KATIRCYFEK+LW
Sbjct: 1   MGRWPISSNDSILDCNKDVDPNPSYGYCIAPDCLVEGSCANVDHDDCKATIRCYFEKVLW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKE  RRG IRPVPALLGEG SLDLFELFMVVRDKGGYQVVSEKELWSSVV+ELGLDL
Sbjct: 61  VFLKETCRRGFIRPVPALLGEGESLDLFELFMVVRDKGGYQVVSEKELWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSD-YHYRKSFPFLSELEVKIKGMLFG 193
           GLSASVKLIY KYL+D+EKWLMVR GGTKLENGNSD Y+YRK+FP L+ELE KIK +L+G
Sbjct: 121 GLSASVKLIYFKYLSDLEKWLMVRRGGTKLENGNSDCYYYRKNFPCLAELEAKIKDILYG 180

Query: 194 VLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK-----KKEHDLHGDVTPIQQNCIDT 253
           VLRQKSIYDE S FKSNKPN NVNVA    EKEIK     KKEHDLH DVTPIQQNC +T
Sbjct: 181 VLRQKSIYDERSGFKSNKPNGNVNVAETAAEKEIKSPKIEKKEHDLHEDVTPIQQNCTET 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
           PRDNG+T+QIHVI DCR  DAVNVET+ DSHG  R+SL +MLKWVRKTAKHP NP NGT+
Sbjct: 241 PRDNGKTNQIHVIGDCRSSDAVNVETETDSHGSSRESLFRMLKWVRKTAKHPANPSNGTV 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG+SKWKAYAS+DALWLQVI+AK+ALL RKDVDK AEKRLLIQ                 
Sbjct: 301 PGSSKWKAYASEDALWLQVIKAKDALLNRKDVDKTAEKRLLIQ----------------- 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                                            KKVRMHP IYE
Sbjct: 481 -------------------------------------------------KKVRMHPCIYE 540

Query: 554 DNI-DKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRAL 613
           DNI D HHLSTERICCS+RSNAL +S  V  NNSCP VQSN I SLTTEIGKGL KN+AL
Sbjct: 541 DNIDDNHHLSTERICCSRRSNALSKSESVACNNSCPPVQSNQIGSLTTEIGKGL-KNQAL 600

Query: 614 LNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNS 673
           LNGD  SE EDNQ NEDSV+KPVPVGA FQAV+PEWTGNISDSDSKWLGT+SWPS H N+
Sbjct: 601 LNGDLASEMEDNQANEDSVEKPVPVGASFQAVLPEWTGNISDSDSKWLGTRSWPSQHENN 639

Query: 674 NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEIS 733
            SVSDRNPI RGRLD C CQFPGSVEC+RFHIAEARMRLKLELGLTFYDWRFH MGEEIS
Sbjct: 661 KSVSDRNPISRGRLDPCSCQFPGSVECYRFHIAEARMRLKLELGLTFYDWRFHQMGEEIS 639

Query: 734 LQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNR 793
           LQWTAEEE RFKELA SSFNNQ+QCFW++SLKWFP+KS+KNLISYYFNVFLLRQRSYQNR
Sbjct: 721 LQWTAEEENRFKELAISSFNNQNQCFWNHSLKWFPMKSRKNLISYYFNVFLLRQRSYQNR 639

Query: 794 VTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDV 833
           VTPN IDSD EDVEFGCISGDFGAKAME+LGSK VECSEN+QF  +
Sbjct: 781 VTPNDIDSDGEDVEFGCISGDFGAKAMEVLGSKFVECSENKQFIGI 639

BLAST of ClCG07G008230 vs. ExPASy TrEMBL
Match: A0A6J1EZB1 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)

HSP 1 Score: 958.7 bits (2477), Expect = 1.5e-275
Identity = 518/826 (62.71%), Postives = 577/826 (69.85%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRW +SSNASILDCNKDVDP+PSNGCCIA  C VE SY NVDYDD KA IRCYFEKILW
Sbjct: 1   MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEIGRRG +RP+PAL+GEGG+LDLFELF+VVRDKGG QVVSEK+LWSSVV+ELGLDL
Sbjct: 61  VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGV 193
           GLSASVKLIY KYL+D+EKWLMVRCG TKLENG+SDY Y+KS PFLSEL  KI GML+GV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180

Query: 194 LRQKSIYDECSRFKSNKPNENVNV-AAATVEK-----EIKKKEHDLHGDVTPIQQNCIDT 253
            RQ SIYDEC  FKSNK N NVNV AAA VEK     EIKKKEHDLHGDVT IQQ+C   
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCT-- 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
                ET  IHVIED + LDAVNVE +I+S G+YR+SLL+MLKWVRKTAKHP +PLNGTI
Sbjct: 241 -----ETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG S+WK Y+SDDALWLQVIRAK+ALL RK VDK AEKRLLIQ   L  L     +++ +
Sbjct: 301 PGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQV--LRSLAFFFRIESSD 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
           H     QR  LS ++R                                            
Sbjct: 361 H----NQRPFLSYIER-------------------------------------------E 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
             ++ L+ WV                            S  +   H   +          
Sbjct: 421 SYLLLLEPWV----------------------------SIIISKQHTTTV---------- 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                +LS +  D +                  KKV+MHPSIYE
Sbjct: 481 ---------------------SLSCVLCDNM---------------MDTKKVKMHPSIYE 540

Query: 554 DNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRALL 613
           DNID HHLSTERI CSKRS AL ESV    +NSCP V+SNCISSLTTE+GKGL KN+A+L
Sbjct: 541 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGL-KNQAVL 600

Query: 614 NGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSN 673
           NGD PSE ED+ PNEDS ++ VPVGA+ QA +PEWTGN SDSDSKWLGT+SWP  H NSN
Sbjct: 601 NGDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSN 660

Query: 674 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISL 733
           SV DR  IGRGR DSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISL
Sbjct: 661 SVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISL 695

Query: 734 QWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRV 793
           QWT EEEKRFKELA S FNN ++CFW YSL+WFP+KS+KNLISYYFNVFLLR RSYQNRV
Sbjct: 721 QWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRV 695

Query: 794 TPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           TPNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TDVE
Sbjct: 781 TPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 695

BLAST of ClCG07G008230 vs. ExPASy TrEMBL
Match: A0A6J1ETJ6 (AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111437591 PE=4 SV=1)

HSP 1 Score: 936.8 bits (2420), Expect = 6.2e-269
Identity = 510/826 (61.74%), Postives = 568/826 (68.77%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRW +SSNASILDCNKDVDP+PSNGCCIA  C VE SY NVDYDD KA IRCYFEKILW
Sbjct: 1   MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVERSYENVDYDDCKARIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEIGRRG +RP+PAL+GEGG+LDLFELF+VVRDKGG QVVSEK+LWSSVV+ELGLDL
Sbjct: 61  VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGV 193
           GLSASVKLIY KYL+D+EKWLMVRCG TKLENG+SDY Y+KS PFLSEL  KI GML+GV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180

Query: 194 LRQKSIYDECSRFKSNKPNENVNV-AAATVEK-----EIKKKEHDLHGDVTPIQQNCIDT 253
            RQ SIYDEC  FKSNK N NVNV AAA VEK     EIKKKEHDLHGDVT IQQ+C   
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAAVEKEIKFPEIKKKEHDLHGDVTSIQQDCT-- 240

Query: 254 PRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTI 313
                ET  IHVI           E +I+S G+YR+SLL+MLKWVRKTAKHP +PLNGTI
Sbjct: 241 -----ETHPIHVI-----------EAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTI 300

Query: 314 PGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALN 373
           PG S+WK Y+SDDALWLQVIRAK+ALL RK VDK AEKRLLIQ   L  L     +++ +
Sbjct: 301 PGTSRWKGYSSDDALWLQVIRAKDALLIRKGVDKIAEKRLLIQV--LRSLAFFFRIESSD 360

Query: 374 HLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDS 433
           H     QR  LS ++R                                            
Sbjct: 361 H----NQRPFLSYIER-------------------------------------------E 420

Query: 434 KNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVL 493
             ++ L+ WV                            S  +   H   +          
Sbjct: 421 SYLLLLEPWV----------------------------SIIISKQHTTTV---------- 480

Query: 494 VLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYE 553
                                +LS +  D +                  KKV+MHPSIYE
Sbjct: 481 ---------------------SLSCVLCDNM---------------MDTKKVKMHPSIYE 540

Query: 554 DNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKKNRALL 613
           DNID HHLSTERI CSKRS AL ESV    +NSCP V+SNCISSLTTE+GKGL KN+A+L
Sbjct: 541 DNIDNHHLSTERISCSKRSKALTESVLAPCSNSCPTVRSNCISSLTTEVGKGL-KNQAVL 600

Query: 614 NGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSN 673
           NGD PSE ED+ PNEDS ++ VPVGA+ QA +PEWTGN SDSDSKWLGT+SWP  H NSN
Sbjct: 601 NGDIPSEMEDDHPNEDSAEETVPVGAVCQADLPEWTGNNSDSDSKWLGTRSWPLQHRNSN 660

Query: 674 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISL 733
           SV DR  IGRGR DSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISL
Sbjct: 661 SVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISL 684

Query: 734 QWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRV 793
           QWT EEEKRFKELA S FNN ++CFW YSL+WFP+KS+KNLISYYFNVFLLR RSYQNRV
Sbjct: 721 QWTVEEEKRFKELAMSGFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRV 684

Query: 794 TPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           TPNSIDSDDED EFG +SG FG KAMEILGS S+ECS NRQ TDVE
Sbjct: 781 TPNSIDSDDEDFEFGRVSGGFGDKAMEILGSNSLECSINRQVTDVE 684

BLAST of ClCG07G008230 vs. ExPASy TrEMBL
Match: A0A6J1J644 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111482924 PE=4 SV=1)

HSP 1 Score: 936.0 bits (2418), Expect = 1.1e-268
Identity = 505/826 (61.14%), Postives = 551/826 (66.71%), Query Frame = 0

Query: 14  MGRWPISSNASILDCNKDVDPSPSNGCCIAPACSVEGSYANVDYDDYKATIRCYFEKILW 73
           MGRW +SSNASILDCNKDVDP+PSNGCCIA  C VEG+YANVDYDD KA IRCYFEKILW
Sbjct: 1   MGRWHVSSNASILDCNKDVDPNPSNGCCIASDCLVEGTYANVDYDDCKARIRCYFEKILW 60

Query: 74  VFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVLELGLDL 133
           VFLKEIGRRG +RP+PAL+GEGG+LDLFELF+VVRDKGG QVVSEK+LWSSVV+ELGLDL
Sbjct: 61  VFLKEIGRRGFVRPLPALIGEGGALDLFELFLVVRDKGGSQVVSEKKLWSSVVVELGLDL 120

Query: 134 GLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGNSDYHYRKSFPFLSELEVKIKGMLFGV 193
           GLSASVKLIY KYL+D+EKWLMVRCG TKLENG+SDY Y+KS PFLSEL  KI GML+GV
Sbjct: 121 GLSASVKLIYSKYLSDLEKWLMVRCGDTKLENGSSDYCYKKSSPFLSELGAKINGMLYGV 180

Query: 194 LRQKSIYDECSRFKSNKPNENVNVAAATVEK-----EIKKKEHDLHGDVTPIQQNCIDTP 253
            RQ SIYDEC  FKSNK N NVNVAAA VEK     EIKKKEHDLHGDVTPIQQ+C    
Sbjct: 181 PRQNSIYDECFGFKSNKQNGNVNVAAAAVEKEIKFSEIKKKEHDLHGDVTPIQQDCT--- 240

Query: 254 RDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNPLNGTIP 313
               ET  IHVIED + LDAVNVE +I+S G+YR+SLL+MLKWVRKTAKHP +PLNGTI 
Sbjct: 241 ----ETHPIHVIEDGQSLDAVNVEAEIESLGKYRESLLRMLKWVRKTAKHPEDPLNGTIL 300

Query: 314 GASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLPVDALNH 373
           GAS+WK Y+SDDALWLQVI AK+ALL RK VDK AEKRLLIQ                  
Sbjct: 301 GASRWKGYSSDDALWLQVISAKDALLIRKGVDKIAEKRLLIQ------------------ 360

Query: 374 LLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQLINLDSK 433
                                                                       
Sbjct: 361 ------------------------------------------------------------ 420

Query: 434 NIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFSWAFVLV 493
                                                                       
Sbjct: 421 ------------------------------------------------------------ 480

Query: 494 LDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMHPSIYED 553
                                                           KKV+MHPSIYED
Sbjct: 481 ------------------------------------------------KKVKMHPSIYED 540

Query: 554 NIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCI-SSLTTEIGKGLKKNRALL 613
           NID H LSTERI CSKR  A  ESVF T +NSCP V+SNCI SSLTTE+GKGL KN+A+L
Sbjct: 541 NIDNHRLSTERISCSKRFKASTESVFATCSNSCPTVRSNCISSSLTTEVGKGL-KNQAVL 600

Query: 614 NGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWPSPHGNSN 673
           NGD PSE ED+ PNEDS ++ VPVGAL QA +PEWTGN SDSDSKWLGT+ WP  H NSN
Sbjct: 601 NGDIPSEMEDDHPNEDSAEETVPVGALCQADLPEWTGNNSDSDSKWLGTRLWPLQHRNSN 632

Query: 674 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISL 733
           SV DR  IGRGR DSC CQFPGSVECFRFHIAEARMRLKLELG TF+ WRFH MGEEISL
Sbjct: 661 SVRDRRAIGRGRPDSCGCQFPGSVECFRFHIAEARMRLKLELGSTFFAWRFHQMGEEISL 632

Query: 734 QWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRV 793
           QWTAEEEKRFKELA SSFNN ++CFW YSL+WFP+KS+KNLISYYFNVFLLR RSYQNRV
Sbjct: 721 QWTAEEEKRFKELAMSSFNNHNRCFWDYSLRWFPMKSRKNLISYYFNVFLLRLRSYQNRV 632

Query: 794 TPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSENRQFTDVE 834
           TPNSIDSDDED EFGC+SG FG KAME+LGSKS+ECS NRQ TDVE
Sbjct: 781 TPNSIDSDDEDFEFGCVSGGFGDKAMEVLGSKSLECSINRQVTDVE 632

BLAST of ClCG07G008230 vs. TAIR 10
Match: AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 266.9 bits (681), Expect = 5.3e-71
Identity = 232/785 (29.55%), Postives = 325/785 (41.40%), Query Frame = 0

Query: 51  SYANVD---YDDYKATIRCYFEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVV 110
           SY +V+    D+ +  +R  F++ L VFL+E    G I+P+PA++G+G ++DLF+LF++V
Sbjct: 10  SYVDVEIKYVDECEERLRRLFDQALLVFLEE---EGSIKPLPAVIGDGKNVDLFKLFVLV 69

Query: 111 RDKGGYQVVSEKELWSSVVLELGLDLGLSASVKLIYLKYLNDVEKWLMVRCGGTKLENGN 170
           R++ G+  VS K LW  V  +LG D  L  S+ LIYLKYLN +EKW +        +N +
Sbjct: 70  REREGFDTVSRKRLWEVVAEKLGFDCSLVPSLILIYLKYLNRMEKWAVEESRIVNWDNKD 129

Query: 171 SDYHYRKSFPFLSELEVKIKGMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIK 230
           S+     S   L EL    K +L     QK   +    F  N   E+ +    +  K  +
Sbjct: 130 SEKKGCYS-GMLHELGNGFKSLLDNGKCQKR--NRAVAFGCNHMEESCSEFDRS-RKRFR 189

Query: 231 KKEHDLHGDVTPIQQNCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQ 290
           + + D                 D G      VI +  ++ AV  E   D     R  L  
Sbjct: 190 ESDDD-----------------DKGVGLSSVVIREETVVCAVE-EGLSDFSLEKRDDLPG 249

Query: 291 MLKWVRKTAKHPVNPLNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRL 350
           MLKW+   A  P +P  G IP +SKWK Y + +  WLQV RAKN+LL ++D   NAE   
Sbjct: 250 MLKWLALVATSPHDPAIGVIPHSSKWKQY-NGNKCWLQVARAKNSLLVQRD---NAEL-- 309

Query: 351 LIQTEDLNLLFLLLPVDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQV 410
                                              R+R+                     
Sbjct: 310 -----------------------------------RYRYH-------------------- 369

Query: 411 EKPFLEGGGLTLSQLINLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSF 470
             PF     +    +   D K+I                                     
Sbjct: 370 --PFRGHQNIHHPSMYEDDRKSI------------------------------------- 429

Query: 471 FLEVVHIVVLGSSFSWAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKG 530
                                                                       
Sbjct: 430 ------------------------------------------------------------ 489

Query: 531 EKPQPTTKEKKVRMHPSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSN 590
                       R+  SI   N+ KH  S+   CC+  S   +     T      I+ S 
Sbjct: 490 -----------GRLRYSIRPPNLSKHCSSS---CCNGSSLVSLSKSRSTKCRKLTIIASE 549

Query: 591 CISSLTTEIGKGLKKNRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNIS 650
             + LT    +  K+N+A    + P              + + VG   QA V EWT +  
Sbjct: 550 -RAGLTAGTSRARKRNKA----EIPR-------------RCIKVGHQHQAQVDEWTESGV 572

Query: 651 DSDSKWLGTKSWPSPHGNS-NSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLK 710
           DSDSKWLGT+ WP  +  + +     + +G+GR DSC C+  G VEC R HIAE RM LK
Sbjct: 610 DSDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELK 572

Query: 711 LELGLTFYDWRFHHMGEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKK 770
            ELG  F+ WRF+ MGEE+ L+WT EEEKRFK++  +      Q FW+ + K FP K ++
Sbjct: 670 RELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIA----DPQSFWTNAAKNFPKKKRE 572

Query: 771 NLISYYFNVFLLRQRSYQNRVTPNSIDSDDEDVEFGCISGDFGAKAMEILGSKSVECSEN 830
            L+SYYFNVFL+ +R YQNRVTP SIDSDDE   FG + G FG  A+   GS  + C++N
Sbjct: 730 ELVSYYFNVFLINRRRYQNRVTPKSIDSDDEGA-FGSVGGSFGRDAVTSSGSDVMICAQN 572

Query: 831 RQFTD 832
           RQ  D
Sbjct: 790 RQCED 572

BLAST of ClCG07G008230 vs. TAIR 10
Match: AT2G46040.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 169.5 bits (428), Expect = 1.2e-41
Identity = 184/739 (24.90%), Postives = 262/739 (35.45%), Query Frame = 0

Query: 68  FEKILWVFLKEIGRRGLIRPVPALLGEGGSLDLFELFMVVRDKGGYQVVSEKELWSSVVL 127
           F  +L  FL E        P+PA+ GEG ++DLF LF+ V  KGG+  VSE   W  VV 
Sbjct: 49  FRPLLDSFLAEFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQ 108

Query: 128 ELGLDLGLSASVKLIYLKYLNDVEKWL-MVRCGGTKLE----NGNSDYHYRKSFPFLSEL 187
           E GL+   SAS KLIY+KYL+   +WL  V  G T +     +G SD    +   FLSE+
Sbjct: 109 ESGLESYDSASAKLIYVKYLDAFGRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV 168

Query: 188 EVKIKGMLFGVLRQKSIYDECSRFKSNKPNENVNVAAATVEKEIKKKEHDLHGDVTPIQQ 247
           + K +                   +  +P + +         + K++    H        
Sbjct: 169 KKKYE------------------LRKGRPAKELGAELKWFISKTKRRYDKHHVGKESASN 228

Query: 248 NCIDTPRDNGETDQIHVIEDCRILDAVNVETDIDSHGRYRKSLLQMLKWVRKTAKHPVNP 307
           + +      G       +E   IL++V  E       R R+  L+ LKW+   AK P +P
Sbjct: 229 DAV--KEFQGSKLAERRLEQIMILESVTQECSSPGK-RKRECPLETLKWLSDVAKDPCDP 288

Query: 308 LNGTIPGASKWKAYASDDALWLQVIRAKNALLTRKDVDKNAEKRLLIQTEDLNLLFLLLP 367
             G +P  S+W +Y S++  W Q+                                    
Sbjct: 289 SLGIVPDRSEWVSYGSEEP-WKQL------------------------------------ 348

Query: 368 VDALNHLLERVQRTCLSILDRWRHEFKLICCTLILHVVVEEIDQVEKPFLEGGGLTLSQL 427
                 LL R  RT                                              
Sbjct: 349 ------LLFRASRT---------------------------------------------- 408

Query: 428 INLDSKNIVFLDKWVLEILIGGDFAFGVKLLWARAVFRLMTNPSFFLEVVHIVVLGSSFS 487
            N DS                                                       
Sbjct: 409 -NNDSA------------------------------------------------------ 468

Query: 488 WAFVLVLDDVDCVFRSRGGLKVGLVGALSLLFLDKIYIHIGKIKGEKPQPTTKEKKVRMH 547
                      C                                 EK    T +K  +MH
Sbjct: 469 -----------C---------------------------------EK----TWQKVQKMH 528

Query: 548 PSIYEDNIDKHHLSTERICCSKRSNALVESVFVTHNNSCPIVQSNCISSLTTEIGKGLKK 607
           P +Y+D+    +   ER+                                     +  K+
Sbjct: 529 PCLYDDSAGASYNLRERLSY-----------------------------------EDYKR 531

Query: 608 NRALLNGDFPSETEDNQPNEDSVDKPVPVGALFQAVVPEWTGNISDSDSKWLGTKSWP-- 667
            +     D  S  E+++P          VG+ FQA VPEWTG   +SDSKWLGT+ WP  
Sbjct: 589 GKTGNGSDIGSSDEEDRP-------CALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLT 531

Query: 668 SPHGNSNSVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHH 727
                +N + +R+ IG+GR D C C  PGS+EC +FHI   R +LKLELG  FY W F  
Sbjct: 649 KEQTKANLLIERDRIGKGRQDPCGCHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDV 531

Query: 728 MGEEISLQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQ 787
           MGE     WT  E K+ K L  SS  + S  F   +    P KS+  ++SY++NV LL+ 
Sbjct: 709 MGECTLQYWTDLELKKIKSL-MSSPPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQY 531

Query: 788 RSYQNRVTPNSIDSDDEDV 800
           R+ Q+R+TP+ IDSD + +
Sbjct: 769 RASQSRITPHDIDSDTDQI 531

BLAST of ClCG07G008230 vs. TAIR 10
Match: AT5G04110.1 (DNA GYRASE B3 )

HSP 1 Score: 131.7 bits (330), Expect = 2.7e-30
Identity = 72/180 (40.00%), Postives = 104/180 (57.78%), Query Frame = 0

Query: 629 VPVGALFQAVVPEWT---------GNISDSDS-KWLGTKSWPSPHGNSNSVSDRNPIGRG 688
           +P+G  FQA +P W          G+  DS++ +WLGT  WP+ +    +V  +  +G G
Sbjct: 361 IPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLRWLGTGVWPT-YSLKKTVHSKK-VGEG 420

Query: 689 RLDSCRCQFPGSVECFRFHIAEARMRLKLELGLTFYDWRFHHMGEEISLQ-WTAEEEKRF 748
           R DSC C  P S  C + H  EA+  L+ E+   F  W F  MGEEI L+ WTA+EE+RF
Sbjct: 421 RSDSCSCASPRSTNCIKRHKKEAQELLEKEINRAFSTWEFDQMGEEIVLKSWTAKEERRF 480

Query: 749 KELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRVTPNSIDSDDE 798
           + L   +  + S  FW ++   FP KSKK+L+SYY+NVFL+++         N+IDSDD+
Sbjct: 481 EALVKKNPLSSSDGFWEFASNAFPQKSKKDLLSYYYNVFLIKRMRLLKSSAANNIDSDDD 538

BLAST of ClCG07G008230 vs. TAIR 10
Match: AT1G26580.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; BEST Arabidopsis thaliana protein match is: ELM2 domain-containing protein (TAIR:AT2G03470.1); Has 161 Blast hits to 161 proteins in 17 species: Archae - 0; Bacteria - 0; Metazoa - 1; Fungi - 4; Plants - 156; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 99.4 bits (246), Expect = 1.5e-20
Identity = 73/206 (35.44%), Postives = 103/206 (50.00%), Query Frame = 0

Query: 627 KPVPVGALFQAVVPEW----TGNISDS-------------DSKWLGTKSWPSPHGNSNSV 686
           K VP+G   QA +PEW    TGNI  S               K  GT   P P G +   
Sbjct: 133 KQVPIGPGHQAEIPEWEGSQTGNIETSGMSVQNHISGCADGEKLFGTSVIPMP-GLTTVA 192

Query: 687 SDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELG-LTFYDWRFHHMGEEISLQ 746
              + +G+GR   C C+   SV C   HI EAR  L    G  TF +     MGE+ +L+
Sbjct: 193 HIDDIVGKGR-KFCVCRDRDSVRCVCQHIKEAREELVKTFGNETFKELGLCEMGEKGALK 252

Query: 747 WTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNRVT 806
           W+ E+ + F E+  S+     Q FW +    F  +++K ++S+YFNVF+LR+R+ QNR  
Sbjct: 253 WSDEDAQLFHEVVYSNPVTLGQNFWRHLEAAFCSRTQKEIVSFYFNVFVLRRRAIQNRAF 312

Query: 807 PNSIDSDDEDVEFGCISGDFGAKAME 815
              IDSDD++   GC  G  G + +E
Sbjct: 313 ILDIDSDDDEWH-GCYGGSSGTRYVE 335

BLAST of ClCG07G008230 vs. TAIR 10
Match: AT2G03470.2 (ELM2 domain-containing protein )

HSP 1 Score: 92.4 bits (228), Expect = 1.8e-18
Identity = 54/132 (40.91%), Postives = 74/132 (56.06%), Query Frame = 0

Query: 668 SVSDRNPIGRGRLDSCRCQFPGSVECFRFHIAEARMRLKLELGL-TFYDWRFHHMGEEIS 727
           S SD    G+GR + C C   GS+ C R HI EAR  L   +G   F +     MGEE++
Sbjct: 165 SDSDLCGTGQGRKE-CLCLDKGSIRCVRRHIIEARESLVETIGYERFMELGLCEMGEEVA 224

Query: 728 LQWTAEEEKRFKELATSSFNNQSQCFWSYSLKWFPLKSKKNLISYYFNVFLLRQRSYQNR 787
             WT EEE  F ++  S+  +  + FW      FP ++ K L+SYYFNVF+LR+R  QNR
Sbjct: 225 SLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRTMKELVSYYFNVFILRRRGIQNR 284

Query: 788 VTPNSIDSDDED 799
                +DSDD++
Sbjct: 285 FKALDVDSDDDE 295

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038893741.12.4e-29965.99AT-rich interactive domain-containing protein 2 [Benincasa hispida][more]
XP_008452043.11.1e-29165.38PREDICTED: AT-rich interactive domain-containing protein 2 [Cucumis melo][more]
XP_004146560.27.5e-28564.04AT-rich interactive domain-containing protein 2 [Cucumis sativus] >KGN53331.1 hy... [more]
XP_022931395.13.2e-27562.71AT-rich interactive domain-containing protein 2-like isoform X1 [Cucurbita mosch... [more]
XP_022931396.11.3e-26861.74AT-rich interactive domain-containing protein 2-like isoform X2 [Cucurbita mosch... [more]
Match NameE-valueIdentityDescription
Q9LDD47.5e-7029.55AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Q84JT71.6e-4024.90AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A1S3BSW25.2e-29265.38AT-rich interactive domain-containing protein 2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
A0A0A0KZM13.6e-28564.04ARID domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G047920 PE=4 S... [more]
A0A6J1EZB11.5e-27562.71AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita mos... [more]
A0A6J1ETJ66.2e-26961.74AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita mos... [more]
A0A6J1J6441.1e-26861.14AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita max... [more]
Match NameE-valueIdentityDescription
AT4G11400.15.3e-7129.55ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT2G46040.11.2e-4124.90ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT5G04110.12.7e-3040.00DNA GYRASE B3 [more]
AT1G26580.11.5e-2035.44FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... [more]
AT2G03470.21.8e-1840.91ELM2 domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Watermelon (Charleston Gray) v2.5
Date Performed: 2022-01-31
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 212..232
NoneNo IPR availableSMARTSM01014ARID_2coord: 51..151
e-value: 3.4E-7
score: 39.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 607..627
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 653..673
NoneNo IPR availablePANTHERPTHR46410AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 531..832
NoneNo IPR availablePANTHERPTHR46410:SF2AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 531..832
NoneNo IPR availablePANTHERPTHR46410AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 14..345
NoneNo IPR availablePANTHERPTHR46410:SF2AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 14..345
NoneNo IPR availableCDDcd16100ARIDcoord: 68..151
e-value: 5.80986E-14
score: 65.842
IPR001606ARID DNA-binding domainSMARTSM00501bright_3coord: 62..156
e-value: 1.9E-11
score: 54.1
IPR001606ARID DNA-binding domainPFAMPF01388ARIDcoord: 81..148
e-value: 1.7E-8
score: 35.0
IPR001606ARID DNA-binding domainPROSITEPS51011ARIDcoord: 62..155
score: 17.816027
IPR036431ARID DNA-binding domain superfamilyGENE3D1.10.150.60coord: 55..159
e-value: 1.2E-12
score: 49.3
IPR036431ARID DNA-binding domain superfamilySUPERFAMILY46774ARID-likecoord: 65..155

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
ClCG07G008230.2ClCG07G008230.2mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003677 DNA binding