HG10010514 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10010514
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionAT-rich interactive domain-containing protein 1-like
LocationChr06: 22866253 .. 22870646 (-)
RNA-Seq ExpressionHG10010514
SyntenyHG10010514
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTGTCGGATGAATTCATGAATTGTGTTGCCTTTGACGAGAATTCAATTTCAGATTTTTCCAAAAGAATACTCGATATGAAGAGTGATAGGATGAAGAGTGGGGATGTATTTCCTCGCAGAAGCAAGAAACTCAAGGATATTGATAATTGCAACATTGAAAATGAGGATATAAAAATTATTGATCCACCTTTCATCAAGGAAACGAATGTTTTCCAGGAGAGTAAGCAGGAACCAGTGTTGGGACTGCTTGATTGGCTTAAGGGTATTGCAAGAAACCCCTGTGATCCTTCAATATGTTCATTACCAGAAAAGTCAAAATGGAAATCATATGGAAATGAAGAGATTTGGAAACAAGTTTTGCTGGTTCGAGAGGAAATGTTTGTGAAAAGACAAGTTGATTCAAGCAGCGAACAATCTTTTGTGCAGGTACTTTACATAAATGCTCAAGTTGTTTGCATTTATATCTTTACTCATTTAAAATTGCCAATTCACATGACAGGGTAATAAAAAACTGTCTATATCTGTGCATGTATGAAGCTTGTCTCATGAAAAAGCTGATTGTGGTAGCCTTTAGGCTGCTTAAGTCTTTTATTGTTTTTAATATCATTGGTCCATCTTGAAATTATTTTCAAACTATGCATTTTGATTAGTATATATTTACATCTTATATAATGTTCTGGTCACTGTACTGAAGATCCTCATTGTGAAAGCCAATATGGAGCTAAGACTTAATCTGAGAATGTATCATTAAATTACGGCAGTAATTTTTTTTTTATCTGATCTATAGTATCTATCGTGTTAGTTGTTAATGTTATGAGCCTCTTTAGTTTTTAGGAACTTCTGCATTGTCCTTTCTTTATGTTGTTTTAAAGGTTCCATGACCTTGTGTGTATAACTAAAGAATCCTTATCAATTCTGAATTTTATTTGCCTTCTTAAGAAGATACTCGGAGTATTAGCTGTTTAAAATTGTGGTTCCTTTATTAAGAAGAAAATTAAATTATTTAAGACATCGATTTCATTTAGCCTGGTTCTGTTGTGGCCAATAGCTTGAAGCTTCAGCCGCTACTTTTGGTTTTTGATCTGCTTGTAGTCACTAGTTGTTTGAAAATGTTCTTAAAACTATCAAAAGTGCCTTGATAACATTTTTTTAGGTTGGGTGATTATAGATTCATGAATGATTTTAGTACTCTGATAGGAATTAGGAAGTACTTTACTAGAAACACCAAGCAAACATTACTAATTTAGAATTATTTTACCCTAGGAAATGTAGAATAGTAGCTACAATGCGATATTTACAGATTTTATCAGCTATCCAAAATTGTAAAGTTAACGCCCAATATTAACCTAGTGAGTGAATTTTAATTTTATTATTTTTTGTATTCATCTTAAACTTGATTACATATTTTTTTTATTTGATAAGAATAACACCTTTTATTGAGAAAAAAAATGAAAGAATACATGGAGGGATTCTCTTACAAGAAGGGACCCAACTATACAAGATCATACTAATAGAATAATTACAAAAAATCTTCAAAACTGAAGCTCACTGGAAAACATGAAAGTGAACAAGGGCCCATAACTCTCTATAATCTCTCTCAACACCCCCAAACACCCTATTGTTCCGCTTGCTCCATAGAACCCAAATAACTGCACAAACACGCAAGCCACAAAAACCGACCTCTCTCCCCAAACGGTGGATTGAGGAGGAACTCCTCGAACATATCCATGACAGCTCTATGACGAACATACGACATACCAAACGTCTAGAAAAAAGAATCACAAACATAGCTCGCGAACTCACAACTACATGTTGAAATTAGTACCTCAAATTTTAATTAGAAAATTTTCCACAATGTAACTGATAAATTATTGGTTAAATTGCAAATTTGGTCCTTATGGTTTGAAGAAAGTTAGTATTTAGTTTTTGTAGTTTAAAAACTTAGTCATCCTTATGGTTTGATAAAACCTCATAAATAGTTCTTATGGTTTGATAAAACCTAATAAATAGTTCTTATTGTTTGATAAAACCTAGGGACTATTTATGACAGTTCTATCAAACCATAGAGACAAATTCTAACCTTTAAACTATAGAGACTAAAATTTTATTTTTTTCCAAACCATTAGGACCAAATTTGTAATCTGACCTAAATTATATAAAATTCAACCAATAACAACCATAGGGACAAATTCTTTACCTTTAAACCATAGGGACTAAAATTTTACTTTTTTCAAACTATAGGGACCAAATTTGTAAATTGACCTAAATTATATAAAATTCAACCTCTAGACAACCATATAGATAAATTCTAACTTTTAAATCGTGGAGACTAAAATTTAACCTTCTCCAAATTATAGGGACTAAATTTGTAATTTAACTTAAATTATATATAAAGAAAGTTCAGCCTGTAAACAATCATAGAGACGCAATCTTAATTTTTCTTTCGTTCCTTAAGCACAAGAAATTATTTTGTACTTATAGCCTAGGTCAATTTCTCCATAATGGAGGCGGTGCTTCATTAGTTTCGGTTGGAGGGGGGTTGCTCCCCCCCCCCCCCCCCCCCCAGCTGTCTCTTTTTGTTTGCTTAATGTATTTTTGTTTTTTTTTAAAAAAAAAAAAGAAAGAAAAAACAACTGTAATCTAGACTAAGAAATATTTCTATCCAACCATTAAAATATCTAAGTACAGCCTCAAAATTGAGCGAATTTAAAAGTCCCCATAACTCAGTTAAATAGCAACCAAAATCATGAAAACTAATACCAACTAAAAATACTCTTTCAATCATAGGAGCTACAAAAATCATTAAAGAAAAATTGATCATTTTATCTTTTCTTAATTTGATTCCAACCATTGAAGAACTCGAGATTCAAAATTTGGAGAATGTCCATGAATCTTGCAATTGAAAACGTGTACTATTTTACCATAAATCATAAATTGATAATAAAATTATATTTTACCCTGCTCTATTTTGGTTTTTGAACTAAGAAAATTTTGTTCTTTTTTTCCTTGCTAATTTTGGCCATGTATTTAACTTAAATCAGGCTATACTTCTAGTCCTAATACTGAATTTGATACCTGTTCTACTATAAAATATGAATACATTTAGTGTCATCTGAGAGGTGTATTACAGCATAGAAATTAAACAACGATAAAGAAAGATAGTTATGCAAAGTCTGAGGACCAAAACAAATTATTTGACAACAAGACAATCAAAGCAAATGATTATTGAAGTTTAGGGATCAAAATAAGATTAAAATCTTTTTTCTTTTTTCTCTAGAATTTAGAACTTATATTAGTTCCTTTATTGATGTTGAGTTTAAATGATTTGTTATTACATCAGTATGTTTGAATATTTATTATACTGTGTATACTGTATAGAAACGAATTAAATTCAAAGGAAAGGAAGAATAAATGAAACAAAAAGTTAACATTTCTATATAGGAAAAATTGATTAATGGTAAATGTGTAATTTTTTTTTTTAAAAAGGATTTAACAGATCTAAACAACATAATGAGATCATTATTTTTCCATCTACTTATTCTCCTTCCCAGTTGACCACATCATTTCTATACTTCAGCTTAAAAGTCTTAATTTAATTATTATCTGGTTCACTTCATCTTCTCTATTTTTTGCAGAGAAATCAGAGGATGCATCCTTGTATGTACGACGATGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCCTTGACAAGAAGGATCTTTCTCAAGAACCTGTTTCTAAAACAAGTGATTCCTCACCTACAGATTCATCAGATGATTACAAGCCCGTTCCTTTGGGGTCAGATTATCAAGCTCAAGTACCGGAATGGAATGGTGTGATATCCGAGAGTGATTTAAAGTGGTTGGGAACTCAAGAATGGCCCTTGAAGAAAGGAAGGAATAGATATCTAGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGGTGCTTGGATGCCAGTTCGGTTGGCTGTGTCAAATTTCACATTGCTGAGAAAAGGCACAGATTGAAAATTGAGTTGGGCGACGCATTCCTCCGATGGAGATTTGATAAGATGGGGGAAGATGTTACATTTGCTTGGACAGTGGAGGATGAGAAAAAGTTCGAGGACATAGTGTCGTCGAACCCTCCATCCCTCGGGATATCTTATTGGGAAGATATCATTGAGTCATTTCCTTCTAGGAGCAAGGCAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTGCGTCGCAGAGGACACCAAAATCGGGTTACGCCAGATGAAATCGATAGTGATGAGGAATCAGATTCCGGAATTGCAACCAATGGATTTGGAAACGAAGTGCATAATTCACCTGGCTCCATTTTCTACTCTCCTAAGAAGCCACGATAA

mRNA sequence

ATGTTGTCGGATGAATTCATGAATTGTGTTGCCTTTGACGAGAATTCAATTTCAGATTTTTCCAAAAGAATACTCGATATGAAGAGTGATAGGATGAAGAGTGGGGATGTATTTCCTCGCAGAAGCAAGAAACTCAAGGATATTGATAATTGCAACATTGAAAATGAGGATATAAAAATTATTGATCCACCTTTCATCAAGGAAACGAATGTTTTCCAGGAGAGTAAGCAGGAACCAGTGTTGGGACTGCTTGATTGGCTTAAGGGTATTGCAAGAAACCCCTGTGATCCTTCAATATGTTCATTACCAGAAAAGTCAAAATGGAAATCATATGGAAATGAAGAGATTTGGAAACAAGTTTTGCTGGTTCGAGAGGAAATGTTTGTGAAAAGACAAGTTGATTCAAGCAGCGAACAATCTTTTGTGCAGAGAAATCAGAGGATGCATCCTTGTATGTACGACGATGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCCTTGACAAGAAGGATCTTTCTCAAGAACCTGTTTCTAAAACAAGTGATTCCTCACCTACAGATTCATCAGATGATTACAAGCCCGTTCCTTTGGGGTCAGATTATCAAGCTCAAGTACCGGAATGGAATGGTGTGATATCCGAGAGTGATTTAAAGTGGTTGGGAACTCAAGAATGGCCCTTGAAGAAAGGAAGGAATAGATATCTAGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGGTGCTTGGATGCCAGTTCGGTTGGCTGTGTCAAATTTCACATTGCTGAGAAAAGGCACAGATTGAAAATTGAGTTGGGCGACGCATTCCTCCGATGGAGATTTGATAAGATGGGGGAAGATGTTACATTTGCTTGGACAGTGGAGGATGAGAAAAAGTTCGAGGACATAGTGTCGTCGAACCCTCCATCCCTCGGGATATCTTATTGGGAAGATATCATTGAGTCATTTCCTTCTAGGAGCAAGGCAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTGCGTCGCAGAGGACACCAAAATCGGGTTACGCCAGATGAAATCGATAGTGATGAGGAATCAGATTCCGGAATTGCAACCAATGGATTTGGAAACGAAGTGCATAATTCACCTGGCTCCATTTTCTACTCTCCTAAGAAGCCACGATAA

Coding sequence (CDS)

ATGTTGTCGGATGAATTCATGAATTGTGTTGCCTTTGACGAGAATTCAATTTCAGATTTTTCCAAAAGAATACTCGATATGAAGAGTGATAGGATGAAGAGTGGGGATGTATTTCCTCGCAGAAGCAAGAAACTCAAGGATATTGATAATTGCAACATTGAAAATGAGGATATAAAAATTATTGATCCACCTTTCATCAAGGAAACGAATGTTTTCCAGGAGAGTAAGCAGGAACCAGTGTTGGGACTGCTTGATTGGCTTAAGGGTATTGCAAGAAACCCCTGTGATCCTTCAATATGTTCATTACCAGAAAAGTCAAAATGGAAATCATATGGAAATGAAGAGATTTGGAAACAAGTTTTGCTGGTTCGAGAGGAAATGTTTGTGAAAAGACAAGTTGATTCAAGCAGCGAACAATCTTTTGTGCAGAGAAATCAGAGGATGCATCCTTGTATGTACGACGATGATACGGTTCCAATTTACAATCTTAGGAAGAGATTAAGCCTTGACAAGAAGGATCTTTCTCAAGAACCTGTTTCTAAAACAAGTGATTCCTCACCTACAGATTCATCAGATGATTACAAGCCCGTTCCTTTGGGGTCAGATTATCAAGCTCAAGTACCGGAATGGAATGGTGTGATATCCGAGAGTGATTTAAAGTGGTTGGGAACTCAAGAATGGCCCTTGAAGAAAGGAAGGAATAGATATCTAGTTGAAAGGGATCCCATTGGAAAAGGAAGGCGAGATCCTTGTGGGTGCTTGGATGCCAGTTCGGTTGGCTGTGTCAAATTTCACATTGCTGAGAAAAGGCACAGATTGAAAATTGAGTTGGGCGACGCATTCCTCCGATGGAGATTTGATAAGATGGGGGAAGATGTTACATTTGCTTGGACAGTGGAGGATGAGAAAAAGTTCGAGGACATAGTGTCGTCGAACCCTCCATCCCTCGGGATATCTTATTGGGAAGATATCATTGAGTCATTTCCTTCTAGGAGCAAGGCAGATCTTGTTAGCTACTACTATAATGTCTTTCTTTTGCGTCGCAGAGGACACCAAAATCGGGTTACGCCAGATGAAATCGATAGTGATGAGGAATCAGATTCCGGAATTGCAACCAATGGATTTGGAAACGAAGTGCATAATTCACCTGGCTCCATTTTCTACTCTCCTAAGAAGCCACGATAA

Protein sequence

MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKIIDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQVLLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVSKTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVERDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR
Homology
BLAST of HG10010514 vs. NCBI nr
Match: XP_038875899.1 (AT-rich interactive domain-containing protein 1-like [Benincasa hispida])

HSP 1 Score: 781.9 bits (2018), Expect = 2.5e-222
Identity = 376/394 (95.43%), Postives = 387/394 (98.22%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMNCV+FDENSISDFSKRIL+MKSDRMKSGDVFPRRSKKLKDID+CN ENEDIKI
Sbjct: 1   MLSDEFMNCVSFDENSISDFSKRILNMKSDRMKSGDVFPRRSKKLKDIDHCNTENEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           IDPPFIKETNV QESKQEP+LGLLDWLKGIARNPCDPSICSLPEKSKWKSYG EEIWKQV
Sbjct: 61  IDPPFIKETNVAQESKQEPMLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGKEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLS DKKDLSQEPVS
Sbjct: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSFDKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           K SDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLK+GR+RYLVER
Sbjct: 181 KISDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKRGRHRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIGKGRRD CGCLDA+SVGCVKFH+AEKRHRLK+ELG+ FL+WRFDKMGEDVTFAWTVE
Sbjct: 241 DPIGKGRRDSCGCLDANSVGCVKFHVAEKRHRLKLELGNTFLQWRFDKMGEDVTFAWTVE 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI
Sbjct: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR
Sbjct: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 394

BLAST of HG10010514 vs. NCBI nr
Match: XP_004138586.1 (AT-rich interactive domain-containing protein 1 [Cucumis sativus] >XP_011656344.1 AT-rich interactive domain-containing protein 1 [Cucumis sativus] >KGN45671.1 hypothetical protein Csa_005371 [Cucumis sativus])

HSP 1 Score: 743.4 bits (1918), Expect = 9.9e-211
Identity = 351/395 (88.86%), Postives = 381/395 (96.46%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMNC+AFD+NSISDFSKRI DMKSDRMKSGDVFPRRSKK KDI++CN EN D++I
Sbjct: 1   MLSDEFMNCIAFDDNSISDFSKRIFDMKSDRMKSGDVFPRRSKKFKDIEHCNTENGDVQI 60

Query: 61  IDPPFIKETNVFQE-SKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQ 120
           IDPPF+KETNV QE SKQEP+LGLLDWLKGIARNPCDPSI SLPEKSKWKSYGNEEIWKQ
Sbjct: 61  IDPPFVKETNVVQEKSKQEPMLGLLDWLKGIARNPCDPSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPV 180
           VL+VREEMF+KRQVDSSSEQSF+Q+NQ+MHPCMYDDDT PIYNLRKRLSLDKKDLSQEPV
Sbjct: 121 VLVVREEMFLKRQVDSSSEQSFMQKNQKMHPCMYDDDTAPIYNLRKRLSLDKKDLSQEPV 180

Query: 181 SKTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVE 240
           SK SDSSPTDS DDYKPVPLGSDYQA+VPEWNGVIS+SDLKWLGTQ+WPLKKGRNRYLVE
Sbjct: 181 SKASDSSPTDSLDDYKPVPLGSDYQARVPEWNGVISKSDLKWLGTQDWPLKKGRNRYLVE 240

Query: 241 RDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTV 300
           RDPIG+GRRDPCGC+D +SVGCV+FH++EKRH+LK+ELGDAFL+WRFDKMGE+VTFAWTV
Sbjct: 241 RDPIGRGRRDPCGCMDPNSVGCVQFHVSEKRHKLKLELGDAFLQWRFDKMGEEVTFAWTV 300

Query: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360
           +DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+E
Sbjct: 301 DDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVGYYYNVFLLRRRGHQNRVTPNE 360

Query: 361 IDSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           I+SDEES+SG ATNGFGNEVHNS GSIFYSPKKPR
Sbjct: 361 INSDEESESGTATNGFGNEVHNSSGSIFYSPKKPR 395

BLAST of HG10010514 vs. NCBI nr
Match: XP_008458204.1 (PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucumis melo] >XP_008458205.1 PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucumis melo] >XP_016902234.1 PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucumis melo])

HSP 1 Score: 737.3 bits (1902), Expect = 7.1e-209
Identity = 359/395 (90.89%), Postives = 377/395 (95.44%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMNCVAFDENSISDFSKRILDMKSDRMK+GD FPRRSKKLKDI + NIENED++I
Sbjct: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKTGDAFPRRSKKLKDIAHRNIENEDVQI 60

Query: 61  IDPPFIKETNVFQE-SKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQ 120
           IDPP IKETNV QE SKQEP+LGLLDWLK IARNPCD SI SLPEKSKWKSYGNEEIWKQ
Sbjct: 61  IDPPLIKETNVVQEKSKQEPMLGLLDWLKVIARNPCDSSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPV 180
           VL+VREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKD+SQEPV
Sbjct: 121 VLVVREEMFVKRQVDSSSEQSSVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDVSQEPV 180

Query: 181 SKTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVE 240
           SK +DSSPTDS DDYKPV LGSDYQA+VPEWNGVISESDLKWLGTQ+WPLKKGRNRYLVE
Sbjct: 181 SKANDSSPTDSLDDYKPVLLGSDYQARVPEWNGVISESDLKWLGTQDWPLKKGRNRYLVE 240

Query: 241 RDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTV 300
           RDPIGKGRRDPCGC+DA+SVGCV+FH++EKR RLK+ELGDAFLRWRFD+MGEDVT AWTV
Sbjct: 241 RDPIGKGRRDPCGCMDANSVGCVQFHVSEKRQRLKLELGDAFLRWRFDEMGEDVTLAWTV 300

Query: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360
           EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE
Sbjct: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360

Query: 361 IDSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           IDSDEES+SG AT  FGNEVHNSPGSIFYSPKKPR
Sbjct: 361 IDSDEESESGTATIRFGNEVHNSPGSIFYSPKKPR 395

BLAST of HG10010514 vs. NCBI nr
Match: KAG7013748.1 (AT-rich interactive domain-containing protein 1, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 723.8 bits (1867), Expect = 8.1e-205
Identity = 346/394 (87.82%), Postives = 370/394 (93.91%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMN VAFDENSISDFSKRIL MKSD+MKSGDVFPRRSKKLKDI++CN ENEDIKI
Sbjct: 1   MLSDEFMNYVAFDENSISDFSKRILAMKSDKMKSGDVFPRRSKKLKDINHCNNENEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           ID   +KE N   E K+EP+LGLL+WLKGIARNPCDPS+CSLPEKSKWKSYGNEEIWKQV
Sbjct: 61  IDLRLVKEPNFTPERKEEPMLGLLNWLKGIARNPCDPSVCSLPEKSKWKSYGNEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDDTVPIYNLRKRLS +KKDLSQEPVS
Sbjct: 121 LLVREEMFVKRQVDSSSEQSLVQRNQRMHPCMYDDDTVPIYNLRKRLSFEKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           KTSD SPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER
Sbjct: 181 KTSDFSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIG+GRRDPCGC DA+SVGCVKFH+ EKRH++K+ELG+AFL+WRFDKMGEDVTF+WT +
Sbjct: 241 DPIGRGRRDPCGCFDANSVGCVKFHVTEKRHKVKLELGNAFLQWRFDKMGEDVTFSWTAD 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIV SNPPSLGIS+W++IIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+EI
Sbjct: 301 DEKKFEDIVKSNPPSLGISFWDEIIESFPSRSKADLVCYYYNVFLLRRRGHQNRVTPNEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEES+SGI TNG  NEVHNS  SIFYSPKKPR
Sbjct: 361 DSDEESESGIVTNGLRNEVHNSSDSIFYSPKKPR 394

BLAST of HG10010514 vs. NCBI nr
Match: XP_023547658.1 (AT-rich interactive domain-containing protein 1 [Cucurbita pepo subsp. pepo] >XP_023547659.1 AT-rich interactive domain-containing protein 1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 721.8 bits (1862), Expect = 3.1e-204
Identity = 344/394 (87.31%), Postives = 371/394 (94.16%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLS+EFMN VAFDENSISDFSKRIL MKSD+MKSGDVFPRRSKKLKDI++CN E EDIKI
Sbjct: 1   MLSEEFMNYVAFDENSISDFSKRILAMKSDKMKSGDVFPRRSKKLKDINHCNNETEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           ID   +KE N   E K+EP+LGLL+WLKGIARNPCDPS+CSLPEKSKWKSYGNEEIWKQV
Sbjct: 61  IDLRLVKEPNFTPERKEEPMLGLLNWLKGIARNPCDPSVCSLPEKSKWKSYGNEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDDTVPIYNLRKRLS +KKDLSQEPVS
Sbjct: 121 LLVREEMFVKRQVDSSSEQSLVQRNQRMHPCMYDDDTVPIYNLRKRLSFEKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWP+KKGRNRYLVER
Sbjct: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPMKKGRNRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIG+GRRDPCGC DA+SVGCVKFH+ EKRH++K+ELG+AFL+WRFDKMGEDVTF+WT +
Sbjct: 241 DPIGRGRRDPCGCFDANSVGCVKFHVTEKRHKVKLELGNAFLQWRFDKMGEDVTFSWTAD 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIV+SNPPSLGIS+W++IIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+EI
Sbjct: 301 DEKKFEDIVTSNPPSLGISFWDEIIESFPSRSKADLVCYYYNVFLLRRRGHQNRVTPNEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEES+SGI TNG  NEVHNS  SIFYSPKKPR
Sbjct: 361 DSDEESESGIVTNGLRNEVHNSSDSIFYSPKKPR 394

BLAST of HG10010514 vs. ExPASy Swiss-Prot
Match: Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)

HSP 1 Score: 262.3 bits (669), Expect = 8.8e-69
Identity = 139/293 (47.44%), Postives = 190/293 (64.85%), Query Frame = 0

Query: 76  KQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQVLLVREEMFVKRQVDS 135
           K+E  L  L WL  +A++PCDPS+  +P++S+W SYG+EE WKQ+LL R     +   DS
Sbjct: 246 KRECPLETLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS---RTNNDS 305

Query: 136 SSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVSKTSDSSPTDSSDDYK 195
           + E+++ Q+ Q+MHPC+YDD     YNLR+RLS +  D  +      SD   +D  D  +
Sbjct: 306 ACEKTW-QKVQKMHPCLYDDSAGASYNLRERLSYE--DYKRGKTGNGSDIGSSDEED--R 365

Query: 196 PVPL-GSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNR--YLVERDPIGKGRRDPCG 255
           P  L GS +QA+VPEW G+  ESD KWLGT+ WPL K + +   L+ERD IGKGR+DPCG
Sbjct: 366 PCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGKGRQDPCG 425

Query: 256 CLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSN 315
           C +  S+ CVKFHI  KR +LK+ELG AF  W FD MGE     WT  + KK + ++SS 
Sbjct: 426 CHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKIKSLMSS- 485

Query: 316 PPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEE 366
           PPSL  ++        PS+S+  +VSY+YNV LL+ R  Q+R+TP +IDSD +
Sbjct: 486 PPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 529

BLAST of HG10010514 vs. ExPASy Swiss-Prot
Match: Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)

HSP 1 Score: 209.9 bits (533), Expect = 5.2e-53
Identity = 124/350 (35.43%), Postives = 178/350 (50.86%), Query Frame = 0

Query: 72  FQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQVLLVREEMFVKR 131
           F   K++ + G+L WL  +A +P DP+I  +P  SKWK Y   + W QV   +  + V+R
Sbjct: 214 FSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLLVQR 273

Query: 132 QVDSSSEQSFVQRNQR--MHPCMYDDDTVPIYNLR--------------------KRLSL 191
                  +    R  +   HP MY+DD   I  LR                      +SL
Sbjct: 274 DNAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSKHCSSSCCNGSSLVSL 333

Query: 192 DKKD---------LSQEPVSKTSDSSPTDSSD----DYKPVPLGSDYQAQVPEWNGVISE 251
            K           ++ E    T+ +S     +      + + +G  +QAQV EW     +
Sbjct: 334 SKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWTESGVD 393

Query: 252 SDLKWLGTQEWPLKKGRN-RYLVERDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKI 311
           SD KWLGT+ WP +        +  D +GKGR D C C  +  V C + HIAEKR  LK 
Sbjct: 394 SDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELKR 453

Query: 312 ELGDAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKAD 371
           ELGD F  WRF++MGE+V   WT E+EK+F+D++ ++P     S+W +  ++FP + + +
Sbjct: 454 ELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIADPQ----SFWTNAAKNFPKKKREE 513

Query: 372 LVSYYYNVFLLRRRGHQNRVTPDEIDSDEESDSGIATNGFGNEVHNSPGS 386
           LVSYY+NVFL+ RR +QNRVTP  IDSD+E   G     FG +   S GS
Sbjct: 514 LVSYYFNVFLINRRRYQNRVTPKSIDSDDEGAFGSVGGSFGRDAVTSSGS 559

BLAST of HG10010514 vs. ExPASy TrEMBL
Match: A0A0A0KD11 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G004620 PE=4 SV=1)

HSP 1 Score: 743.4 bits (1918), Expect = 4.8e-211
Identity = 351/395 (88.86%), Postives = 381/395 (96.46%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMNC+AFD+NSISDFSKRI DMKSDRMKSGDVFPRRSKK KDI++CN EN D++I
Sbjct: 1   MLSDEFMNCIAFDDNSISDFSKRIFDMKSDRMKSGDVFPRRSKKFKDIEHCNTENGDVQI 60

Query: 61  IDPPFIKETNVFQE-SKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQ 120
           IDPPF+KETNV QE SKQEP+LGLLDWLKGIARNPCDPSI SLPEKSKWKSYGNEEIWKQ
Sbjct: 61  IDPPFVKETNVVQEKSKQEPMLGLLDWLKGIARNPCDPSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPV 180
           VL+VREEMF+KRQVDSSSEQSF+Q+NQ+MHPCMYDDDT PIYNLRKRLSLDKKDLSQEPV
Sbjct: 121 VLVVREEMFLKRQVDSSSEQSFMQKNQKMHPCMYDDDTAPIYNLRKRLSLDKKDLSQEPV 180

Query: 181 SKTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVE 240
           SK SDSSPTDS DDYKPVPLGSDYQA+VPEWNGVIS+SDLKWLGTQ+WPLKKGRNRYLVE
Sbjct: 181 SKASDSSPTDSLDDYKPVPLGSDYQARVPEWNGVISKSDLKWLGTQDWPLKKGRNRYLVE 240

Query: 241 RDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTV 300
           RDPIG+GRRDPCGC+D +SVGCV+FH++EKRH+LK+ELGDAFL+WRFDKMGE+VTFAWTV
Sbjct: 241 RDPIGRGRRDPCGCMDPNSVGCVQFHVSEKRHKLKLELGDAFLQWRFDKMGEEVTFAWTV 300

Query: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360
           +DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+E
Sbjct: 301 DDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVGYYYNVFLLRRRGHQNRVTPNE 360

Query: 361 IDSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           I+SDEES+SG ATNGFGNEVHNS GSIFYSPKKPR
Sbjct: 361 INSDEESESGTATNGFGNEVHNSSGSIFYSPKKPR 395

BLAST of HG10010514 vs. ExPASy TrEMBL
Match: A0A1S3C8K0 (AT-rich interactive domain-containing protein 1-like isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497705 PE=4 SV=1)

HSP 1 Score: 737.3 bits (1902), Expect = 3.4e-209
Identity = 359/395 (90.89%), Postives = 377/395 (95.44%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMNCVAFDENSISDFSKRILDMKSDRMK+GD FPRRSKKLKDI + NIENED++I
Sbjct: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKTGDAFPRRSKKLKDIAHRNIENEDVQI 60

Query: 61  IDPPFIKETNVFQE-SKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQ 120
           IDPP IKETNV QE SKQEP+LGLLDWLK IARNPCD SI SLPEKSKWKSYGNEEIWKQ
Sbjct: 61  IDPPLIKETNVVQEKSKQEPMLGLLDWLKVIARNPCDSSISSLPEKSKWKSYGNEEIWKQ 120

Query: 121 VLLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPV 180
           VL+VREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKD+SQEPV
Sbjct: 121 VLVVREEMFVKRQVDSSSEQSSVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDVSQEPV 180

Query: 181 SKTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVE 240
           SK +DSSPTDS DDYKPV LGSDYQA+VPEWNGVISESDLKWLGTQ+WPLKKGRNRYLVE
Sbjct: 181 SKANDSSPTDSLDDYKPVLLGSDYQARVPEWNGVISESDLKWLGTQDWPLKKGRNRYLVE 240

Query: 241 RDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTV 300
           RDPIGKGRRDPCGC+DA+SVGCV+FH++EKR RLK+ELGDAFLRWRFD+MGEDVT AWTV
Sbjct: 241 RDPIGKGRRDPCGCMDANSVGCVQFHVSEKRQRLKLELGDAFLRWRFDEMGEDVTLAWTV 300

Query: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360
           EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE
Sbjct: 301 EDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDE 360

Query: 361 IDSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           IDSDEES+SG AT  FGNEVHNSPGSIFYSPKKPR
Sbjct: 361 IDSDEESESGTATIRFGNEVHNSPGSIFYSPKKPR 395

BLAST of HG10010514 vs. ExPASy TrEMBL
Match: A0A6J1H398 (AT-rich interactive domain-containing protein 1 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111460098 PE=4 SV=1)

HSP 1 Score: 720.7 bits (1859), Expect = 3.3e-204
Identity = 345/394 (87.56%), Postives = 370/394 (93.91%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMN VAFDENSISDFSKRIL MKSD+MKSGDVFPRRSKKLKDI++CN E+EDIKI
Sbjct: 1   MLSDEFMNYVAFDENSISDFSKRILAMKSDKMKSGDVFPRRSKKLKDINHCNNESEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           ID   +KE N   E K+E +LGLL+WLKGIARNPCDPS+CSLPEKSKWKSYGNEEIWKQV
Sbjct: 61  IDLRLVKEPNFTPERKEERMLGLLNWLKGIARNPCDPSVCSLPEKSKWKSYGNEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDD VPIYNLRKRLS +KKDLSQEPVS
Sbjct: 121 LLVREEMFVKRQVDSSSEQSLVQRNQRMHPCMYDDDMVPIYNLRKRLSFEKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER
Sbjct: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIG+GRRDPCGC DA+SVGCVKFH+ EKRH++K+ELGDAFL+WRFDKMGEDVTF+WT +
Sbjct: 241 DPIGRGRRDPCGCFDANSVGCVKFHVTEKRHKVKLELGDAFLQWRFDKMGEDVTFSWTAD 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIV+SNPPSLGIS+W++IIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+EI
Sbjct: 301 DEKKFEDIVTSNPPSLGISFWDEIIESFPSRSKADLVCYYYNVFLLRRRGHQNRVTPNEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEES+SGI TNG  NEVHNS  SIFYSPKKPR
Sbjct: 361 DSDEESESGIVTNGLRNEVHNSSDSIFYSPKKPR 394

BLAST of HG10010514 vs. ExPASy TrEMBL
Match: A0A6J1H4K9 (AT-rich interactive domain-containing protein 1 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111460098 PE=4 SV=1)

HSP 1 Score: 720.7 bits (1859), Expect = 3.3e-204
Identity = 345/394 (87.56%), Postives = 370/394 (93.91%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMN VAFDENSISDFSKRIL MKSD+MKSGDVFPRRSKKLKDI++CN E+EDIKI
Sbjct: 1   MLSDEFMNYVAFDENSISDFSKRILAMKSDKMKSGDVFPRRSKKLKDINHCNNESEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           ID   +KE N   E K+E +LGLL+WLKGIARNPCDPS+CSLPEKSKWKSYGNEEIWKQV
Sbjct: 61  IDLRLVKEPNFTPERKEERMLGLLNWLKGIARNPCDPSVCSLPEKSKWKSYGNEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVREEMFVKRQVDSSSEQS VQRNQRMHPCMYDDD VPIYNLRKRLS +KKDLSQEPVS
Sbjct: 121 LLVREEMFVKRQVDSSSEQSLVQRNQRMHPCMYDDDMVPIYNLRKRLSFEKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER
Sbjct: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIG+GRRDPCGC DA+SVGCVKFH+ EKRH++K+ELGDAFL+WRFDKMGEDVTF+WT +
Sbjct: 241 DPIGRGRRDPCGCFDANSVGCVKFHVTEKRHKVKLELGDAFLQWRFDKMGEDVTFSWTAD 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIV+SNPPSLGIS+W++IIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+EI
Sbjct: 301 DEKKFEDIVTSNPPSLGISFWDEIIESFPSRSKADLVCYYYNVFLLRRRGHQNRVTPNEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEES+SGI TNG  NEVHNS  SIFYSPKKPR
Sbjct: 361 DSDEESESGIVTNGLRNEVHNSSDSIFYSPKKPR 394

BLAST of HG10010514 vs. ExPASy TrEMBL
Match: A0A6J1L5C2 (AT-rich interactive domain-containing protein 1-like OS=Cucurbita maxima OX=3661 GN=LOC111499270 PE=4 SV=1)

HSP 1 Score: 716.8 bits (1849), Expect = 4.8e-203
Identity = 342/394 (86.80%), Postives = 370/394 (93.91%), Query Frame = 0

Query: 1   MLSDEFMNCVAFDENSISDFSKRILDMKSDRMKSGDVFPRRSKKLKDIDNCNIENEDIKI 60
           MLSDEFMN VAFDENSISDFSKRIL MKSD+MKSGDVFPRRSKKLKDI++CN ENEDIKI
Sbjct: 1   MLSDEFMNYVAFDENSISDFSKRILAMKSDKMKSGDVFPRRSKKLKDINHCNNENEDIKI 60

Query: 61  IDPPFIKETNVFQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQV 120
           ID   +KE N   E K+EP+LGLL+WLKGIARNPCDPS+CSLPEKSKWKSYGNEEIWKQV
Sbjct: 61  IDLRLVKEPNFTPERKEEPLLGLLNWLKGIARNPCDPSVCSLPEKSKWKSYGNEEIWKQV 120

Query: 121 LLVREEMFVKRQVDSSSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVS 180
           LLVRE+MFVKRQVDSSSEQS VQRNQRMHPCMYD+DTVPIYNLRKRLS +KKDLSQEPVS
Sbjct: 121 LLVREKMFVKRQVDSSSEQSLVQRNQRMHPCMYDNDTVPIYNLRKRLSFEKKDLSQEPVS 180

Query: 181 KTSDSSPTDSSDDYKPVPLGSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNRYLVER 240
           KTSDSSPTDSSDDYKPVPLGSDYQ QVPEWNGVISESDLKWLGTQ+WPLKK RNRYLVER
Sbjct: 181 KTSDSSPTDSSDDYKPVPLGSDYQTQVPEWNGVISESDLKWLGTQDWPLKKVRNRYLVER 240

Query: 241 DPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVE 300
           DPIG+GRRDPCGC DA+SVGCVKFH+ EKRH++K+ELGDAFL+WRFDKMGEDVTF+WT +
Sbjct: 241 DPIGRGRRDPCGCFDANSVGCVKFHVTEKRHKVKLELGDAFLQWRFDKMGEDVTFSWTAD 300

Query: 301 DEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEI 360
           DEKKFEDIV+SNPPSLGIS+W++IIESFPSRSKADLV YYYNVFLLRRRGHQNRVTP+EI
Sbjct: 301 DEKKFEDIVTSNPPSLGISFWDEIIESFPSRSKADLVCYYYNVFLLRRRGHQNRVTPNEI 360

Query: 361 DSDEESDSGIATNGFGNEVHNSPGSIFYSPKKPR 395
           DSDEES+SGI TNG  N+VHNS  SIFYSPKKPR
Sbjct: 361 DSDEESESGIVTNGLLNQVHNSSDSIFYSPKKPR 394

BLAST of HG10010514 vs. TAIR 10
Match: AT2G46040.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 262.3 bits (669), Expect = 6.2e-70
Identity = 139/293 (47.44%), Postives = 190/293 (64.85%), Query Frame = 0

Query: 76  KQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQVLLVREEMFVKRQVDS 135
           K+E  L  L WL  +A++PCDPS+  +P++S+W SYG+EE WKQ+LL R     +   DS
Sbjct: 246 KRECPLETLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS---RTNNDS 305

Query: 136 SSEQSFVQRNQRMHPCMYDDDTVPIYNLRKRLSLDKKDLSQEPVSKTSDSSPTDSSDDYK 195
           + E+++ Q+ Q+MHPC+YDD     YNLR+RLS +  D  +      SD   +D  D  +
Sbjct: 306 ACEKTW-QKVQKMHPCLYDDSAGASYNLRERLSYE--DYKRGKTGNGSDIGSSDEED--R 365

Query: 196 PVPL-GSDYQAQVPEWNGVISESDLKWLGTQEWPLKKGRNR--YLVERDPIGKGRRDPCG 255
           P  L GS +QA+VPEW G+  ESD KWLGT+ WPL K + +   L+ERD IGKGR+DPCG
Sbjct: 366 PCALVGSKFQAKVPEWTGITPESDSKWLGTRIWPLTKEQTKANLLIERDRIGKGRQDPCG 425

Query: 256 CLDASSVGCVKFHIAEKRHRLKIELGDAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSN 315
           C +  S+ CVKFHI  KR +LK+ELG AF  W FD MGE     WT  + KK + ++SS 
Sbjct: 426 CHNPGSIECVKFHITAKRDKLKLELGPAFYMWCFDVMGECTLQYWTDLELKKIKSLMSS- 485

Query: 316 PPSLGISYWEDIIESFPSRSKADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEE 366
           PPSL  ++        PS+S+  +VSY+YNV LL+ R  Q+R+TP +IDSD +
Sbjct: 486 PPSLSPAFIHQAKMILPSKSRGKIVSYFYNVTLLQYRASQSRITPHDIDSDTD 529

BLAST of HG10010514 vs. TAIR 10
Match: AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 209.9 bits (533), Expect = 3.7e-54
Identity = 124/350 (35.43%), Postives = 178/350 (50.86%), Query Frame = 0

Query: 72  FQESKQEPVLGLLDWLKGIARNPCDPSICSLPEKSKWKSYGNEEIWKQVLLVREEMFVKR 131
           F   K++ + G+L WL  +A +P DP+I  +P  SKWK Y   + W QV   +  + V+R
Sbjct: 214 FSLEKRDDLPGMLKWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLLVQR 273

Query: 132 QVDSSSEQSFVQRNQR--MHPCMYDDDTVPIYNLR--------------------KRLSL 191
                  +    R  +   HP MY+DD   I  LR                      +SL
Sbjct: 274 DNAELRYRYHPFRGHQNIHHPSMYEDDRKSIGRLRYSIRPPNLSKHCSSSCCNGSSLVSL 333

Query: 192 DKKD---------LSQEPVSKTSDSSPTDSSD----DYKPVPLGSDYQAQVPEWNGVISE 251
            K           ++ E    T+ +S     +      + + +G  +QAQV EW     +
Sbjct: 334 SKSRSTKCRKLTIIASERAGLTAGTSRARKRNKAEIPRRCIKVGHQHQAQVDEWTESGVD 393

Query: 252 SDLKWLGTQEWPLKKGRN-RYLVERDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKI 311
           SD KWLGT+ WP +        +  D +GKGR D C C  +  V C + HIAEKR  LK 
Sbjct: 394 SDSKWLGTRIWPPENSEALDQTLGNDLVGKGRPDSCSCELSGFVECTRLHIAEKRMELKR 453

Query: 312 ELGDAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKAD 371
           ELGD F  WRF++MGE+V   WT E+EK+F+D++ ++P     S+W +  ++FP + + +
Sbjct: 454 ELGDDFFHWRFNQMGEEVCLRWTEEEEKRFKDMIIADPQ----SFWTNAAKNFPKKKREE 513

Query: 372 LVSYYYNVFLLRRRGHQNRVTPDEIDSDEESDSGIATNGFGNEVHNSPGS 386
           LVSYY+NVFL+ RR +QNRVTP  IDSD+E   G     FG +   S GS
Sbjct: 514 LVSYYFNVFLINRRRYQNRVTPKSIDSDDEGAFGSVGGSFGRDAVTSSGS 559

BLAST of HG10010514 vs. TAIR 10
Match: AT5G04110.1 (DNA GYRASE B3 )

HSP 1 Score: 137.5 bits (345), Expect = 2.3e-32
Identity = 77/209 (36.84%), Postives = 113/209 (54.07%), Query Frame = 0

Query: 172 KDLSQEPVSKTSDSSPTDSSDDYKP-VPLGSDYQAQVPEWNGVISE----------SDLK 231
           KD+S    +KTS    T  S+  +P +P+G  +QA++P W     +          + L+
Sbjct: 339 KDVS----NKTSKDVITHGSNKTRPAIPIGPRFQAEIPVWIAPTKKGKFYGSPGDSNTLR 398

Query: 232 WLGTQEWP---LKKGRNRYLVERDPIGKGRRDPCGCLDASSVGCVKFHIAEKRHRLKIEL 291
           WLGT  WP   LKK      V    +G+GR D C C    S  C+K H  E +  L+ E+
Sbjct: 399 WLGTGVWPTYSLKK-----TVHSKKVGEGRSDSCSCASPRSTNCIKRHKKEAQELLEKEI 458

Query: 292 GDAFLRWRFDKMGEDVTF-AWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRSKADL 351
             AF  W FD+MGE++   +WT ++E++FE +V  NP S    +WE    +FP +SK DL
Sbjct: 459 NRAFSTWEFDQMGEEIVLKSWTAKEERRFEALVKKNPLSSSDGFWEFASNAFPQKSKKDL 518

Query: 352 VSYYYNVFLLRRRGHQNRVTPDEIDSDEE 366
           +SYYYNVFL++R         + IDSD++
Sbjct: 519 LSYYYNVFLIKRMRLLKSSAANNIDSDDD 538

BLAST of HG10010514 vs. TAIR 10
Match: AT2G03470.1 (ELM2 domain-containing protein )

HSP 1 Score: 120.9 bits (302), Expect = 2.2e-27
Identity = 76/213 (35.68%), Postives = 113/213 (53.05%), Query Frame = 0

Query: 175 SQEPVSKTSDSSPTDSSDDY------------------KPVPLGSDYQAQVPEW--NGVI 234
           SQ  V+  SD S   S  D+                  K V +GS++QA +PE+    ++
Sbjct: 83  SQSGVTTQSDLSHQSSGSDFTWKPVEDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEIL 142

Query: 235 SESDLKWLGTQEWPLKKGRNRYLVERDPIGKGR-RDPCGCLDASSVGCVKFHIAEKRHRL 294
            +S+ +     E  L +     + + D  G G+ R  C CLD  S+ CV+ HI E R  L
Sbjct: 143 DQSEARTKEDLEGKLMRKCVIPMSDSDLCGTGQGRKECLCLDKGSIRCVRRHIIEARESL 202

Query: 295 KIELG-DAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRS 354
              +G + F+     +MGE+V   WT E+E  F  +V SNP S G  +W+ +  +FPSR+
Sbjct: 203 VETIGYERFMELGLCEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRT 262

Query: 355 KADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEE 366
             +LVSYY+NVF+LRRRG QNR    ++DSD++
Sbjct: 263 MKELVSYYFNVFILRRRGIQNRFKALDVDSDDD 295

BLAST of HG10010514 vs. TAIR 10
Match: AT2G03470.2 (ELM2 domain-containing protein )

HSP 1 Score: 120.9 bits (302), Expect = 2.2e-27
Identity = 76/213 (35.68%), Postives = 113/213 (53.05%), Query Frame = 0

Query: 175 SQEPVSKTSDSSPTDSSDDY------------------KPVPLGSDYQAQVPEW--NGVI 234
           SQ  V+  SD S   S  D+                  K V +GS++QA +PE+    ++
Sbjct: 82  SQSGVTTQSDLSHQSSGSDFTWKPVEDVYTCLMNQPPRKQVLVGSNHQADIPEFVKEEIL 141

Query: 235 SESDLKWLGTQEWPLKKGRNRYLVERDPIGKGR-RDPCGCLDASSVGCVKFHIAEKRHRL 294
            +S+ +     E  L +     + + D  G G+ R  C CLD  S+ CV+ HI E R  L
Sbjct: 142 DQSEARTKEDLEGKLMRKCVIPMSDSDLCGTGQGRKECLCLDKGSIRCVRRHIIEARESL 201

Query: 295 KIELG-DAFLRWRFDKMGEDVTFAWTVEDEKKFEDIVSSNPPSLGISYWEDIIESFPSRS 354
              +G + F+     +MGE+V   WT E+E  F  +V SNP S G  +W+ +  +FPSR+
Sbjct: 202 VETIGYERFMELGLCEMGEEVASLWTEEEEDLFHKVVYSNPFSAGRDFWKQLKGTFPSRT 261

Query: 355 KADLVSYYYNVFLLRRRGHQNRVTPDEIDSDEE 366
             +LVSYY+NVF+LRRRG QNR    ++DSD++
Sbjct: 262 MKELVSYYFNVFILRRRGIQNRFKALDVDSDDD 294

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038875899.12.5e-22295.43AT-rich interactive domain-containing protein 1-like [Benincasa hispida][more]
XP_004138586.19.9e-21188.86AT-rich interactive domain-containing protein 1 [Cucumis sativus] >XP_011656344.... [more]
XP_008458204.17.1e-20990.89PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Cucu... [more]
KAG7013748.18.1e-20587.82AT-rich interactive domain-containing protein 1, partial [Cucurbita argyrosperma... [more]
XP_023547658.13.1e-20487.31AT-rich interactive domain-containing protein 1 [Cucurbita pepo subsp. pepo] >XP... [more]
Match NameE-valueIdentityDescription
Q84JT78.8e-6947.44AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9LDD45.2e-5335.43AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A0A0KD114.8e-21188.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G004620 PE=4 SV=1[more]
A0A1S3C8K03.4e-20990.89AT-rich interactive domain-containing protein 1-like isoform X1 OS=Cucumis melo ... [more]
A0A6J1H3983.3e-20487.56AT-rich interactive domain-containing protein 1 isoform X1 OS=Cucurbita moschata... [more]
A0A6J1H4K93.3e-20487.56AT-rich interactive domain-containing protein 1 isoform X2 OS=Cucurbita moschata... [more]
A0A6J1L5C24.8e-20386.80AT-rich interactive domain-containing protein 1-like OS=Cucurbita maxima OX=3661... [more]
Match NameE-valueIdentityDescription
AT2G46040.16.2e-7047.44ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT4G11400.13.7e-5435.43ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT5G04110.12.3e-3236.84DNA GYRASE B3 [more]
AT2G03470.12.2e-2735.68ELM2 domain-containing protein [more]
AT2G03470.22.2e-2735.68ELM2 domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000949ELM2 domainSMARTSM01189ELM2_2coord: 197..248
e-value: 0.0019
score: 27.5
IPR017930Myb domainPFAMPF00249Myb_DNA-bindingcoord: 296..343
e-value: 7.5E-6
score: 26.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 173..198
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 367..384
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 355..394
NoneNo IPR availablePANTHERPTHR46410AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 43..393
NoneNo IPR availablePANTHERPTHR46410:SF1AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 1coord: 43..393
IPR001005SANT/Myb domainCDDcd00167SANTcoord: 296..344
e-value: 3.89394E-4
score: 36.0142

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10010514.1HG10010514.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus