Moc05g03940 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc05g03940
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionAT-rich interactive domain-containing protein 2-like isoform X1
Locationchr5: 2724945 .. 2726646 (-)
RNA-Seq ExpressionMoc05g03940
SyntenyMoc05g03940
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACTGCAGTCGAGCAAACTTTCGATTTGCTTAAGCTATTTGTTGCTGTACGAGATAAGGGTGGTTATGATGCTGTATCGAGGAATGATTTGTGGGATCTGGTAGCAGAAGATTCTGGTTTAGGTTCGATTATTCCCTCTACTGTGAAAGTGCTATACGTCGAGTGTTTGAATATTTTAGAGAGATGGCTTGAAAAGGTTGTTGAGGATAAAGAATCCTCAAGTAGTTGCAGCAGTAAGGGAGATGGTACGGGTTTTGAGTTCAATGGTTTGTCATCAGATATTCAGTATTTGAAGAAGAATGATGATTTGCATGAAGATACAAAATTGTTGCAGGAGTCAAACTTTTTAGACTGTGATGATACAGTTGTGATTCTGAAGACTGACAGGGACAATGATATTTATGGTTGCGGGGAAACTTTTTGCCAATTAAATAAGAGCGACTTAGACATTCCTGACACAGATGACTTGTACGAAGATGAAGACACAAGCCCCGAATTATCATCCAACGTGAACGACATTCTAATGCCAAGTATTCGGAAGCATGAAAATGTATCAGCAGACGGGGTAGTAGAAAGCAATGTGGAGTTCTCTCATGGCGGTGGAAAGTGCGATGGTGATGATCCTGATAATGAGGGAGGAGTGATTTTAGATTCCACCTCTATTGAGGAACAGAATCTTAGCCACGAGAGAAAGTGTGAATCCCTGTTAGGAATGGTAAGGTGGATCACTGAAATTGCAAAGAATCCATCTTACCCTGTTATCGGGTCATTACCTGAAAGATCAAAGTGGAAGTCACGTGGAAATGAAGAGATCTGGAAGCAAGTTCTGCTGATTCGAGAGGCAATGTTACTGAAAAGACATTCTAATTCTTTTGCGGGACGGTCTATTCTGCAGGTATATTTCTACGAAAACTTGTTCTACAAGTGCAAAAATCATTTACGAGAATATTATTCCATCCTACTAATAATCAACATTCAGTCTGAAACCATTTTGTAGACGAGATAATAATGTGCTGCACTTATAGTGTTGTACACTACCGTACTCGTGGGCTATGTCTAATGCAGCTCTACAAGCACTTGATTACTTTTATACATATGAAAAATGGCTGTGTAACACACTAACACCCAATTGTTTAGTTTTTCAGGATGATAGCACCGTTTCTGTCTTCATGTTTTTATAGTAACTTTTTAGCACATTCAGTTAGTGCCTACTTATGTTTTCTGGATTGTTAAATGAAAATGTATGCATATGATATCATCATGATTCATTTCATCACTTCTAGTTCTTGTTTTTTGTTTGTTTAGCTTTTATGGACAATCTTGTCAACAATACCATCACTTTCTCTGGTTGTTCCTATTTGATTGCAATTAATGGTTCTATATGTGGCCAGAAATATGCAAGCTAATGAAATTGAAATATGGTGAACAAGAAGAAAAGTTATATTAGGTAGTGGCTTTGCAATGAATGAGAATACGTTTTGCTGCAATTCAGCATTTGGCATGCTTTCAGTATGACGATATGATTCACATTGCTAAGTTTTTCAAATACTGCTTAAAAATGGAAAACTTGTATTCAGGTAACTGATCTCTTTAGTGTTTAGCTAATGCATGATAATGTTATTATGTGTTCCCTTTGTAATAATGTCAGAATATAAATCCATACATGTTCAATGATCATCAAGGCTCCAGTTACAATCTAA

mRNA sequence

ATGACTGCAGTCGAGCAAACTTTCGATTTGCTTAAGCTATTTGTTGCTGTACGAGATAAGGGTGGTTATGATGCTGTATCGAGGAATGATTTGTGGGATCTGGTAGCAGAAGATTCTGGTTTAGGTTCGATTATTCCCTCTACTGTGAAAGTGCTATACGTCGAGTGTTTGAATATTTTAGAGAGATGGCTTGAAAAGGTTGTTGAGGATAAAGAATCCTCAAGTAGTTGCAGCAGTAAGGGAGATGGTACGGGTTTTGAGTTCAATGGTTTGTCATCAGATATTCAGTATTTGAAGAAGAATGATGATTTGCATGAAGATACAAAATTGTTGCAGGAGTCAAACTTTTTAGACTGTGATGATACAGTTGTGATTCTGAAGACTGACAGGGACAATGATATTTATGGTTGCGGGGAAACTTTTTGCCAATTAAATAAGAGCGACTTAGACATTCCTGACACAGATGACTTGTACGAAGATGAAGACACAAGCCCCGAATTATCATCCAACGTGAACGACATTCTAATGCCAAGTATTCGGAAGCATGAAAATGTATCAGCAGACGGGGTAGTAGAAAGCAATGTGGAGTTCTCTCATGGCGGTGGAAAGTGCGATGGTGATGATCCTGATAATGAGGGAGGAGTGATTTTAGATTCCACCTCTATTGAGGAACAGAATCTTAGCCACGAGAGAAAGTGTGAATCCCTGTTAGGAATGGTAAGGTGGATCACTGAAATTGCAAAGAATCCATCTTACCCTGTTATCGGGTCATTACCTGAAAGATCAAAGTGGAAGTCACGTGGAAATGAAGAGATCTGGAAGCAAGTTCTGCTGATTCGAGAGGCAATGTTACTGAAAAGACATTCTAATTCTTTTGCGGGACGGTCTATTCTGCAGGCTCCAGTTACAATCTAA

Coding sequence (CDS)

ATGACTGCAGTCGAGCAAACTTTCGATTTGCTTAAGCTATTTGTTGCTGTACGAGATAAGGGTGGTTATGATGCTGTATCGAGGAATGATTTGTGGGATCTGGTAGCAGAAGATTCTGGTTTAGGTTCGATTATTCCCTCTACTGTGAAAGTGCTATACGTCGAGTGTTTGAATATTTTAGAGAGATGGCTTGAAAAGGTTGTTGAGGATAAAGAATCCTCAAGTAGTTGCAGCAGTAAGGGAGATGGTACGGGTTTTGAGTTCAATGGTTTGTCATCAGATATTCAGTATTTGAAGAAGAATGATGATTTGCATGAAGATACAAAATTGTTGCAGGAGTCAAACTTTTTAGACTGTGATGATACAGTTGTGATTCTGAAGACTGACAGGGACAATGATATTTATGGTTGCGGGGAAACTTTTTGCCAATTAAATAAGAGCGACTTAGACATTCCTGACACAGATGACTTGTACGAAGATGAAGACACAAGCCCCGAATTATCATCCAACGTGAACGACATTCTAATGCCAAGTATTCGGAAGCATGAAAATGTATCAGCAGACGGGGTAGTAGAAAGCAATGTGGAGTTCTCTCATGGCGGTGGAAAGTGCGATGGTGATGATCCTGATAATGAGGGAGGAGTGATTTTAGATTCCACCTCTATTGAGGAACAGAATCTTAGCCACGAGAGAAAGTGTGAATCCCTGTTAGGAATGGTAAGGTGGATCACTGAAATTGCAAAGAATCCATCTTACCCTGTTATCGGGTCATTACCTGAAAGATCAAAGTGGAAGTCACGTGGAAATGAAGAGATCTGGAAGCAAGTTCTGCTGATTCGAGAGGCAATGTTACTGAAAAGACATTCTAATTCTTTTGCGGGACGGTCTATTCTGCAGGCTCCAGTTACAATCTAA

Protein sequence

MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNILERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCDDTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQAPVTI
Homology
BLAST of Moc05g03940 vs. NCBI nr
Match: XP_022147337.1 (AT-rich interactive domain-containing protein 1-like [Momordica charantia])

HSP 1 Score: 600.5 bits (1547), Expect = 8.0e-168
Identity = 299/299 (100.00%), Postives = 299/299 (100.00%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL
Sbjct: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD
Sbjct: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 180
           DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR
Sbjct: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 180

Query: 181 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 240
           KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV
Sbjct: 181 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 240

Query: 241 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ 300
           RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ
Sbjct: 241 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ 299

BLAST of Moc05g03940 vs. NCBI nr
Match: XP_038878805.1 (AT-rich interactive domain-containing protein 1-like [Benincasa hispida] >XP_038878812.1 AT-rich interactive domain-containing protein 1-like [Benincasa hispida])

HSP 1 Score: 371.7 bits (953), Expect = 6.0e-99
Identity = 201/306 (65.69%), Postives = 229/306 (74.84%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           M + EQTFDL KLFVAVRDKGGYD VSR DLWDLVAE+SGLGSII ST+KVLYVE LN+L
Sbjct: 1   MVSNEQTFDLFKLFVAVRDKGGYDVVSRKDLWDLVAEESGLGSIISSTLKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER LE+VVED++S++SCSS GDGTGF FNG+S D Q+LKKN DLH       ESNFLD D
Sbjct: 61  ERLLERVVEDRDSTNSCSSNGDGTGFGFNGMSPDTQFLKKNRDLH-------ESNFLDYD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPS-- 180
           D +V+LK DR  +I GC ET CQLN S  DI DT+DLYEDED+S EL+SNV +    S  
Sbjct: 121 DMIVVLKIDRHKNIAGCEETLCQLNTSQWDIYDTNDLYEDEDSSLELASNVAENFDDSEK 180

Query: 181 -----IRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
                ++K E    DG VESNVEFSH   KCDG   D++ GVI DS S+EE N+ HE+KC
Sbjct: 181 SHSRNVQKDERAFVDG-VESNVEFSHDSRKCDGS--DSKEGVITDSISVEEINICHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 300
           ES+ GMV WITEIAKNP  PVIG LP+ SKWKS GNEEIWKQVLL REAMLL  H NS+ 
Sbjct: 241 ESMSGMVNWITEIAKNPCNPVIGLLPKSSKWKSSGNEEIWKQVLLTREAMLLNGHINSYF 296

BLAST of Moc05g03940 vs. NCBI nr
Match: KAG7019014.1 (AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 360.1 bits (923), Expect = 1.8e-95
Identity = 198/306 (64.71%), Postives = 227/306 (74.18%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++ CSS GD TGF  N    DIQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNCCSSNGDSTGFGLNCSPLDIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT+DLYEDED S ELSSNV++       
Sbjct: 121 DRIVVPKTDRDNYTAGCGETFCQSNKSKPDIHDTNDLYEDEDFSLELSSNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SNSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 300
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLQGHINSYA 293

BLAST of Moc05g03940 vs. NCBI nr
Match: XP_022980870.1 (AT-rich interactive domain-containing protein 2-like isoform X3 [Cucurbita maxima])

HSP 1 Score: 359.4 bits (921), Expect = 3.1e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. NCBI nr
Match: XP_022980871.1 (AT-rich interactive domain-containing protein 2-like isoform X4 [Cucurbita maxima] >XP_022980872.1 AT-rich interactive domain-containing protein 2-like isoform X4 [Cucurbita maxima] >XP_022980873.1 AT-rich interactive domain-containing protein 2-like isoform X4 [Cucurbita maxima])

HSP 1 Score: 359.4 bits (921), Expect = 3.1e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. ExPASy Swiss-Prot
Match: Q84JT7 (AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 GN=ARID1 PE=2 SV=1)

HSP 1 Score: 95.5 bits (236), Expect = 1.1e-18
Identity = 85/282 (30.14%), Postives = 127/282 (45.04%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT   +T DL  LF+ V  KGG+DAVS N  WD V ++SGL S   ++ K++YV+ L+  
Sbjct: 72  MTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSASAKLIYVKYLDAF 131

Query: 61  ERWLEKVVEDKESSSSCSSKG--DGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLD 120
            RWL +VV      SS    G  D      NG  S++   KK  +L +     + +  L 
Sbjct: 132 GRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV---KKKYELRKG----RPAKELG 191

Query: 121 CDDTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPS 180
            +    I KT R  D +  G+                                       
Sbjct: 192 AELKWFISKTKRRYDKHHVGK--------------------------------------- 251

Query: 181 IRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLG 240
               E+ S D V E        G K    +   E  +IL+S + +E +   +RK E  L 
Sbjct: 252 ----ESASNDAVKEFQ------GSKL--AERRLEQIMILESVT-QECSSPGKRKRECPLE 294

Query: 241 MVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIR 281
            ++W++++AK+P  P +G +P+RS+W S G+EE WKQ+LL R
Sbjct: 312 TLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFR 294

BLAST of Moc05g03940 vs. ExPASy Swiss-Prot
Match: Q9LDD4 (AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 GN=ARID2 PE=1 SV=1)

HSP 1 Score: 72.0 bits (175), Expect = 1.3e-11
Identity = 76/287 (26.48%), Postives = 118/287 (41.11%), Query Frame = 0

Query: 9   DLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLG-SIIPSTVKVLYVECLNILERWL--- 68
           DL KLFV VR++ G+D VSR  LW++VAE  G   S++PS + ++Y++ LN +E+W    
Sbjct: 58  DLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLI-LIYLKYLNRMEKWAVEE 117

Query: 69  EKVV----EDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 128
            ++V    +D E     S      G  F  L  + +  K+N  +      ++ES    C 
Sbjct: 118 SRIVNWDNKDSEKKGCYSGMLHELGNGFKSLLDNGKCQKRNRAVAFGCNHMEES----CS 177

Query: 129 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 188
           +     K  R++D                          D+D    LSS V       IR
Sbjct: 178 EFDRSRKRFRESD--------------------------DDDKGVGLSSVV-------IR 237

Query: 189 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 248
           +   V A  V E   +FS                                 K + L GM+
Sbjct: 238 EETVVCA--VEEGLSDFS-------------------------------LEKRDDLPGML 273

Query: 249 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKR 288
           +W+  +A +P  P IG +P  SKWK     + W QV   + ++L++R
Sbjct: 298 KWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLLVQR 273

BLAST of Moc05g03940 vs. ExPASy TrEMBL
Match: A0A6J1D240 (AT-rich interactive domain-containing protein 1-like OS=Momordica charantia OX=3673 GN=LOC111016306 PE=4 SV=1)

HSP 1 Score: 600.5 bits (1547), Expect = 3.9e-168
Identity = 299/299 (100.00%), Postives = 299/299 (100.00%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL
Sbjct: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD
Sbjct: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 180
           DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR
Sbjct: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 180

Query: 181 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 240
           KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV
Sbjct: 181 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 240

Query: 241 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ 300
           RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ
Sbjct: 241 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFAGRSILQ 299

BLAST of Moc05g03940 vs. ExPASy TrEMBL
Match: A0A6J1J0J0 (AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111480131 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.5e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. ExPASy TrEMBL
Match: A0A6J1ISG3 (AT-rich interactive domain-containing protein 2-like isoform X4 OS=Cucurbita maxima OX=3661 GN=LOC111480131 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.5e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. ExPASy TrEMBL
Match: A0A6J1J0F4 (AT-rich interactive domain-containing protein 2-like isoform X5 OS=Cucurbita maxima OX=3661 GN=LOC111480131 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.5e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. ExPASy TrEMBL
Match: A0A6J1J0E9 (AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111480131 PE=4 SV=1)

HSP 1 Score: 359.4 bits (921), Expect = 1.5e-95
Identity = 197/304 (64.80%), Postives = 226/304 (74.34%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT+ EQ FDL KLFVAVRDKGGY+ VSR DLWDLVAE+SGLGSII STVKVLYVE LN+L
Sbjct: 1   MTSDEQNFDLFKLFVAVRDKGGYNVVSRKDLWDLVAEESGLGSIISSTVKVLYVEYLNVL 60

Query: 61  ERWLEKVVEDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 120
           ER+LE+VVED++S++SCSS GD TGF  N L   IQ LKKN+D       LQ+SNF  CD
Sbjct: 61  ERFLERVVEDRDSTNSCSSNGDSTGFGLNCLPLYIQSLKKNND-------LQDSNFSVCD 120

Query: 121 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDIL----- 180
           D +V+ KTDRDN   GCGETFCQ NKS  DI DT DLYEDED S EL+SNV++       
Sbjct: 121 DRIVVPKTDRDNHTAGCGETFCQSNKSKPDIHDTSDLYEDEDFSLELASNVDENFDDIEK 180

Query: 181 --MPSIRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKC 240
               +I+K+EN   DG VESNVEF +   KCDG D DN+ GV     S+EE N SHE+KC
Sbjct: 181 SHSLNIQKYENALVDG-VESNVEFPYDCRKCDGYDSDNKQGV-----SVEEHNFSHEKKC 240

Query: 241 ESLLGMVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKRHSNSFA 298
           ES+LGMV WI EIAKNP  PVIG LPERS+WKS  NEEIWKQVLLIREAM L+ H NS+A
Sbjct: 241 ESMLGMVNWIAEIAKNPCNPVIGLLPERSEWKSSANEEIWKQVLLIREAMFLEGHINSYA 291

BLAST of Moc05g03940 vs. TAIR 10
Match: AT2G46040.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 95.5 bits (236), Expect = 7.8e-20
Identity = 85/282 (30.14%), Postives = 127/282 (45.04%), Query Frame = 0

Query: 1   MTAVEQTFDLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLGSIIPSTVKVLYVECLNIL 60
           MT   +T DL  LF+ V  KGG+DAVS N  WD V ++SGL S   ++ K++YV+ L+  
Sbjct: 72  MTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSASAKLIYVKYLDAF 131

Query: 61  ERWLEKVVEDKESSSSCSSKG--DGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLD 120
            RWL +VV      SS    G  D      NG  S++   KK  +L +     + +  L 
Sbjct: 132 GRWLNRVVAGDTDVSSVELSGISDALVARLNGFLSEV---KKKYELRKG----RPAKELG 191

Query: 121 CDDTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPS 180
            +    I KT R  D +  G+                                       
Sbjct: 192 AELKWFISKTKRRYDKHHVGK--------------------------------------- 251

Query: 181 IRKHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLG 240
               E+ S D V E        G K    +   E  +IL+S + +E +   +RK E  L 
Sbjct: 252 ----ESASNDAVKEFQ------GSKL--AERRLEQIMILESVT-QECSSPGKRKRECPLE 294

Query: 241 MVRWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIR 281
            ++W++++AK+P  P +G +P+RS+W S G+EE WKQ+LL R
Sbjct: 312 TLKWLSDVAKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFR 294

BLAST of Moc05g03940 vs. TAIR 10
Match: AT4G11400.1 (ARID/BRIGHT DNA-binding domain;ELM2 domain protein )

HSP 1 Score: 72.0 bits (175), Expect = 9.2e-13
Identity = 76/287 (26.48%), Postives = 118/287 (41.11%), Query Frame = 0

Query: 9   DLLKLFVAVRDKGGYDAVSRNDLWDLVAEDSGLG-SIIPSTVKVLYVECLNILERWL--- 68
           DL KLFV VR++ G+D VSR  LW++VAE  G   S++PS + ++Y++ LN +E+W    
Sbjct: 58  DLFKLFVLVREREGFDTVSRKRLWEVVAEKLGFDCSLVPSLI-LIYLKYLNRMEKWAVEE 117

Query: 69  EKVV----EDKESSSSCSSKGDGTGFEFNGLSSDIQYLKKNDDLHEDTKLLQESNFLDCD 128
            ++V    +D E     S      G  F  L  + +  K+N  +      ++ES    C 
Sbjct: 118 SRIVNWDNKDSEKKGCYSGMLHELGNGFKSLLDNGKCQKRNRAVAFGCNHMEES----CS 177

Query: 129 DTVVILKTDRDNDIYGCGETFCQLNKSDLDIPDTDDLYEDEDTSPELSSNVNDILMPSIR 188
           +     K  R++D                          D+D    LSS V       IR
Sbjct: 178 EFDRSRKRFRESD--------------------------DDDKGVGLSSVV-------IR 237

Query: 189 KHENVSADGVVESNVEFSHGGGKCDGDDPDNEGGVILDSTSIEEQNLSHERKCESLLGMV 248
           +   V A  V E   +FS                                 K + L GM+
Sbjct: 238 EETVVCA--VEEGLSDFS-------------------------------LEKRDDLPGML 273

Query: 249 RWITEIAKNPSYPVIGSLPERSKWKSRGNEEIWKQVLLIREAMLLKR 288
           +W+  +A +P  P IG +P  SKWK     + W QV   + ++L++R
Sbjct: 298 KWLALVATSPHDPAIGVIPHSSKWKQYNGNKCWLQVARAKNSLLVQR 273

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022147337.18.0e-168100.00AT-rich interactive domain-containing protein 1-like [Momordica charantia][more]
XP_038878805.16.0e-9965.69AT-rich interactive domain-containing protein 1-like [Benincasa hispida] >XP_038... [more]
KAG7019014.11.8e-9564.71AT-rich interactive domain-containing protein 2, partial [Cucurbita argyrosperma... [more]
XP_022980870.13.1e-9564.80AT-rich interactive domain-containing protein 2-like isoform X3 [Cucurbita maxim... [more]
XP_022980871.13.1e-9564.80AT-rich interactive domain-containing protein 2-like isoform X4 [Cucurbita maxim... [more]
Match NameE-valueIdentityDescription
Q84JT71.1e-1830.14AT-rich interactive domain-containing protein 1 OS=Arabidopsis thaliana OX=3702 ... [more]
Q9LDD41.3e-1126.48AT-rich interactive domain-containing protein 2 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1D2403.9e-168100.00AT-rich interactive domain-containing protein 1-like OS=Momordica charantia OX=3... [more]
A0A6J1J0J01.5e-9564.80AT-rich interactive domain-containing protein 2-like isoform X1 OS=Cucurbita max... [more]
A0A6J1ISG31.5e-9564.80AT-rich interactive domain-containing protein 2-like isoform X4 OS=Cucurbita max... [more]
A0A6J1J0F41.5e-9564.80AT-rich interactive domain-containing protein 2-like isoform X5 OS=Cucurbita max... [more]
A0A6J1J0E91.5e-9564.80AT-rich interactive domain-containing protein 2-like isoform X2 OS=Cucurbita max... [more]
Match NameE-valueIdentityDescription
AT2G46040.17.8e-2030.14ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
AT4G11400.19.2e-1326.48ARID/BRIGHT DNA-binding domain;ELM2 domain protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001606ARID DNA-binding domainPFAMPF01388ARIDcoord: 6..57
e-value: 5.4E-6
score: 27.0
IPR001606ARID DNA-binding domainPROSITEPS51011ARIDcoord: 1..65
score: 14.160748
IPR036431ARID DNA-binding domain superfamilyGENE3D1.10.150.60coord: 2..78
e-value: 4.6E-10
score: 40.9
IPR036431ARID DNA-binding domain superfamilySUPERFAMILY46774ARID-likecoord: 7..70
NoneNo IPR availablePANTHERPTHR46410:SF1AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 1coord: 1..299
NoneNo IPR availablePANTHERPTHR46410AT-RICH INTERACTIVE DOMAIN-CONTAINING PROTEIN 2coord: 1..299
NoneNo IPR availableCDDcd16100ARIDcoord: 6..57
e-value: 2.53783E-10
score: 54.286

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc05g03940.1Moc05g03940.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0005634 nucleus
molecular_function GO:0003677 DNA binding