MS021020 (gene) Bitter gourd (TR) v1

Overview
NameMS021020
Typegene
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMyb_DNA-bind_3 domain-containing protein
Locationscaffold290: 593174 .. 593611 (+)
RNA-Seq ExpressionMS021020
SyntenyMS021020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TGGACATCAATCCTCGCCAAGAAACTTGTGAATTTGATAAGTGAACAACTGCAGAAAGGAAGGAAAGAATTGGAATGGAGGCTTATATGCAATGAGTTCTGCAGGAAAACAAAAATGGAATGGAATGTAGAACAACTGAAGAATCAATACGCTGTTATGAGAAAGCAATACTTGATTGTGAAGTCGATATTCTATCGGGACGACTTCTCATGGCGTGGAAGTACTGGGGTTATTGTGACCACAGATGAGAGTACGGCCACATATGTTAGGGTATGTTCTTTTATCTCTAAAACTCATATTATTGTGATAGCTGTGAAGAGGTTATCTCTTTCTGAAACTCTTTATTATTGTGGCGGCTTATTTGTCAAAAAGCAGAGAATAGTGATTTGTCCAACATATGAAGAGTTATGCATAATTTTCTCCAATTCTGATGGTAAG

mRNA sequence

TGGACATCAATCCTCGCCAAGAAACTTGTGAATTTGATAAGTGAACAACTGCAGAAAGGAAGGAAAGAATTGGAATGGAGGCTTATATGCAATGAGTTCTGCAGGAAAACAAAAATGGAATGGAATGTAGAACAACTGAAGAATCAATACGCTGTTATGAGAAAGCAATACTTGATTGTGAAGTCGATATTCTATCGGGACGACTTCTCATGGCGTGGAAGTACTGGGGTTATTGTGACCACAGATGAGAGTACGGCCACATATGTTAGGGTATGTTCTTTTATCTCTAAAACTCATATTATTGTGATAGCTGTGAAGAGGTTATCTCTTTCTGAAACTCTTTATTATTGTGGCGGCTTATTTGTCAAAAAGCAGAGAATAGTGATTTGTCCAACATATGAAGAGTTATGCATAATTTTCTCCAATTCTGATGGTAAG

Coding sequence (CDS)

TGGACATCAATCCTCGCCAAGAAACTTGTGAATTTGATAAGTGAACAACTGCAGAAAGGAAGGAAAGAATTGGAATGGAGGCTTATATGCAATGAGTTCTGCAGGAAAACAAAAATGGAATGGAATGTAGAACAACTGAAGAATCAATACGCTGTTATGAGAAAGCAATACTTGATTGTGAAGTCGATATTCTATCGGGACGACTTCTCATGGCGTGGAAGTACTGGGGTTATTGTGACCACAGATGAGAGTACGGCCACATATGTTAGGGTATGTTCTTTTATCTCTAAAACTCATATTATTGTGATAGCTGTGAAGAGGTTATCTCTTTCTGAAACTCTTTATTATTGTGGCGGCTTATTTGTCAAAAAGCAGAGAATAGTGATTTGTCCAACATATGAAGAGTTATGCATAATTTTCTCCAATTCTGATGGTAAG

Protein sequence

WTSILAKKLVNLISEQLQKGRKELEWRLICNEFCRKTKMEWNVEQLKNQYAVMRKQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETLYYCGGLFVKKQRIVICPTYEELCIIFSNSDGK
Homology
BLAST of MS021020 vs. NCBI nr
Match: XP_022145264.1 (uncharacterized protein LOC111014759 isoform X1 [Momordica charantia])

HSP 1 Score: 129.0 bits (323), Expect = 3.3e-26
Identity = 74/145 (51.03%), Postives = 86/145 (59.31%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRKELEWRLICNEFCRKTKMEWNVEQLKNQYAVMRKQYLIV 60
           WTS +AK LV LI EQ+Q+GRKELEW  IC+EFC ++K+ W+ EQLK+QYAVMRKQYLIV
Sbjct: 23  WTSNIAKILVELIIEQVQRGRKELEWGFICDEFCSRSKLMWDEEQLKHQYAVMRKQYLIV 82

Query: 61  KSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETLYYCGGL 120
           KSIF RDDFSW  STG+IV T                   +     R    E LY     
Sbjct: 83  KSIFDRDDFSWHESTGIIVAT-------------------VATDDARFIEEENLYL---- 133

Query: 121 FVKKQRIVICPTYEELCIIFSNSDG 146
                      TYE+LC IFSNSDG
Sbjct: 143 -----------TYEDLCRIFSNSDG 133

BLAST of MS021020 vs. NCBI nr
Match: XP_022145265.1 (uncharacterized protein LOC111014759 isoform X2 [Momordica charantia])

HSP 1 Score: 129.0 bits (323), Expect = 3.3e-26
Identity = 74/145 (51.03%), Postives = 86/145 (59.31%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRKELEWRLICNEFCRKTKMEWNVEQLKNQYAVMRKQYLIV 60
           WTS +AK LV LI EQ+Q+GRKELEW  IC+EFC ++K+ W+ EQLK+QYAVMRKQYLIV
Sbjct: 23  WTSNIAKILVELIIEQVQRGRKELEWGFICDEFCSRSKLMWDEEQLKHQYAVMRKQYLIV 82

Query: 61  KSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETLYYCGGL 120
           KSIF RDDFSW  STG+IV T                   +     R    E LY     
Sbjct: 83  KSIFDRDDFSWHESTGIIVAT-------------------VATDDARFIEEENLYL---- 133

Query: 121 FVKKQRIVICPTYEELCIIFSNSDG 146
                      TYE+LC IFSNSDG
Sbjct: 143 -----------TYEDLCRIFSNSDG 133

BLAST of MS021020 vs. NCBI nr
Match: XP_035549342.1 (L10-interacting MYB domain-containing protein-like isoform X3 [Juglans regia] >KAF5462973.1 hypothetical protein F2P56_018933 [Juglans regia])

HSP 1 Score: 99.0 bits (245), Expect = 3.6e-17
Identity = 59/147 (40.14%), Postives = 80/147 (54.42%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WTS L K L +L+ + +QKG +      +  WR IC+EF +KT + W+ EQLKN+YAV+R
Sbjct: 24  WTSSLTKILADLMIDLVQKGNRHGHSFGKKAWRYICDEFYKKTGLNWDKEQLKNRYAVLR 83

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DFSW  S G I+  DE+   Y+R                    +ETL
Sbjct: 84  RQYVTVKSLLDQRDFSWDESMGTIIGKDEAWTEYIR----------------GHPDAETL 143

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFS 142
            Y G           CP Y+ELCIIFS
Sbjct: 144 KYTG-----------CPIYKELCIIFS 143

BLAST of MS021020 vs. NCBI nr
Match: XP_035549340.1 (L10-interacting MYB domain-containing protein-like isoform X1 [Juglans regia] >KAF5462975.1 hypothetical protein F2P56_018934 [Juglans regia])

HSP 1 Score: 99.0 bits (245), Expect = 3.6e-17
Identity = 59/147 (40.14%), Postives = 80/147 (54.42%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WTS L K L +L+ + +QKG +      +  WR IC+EF +KT + W+ EQLKN+YAV+R
Sbjct: 51  WTSSLTKILADLMIDLVQKGNRHGHSFGKKAWRYICDEFYKKTGLNWDKEQLKNRYAVLR 110

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DFSW  S G I+  DE+   Y+R                    +ETL
Sbjct: 111 RQYVTVKSLLDQRDFSWDESMGTIIGKDEAWTEYIR----------------GHPDAETL 170

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFS 142
            Y G           CP Y+ELCIIFS
Sbjct: 171 KYTG-----------CPIYKELCIIFS 170

BLAST of MS021020 vs. NCBI nr
Match: XP_035549341.1 (L10-interacting MYB domain-containing protein-like isoform X2 [Juglans regia] >KAF5462974.1 hypothetical protein F2P56_018934 [Juglans regia])

HSP 1 Score: 99.0 bits (245), Expect = 3.6e-17
Identity = 59/147 (40.14%), Postives = 80/147 (54.42%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WTS L K L +L+ + +QKG +      +  WR IC+EF +KT + W+ EQLKN+YAV+R
Sbjct: 32  WTSSLTKILADLMIDLVQKGNRHGHSFGKKAWRYICDEFYKKTGLNWDKEQLKNRYAVLR 91

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DFSW  S G I+  DE+   Y+R                    +ETL
Sbjct: 92  RQYVTVKSLLDQRDFSWDESMGTIIGKDEAWTEYIR----------------GHPDAETL 151

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFS 142
            Y G           CP Y+ELCIIFS
Sbjct: 152 KYTG-----------CPIYKELCIIFS 151

BLAST of MS021020 vs. ExPASy TrEMBL
Match: A0A6J1CUQ3 (uncharacterized protein LOC111014759 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014759 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.6e-26
Identity = 74/145 (51.03%), Postives = 86/145 (59.31%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRKELEWRLICNEFCRKTKMEWNVEQLKNQYAVMRKQYLIV 60
           WTS +AK LV LI EQ+Q+GRKELEW  IC+EFC ++K+ W+ EQLK+QYAVMRKQYLIV
Sbjct: 23  WTSNIAKILVELIIEQVQRGRKELEWGFICDEFCSRSKLMWDEEQLKHQYAVMRKQYLIV 82

Query: 61  KSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETLYYCGGL 120
           KSIF RDDFSW  STG+IV T                   +     R    E LY     
Sbjct: 83  KSIFDRDDFSWHESTGIIVAT-------------------VATDDARFIEEENLYL---- 133

Query: 121 FVKKQRIVICPTYEELCIIFSNSDG 146
                      TYE+LC IFSNSDG
Sbjct: 143 -----------TYEDLCRIFSNSDG 133

BLAST of MS021020 vs. ExPASy TrEMBL
Match: A0A6J1CTZ5 (uncharacterized protein LOC111014759 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111014759 PE=4 SV=1)

HSP 1 Score: 129.0 bits (323), Expect = 1.6e-26
Identity = 74/145 (51.03%), Postives = 86/145 (59.31%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRKELEWRLICNEFCRKTKMEWNVEQLKNQYAVMRKQYLIV 60
           WTS +AK LV LI EQ+Q+GRKELEW  IC+EFC ++K+ W+ EQLK+QYAVMRKQYLIV
Sbjct: 23  WTSNIAKILVELIIEQVQRGRKELEWGFICDEFCSRSKLMWDEEQLKHQYAVMRKQYLIV 82

Query: 61  KSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETLYYCGGL 120
           KSIF RDDFSW  STG+IV T                   +     R    E LY     
Sbjct: 83  KSIFDRDDFSWHESTGIIVAT-------------------VATDDARFIEEENLYL---- 133

Query: 121 FVKKQRIVICPTYEELCIIFSNSDG 146
                      TYE+LC IFSNSDG
Sbjct: 143 -----------TYEDLCRIFSNSDG 133

BLAST of MS021020 vs. ExPASy TrEMBL
Match: A0A2N9J0Q6 (Myb_DNA-bind_3 domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS59019 PE=4 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.8e-17
Identity = 58/149 (38.93%), Postives = 81/149 (54.36%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WT+ L K L +L+  Q+QKG +      +  WR IC+EF +KT ++W+ EQLKN+YAV+R
Sbjct: 24  WTTSLTKILADLMINQVQKGNRHKNSFSKKAWRYICDEFYKKTGLKWDKEQLKNRYAVLR 83

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DF+W   TG IV  DE+ A Y                +     +E L
Sbjct: 84  RQYITVKSLLDQSDFNWDEFTGSIVAKDEAWAEY----------------IMGHPDAEAL 143

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFSNS 144
            Y G           CP Y+ELC+IFS S
Sbjct: 144 KYSG-----------CPIYKELCLIFSES 145

BLAST of MS021020 vs. ExPASy TrEMBL
Match: A0A6P9F3R0 (L10-interacting MYB domain-containing protein-like isoform X3 OS=Juglans regia OX=51240 GN=LOC109001681 PE=4 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.8e-17
Identity = 59/147 (40.14%), Postives = 80/147 (54.42%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WTS L K L +L+ + +QKG +      +  WR IC+EF +KT + W+ EQLKN+YAV+R
Sbjct: 24  WTSSLTKILADLMIDLVQKGNRHGHSFGKKAWRYICDEFYKKTGLNWDKEQLKNRYAVLR 83

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DFSW  S G I+  DE+   Y+R                    +ETL
Sbjct: 84  RQYVTVKSLLDQRDFSWDESMGTIIGKDEAWTEYIR----------------GHPDAETL 143

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFS 142
            Y G           CP Y+ELCIIFS
Sbjct: 144 KYTG-----------CPIYKELCIIFS 143

BLAST of MS021020 vs. ExPASy TrEMBL
Match: A0A6P9ELE7 (L10-interacting MYB domain-containing protein-like isoform X1 OS=Juglans regia OX=51240 GN=LOC109001681 PE=4 SV=1)

HSP 1 Score: 99.0 bits (245), Expect = 1.8e-17
Identity = 59/147 (40.14%), Postives = 80/147 (54.42%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGRK------ELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           WTS L K L +L+ + +QKG +      +  WR IC+EF +KT + W+ EQLKN+YAV+R
Sbjct: 51  WTSSLTKILADLMIDLVQKGNRHGHSFGKKAWRYICDEFYKKTGLNWDKEQLKNRYAVLR 110

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVRVCSFISKTHIIVIAVKRLSLSETL 120
           +QY+ VKS+  + DFSW  S G I+  DE+   Y+R                    +ETL
Sbjct: 111 RQYVTVKSLLDQRDFSWDESMGTIIGKDEAWTEYIR----------------GHPDAETL 170

Query: 121 YYCGGLFVKKQRIVICPTYEELCIIFS 142
            Y G           CP Y+ELCIIFS
Sbjct: 171 KYTG-----------CPIYKELCIIFS 170

BLAST of MS021020 vs. TAIR 10
Match: AT4G02210.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2); Has 791 Blast hits to 465 proteins in 19 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 17; Plants - 748; Viruses - 0; Other Eukaryotes - 26 (source: NCBI BLink). )

HSP 1 Score: 42.7 bits (99), Expect = 2.9e-04
Identity = 22/96 (22.92%), Postives = 48/96 (50.00%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGR------KELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           W   + +  ++L+ +Q ++G       ++  W  + N F  K +  ++V+ LKN+Y  +R
Sbjct: 186 WHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSLR 245

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVR 91
           +Q+  +KSI   D F+W     ++   +     Y++
Sbjct: 246 RQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIK 281

BLAST of MS021020 vs. TAIR 10
Match: AT4G02210.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G24960.2). )

HSP 1 Score: 42.7 bits (99), Expect = 2.9e-04
Identity = 22/96 (22.92%), Postives = 48/96 (50.00%), Query Frame = 0

Query: 1   WTSILAKKLVNLISEQLQKGR------KELEWRLICNEFCRKTKMEWNVEQLKNQYAVMR 60
           W   + +  ++L+ +Q ++G       ++  W  + N F  K +  ++V+ LKN+Y  +R
Sbjct: 186 WHPPMDRYFIDLMLDQARRGNQIEGVFRKQAWTEMVNLFNAKFESNFDVDVLKNRYKSLR 245

Query: 61  KQYLIVKSIFYRDDFSWRGSTGVIVTTDESTATYVR 91
           +Q+  +KSI   D F+W     ++   +     Y++
Sbjct: 246 RQFNAIKSILRSDGFAWDNERQMVTADNNVWQDYIK 281

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022145264.13.3e-2651.03uncharacterized protein LOC111014759 isoform X1 [Momordica charantia][more]
XP_022145265.13.3e-2651.03uncharacterized protein LOC111014759 isoform X2 [Momordica charantia][more]
XP_035549342.13.6e-1740.14L10-interacting MYB domain-containing protein-like isoform X3 [Juglans regia] >K... [more]
XP_035549340.13.6e-1740.14L10-interacting MYB domain-containing protein-like isoform X1 [Juglans regia] >K... [more]
XP_035549341.13.6e-1740.14L10-interacting MYB domain-containing protein-like isoform X2 [Juglans regia] >K... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CUQ31.6e-2651.03uncharacterized protein LOC111014759 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1CTZ51.6e-2651.03uncharacterized protein LOC111014759 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A2N9J0Q61.8e-1738.93Myb_DNA-bind_3 domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCU... [more]
A0A6P9F3R01.8e-1740.14L10-interacting MYB domain-containing protein-like isoform X3 OS=Juglans regia O... [more]
A0A6P9ELE71.8e-1740.14L10-interacting MYB domain-containing protein-like isoform X1 OS=Juglans regia O... [more]
Match NameE-valueIdentityDescription
AT4G02210.12.9e-0422.92unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G02210.22.9e-0422.92unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (TR) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 36..56
NoneNo IPR availablePANTHERPTHR47584FAMILY NOT NAMEDcoord: 1..90
NoneNo IPR availablePANTHERPTHR47584:SF9L10-INTERACTING MYB DOMAIN-CONTAINING PROTEIN-LIKEcoord: 1..90
IPR024752Myb/SANT-like domainPFAMPF12776Myb_DNA-bind_3coord: 1..84
e-value: 1.3E-11
score: 45.4

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MS021020.1MS021020.1mRNA