Moc04g20790 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc04g20790
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Locationchr4: 15125947 .. 15129026 (-)
RNA-Seq ExpressionMoc04g20790
SyntenyMoc04g20790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTTGTGACGTGGGGAGTTACCCCATGTCATCCCTCTTCTTCTTCAAGTGTGTTGGAGAAGAGATGAAGGGGTTACAGAACAGAACCAAGAAGAAACCCATCAATTCTCTCTCAAGAGATCTCTAATCTCTCCCTCTCATTCCAAAGAACTGCTCCCACAAGCACGGTCTCGTACCCAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAACTCACTTAAGAAACGTTCTTCAAAGGTTTGGTTTTTCCCCTGTATTTCCTTCTTTGAATTTCAGTTTTGTATCCCGAAAATCAGCAACGAACATCCGTTTCCGCTCCGGGATTCACATCCCTTCAATTGGTATCAGAGCCAGGTTGCAGTTCTGATTTTTTGGGATCAAAATCGCTGTGTTTGGTGGGTGATTTCATATTTTCGAAATATGGGTTTGTAGGTTGGAAAATTTCCGGTTTTTGGCACTTAGATAGCGTTTTTCTTCGTGATTCGATCATGTAAGAGTCGGATGTTTTGGGCATTATGTTTCTACTTTGTTTCTGTAAGTCCGGGTCGCTCGAAAGTCATTCGTGGCGTCGGATTGGGCAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGAAAGCGCCATGGCACTATGCAGCAGTGCCATGGCGCTGTGGGGATAGCACACAGCGCCACGACGCTGCCCTTAGGCGCCGAGGCGCTGTCCCGGGTGTTCTTCAACGCGTTTCCGTGGCTCCGATTCGTGGTTCAAGGGCGGTTGCAGTCGTTTTCTTAATTTTGTATTATTTCATTTTCTTAGGGTGCCAAAAACCATTCCAACTTTGCTATTTAATTATTTTATGCATTATGAATGTATATTTTAATTAAAATCATATTGCATATTGTATGCCATATAGTTTTAAAATCCCACCATAGGTTACATGCATAATATGTATGTTTGATATAGAGTATGTATGTATGGTCATGCATCATATAGTTATAAATGTTATAATGTATGAATGCATGTCTTATTTTCATAATTAATATAAGTGTTGTATTAATTAGATGAAAATAAAGTTGCATTGAGCATGACATTATTTTGGATAATATAATTGTTAAATTCATGTATGCTTTCAATGCATGTTTATAGTTTTAATTTCTTAAATTTTATAAGAGTTATAAAATGTGGTAGATAGAAATTAAAATCTATAATAAAGAGTTGCATCCAAACATAAGGTCTAAGTTAATTAATTTTAAAGGGGTTTAAAATTAATTAGACCTTAGGTTATATCTTCTAATGTGATTAGGGGGTAGTCTTTTATTTTGTTTTAACTAGGTTTAAAATGAGATAATAAAAGACTAAATATAAAATGTTGTCTATAAGGGAACCCTGTCTAAGGAAGGTTCTGTCAAGGTTGAGGTATTTAAGCTGACCGTAAGGGAACACCTCTACCTGGGAACCGACCTGGGGGTTGAATTAGTCAATATTTTACATACAAGCAATAATGTCGTTGGTTTATTAAAGTGTTTAACGAACTAAAACATTATTGTACAACTTCCGATATAATAGTTATATTGGGCCGACTAAAAATTCACTTAGTTAATTTCACTTAGCCGGGATTTATCTAAGTAATATACCCTAGTCTTAGAATACTAAGTGGGAGCAAAAGAGAATATGTAATATACAACGTATATATTTGATATACAAAGTATATATAACATACTTTACTCTCTCTCTCACGTTCTCTTTTAAATTCACGCTGTGAATTCCATGCTCGGCCTCGTGTCGCCCTGGGCGCGGCCTCTCTACAGAAGGTGTTTGCATGGTTCAATATCGAGGTGAATGGAGATAGTGTTCATAGTGAGTGGGAGAAGGACGTGTGACAACACATCCTACGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATGACCTACGTGTCGTCCTGGAGCGACCATCCCTACGAAGAGTTCATTGTATGGAAATCAAAATCAAGGAAAACTCCAGAAATGGATAGGAGTCTCTTAGGTTTTGCTCCAATATTTTCCTTCCCTACGGCAGGATTATTAGGGTGGACCTCTGAGGTCCGAAAATGTTGGGTCACACTTACGAGGAGTTGTTAAGTTAGTTAGCAATTTCCTGACCAAATAAACGATGACTAAAATTTATAGGAATAAGAGTTATTCTGGTGTAAAATTTAGTTGAGAATGTCTCAATTCAGTGAAGGAGTACCTGTTCGCCCTACGGTGGCTGTTGCTCTAAATCACTGAAGCGTCGTTGCAAAACAATTTTGTTAGGGTGCTTAATTACTTTTGCTAAAATTTTGATTGGGTTTAAAAGTCTAATGCATGAAACTAATATAGATTTGTTTTTACTTTCAGCATGTCTACTTCTATTATTGCACTCTTAGCCGCGTAAAGACTTAATGGCAAAAATTACAAACAATGGAAGTCAAACCTAACCACCATTCTCGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAAGTCTACATCTTAGCGAGCATATCTGATGTGCTTGCTAAGAAGCACAAGGACACGGTCACCGCTAAGGAGATCATGGATAATATCCAAGGTTATTTTAATGTAATCGGGACAGTATATATACTGGGTATATATTTGATATACCCCTGATATATGAATGGATAATATCCCGTGAATAACCTAAAGGGTCTATAGTATATAGATAAGGCTGGGTACCTTATCCTGGCAACACTATGGATACGGTCCACTCTGTAAAGGTTACAGACGACTTGAGTTACGAGCACTCGTGAAGGATTGACTCGTCATTAATGGTCATATATCCGTGGACACGAAAAAGTTCTGCAGTGAGAAGAGTGCAACTGCGGGTCTTTACTGGAATGACCGATAGTTAACGAATCTTGATTAACTCAGTCAATGAGTTTGACCGATTAATCTCGCATCGTTGGATTTGATCTGTAGGTCCATTAGGTCCCCTGGCTAGCTCATAA

mRNA sequence

ATGTTTTGTGACGTGGGGAGTTACCCCATGTCATCCCTCTTCTTCTTCAAGTGTGTTGGAGAAGAGATGAAGGGAACTGCTCCCACAAGCACGGTCTCGTACCCAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAACTCACTTAAGAAACGTTCTTCAAAGTCCGGGTCGCTCGAAAGTCATTCGTGGCGTCGGATTGGGCAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGAAAGCGCCATGGCACTATGCAGCAGTGCCATGGCGCTGTGGGGATAGCACACAGCGCCACGACGCTGCCCTTAGGCGCCGAGGCGCTGTCCCGGGTGTTCTTCAACGCGTTTCCGTGGCTCCGATTCGTGTGGGAGAAGGACGTGTGACAACACATCCTACGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATGACCTACGTGTCGTCCTGGAGCGACCATCCCTACGAAGAGTTCATTGTATGGAAATCAAAATCAAGGAAAACTCCAGAAATGGATAGGAGTCTCTTAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAAGTCTACATCTTAGCGAGCATATCTGATGTGCTTGCTAAGAAGCACAAGGACACGGTCACCGCTAAGGAGATCATGGATAATATCCAAGGTCCATTAGGTCCCCTGGCTAGCTCATAA

Coding sequence (CDS)

ATGTTTTGTGACGTGGGGAGTTACCCCATGTCATCCCTCTTCTTCTTCAAGTGTGTTGGAGAAGAGATGAAGGGAACTGCTCCCACAAGCACGGTCTCGTACCCAGAGGATAGCGAGGAAGATCCGGTGGTGGTGTTCGAGGGGAACTCACTTAAGAAACGTTCTTCAAAGTCCGGGTCGCTCGAAAGTCATTCGTGGCGTCGGATTGGGCAAAAGTTGCAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGAAAGCGCCATGGCACTATGCAGCAGTGCCATGGCGCTGTGGGGATAGCACACAGCGCCACGACGCTGCCCTTAGGCGCCGAGGCGCTGTCCCGGGTGTTCTTCAACGCGTTTCCGTGGCTCCGATTCGTGTGGGAGAAGGACGTGTGACAACACATCCTACGGTCTCCGCCATTGGTTTGCACCGTGAGGTTTCATACATGACCTACGTGTCGTCCTGGAGCGACCATCCCTACGAAGAGTTCATTGTATGGAAATCAAAATCAAGGAAAACTCCAGAAATGGATAGGAGTCTCTTAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAAGTCTACATCTTAGCGAGCATATCTGATGTGCTTGCTAAGAAGCACAAGGACACGGTCACCGCTAAGGAGATCATGGATAATATCCAAGGTCCATTAGGTCCCCTGGCTAGCTCATAA

Protein sequence

MFCDVGSYPMSSLFFFKCVGEEMKGTAPTSTVSYPEDSEEDPVVVFEGNSLKKRSSKSGSLESHSWRRIGQKLQKTAKKTKQTAQKAPWHYAAVPWRCGDSTQRHDAALRRRGAVPGVLQRVSVAPIRVGEGRVTTHPTVSAIGLHREVSYMTYVSSWSDHPYEEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHKDTVTAKEIMDNIQGPLGPLASS
Homology
BLAST of Moc04g20790 vs. NCBI nr
Match: XP_022154837.1 (uncharacterized protein LOC111022000 [Momordica charantia])

HSP 1 Score: 133.3 bits (334), Expect = 3.1e-27
Identity = 63/77 (81.82%), Postives = 71/77 (92.21%), Query Frame = 0

Query: 183 LLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHKDTVT 242
           ++DDLRFVLQEDCPQAPAPNAT+AVRN YDRWIKANDKAKVYIL+SISDVLAKKH+DTVT
Sbjct: 1   MIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWIKANDKAKVYILSSISDVLAKKHEDTVT 60

Query: 243 AKEIMDNIQGPLGPLAS 260
           AKEIMD++Q   G  +S
Sbjct: 61  AKEIMDSLQSMFGQPSS 77

BLAST of Moc04g20790 vs. NCBI nr
Match: XP_022158197.1 (uncharacterized protein LOC111024734 [Momordica charantia])

HSP 1 Score: 127.9 bits (320), Expect = 1.3e-25
Identity = 64/92 (69.57%), Postives = 71/92 (77.17%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDLRFVLQEDCPQAP  NATVAVRN YDRWIK+NDKAKV
Sbjct: 17  ENYRQWKSNLNTI-----LVIDDLRFVLQEDCPQAPVSNATVAVRNAYDRWIKSNDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLG 256
           YILASISDVLAKKH+DTVT KEIMD++Q   G
Sbjct: 77  YILASISDVLAKKHEDTVTTKEIMDSLQSMFG 103

BLAST of Moc04g20790 vs. NCBI nr
Match: XP_022152352.1 (uncharacterized protein LOC111020095 [Momordica charantia])

HSP 1 Score: 127.5 bits (319), Expect = 1.7e-25
Identity = 65/96 (67.71%), Postives = 74/96 (77.08%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDL+FVLQEDCPQA APNATVAVR  YDRWIKANDKAKV
Sbjct: 17  ENYKQWKSNLNTI-----LVIDDLKFVLQEDCPQASAPNATVAVRIAYDRWIKANDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLGPLAS 260
           YILASISDVLAKKH+DT+TAKEIMD++Q   G  +S
Sbjct: 77  YILASISDVLAKKHEDTITAKEIMDSLQSMFGQPSS 107

BLAST of Moc04g20790 vs. NCBI nr
Match: XP_022158062.1 (uncharacterized protein LOC111024637 [Momordica charantia])

HSP 1 Score: 125.9 bits (315), Expect = 4.9e-25
Identity = 64/96 (66.67%), Postives = 74/96 (77.08%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKANDKAKV
Sbjct: 17  ENYKQWKSNINTI-----LMIDDLRFVLQEDCPQAPAPNATVAVRNIYDRWIKANDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLGPLAS 260
            ILASISDVLAKKH+++V  KEIMD++Q   G  +S
Sbjct: 77  DILASISDVLAKKHENSVITKEIMDSLQSMFGQPSS 107

BLAST of Moc04g20790 vs. NCBI nr
Match: KAA0055183.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK25777.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 109.4 bits (272), Expect = 4.8e-20
Identity = 61/148 (41.22%), Postives = 85/148 (57.43%), Query Frame = 0

Query: 134 VTTHPTVSAIGLHREVSYMTYVSSWSDHPYEEFIVWKSKSRKTPEMDR------------ 193
           +T HPTVS+IG HRE+  M  +       + E  + + ++ +TP +DR            
Sbjct: 1   MTIHPTVSSIGPHREIPIMRLLVCPGVTCHSEGCIIELETPRTPYLDRLDSSIVASKKLN 60

Query: 194 --------------SLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILA 253
                          ++DDLRFVL E+CPQ+   NA  A R  YDRWIK N+KA+VYILA
Sbjct: 61  GDNYEAWKSNLNTILVVDDLRFVLIEECPQSAVLNANRANRKAYDRWIKVNEKARVYILA 120

Query: 254 SISDVLAKKHKDTVTAKEIMDNIQGPLG 256
           ++SD+LAKKH+   T KEIMD+++G  G
Sbjct: 121 NMSDILAKKHESLATDKEIMDSLKGMFG 148

BLAST of Moc04g20790 vs. ExPASy TrEMBL
Match: A0A6J1DMS3 (uncharacterized protein LOC111022000 OS=Momordica charantia OX=3673 GN=LOC111022000 PE=4 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 1.5e-27
Identity = 63/77 (81.82%), Postives = 71/77 (92.21%), Query Frame = 0

Query: 183 LLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHKDTVT 242
           ++DDLRFVLQEDCPQAPAPNAT+AVRN YDRWIKANDKAKVYIL+SISDVLAKKH+DTVT
Sbjct: 1   MIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWIKANDKAKVYILSSISDVLAKKHEDTVT 60

Query: 243 AKEIMDNIQGPLGPLAS 260
           AKEIMD++Q   G  +S
Sbjct: 61  AKEIMDSLQSMFGQPSS 77

BLAST of Moc04g20790 vs. ExPASy TrEMBL
Match: A0A6J1DWL0 (uncharacterized protein LOC111024734 OS=Momordica charantia OX=3673 GN=LOC111024734 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 6.3e-26
Identity = 64/92 (69.57%), Postives = 71/92 (77.17%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDLRFVLQEDCPQAP  NATVAVRN YDRWIK+NDKAKV
Sbjct: 17  ENYRQWKSNLNTI-----LVIDDLRFVLQEDCPQAPVSNATVAVRNAYDRWIKSNDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLG 256
           YILASISDVLAKKH+DTVT KEIMD++Q   G
Sbjct: 77  YILASISDVLAKKHEDTVTTKEIMDSLQSMFG 103

BLAST of Moc04g20790 vs. ExPASy TrEMBL
Match: A0A6J1DFZ2 (uncharacterized protein LOC111020095 OS=Momordica charantia OX=3673 GN=LOC111020095 PE=4 SV=1)

HSP 1 Score: 127.5 bits (319), Expect = 8.2e-26
Identity = 65/96 (67.71%), Postives = 74/96 (77.08%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDL+FVLQEDCPQA APNATVAVR  YDRWIKANDKAKV
Sbjct: 17  ENYKQWKSNLNTI-----LVIDDLKFVLQEDCPQASAPNATVAVRIAYDRWIKANDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLGPLAS 260
           YILASISDVLAKKH+DT+TAKEIMD++Q   G  +S
Sbjct: 77  YILASISDVLAKKHEDTITAKEIMDSLQSMFGQPSS 107

BLAST of Moc04g20790 vs. ExPASy TrEMBL
Match: A0A6J1DW68 (uncharacterized protein LOC111024637 OS=Momordica charantia OX=3673 GN=LOC111024637 PE=4 SV=1)

HSP 1 Score: 125.9 bits (315), Expect = 2.4e-25
Identity = 64/96 (66.67%), Postives = 74/96 (77.08%), Query Frame = 0

Query: 164 EEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKV 223
           E +  WKS           ++DDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKANDKAKV
Sbjct: 17  ENYKQWKSNINTI-----LMIDDLRFVLQEDCPQAPAPNATVAVRNIYDRWIKANDKAKV 76

Query: 224 YILASISDVLAKKHKDTVTAKEIMDNIQGPLGPLAS 260
            ILASISDVLAKKH+++V  KEIMD++Q   G  +S
Sbjct: 77  DILASISDVLAKKHENSVITKEIMDSLQSMFGQPSS 107

BLAST of Moc04g20790 vs. ExPASy TrEMBL
Match: A0A5D3DEL0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold482G00020 PE=4 SV=1)

HSP 1 Score: 109.4 bits (272), Expect = 2.3e-20
Identity = 57/111 (51.35%), Postives = 72/111 (64.86%), Query Frame = 0

Query: 145 LHREVSYMTYVSSWSDHPYEEFIVWKSKSRKTPEMDRSLLDDLRFVLQEDCPQAPAPNAT 204
           LH E   M+Y         +++  WKS           ++DDLRFVL E+CPQ PA NA 
Sbjct: 55  LHPESVCMSYEQVSKKLNDDKYAAWKSNLNTI-----LVVDDLRFVLTEECPQTPASNAN 114

Query: 205 VAVRNVYDRWIKANDKAKVYILASISDVLAKKHKDTVTAKEIMDNIQGPLG 256
            A R  YDRWIKAN+KA+VYILAS+SDVLA+KH+   TAKEIMD+++G  G
Sbjct: 115 RASRKAYDRWIKANEKARVYILASMSDVLARKHESLATAKEIMDSLKGMFG 160

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154837.13.1e-2781.82uncharacterized protein LOC111022000 [Momordica charantia][more]
XP_022158197.11.3e-2569.57uncharacterized protein LOC111024734 [Momordica charantia][more]
XP_022152352.11.7e-2567.71uncharacterized protein LOC111020095 [Momordica charantia][more]
XP_022158062.14.9e-2566.67uncharacterized protein LOC111024637 [Momordica charantia][more]
KAA0055183.14.8e-2041.22gag/pol protein [Cucumis melo var. makuwa] >TYK25777.1 gag/pol protein [Cucumis ... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DMS31.5e-2781.82uncharacterized protein LOC111022000 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A6J1DWL06.3e-2669.57uncharacterized protein LOC111024734 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A6J1DFZ28.2e-2667.71uncharacterized protein LOC111020095 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1DW682.4e-2566.67uncharacterized protein LOC111024637 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A5D3DEL02.3e-2051.35Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold482G0002... [more]
Match NameE-valueIdentityDescription
Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc04g20790.1Moc04g20790.1mRNA