Moc03g21020 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g21020
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionEnzymatic polyprotein
Locationchr3: 14349547 .. 14352019 (+)
RNA-Seq ExpressionMoc03g21020
SyntenyMoc03g21020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCGGAAGATGAGCAACTACAAATACCGTATCTATTTTTTGAAGGACTTCCGCATTACATTGCTCAAAGATTCTATCAAGTGGCAGCAGCAAACTCTACAACCAATCAGATCAATTGGTCAGAGTTAACCATTGGGGATATTACTGCTACAGTTCAAGGGATATGCGTTAATCTCTGCCTAGAGAATAAGCATATTACTAAAGTTATCAAAGACCCCGACTACCGTAAGGAATTGGGAACTTTCTGCAAACAATACGGTCTTGACTACAGACCTAAAGATGAGAGAAAGAAAAGAAAGAAATCTTCCAACAAACAACTCTTTAGCAAAAGTAAGTCAAAGGATCCTGAATTTCCTCGGCACAACAGGAGCAAGGGTAAGAGGAATTATTCGAAGAATCATTCTCACAAAACCTCTGTTACTTGCTACAAATGCAATCGCAAAGGACATTATTCTAGCAAATGCCCTTTGAAGGACAAGATCAATTCTCTAACCATAGATGAAGATACACGTCAGTCTCTTCTCTATGCCATTATAAGTGAAGAAGAATATTCTTCTCATTCCGAGTCTTCAACCGATGATGATGGTATCAACCTCATAAATGAATAAGACTCGAGTGAAGAAACCTTCTTCTCTCAAAGTAGCTCCTCTAAGGATAACTTTAGTAGTATGCTTATTCTCTAGGCAGAGATTAACACATATCCCTTGAACTGTGGCAGTAATATCCCCAATGGTTAACTCTGACCAATTGATCTGATTGGTTGTAGAGTTTGCTGCTGCCACTTGATAGAATCTTTGAGCAATGTAATGCGGAAGTCCTTCAACAAATTTTTATTTCTAGATATCTTCTCCGCATGACGTAATAGTATAAAGACATGTTAGGAATGTGTCTTTATACCATTTGTAGTTGCTCATCTTCCGACATCGGAGTCCTAATAGTGCTTCGGTACTGAGGTCGGTATGGATTTTAGTGCTACCAACAAAATTCTTCGTCATTGCATAAACTAATTGATTTACCATGTCCGGTTCTTCAATTTGCATGGCGTTTTGTCCTACTTGTTTGACAATAGTCTTAGTTGCCGTCAAGATTTTTTGTCTATCCTCTTCTATGAGTTGGTTATGCCACCAACTTCTTAGGTTTCCAGAGAGACTTGAGAGTAGGATATGGACAGTTTGTAGAACTAGCGGCCATCATCATTTCTTGAAAAGTATTCAGCATTTGAGCTTTAGAATATCCGTTGATATTCTAGGTAACTATTGATGTTCCGTCATAGGCTCGTTGATCATGGCGGATATCATCCCATCCTAAATCAGGAGGAGATTATTGAGGATAATGATTTTTCATTTTCGTATGCATAGTTACTGGTAGAATTGTTGAAGATGAGGCTTGAGTTGTTGTAGGTATTGTAGCAACTACGTTTATACTCTTGGCAACATTAATCTTCTTAGAAGAGTCTCCTCTATTGATGGACAAAGAAGAAAGTCTTTTGTTGATCTCGAGTAAGAGATCGGAGGGGTCTTCCTTGAGAGTTCCAATCTTGAAACTGTTAGGTTGAGAAATAGGCTGGCAGGGGTCCACCTGTGGGATCTCTGGGTTCTTAGAGATGATAGGAAGAACCGGGTTCTCAATTCTTTCAACCGCTTTAGAAACTTCAAAAAGAATCTTGTTTGAGTAATTGAGTTGATGTTGGATGTTTTTGATTTCACGAACACCAACTTTCTGAACTTTGTCTTCATCTATCGTCTTATAGGACAAAGACACCATTTTTATGGTCGGTATGACTGAGTGAGAAAAGGTAGCTTCTTCTTCGGGAGGGAAGGTGGAAGCTATTTCTTTACCCCTTGCAGTGACCCAAACAGTGCCGTTTCTAATGACTCATTGGGCTTTAAACGTCTCTACGAGAGCTTCATTTCTAGCAAGCTCTTTTTGTTGTAATTTTGAGGTGAAATCTCATATACTCATGACAGGTTTCCTTGTCTCTTTTGGTGCGGTAATCCACATATCGATGTATTTGTCGTACAATTCTTCATAACGTCTTTCAATTTTTGAAATAACGTTTATTAGGTTGAATGTTGATTCAGTTCTCCTTTCCATGTCAGATTGGGTAGGAGAGAGCGATGCTTCTTCATAATGAACTTCAGGGATAGGATGGATAAAGTCTACCGGAGCTCGCATTGATTCAGATCTTTTAAAAGATTTGTTGATAGAGTTTACCGACCTTGTGTCTGAAGATATGCTTGGGCGGCTACTCATGACTTCTCTAATCTTGGGATATCTTGCCTCAGAATTAAATTGTATTTCAACGTTTCCATCTGGAATTCGATGATTGACGCCTCTGTTCTAGTTCTTTTAGCGGGGGATGTTACTCCTTGCAGTCTCCAGATAGGGTTTCTGGAAATTTCATCCCAGCGAAGTGTCTTTGGAATGGTCATCAAAGATTTTTCAACAATGACTTCCATCAGCATGGTGTAA

mRNA sequence

ATGTCGGAAGATGAGCAACTACAAATACCGTATCTATTTTTTGAAGGACTTCCGCATTACATTGCTCAAAGATTCTATCAAGTGGCAGCAGCAAACTCTACAACCAATCAGATCAATTGGTCAGAGTTAACCATTGGGGATATTACTGCTACAGTTCAAGGGATATGCGTTAATCTCTGCCTAGAGAATAAGCATATTACTAAAGTTATCAAAGACCCCGACTACCGTAAGGAATTGGGAACTTTCTGCAAACAATACGGTCTTGACTACAGACCTAAAGATGAGAGAAAGAAAAGAAAGAAATCTTCCAACAAACAACTCTTTAGCAAAAGTAAGTCAAAGGATCCTGAATTTCCTCGGCACAACAGGAGCAAGGGTAAGAGGAATTATTCGAAGAATCATTCTCACAAAACCTCTGTTACTTGCTACAAATGCAATCGCAAAGGACATTATTCTAGCAAATGCCCTTTGAAGGACAAGATCAATTCTCTAACCATAGATGAAGATACACGTCAGTCTCTTCTCTATGCCATTATAAGTGAAGAAGAATATTCTTCTCATTCCGAGTCTTCAACCGATGATGATGTCTTAGTTGCCGTCAAGATTTTTTGTCTATCCTCTTCTATGAGTTGGTTATGCCACCAACTTCTTAGGTTTCCAGAGAGACTTGAGAGTAGGATATGGACAGTTTGTAGAACTAGCGGCCATCATCATTTCTTGAAAAGTATTCAGCATTTGAGCTTTAGAATATCCGTTGATATTCTAGGTATTGTAGCAACTACGTTTATACTCTTGGCAACATTAATCTTCTTAGAAGAGTCTCCTCTATTGATGGACAAAGAAGAAAGTCTTTTGTTGATCTCGAGTAAGAGATCGGAGGGGTCTTCCTTGAGAGTTCCAATCTTGAAACTATATGCTTGGGCGGCTACTCATGACTTCTCTAATCTTGGGATATCTTGCCTCAGAATTAAATTGTATTTCAACGTTTCCATCTGGAATTCGATGATTGACGCCTCTGTTCTAGTTCTTTTAGCGGGGGATGTTACTCCTTGCAGTCTCCAGATAGGGTTTCTGGAAATTTCATCCCAGCGAAGTGTCTTTGGAATGGTCATCAAAGATTTTTCAACAATGACTTCCATCAGCATGGTGTAA

Coding sequence (CDS)

ATGTCGGAAGATGAGCAACTACAAATACCGTATCTATTTTTTGAAGGACTTCCGCATTACATTGCTCAAAGATTCTATCAAGTGGCAGCAGCAAACTCTACAACCAATCAGATCAATTGGTCAGAGTTAACCATTGGGGATATTACTGCTACAGTTCAAGGGATATGCGTTAATCTCTGCCTAGAGAATAAGCATATTACTAAAGTTATCAAAGACCCCGACTACCGTAAGGAATTGGGAACTTTCTGCAAACAATACGGTCTTGACTACAGACCTAAAGATGAGAGAAAGAAAAGAAAGAAATCTTCCAACAAACAACTCTTTAGCAAAAGTAAGTCAAAGGATCCTGAATTTCCTCGGCACAACAGGAGCAAGGGTAAGAGGAATTATTCGAAGAATCATTCTCACAAAACCTCTGTTACTTGCTACAAATGCAATCGCAAAGGACATTATTCTAGCAAATGCCCTTTGAAGGACAAGATCAATTCTCTAACCATAGATGAAGATACACGTCAGTCTCTTCTCTATGCCATTATAAGTGAAGAAGAATATTCTTCTCATTCCGAGTCTTCAACCGATGATGATGTCTTAGTTGCCGTCAAGATTTTTTGTCTATCCTCTTCTATGAGTTGGTTATGCCACCAACTTCTTAGGTTTCCAGAGAGACTTGAGAGTAGGATATGGACAGTTTGTAGAACTAGCGGCCATCATCATTTCTTGAAAAGTATTCAGCATTTGAGCTTTAGAATATCCGTTGATATTCTAGGTATTGTAGCAACTACGTTTATACTCTTGGCAACATTAATCTTCTTAGAAGAGTCTCCTCTATTGATGGACAAAGAAGAAAGTCTTTTGTTGATCTCGAGTAAGAGATCGGAGGGGTCTTCCTTGAGAGTTCCAATCTTGAAACTATATGCTTGGGCGGCTACTCATGACTTCTCTAATCTTGGGATATCTTGCCTCAGAATTAAATTGTATTTCAACGTTTCCATCTGGAATTCGATGATTGACGCCTCTGTTCTAGTTCTTTTAGCGGGGGATGTTACTCCTTGCAGTCTCCAGATAGGGTTTCTGGAAATTTCATCCCAGCGAAGTGTCTTTGGAATGGTCATCAAAGATTTTTCAACAATGACTTCCATCAGCATGGTGTAA

Protein sequence

MSEDEQLQIPYLFFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKDPDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPRHNRSKGKRNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSHSESSTDDDVLVAVKIFCLSSSMSWLCHQLLRFPERLESRIWTVCRTSGHHHFLKSIQHLSFRISVDILGIVATTFILLATLIFLEESPLLMDKEESLLLISSKRSEGSSLRVPILKLYAWAATHDFSNLGISCLRIKLYFNVSIWNSMIDASVLVLLAGDVTPCSLQIGFLEISSQRSVFGMVIKDFSTMTSISMV
Homology
BLAST of Moc03g21020 vs. NCBI nr
Match: XP_022151716.1 (uncharacterized protein LOC111019629 [Momordica charantia])

HSP 1 Score: 289.3 bits (739), Expect = 5.0e-74
Identity = 147/188 (78.19%), Postives = 164/188 (87.23%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYIAQ+FYQ    NSTTN+I+W+ELTIGDI AT+Q ICVNLCLENKH  KVIK+
Sbjct: 318 FVEGLPHYIAQKFYQTVVTNSTTNRIDWAELTIGDINATIQQICVNLCLENKHTAKVIKE 377

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
           PDYRKELGTFCKQYGLD R ++ERKK+KKSSNK+LFSKSKSKD E PR     +NR+KGK
Sbjct: 378 PDYRKELGTFCKQYGLDDRSEEERKKKKKSSNKRLFSKSKSKDSELPRRKRKYYNRNKGK 437

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           ++YSKN  HK+SVTCYKCNRKGHYSSKCPLKDKINSLTIDE TR+SLLYAI SEEE SS 
Sbjct: 438 KDYSKNRPHKSSVTCYKCNRKGHYSSKCPLKDKINSLTIDEKTRRSLLYAIRSEEENSSS 497

Query: 193 SESSTDDD 196
           SESSTD+D
Sbjct: 498 SESSTDND 505

BLAST of Moc03g21020 vs. NCBI nr
Match: KAA0058433.1 (Enzymatic polyprotein [Cucumis melo var. makuwa] >TYJ96695.1 Enzymatic polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 233.0 bits (593), Expect = 4.2e-57
Identity = 118/187 (63.10%), Postives = 150/187 (80.21%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGL HYI+++FYQ  A NS T QINW ELT GDI+AT+Q IC+ L  ENKH TKVIKD
Sbjct: 154 FVEGLSHYISKKFYQTVATNSVTKQINWVELTYGDISATLQAICIILYTENKHTTKVIKD 213

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
            DY KEL TFCKQYGL+  PK+E+KK+KKSS+K+LFS+SK+KDPEFPR     +N+ KGK
Sbjct: 214 SDYCKELRTFCKQYGLNQGPKEEKKKKKKSSSKRLFSRSKAKDPEFPRRKRKYYNKIKGK 273

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           R+Y      KT+  C+KCNRKGHY+++CPLKDKIN+LT+DE+TRQS+LYAI S+ + SS 
Sbjct: 274 RHYPS----KTNNACFKCNRKGHYANRCPLKDKINALTMDEETRQSILYAIRSDNDTSSE 333

Query: 193 SESSTDD 195
           ++SST++
Sbjct: 334 TDSSTEE 336

BLAST of Moc03g21020 vs. NCBI nr
Match: KAA0057417.1 (Enzymatic polyprotein [Cucumis melo var. makuwa] >TYK30116.1 Enzymatic polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 231.1 bits (588), Expect = 1.6e-56
Identity = 115/190 (60.53%), Postives = 148/190 (77.89%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ    NS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 689 FVEGLPHYISQKFYQTMTENSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 748

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
            DYRKELGTFCKQYGL   PK+E+KK+KK S+K+ F +SK KD E PR     +N+ KGK
Sbjct: 749 SDYRKELGTFCKQYGLSQGPKEEKKKKKKYSSKKFFRRSKPKDQESPRRRKHHYNKGKGK 808

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           + YS     KT+  C+KCN+KGHY+++CPL+DKIN+LTIDE T+QS+LYAI S+++ SS 
Sbjct: 809 KRYSS----KTNTICFKCNQKGHYANRCPLQDKINALTIDEKTKQSILYAIRSDDDTSSQ 868

Query: 193 SESSTDDDVL 198
           +ESS+++D +
Sbjct: 869 TESSSEEDYI 874

BLAST of Moc03g21020 vs. NCBI nr
Match: KAA0056776.1 (Enzymatic polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 229.2 bits (583), Expect = 6.1e-56
Identity = 117/189 (61.90%), Postives = 149/189 (78.84%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ   ANS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 798 FVEGLPHYISQKFYQTMTANSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 857

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKS-SNKQLFSKSKSKDPEFPRHNR---SKGKR 132
            DYRKELGTFCKQYGL   PK+E+KK+KK  S+K+ F KSK+KD E PR  R   +KGK 
Sbjct: 858 SDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSKKFFRKSKAKDQESPRRRRRHYNKGKS 917

Query: 133 NYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSHS 192
              K +S KT   C+KCN+KGHY+++CPLKDKIN++TIDE+T+QSLLYAI S+++ +S +
Sbjct: 918 --KKGYSSKTHTICFKCNQKGHYANRCPLKDKINAMTIDEETKQSLLYAIRSDDDTTSQT 977

Query: 193 ESSTDDDVL 198
           ESS+++D +
Sbjct: 978 ESSSEEDYI 984

BLAST of Moc03g21020 vs. NCBI nr
Match: TYJ97599.1 (Enzymatic polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 227.6 bits (579), Expect = 1.8e-55
Identity = 115/187 (61.50%), Postives = 146/187 (78.07%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ   ANS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 798 FVEGLPHYISQKFYQTMTANSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 857

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKS-SNKQLFSKSKSKDPEFP-RHNRSKGKRNY 132
            DYRKELGTFCKQYGL   PK+E+KK+KK  S+K+ F KSK+KD E P R  R   K   
Sbjct: 858 SDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSKKFFRKSKTKDQESPQRRKRHYNKGKS 917

Query: 133 SKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSHSES 192
            K +S KT   C+KCN+KGHY+++CPLKDKIN++TIDE+T+QSLLYAI S+++ +S +ES
Sbjct: 918 KKGYSSKTHTICFKCNQKGHYANRCPLKDKINAMTIDEETKQSLLYAIRSDDDTTSQTES 977

Query: 193 STDDDVL 198
           S+++D +
Sbjct: 978 SSEEDYI 984

BLAST of Moc03g21020 vs. ExPASy TrEMBL
Match: A0A6J1DFI7 (uncharacterized protein LOC111019629 OS=Momordica charantia OX=3673 GN=LOC111019629 PE=4 SV=1)

HSP 1 Score: 289.3 bits (739), Expect = 2.4e-74
Identity = 147/188 (78.19%), Postives = 164/188 (87.23%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYIAQ+FYQ    NSTTN+I+W+ELTIGDI AT+Q ICVNLCLENKH  KVIK+
Sbjct: 318 FVEGLPHYIAQKFYQTVVTNSTTNRIDWAELTIGDINATIQQICVNLCLENKHTAKVIKE 377

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
           PDYRKELGTFCKQYGLD R ++ERKK+KKSSNK+LFSKSKSKD E PR     +NR+KGK
Sbjct: 378 PDYRKELGTFCKQYGLDDRSEEERKKKKKSSNKRLFSKSKSKDSELPRRKRKYYNRNKGK 437

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           ++YSKN  HK+SVTCYKCNRKGHYSSKCPLKDKINSLTIDE TR+SLLYAI SEEE SS 
Sbjct: 438 KDYSKNRPHKSSVTCYKCNRKGHYSSKCPLKDKINSLTIDEKTRRSLLYAIRSEEENSSS 497

Query: 193 SESSTDDD 196
           SESSTD+D
Sbjct: 498 SESSTDND 505

BLAST of Moc03g21020 vs. ExPASy TrEMBL
Match: A0A5A7URD4 (Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold535G00110 PE=4 SV=1)

HSP 1 Score: 233.0 bits (593), Expect = 2.0e-57
Identity = 118/187 (63.10%), Postives = 150/187 (80.21%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGL HYI+++FYQ  A NS T QINW ELT GDI+AT+Q IC+ L  ENKH TKVIKD
Sbjct: 154 FVEGLSHYISKKFYQTVATNSVTKQINWVELTYGDISATLQAICIILYTENKHTTKVIKD 213

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
            DY KEL TFCKQYGL+  PK+E+KK+KKSS+K+LFS+SK+KDPEFPR     +N+ KGK
Sbjct: 214 SDYCKELRTFCKQYGLNQGPKEEKKKKKKSSSKRLFSRSKAKDPEFPRRKRKYYNKIKGK 273

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           R+Y      KT+  C+KCNRKGHY+++CPLKDKIN+LT+DE+TRQS+LYAI S+ + SS 
Sbjct: 274 RHYPS----KTNNACFKCNRKGHYANRCPLKDKINALTMDEETRQSILYAIRSDNDTSSE 333

Query: 193 SESSTDD 195
           ++SST++
Sbjct: 334 TDSSTEE 336

BLAST of Moc03g21020 vs. ExPASy TrEMBL
Match: A0A5A7URX9 (Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold216G00980 PE=4 SV=1)

HSP 1 Score: 231.1 bits (588), Expect = 7.8e-57
Identity = 115/190 (60.53%), Postives = 148/190 (77.89%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ    NS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 689 FVEGLPHYISQKFYQTMTENSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 748

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKSSNKQLFSKSKSKDPEFPR-----HNRSKGK 132
            DYRKELGTFCKQYGL   PK+E+KK+KK S+K+ F +SK KD E PR     +N+ KGK
Sbjct: 749 SDYRKELGTFCKQYGLSQGPKEEKKKKKKYSSKKFFRRSKPKDQESPRRRKHHYNKGKGK 808

Query: 133 RNYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSH 192
           + YS     KT+  C+KCN+KGHY+++CPL+DKIN+LTIDE T+QS+LYAI S+++ SS 
Sbjct: 809 KRYSS----KTNTICFKCNQKGHYANRCPLQDKINALTIDEKTKQSILYAIRSDDDTSSQ 868

Query: 193 SESSTDDDVL 198
           +ESS+++D +
Sbjct: 869 TESSSEEDYI 874

BLAST of Moc03g21020 vs. ExPASy TrEMBL
Match: A0A5A7UR29 (Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold486G00660 PE=4 SV=1)

HSP 1 Score: 229.2 bits (583), Expect = 3.0e-56
Identity = 117/189 (61.90%), Postives = 149/189 (78.84%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ   ANS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 798 FVEGLPHYISQKFYQTMTANSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 857

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKS-SNKQLFSKSKSKDPEFPRHNR---SKGKR 132
            DYRKELGTFCKQYGL   PK+E+KK+KK  S+K+ F KSK+KD E PR  R   +KGK 
Sbjct: 858 SDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSKKFFRKSKAKDQESPRRRRRHYNKGKS 917

Query: 133 NYSKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSHS 192
              K +S KT   C+KCN+KGHY+++CPLKDKIN++TIDE+T+QSLLYAI S+++ +S +
Sbjct: 918 --KKGYSSKTHTICFKCNQKGHYANRCPLKDKINAMTIDEETKQSLLYAIRSDDDTTSQT 977

Query: 193 ESSTDDDVL 198
           ESS+++D +
Sbjct: 978 ESSSEEDYI 984

BLAST of Moc03g21020 vs. ExPASy TrEMBL
Match: A0A5D3BEY3 (Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold690G00300 PE=4 SV=1)

HSP 1 Score: 227.6 bits (579), Expect = 8.6e-56
Identity = 115/187 (61.50%), Postives = 146/187 (78.07%), Query Frame = 0

Query: 13  FFEGLPHYIAQRFYQVAAANSTTNQINWSELTIGDITATVQGICVNLCLENKHITKVIKD 72
           F EGLPHYI+Q+FYQ   ANS   QI+W+ LT GDI++TVQ ICVNLC ENKH TKVIKD
Sbjct: 798 FVEGLPHYISQKFYQTMTANSVNQQIDWANLTYGDISSTVQMICVNLCTENKHTTKVIKD 857

Query: 73  PDYRKELGTFCKQYGLDYRPKDERKKRKKS-SNKQLFSKSKSKDPEFP-RHNRSKGKRNY 132
            DYRKELGTFCKQYGL   PK+E+KK+KK  S+K+ F KSK+KD E P R  R   K   
Sbjct: 858 SDYRKELGTFCKQYGLSQGPKEEKKKKKKRYSSKKFFRKSKTKDQESPQRRKRHYNKGKS 917

Query: 133 SKNHSHKTSVTCYKCNRKGHYSSKCPLKDKINSLTIDEDTRQSLLYAIISEEEYSSHSES 192
            K +S KT   C+KCN+KGHY+++CPLKDKIN++TIDE+T+QSLLYAI S+++ +S +ES
Sbjct: 918 KKGYSSKTHTICFKCNQKGHYANRCPLKDKINAMTIDEETKQSLLYAIRSDDDTTSQTES 977

Query: 193 STDDDVL 198
           S+++D +
Sbjct: 978 SSEEDYI 984

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022151716.15.0e-7478.19uncharacterized protein LOC111019629 [Momordica charantia][more]
KAA0058433.14.2e-5763.10Enzymatic polyprotein [Cucumis melo var. makuwa] >TYJ96695.1 Enzymatic polyprote... [more]
KAA0057417.11.6e-5660.53Enzymatic polyprotein [Cucumis melo var. makuwa] >TYK30116.1 Enzymatic polyprote... [more]
KAA0056776.16.1e-5661.90Enzymatic polyprotein [Cucumis melo var. makuwa][more]
TYJ97599.11.8e-5561.50Enzymatic polyprotein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DFI72.4e-7478.19uncharacterized protein LOC111019629 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A5A7URD42.0e-5763.10Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold53... [more]
A0A5A7URX97.8e-5760.53Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold21... [more]
A0A5A7UR293.0e-5661.90Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold48... [more]
A0A5D3BEY38.6e-5661.50Enzymatic polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold69... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 141..157
e-value: 0.0067
score: 24.6
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 141..156
e-value: 4.6E-5
score: 23.3
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 142..156
score: 10.01534
NoneNo IPR availableGENE3D4.10.60.10coord: 128..215
e-value: 7.0E-6
score: 27.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 93..135
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 108..124
NoneNo IPR availablePANTHERPTHR33054FAMILY NOT NAMEDcoord: 13..194
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 124..160

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g21020.1Moc03g21020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006508 proteolysis
molecular_function GO:0004190 aspartic-type endopeptidase activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding