Moc04g05850 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc04g05850
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionKeratin, type II cytoskeletal 1-like
Locationchr4: 3987757 .. 3990492 (+)
RNA-Seq ExpressionMoc04g05850
SyntenyMoc04g05850
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAACTTCGACAAACATGGAAGCTGATTCCACCGACACCGATGCAATGTCCGTCAGTCAATTAATTAATCCAGGAAACAAGATTTCGATCGTGAAGCTTACGGATGATTATTTCCTTTCGTGGAAATTTCAAGTACTCACTACTTTGGAAGGTCACGGATTAGAGTCGTACATTGAAGATGAAATTGATCCTCCTCTGCAATTTATTCAGGTCATTAATGGAACCTCTACTAGTTCGAAATTGAACCCGATTTTTACTAAGTGGAAACGACAGGATAAATTGATTTCATCTTGGTTGCTCGGCTCCATGTCTGAGGATTTTCTTGAACAAATGCTACATTGCAAGTCTGCTAGAGAAATTTGGACCTGTTTATTACAGATTTTTATTTCTCGCAATCTTGCACAAGTAATGAAAATTAAATCGAAGTTGCATAATCTCCAAAAAAGGTAGTTCTTCTCTGAAAGAATATTTCTCGCAAGTTAAGAAGTGTATTGATGCTTTAGCGGCGTTTCAATCTATGATACCAGTGCAAAATTCGGTCCTCAAACCGTTCAAGAGGTTATGGCTCTACTCTTGACACTTGACACAAGAAAATAGGAATGACAGCAAAAAGGTTGTGGTGAATTCGGATGGCACTCCTCCATCGGCTAATCTCCTTACTCATGCTCCTACTATGGGAAAGGAGAATGAGAATCCTATATATTCTAATGTTCCTCAGTATACTTCTGGTTCTAATAATAGGACTCGCGGTCGACATAATAGAGGCGGAAAACAGTGGAATAATTGAGGTCGTGTTCAATGTCAGTTATGTGGTCGGTTTAGACATACATTTTTCCAGTGGCTCCTCTCAAATGTCTGCTTTACTTACTGCCAATCAGGATATGAATTGGTATCCGGATTCAGGTGCTACTAATCATTTGAGGAACAATCTTACCAATCTGTCTATTGGTACAGAATACACTGGAGGAAACCAGGTGCAAGTCGAGAATGGTGCAGGTCTGAAAATTTTTCACTTTGGCCATACATCTTTTCAAACTTCAAATAATCATATCCTTCATCTCAATAACTTACTTCATGTCCCTCATATTACCAAAAACTTGATAAGTGTTAATCAGTTTGTTAGAGATAACTTAGTCTTTTTTGAATTTCTCCTATTTTTTGCTATGTGAAGGACCACCGTACTACCATTCTTCTCCAAGGGACTCTGCATGATGGACTGTACCGGTTTCATCTTTCTCCTCTTTCTTCTGCTCCTATGTCTACCGCTAGCCCTTCTTCTCAAGTCTTGTCTCCAACACCAGCTGCTCTGTTTTCGTCTTCTACCTCCTTGTCCCCTGCTGTGTCTCCTTCTCCTATTGATTTGTGGCATAGGCAACTTGGGTACCCTGCCTTTCCTATTGTTAGAAAAATAATAAATGTGTTCAAACGATTTAAATCTTCTTCTATGGCATTTCATTTTTGTAATGTTTGTGCTATGTGTAAATCTCATGCTCTTCCGTTCCGTTTACTCCTACTGCGGTTTATACTGCTCCATTGCAATTAATTGTCTTTGATTTGTGGGGTCTTGCATTTACACCCTCTAATGGCTTTAAATATTACATTAGTTTTGTTGATGCTTTTTCACGTTATACTTAGATTTATTTTCTTCGATCCAAGTCGGATGCCTTCTCTGTTTTTCTTCAATTTAAAACTATTGGTTGAGAAACATCTGAGATGCTCTATTATTTGTTCTCAATCTGACGGGGGTGGTGAATTTAAGCTCTTTACTTATTTTCTTCAAAAACATGGCATTATTCATCACCTTACATGTCCTCACACGTCTAAACAAAATGGGATTCTTGAGCGCAAGAATCGTCACATTGTTGATACTTGGTTAACTTTGTTATCTCAGTCTTCTATGGATGATGCCTTTTCTACATCTGTCTATCTCATTAACCAGTTACCTTCCACAATCCTTGGTGATGTGATTTCTTTGGAGAAGCTATTATGTCACAAACCTAACCATGCTTGGTTAAAGGTTTTTAGATGTCAACGTTTTCCTTGTTTGCGTTCTTTTAATGCTCACAAGCTCGATTTTCGTTCTCAACCTTGCACTTTTAATTTATTGGTTATAGTAACACATATAAAGGCTATAAGTTTGTCGTCTACCGGTAAGGTCTTTGTGTTTCGAAATGTTATTTTTTATGAGCATTCTTTCCCTTTTGCTAACCTTTCTATTCAGTCTATTCAGACTTATTTTTCTCCTCCTTTTTCGAACATCTCTTCGACTGCCTCTACAGATTTCCCTATTCCTCTGTTGTCCAACTCACCATCACCTACTTTAGCTGTTTCTTTCCCCCAATCGCTGCACCTATCCCCTCTGTATCTGGTTCTTCTCCAGCTATTCCTGTATCAGCTGACAATCGTCCGACTTTTGCCGTTCGTACTGTTGCTTGTTCTATACCTGCTGCTGTCAGTTCTATTTCGGCTCCTCGTGTACCGTCTATGCCTCTTGATGGGTCACATCTGGTGCCTAGTGATTCTCCATCTTTGCCTGTTGGTTCTTTGACTGTTGGTACCGAGGCCGTTGTCCCCACTGGTTCTTTAAATGTGTCCTTATCTGATGGTTCTCTCACTGTGCCTGTCCATGAAGTTGGCACTTCGTCTCAGCGGTTTGTTCTGATCTTCCTCCTTCGGCTCCCACTGTTTCGGTTCCCTCTTCTTCTTATGTACAAAATTCTCATTCGATGGTGA

mRNA sequence

ATGGAAACTTCGACAAACATGGAAGCTGATTCCACCGACACCGATGCAATGTCCGTCAGTCAATTAATTAATCCAGGAAACAAGATTTCGATCGTGAAGCTTACGGATGATTATTTCCTTTCGTGGAAATTTCAAGTACTCACTACTTTGGAAGGTCACGGATTAGAGTCGTACATTGAAGATGAAATTGATCCTCCTCTGCAATTTATTCAGGTCATTAATGGAACCTCTACTAGTTCGAAATTGAACCCGATTTTTACTAAGTGGAAACGACAGGATAAATTGATTTCATCTTGGTTGCTCGGCTCCATGTCTGAGGATTTTCTTGAACAAATGCTACATTGCAAGTCTGCTAGAGAAATTTGGACCTGTTTATTACAGATTTTTATTTCTCGCAATCTTGCACAAGTAATGAAAATTAAATCGAAGTTGCATAATCTCCAAAAAAGGAATGACAGCAAAAAGGTTGTGGTGAATTCGGATGGCACTCCTCCATCGGCTAATCTCCTTACTCATGCTCCTACTATGGGAAAGGAGAATGAGAATCCTATATATTCTAATGTTCCTCAGTATACTTCTGGTTCTAATAATAGGACTCGCGGTGCTACTAATCATTTGAGGAACAATCTTACCAATCTGTCTATTGGTACAGAATACACTGGAGGAAACCAGGACCACCGTACTACCATTCTTCTCCAAGGGACTCTGCATGATGGACTGTACCGGTTTCATCTTTCTCCTCTTTCTTCTGCTCCTATGTCTACCGCTAGCCCTTCTTCTCAAGTCTTGTCTCCAACACCAGCTGCTCTGTTTTCGTCTTCTACCTCCTTGTCCCCTGCTGTGTCTCCTTCTCCTATTGATTTGTGGCATAGGCAACTTGGATTTCCCTATTCCTCTGTTGTCCAACTCACCATCACCTACTTTAGCTGTTTCTTTCCCCCAATCGCTGCACCTATCCCCTCTGTATCTGGTTCTTCTCCAGCTATTCCTGTATCAGCTGACAATCGTCCGACTTTTGCCGTTCGTACTGTTGCTTGTTCTATACCTGCTGCTGTCAGTTCTATTTCGGCTCCTCGTGTACCGTCTATGCCTCTTGATGGGTCACATCTGGTGCCTAGTGATTCTCCATCTTTGCCTGTTGGTTCTTTGACTGTTGGTACCGAGGCCGTTGTCCCCACTGGTTCTTTAAATGTGTCCTTATCTGATGGTTCTCTCACTGTGCCTGTCCATGAAGTTGGCACTTCGTCTCAGCGGTTTGTTCTGATCTTCCTCCTTCGGCTCCCACTGTTTCGGTTCCCTCTTCTTCTTATGTACAAAATTCTCATTCGATGGTGA

Coding sequence (CDS)

ATGGAAACTTCGACAAACATGGAAGCTGATTCCACCGACACCGATGCAATGTCCGTCAGTCAATTAATTAATCCAGGAAACAAGATTTCGATCGTGAAGCTTACGGATGATTATTTCCTTTCGTGGAAATTTCAAGTACTCACTACTTTGGAAGGTCACGGATTAGAGTCGTACATTGAAGATGAAATTGATCCTCCTCTGCAATTTATTCAGGTCATTAATGGAACCTCTACTAGTTCGAAATTGAACCCGATTTTTACTAAGTGGAAACGACAGGATAAATTGATTTCATCTTGGTTGCTCGGCTCCATGTCTGAGGATTTTCTTGAACAAATGCTACATTGCAAGTCTGCTAGAGAAATTTGGACCTGTTTATTACAGATTTTTATTTCTCGCAATCTTGCACAAGTAATGAAAATTAAATCGAAGTTGCATAATCTCCAAAAAAGGAATGACAGCAAAAAGGTTGTGGTGAATTCGGATGGCACTCCTCCATCGGCTAATCTCCTTACTCATGCTCCTACTATGGGAAAGGAGAATGAGAATCCTATATATTCTAATGTTCCTCAGTATACTTCTGGTTCTAATAATAGGACTCGCGGTGCTACTAATCATTTGAGGAACAATCTTACCAATCTGTCTATTGGTACAGAATACACTGGAGGAAACCAGGACCACCGTACTACCATTCTTCTCCAAGGGACTCTGCATGATGGACTGTACCGGTTTCATCTTTCTCCTCTTTCTTCTGCTCCTATGTCTACCGCTAGCCCTTCTTCTCAAGTCTTGTCTCCAACACCAGCTGCTCTGTTTTCGTCTTCTACCTCCTTGTCCCCTGCTGTGTCTCCTTCTCCTATTGATTTGTGGCATAGGCAACTTGGATTTCCCTATTCCTCTGTTGTCCAACTCACCATCACCTACTTTAGCTGTTTCTTTCCCCCAATCGCTGCACCTATCCCCTCTGTATCTGGTTCTTCTCCAGCTATTCCTGTATCAGCTGACAATCGTCCGACTTTTGCCGTTCGTACTGTTGCTTGTTCTATACCTGCTGCTGTCAGTTCTATTTCGGCTCCTCGTGTACCGTCTATGCCTCTTGATGGGTCACATCTGGTGCCTAGTGATTCTCCATCTTTGCCTGTTGGTTCTTTGACTGTTGGTACCGAGGCCGTTGTCCCCACTGGTTCTTTAAATGTGTCCTTATCTGATGGTTCTCTCACTGTGCCTGTCCATGAAGTTGGCACTTCGTCTCAGCGGTTTGTTCTGATCTTCCTCCTTCGGCTCCCACTGTTTCGGTTCCCTCTTCTTCTTATGTACAAAATTCTCATTCGATGGTGA

Protein sequence

METSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDEIDPPLQFIQVINGTSTSSKLNPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAREIWTCLLQIFISRNLAQVMKIKSKLHNLQKRNDSKKVVVNSDGTPPSANLLTHAPTMGKENENPIYSNVPQYTSGSNNRTRGATNHLRNNLTNLSIGTEYTGGNQDHRTTILLQGTLHDGLYRFHLSPLSSAPMSTASPSSQVLSPTPAALFSSSTSLSPAVSPSPIDLWHRQLGFPYSSVVQLTITYFSCFFPPIAAPIPSVSGSSPAIPVSADNRPTFAVRTVACSIPAAVSSISAPRVPSMPLDGSHLVPSDSPSLPVGSLTVGTEAVVPTGSLNVSLSDGSLTVPVHEVGTSSQRFVLIFLLRLPLFRFPLLLMYKILIRW
Homology
BLAST of Moc04g05850 vs. NCBI nr
Match: XP_022154487.1 (uncharacterized protein LOC111021757 [Momordica charantia])

HSP 1 Score: 159.8 bits (403), Expect = 5.3e-35
Identity = 81/141 (57.45%), Postives = 102/141 (72.34%), Query Frame = 0

Query: 17  MSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDEIDPPLQFIQVINGT 76
           +  S+ INPG+K+SIV+L DD  L WKFQ+ T L+G+GLESYI+   D P QF+Q     
Sbjct: 16  IQASKTINPGSKVSIVRLNDDNXLLWKFQIRTALQGNGLESYIDSNEDTPAQFVQTTEDE 75

Query: 77  STSSKL--NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAREIWTCLLQIFISRNL 136
           S+SS L  NP + +W +QDKLIS+WLLGSM+ED L QML CKSAREIWT L  +F SR L
Sbjct: 76  SSSSSLQQNPAYFEWIKQDKLISAWLLGSMNEDILSQMLDCKSAREIWTVLECMFASRTL 135

Query: 137 AQVMKIKSKLHNLQKRNDSKK 156
           A+VM++K KL N +K N S K
Sbjct: 136 ARVMQLKLKLENFKKGNLSLK 156

BLAST of Moc04g05850 vs. NCBI nr
Match: KAA0067213.1 (keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 156.4 bits (394), Expect = 5.8e-34
Identity = 79/152 (51.97%), Postives = 111/152 (73.03%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+    NKIS+VKL+DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSSNKISLVKLSDDNFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSSKL---NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAR 122
            +PP +++ +  G+S++S     NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+
Sbjct: 64  SEPPSKYL-ISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAK 123

Query: 123 EIWTCLLQIFISRNLAQVMKIKSKLHNLQKRN 152
           EIW  L  IF SR LAQ MK K+KLHN++K +
Sbjct: 124 EIWGTLQGIFSSRYLAQAMKFKNKLHNIKKES 153

BLAST of Moc04g05850 vs. NCBI nr
Match: TYK18917.1 (keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa])

HSP 1 Score: 156.4 bits (394), Expect = 5.8e-34
Identity = 79/152 (51.97%), Postives = 111/152 (73.03%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+    NKIS+VKL+DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSSNKISLVKLSDDNFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSSKL---NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAR 122
            +PP +++ +  G+S++S     NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+
Sbjct: 64  SEPPSKYL-ISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAK 123

Query: 123 EIWTCLLQIFISRNLAQVMKIKSKLHNLQKRN 152
           EIW  L  IF SR LAQ MK K+KLHN++K +
Sbjct: 124 EIWGTLQGIFSSRYLAQAMKFKNKLHNIKKES 153

BLAST of Moc04g05850 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 156.0 bits (393), Expect = 7.6e-34
Identity = 78/149 (52.35%), Postives = 107/149 (71.81%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+   GNKIS+VKL DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSS--KLNPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSARE 122
            +PP +++     +S S+    NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+E
Sbjct: 64  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 123

Query: 123 IWTCLLQIFISRNLAQVMKIKSKLHNLQK 150
           IW  L  IF SR LAQ M+ K+KLHN++K
Sbjct: 124 IWETLQGIFSSRYLAQAMQFKNKLHNIKK 151

BLAST of Moc04g05850 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 156.0 bits (393), Expect = 7.6e-34
Identity = 78/149 (52.35%), Postives = 107/149 (71.81%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+   GNKIS+VKL DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSS--KLNPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSARE 122
            +PP +++     +S S+    NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+E
Sbjct: 64  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 123

Query: 123 IWTCLLQIFISRNLAQVMKIKSKLHNLQK 150
           IW  L  IF SR LAQ M+ K+KLHN++K
Sbjct: 124 IWETLQGIFSSRYLAQAMQFKNKLHNIKK 151

BLAST of Moc04g05850 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 69.3 bits (168), Expect = 1.2e-10
Identity = 37/123 (30.08%), Postives = 65/123 (52.85%), Query Frame = 0

Query: 27  NKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDEIDPPLQFIQVINGTSTSSKLNPIF 86
           N  ++ KLT   +L W  QV    +G+ L  +++     P   I    GT  + ++NP +
Sbjct: 19  NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATI----GTDAAPRVNPDY 78

Query: 87  TKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAREIWTCLLQIFISRNLAQVMKIKSKLHN 146
           T+WKRQDKLI S +LG++S      +    +A +IW  L +I+ + +   V +++++L  
Sbjct: 79  TRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQ 137

Query: 147 LQK 150
             K
Sbjct: 139 WTK 137

BLAST of Moc04g05850 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 65.1 bits (157), Expect = 2.3e-09
Identity = 34/115 (29.57%), Postives = 60/115 (52.17%), Query Frame = 0

Query: 27  NKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDEIDPPLQFIQVINGTSTSSKLNPIF 86
           N  ++ KLT   +L W  QV    +G+ L  +++     P   I    GT    ++NP +
Sbjct: 19  NMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATI----GTDAVPRVNPDY 78

Query: 87  TKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAREIWTCLLQIFISRNLAQVMKIK 142
           T+W+RQDKLI S +LG++S      +    +A +IW  L +I+ + +   V +++
Sbjct: 79  TRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLR 129

BLAST of Moc04g05850 vs. ExPASy TrEMBL
Match: A0A6J1DLT9 (uncharacterized protein LOC111021757 OS=Momordica charantia OX=3673 GN=LOC111021757 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 2.5e-35
Identity = 81/141 (57.45%), Postives = 102/141 (72.34%), Query Frame = 0

Query: 17  MSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDEIDPPLQFIQVINGT 76
           +  S+ INPG+K+SIV+L DD  L WKFQ+ T L+G+GLESYI+   D P QF+Q     
Sbjct: 16  IQASKTINPGSKVSIVRLNDDNXLLWKFQIRTALQGNGLESYIDSNEDTPAQFVQTTEDE 75

Query: 77  STSSKL--NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAREIWTCLLQIFISRNL 136
           S+SS L  NP + +W +QDKLIS+WLLGSM+ED L QML CKSAREIWT L  +F SR L
Sbjct: 76  SSSSSLQQNPAYFEWIKQDKLISAWLLGSMNEDILSQMLDCKSAREIWTVLECMFASRTL 135

Query: 137 AQVMKIKSKLHNLQKRNDSKK 156
           A+VM++K KL N +K N S K
Sbjct: 136 ARVMQLKLKLENFKKGNLSLK 156

BLAST of Moc04g05850 vs. ExPASy TrEMBL
Match: A0A5D3D5T2 (Keratin, type II cytoskeletal 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G001330 PE=4 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 2.8e-34
Identity = 79/152 (51.97%), Postives = 111/152 (73.03%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+    NKIS+VKL+DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSSNKISLVKLSDDNFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSSKL---NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAR 122
            +PP +++ +  G+S++S     NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+
Sbjct: 64  SEPPSKYL-ISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAK 123

Query: 123 EIWTCLLQIFISRNLAQVMKIKSKLHNLQKRN 152
           EIW  L  IF SR LAQ MK K+KLHN++K +
Sbjct: 124 EIWGTLQGIFSSRYLAQAMKFKNKLHNIKKES 153

BLAST of Moc04g05850 vs. ExPASy TrEMBL
Match: A0A5A7VGJ8 (Keratin, type II cytoskeletal 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold418G00160 PE=4 SV=1)

HSP 1 Score: 156.4 bits (394), Expect = 2.8e-34
Identity = 79/152 (51.97%), Postives = 111/152 (73.03%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+    NKIS+VKL+DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSSNKISLVKLSDDNFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSSKL---NPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSAR 122
            +PP +++ +  G+S++S     NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+
Sbjct: 64  SEPPSKYL-ISTGSSSASATRTPNPTYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAK 123

Query: 123 EIWTCLLQIFISRNLAQVMKIKSKLHNLQKRN 152
           EIW  L  IF SR LAQ MK K+KLHN++K +
Sbjct: 124 EIWGTLQGIFSSRYLAQAMKFKNKLHNIKKES 153

BLAST of Moc04g05850 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 3.7e-34
Identity = 78/149 (52.35%), Postives = 107/149 (71.81%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+   GNKIS+VKL DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSS--KLNPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSARE 122
            +PP +++     +S S+    NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+E
Sbjct: 64  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 123

Query: 123 IWTCLLQIFISRNLAQVMKIKSKLHNLQK 150
           IW  L  IF SR LAQ M+ K+KLHN++K
Sbjct: 124 IWETLQGIFSSRYLAQAMQFKNKLHNIKK 151

BLAST of Moc04g05850 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 156.0 bits (393), Expect = 3.7e-34
Identity = 78/149 (52.35%), Postives = 107/149 (71.81%), Query Frame = 0

Query: 3   TSTNMEADSTDTDAMSVSQLINPGNKISIVKLTDDYFLSWKFQVLTTLEGHGLESYIEDE 62
           TS+ +  ++T+  +  ++Q+   GNKIS+VKL DD FL WKFQ+LT LE + LE+++E E
Sbjct: 4   TSSLLGVENTEASS-PINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLESE 63

Query: 63  IDPPLQFIQVINGTSTSS--KLNPIFTKWKRQDKLISSWLLGSMSEDFLEQMLHCKSARE 122
            +PP +++     +S S+    NP +  WKRQD+LISSWLLGSMSE+ L QMLHCKSA+E
Sbjct: 64  SEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSAKE 123

Query: 123 IWTCLLQIFISRNLAQVMKIKSKLHNLQK 150
           IW  L  IF SR LAQ M+ K+KLHN++K
Sbjct: 124 IWETLQGIFSSRYLAQAMQFKNKLHNIKK 151

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022154487.15.3e-3557.45uncharacterized protein LOC111021757 [Momordica charantia][more]
KAA0067213.15.8e-3451.97keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa][more]
TYK18917.15.8e-3451.97keratin, type II cytoskeletal 1-like [Cucumis melo var. makuwa][more]
KAA0048297.17.6e-3452.35Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK10642.17.6e-3452.35Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
Match NameE-valueIdentityDescription
Q94HW21.2e-1030.08Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.3e-0929.57Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1DLT92.5e-3557.45uncharacterized protein LOC111021757 OS=Momordica charantia OX=3673 GN=LOC111021... [more]
A0A5D3D5T22.8e-3451.97Keratin, type II cytoskeletal 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A5A7VGJ82.8e-3451.97Keratin, type II cytoskeletal 1-like OS=Cucumis melo var. makuwa OX=1194695 GN=E... [more]
A0A5A7U2333.7e-3452.35Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3CH973.7e-3452.35Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 89..152
e-value: 1.8E-6
score: 27.8
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 39..153
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 39..153

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc04g05850.1Moc04g05850.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0044260 cellular macromolecule metabolic process
biological_process GO:0034641 cellular nitrogen compound metabolic process
biological_process GO:0044238 primary metabolic process
molecular_function GO:1901363 heterocyclic compound binding
molecular_function GO:0097159 organic cyclic compound binding