Moc01g16790 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc01g16790
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr1: 11212987 .. 11220354 (-)
RNA-Seq ExpressionMoc01g16790
SyntenyMoc01g16790
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATGGATTAGGAGGTTGGGTTCGATTTCCGGGCCGGGCGAGAGGTGAAGGCCATAAGAGAAGATTCCCTGGTAAGGAAGAGAAGAAAAAGGTGTTAGTCATCGAGATCGACCTCTTTCCCGAGGATTTTGATCACTATATATTAACTTTTATTTATTTTAATTAATGTACTCTCAAAACATCTGTTTTGGTCTCTATATTTAAAAAAAAGTGATCATTTTTTTTCCATATTTAAACATGTTAACACAAATTTATACACAATGGAAATCCACAAATACTAATTTATGGTTACATTAGAAAAAAATGAGTGAGCTGATCGAAAAAAAAATGAGATAAAAATGAAAATGAAAATGAAGTGACCAAAATGATCACTTTTAAAAAGAACGGGACCCAAATGAATATTTCGAAAATCTATGGACAATAAAATTTGAAAGTATATTGTTGAGCAAAATTGACGTTTTGAAAATAGAGAGACTAAAATGAACCAAAGCCAAAATTACATGATATGAACCAAAATGTCCTAGTCATGCCACGAAGATTTAAACGAAGATTTTGGAGATTTATCCTATGTCCAAGTTGACAACAAAATGGAGATACATTAAATGCACTCAGACTTCATTTATAAAGAAATCAAGATAAACTCGAGTTACATTTAAGGAAGTCAAGTTGCATTACTGTTTGTTTGATACCTACGATTTTGAAAGATTTCAAAATCATGACGGATTCCAAATGAATTTCGTTGTTTGGTACCACCTAATTTTAAATTATTTGAAATCTTACTGGATTTCCTGTTTTTACGCAAAAAAAAGTTCGGATCAAAACAGTGATTTTGTAGGATTTCATTGAATTTCAAAATAAATTACCTATTTAACTTTGTCATATTTTAATTTCTAAGTAATAAATTTAGATTAAAAGCATGAGTAAAATTTTTATTTTCTTTTCATTTACAACGAACATTATGAACAAAGAAATTTAATTTCACTCATTTTAAATCATTATTATTATTTCAAACAAGATAAAGAATTTAAAATCATTTCTCGGATTTCAAATCACAAAATTTAAAATCATTTCTTGAACTTTCTTGACACCAAACAAAAGGTTAAAGTTGAACTCAAGTTCTATTAAATGAACTCAAGTTGCAATTAAATAATTCAAATACATCTCAAGCTTCATTTAATGCAAGTCAAAGCAGGAATTAAACTTTTAAGCTACTTTTCAACTAGTACCTCATTCTCGGATTTAAACCTCATTTTATTGTATTTTCAATTTTAAACTTTTCATTTGTTAATTTTAGGTTGCTTTTAAGTTAGCGGATATTATGACATTAATGCACTAAACAGTTACAACCTTTCCATTGTCTTTAGGTTACTTCTAAGCCTTGTTTCTTAGGTTATATATAGTCTAGCTTACTCCATTAATTGTAAAAAGCAGATTTAAGTTATCCGAAAAATGAAATTTGAGCCATTGAGCCATGAGTTTCTCATTTGTTTTTCTTGTGTTTGAGAATTGCATGCTAGAATTATTCAAGTGACATTGATCAAGTTAACCTGTGGAGTGATTTGAATCTTGAGTTTAGAAATAAATTTTCTCATAAATTCGTTCTAACTTCTTCCTTGAGTTGAAATCATCTTGATTTGGAGTTTTAGTATCCGTGATCTAGAGTTTGCATAACCATAGGGATTTCTTAGATTCAATTGGCTAACGTTTTCTTTAAGTAAAATAGTTTTTTTTCTTTTTTACAATGATACAACAAAAAATGGAAGGAAATTACATTTGTGCCCCCTAATTTAAGAGGGAAAAATGAAGAAGCTAAGTTTCATAATAAAGCCCCTGAATCACGTGGGAATTTTGCCTGTGTTAGCCAAAATAATATAAAATACTTGTATAGTTGTGAACGTTGTGTTTTCATCTGTAAATTTAAAATAAAAAAAAATATATCATTTTAGCCATTACTCCCAAAATAGAAAAAATAAAAATTTTTGACATCAAGTAGACTTGTGATGTATAACAAATTTGGTATCAAATTGGGGAAGATGTACCTTTATTTATTTTTTTTATAAATACAATATGAATAATTTAACATAGATCAAAATGGAAACTAATCTAAAAAAATATATATTTATAGTATTAAGAACTAAAATGGAAAAAGAAAAAATTTAAAATAAGATTTAAACCTATAAATTAAGTTCCATTCTATACGATATTTTTCTCCATATAATAAAAGTCCCAAAAGTTCCAAAAAGAAATATTTTATATATATATATAATATTTCTTTTTGATGTATGTGGAAAGTTTCGTTCACGGGCATTAAGATGCCAATTACTTTCATTTTTGTCTTTGTTGCCATTTTTGATGAGCGAATAATTGATCCGAAAAAGAGTAATTGGATTCTCCCAATAGCTTAAAAGTGTATCATACCTAAATCTAACTAGAGTTCGCGGTAGTGGAAGAATGTCATCATTTTATGATTTTTTTTTAGTATATGCGGGATAGAGATTTGAACTCACGACATTTTGGTCGGAAATAAATATCTCAGTTGAGTTACATTTAGGGTGGAATATTATTTTAATGAATTAAAATGGCGAGAGAATTTTGCAATAATTGAATAAAAATGAGCTGTTAATTAAAACCCGACCACTTTTTGATTTTGTTTGTAATTACGAAGGAAATTATTCGCATTTTCCATTGGAATCAAGGAGAATTTTCCCCACGTTCATATATGCTATTTTTTTATAACAATGACAATACTCTTTCTTGATTTTGCTTTTCTTGTTTTGTTTTTGAGCAAATAAGAGAGCCAACTAGGGGATGAACAACAAGATTCACTTCTTGTTCTTAATTAAAAACTCAAAGAATAAGATTTCCCATTTCTCCCTCATGCATATGCATTTAATTACGGTGTATACCGAGCGAGATAGAGATACGTTCTGATTTGAACTTATCCTCAACTTGGAAAAATCGATCTCATTTTTTTTTAATAATGTGCACGTGCTCCCTATCTTATGTCAAGACCATATGAGAAAATTTTTTGAATCGTCCTTGTCGAAACAAGAAATTACTCTCTCTCTCTTTCATACACATGTGTTGGAGATGCAATCGGACAAGGACCCTCCTCTCTCATATGCACAAAACATAAGACAAATTTATGAATCGTCTTTGTTGAAACAATAAAATGCTTTCTCTCTCTCATACACATGTGTTAGAGATGCAACCGAGACAAAAGAACATCCACATTGATATAATATTGTCTATTTGAGCATAAACTCTCGATGGTTCTATCTTCACTCTTATACCAGGTGTTAGAGATACAGTCGGACAAGGGACCCTCCAACCTATATTTTCACATGGTATGGTACTATCCACTTTGGGCACCAAAGTCCTCATGACTTTGCTTTGGTTTCACTAAAAAAGACCTTAAATACCAATGAAAATGTTGTCTCTCACATATGTATATATCAATATCCATGAGGATCCTCTCAAACTCAATAGGTAATGAGAGCAGTACTGGACTTATGTTATAAATTCTCGTTAGCCCTTATCTTTCCAATATGAGATCTTTAACAGTATATATATCTTACGCTCTTCCCCTTCACTATTTAGTTGATGTGGAACTTTTATTCGCACTCCCGACAAAGAAGTCATCCCTATCTTAAAAAATGTTTTTGACTTCTTATGTGTGCTCTATCTCGCCTTAAGGAAGACAAAGATATTGTCTCTAAAATCACCCCTATCAACTTTGTTGATGCAAAAATCCACCTAAGGTACGGAGGATGACCTATGAGGTTGGTATTAGCAACTTGGTATAAATTCCTTCAGAGGTACCTGCGAAAACAAGTGGGCCACTCCGACGCCCAAGTTAGTTAGGAATTCGAGCTTAGACAGATCAGCTTCAAACAGATTTTCATACCTTGAATGGTGAAAAGGAGCTATATTTATAGCAGTCAAAGGAGTAGCTATCTATACCATCGGGTATTTCCCTCCTCGCACCACGTCCACCTCTGGGACCTTTGTCCATGCGTTTCACCGGATTTTTGCCTACCCTGAAATGGAAGCTATGGACGTCTTCGTTTTTTACCTAAGTGTCTACCCCGACGTGTTAGGTCAAGACGCTTGATTCCGTTTGAAGAGGTTGGCCCCTCACAGTCTGACTCCCAACAAACTTGAAAAATCACAATTTCTTTACTCAAAAAAAAAAAAGGACAACTAGAATCATAGTCCCCACAACCACAAATTAAATGCTCATGACCAGCTACATATTTTCTAAAACAAAATAATAAGAACTTGACCAATGGAATATATTAACTTTACCTTTTATTAAAATTTAAGAATAAATTTAATAAAATTAGATATTTAATATTAATATTAAATCAACTACGAAAAATATCCATAAAAATAGTCTTATAAGCTTCAAGAATCCTCATGACCAGCCACAAATTTTAAAAATTATAATTTAAAAAATAGCTTAACCATGTTAATTTCACCACCATCTGTTTACATAAAATAAATATGTATTTAAAAATAAACGTAGCTAGAAAGTTTGAAAATTTTCAAAATATATATAATCTAGATCCTCAAACTATAATATAAAACACATGCACAGTATAAAAGTCAGACTTCTACTATATATATATAAAGTAAAAAATAAAAAACCATGATATTTTTTTAAATACATTTAAAATTTATGCTATATAGAGATCTTTTTTCCTACCATTTTTCAAATTTGATAAATGTTGGTTCATCGATATTTCATTACAATAAAAAAGATTAAAATTTTATACATTCTCTCCTTTCTACCCTCGCTACAAAAATCTAAATTTATATGGCCGTCATAGCAAGATGCAACTCCTCCCCACGAAAAAGATTGTAGAGTTTGGGAAAAGCCTCACCTTTTATAGTTTGAATCGTAACACATGAAAAAATAAGATAATAAATTGTGTCTAATAGATCCCTAAATTTTAAAAAGTAGTATCTAGTAAGTCCTTTAACTTTTAATATTGTGTCTTATTATGTACACAAAATCGAAAGCTCAATAGACCTATTAACCACAAGCTTGAAAGTTAAAGGGACCTATTGGACAAAATTTTCAAATTATGTTTAATAGGTGGGGACCTATTAGATACAAAATGAAAAGGTGAAAGGGATCTATTAAACACGTTTTTAAAAATCCCTAAGGTCTAAATACAAAATTGAAAAGTTTCATAAATCTACTAGCTAGACATCTTTCTAAATTCAAGAACGTAATATACTCGATACAACTCTCTTAAAATTTAGGGATCCAATTAGGCACAATTTCAAAAGTTTAGGCATTAGACTTTCTTTTTTTTTTCTTTTTTGTTATGATCCATAATCGATAGTTCTTTAGCTTTTACCTCTCTAACGGGGAGCACTTGTGCTTATCCCAAGACTTGGACCTAAGATACATTAAGAATTTTTGGTATTATGCTCACCAAAAGGTTTGAACGTAAGACCACATTGATGGAATTAATTTAATTAATTCCATTAAATGGGTAACAATTTATGATAAATTAAATGTATTACTATTCATGGGAAAAATAAGATGTCTAATTAGGGCCCATAAACTTCCTATATGTGGTGTGACCCACTGGCTCGTTGATCCTAGGGGATGATGGGAAACCCTAGAATCAACTTATAAATACAGGGTTATGGTCCCCAATATACTATGCTTCTTTAGTCTCCTATTCCCCATCTAATAGAGAGTATTGCCCATCTTTGGTCTAAAGATGCATAGTTGAGGAAGACCCAAACCACACAGATATTGCTACAAGAATTTGGAATCTCTACTCATGGATCCAGGTAATTCAAAATAACTTATAGTTCTTTAGCCTACTCCCATGCATAGATCCTATATTAAAGTTTTTGTTTGCTAACAAGTGGTATCAGAGCTTAGTTTTACATGGTTGTTTGCTAAATATTTAATTGTTGTTTATGAATAAATTTGTTGGTTGTTTCCTTAGTTAAATTTATTATGAAACATCATGAAATCTGAATTATTAGAACTATTTGGATTGATTCAACAATTTTAAATTATTTAATTTGAAAATTGTTTGAATTCATTGTCCATTGATTTGATAAAAAAATAAAGTGGCATGAAGTGCTGCATTTGAAGAATCTGTTATTGTTAAAAAAAAAATAAAAAATTTGTGGCGTTACATGAAGAATCTATTATTGTAAAAAAAACAAACAAATAAAGTTTGCGACACTCCAGAGTGCCACATAAGGTTACTATTTACCCTATCACATAAAGACAATTTTGTTGCAACGGGCTTTTAAGGTTATTTATTTTGTTTTTTTTTTCCTTTTTTGATTAGTATTTTATATATTTATAATCTATTATTAATATTTTGGACATAAAAGGTATGGAGGACTGTTTATTTTTCCCCTTTTAAATGGTGCTAATTAAAGGTTGACTACTATTCAATAAATTTATATTTCATTTATTGGTAGCACAATTTATCTAATTTCTTTGAATGTTTTGATGTGTGAAATATGCTACCAAAGTAGTCTCTATTCTATAAATTGGAACATTTAAAGTTCTTAGATGTAGTGTATTTTTTGTTTATGCGGATTGTGATTGGCCCAAAGGAAGGTTACAATTTGGCCAAACAACAAACACACATGCAATAATAAATATGGATAACATAAGGTTGTGTTCTTGAAGTTGTTAGTCGACCCAAAGGAAGACTATCATCTAAAAGGAATGAGTTATTAATGTTTTATTATTGTACTGAGAGTATCACTAGCAAATTAACATTTTTGTCCAATGACTGAATGTAAATGTTGCGTTAGGTATCTCACTTGTCCTCATAAAGTTTATTTAGAATGTCATGAATATTTATATATATCTGTGTGACATTACTTACTATATTGAGCTTGTGTTCTTTATACAGTTAAGGTTGCTAATTCTGATAATATGTCCACTCAAGTCAACAACAATCCTAGACTGAATGGGGCTAATTTCAAAAACTGGAAAGAAGACATCCAGATAGTACTTGGATGTATGGATTTAGACCTCGCATTAAGGGTAGACCGCCCTACTTCAATTGAGGAAAATCCTAATAAGGTTGAAATTGAGAAGTGGGATAGGTTTAATCGCATGTTTCTAATGATCATGAAGCGCTCAATTCCAGAAACATTTAGAGGCTCTATTATTGAGGGAACGAATGCCAAAGGCTTTCTAAAGGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACGTTGGTAAAGGAAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTTAA

mRNA sequence

ATGAATGGATTAGGAGGTTGGGTTCGATTTCCGGGCCGGGCGAGAGGTGAAGGCCATAAGAGAAGATTCCCTGTTAAGGTTGCTAATTCTGATAATATGTCCACTCAAGTCAACAACAATCCTAGACTGAATGGGGCTAATTTCAAAAACTGGAAAGAAGACATCCAGATAGTACTTGGATGTATGGATTTAGACCTCGCATTAAGGGTAGACCGCCCTACTTCAATTGAGGAAAATCCTAATAAGGTTGAAATTGAGAAGTGGGATAGGTTTAATCGCATGTTTCTAATGATCATGAAGCGCTCAATTCCAGAAACATTTAGAGGCTCTATTATTGAGGGAACGAATGCCAAAGGCTTTCTAAAGGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACGTTGGTAAAGGAAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTTAA

Coding sequence (CDS)

ATGAATGGATTAGGAGGTTGGGTTCGATTTCCGGGCCGGGCGAGAGGTGAAGGCCATAAGAGAAGATTCCCTGTTAAGGTTGCTAATTCTGATAATATGTCCACTCAAGTCAACAACAATCCTAGACTGAATGGGGCTAATTTCAAAAACTGGAAAGAAGACATCCAGATAGTACTTGGATGTATGGATTTAGACCTCGCATTAAGGGTAGACCGCCCTACTTCAATTGAGGAAAATCCTAATAAGGTTGAAATTGAGAAGTGGGATAGGTTTAATCGCATGTTTCTAATGATCATGAAGCGCTCAATTCCAGAAACATTTAGAGGCTCTATTATTGAGGGAACGAATGCCAAAGGCTTTCTAAAGGAAATGGAGCAGTACTTTACCAAAAACGATAAGGCAGAGGCGAGTACCCTTATGGCAAAACTCACCTCTTCAAGATACGTTGGTAAAGGAAACATAAGGGAATACATAATGCAAATGTCAAATGTTGCAACAAAACTTAAGGCACTGAAGTTGGAAGTTTCTTAA

Protein sequence

MNGLGGWVRFPGRARGEGHKRRFPVKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEIEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVS
Homology
BLAST of Moc01g16790 vs. NCBI nr
Match: XP_022152232.1 (uncharacterized protein LOC111020001 [Momordica charantia])

HSP 1 Score: 301.6 bits (771), Expect = 4.4e-78
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0

Query: 26  KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI 85
           KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI
Sbjct: 5   KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI 64

Query: 86  EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS 145
           EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS
Sbjct: 65  EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS 124

Query: 146 SRYVGKGNIREYIMQMSNVATKLKALKLEVS 177
           SRYVGKGNIREYIMQMSNVATKLKALKLEVS
Sbjct: 125 SRYVGKGNIREYIMQMSNVATKLKALKLEVS 155

BLAST of Moc01g16790 vs. NCBI nr
Match: XP_022156979.1 (uncharacterized protein LOC111023808 [Momordica charantia])

HSP 1 Score: 284.3 bits (726), Expect = 7.3e-73
Identity = 143/151 (94.70%), Postives = 146/151 (96.69%), Query Frame = 0

Query: 25  VKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVE 84
           VKV NSDNMSTQVNN PRLNGANFK+WKEDIQIVLGCMDLDLALRVDRPTS EENPNKVE
Sbjct: 4   VKVXNSDNMSTQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVE 63

Query: 85  IEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 144
           I+KWDR NRM LMIMKRSIPETFRGSI+EGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT
Sbjct: 64  IKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 123

Query: 145 SSRYVGKGNIREYIMQMSNVATKLKALKLEV 176
           SSRYVGKGNIREYIMQMSNVATKLKALKLEV
Sbjct: 124 SSRYVGKGNIREYIMQMSNVATKLKALKLEV 154

BLAST of Moc01g16790 vs. NCBI nr
Match: XP_022155096.1 (uncharacterized protein LOC111022228 [Momordica charantia])

HSP 1 Score: 259.2 bits (661), Expect = 2.5e-65
Identity = 131/144 (90.97%), Postives = 136/144 (94.44%), Query Frame = 0

Query: 33  MSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEIEKWDRFN 92
           MSTQVNN PRLN ANFK+WKEDIQIVLGCMDLDLALRVDRPTS EENPNKVEIEKWDR N
Sbjct: 1   MSTQVNNIPRLNVANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIEKWDRSN 60

Query: 93  RMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTSSRYVGKG 152
           RM LMIMKRSIPETFRGSI+EGTNAK FLKEM+QYFTKNDKAEASTLM KLTSSRYVGKG
Sbjct: 61  RMCLMIMKRSIPETFRGSIVEGTNAKSFLKEMKQYFTKNDKAEASTLMTKLTSSRYVGKG 120

Query: 153 NIREYIMQMSNVATKLKALKLEVS 177
           NIREY MQMS+VATKLKALKL+VS
Sbjct: 121 NIREYKMQMSSVATKLKALKLKVS 144

BLAST of Moc01g16790 vs. NCBI nr
Match: KAF8413461.1 (hypothetical protein HHK36_001448 [Tetracentron sinense])

HSP 1 Score: 213.4 bits (542), Expect = 1.6e-51
Identity = 104/153 (67.97%), Postives = 129/153 (84.31%), Query Frame = 0

Query: 24  PVKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKV 83
           PV VA + N+S QV+N P L+G NFK WKE ++IVLGCMDLDLALR D+PT+  ENPN+V
Sbjct: 3   PVSVATATNVSAQVSNIPMLSGTNFKVWKEIVEIVLGCMDLDLALRSDQPTATPENPNEV 62

Query: 84  EIEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKL 143
           +IEKWDR NRM LMIMKRSIPE FRGSI E  +AK FL+E++QYF KN+K+EAS L+AKL
Sbjct: 63  KIEKWDRSNRMCLMIMKRSIPEAFRGSITERKSAKKFLEEIQQYFAKNEKSEASNLLAKL 122

Query: 144 TSSRYVGKGNIREYIMQMSNVATKLKALKLEVS 177
            + +Y GKGNIREYIM+MS++A+KLK+LKLE+S
Sbjct: 123 VAMKYKGKGNIREYIMEMSHLASKLKSLKLELS 155

BLAST of Moc01g16790 vs. NCBI nr
Match: KAF8394168.1 (hypothetical protein HHK36_020374 [Tetracentron sinense])

HSP 1 Score: 209.1 bits (531), Expect = 3.0e-50
Identity = 103/156 (66.03%), Postives = 129/156 (82.69%), Query Frame = 0

Query: 21  RRFPVKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENP 80
           RR  V VA + N+S QV+  P L+G NFK WKE ++IVLGCMDLDLALR D+PT+  ENP
Sbjct: 5   RRDMVSVAIATNVSAQVSKIPMLSGTNFKVWKETVEIVLGCMDLDLALRSDQPTATPENP 64

Query: 81  NKVEIEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLM 140
           N+V+IEKWDR NRM LMIMKRSIPE F+GSIIE  +AK FL+E++QYF  N+K+EAS L+
Sbjct: 65  NEVKIEKWDRSNRMCLMIMKRSIPEAFQGSIIESKSAKKFLEEIQQYFANNEKSEASNLL 124

Query: 141 AKLTSSRYVGKGNIREYIMQMSNVATKLKALKLEVS 177
           AKL + +Y GKGNIREYIM+MS++A+KLK+LKLE+S
Sbjct: 125 AKLVAMKYKGKGNIREYIMEMSHLASKLKSLKLELS 160

BLAST of Moc01g16790 vs. ExPASy TrEMBL
Match: A0A6J1DFM1 (uncharacterized protein LOC111020001 OS=Momordica charantia OX=3673 GN=LOC111020001 PE=4 SV=1)

HSP 1 Score: 301.6 bits (771), Expect = 2.1e-78
Identity = 151/151 (100.00%), Postives = 151/151 (100.00%), Query Frame = 0

Query: 26  KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI 85
           KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI
Sbjct: 5   KVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEI 64

Query: 86  EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS 145
           EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS
Sbjct: 65  EKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTS 124

Query: 146 SRYVGKGNIREYIMQMSNVATKLKALKLEVS 177
           SRYVGKGNIREYIMQMSNVATKLKALKLEVS
Sbjct: 125 SRYVGKGNIREYIMQMSNVATKLKALKLEVS 155

BLAST of Moc01g16790 vs. ExPASy TrEMBL
Match: A0A6J1DV67 (uncharacterized protein LOC111023808 OS=Momordica charantia OX=3673 GN=LOC111023808 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 3.6e-73
Identity = 143/151 (94.70%), Postives = 146/151 (96.69%), Query Frame = 0

Query: 25  VKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVE 84
           VKV NSDNMSTQVNN PRLNGANFK+WKEDIQIVLGCMDLDLALRVDRPTS EENPNKVE
Sbjct: 4   VKVXNSDNMSTQVNNIPRLNGANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVE 63

Query: 85  IEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 144
           I+KWDR NRM LMIMKRSIPETFRGSI+EGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT
Sbjct: 64  IKKWDRSNRMCLMIMKRSIPETFRGSIVEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 123

Query: 145 SSRYVGKGNIREYIMQMSNVATKLKALKLEV 176
           SSRYVGKGNIREYIMQMSNVATKLKALKLEV
Sbjct: 124 SSRYVGKGNIREYIMQMSNVATKLKALKLEV 154

BLAST of Moc01g16790 vs. ExPASy TrEMBL
Match: A0A6J1DQP2 (uncharacterized protein LOC111022228 OS=Momordica charantia OX=3673 GN=LOC111022228 PE=4 SV=1)

HSP 1 Score: 259.2 bits (661), Expect = 1.2e-65
Identity = 131/144 (90.97%), Postives = 136/144 (94.44%), Query Frame = 0

Query: 33  MSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEIEKWDRFN 92
           MSTQVNN PRLN ANFK+WKEDIQIVLGCMDLDLALRVDRPTS EENPNKVEIEKWDR N
Sbjct: 1   MSTQVNNIPRLNVANFKDWKEDIQIVLGCMDLDLALRVDRPTSTEENPNKVEIEKWDRSN 60

Query: 93  RMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLTSSRYVGKG 152
           RM LMIMKRSIPETFRGSI+EGTNAK FLKEM+QYFTKNDKAEASTLM KLTSSRYVGKG
Sbjct: 61  RMCLMIMKRSIPETFRGSIVEGTNAKSFLKEMKQYFTKNDKAEASTLMTKLTSSRYVGKG 120

Query: 153 NIREYIMQMSNVATKLKALKLEVS 177
           NIREY MQMS+VATKLKALKL+VS
Sbjct: 121 NIREYKMQMSSVATKLKALKLKVS 144

BLAST of Moc01g16790 vs. ExPASy TrEMBL
Match: A0A151UI88 (Uncharacterized protein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_050268 PE=4 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 2.1e-49
Identity = 100/152 (65.79%), Postives = 124/152 (81.58%), Query Frame = 0

Query: 25  VKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVE 84
           V VA++ N+S Q+N  P  NG NFK WKE ++I+LGCMDLDLALR ++ T   ENP++ +
Sbjct: 12  VAVASAVNLSAQINCIPMFNGTNFKAWKEAVEIILGCMDLDLALRAEKLTPNPENPDEDK 71

Query: 85  IEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 144
           +EKW+R NRM LMIMKRS+PE FRGSI E  NAKGFL  +EQYFT N+KA+AS+L+AKL 
Sbjct: 72  VEKWERSNRMCLMIMKRSVPEVFRGSISESQNAKGFLDAIEQYFTSNEKADASSLLAKLI 131

Query: 145 SSRYVGKGNIREYIMQMSNVATKLKALKLEVS 177
           S RY GKGNIREYIM+MSN+A+KLKALKLE+S
Sbjct: 132 SMRYKGKGNIREYIMEMSNLASKLKALKLELS 163

BLAST of Moc01g16790 vs. ExPASy TrEMBL
Match: A0A445LAJ7 (Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_006823 PE=4 SV=1)

HSP 1 Score: 205.3 bits (521), Expect = 2.1e-49
Identity = 101/151 (66.89%), Postives = 122/151 (80.79%), Query Frame = 0

Query: 25  VKVANSDNMSTQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVE 84
           V + +  N++ QVN+ P LNG NFK WKE ++IVLGCMDLDLALR +RP S  E  N+V+
Sbjct: 41  VNITSVANVTAQVNSIPMLNGTNFKVWKEVVEIVLGCMDLDLALRTERPISTPETSNEVK 100

Query: 85  IEKWDRFNRMFLMIMKRSIPETFRGSIIEGTNAKGFLKEMEQYFTKNDKAEASTLMAKLT 144
           IEKWDR NRM LMIMKRSIPE FRGSI EG +AK FL+E+EQYF KN+KAE S L+AKL 
Sbjct: 101 IEKWDRSNRMCLMIMKRSIPEAFRGSISEGQSAKKFLEEIEQYFAKNEKAETSNLLAKLI 160

Query: 145 SSRYVGKGNIREYIMQMSNVATKLKALKLEV 176
           S +Y GKGNIREYIM+M N+A+KLK+LKLE+
Sbjct: 161 SMKYKGKGNIREYIMEMPNLASKLKSLKLEL 191

BLAST of Moc01g16790 vs. TAIR 10
Match: AT5G53670.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: endomembrane system; EXPRESSED IN: sperm cell; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 113.2 bits (282), Expect = 2.1e-25
Identity = 59/138 (42.75%), Postives = 88/138 (63.77%), Query Frame = 0

Query: 35  TQVNNNPRLNGANFKNWKEDIQIVLGCMDLDLALRVDRPTSIEENPNKVEIEKWDRFNRM 94
           + V++ P L+G+NF  WKE + +VL  MDLDL+L  +RP+S +      E++ WDR NR+
Sbjct: 33  SNVDSIPMLSGSNFSEWKEHLLLVLALMDLDLSLMTERPSSPK------ELKHWDRSNRV 92

Query: 95  FLMIMKRSIPETFRGSIIEG-TNAKGFLKEMEQYFTKNDKAEASTLMAKLTSSRYVGKGN 154
            +MIMK  IP+ FRG + +  T AK FL  +E +F KN++AE S + A+ +S  Y+   N
Sbjct: 93  SIMIMKIRIPQGFRGVVPDDVTTAKDFLASLENFFAKNEEAERSRVQAESSSMSYIENEN 152

Query: 155 IREYIMQMSNVATKLKAL 172
           +RE IM+M  +  K K L
Sbjct: 153 VRELIMRMKTLGAKRKRL 164

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022152232.14.4e-78100.00uncharacterized protein LOC111020001 [Momordica charantia][more]
XP_022156979.17.3e-7394.70uncharacterized protein LOC111023808 [Momordica charantia][more]
XP_022155096.12.5e-6590.97uncharacterized protein LOC111022228 [Momordica charantia][more]
KAF8413461.11.6e-5167.97hypothetical protein HHK36_001448 [Tetracentron sinense][more]
KAF8394168.13.0e-5066.03hypothetical protein HHK36_020374 [Tetracentron sinense][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DFM12.1e-78100.00uncharacterized protein LOC111020001 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
A0A6J1DV673.6e-7394.70uncharacterized protein LOC111023808 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1DQP21.2e-6590.97uncharacterized protein LOC111022228 OS=Momordica charantia OX=3673 GN=LOC111022... [more]
A0A151UI882.1e-4965.79Uncharacterized protein (Fragment) OS=Cajanus cajan OX=3821 GN=KK1_050268 PE=4 S... [more]
A0A445LAJ72.1e-4966.89Uncharacterized protein OS=Glycine soja OX=3848 GN=D0Y65_006823 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G53670.12.1e-2542.75unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 98..175
e-value: 7.1E-7
score: 29.1
NoneNo IPR availablePANTHERPTHR35317:SF3TRANSMEMBRANE PROTEINcoord: 37..176
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 37..176

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc01g16790.1Moc01g16790.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding