Moc03g21740 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc03g21740
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Locationchr3: 14951001 .. 14954836 (+)
RNA-Seq ExpressionMoc03g21740
SyntenyMoc03g21740
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCTCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAGGGCCAACCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGTACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAGAAAGAGAGGCTCGTCTCTCTGAAAAGGGCAGTCACCATCCCGCTCCCACAGGAGCTCCAACCAGCAGGCTGAATTCTCTCACAATCCCGCAGAGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGTCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGATGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCCAGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCAGTGGCCGACCTGAGTATCGAAGGGTGGAGAACGGACCTACCAGGAGCCGACCTTACGAGCGCTTCACCCCAACCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGATTCTGGAATGGAAAAACTACTCAAGGATCAGCTCGTGGACGCCACAGCCGGGCACGAACTGCTCACCTTCATGGACGCCTACTCTGGGTACAACCAAATCAAGATGCATGTCTCAGATGAAGGTCATACCGCTTTCATAACAGACCAAGGTCTGTACTGCTACAAGGTCATGCCCTTCGGGTTAAAGAACGCAGGAGCGACCTACCAGAAAATGGTGAACAAAATGTTCGCCAAGCAGATTGGCTGGAATATGGAAGTGTATGTGGACGACATGCTTGTCAAGAGCAAGCAGTCTAAGTCGCATCTCTCCGACCTGGTCGAAGCCTTCGAGGTTCTGAGGGCATATCAAATGAAGCTCAACCCTGCTAAGTATGCCTTTGGAGTCACCTCGGGAAAATTCCTTGGCTTCATGGTAAACAACCGGGGAATCGAGGCCAACCCCGAAAAGATTAAAGCCGTGATCGAGATGGAGGCACCTAAAACGCTGAAGCAGCTTCAGTGCCTCAATGGCAGGATTGCGGCCCTGAACCGGTTTGTTTCAAGGTCGACAGATAAGTGTCTCCCTTTCTTCAAGGTCTTACGAAAGAAAGGGCCGTTTTAATGGACAGCGGAGTGCGAGCAAGCGTTTCAGCAATTGAAGAGCTACCTCTGTTCGGCACCCATGCTTGCCAAGCCCATGCCGGGGGACAAGCTCCAATTGTACCTAGCAGTGTCTGACAGTGCCGTCAGCTCGGCCCTAATCAGGCAAGAGGAAGCGCGGCAAAACCCGGTCTACTACACAAGCAAGGCTATGACCGAAGCCGAGACTAGATACCCTCAGATGGAGAAGTTGGCTCTCGCTTTGGTCACCTCGGCCCGACGACTTAGACCATACTTCCAAGCCCATACGGTGGTGGTGCTCACTAACTTGCCCCTAAAGAGCATCTTCCATAAGCCAGAAGCTTCTGGTCGCCTAATGAAGTGGGCAATGGAGCTAAGTGAGTACGACATCCAGTTCGAACCCAGAACTGCGTTGAAAAGACAAGCAGCGGTAGATTTCATAGCCGAGCTCACACCACCTTCCGAGCTGAGCGAGTCCGACCTACCGTGGACAGTCTATGTCGACGGATCCTCCAATGAGAAGGGGTGCGGAGCCGGGGTCCTCTTGCTCGGACCAGGGGGTAAGCGATTTGAGTATGCCTTGCGGTTCAGCTTCCGGACTTCTAACAACGAGACAGAGTATGAAGCATTTATTGCCGGCCTGCGAATCGCTCGAGCATTGGGGGCCTCTTGTGTTAAGGTCTGCAGTGACTCTCAGCTGGTTGTGAGCCAGATCAAGGACGAATACCAAGCCAAAGACACCCGAATGGAGAAGTATTTGGGCAAGGTCAGATCATACCTCGCCCAGTTTCGAACTTACGAGGTAAGCCGGATTCCGCGAGCAGAAAATTCTAATGCTGACGCCTTGGCCAAGTTAGCATCGACGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTGTATTGAAATGCCTAACCCCTGAAGAGGGCTTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCAGTCGCTATCAGCCAAAGTGATCCGGCAAGGATACTATTGGCCGACCCTCAACCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGGGCCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCATGGCCATTCGCGCAGTGGGGGGTAGATATCATTGGTCCTTTCCCTTTGGGCAAGGGCCAGACCAAGCTGTAATCCCGGTTGAGATCGGCATGCCGTCTGACAGAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGACCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACTTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA

mRNA sequence

ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCTCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAGGGCCAACCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGAGCTCCAACCAGCAGGCTGAATTCTCTCACAATCCCGCAGAGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGTCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGATGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCCAGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCACATCGACGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTGTATTGAAATGCCTAACCCCTGAAGAGGGCTTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCAGTCGCTATCAGCCAAAGTGATCCGGCAAGGATACTATTGGCCGACCCTCAACCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGGGCCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCATGGCCATTCGCGCAAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGACCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACTTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA

Coding sequence (CDS)

ATGGTTCAACCAGCGAATTCGACCAATACGACGGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCGGCGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCTCTCCGCAGGTCGGCACGGATCACCGCACCTGCCCTACCGCCTGCGCACCCGAGGACGTCCAGGGCCAACCGTGGTCGAGGTGGGACCTCTAAGAAGGGCGCCCAGGGTCCAGCCCCGGCTCCAACAAGCGAGAACTTTGATGCGCTCCAGAGAGAGATGGAGGCAATGCGCACACAAATGCGCTCCATGGAGGCAATGTATAACGAAATGGTGCTGGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGAGTGACGCGCATGGACGTACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGCCGAGGAAGAACGTCCCGAAGACAACGAGAGTGAGGGGAGCTCCAACCAGCAGGCTGAATTCTCTCACAATCCCGCAGAGATAATCACAAGGGAGGAGTTCGACCAGCTGAGGGGGGAGCTCGATGCTCAGGTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGACGATTCACTGAACGACGGCGACTTGGGAGAATCGCCTTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCGAAGTTCAAAGCTCCTACCGTGAAGCCTTATGACGGGACGAAGGACCCCAAGGACTATGTTGAGGTCTTTGAAGGCCTCATGGACTTCCAAGCGGCATCAGACGCAATCAAATGTCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCGATTGTGGTACCGGAGACTGCCAGTCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCACTATGACAAAAAGACAACGACCCATCTCGCCACCATCAGGCAGAAGGAGGGTGAGACGCTGCGGGAGTATGTCACCAGATTCCAGGAGGAACAGTTGAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTCCTCACCGGTCTAGCCGACGAAGCCCTCACGATGAAGCTTGGAGAGGAGGCCCCGGCCACCTTCGCCGAGGTACTCCAGAAGGCGAAGAAAGTCATCGACGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCTGAACGAAAGATCGGCCGGGGCAGAAGTGGAAAAGATGAAAGGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTCTCCACATCGACGTACGAGACCGACCTCGCCAGGTCGGTCCCCGTCGAGATCCTAGATAATCCCTCGATCTTAGAGCCAGATCTGATGGAGATCGGCGCTCCAGAATCCTCATGGATGGACCCGATCACGGACTTCATTAGGGGTAACTCACCACAAGATCCCAAGGAGCGCAGAAAGTTGGCAAGGCGGGCAGCTCGGTTCGTGGTCCGAGATGGTGCATTGTACCGACGTGGCTTTTCCCTGCCTGTATTGAAATGCCTAACCCCTGAAGAGGGCTTGTACGTCCTCAGAGAGATCCACGAAGGAGTGTGCGGCAATCACTCAGGCGCCCAGTCGCTATCAGCCAAAGTGATCCGGCAAGGATACTATTGGCCGACCCTCAACCAGGACGCCAAGAAGTTCGTTAGAACTTGCGACAATTGCCAACGCTACGGGGCCATAATCCACCAACCTCCCGAGCTGCTCACCCCCATCTCGGCCCCATGGCCATTCGCGCAAGTAGAGCATTATGAGCCTACGACGAATGAGGATGGGCTGCTCCTCAACCTCGACTTGTTGGAAGAAAGAAGAGCAATGGCCCAGCTACGCCTGGCGGAATATCAGGGCAGAATGGCCAGACATTACAATGCCCGCGTCCGACCTCGGGCCTTTCAGGTCGGACATCTGGTCTTAAGGAGGGTCCAAACGCATGTGGGTGCCCTTGACCCGACCTGGGAGGGCCCGTTTGAGATCAAGGGCATAGTCCGACTTGGGACGTACATATTGGCCGATCTGAAAGGAGACGTCCTCGCGCACCCGTGGAACGCGGAGCACCTGAAGCGTTATTATCCTTGA

Protein sequence

MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHPRTSRANRGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAGSRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPAEIITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDERADPKSKDKGSFSTSTYETDLARSVPVEILDNPSILEPDLMEIGAPESSWMDPITDFIRGNSPQDPKERRKLARRAARFVVRDGALYRRGFSLPVLKCLTPEEGLYVLREIHEGVCGNHSGAQSLSAKVIRQGYYWPTLNQDAKKFVRTCDNCQRYGAIIHQPPELLTPISAPWPFAQVEHYEPTTNEDGLLLNLDLLEERRAMAQLRLAEYQGRMARHYNARVRPRAFQVGHLVLRRVQTHVGALDPTWEGPFEIKGIVRLGTYILADLKGDVLAHPWNAEHLKRYYP
Homology
BLAST of Moc03g21740 vs. NCBI nr
Match: XP_022159327.1 (uncharacterized protein LOC111025738 [Momordica charantia])

HSP 1 Score: 535.0 bits (1377), Expect = 9.4e-148
Identity = 295/427 (69.09%), Postives = 338/427 (79.16%), Query Frame = 0

Query: 1   MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
           MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP
Sbjct: 1   MVQPVDSTNTGDRRALVANDGHQREVGAEVVEGQIHEGLGTEPFCRSARITTPDLSPAHP 60

Query: 61  RTSRANRGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
           +  +ANRGRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +ME MYNEMV A GAG
Sbjct: 61  KPFKANRGRGGASRRTTLGAAPAPSRENFDALQKEMEAMRTQMLTMEEMYNEMVQAVGAG 120

Query: 121 SRSENRVT---RMDVREQRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPA---EIITRE 180
           SRSE+R     R D+R+        +  +    + S  +SNQQAE S+NP     +ITRE
Sbjct: 121 SRSEDRAARDERGDLRDHLSRKRSSSLRKGRSPSCSHKNSNQQAESSYNPVVPEGVITRE 180

Query: 181 EFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG 240
           EFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+KPYDG
Sbjct: 181 EFDQLKSKFDAQVETLKARCEVKGSTFDDGDLGESPFTSDILEALIPSKFKTPTMKPYDG 240

Query: 241 TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFL 300
           +KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLP RSISTYSQLR+EF 
Sbjct: 241 SKDPKDYVEVFEGLMGFQAATDAIKYRAFQIALTSSARLWYRRLPARSISTYSQLRKEFN 300

Query: 301 AQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE 360
           +QFSSRHY++KT THLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE
Sbjct: 301 SQFSSRHYERKTATHLATIRQKERETLREYVTWFQEEQLKVAHYSDDSALCYFLTDLVDE 360

Query: 361 ALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKD 420
            LT+KLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E++I + +  +++R A+ KSKD
Sbjct: 361 TLTVKLGEEAPATFAEVLQKAKKVIDGQELFRTKTGRSEKQIDQKKPSQEKRKAESKSKD 420

BLAST of Moc03g21740 vs. NCBI nr
Match: XP_022156542.1 (uncharacterized protein LOC111023421 [Momordica charantia])

HSP 1 Score: 478.8 bits (1231), Expect = 8.0e-131
Identity = 249/263 (94.68%), Postives = 252/263 (95.82%), Query Frame = 0

Query: 170 IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV 229
           IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV
Sbjct: 27  IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV 86

Query: 230 KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQL 289
           KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQL
Sbjct: 87  KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQL 146

Query: 290 RREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT 349
           RREFLAQFSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT
Sbjct: 147 RREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT 206

Query: 350 GLADEALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERAD 409
           GLADEALT+KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERAD
Sbjct: 207 GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERAD 266

Query: 410 PKSKDKGSFSTSTYETDLARSVP 432
           PKSKDKGSFS+   E   A + P
Sbjct: 267 PKSKDKGSFSSGRAEYRRAENGP 289

BLAST of Moc03g21740 vs. NCBI nr
Match: XP_022152033.1 (uncharacterized protein LOC111019842 [Momordica charantia])

HSP 1 Score: 467.6 bits (1202), Expect = 1.8e-127
Identity = 251/289 (86.85%), Postives = 261/289 (90.31%), Query Frame = 0

Query: 135 QRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPAE---IITREEFDQLRGELDAQVEALK 194
           QRGS L   +      + S  SSNQQAE SHNPA    +ITREEFDQLRG+LDAQVEALK
Sbjct: 28  QRGSSLRKGQ----SPSRSHRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALK 87

Query: 195 AKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDF 254
           AKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG++DPKDYVEVFEGLMDF
Sbjct: 88  AKCEQKEGSLNDGDLGESPFTSDVLEAPIPXKFKAPTVKPYDGSRDPKDYVEVFEGLMDF 147

Query: 255 QAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHYDKKTTTHLA 314
           QAASD IKCRAFQIALT SARLWYRRLP RSISTYSQLRREFLAQFSSRHYDK+T THLA
Sbjct: 148 QAASDTIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKETATHLA 207

Query: 315 TIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTMKLGEEAPATFAEV 374
           TIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA T+KLGEEAPATFAEV
Sbjct: 208 TIRQKEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAXTVKLGEEAPATFAEV 267

Query: 375 LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFST 420
           LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERAD KSKDKGSFS+
Sbjct: 268 LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADLKSKDKGSFSS 312

BLAST of Moc03g21740 vs. NCBI nr
Match: XP_022137317.1 (uncharacterized protein LOC111008813 [Momordica charantia])

HSP 1 Score: 463.8 bits (1192), Expect = 2.7e-126
Identity = 244/276 (88.41%), Postives = 253/276 (91.67%), Query Frame = 0

Query: 160 QAEFSHNPAE---IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 219
           +AE S NPA    +ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVL
Sbjct: 3   KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62

Query: 220 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 279
           EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63  EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122

Query: 280 RLPVRSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA 339
           RLP  SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182

Query: 340 HCSDDSAMCYFLTGLADEALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 399
           HCSDDSAMCYFLTGLADEALT+KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242

Query: 400 GRGRSGKD-ERADPKSKDKGSFSTSTYETDLARSVP 432
           GRGRSGKD E ADPKSKDKGSFS+   E   A + P
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGP 278

BLAST of Moc03g21740 vs. NCBI nr
Match: XP_022152854.1 (uncharacterized protein LOC111020479 [Momordica charantia])

HSP 1 Score: 457.6 bits (1176), Expect = 1.9e-124
Identity = 261/432 (60.42%), Postives = 291/432 (67.36%), Query Frame = 0

Query: 1   MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
           MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP
Sbjct: 1   MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60

Query: 61  RTSRANRGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
           + S+                                                        
Sbjct: 61  KPSK-------------------------------------------------------- 120

Query: 121 SRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPAE--IITREEFDQ 180
                                                   AE S+NP    +ITREEFDQ
Sbjct: 121 ----------------------------------------AESSYNPITPGVITREEFDQ 180

Query: 181 LRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDP 240
           L+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDP
Sbjct: 181 LKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPYDGSKDP 240

Query: 241 KDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFS 300
           KDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFS
Sbjct: 241 KDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKEFISQFS 300

Query: 301 SRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTM 360
           SRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT+
Sbjct: 301 SRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLADETLTV 335

Query: 361 KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKGSF 420
           KL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG  
Sbjct: 361 KLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKSRDKGP- 335

Query: 421 STSTYETDLARS 430
           S+S+   D  RS
Sbjct: 421 SSSSSRVDYRRS 335

BLAST of Moc03g21740 vs. ExPASy Swiss-Prot
Match: Q5RBK0 (Gypsy retrotransposon integrase-like protein 1 OS=Pongo abelii OX=9601 GN=GIN1 PE=2 SV=1)

HSP 1 Score: 66.6 bits (161), Expect = 1.2e-09
Identity = 39/122 (31.97%), Postives = 58/122 (47.54%), Query Frame = 0

Query: 471 PKERRKLARRAARFVVRDGALYRRGFSLPV--LKCLTPEEGLYVLREIHEGVCGNHSGAQ 530
           P ERR + R A +FV ++  L+  G       L  ++ EE   VLRE HE   G H G  
Sbjct: 30  PSERRGIRRAAKKFVFKEKKLFYVGKDRKQNRLVIVSEEEKKKVLRECHENDSGAHHGI- 89

Query: 531 SLSAKVIRQGYYWPTLNQDAKKFVRTCDNCQ-RYGAIIHQPPELLTPISAPWPFAQVEHY 590
           S +  ++   YYW ++  D K++V  C +CQ     +I  P + L  +  PW    V+  
Sbjct: 90  SRTLTLVESNYYWTSVTNDVKQWVYACQHCQVAKNTVIVAPKQHLLKVENPWSLVTVDLM 149

BLAST of Moc03g21740 vs. ExPASy Swiss-Prot
Match: Q4R6I1 (Gypsy retrotransposon integrase-like protein 1 OS=Macaca fascicularis OX=9541 GN=GIN1 PE=2 SV=1)

HSP 1 Score: 65.9 bits (159), Expect = 2.1e-09
Identity = 39/122 (31.97%), Postives = 57/122 (46.72%), Query Frame = 0

Query: 471 PKERRKLARRAARFVVRDGALYRRGFSLPV--LKCLTPEEGLYVLREIHEGVCGNHSGAQ 530
           P ER  + R A +FV +D  L+  G       L  ++ EE   VLRE HE   G H G  
Sbjct: 30  PSERSGIRRAAKKFVFKDKKLFYVGEDRKQNRLVIVSEEEKKKVLRECHENDSGAHHGI- 89

Query: 531 SLSAKVIRQGYYWPTLNQDAKKFVRTCDNCQ-RYGAIIHQPPELLTPISAPWPFAQVEHY 590
           S +  ++   YYW ++  D K++V  C +CQ     +I  P + L  +  PW    V+  
Sbjct: 90  SRTLTLVESNYYWTSVTNDVKQWVYACQHCQVAKNTVIVAPKQHLLKVENPWSLVTVDLM 149

BLAST of Moc03g21740 vs. ExPASy Swiss-Prot
Match: Q8K259 (Gypsy retrotransposon integrase-like protein 1 OS=Mus musculus OX=10090 GN=Gin1 PE=2 SV=2)

HSP 1 Score: 64.7 bits (156), Expect = 4.7e-09
Identity = 39/122 (31.97%), Postives = 58/122 (47.54%), Query Frame = 0

Query: 471 PKERRKLARRAARFVVRDGALYRRGFSLPV--LKCLTPEEGLYVLREIHEGVCGNHSGAQ 530
           P ER  + R A +FV ++  L+  G       L  ++ EE   VLRE HE   G H G  
Sbjct: 30  PSERSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVVVSEEEKKKVLRECHENGPGVHHGI- 89

Query: 531 SLSAKVIRQGYYWPTLNQDAKKFVRTCDNCQ-RYGAIIHQPPELLTPISAPWPFAQVEHY 590
           S +  ++  GYYW ++  D K++V  C +CQ     +I  P + L  +  PW    V+  
Sbjct: 90  SRTLTLVESGYYWTSVTNDVKQWVYACQHCQVAKNTVIVAPQQHLPMVGNPWSVVTVDLM 149

BLAST of Moc03g21740 vs. ExPASy Swiss-Prot
Match: Q9NXP7 (Gypsy retrotransposon integrase-like protein 1 OS=Homo sapiens OX=9606 GN=GIN1 PE=1 SV=3)

HSP 1 Score: 64.3 bits (155), Expect = 6.2e-09
Identity = 38/122 (31.15%), Postives = 57/122 (46.72%), Query Frame = 0

Query: 471 PKERRKLARRAARFVVRDGALYRRGFSLPV--LKCLTPEEGLYVLREIHEGVCGNHSGAQ 530
           P ER  + R A +FV ++  L+  G       L  ++ EE   VLRE HE   G H G  
Sbjct: 30  PSERSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVIVSEEEKKKVLRECHENDSGAHHGI- 89

Query: 531 SLSAKVIRQGYYWPTLNQDAKKFVRTCDNCQ-RYGAIIHQPPELLTPISAPWPFAQVEHY 590
           S +  ++   YYW ++  D K++V  C +CQ     +I  P + L  +  PW    V+  
Sbjct: 90  SRTLTLVESNYYWTSVTNDVKQWVYACQHCQVAKNTVIVAPKQHLLKVENPWSLVTVDLM 149

BLAST of Moc03g21740 vs. ExPASy Swiss-Prot
Match: Q66H30 (Gypsy retrotransposon integrase-like protein 1 OS=Rattus norvegicus OX=10116 GN=GIN1 PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 8.9e-08
Identity = 37/120 (30.83%), Postives = 57/120 (47.50%), Query Frame = 0

Query: 473 ERRKLARRAARFVVRDGALYRRGFSLPV--LKCLTPEEGLYVLREIHEGVCGNHSGAQSL 532
           ER  + R A +FV ++  L+  G       L  ++ EE   VLRE HE   G H G  S 
Sbjct: 32  ERSGIRRAAKKFVFKEKKLFYVGKDRKQNRLVVVSEEEKKKVLRECHENGPGVHHGI-SR 91

Query: 533 SAKVIRQGYYWPTLNQDAKKFVRTCDNCQ-RYGAIIHQPPELLTPISAPWPFAQVEHYEP 590
           +  ++   YYW ++  D K++V  C +CQ     +I  P + L+ +  PW    V+   P
Sbjct: 92  TLTLVESSYYWTSVTNDVKQWVYACQHCQVAKSTVIVAPQQHLSVVGNPWSVVTVDLMGP 150

BLAST of Moc03g21740 vs. ExPASy TrEMBL
Match: A0A6J1DZJ1 (uncharacterized protein LOC111025738 OS=Momordica charantia OX=3673 GN=LOC111025738 PE=4 SV=1)

HSP 1 Score: 535.0 bits (1377), Expect = 4.6e-148
Identity = 295/427 (69.09%), Postives = 338/427 (79.16%), Query Frame = 0

Query: 1   MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
           MVQP +STNT DRR L A+D HQREVGA  VEGQ H+GL TEP  RSARIT P L PAHP
Sbjct: 1   MVQPVDSTNTGDRRALVANDGHQREVGAEVVEGQIHEGLGTEPFCRSARITTPDLSPAHP 60

Query: 61  RTSRANRGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
           +  +ANRGRGG S++   G APAP+ ENFDALQ+EMEAMRTQM +ME MYNEMV A GAG
Sbjct: 61  KPFKANRGRGGASRRTTLGAAPAPSRENFDALQKEMEAMRTQMLTMEEMYNEMVQAVGAG 120

Query: 121 SRSENRVT---RMDVREQRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPA---EIITRE 180
           SRSE+R     R D+R+        +  +    + S  +SNQQAE S+NP     +ITRE
Sbjct: 121 SRSEDRAARDERGDLRDHLSRKRSSSLRKGRSPSCSHKNSNQQAESSYNPVVPEGVITRE 180

Query: 181 EFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG 240
           EFDQL+ + DAQVE LKA+CE K  + +DGDLGESPFTSD+LEA IP KFK PT+KPYDG
Sbjct: 181 EFDQLKSKFDAQVETLKARCEVKGSTFDDGDLGESPFTSDILEALIPSKFKTPTMKPYDG 240

Query: 241 TKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFL 300
           +KDPKDYVEVFEGLM FQAA+DAIK RAFQIALT SARLWYRRLP RSISTYSQLR+EF 
Sbjct: 241 SKDPKDYVEVFEGLMGFQAATDAIKYRAFQIALTSSARLWYRRLPARSISTYSQLRKEFN 300

Query: 301 AQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADE 360
           +QFSSRHY++KT THLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLT L DE
Sbjct: 301 SQFSSRHYERKTATHLATIRQKERETLREYVTWFQEEQLKVAHYSDDSALCYFLTDLVDE 360

Query: 361 ALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDER-ADPKSKD 420
            LT+KLGEEAPATFAEVLQKAKKVIDGQEL RTKTGR E++I + +  +++R A+ KSKD
Sbjct: 361 TLTVKLGEEAPATFAEVLQKAKKVIDGQELFRTKTGRSEKQIDQKKPSQEKRKAESKSKD 420

BLAST of Moc03g21740 vs. ExPASy TrEMBL
Match: A0A6J1DS95 (uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023421 PE=4 SV=1)

HSP 1 Score: 478.8 bits (1231), Expect = 3.9e-131
Identity = 249/263 (94.68%), Postives = 252/263 (95.82%), Query Frame = 0

Query: 170 IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV 229
           IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV
Sbjct: 27  IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTV 86

Query: 230 KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQL 289
           KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLP RSISTYSQL
Sbjct: 87  KPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPXRSISTYSQL 146

Query: 290 RREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT 349
           RREFLAQFSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT
Sbjct: 147 RREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLT 206

Query: 350 GLADEALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERAD 409
           GLADEALT+KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERAD
Sbjct: 207 GLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDVERAD 266

Query: 410 PKSKDKGSFSTSTYETDLARSVP 432
           PKSKDKGSFS+   E   A + P
Sbjct: 267 PKSKDKGSFSSGRAEYRRAENGP 289

BLAST of Moc03g21740 vs. ExPASy TrEMBL
Match: A0A6J1DDS5 (uncharacterized protein LOC111019842 OS=Momordica charantia OX=3673 GN=LOC111019842 PE=4 SV=1)

HSP 1 Score: 467.6 bits (1202), Expect = 8.9e-128
Identity = 251/289 (86.85%), Postives = 261/289 (90.31%), Query Frame = 0

Query: 135 QRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPAE---IITREEFDQLRGELDAQVEALK 194
           QRGS L   +      + S  SSNQQAE SHNPA    +ITREEFDQLRG+LDAQVEALK
Sbjct: 28  QRGSSLRKGQ----SPSRSHRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALK 87

Query: 195 AKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDF 254
           AKCEQK+ SLNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDG++DPKDYVEVFEGLMDF
Sbjct: 88  AKCEQKEGSLNDGDLGESPFTSDVLEAPIPXKFKAPTVKPYDGSRDPKDYVEVFEGLMDF 147

Query: 255 QAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFSSRHYDKKTTTHLA 314
           QAASD IKCRAFQIALT SARLWYRRLP RSISTYSQLRREFLAQFSSRHYDK+T THLA
Sbjct: 148 QAASDTIKCRAFQIALTDSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKETATHLA 207

Query: 315 TIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTMKLGEEAPATFAEV 374
           TIRQKEGETLREYVTRFQEEQLKV HCSDDSAMCYFLTGLADEA T+KLGEEAPATFAEV
Sbjct: 208 TIRQKEGETLREYVTRFQEEQLKVTHCSDDSAMCYFLTGLADEAXTVKLGEEAPATFAEV 267

Query: 375 LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD-ERADPKSKDKGSFST 420
           LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKD ERAD KSKDKGSFS+
Sbjct: 268 LQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDIERADLKSKDKGSFSS 312

BLAST of Moc03g21740 vs. ExPASy TrEMBL
Match: A0A6J1C7X5 (uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008813 PE=4 SV=1)

HSP 1 Score: 463.8 bits (1192), Expect = 1.3e-126
Identity = 244/276 (88.41%), Postives = 253/276 (91.67%), Query Frame = 0

Query: 160 QAEFSHNPAE---IITREEFDQLRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVL 219
           +AE S NPA    +ITREEFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVL
Sbjct: 3   KAESSRNPATPAGVITREEFDQLRGQLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVL 62

Query: 220 EAPIPPKFKAPTVKPYDGTKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYR 279
           EAPIPPKFKAPTVKPYDG+KDPKDYVEVFE LMDFQAASDAIKCRAF+IALTGSARLWYR
Sbjct: 63  EAPIPPKFKAPTVKPYDGSKDPKDYVEVFESLMDFQAASDAIKCRAFRIALTGSARLWYR 122

Query: 280 RLPVRSISTYSQLRREFLAQFSSRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVA 339
           RLP  SISTYSQLRREFLA FSSRHYDKKT THLATIRQKEGETLREYVTRFQEEQLKVA
Sbjct: 123 RLPAXSISTYSQLRREFLAXFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVA 182

Query: 340 HCSDDSAMCYFLTGLADEALTMKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 399
           HCSDDSAMCYFLTGLADEALT+KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI
Sbjct: 183 HCSDDSAMCYFLTGLADEALTVKLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKI 242

Query: 400 GRGRSGKD-ERADPKSKDKGSFSTSTYETDLARSVP 432
           GRGRSGKD E ADPKSKDKGSFS+   E   A + P
Sbjct: 243 GRGRSGKDIENADPKSKDKGSFSSGRAEYRRAENGP 278

BLAST of Moc03g21740 vs. ExPASy TrEMBL
Match: A0A6J1DHB3 (uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020479 PE=4 SV=1)

HSP 1 Score: 457.6 bits (1176), Expect = 9.2e-125
Identity = 261/432 (60.42%), Postives = 291/432 (67.36%), Query Frame = 0

Query: 1   MVQPANSTNTTDRRTLAASDAHQREVGAAAVEGQGHDGLATEPLRRSARITAPALPPAHP 60
           MVQPANSTNT DRR LAA+  HQREVGA  VEGQGH+ L TEPL RSARIT P LPPAHP
Sbjct: 1   MVQPANSTNTADRRALAANHGHQREVGAEVVEGQGHEDLGTEPLCRSARITTPVLPPAHP 60

Query: 61  RTSRANRGRGGTSKKGAQGPAPAPTSENFDALQREMEAMRTQMRSMEAMYNEMVLAAGAG 120
           + S+                                                        
Sbjct: 61  KPSK-------------------------------------------------------- 120

Query: 121 SRSENRVTRMDVREQRGSHLGPAEEERPEDNESEGSSNQQAEFSHNPAE--IITREEFDQ 180
                                                   AE S+NP    +ITREEFDQ
Sbjct: 121 ----------------------------------------AESSYNPITPGVITREEFDQ 180

Query: 181 LRGELDAQVEALKAKCEQKDDSLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGTKDP 240
           L+ + DAQVEALKA+CE+K+ S +DGDLGE  F+SD+LEA IPPKFK PT+KPYDG+KDP
Sbjct: 181 LKSKFDAQVEALKARCEKKESSFDDGDLGELSFSSDILEALIPPKFKTPTMKPYDGSKDP 240

Query: 241 KDYVEVFEGLMDFQAASDAIKCRAFQIALTGSARLWYRRLPVRSISTYSQLRREFLAQFS 300
           KDYVEVFE LMDFQAA+DAIKC AFQIALTGSARLWYRRLP R ISTYSQLR+EF++QFS
Sbjct: 241 KDYVEVFESLMDFQAATDAIKCCAFQIALTGSARLWYRRLPARLISTYSQLRKEFISQFS 300

Query: 301 SRHYDKKTTTHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTGLADEALTM 360
           SRHYD+KT THLATIRQKEGETLREYVTRF EEQLKVAHCSDDSAMCYFLTGLADE LT+
Sbjct: 301 SRHYDRKTPTHLATIRQKEGETLREYVTRFPEEQLKVAHCSDDSAMCYFLTGLADETLTV 335

Query: 361 KLGEEAPATFAEVLQKAKKVIDGQELLRTKTGRPERKIGRGRSGKDE-RADPKSKDKGSF 420
           KL EEAPATFAEVLQK KKVIDGQELLRTKTGRPE+ I +GR+GKD+ +AD KS+DKG  
Sbjct: 361 KLREEAPATFAEVLQKTKKVIDGQELLRTKTGRPEKNIDQGRAGKDKGKADSKSRDKGP- 335

Query: 421 STSTYETDLARS 430
           S+S+   D  RS
Sbjct: 421 SSSSSRVDYRRS 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022159327.19.4e-14869.09uncharacterized protein LOC111025738 [Momordica charantia][more]
XP_022156542.18.0e-13194.68uncharacterized protein LOC111023421 [Momordica charantia][more]
XP_022152033.11.8e-12786.85uncharacterized protein LOC111019842 [Momordica charantia][more]
XP_022137317.12.7e-12688.41uncharacterized protein LOC111008813 [Momordica charantia][more]
XP_022152854.11.9e-12460.42uncharacterized protein LOC111020479 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Q5RBK01.2e-0931.97Gypsy retrotransposon integrase-like protein 1 OS=Pongo abelii OX=9601 GN=GIN1 P... [more]
Q4R6I12.1e-0931.97Gypsy retrotransposon integrase-like protein 1 OS=Macaca fascicularis OX=9541 GN... [more]
Q8K2594.7e-0931.97Gypsy retrotransposon integrase-like protein 1 OS=Mus musculus OX=10090 GN=Gin1 ... [more]
Q9NXP76.2e-0931.15Gypsy retrotransposon integrase-like protein 1 OS=Homo sapiens OX=9606 GN=GIN1 P... [more]
Q66H308.9e-0830.83Gypsy retrotransposon integrase-like protein 1 OS=Rattus norvegicus OX=10116 GN=... [more]
Match NameE-valueIdentityDescription
A0A6J1DZJ14.6e-14869.09uncharacterized protein LOC111025738 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A6J1DS953.9e-13194.68uncharacterized protein LOC111023421 OS=Momordica charantia OX=3673 GN=LOC111023... [more]
A0A6J1DDS58.9e-12886.85uncharacterized protein LOC111019842 OS=Momordica charantia OX=3673 GN=LOC111019... [more]
A0A6J1C7X51.3e-12688.41uncharacterized protein LOC111008813 OS=Momordica charantia OX=3673 GN=LOC111008... [more]
A0A6J1DHB39.2e-12560.42uncharacterized protein LOC111020479 OS=Momordica charantia OX=3673 GN=LOC111020... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 173..193
NoneNo IPR availableCOILSCoilCoilcoord: 89..109
NoneNo IPR availableCOILSCoilCoilcoord: 600..620
NoneNo IPR availableGENE3D1.10.340.70coord: 470..561
e-value: 3.1E-17
score: 64.6
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..88
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 119..167
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..16
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 383..420
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 383..413
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 126..146
NoneNo IPR availablePANTHERPTHR33223FAMILY NOT NAMEDcoord: 253..378
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 512..561
e-value: 5.8E-12
score: 45.5
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 263..352
e-value: 3.4E-15
score: 56.1

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc03g21740.1Moc03g21740.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090502 RNA phosphodiester bond hydrolysis, endonucleolytic
cellular_component GO:0030430 host cell cytoplasm
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity