Moc02g14980 (gene) Bitter gourd (OHB3-1) v2

Overview
NameMoc02g14980
Typegene
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Locationchr2: 11124837 .. 11129598 (-)
RNA-Seq ExpressionMoc02g14980
SyntenyMoc02g14980
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGGTTCAATTCTGCTTTTGTTACACAATCTTATGTTTCTTCTTCCAAAGTGTCGTTGGATGTTTGGCACCAAAGAATAGGACACACATCCAACAAAGTCCTTGCTTCTGTTTTAAGTGTTTGTAATGTAAAATCGACTGGAAATGAAAAACCAAGCTTTTATGATTCGTGTCAATATGGAAAGTCTCATGCTTTGCCTTTCAAACTTTCTACTTCTTGTACCAATAGACCTCTTGAACTGATTCATTGTGACCTTTGGGGACCTTCTCCTGTTGTCTCAACAGCTAGTTTTTGGTTTTATATAAGCTTCGTTACAATTTTTCAAGGTTCACTTACATTTTTCCTCTTAAACATAAAGGAGAAGCCCTTTCAATCTTTATCCAATACAAAAATCTTGTAGAAAATAAATTTGACCTTAAAATCAAGTGTTTACAAAGTGATTGGGGTGGAGAGTTTAGACCATTTGTCACTTATCTAAAGCAACACGGCATTGAGTTTAGACACTCTTGTCCTCATACTAGTGAACAAAATGACATAGTAGAAAGAAAACAGAGACACATAGTGGAAATGAGTCTTACTCTACTTGCTCAAGCCTCAATGCCTTTACGATTTTGGTGGGATGCCTTTTTGTGTGCTGTTTACTTGATAAATAGACTTCCCACTTCCGTCTTACAAAATATGTCACCTTGGGAGAAACTTTCTAACAGAAAACCAGATTATTCTTTCCTAAAAGTGTTTGGTAGTACTTGTTTCCCTTGCTTGAGGTCTTATAAAAAACATAAGTTCCAATTTCATAGTACTAAATGTATATTTTTGGGATATAGTGATCAACATAAAGGATATAAATGTCTCAGCTCAAATGGAAGAATCTATATCTCTAGACACGTCATATTCAATGAGAATGAATTTCCTTTCAAGGCTGGTTTTCTCACAACGTCTCTTTCTAAACAACAATCAAATGAACTTGTCATTACCTATCTTAACTTTCCTGGTACTTCTCCCTTGAACTTGCCAAACACTAGTCCTACCTCCACAGATGTAAGACAGAATATTCTTTCCAGACAAGAAAATCATAATGTTGCAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGTCAGGTATCTACTTCTCAAAATAACATTGATTTACTTTCTAACCCAGCTACTGAGATATTGTCTGATACTTTGAGTTCTAATTCACCCTCACAACCTACTCATCCTATGCAAACAAGTCCAAAAGTGGAATTTTTAAGCCTAAGAAATACAGTTATGTTGCTACTCACAGTCAAATTTCAGTGAACAAAGAACCCGTGAATGTTGCAGAAGCTTTAAGATCAGAACACTGGAGAGAAGCTATGGAATCTGAGATGGATGCACTGATAAAAAACAAAACTTGGACTCTAATACCACCTTCTCCAAAGTATAACTTGGTTGGGTGCAAATGGGTGTTCAAGGTTAAAGAAAATTCAGATGGAAGTGTTTGTGGTATAAGGCTAGACTTGTTGCAAAGGGGTTTCATCAAATCCAGGTGTTGATTTTAAAGAGACGTTTAGTCCTGTTGTGAAGTCATCAACTATAAGAATCATTTTAACTATCGCAGTGAAAAATGATTGGGTGATTAAACAGTTGGACATCAATAATGCTTTTCTTAATGGCTTCTTGGAAGAAGATGTGTATATGGTACAACCTGAAGGATTTGAAGACCAACAACAACCATATCATGTATGCAAACTTTATAAAGCATTGTATGGTCTTAAACAAGCACCAAGGACTTGGTTTGATCGATTGAAGGCTGTCTTACTACTTGGGGCTTCAACAACTCAAAGGCAGATAATTCTCTGTTCTATCTCATTAAAGTTAGAGTTCAAATATTTATTCTTATCTATGTTGATGATATTCTTGTGACCGGAAATGACAGTAAATTGATTACTCAGTTTGTGAAAGACTTAAATCAACCGTTTGCTTTGAAGGACCTGGGTGATCTTTCTTATTTTCTAGGTATTAAAGTATGGAGAGATCAGTACGGAATCCATTTAAGTCAAGAAAAATATGTCTTGGATCTACTCGCCAGACTTGGTATGGCCAATATTAAATCTTGTCCAACTCTAGCTGTCACAAGCAAGCAATTCTCAGCTACTGAAGGCGTTTTGATGGCAAATTCCACACTTTATAGAAGTGCTATAGGAGGCCTCCAATACTTAACTCACACTAGACCAGATATCTCATATATTGTTAACAAACTAAGTCAATATATGCAGCAACCTACTATGATGCATTGGCAAGGAGTCAAACGGGTTTTAAGGTATCTCAAAGGAAGCTTGAGTCATGGTCTTTTTATTCCTAAATCAGTAAGTCTATTCTTATTTGCCTATACAGATGCTGATTGGGCTTGTAGTGTTGATGATAGAAAATCTATTGCTGCTCATTGTGTCTTTCTTGGACACTCACTAATTTCTTGGTCTTCTAAGAAACAACATGTTGTTGCTAGATCAAGCACTGAATCTAATATCGGTCTTTAGCTCACACTGCAGCAGAAATTTGTTCGATACAATCTCTATTGAATGAAATTCAGCATTGCCCCATGTCCACTCATGTTATATGGTGTGATAATATGAGTGCACTATCACTTGCAGCAAATCCTGTTTTTCATTCGATGACCAAACACGTTGAATTGGATCTCCACTTTGTTCGGGACAAGGTTGTTAAAAAAGAGCTGGATGTTCGCTATGTTCCTTCGGATGAACAAGTAGCCGATGGTTTAACCAAGGCTTTGTTAGAAAACAAATTTTGTATCAGTCGAGGCAAACTCAATGTGCTGCCAGCACCCTCTCGTTTGAGGGGGGATGTTAGAAGATGTGTTGTGACGTCGTTTCATTAAGAGGCTATTTTTGGCATAGTAGTTTGGCTGCTGTTATTTTTCAGTTTTTCTCCTTATTTTCAGTTTTGAGATATCGTGGGTAACTGTTCTATGAACCATGATAGTTCTCTTCCTTCTGTATTTATATTCATCTTCATTGTTGAATGGGATAAGGAAGTTTTCTGAAATTTTATCTTCCGACAACAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG

mRNA sequence

ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG

Coding sequence (CDS)

ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG

Protein sequence

MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQNQNQQRSMSSVQQCLASNNLSLSTSNSTFRTTILFVAKSRAYNWEPKSRPRFTNGWMIDRPRSGVLSDFIPINRDACPLCKLRKSICILRVR
Homology
BLAST of Moc02g14980 vs. NCBI nr
Match: XP_022157748.1 (uncharacterized protein LOC111024384 isoform X1 [Momordica charantia])

HSP 1 Score: 766.1 bits (1977), Expect = 2.1e-217
Identity = 385/386 (99.74%), Postives = 385/386 (99.74%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
           MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60

Query: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
           VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120

Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
           SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180

Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
           SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240

Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
           VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300

Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
           SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360

Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
           ADSGATDHVTSDLSNLNVKSDYNGKG
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 386

BLAST of Moc02g14980 vs. NCBI nr
Match: XP_022157750.1 (uncharacterized protein LOC111024384 isoform X2 [Momordica charantia])

HSP 1 Score: 764.6 bits (1973), Expect = 6.1e-217
Identity = 384/386 (99.48%), Postives = 385/386 (99.74%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
           MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60

Query: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
           VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120

Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
           SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180

Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
           SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240

Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
           VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300

Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
           SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360

Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
           ADSGATDHVTSDLSNLNVKSDYNG+G
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGQG 386

BLAST of Moc02g14980 vs. NCBI nr
Match: TXG55646.1 (hypothetical protein EZV62_020902 [Acer yangbiense])

HSP 1 Score: 366.7 bits (940), Expect = 3.7e-97
Identity = 203/490 (41.43%), Postives = 303/490 (61.84%), Query Frame = 0

Query: 2   TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVL 61
           TT + +++  P    T         S   N S  FG+ L     +KLD +N+ LW+ MV 
Sbjct: 3   TTSQQQSTLAPSSSSTETPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVT 62

Query: 62  AVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NPEYVEWQAVDQALLGWLFGSM 121
            +++G + DG++  T   PP+FL SP T G SD     NPEY +W   DQ L+GWL+ SM
Sbjct: 63  TIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVSDSGSCSNPEYEKWLVNDQLLMGWLYSSM 122

Query: 122 TPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQA 181
           T ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  
Sbjct: 123 TENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTW 182

Query: 182 SESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRL 241
           ++SL +AG+P   N L + +L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +
Sbjct: 183 ADSLAIAGDPYPENLLFANILAGLDSEYMPIVVLIEAREHFTWQEIYDTLLSYDSKLEHI 242

Query: 242 NIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG 301
           N VS A    +S  SA+   +K N+  N     +Q    QG     +        GR RG
Sbjct: 243 NNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGRFRG 302

Query: 302 RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVA 361
           R    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V 
Sbjct: 303 RGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSPSVFVATPETVD 362

Query: 362 EPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGN 421
           + +W ADSGAT+HVT+D  NL++KS+Y G  +L VGNG +L+ISH+G   L +  +T  +
Sbjct: 363 DTTWYADSGATNHVTNDAGNLDLKSNYRGDESLMVGNGKQLDISHVGLKSLPS--LTKHS 422

Query: 422 LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQV 481
           + L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T+  VL G LK+ LYQ+
Sbjct: 423 IILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTRMEVLRGRLKNGLYQL 482

Query: 482 KLPLQTSNQN 486
           ++P   S  N
Sbjct: 483 EIPTTKSAFN 487

BLAST of Moc02g14980 vs. NCBI nr
Match: TXG67243.1 (hypothetical protein EZV62_008518 [Acer yangbiense])

HSP 1 Score: 360.1 bits (923), Expect = 3.4e-95
Identity = 204/501 (40.72%), Postives = 305/501 (60.88%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
           M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ 
Sbjct: 1   MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60

Query: 61  MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
           MV  +++G + DG++  T   PP+FL SP T G                NPEY +W   D
Sbjct: 61  MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120

Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
           Q L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180

Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
           M EYL  MK  ++SL +AG+P   N L +  L+GL++EY+PIV  IE ++  +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240

Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
           L+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300

Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
                 GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360

Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
           S ++A PE V + +W ADSGATDHVT+D  NL++KSDY G  +L VGNG +L+ISH+G  
Sbjct: 361 SVFVATPETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420

Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
            L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480

Query: 481 HGVLKDELYQVKLPLQTSNQN 486
            G LK+ LYQ+++P   S  N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495

BLAST of Moc02g14980 vs. NCBI nr
Match: TXG69253.1 (hypothetical protein EZV62_004188 [Acer yangbiense])

HSP 1 Score: 358.2 bits (918), Expect = 1.3e-94
Identity = 203/501 (40.52%), Postives = 305/501 (60.88%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
           M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ 
Sbjct: 1   MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60

Query: 61  MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
           MV  +++G + DG++  T   PP+FL SP T G                NPEY +W   D
Sbjct: 61  MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120

Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
           Q L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180

Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
           M EYL  MK  ++SL +AG+P   N L +  L+GL++EY+PIV  IE ++  +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240

Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
           L+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300

Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
                 GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360

Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
           S ++A PE V + +W ADSGAT+HVT+D  NL++KSDY G  +L VGNG +L+ISH+G  
Sbjct: 361 SVFVATPETVDDTTWYADSGATNHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420

Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
            L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480

Query: 481 HGVLKDELYQVKLPLQTSNQN 486
            G LK+ LYQ+++P   S  N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495

BLAST of Moc02g14980 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 182.2 bits (461), Expect = 1.7e-44
Identity = 139/445 (31.24%), Postives = 216/445 (48.54%), Query Frame = 0

Query: 45  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQ 104
           KL   NY +W   V A+  G +  G++ G+   P      P T GT    +VNP+Y  W+
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP------PATIGTDAAPRVNPDYTRWK 84

Query: 105 AVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKN 164
             D+ +   + G+++ S+   V    ++ ++W+ L  +Y   S   + QLR  L+   K 
Sbjct: 85  RQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKG 144

Query: 165 SLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQE 224
           +  + +Y+  +    + L L G+P+  +  +  VL  L  EY P++ QI  KD+  +  E
Sbjct: 145 TKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTE 204

Query: 225 LFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY 284
           +   L+  E+ ++    VS+AT   I   +AN V  +  +  N       +  G     Y
Sbjct: 205 IHERLLNHESKIL---AVSSATVIPI---TANAVSHRNTTTTN------NNNNGNRNNRY 264

Query: 285 NSNDAKNNVR--GRGRGRFSPYRGNNSKP---SCQLCGKYGHIAAVCYKRFDENFNNLSS 344
           ++ +  NN +   +    F P   N SKP    CQ+CG  GH A    KR  +  + LSS
Sbjct: 265 DNRNNNNNSKPWQQSSTNFHP-NNNQSKPYLGKCQICGVQGHSA----KRCSQLQHFLSS 324

Query: 345 SNNN---------RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLT 404
            N+          +  A +A+    +  +WL DSGAT H+TSD +NL++   Y G   + 
Sbjct: 325 VNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVM 384

Query: 405 VGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHP 464
           V +G+ + ISH G T L TK   S  L L NIL+VP I +NL+S+ +L   N   VEF P
Sbjct: 385 VADGSTIPISHTGSTSLSTK---SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFP 443

Query: 465 TCCFVKDKETKKVVLHGVLKDELYQ 475
               VKD  T   +L G  KDELY+
Sbjct: 445 ASFQVKDLNTGVPLLQGKTKDELYE 443

BLAST of Moc02g14980 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 164.5 bits (415), Expect = 3.6e-39
Identity = 125/441 (28.34%), Postives = 204/441 (46.26%), Query Frame = 0

Query: 45  KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQ 104
           KL   NY +W   V A+  G +  G++ G+   P      P T GT    +VNP+Y  W+
Sbjct: 25  KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMP------PATIGTDAVPRVNPDYTRWR 84

Query: 105 AVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKN 164
             D+ +   + G+++ S+   V    ++ ++W+ L  +Y   S   + QLR + +     
Sbjct: 85  RQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRFITR----- 144

Query: 165 SLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQE 224
                          + L L G+P+  +  +  VL  L  +Y P++ QI  KD+  S  E
Sbjct: 145 --------------FDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTE 204

Query: 225 LFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY 284
           +   L+  E+ L+ LN             +AN V  +     N   +++Q+ +G  R   
Sbjct: 205 IHERLINRESKLLALNSAEVVPI------TANVVTHR-----NTNTNRNQNNRGDNRNYN 264

Query: 285 NSNDAKNNVRGRGRGRFSPYRGNNSKPS-----CQLCGKYGHIAAVC--YKRFDENFNNL 344
           N+N+  N+ +    G     R +N +P      CQ+C   GH A  C    +F    N  
Sbjct: 265 NNNNRSNSWQPSSSGS----RSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQ 324

Query: 345 SSSNNN---RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNG 404
            S++     +  A +A+       +WL DSGAT H+TSD +NL+    Y G   + + +G
Sbjct: 325 QSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADG 384

Query: 405 NRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCF 464
           + + I+H G   L   P +S +L L+ +L+VP I +NL+S+ +L   N   VEF P    
Sbjct: 385 STIPITHTGSASL---PTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQ 422

Query: 465 VKDKETKKVVLHGVLKDELYQ 475
           VKD  T   +L G  KDELY+
Sbjct: 445 VKDLNTGVPLLQGKTKDELYE 422

BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match: A0A6J1DU77 (uncharacterized protein LOC111024384 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111024384 PE=4 SV=1)

HSP 1 Score: 766.1 bits (1977), Expect = 1.0e-217
Identity = 385/386 (99.74%), Postives = 385/386 (99.74%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
           MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60

Query: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
           VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120

Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
           SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180

Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
           SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240

Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
           VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300

Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
           SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360

Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
           ADSGATDHVTSDLSNLNVKSDYNGKG
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 386

BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match: A0A6J1DTZ7 (uncharacterized protein LOC111024384 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111024384 PE=4 SV=1)

HSP 1 Score: 764.6 bits (1973), Expect = 2.9e-217
Identity = 384/386 (99.48%), Postives = 385/386 (99.74%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
           MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1   MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60

Query: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
           VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120

Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
           SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180

Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
           SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240

Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
           VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300

Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
           SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360

Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
           ADSGATDHVTSDLSNLNVKSDYNG+G
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGQG 386

BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match: A0A5C7HHE9 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_020902 PE=4 SV=1)

HSP 1 Score: 366.7 bits (940), Expect = 1.8e-97
Identity = 203/490 (41.43%), Postives = 303/490 (61.84%), Query Frame = 0

Query: 2   TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVL 61
           TT + +++  P    T         S   N S  FG+ L     +KLD +N+ LW+ MV 
Sbjct: 3   TTSQQQSTLAPSSSSTETPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVT 62

Query: 62  AVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NPEYVEWQAVDQALLGWLFGSM 121
            +++G + DG++  T   PP+FL SP T G SD     NPEY +W   DQ L+GWL+ SM
Sbjct: 63  TIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVSDSGSCSNPEYEKWLVNDQLLMGWLYSSM 122

Query: 122 TPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQA 181
           T ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  M EYL  MK  
Sbjct: 123 TENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTW 182

Query: 182 SESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRL 241
           ++SL +AG+P   N L + +L+GL++EY+PIV  IE ++  +WQE++ TL+++++ L  +
Sbjct: 183 ADSLAIAGDPYPENLLFANILAGLDSEYMPIVVLIEAREHFTWQEIYDTLLSYDSKLEHI 242

Query: 242 NIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG 301
           N VS A    +S  SA+   +K N+  N     +Q    QG     +        GR RG
Sbjct: 243 NNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGRFRG 302

Query: 302 RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVA 361
           R    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  S ++A PE V 
Sbjct: 303 RGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSPSVFVATPETVD 362

Query: 362 EPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGN 421
           + +W ADSGAT+HVT+D  NL++KS+Y G  +L VGNG +L+ISH+G   L +  +T  +
Sbjct: 363 DTTWYADSGATNHVTNDAGNLDLKSNYRGDESLMVGNGKQLDISHVGLKSLPS--LTKHS 422

Query: 422 LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQV 481
           + L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T+  VL G LK+ LYQ+
Sbjct: 423 IILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTRMEVLRGRLKNGLYQL 482

Query: 482 KLPLQTSNQN 486
           ++P   S  N
Sbjct: 483 EIPTTKSAFN 487

BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match: A0A803PEH4 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 360.5 bits (924), Expect = 1.3e-95
Identity = 217/514 (42.22%), Postives = 307/514 (59.73%), Query Frame = 0

Query: 2   TTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHP-LGTVLTVKLDDKNYSLWRGMVLA 61
           T     NSSV     +N      N + Q   +F  P L    ++KLD  NY+LW+ MV  
Sbjct: 12  TASSPTNSSVAGASSSN----NTNQASQLPNAFAPPTLNQPFSLKLDRNNYTLWKTMVST 71

Query: 62  VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 121
           ++RG +  GY+ GTL  PP+F++  +T+ T      NPEY  W   DQ L+GWL+ SMT 
Sbjct: 72  IVRGHRLHGYLSGTLMCPPEFVMVGDTQVT------NPEYENWIITDQLLMGWLYSSMTE 131

Query: 122 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 181
            IA +V+   S+  + + LE LYGA SK++++  R ++Q T+K S  MSEYL   K  S 
Sbjct: 132 GIATEVMGSHSAANLQRNLESLYGAYSKSKMDDTRTLIQTTRKGSTLMSEYLRQKKNWSN 191

Query: 182 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 241
            L LAG+P    +L++ VL GL+AEYL IV QIE + +T+WQEL   L++F++ + RL  
Sbjct: 192 MLALAGDPYPEAHLVANVLFGLDAEYLSIVVQIEARSNTTWQELQDLLLSFDSKIERLQN 251

Query: 242 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY-NSNDAKNNVRGRGRGR 301
           ++  + +  S      + +K N+ G  +  QSQ+      G + NS    N  RGRGRG 
Sbjct: 252 LTLNSNKATSSSPQANMAAKTNNNGRGRGFQSQNASTNSGGLFSNSRGTSNRFRGRGRG- 311

Query: 302 FSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENF----------NNLSSSNNNRNSAYMA 361
                G+ S+P+CQ+ GKYGH AAVCY RFDE++           N +   NN +SA++A
Sbjct: 312 ----PGSGSRPTCQVYGKYGHTAAVCYNRFDESYMGSDPNNPHNQNKAGQTNNNHSAFVA 371

Query: 362 IPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTK 421
            PE++   +W ADSGA++H+TSD +NL  K DYNGK ++ VGNG++L I+HIG+  L   
Sbjct: 372 TPEVLEFDAWFADSGASNHITSDPANLTQKQDYNGKESVVVGNGSKLRITHIGNGKLN-- 431

Query: 422 PITSGN-LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVL 481
            I SGN L L ++L VPKI +NL+S++KL  DNN  +EF+   C VKDK TKKV+LHGVL
Sbjct: 432 -IESGNYLLLKDMLLVPKIAKNLVSVSKLATDNNVLIEFYSNFCLVKDKVTKKVLLHGVL 491

Query: 482 KDELYQVKLPLQTSNQNQNQQRSMSSVQQCLASN 503
           KDELYQ+  P   S+    Q   +S+    + SN
Sbjct: 492 KDELYQLDSPFTKSSHPYQQSNFLSAFTISVDSN 507

BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match: A0A5C7ID32 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1)

HSP 1 Score: 360.1 bits (923), Expect = 1.7e-95
Identity = 204/501 (40.72%), Postives = 305/501 (60.88%), Query Frame = 0

Query: 1   MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
           M+T   + S++ P   ++ A PT     + S   ++ FG+ L     +KLD +N+ LW+ 
Sbjct: 1   MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60

Query: 61  MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
           MV  +++G + DG++  T   PP+FL SP T G                NPEY +W   D
Sbjct: 61  MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120

Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
           Q L+GWL+ SMT ++A  V+   ++  +WKALE+L+GA SK++ N +R  +Q T+K S  
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180

Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
           M EYL  MK  ++SL +AG+P   N L +  L+GL++EY+PIV  IE ++  +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240

Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
           L+++++ L  +N VS A    +S  SA+   +K N+  N     +Q    QG     +  
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300

Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
                 GR RGR    R NNS+P+CQ+CGK+GH A+VCY R+D+N+      ++SN N  
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360

Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
           S ++A PE V + +W ADSGATDHVT+D  NL++KSDY G  +L VGNG +L+ISH+G  
Sbjct: 361 SVFVATPETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420

Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
            L +  +T  ++ L  +LHVP+I++NLLS+++L  DN+ F+EFH  CCFVKDK T   VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480

Query: 481 HGVLKDELYQVKLPLQTSNQN 486
            G LK+ LYQ+++P   S  N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495

BLAST of Moc02g14980 vs. TAIR 10
Match: AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 60.5 bits (145), Expect = 5.3e-09
Identity = 64/269 (23.79%), Postives = 120/269 (44.61%), Query Frame = 0

Query: 42  LTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYV 101
           +T+ L+  NY +WR +   +       G++ G+         +P TE             
Sbjct: 24  VTLDLNKLNYDVWRELFETLCLSFGVLGHIDGSSTP------TPMTE------------K 83

Query: 102 EWQAVDQALLGWLFGSMTPSIACDVVDFR-SSREVWKALEDLYGATSKARINQLRNVLQN 161
            W+  D  +  W++G++T S+   ++    ++R++W +LE+L+    +AR  Q  N L+ 
Sbjct: 84  RWKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRT 143

Query: 162 TKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-T 221
           T  + L + EY   +K  S+ L     P++   L+  +L+GL  +Y  I+  I+ K    
Sbjct: 144 TTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFP 203

Query: 222 SWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQG 281
           S+ E  + L+  E+ L   +  S +     S  +  +   +Q     +++H + S  G+G
Sbjct: 204 SFTEARSMLLMEESRLSNKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRG 263

Query: 282 RGSYNSNDAKNNVRGRGRGRFSPYRGNNS 309
           R     +  KN   G   GR   Y  NN+
Sbjct: 264 R-----SKKKNRGGGSSDGR---YNNNNN 266

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022157748.12.1e-21799.74uncharacterized protein LOC111024384 isoform X1 [Momordica charantia][more]
XP_022157750.16.1e-21799.48uncharacterized protein LOC111024384 isoform X2 [Momordica charantia][more]
TXG55646.13.7e-9741.43hypothetical protein EZV62_020902 [Acer yangbiense][more]
TXG67243.13.4e-9540.72hypothetical protein EZV62_008518 [Acer yangbiense][more]
TXG69253.11.3e-9440.52hypothetical protein EZV62_004188 [Acer yangbiense][more]
Match NameE-valueIdentityDescription
Q94HW21.7e-4431.24Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT943.6e-3928.34Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Match NameE-valueIdentityDescription
A0A6J1DU771.0e-21799.74uncharacterized protein LOC111024384 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DTZ72.9e-21799.48uncharacterized protein LOC111024384 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A5C7HHE91.8e-9741.43Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_020902 PE=4 SV=1[more]
A0A803PEH41.3e-9542.22Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A5C7ID321.7e-9540.72Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G48050.15.3e-0923.79CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (OHB3-1) v2
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 103..233
e-value: 2.4E-12
score: 46.8
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 265..305
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 265..291
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 48..406
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 48..406

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Moc02g14980.1Moc02g14980.1mRNA