Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGGTTCAATTCTGCTTTTGTTACACAATCTTATGTTTCTTCTTCCAAAGTGTCGTTGGATGTTTGGCACCAAAGAATAGGACACACATCCAACAAAGTCCTTGCTTCTGTTTTAAGTGTTTGTAATGTAAAATCGACTGGAAATGAAAAACCAAGCTTTTATGATTCGTGTCAATATGGAAAGTCTCATGCTTTGCCTTTCAAACTTTCTACTTCTTGTACCAATAGACCTCTTGAACTGATTCATTGTGACCTTTGGGGACCTTCTCCTGTTGTCTCAACAGCTAGTTTTTGGTTTTATATAAGCTTCGTTACAATTTTTCAAGGTTCACTTACATTTTTCCTCTTAAACATAAAGGAGAAGCCCTTTCAATCTTTATCCAATACAAAAATCTTGTAGAAAATAAATTTGACCTTAAAATCAAGTGTTTACAAAGTGATTGGGGTGGAGAGTTTAGACCATTTGTCACTTATCTAAAGCAACACGGCATTGAGTTTAGACACTCTTGTCCTCATACTAGTGAACAAAATGACATAGTAGAAAGAAAACAGAGACACATAGTGGAAATGAGTCTTACTCTACTTGCTCAAGCCTCAATGCCTTTACGATTTTGGTGGGATGCCTTTTTGTGTGCTGTTTACTTGATAAATAGACTTCCCACTTCCGTCTTACAAAATATGTCACCTTGGGAGAAACTTTCTAACAGAAAACCAGATTATTCTTTCCTAAAAGTGTTTGGTAGTACTTGTTTCCCTTGCTTGAGGTCTTATAAAAAACATAAGTTCCAATTTCATAGTACTAAATGTATATTTTTGGGATATAGTGATCAACATAAAGGATATAAATGTCTCAGCTCAAATGGAAGAATCTATATCTCTAGACACGTCATATTCAATGAGAATGAATTTCCTTTCAAGGCTGGTTTTCTCACAACGTCTCTTTCTAAACAACAATCAAATGAACTTGTCATTACCTATCTTAACTTTCCTGGTACTTCTCCCTTGAACTTGCCAAACACTAGTCCTACCTCCACAGATGTAAGACAGAATATTCTTTCCAGACAAGAAAATCATAATGTTGCAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGTCAGGTATCTACTTCTCAAAATAACATTGATTTACTTTCTAACCCAGCTACTGAGATATTGTCTGATACTTTGAGTTCTAATTCACCCTCACAACCTACTCATCCTATGCAAACAAGTCCAAAAGTGGAATTTTTAAGCCTAAGAAATACAGTTATGTTGCTACTCACAGTCAAATTTCAGTGAACAAAGAACCCGTGAATGTTGCAGAAGCTTTAAGATCAGAACACTGGAGAGAAGCTATGGAATCTGAGATGGATGCACTGATAAAAAACAAAACTTGGACTCTAATACCACCTTCTCCAAAGTATAACTTGGTTGGGTGCAAATGGGTGTTCAAGGTTAAAGAAAATTCAGATGGAAGTGTTTGTGGTATAAGGCTAGACTTGTTGCAAAGGGGTTTCATCAAATCCAGGTGTTGATTTTAAAGAGACGTTTAGTCCTGTTGTGAAGTCATCAACTATAAGAATCATTTTAACTATCGCAGTGAAAAATGATTGGGTGATTAAACAGTTGGACATCAATAATGCTTTTCTTAATGGCTTCTTGGAAGAAGATGTGTATATGGTACAACCTGAAGGATTTGAAGACCAACAACAACCATATCATGTATGCAAACTTTATAAAGCATTGTATGGTCTTAAACAAGCACCAAGGACTTGGTTTGATCGATTGAAGGCTGTCTTACTACTTGGGGCTTCAACAACTCAAAGGCAGATAATTCTCTGTTCTATCTCATTAAAGTTAGAGTTCAAATATTTATTCTTATCTATGTTGATGATATTCTTGTGACCGGAAATGACAGTAAATTGATTACTCAGTTTGTGAAAGACTTAAATCAACCGTTTGCTTTGAAGGACCTGGGTGATCTTTCTTATTTTCTAGGTATTAAAGTATGGAGAGATCAGTACGGAATCCATTTAAGTCAAGAAAAATATGTCTTGGATCTACTCGCCAGACTTGGTATGGCCAATATTAAATCTTGTCCAACTCTAGCTGTCACAAGCAAGCAATTCTCAGCTACTGAAGGCGTTTTGATGGCAAATTCCACACTTTATAGAAGTGCTATAGGAGGCCTCCAATACTTAACTCACACTAGACCAGATATCTCATATATTGTTAACAAACTAAGTCAATATATGCAGCAACCTACTATGATGCATTGGCAAGGAGTCAAACGGGTTTTAAGGTATCTCAAAGGAAGCTTGAGTCATGGTCTTTTTATTCCTAAATCAGTAAGTCTATTCTTATTTGCCTATACAGATGCTGATTGGGCTTGTAGTGTTGATGATAGAAAATCTATTGCTGCTCATTGTGTCTTTCTTGGACACTCACTAATTTCTTGGTCTTCTAAGAAACAACATGTTGTTGCTAGATCAAGCACTGAATCTAATATCGGTCTTTAGCTCACACTGCAGCAGAAATTTGTTCGATACAATCTCTATTGAATGAAATTCAGCATTGCCCCATGTCCACTCATGTTATATGGTGTGATAATATGAGTGCACTATCACTTGCAGCAAATCCTGTTTTTCATTCGATGACCAAACACGTTGAATTGGATCTCCACTTTGTTCGGGACAAGGTTGTTAAAAAAGAGCTGGATGTTCGCTATGTTCCTTCGGATGAACAAGTAGCCGATGGTTTAACCAAGGCTTTGTTAGAAAACAAATTTTGTATCAGTCGAGGCAAACTCAATGTGCTGCCAGCACCCTCTCGTTTGAGGGGGGATGTTAGAAGATGTGTTGTGACGTCGTTTCATTAAGAGGCTATTTTTGGCATAGTAGTTTGGCTGCTGTTATTTTTCAGTTTTTCTCCTTATTTTCAGTTTTGAGATATCGTGGGTAACTGTTCTATGAACCATGATAGTTCTCTTCCTTCTGTATTTATATTCATCTTCATTGTTGAATGGGATAAGGAAGTTTTCTGAAATTTTATCTTCCGACAACAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG
mRNA sequence
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG
Coding sequence (CDS)
ATGACGACCGAAGAAACCGAAAATTCATCTGTTCCTCCACAAGTGGTCACAAATGTGGCTGTCCCAACACCAAATCCTTCACCACAATTTAATACCTCCTTTGGTCATCCCCTGGGCACTGTTTTAACAGTAAAGTTGGATGACAAAAATTATTCTCTTTGGAGAGGAATGGTGCTCGCTGTCTTAAGGGGTCAAAAATTTGATGGGTATGTGCTGGGAACCTTGGCCAAACCACCACAGTTTCTTGTCTCACCAGAAACTGAAGGAACTTCAGACCATCTTCAAGTGAATCCTGAATATGTGGAGTGGCAAGCAGTTGATCAAGCTCTACTTGGTTGGCTTTTTGGATCAATGACTCCTTCTATTGCCTGCGATGTCGTTGACTTCAGAAGTTCAAGAGAAGTATGGAAAGCTCTTGAGGATCTCTATGGAGCAACAAGTAAGGCACGCATAAATCAGTTGCGGAATGTTCTTCAAAATACCAAGAAAAACTCTCTGAAGATGTCAGAATATCTTGGACTTATGAAACAAGCCTCTGAAAGTCTCAAATTAGCAGGTGAGCCTGTTGCTTTTAATTATTTAATGTCTTGTGTACTCTCAGGTTTAGAGGCAGAATATCTTCCAATTGTCTGTCAAATTGAAGGGAAAGATTCAACTTCATGGCAAGAGTTGTTTGCTACACTAGTGACGTTTGAAAACACTTTAATGAGGCTAAATATTGTTTCTACCGCTACTGCTGAGGGCATCTCTGATGGGAGTGCTAATTATGTACATTCAAAGCAAAATTCAGTTGGGAATAGACAGTTCCATCAGTCTCAATCAGGACAAGGACAAGGAAGAGGCAGTTACAACTCAAATGATGCTAAAAACAACGTGAGAGGAAGAGGTCGTGGCAGATTCAGTCCTTATAGAGGAAATAACTCTAAACCAAGTTGTCAACTATGTGGCAAATATGGGCATATAGCAGCTGTTTGTTACAAAAGGTTTGATGAAAACTTCAATAATTTGTCTAGCTCCAACAACAACCGTAATTCTGCATATATGGCTATCCCAGAGATTGTTGCTGAACCTAGTTGGTTAGCAGATAGTGGGGCTACAGATCATGTCACTTCAGACCTCTCAAACTTGAATGTTAAGTCTGATTACAATGGTAAAGGTACATTAACTGTTGGTAATGGTAATAGGCTAGAAATTTCACATATTGGGCACACTTGTTTGCAAACCAAACCTATTACTTCTGGCAATTTACAACTCAGCAATATACTTCATGTTCCAAAAATTAAAAGAAACCTCTTGAGTATTGCCAAACTCACTGCTGATAATAATTGTTTTGTTGAATTTCATCCGACTTGTTGTTTTGTGAAGGACAAGGAAACAAAGAAGGTGGTGCTGCACGGAGTTCTCAAAGATGAACTATACCAAGTCAAGTTACCTCTCCAAACCAGCAATCAAAATCAAAACCAGCAGCGTTCAATGTCTTCTGTTCAACAATGTTTAGCTAGCAACAATCTGTCTTTGTCTACTAGCAATAGCACCTTCAGAACCACTATCCTCTTTGTTGCCAAGTCCAGAGCATACAATTGGGAGCCAAAGTCAAGGCCACGTTTTACGAATGGTTGGATGATCGATAGGCCAAGGTCGGGTGTCTTGTCAGATTTCATCCCTATAAATAGGGATGCATGCCCCTTGTGCAAGTTACGCAAATCCATTTGCATTCTGAGAGTTAGATAG
Protein sequence
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQVKLPLQTSNQNQNQQRSMSSVQQCLASNNLSLSTSNSTFRTTILFVAKSRAYNWEPKSRPRFTNGWMIDRPRSGVLSDFIPINRDACPLCKLRKSICILRVR
Homology
BLAST of Moc02g14980 vs. NCBI nr
Match:
XP_022157748.1 (uncharacterized protein LOC111024384 isoform X1 [Momordica charantia])
HSP 1 Score: 766.1 bits (1977), Expect = 2.1e-217
Identity = 385/386 (99.74%), Postives = 385/386 (99.74%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
Query: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
ADSGATDHVTSDLSNLNVKSDYNGKG
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 386
BLAST of Moc02g14980 vs. NCBI nr
Match:
XP_022157750.1 (uncharacterized protein LOC111024384 isoform X2 [Momordica charantia])
HSP 1 Score: 764.6 bits (1973), Expect = 6.1e-217
Identity = 384/386 (99.48%), Postives = 385/386 (99.74%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
Query: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
ADSGATDHVTSDLSNLNVKSDYNG+G
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGQG 386
BLAST of Moc02g14980 vs. NCBI nr
Match:
TXG55646.1 (hypothetical protein EZV62_020902 [Acer yangbiense])
HSP 1 Score: 366.7 bits (940), Expect = 3.7e-97
Identity = 203/490 (41.43%), Postives = 303/490 (61.84%), Query Frame = 0
Query: 2 TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVL 61
TT + +++ P T S N S FG+ L +KLD +N+ LW+ MV
Sbjct: 3 TTSQQQSTLAPSSSSTETPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVT 62
Query: 62 AVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NPEYVEWQAVDQALLGWLFGSM 121
+++G + DG++ T PP+FL SP T G SD NPEY +W DQ L+GWL+ SM
Sbjct: 63 TIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVSDSGSCSNPEYEKWLVNDQLLMGWLYSSM 122
Query: 122 TPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQA 181
T ++A V+ ++ +WKALE+L+GA SK++ N +R +Q T+K S M EYL MK
Sbjct: 123 TENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTW 182
Query: 182 SESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRL 241
++SL +AG+P N L + +L+GL++EY+PIV IE ++ +WQE++ TL+++++ L +
Sbjct: 183 ADSLAIAGDPYPENLLFANILAGLDSEYMPIVVLIEAREHFTWQEIYDTLLSYDSKLEHI 242
Query: 242 NIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG 301
N VS A +S SA+ +K N+ N +Q QG + GR RG
Sbjct: 243 NNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGRFRG 302
Query: 302 RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVA 361
R R NNS+P+CQ+CGK+GH A+VCY R+D+N+ ++SN N S ++A PE V
Sbjct: 303 RGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSPSVFVATPETVD 362
Query: 362 EPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGN 421
+ +W ADSGAT+HVT+D NL++KS+Y G +L VGNG +L+ISH+G L + +T +
Sbjct: 363 DTTWYADSGATNHVTNDAGNLDLKSNYRGDESLMVGNGKQLDISHVGLKSLPS--LTKHS 422
Query: 422 LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQV 481
+ L +LHVP+I++NLLS+++L DN+ F+EFH CCFVKDK T+ VL G LK+ LYQ+
Sbjct: 423 IILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTRMEVLRGRLKNGLYQL 482
Query: 482 KLPLQTSNQN 486
++P S N
Sbjct: 483 EIPTTKSAFN 487
BLAST of Moc02g14980 vs. NCBI nr
Match:
TXG67243.1 (hypothetical protein EZV62_008518 [Acer yangbiense])
HSP 1 Score: 360.1 bits (923), Expect = 3.4e-95
Identity = 204/501 (40.72%), Postives = 305/501 (60.88%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
M+T + S++ P ++ A PT + S ++ FG+ L +KLD +N+ LW+
Sbjct: 1 MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60
Query: 61 MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
MV +++G + DG++ T PP+FL SP T G NPEY +W D
Sbjct: 61 MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120
Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
Q L+GWL+ SMT ++A V+ ++ +WKALE+L+GA SK++ N +R +Q T+K S
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180
Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
M EYL MK ++SL +AG+P N L + L+GL++EY+PIV IE ++ +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240
Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
L+++++ L +N VS A +S SA+ +K N+ N +Q QG +
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300
Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
GR RGR R NNS+P+CQ+CGK+GH A+VCY R+D+N+ ++SN N
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360
Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
S ++A PE V + +W ADSGATDHVT+D NL++KSDY G +L VGNG +L+ISH+G
Sbjct: 361 SVFVATPETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420
Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
L + +T ++ L +LHVP+I++NLLS+++L DN+ F+EFH CCFVKDK T VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480
Query: 481 HGVLKDELYQVKLPLQTSNQN 486
G LK+ LYQ+++P S N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495
BLAST of Moc02g14980 vs. NCBI nr
Match:
TXG69253.1 (hypothetical protein EZV62_004188 [Acer yangbiense])
HSP 1 Score: 358.2 bits (918), Expect = 1.3e-94
Identity = 203/501 (40.52%), Postives = 305/501 (60.88%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
M+T + S++ P ++ A PT + S ++ FG+ L +KLD +N+ LW+
Sbjct: 1 MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60
Query: 61 MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
MV +++G + DG++ T PP+FL SP T G NPEY +W D
Sbjct: 61 MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120
Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
Q L+GWL+ SMT ++A V+ ++ +WKALE+L+GA SK++ N +R +Q T+K S
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180
Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
M EYL MK ++SL +AG+P N L + L+GL++EY+PIV IE ++ +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240
Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
L+++++ L +N VS A +S SA+ +K N+ N +Q QG +
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300
Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
GR RGR R NNS+P+CQ+CGK+GH A+VCY R+D+N+ ++SN N
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360
Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
S ++A PE V + +W ADSGAT+HVT+D NL++KSDY G +L VGNG +L+ISH+G
Sbjct: 361 SVFVATPETVDDTTWYADSGATNHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420
Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
L + +T ++ L +LHVP+I++NLLS+++L DN+ F+EFH CCFVKDK T VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480
Query: 481 HGVLKDELYQVKLPLQTSNQN 486
G LK+ LYQ+++P S N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495
BLAST of Moc02g14980 vs. ExPASy Swiss-Prot
Match:
Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)
HSP 1 Score: 182.2 bits (461), Expect = 1.7e-44
Identity = 139/445 (31.24%), Postives = 216/445 (48.54%), Query Frame = 0
Query: 45 KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQ 104
KL NY +W V A+ G + G++ G+ P P T GT +VNP+Y W+
Sbjct: 25 KLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP------PATIGTDAAPRVNPDYTRWK 84
Query: 105 AVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKN 164
D+ + + G+++ S+ V ++ ++W+ L +Y S + QLR L+ K
Sbjct: 85 RQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKG 144
Query: 165 SLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQE 224
+ + +Y+ + + L L G+P+ + + VL L EY P++ QI KD+ + E
Sbjct: 145 TKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTE 204
Query: 225 LFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY 284
+ L+ E+ ++ VS+AT I +AN V + + N + G Y
Sbjct: 205 IHERLLNHESKIL---AVSSATVIPI---TANAVSHRNTTTTN------NNNNGNRNNRY 264
Query: 285 NSNDAKNNVR--GRGRGRFSPYRGNNSKP---SCQLCGKYGHIAAVCYKRFDENFNNLSS 344
++ + NN + + F P N SKP CQ+CG GH A KR + + LSS
Sbjct: 265 DNRNNNNNSKPWQQSSTNFHP-NNNQSKPYLGKCQICGVQGHSA----KRCSQLQHFLSS 324
Query: 345 SNNN---------RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLT 404
N+ + A +A+ + +WL DSGAT H+TSD +NL++ Y G +
Sbjct: 325 VNSQQPPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVM 384
Query: 405 VGNGNRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHP 464
V +G+ + ISH G T L TK S L L NIL+VP I +NL+S+ +L N VEF P
Sbjct: 385 VADGSTIPISHTGSTSLSTK---SRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFP 443
Query: 465 TCCFVKDKETKKVVLHGVLKDELYQ 475
VKD T +L G KDELY+
Sbjct: 445 ASFQVKDLNTGVPLLQGKTKDELYE 443
BLAST of Moc02g14980 vs. ExPASy Swiss-Prot
Match:
Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)
HSP 1 Score: 164.5 bits (415), Expect = 3.6e-39
Identity = 125/441 (28.34%), Postives = 204/441 (46.26%), Query Frame = 0
Query: 45 KLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQ 104
KL NY +W V A+ G + G++ G+ P P T GT +VNP+Y W+
Sbjct: 25 KLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMP------PATIGTDAVPRVNPDYTRWR 84
Query: 105 AVDQALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKN 164
D+ + + G+++ S+ V ++ ++W+ L +Y S + QLR + +
Sbjct: 85 RQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRFITR----- 144
Query: 165 SLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-TSWQE 224
+ L L G+P+ + + VL L +Y P++ QI KD+ S E
Sbjct: 145 --------------FDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTE 204
Query: 225 LFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY 284
+ L+ E+ L+ LN +AN V + N +++Q+ +G R
Sbjct: 205 IHERLINRESKLLALNSAEVVPI------TANVVTHR-----NTNTNRNQNNRGDNRNYN 264
Query: 285 NSNDAKNNVRGRGRGRFSPYRGNNSKPS-----CQLCGKYGHIAAVC--YKRFDENFNNL 344
N+N+ N+ + G R +N +P CQ+C GH A C +F N
Sbjct: 265 NNNNRSNSWQPSSSGS----RSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQ 324
Query: 345 SSSNNN---RNSAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNG 404
S++ + A +A+ +WL DSGAT H+TSD +NL+ Y G + + +G
Sbjct: 325 QSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADG 384
Query: 405 NRLEISHIGHTCLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCF 464
+ + I+H G L P +S +L L+ +L+VP I +NL+S+ +L N VEF P
Sbjct: 385 STIPITHTGSASL---PTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQ 422
Query: 465 VKDKETKKVVLHGVLKDELYQ 475
VKD T +L G KDELY+
Sbjct: 445 VKDLNTGVPLLQGKTKDELYE 422
BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match:
A0A6J1DU77 (uncharacterized protein LOC111024384 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111024384 PE=4 SV=1)
HSP 1 Score: 766.1 bits (1977), Expect = 1.0e-217
Identity = 385/386 (99.74%), Postives = 385/386 (99.74%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
Query: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
ADSGATDHVTSDLSNLNVKSDYNGKG
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 386
BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match:
A0A6J1DTZ7 (uncharacterized protein LOC111024384 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111024384 PE=4 SV=1)
HSP 1 Score: 764.6 bits (1973), Expect = 2.9e-217
Identity = 384/386 (99.48%), Postives = 385/386 (99.74%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA
Sbjct: 1 MTTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRGMVLA 60
Query: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP
Sbjct: 61 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 120
Query: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE
Sbjct: 121 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 180
Query: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI
Sbjct: 181 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 240
Query: 241 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
VSTATAEGISDGS NYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF
Sbjct: 241 VSTATAEGISDGSXNYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRGRF 300
Query: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL
Sbjct: 301 SPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNNLSSSNNNRNSAYMAIPEIVAEPSWL 360
Query: 361 ADSGATDHVTSDLSNLNVKSDYNGKG 387
ADSGATDHVTSDLSNLNVKSDYNG+G
Sbjct: 361 ADSGATDHVTSDLSNLNVKSDYNGQG 386
BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match:
A0A5C7HHE9 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_020902 PE=4 SV=1)
HSP 1 Score: 366.7 bits (940), Expect = 1.8e-97
Identity = 203/490 (41.43%), Postives = 303/490 (61.84%), Query Frame = 0
Query: 2 TTEETENSSVPPQVVTNVAVPTPNPSPQFNTS--FGHPLGTVLTVKLDDKNYSLWRGMVL 61
TT + +++ P T S N S FG+ L +KLD +N+ LW+ MV
Sbjct: 3 TTSQQQSTLAPSSSSTETPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKTMVT 62
Query: 62 AVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQV-NPEYVEWQAVDQALLGWLFGSM 121
+++G + DG++ T PP+FL SP T G SD NPEY +W DQ L+GWL+ SM
Sbjct: 63 TIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVSDSGSCSNPEYEKWLVNDQLLMGWLYSSM 122
Query: 122 TPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQA 181
T ++A V+ ++ +WKALE+L+GA SK++ N +R +Q T+K S M EYL MK
Sbjct: 123 TENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSSTMEEYLTQMKTW 182
Query: 182 SESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRL 241
++SL +AG+P N L + +L+GL++EY+PIV IE ++ +WQE++ TL+++++ L +
Sbjct: 183 ADSLAIAGDPYPENLLFANILAGLDSEYMPIVVLIEAREHFTWQEIYDTLLSYDSKLEHI 242
Query: 242 NIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSNDAKNNVRGRGRG 301
N VS A +S SA+ +K N+ N +Q QG + GR RG
Sbjct: 243 NNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRGGFRGGGGRFRG 302
Query: 302 RFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRNSAYMAIPEIVA 361
R R NNS+P+CQ+CGK+GH A+VCY R+D+N+ ++SN N S ++A PE V
Sbjct: 303 RGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSPSVFVATPETVD 362
Query: 362 EPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTKPITSGN 421
+ +W ADSGAT+HVT+D NL++KS+Y G +L VGNG +L+ISH+G L + +T +
Sbjct: 363 DTTWYADSGATNHVTNDAGNLDLKSNYRGDESLMVGNGKQLDISHVGLKSLPS--LTKHS 422
Query: 422 LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVLKDELYQV 481
+ L +LHVP+I++NLLS+++L DN+ F+EFH CCFVKDK T+ VL G LK+ LYQ+
Sbjct: 423 IILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTRMEVLRGRLKNGLYQL 482
Query: 482 KLPLQTSNQN 486
++P S N
Sbjct: 483 EIPTTKSAFN 487
BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match:
A0A803PEH4 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 360.5 bits (924), Expect = 1.3e-95
Identity = 217/514 (42.22%), Postives = 307/514 (59.73%), Query Frame = 0
Query: 2 TTEETENSSVPPQVVTNVAVPTPNPSPQFNTSFGHP-LGTVLTVKLDDKNYSLWRGMVLA 61
T NSSV +N N + Q +F P L ++KLD NY+LW+ MV
Sbjct: 12 TASSPTNSSVAGASSSN----NTNQASQLPNAFAPPTLNQPFSLKLDRNNYTLWKTMVST 71
Query: 62 VLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYVEWQAVDQALLGWLFGSMTP 121
++RG + GY+ GTL PP+F++ +T+ T NPEY W DQ L+GWL+ SMT
Sbjct: 72 IVRGHRLHGYLSGTLMCPPEFVMVGDTQVT------NPEYENWIITDQLLMGWLYSSMTE 131
Query: 122 SIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLKMSEYLGLMKQASE 181
IA +V+ S+ + + LE LYGA SK++++ R ++Q T+K S MSEYL K S
Sbjct: 132 GIATEVMGSHSAANLQRNLESLYGAYSKSKMDDTRTLIQTTRKGSTLMSEYLRQKKNWSN 191
Query: 182 SLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFATLVTFENTLMRLNI 241
L LAG+P +L++ VL GL+AEYL IV QIE + +T+WQEL L++F++ + RL
Sbjct: 192 MLALAGDPYPEAHLVANVLFGLDAEYLSIVVQIEARSNTTWQELQDLLLSFDSKIERLQN 251
Query: 242 VSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSY-NSNDAKNNVRGRGRGR 301
++ + + S + +K N+ G + QSQ+ G + NS N RGRGRG
Sbjct: 252 LTLNSNKATSSSPQANMAAKTNNNGRGRGFQSQNASTNSGGLFSNSRGTSNRFRGRGRG- 311
Query: 302 FSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENF----------NNLSSSNNNRNSAYMA 361
G+ S+P+CQ+ GKYGH AAVCY RFDE++ N + NN +SA++A
Sbjct: 312 ----PGSGSRPTCQVYGKYGHTAAVCYNRFDESYMGSDPNNPHNQNKAGQTNNNHSAFVA 371
Query: 362 IPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHTCLQTK 421
PE++ +W ADSGA++H+TSD +NL K DYNGK ++ VGNG++L I+HIG+ L
Sbjct: 372 TPEVLEFDAWFADSGASNHITSDPANLTQKQDYNGKESVVVGNGSKLRITHIGNGKLN-- 431
Query: 422 PITSGN-LQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVLHGVL 481
I SGN L L ++L VPKI +NL+S++KL DNN +EF+ C VKDK TKKV+LHGVL
Sbjct: 432 -IESGNYLLLKDMLLVPKIAKNLVSVSKLATDNNVLIEFYSNFCLVKDKVTKKVLLHGVL 491
Query: 482 KDELYQVKLPLQTSNQNQNQQRSMSSVQQCLASN 503
KDELYQ+ P S+ Q +S+ + SN
Sbjct: 492 KDELYQLDSPFTKSSHPYQQSNFLSAFTISVDSN 507
BLAST of Moc02g14980 vs. ExPASy TrEMBL
Match:
A0A5C7ID32 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1)
HSP 1 Score: 360.1 bits (923), Expect = 1.7e-95
Identity = 204/501 (40.72%), Postives = 305/501 (60.88%), Query Frame = 0
Query: 1 MTTEETENSSVPPQVVTNVAVPT----PNPSPQFNTSFGHPLGTVLTVKLDDKNYSLWRG 60
M+T + S++ P ++ A PT + S ++ FG+ L +KLD +N+ LW+
Sbjct: 1 MSTTSQQQSTLAPS-SSSTATPTVLQEGSNSSNESSPFGNKLNQSFAIKLDRQNFILWKT 60
Query: 61 MVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQ---------VNPEYVEWQAVD 120
MV +++G + DG++ T PP+FL SP T G NPEY +W D
Sbjct: 61 MVTTIIKGHRLDGHLYSTRPCPPEFLPSPTTPGVPSPTTPGVSDSGSCSNPEYEKWLVND 120
Query: 121 QALLGWLFGSMTPSIACDVVDFRSSREVWKALEDLYGATSKARINQLRNVLQNTKKNSLK 180
Q L+GWL+ SMT ++A V+ ++ +WKALE+L+GA SK++ N +R +Q T+K S
Sbjct: 121 QLLMGWLYSSMTENVALSVMGSTTAAGLWKALENLFGAYSKSKANTIRTSIQTTRKGSST 180
Query: 181 MSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDSTSWQELFAT 240
M EYL MK ++SL +AG+P N L + L+GL++EY+PIV IE ++ +WQE++ T
Sbjct: 181 MEEYLTQMKTWADSLAIAGDPYPENLLFANSLAGLDSEYMPIVVLIEAREHFTWQEIYDT 240
Query: 241 LVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQGRGSYNSND 300
L+++++ L +N VS A +S SA+ +K N+ N +Q QG +
Sbjct: 241 LLSYDSKLEHINNVS-AKGNLLSSPSAHLATNKPNNTPNTNKTSNQQNLNQGGNRAPNRG 300
Query: 301 AKNNVRGRGRGRFSPYRGNNSKPSCQLCGKYGHIAAVCYKRFDENFNN---LSSSNNNRN 360
GR RGR R NNS+P+CQ+CGK+GH A+VCY R+D+N+ ++SN N
Sbjct: 301 GFRGGGGRFRGRGG--RNNNSRPTCQVCGKFGHSASVCYFRYDDNYMGSVPTANSNANSP 360
Query: 361 SAYMAIPEIVAEPSWLADSGATDHVTSDLSNLNVKSDYNGKGTLTVGNGNRLEISHIGHT 420
S ++A PE V + +W ADSGATDHVT+D NL++KSDY G +L VGNG +L+ISH+G
Sbjct: 361 SVFVATPETVDDTTWYADSGATDHVTNDAGNLDLKSDYRGDESLMVGNGKQLDISHVGLK 420
Query: 421 CLQTKPITSGNLQLSNILHVPKIKRNLLSIAKLTADNNCFVEFHPTCCFVKDKETKKVVL 480
L + +T ++ L +LHVP+I++NLLS+++L DN+ F+EFH CCFVKDK T VL
Sbjct: 421 SLPS--LTKHSIILKQVLHVPEIRKNLLSVSRLVNDNDVFIEFHANCCFVKDKLTGMEVL 480
Query: 481 HGVLKDELYQVKLPLQTSNQN 486
G LK+ LYQ+++P S N
Sbjct: 481 RGRLKNGLYQLEIPTTKSAFN 495
BLAST of Moc02g14980 vs. TAIR 10
Match:
AT5G48050.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G34070.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 60.5 bits (145), Expect = 5.3e-09
Identity = 64/269 (23.79%), Postives = 120/269 (44.61%), Query Frame = 0
Query: 42 LTVKLDDKNYSLWRGMVLAVLRGQKFDGYVLGTLAKPPQFLVSPETEGTSDHLQVNPEYV 101
+T+ L+ NY +WR + + G++ G+ +P TE
Sbjct: 24 VTLDLNKLNYDVWRELFETLCLSFGVLGHIDGSSTP------TPMTE------------K 83
Query: 102 EWQAVDQALLGWLFGSMTPSIACDVVDFR-SSREVWKALEDLYGATSKARINQLRNVLQN 161
W+ D + W++G++T S+ ++ ++R++W +LE+L+ +AR Q N L+
Sbjct: 84 RWKERDGLVKMWIYGTITDSLLDTIIKVGCTARDLWLSLENLFRDNKEARALQFENELRT 143
Query: 162 TKKNSLKMSEYLGLMKQASESLKLAGEPVAFNYLMSCVLSGLEAEYLPIVCQIEGKDS-T 221
T + L + EY +K S+ L P++ L+ +L+GL +Y I+ I+ K
Sbjct: 144 TTIDDLSVHEYCQKLKSLSDLLTNVDSPISDRVLVMHLLNGLTEKYDYILNVIKHKSPFP 203
Query: 222 SWQELFATLVTFENTLMRLNIVSTATAEGISDGSANYVHSKQNSVGNRQFHQSQSGQGQG 281
S+ E + L+ E+ L + S + S + + +Q +++H + S G+G
Sbjct: 204 SFTEARSMLLMEESRLSNKSKSSLSHTNHPSLSNVLFTVPRQQERYPQEYHNNNSNMGRG 263
Query: 282 RGSYNSNDAKNNVRGRGRGRFSPYRGNNS 309
R + KN G GR Y NN+
Sbjct: 264 R-----SKKKNRGGGSSDGR---YNNNNN 266
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022157748.1 | 2.1e-217 | 99.74 | uncharacterized protein LOC111024384 isoform X1 [Momordica charantia] | [more] |
XP_022157750.1 | 6.1e-217 | 99.48 | uncharacterized protein LOC111024384 isoform X2 [Momordica charantia] | [more] |
TXG55646.1 | 3.7e-97 | 41.43 | hypothetical protein EZV62_020902 [Acer yangbiense] | [more] |
TXG67243.1 | 3.4e-95 | 40.72 | hypothetical protein EZV62_008518 [Acer yangbiense] | [more] |
TXG69253.1 | 1.3e-94 | 40.52 | hypothetical protein EZV62_004188 [Acer yangbiense] | [more] |
Match Name | E-value | Identity | Description | |
Q94HW2 | 1.7e-44 | 31.24 | Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... | [more] |
Q9ZT94 | 3.6e-39 | 28.34 | Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DU77 | 1.0e-217 | 99.74 | uncharacterized protein LOC111024384 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1DTZ7 | 2.9e-217 | 99.48 | uncharacterized protein LOC111024384 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A5C7HHE9 | 1.8e-97 | 41.43 | Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_020902 PE=4 SV=1 | [more] |
A0A803PEH4 | 1.3e-95 | 42.22 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A5C7ID32 | 1.7e-95 | 40.72 | Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_008518 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT5G48050.1 | 5.3e-09 | 23.79 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |