MC10g_new0109 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC10g_new0109
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationMC10: 4270522 .. 4275291 (-)
RNA-Seq ExpressionMC10g_new0109
SyntenyMC10g_new0109
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
AAACAAATTACTTCTCTTGTAAATTCATTCATAATGTAACTGATTAATTATTCTGATCAGTTGAGTTGGTTATTAGTTGATGTATTCACAGTGGGTCTATAAATACCACTGCTTTTGATCATATTGAATAAGGAAGAAGATTGAATTCTGTGATCACATGCGGAGTAAACTGAAATGGTATCAGAGCACCATAGTCTAACGACCCAATTGTCTTGAATTTTTTTCTTCTCCTCAAAATGACATCCTCATCGTCAAATCCAGATGCTGTGAATTCTTCACAAATAAACAAAATAGTAAATCCAGGGAGCAAAATCTCAACTGTCAAATTAAATGATGAGAATTTTTTGCTATGGAAACTCTAGATCTCAACAGCGCTTCAAGGGCACGATCTTGAACATCATATTACAGGGAAATCTGAAATTCCTCCTCAGATTATTTCGATTACTGGTGAGAATTCTTCCTCCACATTGACAGTAAATCCAAAGTATCATCAATGGATGAAGCAGGACAAATTGATTACCTCTTGGTTATTCAGCTCCATGTTCGAAGAAATTTTGGGAGAGATGATTCATTGCAACACTGCCAGAGAAGTTTGGCAGATTCTTGAAAACCTATACACATCAAGAAACCTTGCAAGAGTTATGCAACTCAAATCGAAGCTAGAAAACATCAAGAAAGGTAACCTACCTTTAAAAGATTACTTTCAAAAAGTAAAAGCGCTTGTTGATTCACTTGCTGCGGCTGGAAAAAAGGTTACTGTTGAAGATCATATTATGCATATCTTGACGGGTCTGAGAAGTGAATTTGAATCGACGGTTTCGGTTATCTCTGCGAGAACACAAACCCAGACACTTCAAGAAGTTTATTCTCTCCTTCTCTCACATGAAGGGCGTAATGAGAGGAATTCGATTAACACTGATGGTACCCTACCATCTGTTAATCTTACCCAGCAAACAAAGAATTCCAATTCTGCTCAAAGCATCGATGGTCAGCGTCCTTATATGCAGAATAACAGAAGCAAAAACAGTGGCAACCCTAATTTTAGAAGAAATTGGAATAGCAATAATCGTCCCCAATGTCAGATCTATGGCAAATTTGGGCATACTGCATTGAGATGCTATCTTCGGTTTGAGAAAACATTCTTGGGGCCAAATGGACAATCACATATGCAACATAAATTTTCTGGTGGATCTAATTCCAGCATAAATTCGAACATGACAGCATTTGGGAATCAACAACAGGCCTTCACCAATGGATTTCAGCCTAACTCAAATTCTCATATGGCTGCCTTTTTAGCTCAACAGGATTTTAATCGGGATACCAATTGGTACCCAGATTCAGGTGCAACAAATCATGTTACAAGTAATTTCAACAATCTGGCCACCAGCACAGAATACACGGGTGACAATCAAGTTCGGATTGGGAATGGTACAGGTTTGCAAGTACTTCACACTGGTTCATCTTTTGTAACTTCATCTAACTCTCTTCACTCTCCCAATACTAAACCTGTTTTTATCTTAGATAATCTACTCCATGTCCCCAATATCACAAAGAACCTCATAAGTGTCAGTCAATTTGCTCGTGATAACTCTGTATTCTTTGAATTTCACCCTTACCATTGCTTTGTGAAGGACCTTCAAACTGGCCACATTTTACTCCAAGGGAAGGTACATGATGGTCTATATCGATTTGAGTTGGAGAAGGCTGCTCCCACGTCCTCTGACAGAACTGCATCTGGTGTGTCTGTTGTCTCCTCCTCTAGTGTCAATACTCTTACACACAAAACATCTTTATCTCATGATTCTTTCTCTTTACCAATTAATTGTAATCTGTCTTTGTTTGATGTTTGGCATAGGCGATTAGGCCATGCTGCTCCAAATGTTGTTCATTCAATTCTTCGTTCATGTAATGTACCCATCCAAAATAAATCAAAATTCTATATGTCATGCTTGTGCTACTAGCAAATGTCATAGTCTTCCCTTTGTTGATTCAAATACTGTCTATTCTTCTCCACTTTCATTAGTAGTTGCTGATCTCTGGGGTCCTGCCTTTGTTTCATCCAGAAATGGGTTTAAATACTACATAAGTTTTGTGATGTTCATTCTCGATATACATGGATTTATTTCCTTACAACAAAATCCGAAGCTTTTAAAGCCATTACACTTTTCAAAGCACAAGCTGAAAAATCCTTGAATTGTTCTATTCGTCGCCTTCAAACTGATGGTGGGGGTGAGTTTTCTTCGTTTATTCCTTTTCTGAAATCCCATGGTATAGAACATAGAGTTTCTTGTCCATATACATCACAACAAAATGGCATCGTTGAGAGGAAACATAGGCATATTGTAGATACGGGTCTCACCATTCTTTCTCAAGCATCTATGCCTCTTACTTTTTGGGATGACGCCTTCTCCACGGCTGTGTATTTGATCAATCGCCTTCCTACCTCTGTTCTTTATGGTACCTCCCCAATTGAAACTCTCTTTGGTTTCCAACCTCACTACTTGTTTCTAAAAACGTTTGGATGCCTCTGTTTCCCATCCTTACGCTTATATAATCAACATAAAATTCAACCTCGCTCCTCTGCTTGTCTTTTCATTGGCTATAGTAATATTCACCATGGTTATAAATGTTTGTCACCTTCTGGTCGTTTATACATATCGCGTCATGTTTTATTTGACGAAACTGTTTTCCCGTATCTGCAGTTATTTCAAAAACCAACCTCTCATTCTTCTAGCCCTACAATCCAAACCACACTCCCTATTATATCACCAACTCCTTCATCACCATCACCTCAGCGTGACACTTCAGTTGTTCGTGACCCATCTGAATCTTGTGAATCTTCCCCTGATATTCATGTTGCATTGCCTGTCAATAATTATGAAAATGTACCTGCTGCAATACCTACTAATGACTTTGATGAAAATTCTAGTACCACTTTACCTACTGGTGTTGCTTCAACTTCTCAAGCAGTTTCAAATTCAAATGTTCAGTCCACTGAACAAATGCCTCCACAGAATCATCATCACATGACCACCCGAGCAAAAAATGGGATTTTTAAGCCAAGGATTTTCTTAGCTAACTATACTGCTGTTGAACCACCAACAGTAAAGGAAGCTTTAACCAGTCCACATTGGGTAAAGGCAATGCAAGATGAGTACGATGCCCTTCTCCGAAATCAAACTTGGTCACTTGTTCCTTTACCAAATAATAAGAAGCTTGTTGGTTGTAAATGGGTGTTCAAAGTAAAGAGAAACTTGGATGGTTCCATTGCTCGGTATAAGGCTCGGCTTGTAGCCAAAGGTTTCCACCAAACTGTTGATATAGACTACACCGAGACATTCAGTCCCGTCGTCAAACAAGTTACCATTCGCATTCTCTTTACTCTTGCCTTGTCACATAACTGGAGTCTTCGGCAAGTTGATATAAACAACACGTTCCTTCATGGAATGCTCACTGAGGATGTTTATATGATGCAGCCTTCTGGCTTTGTTCAAGCTTCTTCTTCTCGATTGGTATGCAAACTACATAAAGCTCTATACGGGTTAAAGCAGGCCCCTAGAGCGTGGTATGAACGTCTCACCAGTTATTTGCATACCTTGGGATTCTGTACATCAAAAGCCGACTCTTCACTGCTTTTTCGATAATATAATCATTACTGGTAGCTCGACGTCTGCCATTGACTCTTTGATTCAACTTTTAAATTCTATCTTTGCTCTAAAAGACCTTGGGCAACTGAGTTACTTCTTAGGCATTGAAGTTTCTTATCCCAAAACTGGAGGGTTATTTTTATCCCAAGCTAAGTATGTTACAGATTTACTTCATAAAACAAAAATGCACGAGGCTAATGCCTTATCTACTCCCATGATAAGTGGTTCAGTAGTATCAGCATTTCATGGTGATCCGTTTCATGATGTTTATTTGTATCGTAGCACCGTTGGGGCCCTTCAGTATGTGACGATTACAAGACCTGAGCTTACCTACTCGGTCAATAAAGTCTCACAATTCATGCATGCTCCAACCTTGACTCATTGGCAGGCAGTTAAGAGAATCTTACGTTACTTGGCGGGCTCATTTGATCATGGATTACTTTTATCTCCGCCTTCTGATTTATCGTTACAAGGCTTCGTAGACTCTGACTGGGCTTCTGACCCAAATGATAGGCATTCCACATCTGGCTTTTGTATTCAATTCGGTGGTAATCTTATGTCTTGGACATCAAAGAAGCAAGCTGTTGTCTCACGTTCCAGTACCGTAGCCTTGCCCATGCTGCTGCAGACCTCATCTGGATACAAATCCTTTTATCTGAATTGCGGCTGTCGCTTCATCAACCACCTATTTTATGGTGTGACAATCTCAGCGCAGTGCATTTAAGTGCCAATCCTGTGTTGCATTCTAGAGCAAAACATGTAGAGATTGACATTTATTTTGTTAGAGATCTTGTTTTACAACATCAGTTACAAGTCACTCATATTCCTGCTGCTGCCCAAATAGCTGATATCTTAACAAAGCCGCTCTCTGCAGCACGATTTCTTCCTCTCAAAGTCAAACTCAATGTTCATTCTCCATCGAACATTGGTTTGCAGGGGGGGGGGGGCTGTTAAGCTCACTCACTAGTCCAAACAAATTACTTCTGTTGTAAATTCATTCATAATGTAACTGATTAGTTATTCTGATTAGTTGAGTTGGTTATTAGTTGATGTATTCACAGTGGGTCTATAAATACCACTGCTTTTGATCATATTGAATAAGGAAGAAGATTGAATTCTGTGA

mRNA sequence

ATGCCTCTTACTTTTTGGGATGACGCCTTCTCCACGGCTGTGTATTTGATCAATCGCCTTCCTACCTCTGTTCTTTATGGTACCTCCCCAATTGAAACTCTCTTTGGTTTCCAACCTCACTACTTGTTTCTAAAAACGTTTGGATGCCTCTGTTTCCCATCCTTACGCTTATATAATCAACATAAAATTCAACCTCGCTCCTCTGCTTGTCTTTTCATTGGCTATAGTAATATTCACCATGGTTATAAATGTTTGTCACCTTCTGGTCGTTTATACATATCGCGTCATGTTTTATTTGACGAAACTGTTTTCCCGTATCTGCAGTTATTTCAAAAACCAACCTCTCATTCTTCTAGCCCTACAATCCAAACCACACTCCCTATTATATCACCAACTCCTTCATCACCATCACCTCAGCGTGACACTTCAGTTGTTCGTGACCCATCTGAATCTTGTGAATCTTCCCCTGATATTCATGTTGCATTGCCTGTCAATAATTATGAAAATGTACCTGCTGCAATACCTACTAATGACTTTGATGAAAATTCTAGTACCACTTTACCTACTGGTGTTGCTTCAACTTCTCAAGCAGTTTCAAATTCAAATGTTCAGTCCACTGAACAAATGCCTCCACAGAATCATCATCACATGACCACCCGAGCAAAAAATGGGATTTTTAAGCCAAGGATTTTCTTAGCTAACTATACTGCTGTTGAACCACCAACAGTAAAGGAAGCTTTAACCAGTCCACATTGGGTAAAGGCAATGCAAGATGAGTACGATGCCCTTCTCCGAAATCAAACTTGGTCACTTGTTCCTTTACCAAATAATAAGAAGCTTGTTGGTTGTAAATGGGTGTTCAAAGTAAAGAGAAACTTGGATGGTTCCATTGCTCGGTATAAGGCTCGGCTTGTAGCCAAAGGTTTCCACCAAACTGTTGATATAGACTACACCGAGACATTCAGTCCCGTCGTCAAACAAGTTACCATTCGCATTCTCTTTACTCTTGCCTTGTCACATAACTGGAGTCTTCGGCAAGTTGATATAAACAACACGTTCCTTCATGGAATGCTCACTGAGGATGTTTATATGATGCAGCCTTCTGGCTTTGTTCAAGCTTCTTCTTCTCGATTGGTATGCAAACTACATAAAGCTCTATACGGGTTAAAGCAGGCCCCTAGAGCGTGGTATGAACGTCTCACCAGTTATTTGCATACCTTGGGATTCTGTACATCAAAAGCCGACTCTTCACTGCTTTTTCGATAA

Coding sequence (CDS)

ATGCCTCTTACTTTTTGGGATGACGCCTTCTCCACGGCTGTGTATTTGATCAATCGCCTTCCTACCTCTGTTCTTTATGGTACCTCCCCAATTGAAACTCTCTTTGGTTTCCAACCTCACTACTTGTTTCTAAAAACGTTTGGATGCCTCTGTTTCCCATCCTTACGCTTATATAATCAACATAAAATTCAACCTCGCTCCTCTGCTTGTCTTTTCATTGGCTATAGTAATATTCACCATGGTTATAAATGTTTGTCACCTTCTGGTCGTTTATACATATCGCGTCATGTTTTATTTGACGAAACTGTTTTCCCGTATCTGCAGTTATTTCAAAAACCAACCTCTCATTCTTCTAGCCCTACAATCCAAACCACACTCCCTATTATATCACCAACTCCTTCATCACCATCACCTCAGCGTGACACTTCAGTTGTTCGTGACCCATCTGAATCTTGTGAATCTTCCCCTGATATTCATGTTGCATTGCCTGTCAATAATTATGAAAATGTACCTGCTGCAATACCTACTAATGACTTTGATGAAAATTCTAGTACCACTTTACCTACTGGTGTTGCTTCAACTTCTCAAGCAGTTTCAAATTCAAATGTTCAGTCCACTGAACAAATGCCTCCACAGAATCATCATCACATGACCACCCGAGCAAAAAATGGGATTTTTAAGCCAAGGATTTTCTTAGCTAACTATACTGCTGTTGAACCACCAACAGTAAAGGAAGCTTTAACCAGTCCACATTGGGTAAAGGCAATGCAAGATGAGTACGATGCCCTTCTCCGAAATCAAACTTGGTCACTTGTTCCTTTACCAAATAATAAGAAGCTTGTTGGTTGTAAATGGGTGTTCAAAGTAAAGAGAAACTTGGATGGTTCCATTGCTCGGTATAAGGCTCGGCTTGTAGCCAAAGGTTTCCACCAAACTGTTGATATAGACTACACCGAGACATTCAGTCCCGTCGTCAAACAAGTTACCATTCGCATTCTCTTTACTCTTGCCTTGTCACATAACTGGAGTCTTCGGCAAGTTGATATAAACAACACGTTCCTTCATGGAATGCTCACTGAGGATGTTTATATGATGCAGCCTTCTGGCTTTGTTCAAGCTTCTTCTTCTCGATTGGTATGCAAACTACATAAAGCTCTATACGGGTTAAAGCAGGCCCCTAGAGCGTGGTATGAACGTCTCACCAGTTATTTGCATACCTTGGGATTCTGTACATCAAAAGCCGACTCTTCACTGCTTTTTCGATAA

Protein sequence

MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQHKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSPTIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDFDENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKADSSLLFR
Homology
BLAST of MC10g_new0109 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.9e-86
Identity = 204/493 (41.38%), Postives = 264/493 (53.55%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +P T+W  AF+ AVYLINRLPT +L   SP + LFG  P+Y  L+ FGC C+P LR YNQ
Sbjct: 645  IPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQ 704

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLS-PSGRLYISRHVLFDETVFPY------LQLFQKP 120
            HK+  +S  C+F+GYS     Y CL   + RLYISRHV FDE  FP+      L   Q+ 
Sbjct: 705  HKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQ 764

Query: 121  TSHSS---SPTIQTTLPIISPT---------------PSSPS-PQRDTSVVRDP-----S 180
               SS   SP   TTLP  +P                PSSPS P R++ V         S
Sbjct: 765  RRESSCVWSP--HTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFS 824

Query: 181  ESCESSPD---------------IHVALPVNNYENVPAAIPTNDFDENSSTTLPTGVAST 240
             S  SSP+                      ++ +N     PTN+     + +L T   S+
Sbjct: 825  SSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSS 884

Query: 241  SQAVS---NSNVQSTEQMPPQ---------------------NHHHMTTRAKNGIFKPR- 300
            S + S   +++  ST   PP                      N H M TRAK GI KP  
Sbjct: 885  SSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNP 944

Query: 301  ---IFLANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLV-PLPNNKKLVGCKW 360
               + ++     EP T  +AL    W  AM  E +A + N TW LV P P++  +VGC+W
Sbjct: 945  KYSLAVSLAAESEPRTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRW 1004

Query: 361  VFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLR 419
            +F  K N DGS+ RYKARLVAKG++Q   +DY ETFSPV+K  +IRI+  +A+  +W +R
Sbjct: 1005 IFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIR 1064

BLAST of MC10g_new0109 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 319.7 bits (818), Expect = 4.9e-86
Identity = 205/495 (41.41%), Postives = 270/495 (54.55%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +P T+W  AFS AVYLINRLPT +L   SP + LFG  P+Y  LK FGC C+P LR YN+
Sbjct: 624  VPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNR 683

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLS-PSGRLYISRHVLFDETVFPY------LQLFQKP 120
            HK++ +S  C F+GYS     Y CL  P+GRLY SRHV FDE  FP+      +   Q+ 
Sbjct: 684  HKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQ 743

Query: 121  TSHSS----SPTIQTTLPIISPT--------------PSSPSPQRDTSV---------VR 180
             S S+    S T   T P++ P               PSSPSP   T V         + 
Sbjct: 744  RSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSIS 803

Query: 181  DPSESCESSPDIHVALPV-------NNYENVPA-------AIPTNDFDEN--------SS 240
             PS S  ++P  +   P        N+  N P        +   N  ++N        SS
Sbjct: 804  SPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISS 863

Query: 241  TTLPTGVASTSQAVS-NSNVQSTEQMP---------------PQNHHHMTTRAKNGIFKP 300
              +PT   S S+  S +S+  ST  +P               P N H M TRAK+GI KP
Sbjct: 864  PHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKP 923

Query: 301  RIFLANYTAV----EPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLV-PLPNNKKLVGC 360
                +  T++    EP T  +A+    W +AM  E +A + N TW LV P P +  +VGC
Sbjct: 924  NQKYSYATSLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGC 983

Query: 361  KWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWS 419
            +W+F  K N DGS+ RYKARLVAKG++Q   +DY ETFSPV+K  +IRI+  +A+  +W 
Sbjct: 984  RWIFTKKFNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWP 1043

BLAST of MC10g_new0109 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 211.5 bits (537), Expect = 1.9e-53
Identity = 141/434 (32.49%), Postives = 207/434 (47.70%), Query Frame = 0

Query: 1   MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
           +P +FW +A  TA YLINR P+  L    P       +  Y  LK FGC  F  +    +
Sbjct: 605 LPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKEQR 664

Query: 61  HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYI-SRHVLFDETVFPYLQLFQKPTSHSSS 120
            K+  +S  C+FIGY +   GY+   P  +  I SR V+F E+         +   +   
Sbjct: 665 TKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKVKNGII 724

Query: 121 PTIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDF 180
           P    T+P  S  P+S     D     + SE  E   ++     +   E +         
Sbjct: 725 PNF-VTIPSTSNNPTSAESTTD-----EVSEQGEQPGEV-----IEQGEQL--------- 784

Query: 181 DENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANYTAV- 240
                               +  V+  E        H   R      +PR+    Y +  
Sbjct: 785 --------------------DEGVEEVEHPTQGEEQHQPLRRSE---RPRVESRRYPSTE 844

Query: 241 --------EPPTVKEALTSP---HWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVF 300
                   EP ++KE L+ P     +KAMQ+E ++L +N T+ LV LP  K+ + CKWVF
Sbjct: 845 YVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVF 904

Query: 301 KVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQV 360
           K+K++ D  + RYKARLV KGF Q   ID+ E FSPVVK  +IR + +LA S +  + Q+
Sbjct: 905 KLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQL 964

Query: 361 DINNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTL 420
           D+   FLHG L E++YM QP GF  A    +VCKL+K+LYGLKQAPR WY +  S++ + 
Sbjct: 965 DVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQ 995

Query: 421 GFCTSKADSSLLFR 422
            +  + +D  + F+
Sbjct: 1025 TYLKTYSDPCVYFK 995

BLAST of MC10g_new0109 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 151.4 bits (381), Expect = 2.3e-35
Identity = 130/465 (27.96%), Postives = 207/465 (44.52%), Query Frame = 0

Query: 4    TFWDDAFSTAVYLINRLPTSVLYGTS--PIETLFGFQPHYLFLKTFGCLCFPSLRLYNQH 63
            +FW +A  TA YLINR+P+  L  +S  P E     +P+   L+ FG   +  ++   Q 
Sbjct: 608  SFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHIK-NKQG 667

Query: 64   KIQPRSSACLFIGYSNIHHGYKCL-SPSGRLYISRHVLFDET------VFPYLQLFQKPT 123
            K   +S   +F+GY    +G+K   + + +  ++R V+ DET         +  +F K +
Sbjct: 668  KFDDKSFKSIFVGYE--PNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDS 727

Query: 124  SHS--------SSPTIQTTLP---------------IISPTPSSPSPQRDTSVVRDPSES 183
              S        S   IQT  P                 S   + P+  R       P+ES
Sbjct: 728  KESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNES 787

Query: 184  --CESSPDIHVALPVNNY---------------ENVPAAIPTNDFDENSSTTL-PTGV-- 243
              C++   +  +   N Y               E+  +  P    +  ++  L   G+  
Sbjct: 788  KECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDN 847

Query: 244  ASTSQAVSNSNVQSTE-QMPPQNHHHMTTRAKNGIFKPRIFLANYTAVEPPTVKEALTSP 303
             + +  +   N +S   +  PQ  ++    + N +      + N        ++      
Sbjct: 848  PTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKS 907

Query: 304  HWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDGSIARYKARLVAKGFH 363
             W +A+  E +A   N TW++   P NK +V  +WVF VK N  G+  RYKARLVA+GF 
Sbjct: 908  SWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFT 967

Query: 364  QTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLHGMLTEDVYMMQPSGF 416
            Q   IDY ETF+PV +  + R + +L + +N  + Q+D+   FL+G L E++YM  P G 
Sbjct: 968  QKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI 1027

BLAST of MC10g_new0109 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.8e-27
Identity = 66/125 (52.80%), Postives = 84/125 (67.20%), Query Frame = 0

Query: 217 MTTRAKNGIFK--PRIFLANYTAV--EPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLV 276
           M TR+K GI K  P+  L   T +  EP +V  AL  P W +AMQ+E DAL RN+TW LV
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 277 PLPNNKKLVGCKWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRI 336
           P P N+ ++GCKWVFK K + DG++ R KARLVAKGFHQ   I + ET+SPVV+  TIR 
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

Query: 337 LFTLA 338
           +  +A
Sbjct: 121 ILNVA 125

BLAST of MC10g_new0109 vs. NCBI nr
Match: KYP64199.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 407 bits (1047), Expect = 2.71e-126
Identity = 224/424 (52.83%), Postives = 278/424 (65.57%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +PL FWD AF TA YLINRLP+S +   SP + +    P Y FLK FGC CFP LR YNQ
Sbjct: 681  LPLNFWDHAFLTATYLINRLPSSSVGFQSPYKLIHHKDPDYTFLKVFGCSCFPLLRPYNQ 740

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK+QPRS  C+F+GYS +H GYKCLS SGR+YIS+ V+F+E  FPY  LF   TS S   
Sbjct: 741  HKLQPRSEECVFLGYSPLHKGYKCLSKSGRIYISKDVIFNEGRFPYHDLFVTATSDSIPA 800

Query: 121  TIQTTLP--IISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTND 180
            T  +TLP  + S  PSS      +++V     +   S + + +LP               
Sbjct: 801  TSVSTLPSLVCSHNPSS------STLVSPTVPTNSGSSESNFSLP--------------S 860

Query: 181  FDENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRI----FLAN 240
              +NSST  P   +S+S ++S          PP N H M TR+KNGI +PR+     L+N
Sbjct: 861  MPDNSSTNSPPA-SSSSPSLS---------APPLNIHPMITRSKNGILQPRLNPTLLLSN 920

Query: 241  YTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLD 300
               VEP TVK ALT PHW+ AMQ E  AL  N TW+LV LP  +K +GCKWVF++K N D
Sbjct: 921  ---VEPKTVKSALTDPHWLSAMQAELTALHDNHTWTLVDLPPGRKSIGCKWVFRLKENPD 980

Query: 301  GSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFL 360
            GSI +YKARLVAKGFHQ   +D++ETFSPVVK VTIRI+ +LA++  W LRQ+DINN FL
Sbjct: 981  GSINKYKARLVAKGFHQQPGVDFSETFSPVVKPVTIRIVLSLAVTFQWPLRQLDINNAFL 1040

Query: 361  HGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKA 418
            +GML ED+YM QP GF++ +S  LVCKLHK+LYGLKQAPRAW++RLTS L  LGF  SK 
Sbjct: 1041 NGMLEEDIYMSQPPGFIEPNSKHLVCKLHKSLYGLKQAPRAWFDRLTSVLLKLGFHKSKC 1071

BLAST of MC10g_new0109 vs. NCBI nr
Match: PNX89511.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense])

HSP 1 Score: 377 bits (969), Expect = 1.54e-124
Identity = 207/424 (48.82%), Postives = 271/424 (63.92%), Query Frame = 0

Query: 1   MPLTFWDDAFSTAVYLINRLPTSVL-YGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYN 60
           +PL FWD +F+ AVYLIN+LPTS   +  SP   LF  QP Y  LK FGCLCFP LR YN
Sbjct: 11  LPLKFWDHSFTQAVYLINKLPTSAFNHFKSPHHALFKTQPDYSQLKVFGCLCFPHLRPYN 70

Query: 61  QHKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSS 120
           +HK+Q RSS C+++G S  H G+KCL   GR+YIS+ V+F E+ FPY+ +F   T++  +
Sbjct: 71  KHKLQYRSSPCVYLGVSPQHKGHKCLDEQGRIYISKDVIFHESQFPYISMFPNSTTNPDN 130

Query: 121 PTIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALP----VNNYENVPAAIP 180
                T  I+S     P      S+    S S  +S       P    ++N+  +     
Sbjct: 131 SVTPLTHSILSH--HMPQNGHSLSITNTNSNSESNSLTSPKQAPGDKDISNHHQL----- 190

Query: 181 TNDFDENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANY 240
                 NS +  PT  ++T++A + S + +T      N H M TR K G  KP+ F    
Sbjct: 191 ---LKTNSPSNPPT--STTNKAQNLSPITTTSH---HNDHPMITRGKTGNLKPKAFT--- 250

Query: 241 TAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDG 300
           T +EP TVK AL  P W++AM  E+ AL  N TWSLVPLP +KK +GCKW+F++K N DG
Sbjct: 251 TVLEPTTVKSALADPKWLQAMHTEFKALTDNNTWSLVPLPPHKKAIGCKWIFRIKENPDG 310

Query: 301 SIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLH 360
           +I +YKARLVAKGF QT   D+TETFSPV+K VTIRI+ TLA++H W ++Q+DINN FL+
Sbjct: 311 TINKYKARLVAKGFLQTPGFDFTETFSPVIKPVTIRIILTLAVTHKWVVQQIDINNAFLN 370

Query: 361 GMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKAD 419
           G+L E+VYM QP+GF ++S   LVCKLHK+LYGLKQAPRAWYERLT  L  +GF  SK D
Sbjct: 371 GILHEEVYMKQPAGF-ESSDKSLVCKLHKSLYGLKQAPRAWYERLTQTLLQMGFIASKCD 415

BLAST of MC10g_new0109 vs. NCBI nr
Match: MCH81678.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium medium])

HSP 1 Score: 388 bits (997), Expect = 1.83e-124
Identity = 217/441 (49.21%), Postives = 282/441 (63.95%), Query Frame = 0

Query: 1   MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
           MPL FWD AF TA YLINR+PT+VL   SP   L+   P Y FLK FGC C+P LR YN 
Sbjct: 1   MPLKFWDYAFITATYLINRMPTAVLNMQSPYFMLYHVVPDYKFLKVFGCACYPHLRPYNP 60

Query: 61  HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLF-----QKPTS 120
           HK    S  C+F+GYS  H GYKCL+  GR+YIS+ V+F+E  FPY  LF     Q   S
Sbjct: 61  HKFAYHSKECVFLGYSPSHKGYKCLASDGRIYISKDVIFNEIRFPYHDLFPSTQSQSTVS 120

Query: 121 HSSSPT-IQTTLPIISPT-----PSSPSPQ---RDTSVVRDPSESCESSPDIHVALPVNN 180
           HS  P  +   LP  SP      P SP P      TS +  P++S +SSP   +++P   
Sbjct: 121 HSLHPIPVSFKLPTTSPPITTNHPFSPPPSPLNAFTSAISSPAQS-QSSP---LSVPT-- 180

Query: 181 YENVPAAIPTNDFDENSSTTLPTGVASTSQAVSNSNVQSTEQMPP--------QNHHHMT 240
               P++  T      +S T P+     S ++S ++       PP         N H M 
Sbjct: 181 ----PSSSHTIPLSPTASVTPPSVHTPISASLSPNSASEGVPTPPPPAHKIHPHNSHSMA 240

Query: 241 TRAKNGIFKPRIF-LANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNN 300
           TRAK+GI +PR+      T +EP + K AL  P W+ AM+DEY+ALL+N TW+L  LP++
Sbjct: 241 TRAKHGIVQPRLHPTLLLTELEPTSYKTALQDPKWLAAMKDEYNALLKNNTWTLTLLPSD 300

Query: 301 KKLVGCKWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLA 360
           +K++GCKWVF+VK+N DGSI +YKARLVAKGFHQ    D+TETFSPVVK +T+R + T+A
Sbjct: 301 RKVIGCKWVFRVKQNPDGSILKYKARLVAKGFHQQHGFDFTETFSPVVKPITVRTVLTIA 360

Query: 361 LSHNWSLRQVDINNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWY 418
           +S  W + ++D+NN FL+G+L E+VYM QP GFV + +S LVCKL+KALYGLKQAPRAW+
Sbjct: 361 ISRQWHITRLDVNNAFLNGILEEEVYMQQPPGFVNSDTS-LVCKLNKALYGLKQAPRAWF 420

BLAST of MC10g_new0109 vs. NCBI nr
Match: PNX92571.1 (histone deacetylase [Trifolium pratense])

HSP 1 Score: 402 bits (1033), Expect = 7.38e-124
Identity = 217/419 (51.79%), Postives = 282/419 (67.30%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +P+TFWD AF TAVYLINRLP+S +   +P   LF   P Y FLK FGC CFP LR Y+ 
Sbjct: 745  LPITFWDYAFPTAVYLINRLPSSSINFQTPYFLLFKQHPDYHFLKVFGCACFPLLRPYHN 804

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK++ RS  CLF+GYS  H GY+CLSPSGRLY+S+ VLF+E+ FPY +LF   +  S SP
Sbjct: 805  HKLEFRSQECLFLGYSPSHKGYRCLSPSGRLYVSKDVLFNESRFPYKELFPISSGSSHSP 864

Query: 121  TIQT-TLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDF 180
              ++  LP   P P+ PS   D +    P+    SSP   +  P  +  N P +   +D 
Sbjct: 865  PSKSFKLP---PLPTFPSITTDITSPLPPTAPHISSPPTPINDP--SPPNSPLSATASDQ 924

Query: 181  DENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANYTAVE 240
                ST  P+  +STS  VS         + P N H+M TRAK+G  +P++ +A+    E
Sbjct: 925  SSPLSTPSPSTASSTSHHVSIPPRAVPVPIIPVNAHNMQTRAKSGFKQPKLLVAHS---E 984

Query: 241  PPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDGSIAR 300
            P +VK+AL  P W  AMQ EYDALL N TW+LVPLP +++ +GCKWVF++K N DG++ +
Sbjct: 985  PKSVKQALLDPSWHAAMQTEYDALLNNNTWTLVPLPPDRQAIGCKWVFRIKENPDGTVNK 1044

Query: 301  YKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLHGMLT 360
            YKARLVAKGFHQ    D+ ETFSPVVK VTIR++ T+A++  WS++Q+D+NN FL+G+L 
Sbjct: 1045 YKARLVAKGFHQRQGFDFLETFSPVVKPVTIRVILTIAITKGWSIQQLDVNNAFLNGVLD 1104

Query: 361  EDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKADSSL 418
            E+VYM+QP GF ++S S LVCKLHKALYGLKQAPR W+ERL S L  LGF +SK D SL
Sbjct: 1105 EEVYMLQPQGF-ESSDSSLVCKLHKALYGLKQAPRQWFERLQSTLLLLGFKSSKCDPSL 1154

BLAST of MC10g_new0109 vs. NCBI nr
Match: RVX14937.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 398 bits (1023), Expect = 7.84e-123
Identity = 211/432 (48.84%), Postives = 277/432 (64.12%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +PL FWD++F T VYL NRLPT+VL+   PIE LF   P Y FLK FGC CFP+LR YN 
Sbjct: 659  LPLKFWDESFRTVVYLSNRLPTAVLHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNT 718

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK+Q RS  C F+GYS  H GYKC+S +GR+YISR V+F+ET FPY +  Q  +   S+ 
Sbjct: 719  HKLQYRSEECTFLGYSLKHKGYKCMSSNGRVYISRDVIFNETSFPYSKTIQVSSCLPST- 778

Query: 121  TIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDFD 180
                    +SP+ S  SP     V+        +SP I  A P++  +N+   + T+   
Sbjct: 779  --------VSPSTSHLSPSASPPVLSPTMLPAPTSP-ISSARPISEMDNI---VSTHPHA 838

Query: 181  ENSSTT-----------LPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPR 240
             NS+ T           + T V     ++++++V  T      N H M TRAK+GI KP+
Sbjct: 839  PNSADTTLTPAQVVSNPVATPVQHVVSSIADASVTRTIAKDADNTHPMITRAKSGIVKPK 898

Query: 241  IFLANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKV 300
            IF+A     EP +V  AL    W KAM  EYDAL RN TWSLVPLP  ++ +GCKWV+K 
Sbjct: 899  IFIA--AVREPSSVSAALQQDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKT 958

Query: 301  KRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDI 360
            K N DG++ +YKARLVAKGFHQ    D+TETFSPVVK  TIR++FT+ALS NW+++Q+D+
Sbjct: 959  KENPDGTVQKYKARLVAKGFHQQAGFDFTETFSPVVKPSTIRVVFTIALSRNWAIKQLDV 1018

Query: 361  NNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGF 420
            NN FL+G L E+V+M QP GF+   +  LVC+LHKALYGLKQAPRAW+E+L   L + GF
Sbjct: 1019 NNAFLNGDLQEEVFMQQPQGFIDEKNPNLVCRLHKALYGLKQAPRAWFEKLHQALLSFGF 1075

BLAST of MC10g_new0109 vs. ExPASy TrEMBL
Match: A0A151TAX5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=3821 GN=KK1_018789 PE=4 SV=1)

HSP 1 Score: 407 bits (1047), Expect = 1.31e-126
Identity = 224/424 (52.83%), Postives = 278/424 (65.57%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +PL FWD AF TA YLINRLP+S +   SP + +    P Y FLK FGC CFP LR YNQ
Sbjct: 681  LPLNFWDHAFLTATYLINRLPSSSVGFQSPYKLIHHKDPDYTFLKVFGCSCFPLLRPYNQ 740

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK+QPRS  C+F+GYS +H GYKCLS SGR+YIS+ V+F+E  FPY  LF   TS S   
Sbjct: 741  HKLQPRSEECVFLGYSPLHKGYKCLSKSGRIYISKDVIFNEGRFPYHDLFVTATSDSIPA 800

Query: 121  TIQTTLP--IISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTND 180
            T  +TLP  + S  PSS      +++V     +   S + + +LP               
Sbjct: 801  TSVSTLPSLVCSHNPSS------STLVSPTVPTNSGSSESNFSLP--------------S 860

Query: 181  FDENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRI----FLAN 240
              +NSST  P   +S+S ++S          PP N H M TR+KNGI +PR+     L+N
Sbjct: 861  MPDNSSTNSPPA-SSSSPSLS---------APPLNIHPMITRSKNGILQPRLNPTLLLSN 920

Query: 241  YTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLD 300
               VEP TVK ALT PHW+ AMQ E  AL  N TW+LV LP  +K +GCKWVF++K N D
Sbjct: 921  ---VEPKTVKSALTDPHWLSAMQAELTALHDNHTWTLVDLPPGRKSIGCKWVFRLKENPD 980

Query: 301  GSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFL 360
            GSI +YKARLVAKGFHQ   +D++ETFSPVVK VTIRI+ +LA++  W LRQ+DINN FL
Sbjct: 981  GSINKYKARLVAKGFHQQPGVDFSETFSPVVKPVTIRIVLSLAVTFQWPLRQLDINNAFL 1040

Query: 361  HGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKA 418
            +GML ED+YM QP GF++ +S  LVCKLHK+LYGLKQAPRAW++RLTS L  LGF  SK 
Sbjct: 1041 NGMLEEDIYMSQPPGFIEPNSKHLVCKLHKSLYGLKQAPRAWFDRLTSVLLKLGFHKSKC 1071

BLAST of MC10g_new0109 vs. ExPASy TrEMBL
Match: A0A2K3MFF1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratense OX=57577 GN=L195_g045631 PE=4 SV=1)

HSP 1 Score: 377 bits (969), Expect = 7.46e-125
Identity = 207/424 (48.82%), Postives = 271/424 (63.92%), Query Frame = 0

Query: 1   MPLTFWDDAFSTAVYLINRLPTSVL-YGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYN 60
           +PL FWD +F+ AVYLIN+LPTS   +  SP   LF  QP Y  LK FGCLCFP LR YN
Sbjct: 11  LPLKFWDHSFTQAVYLINKLPTSAFNHFKSPHHALFKTQPDYSQLKVFGCLCFPHLRPYN 70

Query: 61  QHKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSS 120
           +HK+Q RSS C+++G S  H G+KCL   GR+YIS+ V+F E+ FPY+ +F   T++  +
Sbjct: 71  KHKLQYRSSPCVYLGVSPQHKGHKCLDEQGRIYISKDVIFHESQFPYISMFPNSTTNPDN 130

Query: 121 PTIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALP----VNNYENVPAAIP 180
                T  I+S     P      S+    S S  +S       P    ++N+  +     
Sbjct: 131 SVTPLTHSILSH--HMPQNGHSLSITNTNSNSESNSLTSPKQAPGDKDISNHHQL----- 190

Query: 181 TNDFDENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANY 240
                 NS +  PT  ++T++A + S + +T      N H M TR K G  KP+ F    
Sbjct: 191 ---LKTNSPSNPPT--STTNKAQNLSPITTTSH---HNDHPMITRGKTGNLKPKAFT--- 250

Query: 241 TAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDG 300
           T +EP TVK AL  P W++AM  E+ AL  N TWSLVPLP +KK +GCKW+F++K N DG
Sbjct: 251 TVLEPTTVKSALADPKWLQAMHTEFKALTDNNTWSLVPLPPHKKAIGCKWIFRIKENPDG 310

Query: 301 SIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLH 360
           +I +YKARLVAKGF QT   D+TETFSPV+K VTIRI+ TLA++H W ++Q+DINN FL+
Sbjct: 311 TINKYKARLVAKGFLQTPGFDFTETFSPVIKPVTIRIILTLAVTHKWVVQQIDINNAFLN 370

Query: 361 GMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKAD 419
           G+L E+VYM QP+GF ++S   LVCKLHK+LYGLKQAPRAWYERLT  L  +GF  SK D
Sbjct: 371 GILHEEVYMKQPAGF-ESSDKSLVCKLHKSLYGLKQAPRAWYERLTQTLLQMGFIASKCD 415

BLAST of MC10g_new0109 vs. ExPASy TrEMBL
Match: A0A392M4K2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium medium OX=97028 GN=A2U01_0002469 PE=4 SV=1)

HSP 1 Score: 388 bits (997), Expect = 8.87e-125
Identity = 217/441 (49.21%), Postives = 282/441 (63.95%), Query Frame = 0

Query: 1   MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
           MPL FWD AF TA YLINR+PT+VL   SP   L+   P Y FLK FGC C+P LR YN 
Sbjct: 1   MPLKFWDYAFITATYLINRMPTAVLNMQSPYFMLYHVVPDYKFLKVFGCACYPHLRPYNP 60

Query: 61  HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLF-----QKPTS 120
           HK    S  C+F+GYS  H GYKCL+  GR+YIS+ V+F+E  FPY  LF     Q   S
Sbjct: 61  HKFAYHSKECVFLGYSPSHKGYKCLASDGRIYISKDVIFNEIRFPYHDLFPSTQSQSTVS 120

Query: 121 HSSSPT-IQTTLPIISPT-----PSSPSPQ---RDTSVVRDPSESCESSPDIHVALPVNN 180
           HS  P  +   LP  SP      P SP P      TS +  P++S +SSP   +++P   
Sbjct: 121 HSLHPIPVSFKLPTTSPPITTNHPFSPPPSPLNAFTSAISSPAQS-QSSP---LSVPT-- 180

Query: 181 YENVPAAIPTNDFDENSSTTLPTGVASTSQAVSNSNVQSTEQMPP--------QNHHHMT 240
               P++  T      +S T P+     S ++S ++       PP         N H M 
Sbjct: 181 ----PSSSHTIPLSPTASVTPPSVHTPISASLSPNSASEGVPTPPPPAHKIHPHNSHSMA 240

Query: 241 TRAKNGIFKPRIF-LANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNN 300
           TRAK+GI +PR+      T +EP + K AL  P W+ AM+DEY+ALL+N TW+L  LP++
Sbjct: 241 TRAKHGIVQPRLHPTLLLTELEPTSYKTALQDPKWLAAMKDEYNALLKNNTWTLTLLPSD 300

Query: 301 KKLVGCKWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLA 360
           +K++GCKWVF+VK+N DGSI +YKARLVAKGFHQ    D+TETFSPVVK +T+R + T+A
Sbjct: 301 RKVIGCKWVFRVKQNPDGSILKYKARLVAKGFHQQHGFDFTETFSPVVKPITVRTVLTIA 360

Query: 361 LSHNWSLRQVDINNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWY 418
           +S  W + ++D+NN FL+G+L E+VYM QP GFV + +S LVCKL+KALYGLKQAPRAW+
Sbjct: 361 ISRQWHITRLDVNNAFLNGILEEEVYMQQPPGFVNSDTS-LVCKLNKALYGLKQAPRAWF 420

BLAST of MC10g_new0109 vs. ExPASy TrEMBL
Match: A0A2K3MP35 (Histone deacetylase OS=Trifolium pratense OX=57577 GN=L195_g015711 PE=4 SV=1)

HSP 1 Score: 402 bits (1033), Expect = 3.57e-124
Identity = 217/419 (51.79%), Postives = 282/419 (67.30%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +P+TFWD AF TAVYLINRLP+S +   +P   LF   P Y FLK FGC CFP LR Y+ 
Sbjct: 745  LPITFWDYAFPTAVYLINRLPSSSINFQTPYFLLFKQHPDYHFLKVFGCACFPLLRPYHN 804

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK++ RS  CLF+GYS  H GY+CLSPSGRLY+S+ VLF+E+ FPY +LF   +  S SP
Sbjct: 805  HKLEFRSQECLFLGYSPSHKGYRCLSPSGRLYVSKDVLFNESRFPYKELFPISSGSSHSP 864

Query: 121  TIQT-TLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDF 180
              ++  LP   P P+ PS   D +    P+    SSP   +  P  +  N P +   +D 
Sbjct: 865  PSKSFKLP---PLPTFPSITTDITSPLPPTAPHISSPPTPINDP--SPPNSPLSATASDQ 924

Query: 181  DENSSTTLPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPRIFLANYTAVE 240
                ST  P+  +STS  VS         + P N H+M TRAK+G  +P++ +A+    E
Sbjct: 925  SSPLSTPSPSTASSTSHHVSIPPRAVPVPIIPVNAHNMQTRAKSGFKQPKLLVAHS---E 984

Query: 241  PPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDGSIAR 300
            P +VK+AL  P W  AMQ EYDALL N TW+LVPLP +++ +GCKWVF++K N DG++ +
Sbjct: 985  PKSVKQALLDPSWHAAMQTEYDALLNNNTWTLVPLPPDRQAIGCKWVFRIKENPDGTVNK 1044

Query: 301  YKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLHGMLT 360
            YKARLVAKGFHQ    D+ ETFSPVVK VTIR++ T+A++  WS++Q+D+NN FL+G+L 
Sbjct: 1045 YKARLVAKGFHQRQGFDFLETFSPVVKPVTIRVILTIAITKGWSIQQLDVNNAFLNGVLD 1104

Query: 361  EDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTSKADSSL 418
            E+VYM+QP GF ++S S LVCKLHKALYGLKQAPR W+ERL S L  LGF +SK D SL
Sbjct: 1105 EEVYMLQPQGF-ESSDSSLVCKLHKALYGLKQAPRQWFERLQSTLLLLGFKSSKCDPSL 1154

BLAST of MC10g_new0109 vs. ExPASy TrEMBL
Match: A0A438K147 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2516 PE=4 SV=1)

HSP 1 Score: 398 bits (1023), Expect = 3.80e-123
Identity = 211/432 (48.84%), Postives = 277/432 (64.12%), Query Frame = 0

Query: 1    MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
            +PL FWD++F T VYL NRLPT+VL+   PIE LF   P Y FLK FGC CFP+LR YN 
Sbjct: 659  LPLKFWDESFRTVVYLSNRLPTAVLHHKCPIEVLFKSIPDYSFLKVFGCSCFPNLRPYNT 718

Query: 61   HKIQPRSSACLFIGYSNIHHGYKCLSPSGRLYISRHVLFDETVFPYLQLFQKPTSHSSSP 120
            HK+Q RS  C F+GYS  H GYKC+S +GR+YISR V+F+ET FPY +  Q  +   S+ 
Sbjct: 719  HKLQYRSEECTFLGYSLKHKGYKCMSSNGRVYISRDVIFNETSFPYSKTIQVSSCLPST- 778

Query: 121  TIQTTLPIISPTPSSPSPQRDTSVVRDPSESCESSPDIHVALPVNNYENVPAAIPTNDFD 180
                    +SP+ S  SP     V+        +SP I  A P++  +N+   + T+   
Sbjct: 779  --------VSPSTSHLSPSASPPVLSPTMLPAPTSP-ISSARPISEMDNI---VSTHPHA 838

Query: 181  ENSSTT-----------LPTGVASTSQAVSNSNVQSTEQMPPQNHHHMTTRAKNGIFKPR 240
             NS+ T           + T V     ++++++V  T      N H M TRAK+GI KP+
Sbjct: 839  PNSADTTLTPAQVVSNPVATPVQHVVSSIADASVTRTIAKDADNTHPMITRAKSGIVKPK 898

Query: 241  IFLANYTAVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKV 300
            IF+A     EP +V  AL    W KAM  EYDAL RN TWSLVPLP  ++ +GCKWV+K 
Sbjct: 899  IFIA--AVREPSSVSAALQQDEWKKAMVAEYDALQRNNTWSLVPLPAGRQAIGCKWVYKT 958

Query: 301  KRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDI 360
            K N DG++ +YKARLVAKGFHQ    D+TETFSPVVK  TIR++FT+ALS NW+++Q+D+
Sbjct: 959  KENPDGTVQKYKARLVAKGFHQQAGFDFTETFSPVVKPSTIRVVFTIALSRNWAIKQLDV 1018

Query: 361  NNTFLHGMLTEDVYMMQPSGFVQASSSRLVCKLHKALYGLKQAPRAWYERLTSYLHTLGF 420
            NN FL+G L E+V+M QP GF+   +  LVC+LHKALYGLKQAPRAW+E+L   L + GF
Sbjct: 1019 NNAFLNGDLQEEVFMQQPQGFIDEKNPNLVCRLHKALYGLKQAPRAWFEKLHQALLSFGF 1075

BLAST of MC10g_new0109 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 167.9 bits (424), Expect = 1.7e-41
Identity = 83/189 (43.92%), Postives = 115/189 (60.85%), Query Frame = 0

Query: 237 AVEPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLVPLPNNKKLVGCKWVFKVKRNLDGS 296
           A EP T  EA     W  AM DE  A+    TW +  LP NKK +GCKWV+K+K N DG+
Sbjct: 83  AKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGT 142

Query: 297 IARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRILFTLALSHNWSLRQVDINNTFLHG 356
           I RYKARLVAKG+ Q   ID+ ETFSPV K  +++++  ++  +N++L Q+DI+N FL+G
Sbjct: 143 IERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNG 202

Query: 357 MLTEDVYMMQPSGFVQASSSRL----VCKLHKALYGLKQAPRAWYERLTSYLHTLGFCTS 416
            L E++YM  P G+       L    VC L K++YGLKQA R W+ + +  L   GF  S
Sbjct: 203 DLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQS 262

Query: 417 KADSSLLFR 422
            +D +   +
Sbjct: 263 HSDHTYFLK 271

BLAST of MC10g_new0109 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 125.2 bits (313), Expect = 1.3e-28
Identity = 66/125 (52.80%), Postives = 84/125 (67.20%), Query Frame = 0

Query: 217 MTTRAKNGIFK--PRIFLANYTAV--EPPTVKEALTSPHWVKAMQDEYDALLRNQTWSLV 276
           M TR+K GI K  P+  L   T +  EP +V  AL  P W +AMQ+E DAL RN+TW LV
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 277 PLPNNKKLVGCKWVFKVKRNLDGSIARYKARLVAKGFHQTVDIDYTETFSPVVKQVTIRI 336
           P P N+ ++GCKWVFK K + DG++ R KARLVAKGFHQ   I + ET+SPVV+  TIR 
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

Query: 337 LFTLA 338
           +  +A
Sbjct: 121 ILNVA 125

BLAST of MC10g_new0109 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 46.2 bits (108), Expect = 7.5e-05
Identity = 23/67 (34.33%), Postives = 40/67 (59.70%), Query Frame = 0

Query: 1  MPLTFWDDAFSTAVYLINRLPTSVLYGTSPIETLFGFQPHYLFLKTFGCLCFPSLRLYNQ 60
          +P TF  DA +TAV++IN+ P++ +    P E  F   P Y +L+ FGC+ +      ++
Sbjct: 18 LPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYLRRFGCVAYIHC---DE 77

Query: 61 HKIQPRS 68
           K++PR+
Sbjct: 78 GKLKPRA 81

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW24.9e-8641.38Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT944.9e-8641.41Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109781.9e-5332.49Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.3e-3527.96Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925201.8e-2752.80Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
KYP64199.12.71e-12652.83Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
PNX89511.11.54e-12448.82retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium pratense][more]
MCH81678.11.83e-12449.21retrovirus-related Pol polyprotein from transposon TNT 1-94 [Trifolium medium][more]
PNX92571.17.38e-12451.79histone deacetylase [Trifolium pratense][more]
RVX14937.17.84e-12348.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
A0A151TAX51.31e-12652.83Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan OX=... [more]
A0A2K3MFF17.46e-12548.82Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium pratens... [more]
A0A392M4K28.87e-12549.21Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Trifolium medium ... [more]
A0A2K3MP353.57e-12451.79Histone deacetylase OS=Trifolium pratense OX=57577 GN=L195_g015711 PE=4 SV=1[more]
A0A438K1473.80e-12348.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.7e-4143.92cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.11.3e-2852.80Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.17.5e-0534.33Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 266..418
e-value: 2.2E-47
score: 161.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 113..150
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 113..157
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 178..216
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 5..418
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 265..409

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC10g_new0109.1MC10g_new0109.1mRNA