Pay0016367.1 (mRNA) Melon (Payzawat) v1

Overview
NamePay0016367.1
TypemRNA
OrganismCucumis melo L. var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Locationchr08: 14122344 .. 14126878 (+)
Sequence length3429
RNA-Seq ExpressionPay0016367.1
SyntenyPay0016367.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATAATCAGAGAAGGATCATCTGCATCACGTCCTCCTATTTTAGATGGTAAAAACTACTCATATTGGAAACCTCGAATGATCTTCTTCATTAAGACGTTAGACAGAAAAGCTTGGAGAGCTCTTGTTGCGGGTTATGATCCTCCTATGATTACCGTAAATGGTGTTTTGGTTCCAAAACCTGAAGTTGGTTGGACCAATGCTGAAGAGCAAGCTTCCGTTGGGAATGCTAGAGCACTTAACGCAATATTCAATGGTGTTAACCTGAACATTTTCAAGTTAATAAATTCTTGTAGTACAGCCAAAGAAGCCTAAAAAATCTTGGAGGTAGCGTATGAAGGTACTTCCAAAGTAAAGATCGTAAGATTACAGTTGATACGTCTAAATTTGAGGCATTGAGAATGACCGAAGATGAATCAGTGTCTGATTACAATAAGAGAGTTCTTGAAATTGCAAATGAATCTTTGCTGCTCGGTGAAAAAAATATCTGACACTAAAATAGTACGAAAAGTACTTCGATCCTTGCCCAGAAAATTTGATATGAAAGTAACTGCCATTGAGGAAGCTCATTGTATTACAACATTGAGACTTGATGAATTGTTTAGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAATAAGAAAGGCAAGGGAATTGATTTTAAGTCCACACATGAAGGTGAGGCGGCAGTAAGTGACACTGAAGCAAACATGGATGAGTCAATAGCTTTGTTAACAAAACAATTCATAAACGTCCTCCGAAAATTTAAAAATACGAATGCCACAGGTATGAATGCTCAAACTTCTAATTAGTATCGAAGAAGGAATGATGAAGGCACTACCAGGAGGAACAATGAAAATTCTAATAGAAGAAGTAATGGTTATATTAAAAAAAAGGAAGGTGACGAGAGGATTTTCAGGTGTAGGGAATCTTGAGGTGTTGGTCACTATCAGGCAGAATGTCCCATATTCCTGAGAAAATAGAAGAAAAACTTTCGTGCCACACTGTCAGATGAAGAATCTGGTGATAGTAGAGATGATGATGGAAACATAAATGCCTTCACAATACAAATTACTGATGAGAACACTGATGATGAAAGTGAATGTTCTGAAGAAAGCAAAAACGATAAGCTGTCAATTGAGAAGCTTGAAGCTCTATGGAAAGAAGATTGTGAAGCAAGGGCAAAACAAAAGGAAAGGATTCAAGATCTTATAGAAGAAAATGAACGGTTGATGTCTGTAATATCTTCCCTAAAGATAAAATTGAGAGAGGTTCAGAATGAAAATGATCGGATTTTAAAATCCGTTAAAATGCTAAACTCAGGAACGGAGAACCTAGATTCAATACTCAATTAAATGGATTTGACATGCACAACCTCGCATGGTCTGGAGAATTAAATCTGCTGAAAGATGTAAGATTGTCTTTACATCCATTTAGACCACAGATGATGCATGATATTTTGATAATGGGTGCTCCAGACATATGACTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTAGTGATGGTGCAAAAGGAAAAATTATGGCTAAAGGTAACATAGATAAAAATAATCTGCCACGTTTAAATGATGTTAGGTATGTGGATGGACTAAAAGCAAATTTGATCTGTATAAGTCAACTATGTGATCAAGGCTACAAAGTCAGTTTTGATGATATTGGTTGTGTTGTTATGAATAAAGAAAATCAGATTTGTATGAGAGGTAAACGACAAACTGATAACTGTTATCACTGGAACTCAAATACGTCATACACCTGTCATTTGACAAGATCAGATCAAACGTGGCTATGACATAAAAAGTTGGGCCATGTCAGTATGAGGGGCTTGGAAAAAATTATTAAAAATGAAGCAATTGTGGGAATTCCTGATTTAGATGTAAATGGAAAATTCTTCTGTGGAGACTGTCAAATTGGCAAAAGACAAGGTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACGTGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAAGGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGAGAAAAAGATAACGAGGATCCGAAGTGATAATGGTAAGGAGTTTGATAATGAGGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAGAGATGGCACGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATGGGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTATAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAGTTTTAAATGATCTCGATTCAACCATCAAATAGATGATAGATGAGGAAGATGAGACTTCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAACCATCAAAAGAAATTATCACTAAAAAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAAAAGGAAAGAAAAGATTGTTTACTTAAAGATGGTTGATGATTTATGTTATACTTTTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAGTATTGACTAAATGCTATGCAAGAGGAGCTACTCTAATTCAGACGAAACAATATATGGACGTTAGTTTCCAAGCCAGAAGGTGTAAACATTATTGGCACTAAATGGGTATTTAAAAATAAGACTAATGAAGCTGGATGTGTGACAAAAAATAAAGCCAGATTAGTAGCTCAAGGCTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTGTTACTTGATATATCATGCATACAGAAATTTAAATTATATCAGATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCAAAGCATGTGTATAAGCTCAGCAAAGCTCTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTATTTACTTAAGAGGTAAAGGATATTCCAGAGGAGAAATTGACAAGACCTTGTTTATACACAGGGAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATTTTGTCCATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGTATGGTTGGAGAACTTTCGTGCTTTTTGGGTCTTCAAATCAAGTAAAAGAATGACGGCATATTCATATCTCAAGAAAAGTATGCCAAGGATATGATTAAAAAGTTTGGTTTGGGACAGGCTCAAAATAAGTGGACTCTAACTGCGACACATGTTAAACTTATAAGAAATACTGATGGTGCTGAAGTTGATCACAAACTCTACAGGAGTATAATAGGCAGCTTATTGTATTTAACAGCAAGTCGACCTGACATATCTTATGCTGTGGGAATATGTGCCCGTTATCAGGCTGATCCTCGCATCTCTCATCTAGAAGCTGTTAAACGAATTCTTAAGTATGTTCAGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACCCCCACTCTTGTTGGATATTGTGATGCTGACTGGGCAGGTTCGGCTGATGATCGAAAAAGTACATCTGAAGGATGTTTCTTTTTAGGAAACAATTTAATTTCTTAGTTGAGTAAGAAGCGAAACTGTGTCTCTTTATCTACAGTTGAAGCTGAATATATAGCAGTAGGCAGTGGTTGTACTCAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCTTTGATCAAGACACTATGATGTTGTATTGTGACAATATGAGCGCAGTTGATATATCGAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGAGAACTTGTTGAAAATAAAGTAATTAGGCTTGATCATATTCATTCCAACTTACAATTAGTCGATATTTTCACTAAGCCTTTGGATGCAAATTCATTTGAATACTTACATGCTGGTTTAGGAGTGTGTCGCACTTAA

mRNA sequence

ATGGAGATAATCAGAGAAGGATCATCTGCATCACGTCCTCCTATTTTAGATGGTAAAAACTACTCATATTGGAAACCTCGAATGATCTTCTTCATTAAGACGTTAGACAGAAAAGCTTGGAGAGCTCTTGTTGCGGGTTATGATCCTCCTATGATTACCGTAAATGGTGTTTTGGTTCCAAAACCTGAAGTTGGTTGGACCAATGCTGAAGAGCAAGCTTCCGTTGGGAATGCTAGAGCACTTAACGCAATATTCAATGGTGTTAACCTGAACATTTTCAAGTTAATAAATTCTTCGTATGAAGGTACTTCCAAAGTAAAGATCGTAAGATTACAAGAGTTCTTGAAATTGCAAATGAATCTTTGCTGCTCGGTGAAAAAAATATCTGACACTAAAATAGTACGAAAAGTACTTCGATCCTTGCCCAGAAAATTTGATATGAAAGTAACTGCCATTGAGGAAGCTCATTGTATTACAACATTGAGACTTGATGAATTGTTTAGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAATAAGAAAGGCAAGGGAATTGATTTTAAGTCCACACATGAAGGTGAGGCGGCAGTAAGTGACACTGAAGCAAACATGGATGAGTCAATAGCTTTGTTAACAAAACAATTCATAAACGTCCTCCGAAAATTTAAAAATACGAATGCCACAGGAACAATGAAAATTCTAATAGAAGAAGTAATGGTTATATTAAAAAAAAGGAAGGTGACGAGAGGATTTTCAGGTGTAGGGAATCTTGAGGTACATATGACTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTAGTGATGGTGCAAAAGGAAAAATTATGGCTAAAGGTAACATAGATAAAAATAATCTGCCACGTTTAAATGATGTTAGGTATGTGGATGGACTAAAAGCAAATTTGATCTGTATAAGTCAACTATGTGATCAAGGCTACAAAGTCAGTTTTGATGATATTGGTTGTGTTGTTATGAATAAAGAAAATCAGATTTGTATGAGAGGTAAACGACAAACTGATAACTGTTATCACTGGAACTCAAATACGTCATACACCTGTCATTTGACAAGATCAGATCAAACACTGTCAAATTGGCAAAAGACAAGGTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACGTGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAAGGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGAGAAAAAGATAACGAGGATCCGAAGTGATAATGGTAAGGAGTTTGATAATGAGGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAGAGATGGCACGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATGGGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTATAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAATGATAGATGAGGAAGATGAGACTTCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAACCATCAAAAGAAATTATCACTAAAAAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAAAAGGAAAGAAAAGATTGTTTACTTAAAGATGGTTGATGATTTATGTTATACTTTTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAACGAAACAATATATGGACGTTAGTTTCCAAGCCAGAAGGTGTAAACATTATTGGCACTAAATGGGTATTTAAAAATAAGACTAATGAAGCTGGATGTGTGACAAAAAATAAAGCCAGATTAGTAGCTCAAGGCTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTGTTACTTGATATATCATGCATACAGAAATTTAAATTATATCAGATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCAAAGCATGTGTATAAGCTCAGCAAAGCTCTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTATTTACTTAAGAGGTAAAGGATATTCCAGAGGAGAAATTGACAAGACCTTGTTTATACACAGGGAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATTTTGTCCATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGTATGGTTGGAGAACTTTCGTGCTTTTTGGAAAAGTATGCCAAGGATATGATTAAAAAGTTTGGTTTGGGACAGGCTCAAAATAAGTGGACTCTAACTGCGACACATGTTAAACTTATAAGAAATACTGATGGTGCTGAAGTTGATCACAAACTCTACAGGAGTATAATAGGCAGCTTATTGTATTTAACAGCAAGTCGACCTGACATATCTTATGCTGTGGGAATATGTGCCCGTTATCAGGCTGATCCTCGCATCTCTCATCTAGAAGCTGTTAAACGAATTCTTAATAAGAAGCGAAACTGTGTCTCTTTATCTACAGTTGAAGCTGAATATATAGCAGTAGGCAGTGGTTGTACTCAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCTTTGATCAAGACACTATGATGTTGTATTGTGACAATATGAGCGCAGTTGATATATCGAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGAGAACTTGTTGAAAATAAAGTAATTAGGCTTGATCATATTCATTCCAACTTACAATTAGTCGATATTTTCACTAAGCCTTTGGATGCAAATTCATTTGAATACTTACATGCTGGTTTAGGAGTGTGTCGCACTTAA

Coding sequence (CDS)

ATGGAGATAATCAGAGAAGGATCATCTGCATCACGTCCTCCTATTTTAGATGGTAAAAACTACTCATATTGGAAACCTCGAATGATCTTCTTCATTAAGACGTTAGACAGAAAAGCTTGGAGAGCTCTTGTTGCGGGTTATGATCCTCCTATGATTACCGTAAATGGTGTTTTGGTTCCAAAACCTGAAGTTGGTTGGACCAATGCTGAAGAGCAAGCTTCCGTTGGGAATGCTAGAGCACTTAACGCAATATTCAATGGTGTTAACCTGAACATTTTCAAGTTAATAAATTCTTCGTATGAAGGTACTTCCAAAGTAAAGATCGTAAGATTACAAGAGTTCTTGAAATTGCAAATGAATCTTTGCTGCTCGGTGAAAAAAATATCTGACACTAAAATAGTACGAAAAGTACTTCGATCCTTGCCCAGAAAATTTGATATGAAAGTAACTGCCATTGAGGAAGCTCATTGTATTACAACATTGAGACTTGATGAATTGTTTAGTTCGTTGCTTACGTTTGAGATGGCCACTGCTGATAGAGAAAATAAGAAAGGCAAGGGAATTGATTTTAAGTCCACACATGAAGGTGAGGCGGCAGTAAGTGACACTGAAGCAAACATGGATGAGTCAATAGCTTTGTTAACAAAACAATTCATAAACGTCCTCCGAAAATTTAAAAATACGAATGCCACAGGAACAATGAAAATTCTAATAGAAGAAGTAATGGTTATATTAAAAAAAAGGAAGGTGACGAGAGGATTTTCAGGTGTAGGGAATCTTGAGGTACATATGACTGGAAACAGATCCTACTTTACGAACTTAAAAGACTGTGTCACTGGACATGTTACCTTTAGTGATGGTGCAAAAGGAAAAATTATGGCTAAAGGTAACATAGATAAAAATAATCTGCCACGTTTAAATGATGTTAGGTATGTGGATGGACTAAAAGCAAATTTGATCTGTATAAGTCAACTATGTGATCAAGGCTACAAAGTCAGTTTTGATGATATTGGTTGTGTTGTTATGAATAAAGAAAATCAGATTTGTATGAGAGGTAAACGACAAACTGATAACTGTTATCACTGGAACTCAAATACGTCATACACCTGTCATTTGACAAGATCAGATCAAACACTGTCAAATTGGCAAAAGACAAGGTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACGTGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAAGGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAAAAAGAGAAAAAGATAACGAGGATCCGAAGTGATAATGGTAAGGAGTTTGATAATGAGGGCTTTAACAGTTTTTGTCTGTTAGAAGGAATACACCATGAATTTTCTGCACCTATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAGAGATGGCACGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAGCTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCATGTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATGGGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTATAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAATGATAGATGAGGAAGATGAGACTTCAAACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAACCATCAAAAGAAATTATCACTAAAAAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAAAAGGAAAGAAAAGATTGTTTACTTAAAGATGGTTGATGATTTATGTTATACTTTTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAACGAAACAATATATGGACGTTAGTTTCCAAGCCAGAAGGTGTAAACATTATTGGCACTAAATGGGTATTTAAAAATAAGACTAATGAAGCTGGATGTGTGACAAAAAATAAAGCCAGATTAGTAGCTCAAGGCTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTGTTACTTGATATATCATGCATACAGAAATTTAAATTATATCAGATGGATGTAAAGAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCAAAGCATGTGTATAAGCTCAGCAAAGCTCTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTATTTACTTAAGAGGTAAAGGATATTCCAGAGGAGAAATTGACAAGACCTTGTTTATACACAGGGAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATGACATCATTTTTGGAGGTTTTCCTCAAGATTTTGTCCATAATTTCATTAACATCATGCAGTCAGAATTCGAAATGAGTATGGTTGGAGAACTTTCGTGCTTTTTGGAAAAGTATGCCAAGGATATGATTAAAAAGTTTGGTTTGGGACAGGCTCAAAATAAGTGGACTCTAACTGCGACACATGTTAAACTTATAAGAAATACTGATGGTGCTGAAGTTGATCACAAACTCTACAGGAGTATAATAGGCAGCTTATTGTATTTAACAGCAAGTCGACCTGACATATCTTATGCTGTGGGAATATGTGCCCGTTATCAGGCTGATCCTCGCATCTCTCATCTAGAAGCTGTTAAACGAATTCTTAATAAGAAGCGAAACTGTGTCTCTTTATCTACAGTTGAAGCTGAATATATAGCAGTAGGCAGTGGTTGTACTCAGTTGATTTGGATGAAAAATATGTTGCATGAATATGGCTTTGATCAAGACACTATGATGTTGTATTGTGACAATATGAGCGCAGTTGATATATCGAAGAATCCTGTTCAACATAGTCGAACAAAGCACATTGACATAAGACATCATTTTATTCGAGAACTTGTTGAAAATAAAGTAATTAGGCTTGATCATATTCATTCCAACTTACAATTAGTCGATATTTTCACTAAGCCTTTGGATGCAAATTCATTTGAATACTTACATGCTGGTTTAGGAGTGTGTCGCACTTAA

Protein sequence

MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMITVNGVLVPKPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINSSYEGTSKVKIVRLQEFLKLQMNLCCSVKKISDTKIVRKVLRSLPRKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEANMDESIALLTKQFINVLRKFKNTNATGTMKILIEEVMVILKKRKVTRGFSGVGNLEVHMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHLTRSDQTLSNWQKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFLEKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLEAVKRILNKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTMMLYCDNMSAVDISKNPVQHSRTKHIDIRHHFIRELVENKVIRLDHIHSNLQLVDIFTKPLDANSFEYLHAGLGVCRT
Homology
BLAST of Pay0016367.1 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 404.4 bits (1038), Expect = 4.1e-111
Identity = 254/846 (30.02%), Postives = 418/846 (49.41%), Query Frame = 0

Query: 400  VLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQRE 459
            +L L++ D+  PM+ ES+GG +Y +  +DD SR  WV+ LK K    ++ +     ++RE
Sbjct: 480  ILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERE 539

Query: 460  KEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVM 519
              +K+ R+RSDNG E+ +  F  +C   GI HE + P TPQ NGV ER NRT+ E  R M
Sbjct: 540  TGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSM 599

Query: 520  IHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILA 579
            +    LP  FW EAV TAC++ NR             +W  ++ +  +  VFG   +   
Sbjct: 600  LRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHV 659

Query: 580  DREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVMIDEED--ETSNMSEAR 639
             +E R K D +S   IF+GY      YR+++     V+ + +V+  E +    ++MSE  
Sbjct: 660  PKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSEKV 719

Query: 640  TTSTV-----------EVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSS 699
                +             + A++ +D++ +  E+P +  + ++ E +     +  HPT  
Sbjct: 720  KNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGE--VIEQGEQLDEGVEEVEHPTQG 779

Query: 700  ------IIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKD----------- 759
                  +       ++++R     Y+ + DD       +P ++   L             
Sbjct: 780  EEQHQPLRRSERPRVESRRYPSTEYVLISDDR------EPESLKEVLSHPEKNQLMKAMQ 839

Query: 760  ------ERNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDF 819
                  ++N  + LV  P+G   +  KWVFK K +    + + KARLV +G+ Q +G+DF
Sbjct: 840  EEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDF 899

Query: 820  DETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPK 879
            DE F+PV ++ +IR +L ++     ++ Q+DVK+AFL+G L EE+Y+ QP+GF  +    
Sbjct: 900  DEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKH 959

Query: 880  HVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRES-NQLLVAQIYVDD 939
             V KL+K+LYGLKQAPR WY +   +++ + Y +   D  ++  R S N  ++  +YVDD
Sbjct: 960  MVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDD 1019

Query: 940  IIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL-----------------EKYAKDMIK 999
            ++  G  +  +      +   F+M  +G     L                 EKY + +++
Sbjct: 1020 MLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLE 1079

Query: 1000 KFGLGQAQNKWTLTATHVKLIRNTDGAEVDHK------LYRSIIGSLLY-LTASRPDISY 1059
            +F +  A+   T  A H+KL +      V+ K       Y S +GSL+Y +  +RPDI++
Sbjct: 1080 RFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAH 1139

Query: 1060 AVGICARYQADPRISHLEAVKRIL------------------------------------ 1119
            AVG+ +R+  +P   H EAVK IL                                    
Sbjct: 1140 AVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRK 1199

Query: 1120 -----------------NKKRNCVSLSTVEAEYIAVGSGCTQLIWMKNMLHEYGFDQDTM 1132
                             +K + CV+LST EAEYIA      ++IW+K  L E G  Q   
Sbjct: 1200 SSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEY 1259

BLAST of Pay0016367.1 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 340.1 bits (871), Expect = 9.6e-92
Identity = 262/948 (27.64%), Postives = 429/948 (45.25%), Query Frame = 0

Query: 381  NWQKTRSTHKSLKE-CYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFL 440
            N ++ R   K LK+  +  R L ++H D+  P+   +L  K Y ++ VD ++ Y   + +
Sbjct: 460  NGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLI 519

Query: 441  KGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITP 500
            K K+D   + ++   K +     K+  +  DNG+E+ +     FC+ +GI +  + P TP
Sbjct: 520  KYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTP 579

Query: 501  QQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR--TRTTVTLYEL 560
            Q NGV ER  RT+ E AR M+    L   FW EAV TA ++ NR+  R    ++ T YE+
Sbjct: 580  QLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEM 639

Query: 561  WKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNS-RAYRVFNNKSVSV 620
            W  +KP +K+  VFG+  Y+   +  + K+D +S + IF+GY  N  + +   N K +  
Sbjct: 640  WHNKKPYLKHLRVFGATVYVHI-KNKQGKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVA 699

Query: 621  -----------------METINVMIDEEDETSNM-------------SEARTTSTVEVSK 680
                              ET+ +   +E E  N              +E++    ++  K
Sbjct: 700  RDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLK 759

Query: 681  ADNPSDDLGKNLEKPSKEII---------------------------------------- 740
                S++  KN    S++II                                        
Sbjct: 760  DSKESEN--KNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHL 819

Query: 741  ---------TKKSELITFAHVKK---NHPTSS----IIGDPSAGMQTK-----RKEKIVY 800
                      +  E  T  H+K+   ++PT +    II   S  ++TK      +E    
Sbjct: 820  NESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSL 879

Query: 801  LKMVDDLCYTFTIKPSTVDS-ALKDER----------------NNIWTLVSKPEGVNIIG 860
             K+V +    F   P++ D    +D++                NN WT+  +PE  NI+ 
Sbjct: 880  NKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVD 939

Query: 861  TKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKF 920
            ++WVF  K NE G   + KARLVA+G+TQ   +D++ETFAPVAR+ + R +L +      
Sbjct: 940  SRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNL 999

Query: 921  KLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTI 980
            K++QMDVK+AFLNG L EE+Y+  P+G   S +  +V KL+KA+YGLKQA R W+E    
Sbjct: 1000 KVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQ 1059

Query: 981  YLRGKGYSRGEIDKTLFIHRES--NQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEM 1040
             L+   +    +D+ ++I  +   N+ +   +YVDD++        ++NF   +  +F M
Sbjct: 1060 ALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRM 1119

Query: 1041 SMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKWTLTATHVKL-IRNTD 1100
            + + E+  F+                 Y K ++ KF +       T   + +   + N+D
Sbjct: 1120 TDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSD 1179

Query: 1101 GAEVDHKLYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL------- 1140
              E  +   RS+IG L+Y +  +RPD++ AV I +RY +       + +KR+L       
Sbjct: 1180 --EDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTI 1239

BLAST of Pay0016367.1 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 327.8 bits (839), Expect = 4.9e-88
Identity = 296/1106 (26.76%), Postives = 458/1106 (41.41%), Query Frame = 0

Query: 282  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCD-QGYKVSFDDI 341
            V  +DG+   I   G+   +   R   L+++ YV  +  NLI + +LC+  G  V F   
Sbjct: 360  VMVADGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPA 419

Query: 342  GCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHL---TRSDQTLSNWQK---------- 401
               V +    + +   +  D  Y W   +S    L     S  T S+W            
Sbjct: 420  SFQVKDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSIL 479

Query: 402  -----------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESL 461
                          +HK  S  +C  N+  K+      ++  RP++            S 
Sbjct: 480  NSVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPILSH 539

Query: 462  GGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDN 521
               RY ++ VD ++RYTW++ LK K+   E        L+   + +I    SDNG EF  
Sbjct: 540  DNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF-- 599

Query: 522  EGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTA 581
                 +    GI H  S P TP+ NG+ ERK+R + E    ++   ++P  +W  A   A
Sbjct: 600  VALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVA 659

Query: 582  CHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFL 641
             ++ NR+        + ++      PN     VFG  CY       + K D +S Q +FL
Sbjct: 660  VYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFL 719

Query: 642  GYSQNSRAY--------RVFNNKSV----------SVMETINVMIDEEDETSNMSEARTT 701
            GYS    AY        R++ ++ V          + + T++ + ++  E+S +    TT
Sbjct: 720  GYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTT 779

Query: 702  --------------------------------STVEVSKAD------------------- 761
                                            S V  S  D                   
Sbjct: 780  LPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQN 839

Query: 762  ------------------------NPSDD----LGKNLEKPSKEIITKKSELITFAHVKK 821
                                    NP+++    L ++L  P++   +  S   T A    
Sbjct: 840  GPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSP-TTSASSSS 899

Query: 822  NHPT--SSIIGDP----------------SAGMQTKRKEKIVYLKMVDDLCYTFTI--KP 881
              PT  S +I  P                +  M T+ K  I+       L  +     +P
Sbjct: 900  TSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEP 959

Query: 882  STVDSALKDER--------------NNIWTLVSKPEG-VNIIGTKWVFKNKTNEAGCVTK 941
             T   ALKDER              N+ W LV  P   V I+G +W+F  K N  G + +
Sbjct: 960  RTAIQALKDERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNR 1019

Query: 942  NKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLN 1001
             KARLVA+GY Q  G+D+ ETF+PV +  +IR++L ++  + + + Q+DV +AFL G L 
Sbjct: 1020 YKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLT 1079

Query: 1002 EEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLF 1061
            ++VY++QP GF+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF
Sbjct: 1080 DDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLF 1139

Query: 1062 IHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL--------- 1121
            + +    ++   +YVDDI+  G     +HN ++ +   F +    EL  FL         
Sbjct: 1140 VLQRGKSIVYMLVYVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPT 1199

Query: 1122 ------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLT 1142
                   +Y  D++ +  +  A+   T  A   KL   +     D   YR I+GSL YL 
Sbjct: 1200 GLHLSQRRYILDLLARTNMITAKPVTTPMAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLA 1259

BLAST of Pay0016367.1 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 324.7 bits (831), Expect = 4.2e-87
Identity = 293/1114 (26.30%), Postives = 460/1114 (41.29%), Query Frame = 0

Query: 282  VTFSDGAKGKIMAKGNIDKNNLPR---LNDVRYVDGLKANLICISQLCDQG-YKVSFDDI 341
            V  +DG+   I   G+       R   LN V YV  +  NLI + +LC+     V F   
Sbjct: 339  VMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPA 398

Query: 342  GCVVMNKENQICMRGKRQTDNCYHWNSNTSYTCHLTR---SDQTLSNWQK---------- 401
               V +    + +   +  D  Y W   +S    +     S  T S+W            
Sbjct: 399  SFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAIL 458

Query: 402  -----------TRSTHK--SLKECYTNRVLKL----LHMDLMRPMQ----------TESL 461
                          +HK  S  +C+ N+  K+      +   +P++            S+
Sbjct: 459  NSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPILSI 518

Query: 462  GGKRYVLVVVDDYSRYTWVFFLKGKT---DTVEICKNLCLKLQREKEKKITRIRSDNGKE 521
               RY ++ VD ++RYTW++ LK K+   DT  I K+L   ++   + +I  + SDNG E
Sbjct: 519  DNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSL---VENRFQTRIGTLYSDNGGE 578

Query: 522  FDNEGFNSFCLLEGIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAV 581
            F       +    GI H  S P TP+ NG+ ERK+R + EM   ++   ++P  +W  A 
Sbjct: 579  F--VVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAF 638

Query: 582  NTACHIHNRVTIRTRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQG 641
            + A ++ NR+        + ++    + PN +   VFG  CY       R K + +S+Q 
Sbjct: 639  SVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQC 698

Query: 642  IFLGYSQNSRAY--------RVFNNKSVSVME------TINVMIDEEDETSNMSEARTTS 701
             F+GYS    AY        R++ ++ V   E      T N  +    E  + S     S
Sbjct: 699  AFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPS 758

Query: 702  -----------------------------------TVEVSKADNPSDDLGK--------- 761
                                               T +VS ++ PS  +           
Sbjct: 759  HTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAP 818

Query: 762  ---------------------------NLEKPSKEIITKKSEL----ITFAHV------- 821
                                       N   PS     + S L    I+  H+       
Sbjct: 819  SHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSI 878

Query: 822  -KKNHPTSSIIGDP---------------------SAGMQTKRKEKIVYLKMVDDLCYTF 881
             + N P+SS    P                     +  M T+ K+ I   K      Y  
Sbjct: 879  SEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGI--RKPNQKYSYAT 938

Query: 882  TI----KPSTVDSALKDER--------------NNIWTLV-SKPEGVNIIGTKWVFKNKT 941
            ++    +P T   A+KD+R              N+ W LV   P  V I+G +W+F  K 
Sbjct: 939  SLAANSEPRTAIQAMKDDRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKF 998

Query: 942  NEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKS 1001
            N  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++L ++  + + + Q+DV +
Sbjct: 999  NSDGSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNN 1058

Query: 1002 AFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSR 1061
            AFL G L +EVY++QP GFVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+  
Sbjct: 1059 AFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVN 1118

Query: 1062 GEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL- 1121
               D +LF+ +    ++   +YVDDI+  G     + + ++ +   F +    +L  FL 
Sbjct: 1119 SISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLG 1178

Query: 1122 --------------EKYAKDMIKKFGLGQAQNKWTLTATHVKLIRNTDGAEVDHKLYRSI 1142
                           +Y  D++ +  +  A+   T  AT  KL  ++     D   YR I
Sbjct: 1179 IEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATPMATSPKLTLHSGTKLPDPTEYRGI 1238

BLAST of Pay0016367.1 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 113.6 bits (283), Expect = 1.5e-23
Identity = 72/227 (31.72%), Postives = 110/227 (48.46%), Query Frame = 0

Query: 813  MDVKSAFLNGYLNEEVYVAQPKGFVDSEHPKHVYKLSKALYGLKQAPRAWYERLTIYLRG 872
            MDV +AFLN  ++E +YV QP GFV+  +P +V++L   +YGLKQAP  W E +   L+ 
Sbjct: 1    MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 873  KGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGEL 932
             G+ R E +  L+    S+  +   +YVDD++               +   + M  +G++
Sbjct: 61   IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 933  SCFL------------EKYAKDMIKKFGLGQAQNKWTLTATHV----KLIRNTDGAEVDH 992
              FL                +D I K       N + LT T +     L   T     D 
Sbjct: 121  DKFLGLNIHQSSNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDI 180

Query: 993  KLYRSIIGSLLY-LTASRPDISYAVGICARYQADPRISHLEAVKRIL 1023
              Y+SI+G LL+     RPDISY V + +R+  +PR  HLE+ +R+L
Sbjct: 181  TPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVL 227

BLAST of Pay0016367.1 vs. ExPASy TrEMBL
Match: A0A2Z6P936 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_401320 PE=4 SV=1)

HSP 1 Score: 1106.3 bits (2860), Expect = 0.0e+00
Identity = 651/1459 (44.62%), Postives = 838/1459 (57.44%), Query Frame = 0

Query: 5    REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPK 64
            REG   +RPP+LD  NY+ WK RMI F+K++D + W+A++ G++ P +   NG    V K
Sbjct: 3    REGGYVTRPPLLDDSNYNIWKARMIAFLKSMDSRTWKAVLKGWEHPKVKDANGADTDVLK 62

Query: 65   PEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-------------NSSYEGTSKVKI 124
            PE  WT AE+  ++GN++ALNA+FNGV+ N+F+LI              ++ EGT+KVKI
Sbjct: 63   PEEEWTTAEDSLALGNSKALNALFNGVDKNMFRLIKKCEVAKDAWEILKTTQEGTAKVKI 122

Query: 125  VRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFD 184
             RLQ    +F  L+M    SV                  +K+SD KIVRK+LRSL +KFD
Sbjct: 123  SRLQNLTRKFENLRMKEDESVHNFYMNVMDFANSFDDLGEKLSDEKIVRKILRSLTKKFD 182

Query: 185  MKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA- 244
            MKV A+EEA  I+T+++DEL  SL T+E +  +R  KK K I F S  E     SD E+ 
Sbjct: 183  MKVIAMEEAQDISTMKVDELIGSLQTYESSVNERIEKKNKSIAFVSNTEDADLESDIEST 242

Query: 245  -NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE---------- 304
             ++ E+I LL +QF  VL++                + N   + K  I+E          
Sbjct: 243  DSVSEAIVLLGRQFNKVLKRMDRRPRQNARHLATDMSRNIGNSRKTKIDEKPAQTNEEID 302

Query: 305  -----------------------VMVILKKRKVTRGFSGVGNLEV--------------- 364
                                   +  IL+K      F G+    V               
Sbjct: 303  RLEVDIGRLRKYSEMLNKTGADKLDEILEKNVRRPKFIGISYENVNKRRSYNPELMYTQP 362

Query: 365  ------------------------------------------------------------ 424
                                                                        
Sbjct: 363  KESPMSRKMLQHSKQHHGVRKPRQFVSPKEWKPKNEDVSSISKIDESLSKDQVAGLIAHT 422

Query: 425  ------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLP 484
                              HMTG   +  ++K   T  VTF DGAKG+I+  G +  NNLP
Sbjct: 423  SFRASSREDWYFDSGCSRHMTGIDKFLVDMKKYSTSFVTFGDGAKGEIVGIGKLINNNLP 482

Query: 485  RLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW-- 544
            +L++V  V GL ANLI ISQLCDQG KV+F    C+V + + ++ M+G    DNCY W  
Sbjct: 483  KLDNVLLVKGLTANLISISQLCDQGLKVNFTKTECLVTDDKGELLMKGVISKDNCYLWVP 542

Query: 545  NSNTSYTCHLTRSDQTLSNW---------------------------------------- 604
              +TS +  L   +  +  W                                        
Sbjct: 543  QEDTSLSTCLIAKEDEVKLWHQRLGHLNLKSMKKAISEEAIRGLPKLKIEEGHICGDCQI 602

Query: 605  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKG 664
             ++T++ H+ L+   T RVL+LLHMDLM PMQ  SLGGKRY  VVVDD+SRYTW+ F+K 
Sbjct: 603  GKQTKTPHQKLQHLTTTRVLELLHMDLMGPMQVLSLGGKRYAYVVVDDFSRYTWINFIKE 662

Query: 665  KTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQ 724
            K+DT ++ K+LC++LQREK + I RIRSD+GKEF N  F+ FC  EGI HEFS+PITPQQ
Sbjct: 663  KSDTFDVFKDLCVQLQREKNEVILRIRSDHGKEFQNSRFSEFCASEGIKHEFSSPITPQQ 722

Query: 725  NGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKER 784
            NGVVERKNRTLQE AR M+H K L   FWAEA+NTAC+IHNRVT+R+ TT TLYELWK R
Sbjct: 723  NGVVERKNRTLQEYARAMLHGKKLSYSFWAEAMNTACYIHNRVTLRSGTTSTLYELWKNR 782

Query: 785  KPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETIN 844
            KP VKYFHVFGS CYIL DRE R+K D +S++GIFLGYS NSR+YRV+N+++  +ME+IN
Sbjct: 783  KPTVKYFHVFGSKCYILTDREQRRKLDPKSDEGIFLGYSTNSRSYRVYNSRTKVMMESIN 842

Query: 845  VMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA 904
            V+ID+  E  T+++++  TTS  +  + +   +D   +  ++   S   ++KK   I   
Sbjct: 843  VVIDDSAEGRTTDVADDATTSDKQFDETNLLKEDDNNMDTSIITNSTSDLSKKGPSI--- 902

Query: 905  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------ 964
             V+KNHP   IIGDPS G+ T+ K  +     V + C+   I+P  V  AL DE      
Sbjct: 903  RVQKNHPQELIIGDPSQGIATRSKNDV-----VSNACFVSKIEPRNVKEALTDEYWINAM 962

Query: 965  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVD 1024
                    RN +W LV +PE VN+IGTKWV+KNK++E G VT+NKARLVAQGY Q+EGVD
Sbjct: 963  QEELGQFKRNEVWDLVPRPENVNVIGTKWVYKNKSDENGNVTRNKARLVAQGYAQIEGVD 1022

Query: 1025 FDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHP 1084
            FDETFAPVA LE+IRLLL ++CI KF+L+QMDVKSAFLNGYLNEEVYV QPKGFVD   P
Sbjct: 1023 FDETFAPVAHLESIRLLLGVACILKFELFQMDVKSAFLNGYLNEEVYVEQPKGFVDPSLP 1082

Query: 1085 KHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDD 1141
             HVYKL KALYGLKQAPRAWYERLT +L  +GY +G  DKTLF+  E  +L++AQIYVDD
Sbjct: 1083 NHVYKLKKALYGLKQAPRAWYERLTEFLLSQGYRKGGNDKTLFVKEEEGKLIIAQIYVDD 1142

BLAST of Pay0016367.1 vs. ExPASy TrEMBL
Match: Q84VH8 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 657/1576 (41.69%), Postives = 858/1576 (54.44%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINS-------------SYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN+             ++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGI 300

Query: 301  ------------------------------------------KNTNA-TG---------- 360
                                                      ++ NA TG          
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISELKGEVGFLN 420

Query: 421  --------TMKIL------IEEVMVILKKRKVTRG------FSG---------------- 480
                    ++K+L      ++EV+++ K     RG      F+G                
Sbjct: 421  SKLETMKKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKFAGRTTMTEFVPAKNRTGT 480

Query: 481  -----------------------------VGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHLSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWV 540

Query: 541  ------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI 600
                                          HMTG + +  N++ C T +VTF DG+KGKI
Sbjct: 541  PKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKI 600

Query: 601  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRG 660
            +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G
Sbjct: 601  IGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKG 660

Query: 661  KRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW--------------------------- 720
             R  DNCY W    TSY  TC  ++ D+ +  W                           
Sbjct: 661  SRSKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKIIDKGAVRGIPNL 720

Query: 721  --------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVD 780
                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVD
Sbjct: 721  KIEEGRICGECQIGKQVKMSHQKLRHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVD 780

Query: 781  DYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEG 840
            D+SR+TWV F++ K++T E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EG
Sbjct: 781  DFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEG 840

Query: 841  IHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT 900
            I HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Sbjct: 841  ITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR 900

Query: 901  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV 960
             T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRV
Sbjct: 901  GTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRV 960

Query: 961  FNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD-NPSDDLGKNLEKPSKEIIT 1020
            FN+++ +VME+INV++D+              T+  + AD   S +  +N +  + E   
Sbjct: 961  FNSRTRTVMESINVVVDDLSPARKKDVEEDVRTLGDNVADAAKSGENAENSDSATDESNI 1020

Query: 1021 KKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSAL 1080
             + +  +   ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL
Sbjct: 1021 NQPDKRSSTRIQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVSKIEPKNVKEAL 1080

Query: 1081 KDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQ 1140
             DE              RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQ
Sbjct: 1081 TDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQ 1140

BLAST of Pay0016367.1 vs. ExPASy TrEMBL
Match: Q84VI4 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 661/1585 (41.70%), Postives = 857/1585 (54.07%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINS-------------SYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN+             ++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVKI RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE    D
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  --TEANMDESIALLTKQFINVLRKF----------------------------------- 300
              T+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LNTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKKSDVKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLN 420

Query: 421  -KNTNATGTMKIL------IEEVMVILKKRKVTRGF------------------------ 480
             K  N T ++K+L      ++EV+++ K     RG                         
Sbjct: 421  SKLENMTKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGA 480

Query: 481  ---------------------------SGVGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKPFCYHLHPHHGTQSSNSRKKMMWVPK 540

Query: 541  ----------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA 600
                                        HMTG + +  N++ C T +VTF DG+KGKI+ 
Sbjct: 541  HKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIG 600

Query: 601  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKR 660
             G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R
Sbjct: 601  MGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSR 660

Query: 661  QTDNCYHWN-SNTSY--TCHLTRSDQTLSNW----------------------------- 720
              DNCY W    TSY  TC  ++ D+ +  W                             
Sbjct: 661  SKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKI 720

Query: 721  ------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDY 780
                        ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+
Sbjct: 721  EEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDF 780

Query: 781  SRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIH 840
            SR+TWV F++ K++T E+ K L L+LQREK+  I RIRSD+G+EF+N     FC  EGI 
Sbjct: 781  SRFTWVKFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRLTEFCTSEGIT 840

Query: 841  HEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT 900
            HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R  T
Sbjct: 841  HEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGT 900

Query: 901  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFN 960
              TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN
Sbjct: 901  PTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFN 960

Query: 961  NKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLGKNL 1020
            +++ +VME+INV++D          EED  TS  + A    + E    +D+ +D+   N+
Sbjct: 961  SRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDE--SNI 1020

Query: 1021 EKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTI 1080
             +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I
Sbjct: 1021 NQPDKRSSTR---------IQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVSKI 1080

Query: 1081 KPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVT 1140
            +P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G +T
Sbjct: 1081 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 1140

BLAST of Pay0016367.1 vs. ExPASy TrEMBL
Match: Q84VI2 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 662/1587 (41.71%), Postives = 857/1587 (54.00%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN-------------SSYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN             S++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALIGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLN 420

Query: 421  -KNTNATGTMKIL------IEEVMVILKKRKVTRGF------------------------ 480
             K  N T ++K+L      ++EV+++ K     RG                         
Sbjct: 421  SKLENMTKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGA 480

Query: 481  ---------------------------SGVGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWV 540

Query: 541  ------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI 600
                                          HMTG + +  N++ C T +VTF DG+KGKI
Sbjct: 541  PKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKI 600

Query: 601  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRG 660
            +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G
Sbjct: 601  IGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKG 660

Query: 661  KRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW--------------------------- 720
             R  DNCY W    TSY  TC  ++ D+ +  W                           
Sbjct: 661  SRSKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKILDKSAVRGIPNL 720

Query: 721  --------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVD 780
                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVD
Sbjct: 721  KIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVD 780

Query: 781  DYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEG 840
            D+SR+TWV F++ K+ T E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EG
Sbjct: 781  DFSRFTWVNFIREKSGTFEVFKKLSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEG 840

Query: 841  IHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT 900
            I HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Sbjct: 841  ITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR 900

Query: 901  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV 960
             T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRV
Sbjct: 901  GTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRV 960

Query: 961  FNNKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLGK 1020
            FN+++ +VME+INV++D          EED  TS  + A    + E    +D+ +D+   
Sbjct: 961  FNSRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDE--S 1020

Query: 1021 NLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTF 1080
            N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+  
Sbjct: 1021 NINQPDKRSSTR---------IQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVS 1080

Query: 1081 TIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC 1140
             I+P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G 
Sbjct: 1081 KIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGV 1140

BLAST of Pay0016367.1 vs. ExPASy TrEMBL
Match: Q84VH6 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 1102.8 bits (2851), Expect = 0.0e+00
Identity = 662/1588 (41.69%), Postives = 854/1588 (53.78%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN-------------SSYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN             +++EGTS
Sbjct: 61   NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++++D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++  L KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNISLDIRKGSEYQRKSDEKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFESAEDSS 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  DTDSEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKLKGEVGFL 420

Query: 421  --KNTNATGTMKIL------IEEVMVILKKRKVTRGF----------------------- 480
              K  N T ++K+L      +++V+ + KK    RG                        
Sbjct: 421  NSKLENMTKSIKMLNKGSDMLDZVLQLGKKVGNQRGLGFNHKSAGRTTMTEFVPAKNSTG 480

Query: 481  -----------------------------------------------------SG----- 540
                                                                 SG     
Sbjct: 481  ATMSQHRSRHHGTQQKRSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQGSSSGRKMMW 540

Query: 541  -----VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK 600
                 + +L V                    HMTG + +  N++ C T +VTF DG+KGK
Sbjct: 541  VPKHKIVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGK 600

Query: 601  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMR 660
            I   G +    LP LN V  V GL  NLI ISQLCD+G+ V+F    C+V N+++++ M+
Sbjct: 601  ITGMGKLVHEGLPSLNKVLLVKGLTVNLISISQLCDEGFNVNFTKSECLVTNEKSEVLMK 660

Query: 661  GKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW-------------------------- 720
            G R  DNCY W    S+ S TC  ++ D+ +  W                          
Sbjct: 661  GSRSKDNCYLWTPQESSHSSTCLFSKEDE-VKIWHQRFGHLHLRGMKKIIDKGAVRGIPN 720

Query: 721  ---------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVV 780
                           ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVV
Sbjct: 721  LKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVV 780

Query: 781  DDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLE 840
            DD+SR+TWV F++ K+DT E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  E
Sbjct: 781  DDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSE 840

Query: 841  GIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR 900
            GI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R
Sbjct: 841  GITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLR 900

Query: 901  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYR 960
              T  TLYE+WK RKP VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYR
Sbjct: 901  RGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYR 960

Query: 961  VFNNKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLG 1020
            VFN+++ +VME+INV++D          EED  TS  + A T  + E    +D+ +D+  
Sbjct: 961  VFNSRTRTVMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDE-- 1020

Query: 1021 KNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYT 1080
             N+ +P K              ++K HP   IIGDP+ G+ T+ +E    +++V + C+ 
Sbjct: 1021 PNINQPDKR---------PSIRIQKMHPKELIIGDPNRGVTTRSRE----IEIVSNSCFV 1080

Query: 1081 FTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAG 1140
              I+P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G
Sbjct: 1081 SKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEG 1140

BLAST of Pay0016367.1 vs. NCBI nr
Match: GAU46010.1 (hypothetical protein TSUD_401320 [Trifolium subterraneum])

HSP 1 Score: 1106.3 bits (2860), Expect = 0.0e+00
Identity = 651/1459 (44.62%), Postives = 838/1459 (57.44%), Query Frame = 0

Query: 5    REGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYDPPMI-TVNG--VLVPK 64
            REG   +RPP+LD  NY+ WK RMI F+K++D + W+A++ G++ P +   NG    V K
Sbjct: 3    REGGYVTRPPLLDDSNYNIWKARMIAFLKSMDSRTWKAVLKGWEHPKVKDANGADTDVLK 62

Query: 65   PEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLI-------------NSSYEGTSKVKI 124
            PE  WT AE+  ++GN++ALNA+FNGV+ N+F+LI              ++ EGT+KVKI
Sbjct: 63   PEEEWTTAEDSLALGNSKALNALFNGVDKNMFRLIKKCEVAKDAWEILKTTQEGTAKVKI 122

Query: 125  VRLQ----EFLKLQMNLCCSV------------------KKISDTKIVRKVLRSLPRKFD 184
             RLQ    +F  L+M    SV                  +K+SD KIVRK+LRSL +KFD
Sbjct: 123  SRLQNLTRKFENLRMKEDESVHNFYMNVMDFANSFDDLGEKLSDEKIVRKILRSLTKKFD 182

Query: 185  MKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSDTEA- 244
            MKV A+EEA  I+T+++DEL  SL T+E +  +R  KK K I F S  E     SD E+ 
Sbjct: 183  MKVIAMEEAQDISTMKVDELIGSLQTYESSVNERIEKKNKSIAFVSNTEDADLESDIEST 242

Query: 245  -NMDESIALLTKQFINVLRKF--------------KNTNATGTMKILIEE---------- 304
             ++ E+I LL +QF  VL++                + N   + K  I+E          
Sbjct: 243  DSVSEAIVLLGRQFNKVLKRMDRRPRQNARHLATDMSRNIGNSRKTKIDEKPAQTNEEID 302

Query: 305  -----------------------VMVILKKRKVTRGFSGVGNLEV--------------- 364
                                   +  IL+K      F G+    V               
Sbjct: 303  RLEVDIGRLRKYSEMLNKTGADKLDEILEKNVRRPKFIGISYENVNKRRSYNPELMYTQP 362

Query: 365  ------------------------------------------------------------ 424
                                                                        
Sbjct: 363  KESPMSRKMLQHSKQHHGVRKPRQFVSPKEWKPKNEDVSSISKIDESLSKDQVAGLIAHT 422

Query: 425  ------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMAKGNIDKNNLP 484
                              HMTG   +  ++K   T  VTF DGAKG+I+  G +  NNLP
Sbjct: 423  SFRASSREDWYFDSGCSRHMTGIDKFLVDMKKYSTSFVTFGDGAKGEIVGIGKLINNNLP 482

Query: 485  RLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKRQTDNCYHW-- 544
            +L++V  V GL ANLI ISQLCDQG KV+F    C+V + + ++ M+G    DNCY W  
Sbjct: 483  KLDNVLLVKGLTANLISISQLCDQGLKVNFTKTECLVTDDKGELLMKGVISKDNCYLWVP 542

Query: 545  NSNTSYTCHLTRSDQTLSNW---------------------------------------- 604
              +TS +  L   +  +  W                                        
Sbjct: 543  QEDTSLSTCLIAKEDEVKLWHQRLGHLNLKSMKKAISEEAIRGLPKLKIEEGHICGDCQI 602

Query: 605  -QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKG 664
             ++T++ H+ L+   T RVL+LLHMDLM PMQ  SLGGKRY  VVVDD+SRYTW+ F+K 
Sbjct: 603  GKQTKTPHQKLQHLTTTRVLELLHMDLMGPMQVLSLGGKRYAYVVVDDFSRYTWINFIKE 662

Query: 665  KTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIHHEFSAPITPQQ 724
            K+DT ++ K+LC++LQREK + I RIRSD+GKEF N  F+ FC  EGI HEFS+PITPQQ
Sbjct: 663  KSDTFDVFKDLCVQLQREKNEVILRIRSDHGKEFQNSRFSEFCASEGIKHEFSSPITPQQ 722

Query: 725  NGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKER 784
            NGVVERKNRTLQE AR M+H K L   FWAEA+NTAC+IHNRVT+R+ TT TLYELWK R
Sbjct: 723  NGVVERKNRTLQEYARAMLHGKKLSYSFWAEAMNTACYIHNRVTLRSGTTSTLYELWKNR 782

Query: 785  KPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETIN 844
            KP VKYFHVFGS CYIL DRE R+K D +S++GIFLGYS NSR+YRV+N+++  +ME+IN
Sbjct: 783  KPTVKYFHVFGSKCYILTDREQRRKLDPKSDEGIFLGYSTNSRSYRVYNSRTKVMMESIN 842

Query: 845  VMIDE--EDETSNMSEARTTSTVEVSKADNPSDD---LGKNLEKPSKEIITKKSELITFA 904
            V+ID+  E  T+++++  TTS  +  + +   +D   +  ++   S   ++KK   I   
Sbjct: 843  VVIDDSAEGRTTDVADDATTSDKQFDETNLLKEDDNNMDTSIITNSTSDLSKKGPSI--- 902

Query: 905  HVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALKDE------ 964
             V+KNHP   IIGDPS G+ T+ K  +     V + C+   I+P  V  AL DE      
Sbjct: 903  RVQKNHPQELIIGDPSQGIATRSKNDV-----VSNACFVSKIEPRNVKEALTDEYWINAM 962

Query: 965  --------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVD 1024
                    RN +W LV +PE VN+IGTKWV+KNK++E G VT+NKARLVAQGY Q+EGVD
Sbjct: 963  QEELGQFKRNEVWDLVPRPENVNVIGTKWVYKNKSDENGNVTRNKARLVAQGYAQIEGVD 1022

Query: 1025 FDETFAPVARLEAIRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFVDSEHP 1084
            FDETFAPVA LE+IRLLL ++CI KF+L+QMDVKSAFLNGYLNEEVYV QPKGFVD   P
Sbjct: 1023 FDETFAPVAHLESIRLLLGVACILKFELFQMDVKSAFLNGYLNEEVYVEQPKGFVDPSLP 1082

Query: 1085 KHVYKLSKALYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDD 1141
             HVYKL KALYGLKQAPRAWYERLT +L  +GY +G  DKTLF+  E  +L++AQIYVDD
Sbjct: 1083 NHVYKLKKALYGLKQAPRAWYERLTEFLLSQGYRKGGNDKTLFVKEEEGKLIIAQIYVDD 1142

BLAST of Pay0016367.1 vs. NCBI nr
Match: AAO73527.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 1104.7 bits (2856), Expect = 0.0e+00
Identity = 657/1576 (41.69%), Postives = 858/1576 (54.44%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINS-------------SYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN+             ++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGI 300

Query: 301  ------------------------------------------KNTNA-TG---------- 360
                                                      ++ NA TG          
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHEEEISELKGEVGFLN 420

Query: 421  --------TMKIL------IEEVMVILKKRKVTRG------FSG---------------- 480
                    ++K+L      ++EV+++ K     RG      F+G                
Sbjct: 421  SKLETMKKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKFAGRTTMTEFVPAKNRTGT 480

Query: 481  -----------------------------VGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHLSRHHGTQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWV 540

Query: 541  ------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI 600
                                          HMTG + +  N++ C T +VTF DG+KGKI
Sbjct: 541  PKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKI 600

Query: 601  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRG 660
            +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G
Sbjct: 601  IGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKG 660

Query: 661  KRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW--------------------------- 720
             R  DNCY W    TSY  TC  ++ D+ +  W                           
Sbjct: 661  SRSKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKIIDKGAVRGIPNL 720

Query: 721  --------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVD 780
                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVD
Sbjct: 721  KIEEGRICGECQIGKQVKMSHQKLRHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVD 780

Query: 781  DYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEG 840
            D+SR+TWV F++ K++T E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EG
Sbjct: 781  DFSRFTWVNFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEG 840

Query: 841  IHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT 900
            I HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Sbjct: 841  ITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR 900

Query: 901  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV 960
             T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRV
Sbjct: 901  GTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRV 960

Query: 961  FNNKSVSVMETINVMIDEEDETSNMSEARTTSTVEVSKAD-NPSDDLGKNLEKPSKEIIT 1020
            FN+++ +VME+INV++D+              T+  + AD   S +  +N +  + E   
Sbjct: 961  FNSRTRTVMESINVVVDDLSPARKKDVEEDVRTLGDNVADAAKSGENAENSDSATDESNI 1020

Query: 1021 KKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSAL 1080
             + +  +   ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I+P  V  AL
Sbjct: 1021 NQPDKRSSTRIQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVSKIEPKNVKEAL 1080

Query: 1081 KDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQ 1140
             DE              RN +W LV +PEG N+IGTKW+FKNKTNE G +T+NKARLVAQ
Sbjct: 1081 TDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQ 1140

BLAST of Pay0016367.1 vs. NCBI nr
Match: AAO73523.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 662/1587 (41.71%), Postives = 857/1587 (54.00%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN-------------SSYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN             S++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDACEILKSTHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKRSDVKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALIGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLN 420

Query: 421  -KNTNATGTMKIL------IEEVMVILKKRKVTRGF------------------------ 480
             K  N T ++K+L      ++EV+++ K     RG                         
Sbjct: 421  SKLENMTKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGA 480

Query: 481  ---------------------------SGVGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQSSNSRKKMMWV 540

Query: 541  ------------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKI 600
                                          HMTG + +  N++ C T +VTF DG+KGKI
Sbjct: 541  PKHKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKI 600

Query: 601  MAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRG 660
            +  G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G
Sbjct: 601  IGMGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKG 660

Query: 661  KRQTDNCYHWN-SNTSY--TCHLTRSDQTLSNW--------------------------- 720
             R  DNCY W    TSY  TC  ++ D+ +  W                           
Sbjct: 661  SRSKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKILDKSAVRGIPNL 720

Query: 721  --------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVD 780
                          ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVD
Sbjct: 721  KIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVD 780

Query: 781  DYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEG 840
            D+SR+TWV F++ K+ T E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  EG
Sbjct: 781  DFSRFTWVNFIREKSGTFEVFKKLSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEG 840

Query: 841  IHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRT 900
            I HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R 
Sbjct: 841  ITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRR 900

Query: 901  RTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV 960
             T  TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRV
Sbjct: 901  GTPTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRV 960

Query: 961  FNNKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLGK 1020
            FN+++ +VME+INV++D          EED  TS  + A    + E    +D+ +D+   
Sbjct: 961  FNSRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDE--S 1020

Query: 1021 NLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTF 1080
            N+ +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+  
Sbjct: 1021 NINQPDKRSSTR---------IQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVS 1080

Query: 1081 TIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGC 1140
             I+P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G 
Sbjct: 1081 KIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGV 1140

BLAST of Pay0016367.1 vs. NCBI nr
Match: AAO73521.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 1104.4 bits (2855), Expect = 0.0e+00
Identity = 661/1585 (41.70%), Postives = 857/1585 (54.07%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGSNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLINS-------------SYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN+             ++EGTS
Sbjct: 61   DELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKITHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVKI RLQ    +F  L+M                 N C ++ ++I+D K+VRK+LRSLP
Sbjct: 121  KVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGERITDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAVSD 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE    D
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  --TEANMDESIALLTKQFINVLRKF----------------------------------- 300
              T+  +  ++ LL KQF  VL +                                    
Sbjct: 241  LNTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGSKYQKKSDVKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRDVNALTGIFETAEDSSD 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  TDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEAHKEEISELKGEVGFLN 420

Query: 421  -KNTNATGTMKIL------IEEVMVILKKRKVTRGF------------------------ 480
             K  N T ++K+L      ++EV+++ K     RG                         
Sbjct: 421  SKLENMTKSIKMLNKGSDTLDEVLLLGKNAGNQRGLGFNPKSAGRTTMTEFVPAKNRTGA 480

Query: 481  ---------------------------SGVGNLEV------------------------- 540
                                          G+++                          
Sbjct: 481  TMSQHRSRHHGMQQKKSKRKKWRCHYCGKYGHIKPFCYHLHPHHGTQSSNSRKKMMWVPK 540

Query: 541  ----------------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGKIMA 600
                                        HMTG + +  N++ C T +VTF DG+KGKI+ 
Sbjct: 541  HKAVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIG 600

Query: 601  KGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMRGKR 660
             G +  + LP LN V  V GL ANLI ISQLCD+G+ V+F    C+V N+++++ M+G R
Sbjct: 601  MGKLVHDGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSR 660

Query: 661  QTDNCYHWN-SNTSY--TCHLTRSDQTLSNW----------------------------- 720
              DNCY W    TSY  TC  ++ D+ +  W                             
Sbjct: 661  SKDNCYLWTPQETSYSSTCLSSKEDE-VRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKI 720

Query: 721  ------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDY 780
                        ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVVDD+
Sbjct: 721  EEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDF 780

Query: 781  SRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLEGIH 840
            SR+TWV F++ K++T E+ K L L+LQREK+  I RIRSD+G+EF+N     FC  EGI 
Sbjct: 781  SRFTWVKFIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRLTEFCTSEGIT 840

Query: 841  HEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRT 900
            HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R  T
Sbjct: 841  HEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGT 900

Query: 901  TVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFN 960
              TLYE+WK RKP+VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYRVFN
Sbjct: 901  PTTLYEIWKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFN 960

Query: 961  NKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLGKNL 1020
            +++ +VME+INV++D          EED  TS  + A    + E    +D+ +D+   N+
Sbjct: 961  SRTRTVMESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDE--SNI 1020

Query: 1021 EKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYTFTI 1080
             +P K   T+         ++K HP   IIGDP+ G+ T+ +E    +++V + C+   I
Sbjct: 1021 NQPDKRSSTR---------IQKMHPKELIIGDPNRGVTTRSRE----VEIVSNSCFVSKI 1080

Query: 1081 KPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAGCVT 1140
            +P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G +T
Sbjct: 1081 EPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVIT 1140

BLAST of Pay0016367.1 vs. NCBI nr
Match: AAO73529.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 1102.8 bits (2851), Expect = 0.0e+00
Identity = 662/1588 (41.69%), Postives = 854/1588 (53.78%), Query Frame = 0

Query: 1    MEIIREGSSASRPPILDGKNYSYWKPRMIFFIKTLDRKAWRALVAGYD-PPMITVNGVLV 60
            M + +EG   +RPPILDG NY YWK RM+ F+K+LD + W+A++ G++ P M+   G   
Sbjct: 1    MNMEKEGGPVNRPPILDGTNYEYWKARMVAFLKSLDSRTWKAVIKGWEHPKMLDTEGKPT 60

Query: 61   P--KPEVGWTNAEEQASVGNARALNAIFNGVNLNIFKLIN-------------SSYEGTS 120
               KPE  WT  E++ ++GN++ALNA+FNGV+ NIF+LIN             +++EGTS
Sbjct: 61   NELKPEEDWTKEEDELALGNSKALNALFNGVDKNIFRLINTCTVAKDAWEILKTTHEGTS 120

Query: 121  KVKIVRLQ----EFLKLQM-----------------NLCCSV-KKISDTKIVRKVLRSLP 180
            KVK+ RLQ    +F  L+M                 N C ++ ++++D K+VRK+LRSLP
Sbjct: 121  KVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGERMTDEKLVRKILRSLP 180

Query: 181  RKFDMKVTAIEEAHCITTLRLDELFSSLLTFEMATADRENKKGKGIDFKSTHEGEAAV-- 240
            ++FDMKVTAIEEA  I  +R+DEL  SL TFE+  +DR  KK K + F S  EGE     
Sbjct: 181  KRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSKNLAFVSNDEGEEDEYD 240

Query: 241  SDTEANMDESIALLTKQFINVLRKF----------------------------------- 300
             DT+  +  ++  L KQF  VL +                                    
Sbjct: 241  LDTDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNISLDIRKGSEYQRKSDEKPSHSKGI 300

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 301  QCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDRDVNALTGRFESAEDSS 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  DTDSEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKEAHEEEISKLKGEVGFL 420

Query: 421  --KNTNATGTMKIL------IEEVMVILKKRKVTRGF----------------------- 480
              K  N T ++K+L      +++V+ + KK    RG                        
Sbjct: 421  NSKLENMTKSIKMLNKGSDMLDZVLQLGKKVGNQRGLGFNHKSAGRTTMTEFVPAKNSTG 480

Query: 481  -----------------------------------------------------SG----- 540
                                                                 SG     
Sbjct: 481  ATMSQHRSRHHGTQQKRSKRKKWRCHYCGKYGHIKPFCYHLHGHPHHGTQGSSSGRKMMW 540

Query: 541  -----VGNLEV--------------------HMTGNRSYFTNLKDCVTGHVTFSDGAKGK 600
                 + +L V                    HMTG + +  N++ C T +VTF DG+KGK
Sbjct: 541  VPKHKIVSLVVHTSLRASAKEDWYLDSGCSRHMTGVKEFLVNIEPCSTSYVTFGDGSKGK 600

Query: 601  IMAKGNIDKNNLPRLNDVRYVDGLKANLICISQLCDQGYKVSFDDIGCVVMNKENQICMR 660
            I   G +    LP LN V  V GL  NLI ISQLCD+G+ V+F    C+V N+++++ M+
Sbjct: 601  ITGMGKLVHEGLPSLNKVLLVKGLTVNLISISQLCDEGFNVNFTKSECLVTNEKSEVLMK 660

Query: 661  GKRQTDNCYHW---NSNTSYTCHLTRSDQTLSNW-------------------------- 720
            G R  DNCY W    S+ S TC  ++ D+ +  W                          
Sbjct: 661  GSRSKDNCYLWTPQESSHSSTCLFSKEDE-VKIWHQRFGHLHLRGMKKIIDKGAVRGIPN 720

Query: 721  ---------------QKTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVV 780
                           ++ + +H+ L+   T+RVL+LLHMDLM PMQ ESLGGKRY  VVV
Sbjct: 721  LKIEEGRICGECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVV 780

Query: 781  DDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNEGFNSFCLLE 840
            DD+SR+TWV F++ K+DT E+ K L L+LQREK+  I RIRSD+G+EF+N  F  FC  E
Sbjct: 781  DDFSRFTWVNFIREKSDTFEVFKELSLRLQREKDCVIKRIRSDHGREFENSKFTEFCTSE 840

Query: 841  GIHHEFSAPITPQQNGVVERKNRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIR 900
            GI HEFSA ITPQQNG+VERKNRTLQE ARVM+HAK LP   WAEA+NTAC+IHNRVT+R
Sbjct: 841  GITHEFSAAITPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLR 900

Query: 901  TRTTVTLYELWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYR 960
              T  TLYE+WK RKP VK+FH+FGS CYILADRE R+K D +S+ GIFLGYS NSRAYR
Sbjct: 901  RGTPTTLYEIWKGRKPTVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYR 960

Query: 961  VFNNKSVSVMETINVMID----------EED-ETSNMSEARTTSTVE-VSKADNPSDDLG 1020
            VFN+++ +VME+INV++D          EED  TS  + A T  + E    +D+ +D+  
Sbjct: 961  VFNSRTRTVMESINVVVDDLTPARKKDVEEDVRTSGDNVADTAKSAENAENSDSATDE-- 1020

Query: 1021 KNLEKPSKEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLKMVDDLCYT 1080
             N+ +P K              ++K HP   IIGDP+ G+ T+ +E    +++V + C+ 
Sbjct: 1021 PNINQPDKR---------PSIRIQKMHPKELIIGDPNRGVTTRSRE----IEIVSNSCFV 1080

Query: 1081 FTIKPSTVDSALKDE--------------RNNIWTLVSKPEGVNIIGTKWVFKNKTNEAG 1140
              I+P  V  AL DE              RN +W LV +PEG N+IGTKW+FKNKTNE G
Sbjct: 1081 SKIEPKNVKEALTDEFWINAMQEELEQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEG 1140

BLAST of Pay0016367.1 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 229.6 bits (584), Expect = 1.3e-59
Identity = 163/558 (29.21%), Postives = 262/558 (46.95%), Query Frame = 0

Query: 631  SNMSEARTTSTVEVSKADNPSDDLGKNLEKPSKEIITKKSELITFAHVKKNHPTSSI-IG 690
            S+   + ++S++++     PS ++  ++ +PS     +++    +      H  +S+ I 
Sbjct: 3    SDADASTSSSSIDIM----PSANIQNDVPEPSVHTSHRRTRKPAYLQDYYCHSVASLTIH 62

Query: 691  DPSAGMQTKRKEKIVYLKMVDDLCYTFTIKPSTVDSALK--------------DERNNIW 750
            D S  +  ++   + +  +V   C     +PST + A +               E  + W
Sbjct: 63   DISQFLSYEKVSPLYHSFLV---CIAKAKEPSTYNEAKEFLVWCGAMDDEIGAMETTHTW 122

Query: 751  TLVSKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEA 810
             + + P     IG KWV+K K N  G + + KARLVA+GYTQ EG+DF ETF+PV +L +
Sbjct: 123  EICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTS 182

Query: 811  IRLLLDISCIQKFKLYQMDVKSAFLNGYLNEEVYVAQPKGFV----DSEHPKHVYKLSKA 870
            ++L+L IS I  F L+Q+D+ +AFLNG L+EE+Y+  P G+     DS  P  V  L K+
Sbjct: 183  VKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKS 242

Query: 871  LYGLKQAPRAWYERLTIYLRGKGYSRGEIDKTLFIHRESNQLLVAQIYVDDIIFGGFPQD 930
            +YGLKQA R W+ + ++ L G G+ +   D T F+   +   L   +YVDDII       
Sbjct: 243  IYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDA 302

Query: 931  FVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKDMIKKFGLGQAQNKW 990
             V    + ++S F++  +G L  FL                KYA D++ + GL   +   
Sbjct: 303  AVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSS 362

Query: 991  TLTATHVKLIRNTDGAEVDHKLYRSIIGSLLYLTASRPDISYAVGICARYQADPRISHLE 1050
                  V    ++ G  VD K YR +IG L+YL  +R DIS+AV   +++   PR++H +
Sbjct: 363  VPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQ 422

Query: 1051 AVKRIL------------------------------------------------------ 1100
            AV +IL                                                      
Sbjct: 423  AVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWK 482

BLAST of Pay0016367.1 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 81.3 bits (199), Expect = 5.7e-15
Identity = 51/125 (40.80%), Postives = 66/125 (52.80%), Query Frame = 0

Query: 695 MQTKRKEKIVYLKMVDDLCYTFTIK--PSTVDSALKD--------------ERNNIWTLV 754
           M T+ K  I  L     L  T TIK  P +V  ALKD               RN  W LV
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 755 SKPEGVNIIGTKWVFKNKTNEAGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRL 804
             P   NI+G KWVFK K +  G + + KARLVA+G+ Q EG+ F ET++PV R   IR 
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRT 120

BLAST of Pay0016367.1 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 61.6 bits (148), Expect = 4.7e-09
Identity = 44/141 (31.21%), Postives = 69/141 (48.94%), Query Frame = 0

Query: 898  IYVDDIIFGGFPQDFVHNFINIMQSEFEMSMVGELSCFL---------------EKYAKD 957
            +YVDDI+  G     ++  I  + S F M  +G +  FL                KYA+ 
Sbjct: 5    LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 958  MIKKFGLGQAQNKWTLTATHVKLIRNTDGAEV-DHKLYRSIIGSLLYLTASRPDISYAVG 1017
            ++   G+     K   T   +KL  +   A+  D   +RSI+G+L YLT +RPDISYAV 
Sbjct: 65   ILNNAGM--LDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAVN 124

Query: 1018 ICARYQADPRISHLEAVKRIL 1023
            I  +   +P ++  + +KR+L
Sbjct: 125  IVCQRMHEPTLADFDLLKRVL 143

BLAST of Pay0016367.1 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 48.1 bits (113), Expect = 5.3e-05
Identity = 35/114 (30.70%), Postives = 52/114 (45.61%), Query Frame = 0

Query: 509 NRTLQEMARVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYELWKERKPNVKYF 568
           NRT+ E  R M+    LP  F A+A NTA HI N+            E+W +  P   Y 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 569 HVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINV 623
             FG + YI  D     K   R+++G      +   +Y +  N+ VS++ TI +
Sbjct: 62  RRFGCVAYIHCD---EGKLKPRAKKG------EEKGSYLI--NRIVSILYTIGI 104

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109784.1e-11130.02Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041469.6e-9227.64Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW24.9e-8826.76Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT944.2e-8726.30Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P256001.5e-2331.72Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A2Z6P9360.0e+0044.62Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
Q84VH80.0e+0041.69Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VI40.0e+0041.70Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VI20.0e+0041.71Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VH60.0e+0041.69Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Match NameE-valueIdentityDescription
GAU46010.10.0e+0044.62hypothetical protein TSUD_401320 [Trifolium subterraneum][more]
AAO73527.10.0e+0041.69gag-pol polyprotein [Glycine max][more]
AAO73523.10.0e+0041.71gag-pol polyprotein [Glycine max][more]
AAO73521.10.0e+0041.70gag-pol polyprotein [Glycine max][more]
AAO73529.10.0e+0041.69gag-pol polyprotein [Glycine max][more]
Match NameE-valueIdentityDescription
AT4G23160.11.3e-5929.21cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.15.7e-1540.80Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00810.14.7e-0931.21DNA/RNA polymerases superfamily protein [more]
ATMG00710.15.3e-0530.70Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Payzawat) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 732..937
e-value: 1.8E-62
score: 211.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 394..571
e-value: 1.2E-43
score: 150.7
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 399..499
e-value: 7.9E-13
score: 48.6
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 390..562
score: 22.625793
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 628..658
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 401..652
coord: 741..1025
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 1024..1125
e-value: 2.08852E-49
score: 169.571
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 731..1093
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 398..556

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Pay0016367Pay0016367gene


The following exon feature(s) are a part of this mRNA:

Feature NameUnique NameType
Pay0016367.1-exonPay0016367.1-exon-chr08:14122344..14122638exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14122678..14122717exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14122784..14123142exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14123204..14123294exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14123835..14124182exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14124302..14125034exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14125068..14125390exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14125433..14126052exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14126098..14126356exon
Pay0016367.1-exonPay0016367.1-exon-chr08:14126518..14126878exon


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Pay0016367.1-cdsPay0016367.1-cds-chr08:14122344..14122638CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14122678..14122717CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14122784..14123142CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14123204..14123294CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14123835..14124182CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14124302..14125034CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14125068..14125390CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14125433..14126052CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14126098..14126356CDS
Pay0016367.1-cdsPay0016367.1-cds-chr08:14126518..14126878CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Pay0016367.1Pay0016367.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding