Pay0016463 (gene) Melon (Payzawat) v1

Overview
NamePay0016463
Typegene
OrganismCucumis melo L. var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionGag-pol polyprotein
Locationchr01: 4377199 .. 4380075 (-)
RNA-Seq ExpressionPay0016463
SyntenyPay0016463
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCCTGATTAGTGTTTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCTGGAATCAAAGAACTTAACTAACGACACAGCAAACAGAAAGATAGAAGATCAAGAAGTTATCTTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAAAACCAAAGCTTTCTATCCTCTATAGTAACTCTAAAAGAAGAACTAGCAGAAACCAAGCATCAATTCGAAGAGCTCCTAAAATTTGCAAGGATGCTGACGAAGGGAACCTCGAAACTAGATGACATACTCGACCAAGGAATGAGGGCTGACGACAAAAGAGGCCTCAGATTTGCAGAAAGGGACACACCTGTCAGAAAAACTGTCTTTATCCGAGAAGGTACCCTTCAGAACAGCCCTACAAATAAAGAACAGGGAAAGGGTACTGAGATTACTAGCATGCCTGTGAAATCCCCAAACAAGCGAACACGTAGAATTTGTTATTTTTGTAGATGGGCTGGACATATTCGTTGGAATTACTTGAATTACTGACTACTCACGCGCAGGCAACTCGACACTCGACAGACCAAACACCTGAGGAGTCCTAGAACAGAATGGTGCCGAAAAATCCACATAGAAAACTGCAAGGTGGCTCTCACCTCTGTCAAAAGCCCCAACTCTAGTGACTGGTACTTTGACAGTGGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGAACTGAGTGAATGCAAAGTCGGATCAGTAGTGTTTGGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCATTCAGGTCTACCGTTTCTTCTTGATGTTCGACTAATACAAGGACTGGCTGCAAATCTCATAAGCATCAGCCAATTATGTGACCAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGTAATGTGTTAGATGTTCAAAATAAAGTATTTCTCAGCGGAACAAGGCTGTCAGACAACTGCTATCACTGGGATGCAGAGGTAACCTTATGCAATCTATCAAAAGTGGAAGAAGCTAGACTCTGGCACAAACGACTTGGACACCTTAGTGGCGCTACTATCTCCAAGGTCACCAAAGTTGATGCCATTATCGGTCTTCCCCCACTATCATTTTTGTCACTAGAAAGCTGTTCGGAGTGCACAGCTGGCAAGCAAGTCAAGTCTGTACACAAGCCTGTAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGTAGAAAATGGTATGCAGTAGTGTGTGTAGATGATTTCTCTCGCTACACCTGGATAAAATTTATCCTTGACAAATCGGAAACCTTTAAGACATGTCAGACCCTGTTCACTCAACTCCAAAGAGAGAAAAATACTAGCATTGGCCAAATACAAACTGATCATGGGCATGAATTTGAGAATCAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCATTAACACTACAGCAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTACAGGAGATGGCCCGAGTGATGATCCATGCAAAGCAACTGCCAATTCAATTCTAGGCTGAGGCTCTAAACACTGCATGCCATATACATAACAGAGTTATTCTCCGTCCAGGGACCACTACTACCTCGTATGAGCTGTGGAAAGGAAGAAAACCAAATGTGAAGTTTTCACATCTTTGGCAGCACGTGCTTTATCTTGAGTGATAGGGATCATCGCAGAAAGTGGGACTCAAAGTCAGATCGTGGAATATTTCTGGGATATTTAGCTAACAGCCGAGCCTACAGGGTCTACAACCAATGTTCCAAAATAGTAATGGAATCCATTAACGTGATTATTGATGACCTTGGTAGGAACCTAACAGAAATCTTGATGATGAAGTTGAGGTTTTTTGGAATTCTCTTTCTCATAAACCAGATGAAGGAGAGTTAGAATCGCCGGCCCGTACTAATGAAACAACATACTTACCCTCTCATCTCGGTTTAAGCAGAATTGACATGTCAACACCATCTACATCAGCCATTCACTGTAACACACATGAAAGTGAAGCAATAGTATCTGCGAGTCAGCACACTCCAGAACAAACTGCGGGTGCAACTGATTCTTCAAAGTGTGACCTCATACCTCCTACGCATACAGCCAAAAATCATCCCTCCAGCTTCATTATTAGAGATATTCACAGTGTGTGCTACACATCTTTACTAGAACCGACCACGGTCTCTGCAGCACTTTCCGATGAACACTGGATCTTGACTATGCAGGAAGAGCTACTGCAGTTTGAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATCCGTAATAAAGCTAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTGCTAAGCTACGCATGTTTTTGGAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATGTGAGGAAGTGTATGTGGCCCAGCCAAAAGGATTTGTTGATCCAGTGCATGAGGATCATGTTTACAAACTTCGAAAGGCACTCTATAGACTTAAACAAGCTCCTAGAGCTTGTTATGAGAGACTCTCCACTTACCTGTTACAACAAGGATATCAAAGGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGATCATTCAGATCTATGTTGATGGAATTATATTTGGTGATACGTCCTAA

mRNA sequence

ATGGCCCTGATTAGTGTTTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCTGGAATCAAAGAACTTAACTAACGACACAGCAAACAGAAAGATAGAAGATCAAGAAGTTATCTTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAAAACCAAAGCTTTCTATCCTCTATAGTAACTCTAAAAGAAGAACTAGCAGAAACCAAGCATCAATTCGAAGAGCTCCTAAAATTTGCAAGGATGGACACACCTGTCAGAAAAACTGTCTTTATCCGAGAAGGTACCCTTCAGAACAGCCCTACAAATAAAGAACAGGGAAAGGGTACTGAGATTACTAGCATGCCTGTGAAATCCCCAAACAAGCGAACACAAAACTGCAAGGTGGCTCTCACCTCTGTCAAAAGCCCCAACTCTAGTGACTGGTACTTTGACAGTGGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGAACTGAGTGAATGCAAAGTCGGATCAGTAGTGTTTGGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCATTCAGGTCTACCGTTTCTTCTTGATGTTCGACTAATACAAGGACTGGCTGCAAATCTCATAAGCATCAGCCAATTATGTGACCAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGTAATGTGTTAGATGTTCAAAATAAAGTATTTCTCAGCGGAACAAGGCTGTCAGACAACTGCTATCACTGGGATGCAGAGGTAACCTTATGCAATCTATCAAAAGTGGAAGAAGCTAGACTCTGGCACAAACGACTTGGACACCTTAGTGGCGCTACTATCTCCAAGGTCACCAAAGTTGATGCCATTATCGGTCTTCCCCCACTATCATTTTTGTCACTAGAAAGCTGTTCGGAGTGCACAGCTGGCAAGCAAGTCAAGTCTGTACACAAGCCTGTAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGTAGAAAATGGTATGCAGTAGTGTGTGTAGATGATTTCTCTCGCTACACCTGGATAAAATTTATCCTTGACAAATCGGAAACCTTTAAGACATGTCAGACCCTGTTCACTCAACTCCAAAGAGAGAAAAATACTAGCATTGGCCAAATACAAACTGATCATGGGCATGAATTTGAGAATCAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCATTAACACTACAGCAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTACAGGAGATGGCCCGAGCTGAGGCTCTAAACACTGCATGCCATATACATAACAGAGTTATTCTCCGTCCAGGGACCACTACTACCTCGTATGAGCTGTGGAAAGGAAGAAAACCAAATGTGAAGGATCATCGCAGAAAGTGGGACTCAAAGTCAGATCGTGGAATATTTCTGGGATATTTAGCTAACAGCCGAGCCTACAGGGTCTACAACCAATGTTCCAAAATAGTAATGGAATCCATTAACGTGATTATTGATGACCTTGGTAGGAACCTAACAGAAATCTTGATGATGAAGTTGAGGTTTTTTGGAATTCTCTTTCTCATAAACCAGATGAAGGAGAAGCTACTGCAGTTTGAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATCCGTAATAAAGCTAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTGCTAAGCTACGCATGTTTTTGGAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATGTGAGGAAGTGTATGTGGCCCAGCCAAAAGGATTTGTTGATCCAGTGCATGAGGATCATGTTTACAAACTTCGAAAGGCACTCTATAGACTTAAACAAGCTCCTAGAGCTTGTTATGAGAGACTCTCCACTTACCTGTTACAACAAGGATATCAAAGGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGATCATTCAGATCTATGTTGATGGAATTATATTTGGTGATACGTCCTAA

Coding sequence (CDS)

ATGGCCCTGATTAGTGTTTGCACCATGAATGACGAAGAAAATGTTCAAACACATGACCAGCTGGAATCAAAGAACTTAACTAACGACACAGCAAACAGAAAGATAGAAGATCAAGAAGTTATCTTGCAACAACAAGAACGAATTCAAGATCTAGTGGAAGAAAACCAAAGCTTTCTATCCTCTATAGTAACTCTAAAAGAAGAACTAGCAGAAACCAAGCATCAATTCGAAGAGCTCCTAAAATTTGCAAGGATGGACACACCTGTCAGAAAAACTGTCTTTATCCGAGAAGGTACCCTTCAGAACAGCCCTACAAATAAAGAACAGGGAAAGGGTACTGAGATTACTAGCATGCCTGTGAAATCCCCAAACAAGCGAACACAAAACTGCAAGGTGGCTCTCACCTCTGTCAAAAGCCCCAACTCTAGTGACTGGTACTTTGACAGTGGGTGTTCCAGACACATGACAGGTAATGCAGATTTCTTTTCTGAACTGAGTGAATGCAAAGTCGGATCAGTAGTGTTTGGAGATGGAGGAAAAGGAAAAATAATTGGCAAAGGAACGATTAACCATTCAGGTCTACCGTTTCTTCTTGATGTTCGACTAATACAAGGACTGGCTGCAAATCTCATAAGCATCAGCCAATTATGTGACCAAGGCTATCAAGTCAGTTTCAATAAAGATAGATGTAATGTGTTAGATGTTCAAAATAAAGTATTTCTCAGCGGAACAAGGCTGTCAGACAACTGCTATCACTGGGATGCAGAGGTAACCTTATGCAATCTATCAAAAGTGGAAGAAGCTAGACTCTGGCACAAACGACTTGGACACCTTAGTGGCGCTACTATCTCCAAGGTCACCAAAGTTGATGCCATTATCGGTCTTCCCCCACTATCATTTTTGTCACTAGAAAGCTGTTCGGAGTGCACAGCTGGCAAGCAAGTCAAGTCTGTACACAAGCCTGTAAATATCTCCTCGACGTCCCATATTCTGGAACTTCTTCATATAGACCTAATGGGGCCCATGCAAACAGAAAGCTTGGGTAGAAAATGGTATGCAGTAGTGTGTGTAGATGATTTCTCTCGCTACACCTGGATAAAATTTATCCTTGACAAATCGGAAACCTTTAAGACATGTCAGACCCTGTTCACTCAACTCCAAAGAGAGAAAAATACTAGCATTGGCCAAATACAAACTGATCATGGGCATGAATTTGAGAATCAGCACTTTGCTGAGTTCTGTGATAATGAAGGCATCTTTCATGAGTTCTCTGCCCCATTAACACTACAGCAAAATGGAGTTGTAGAGAGAAGGAATCGAACCTTACAGGAGATGGCCCGAGCTGAGGCTCTAAACACTGCATGCCATATACATAACAGAGTTATTCTCCGTCCAGGGACCACTACTACCTCGTATGAGCTGTGGAAAGGAAGAAAACCAAATGTGAAGGATCATCGCAGAAAGTGGGACTCAAAGTCAGATCGTGGAATATTTCTGGGATATTTAGCTAACAGCCGAGCCTACAGGGTCTACAACCAATGTTCCAAAATAGTAATGGAATCCATTAACGTGATTATTGATGACCTTGGTAGGAACCTAACAGAAATCTTGATGATGAAGTTGAGGTTTTTTGGAATTCTCTTTCTCATAAACCAGATGAAGGAGAAGCTACTGCAGTTTGAAAGAAACCAAGTATGGGAATTAGTGCCAAAGCCACCTTATGCTAACATAATTGGTACCAAATGGATCTTTAAGAACAAAACGGATGAAGAAGGTAGAGTTATCCGTAATAAAGCTAGACTGGTTGCTCAAGGGTATTCTCAAATAGAAGGGCTGGATTTTGGAGAAACATTTGCCCCAGTTGCCAGATTAGAAGCCATCCGACTACTGCTAAGCTACGCATGTTTTTGGAGGTTCAAACTGTTCCAAATGGATGTAAAGAGTGCGTTCCTAAATGGGTACTTATGTGAGGAAGTGTATGTGGCCCAGCCAAAAGGATTTGTTGATCCAGTGCATGAGGATCATGTTTACAAACTTCGAAAGGCACTCTATAGACTTAAACAAGCTCCTAGAGCTTGTTATGAGAGACTCTCCACTTACCTGTTACAACAAGGATATCAAAGGGGCAGTGCGGATCAAACTATGTTTATATATCGTCAAGGCACTGACTTTCTGATCATTCAGATCTATGTTGATGGAATTATATTTGGTGATACGTCCTAA

Protein sequence

MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLSSIVTLKEELAETKHQFEELLKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQEMARAEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGIIFGDTS
Homology
BLAST of Pay0016463 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 1.4e-64
Identity = 194/728 (26.65%), Postives = 317/728 (43.54%), Query Frame = 0

Query: 143  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTI----NHSGLPFLL 202
            S+W  D+  S H T   D F        G+V  G+    KI G G I    N      L 
Sbjct: 292  SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 351

Query: 203  DVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVT 262
            DVR +  L  NLIS   L   GY+  F   +  +   +  + ++        Y  +AE+ 
Sbjct: 352  DVRHVPDLRMNLISGIALDRDGYESYFANQKWRL--TKGSLVIAKGVARGTLYRTNAEIC 411

Query: 263  LCNLSKVEE---ARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQV 322
               L+  ++     LWHKR+GH+S   +  + K   I         +++ C  C  GKQ 
Sbjct: 412  QGELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLI---SYAKGTTVKPCDYCLFGKQH 471

Query: 323  KSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSET 382
            + V    +     +IL+L++ D+ GPM+ ES+G   Y V  +DD SR  W+  +  K + 
Sbjct: 472  R-VSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQV 531

Query: 383  FKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVV 442
            F+  Q     ++RE    + ++++D+G E+ ++ F E+C + GI HE + P T Q NGV 
Sbjct: 532  FQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVA 591

Query: 443  ERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTSYELWKGRKPNV 502
            ER NRT+ E  R+             EA+ TAC++ NR    P        +W  ++ + 
Sbjct: 592  ERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSY 651

Query: 503  ---------------KDHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVI-- 562
                           K+ R K D KS   IF+GY      YR+++   K V+ S +V+  
Sbjct: 652  SHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFR 711

Query: 563  -----------------------------------------IDDLGRNLTEIL------- 622
                                                     + + G    E++       
Sbjct: 712  ESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLD 771

Query: 623  --------------------------MMKLRFFGILF----------------------- 682
                                      +   R+    +                       
Sbjct: 772  EGVEEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQ 831

Query: 683  LINQMKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQ 736
            L+  M+E++   ++N  ++LV  P     +  KW+FK K D + +++R KARLV +G+ Q
Sbjct: 832  LMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQ 891

BLAST of Pay0016463 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 172.6 bits (436), Expect = 1.7e-41
Identity = 202/845 (23.91%), Postives = 311/845 (36.80%), Query Frame = 0

Query: 136  SVKSP-NSSDWYFDSGCSRHMTGNADFFSELSECKVG-SVVFGDGGKGKIIGKGTINHSG 195
            +V SP N+++W  DSG + H+T + +  S       G  V+  DG    I   G+ +   
Sbjct: 300  AVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPT 359

Query: 196  LPFLLD---VRLIQGLAANLISISQLCDQG-YQVSFNKDRCNVLDVQNKVFLSGTRLSDN 255
                LD   V  +  +  NLIS+ +LC+     V F      V D+   V L   +  D 
Sbjct: 360  SSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDE 419

Query: 256  CYHW---DAEVTLCNLSKVEEA--RLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLE 315
             Y W    ++      S   +A    WH RLGH S A ++ V    ++  L P     L 
Sbjct: 420  LYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLPVLNPSH--KLL 479

Query: 316  SCSECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYT 375
            SCS+C   K  K       I+S S  LE ++ D+       S+    Y V+ VD F+RYT
Sbjct: 480  SCSDCFINKSHKVPFSNSTITS-SKPLEYIYSDVWS-SPILSIDNYRYYVIFVDHFTRYT 539

Query: 376  WIKFILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFS 435
            W+  +  KS+   T     + ++    T IG + +D+G EF      ++    GI H  S
Sbjct: 540  WLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEF--VVLRDYLSQHGISHFTS 599

Query: 436  APLTLQQNGVVERRNRTLQEMARA-------------EALNTACHIHNRVILRPGTTTTS 495
             P T + NG+ ER++R + EM                 A + A ++ NR+        + 
Sbjct: 600  PPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSP 659

Query: 496  YELWKGRKPNVKD---------------HRRKWDSKSDRGIFLGYLANSRAYRVYNQCSK 555
            ++   G+ PN +                +R K + KS +  F+GY     AY   +  + 
Sbjct: 660  FQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTG 719

Query: 556  IVMESINVIIDD--------------------------------------------LGRN 615
             +  S +V  D+                                            LG +
Sbjct: 720  RLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPH 779

Query: 616  L---------------TEILMMKLRFFGI------------------------------- 675
            L               T++    L    I                               
Sbjct: 780  LDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSN 839

Query: 676  ------------------------------------------------------------ 735
                                                                        
Sbjct: 840  SPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVL 899

BLAST of Pay0016463 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 161.4 bits (407), Expect = 4.0e-38
Identity = 79/174 (45.40%), Postives = 116/174 (66.67%), Query Frame = 0

Query: 563  NQVWELVPKPP-YANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPV 622
            N  W+LVP PP +  I+G +WIF  K + +G + R KARLVA+GY+Q  GLD+ ETF+PV
Sbjct: 982  NHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFSPV 1041

Query: 623  ARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRK 682
             +  +IR++L  A    + + Q+DV +AFL G L ++VY++QP GF+D    ++V KLRK
Sbjct: 1042 IKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRK 1101

Query: 683  ALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQIYVDGII 736
            ALY LKQAPRA Y  L  YLL  G+    +D ++F+ ++G   + + +YVD I+
Sbjct: 1102 ALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDIL 1155

BLAST of Pay0016463 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 134.0 bits (336), Expect = 6.8e-30
Identity = 70/175 (40.00%), Postives = 107/175 (61.14%), Query Frame = 0

Query: 563  NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVA 622
            N  W +  +P   NI+ ++W+F  K +E G  IR KARLVA+G++Q   +D+ ETFAPVA
Sbjct: 920  NNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVA 979

Query: 623  RLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKA 682
            R+ + R +LS    +  K+ QMDVK+AFLNG L EE+Y+  P+G     + D+V KL KA
Sbjct: 980  RISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKA 1039

Query: 683  LYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQG--TDFLIIQIYVDGII 736
            +Y LKQA R  +E     L +  +   S D+ ++I  +G   + + + +YVD ++
Sbjct: 1040 IYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVV 1092

BLAST of Pay0016463 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 8.0e-15
Identity = 48/118 (40.68%), Postives = 67/118 (56.78%), Query Frame = 0

Query: 553 MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGL 612
           M+E+L    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q EG+
Sbjct: 44  MQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGI 103

Query: 613 DFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQMDVKSAFLNGYLCEEVYV 662
            F ET++PV R   IR +L+ A          W FK+ F M +   F   ++C  + V
Sbjct: 104 YFVETYSPVVRTATIRTILNVAQQLEVGQSINWMFKMHFSMGI---FKKKFICINLLV 158

BLAST of Pay0016463 vs. ExPASy TrEMBL
Match: A0A5A7V046 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G001550 PE=4 SV=1)

HSP 1 Score: 1291.2 bits (3340), Expect = 0.0e+00
Identity = 694/901 (77.03%), Postives = 703/901 (78.02%), Query Frame = 0

Query: 1    MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS 60
            MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS
Sbjct: 305  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS 364

Query: 61   SIVTLKEELAETKHQFEELLKFARM-----------------------------DTPVRK 120
            SIVTLKEELA+TKHQFEELLKFARM                             DTPVRK
Sbjct: 365  SIVTLKEELAKTKHQFEELLKFARMLTKGTSKLDDILDQGMRADDKRGLRFAERDTPVRK 424

Query: 121  TVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNS 180
            TVFIREGTLQNSPTN EQGKGTEITSMP K   SP          +NCKVALTSVKSPNS
Sbjct: 425  TVFIREGTLQNSPTNNEQGKGTEITSMPTKHLRSPRTEWCRKIHIENCKVALTSVKSPNS 484

Query: 181  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL 240
            SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL
Sbjct: 485  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL 544

Query: 241  IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL 300
            IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
Sbjct: 545  IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL 604

Query: 301  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPV 360
            SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPL+FLSLESCSECTAGKQVKSVHKPV
Sbjct: 605  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLTFLSLESCSECTAGKQVKSVHKPV 664

Query: 361  NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTL 420
            NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDK ETFKTCQTL
Sbjct: 665  NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKPETFKTCQTL 724

Query: 421  FTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTL 480
            FTQLQREKNT IGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV        
Sbjct: 725  FTQLQREKNTGIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV-------- 784

Query: 481  QEMARAEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRR 540
                 AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK               DHRR
Sbjct: 785  -----AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRR 844

Query: 541  KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM 600
            KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDL           N T  L 
Sbjct: 845  KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLDEGELESPARTNETTYLP 904

Query: 601  MKLRFFGI---------------------------------------------------- 660
              L    I                                                    
Sbjct: 905  SHLGLSRIDMSTPSTSAIHCNTHESEAIVSASQHTPEQTAGATDSSKCDLIPPTHTAKNH 964

Query: 661  --LFLINQ---------------------------------------------MKEKLLQ 720
               F+I                                               ++E+LLQ
Sbjct: 965  PSSFIIRDIHSGIITRKKERKDYAKMVANVCYTSLLEPTTVSAALSDEHWILTIQEELLQ 1024

Query: 721  FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA 741
            FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA
Sbjct: 1025 FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA 1084

BLAST of Pay0016463 vs. ExPASy TrEMBL
Match: A0A5D3C9Q6 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G00180 PE=4 SV=1)

HSP 1 Score: 817.4 bits (2110), Expect = 4.9e-233
Identity = 452/699 (64.66%), Postives = 477/699 (68.24%), Query Frame = 0

Query: 80  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKS 139
           L+F   DTPVRKTVFIREGTLQNSPTN EQGK                +NCKVALTSVKS
Sbjct: 25  LEFVERDTPVRKTVFIREGTLQNSPTNNEQGK----------------ENCKVALTSVKS 84

Query: 140 PNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLD 199
           PNS DWYFDSGCSRHMTGNADFFSELSECK GSVVF DGGKGKIIGKGTIN  GLPFLLD
Sbjct: 85  PNSGDWYFDSGCSRHMTGNADFFSELSECKAGSVVFEDGGKGKIIGKGTINRPGLPFLLD 144

Query: 200 VRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTL 259
           VRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWDAEVTL
Sbjct: 145 VRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTL 204

Query: 260 CNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVH 319
           CNLSKVEEA LWHKRLGHL GATISKV K +AIIGLPPLSF SLESCSEC AGKQVKSVH
Sbjct: 205 CNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVH 264

Query: 320 KPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC 379
           KPVNIS TSHILELLHIDLM PMQTESLGRK YAVVCVDDFSRYTWIKFIL+K ETFKTC
Sbjct: 265 KPVNISLTSHILELLHIDLMRPMQTESLGRKRYAVVCVDDFSRYTWIKFILEKLETFKTC 324

Query: 380 QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRN 439
           QTL TQLQREKNT IG+I+T+HG EFEN+HFAEFCDNEGIFHEFSA LT Q+NGVVE+RN
Sbjct: 325 QTLVTQLQREKNTGIGRIRTEHGCEFENKHFAEFCDNEGIFHEFSAQLTPQENGVVEKRN 384

Query: 440 RTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK--- 499
           +TLQEMAR             AEALNTACHIHNRVILRP TTTTSYELWKGRKPNVK   
Sbjct: 385 QTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFH 444

Query: 500 ------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG- 559
                       DHRRKWDSKSDRGIFLGY AN+RAYRVYNQ +KIV+ESINVIIDDLG 
Sbjct: 445 IFGGTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYNQRTKIVIESINVIIDDLGK 504

Query: 560 ---RNL--------------------------TEILMMKLRF------------------ 619
              RNL                           E   +   F                  
Sbjct: 505 EPNRNLDDEDEVFWNSLSHKTAEGESESTTPTNETTYLPSHFDSNKIDMSTPSTSTNHSN 564

BLAST of Pay0016463 vs. ExPASy TrEMBL
Match: A0A5A7TGY4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold4170G00030 PE=4 SV=1)

HSP 1 Score: 703.7 bits (1815), Expect = 7.9e-199
Identity = 420/731 (57.46%), Postives = 467/731 (63.89%), Query Frame = 0

Query: 1   MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLV 60
           MALIS+C MNDEE    N QTHD  ES   K LT+   + +K EDQE+ILQQQERIQDLV
Sbjct: 251 MALISLCNMNDEETAKVNTQTHDPQESTTNKYLTDGLVDKKKTEDQEIILQQQERIQDLV 310

Query: 61  EENQSFLSSIVTLKEELAETKHQFEELLKFARM--------------------------- 120
           EENQSFLSSIVTLK EL ETKHQFEELLKFARM                           
Sbjct: 311 EENQSFLSSIVTLKVELVETKHQFEELLKFARMLTNGTLKLDDILNQGRRVDDKRGLGFV 370

Query: 121 --DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSS 180
             D PVR T+FIREG       + +   G +I+           ++CKVA+TSVKSPNS 
Sbjct: 371 ERDAPVRTTIFIREGEHVEFVISVD---GLDISD----------ESCKVAMTSVKSPNSG 430

Query: 181 DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLI 240
           DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH GLPFLLDV+L+
Sbjct: 431 DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHPGLPFLLDVQLV 490

Query: 241 QGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS 300
           QGL+ANL+SISQLCDQGYQVS +KDR NVLD QNKVF S TR+SDNCYHWDAEV LCNLS
Sbjct: 491 QGLSANLLSISQLCDQGYQVSLSKDRSNVLDSQNKVFFSRTRMSDNCYHWDAEVNLCNLS 550

Query: 301 KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVN 360
           KV+EA LWHKRLGHL G TI KVTK DAIIGLPP SF SL+SC EC AGKQVKSVHKP  
Sbjct: 551 KVKEAGLWHKRLGHLGGTTIFKVTKADAIIGLPPFSFSSLKSCLECPAGKQVKSVHKP-- 610

Query: 361 ISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLF 420
                                                                 TCQTLF
Sbjct: 611 ------------------------------------------------------TCQTLF 670

Query: 421 TQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQ 480
           TQLQREKNT IG+I+TDHG EFEN++F EFCDNE    E S  LT +           + 
Sbjct: 671 TQLQREKNTGIGRIRTDHGREFENKYFTEFCDNEDAESE-STSLTRETTYSPPHSKTNII 730

Query: 481 EMARAEALNTACHIHNRVILRPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYL 540
           +M+                  P T+    E+ +G    +   H  +W   S         
Sbjct: 731 DMS-----------------TPPTSVNHSEICEGEAVVSASQHTPEWTIDS--------- 790

Query: 541 ANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER 600
            +S  +++              II D+   +  I   K R      + N   E+LLQFER
Sbjct: 791 TDSLKHKLMPPMHIAKNHPSIFIIGDVHSGI--ITRKKERRDYAKMVAN---EELLQFER 850

Query: 601 NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVA 660
           NQVWELVPKPP+ANIIGTKWIFK KT E+GRVIRN+ARLVAQGYSQIEGLD  ETFA VA
Sbjct: 851 NQVWELVPKPPHANIIGTKWIFKKKTVEQGRVIRNEARLVAQGYSQIEGLDLRETFALVA 880

Query: 661 RLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKA 694
           RLEAIRLLLSYA F RFKLF MDVKSAFLNGYL EEVYVA+PKGFVD VH DHVYKL+KA
Sbjct: 911 RLEAIRLLLSYAWFRRFKLFPMDVKSAFLNGYLYEEVYVAKPKGFVDLVHHDHVYKLQKA 880

BLAST of Pay0016463 vs. ExPASy TrEMBL
Match: Q84VH8 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 685.3 bits (1767), Expect = 2.9e-193
Identity = 369/723 (51.04%), Postives = 452/723 (62.52%), Query Frame = 0

Query: 132  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH 191
            V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H
Sbjct: 549  VVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVH 608

Query: 192  SGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCY 251
             GLP L  V L++GL ANLISISQLCD+G+ V+F K  C V + +++V + G+R  DNCY
Sbjct: 609  DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCY 668

Query: 252  HWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCS 311
             W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C 
Sbjct: 669  LWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 728

Query: 312  ECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIK 371
            EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+ 
Sbjct: 729  ECQIGKQVKMSHQKLRHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVN 788

Query: 372  FILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL 431
            FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN  F EFC +EGI HEFSA +
Sbjct: 789  FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAI 848

Query: 432  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYEL 491
            T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+
Sbjct: 849  TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 908

Query: 492  WKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVM 551
            WKGRKP+VK               + RRK D KSD GIFLGY  NSRAYRV+N  ++ VM
Sbjct: 909  WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 968

Query: 552  ESINVIIDDLG---------------------------------------------RNLT 611
            ESINV++DDL                                              R+ T
Sbjct: 969  ESINVVVDDLSPARKKDVEEDVRTLGDNVADAAKSGENAENSDSATDESNINQPDKRSST 1028

Query: 612  EILMM----------------KLRFFGIL---------------------FLINQMKEKL 671
             I  M                + R   I+                     F IN M+E+L
Sbjct: 1029 RIQKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEEL 1088

Query: 672  LQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET 731
             QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Sbjct: 1089 EQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDET 1148

Query: 732  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVY 741
            FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY
Sbjct: 1149 FAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVY 1208

BLAST of Pay0016463 vs. ExPASy TrEMBL
Match: Q84VI4 (Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1)

HSP 1 Score: 684.9 bits (1766), Expect = 3.8e-193
Identity = 369/723 (51.04%), Postives = 452/723 (62.52%), Query Frame = 0

Query: 132  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH 191
            V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H
Sbjct: 547  VVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVH 606

Query: 192  SGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCY 251
             GLP L  V L++GL ANLISISQLCD+G+ V+F K  C V + +++V + G+R  DNCY
Sbjct: 607  DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCY 666

Query: 252  HWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCS 311
             W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C 
Sbjct: 667  LWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 726

Query: 312  ECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIK 371
            EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+K
Sbjct: 727  ECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVK 786

Query: 372  FILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL 431
            FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN    EFC +EGI HEFSA +
Sbjct: 787  FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRLTEFCTSEGITHEFSAAI 846

Query: 432  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYEL 491
            T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+
Sbjct: 847  TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 906

Query: 492  WKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVM 551
            WKGRKP+VK               + RRK D KSD GIFLGY  NSRAYRV+N  ++ VM
Sbjct: 907  WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 966

Query: 552  ESINVIIDDLG---------------------------------------------RNLT 611
            ESINV++DDL                                              R+ T
Sbjct: 967  ESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSST 1026

Query: 612  EILMM----------------KLRFFGIL---------------------FLINQMKEKL 671
             I  M                + R   I+                     F IN M+E+L
Sbjct: 1027 RIQKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEEL 1086

Query: 672  LQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET 731
             QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Sbjct: 1087 EQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDET 1146

Query: 732  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVY 741
            FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY
Sbjct: 1147 FAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVY 1206

BLAST of Pay0016463 vs. NCBI nr
Match: KAA0059225.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 1291.2 bits (3340), Expect = 0.0e+00
Identity = 694/901 (77.03%), Postives = 703/901 (78.02%), Query Frame = 0

Query: 1    MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS 60
            MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS
Sbjct: 305  MALISVCTMNDEENVQTHDQLESKNLTNDTANRKIEDQEVILQQQERIQDLVEENQSFLS 364

Query: 61   SIVTLKEELAETKHQFEELLKFARM-----------------------------DTPVRK 120
            SIVTLKEELA+TKHQFEELLKFARM                             DTPVRK
Sbjct: 365  SIVTLKEELAKTKHQFEELLKFARMLTKGTSKLDDILDQGMRADDKRGLRFAERDTPVRK 424

Query: 121  TVFIREGTLQNSPTNKEQGKGTEITSMPVK---SPNK------RTQNCKVALTSVKSPNS 180
            TVFIREGTLQNSPTN EQGKGTEITSMP K   SP          +NCKVALTSVKSPNS
Sbjct: 425  TVFIREGTLQNSPTNNEQGKGTEITSMPTKHLRSPRTEWCRKIHIENCKVALTSVKSPNS 484

Query: 181  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL 240
            SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL
Sbjct: 485  SDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRL 544

Query: 241  IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL 300
            IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL
Sbjct: 545  IQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNL 604

Query: 301  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPV 360
            SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPL+FLSLESCSECTAGKQVKSVHKPV
Sbjct: 605  SKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLTFLSLESCSECTAGKQVKSVHKPV 664

Query: 361  NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTL 420
            NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDK ETFKTCQTL
Sbjct: 665  NISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKPETFKTCQTL 724

Query: 421  FTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTL 480
            FTQLQREKNT IGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV        
Sbjct: 725  FTQLQREKNTGIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGV-------- 784

Query: 481  QEMARAEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK---------------DHRR 540
                 AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK               DHRR
Sbjct: 785  -----AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVKYFHIFGSTCFILSDRDHRR 844

Query: 541  KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLGR---------NLTEILM 600
            KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDL           N T  L 
Sbjct: 845  KWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLDEGELESPARTNETTYLP 904

Query: 601  MKLRFFGI---------------------------------------------------- 660
              L    I                                                    
Sbjct: 905  SHLGLSRIDMSTPSTSAIHCNTHESEAIVSASQHTPEQTAGATDSSKCDLIPPTHTAKNH 964

Query: 661  --LFLINQ---------------------------------------------MKEKLLQ 720
               F+I                                               ++E+LLQ
Sbjct: 965  PSSFIIRDIHSGIITRKKERKDYAKMVANVCYTSLLEPTTVSAALSDEHWILTIQEELLQ 1024

Query: 721  FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA 741
            FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA
Sbjct: 1025 FERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFA 1084

BLAST of Pay0016463 vs. NCBI nr
Match: KAA0048721.1 (gag-pol polyprotein [Cucumis melo var. makuwa] >TYK07908.1 gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 817.4 bits (2110), Expect = 1.0e-232
Identity = 452/699 (64.66%), Postives = 477/699 (68.24%), Query Frame = 0

Query: 80  LKFARMDTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKS 139
           L+F   DTPVRKTVFIREGTLQNSPTN EQGK                +NCKVALTSVKS
Sbjct: 25  LEFVERDTPVRKTVFIREGTLQNSPTNNEQGK----------------ENCKVALTSVKS 84

Query: 140 PNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLD 199
           PNS DWYFDSGCSRHMTGNADFFSELSECK GSVVF DGGKGKIIGKGTIN  GLPFLLD
Sbjct: 85  PNSGDWYFDSGCSRHMTGNADFFSELSECKAGSVVFEDGGKGKIIGKGTINRPGLPFLLD 144

Query: 200 VRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTL 259
           VRL+QGL+ANLIS SQLCDQGY+V+F+KDRCNVLD QNKVFLSGTRLSDNCYHWDAEVTL
Sbjct: 145 VRLVQGLSANLISTSQLCDQGYKVNFSKDRCNVLDGQNKVFLSGTRLSDNCYHWDAEVTL 204

Query: 260 CNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVH 319
           CNLSKVEEA LWHKRLGHL GATISKV K +AIIGLPPLSF SLESCSEC AGKQVKSVH
Sbjct: 205 CNLSKVEEAGLWHKRLGHLGGATISKVIKANAIIGLPPLSFSSLESCSECPAGKQVKSVH 264

Query: 320 KPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTC 379
           KPVNIS TSHILELLHIDLM PMQTESLGRK YAVVCVDDFSRYTWIKFIL+K ETFKTC
Sbjct: 265 KPVNISLTSHILELLHIDLMRPMQTESLGRKRYAVVCVDDFSRYTWIKFILEKLETFKTC 324

Query: 380 QTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRN 439
           QTL TQLQREKNT IG+I+T+HG EFEN+HFAEFCDNEGIFHEFSA LT Q+NGVVE+RN
Sbjct: 325 QTLVTQLQREKNTGIGRIRTEHGCEFENKHFAEFCDNEGIFHEFSAQLTPQENGVVEKRN 384

Query: 440 RTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYELWKGRKPNVK--- 499
           +TLQEMAR             AEALNTACHIHNRVILRP TTTTSYELWKGRKPNVK   
Sbjct: 385 QTLQEMARVMIHAKQLPIQFWAEALNTACHIHNRVILRPRTTTTSYELWKGRKPNVKYFH 444

Query: 500 ------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVMESINVIIDDLG- 559
                       DHRRKWDSKSDRGIFLGY AN+RAYRVYNQ +KIV+ESINVIIDDLG 
Sbjct: 445 IFGGTCFILSDRDHRRKWDSKSDRGIFLGYSANNRAYRVYNQRTKIVIESINVIIDDLGK 504

Query: 560 ---RNL--------------------------TEILMMKLRF------------------ 619
              RNL                           E   +   F                  
Sbjct: 505 EPNRNLDDEDEVFWNSLSHKTAEGESESTTPTNETTYLPSHFDSNKIDMSTPSTSTNHSN 564

BLAST of Pay0016463 vs. NCBI nr
Match: KAA0040705.1 (retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK14274.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 703.7 bits (1815), Expect = 1.6e-198
Identity = 420/731 (57.46%), Postives = 467/731 (63.89%), Query Frame = 0

Query: 1   MALISVCTMNDEE----NVQTHDQLES---KNLTNDTAN-RKIEDQEVILQQQERIQDLV 60
           MALIS+C MNDEE    N QTHD  ES   K LT+   + +K EDQE+ILQQQERIQDLV
Sbjct: 251 MALISLCNMNDEETAKVNTQTHDPQESTTNKYLTDGLVDKKKTEDQEIILQQQERIQDLV 310

Query: 61  EENQSFLSSIVTLKEELAETKHQFEELLKFARM--------------------------- 120
           EENQSFLSSIVTLK EL ETKHQFEELLKFARM                           
Sbjct: 311 EENQSFLSSIVTLKVELVETKHQFEELLKFARMLTNGTLKLDDILNQGRRVDDKRGLGFV 370

Query: 121 --DTPVRKTVFIREGTLQNSPTNKEQGKGTEITSMPVKSPNKRTQNCKVALTSVKSPNSS 180
             D PVR T+FIREG       + +   G +I+           ++CKVA+TSVKSPNS 
Sbjct: 371 ERDAPVRTTIFIREGEHVEFVISVD---GLDISD----------ESCKVAMTSVKSPNSG 430

Query: 181 DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHSGLPFLLDVRLI 240
           DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH GLPFLLDV+L+
Sbjct: 431 DWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINHPGLPFLLDVQLV 490

Query: 241 QGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCYHWDAEVTLCNLS 300
           QGL+ANL+SISQLCDQGYQVS +KDR NVLD QNKVF S TR+SDNCYHWDAEV LCNLS
Sbjct: 491 QGLSANLLSISQLCDQGYQVSLSKDRSNVLDSQNKVFFSRTRMSDNCYHWDAEVNLCNLS 550

Query: 301 KVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCSECTAGKQVKSVHKPVN 360
           KV+EA LWHKRLGHL G TI KVTK DAIIGLPP SF SL+SC EC AGKQVKSVHKP  
Sbjct: 551 KVKEAGLWHKRLGHLGGTTIFKVTKADAIIGLPPFSFSSLKSCLECPAGKQVKSVHKP-- 610

Query: 361 ISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIKFILDKSETFKTCQTLF 420
                                                                 TCQTLF
Sbjct: 611 ------------------------------------------------------TCQTLF 670

Query: 421 TQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPLTLQQNGVVERRNRTLQ 480
           TQLQREKNT IG+I+TDHG EFEN++F EFCDNE    E S  LT +           + 
Sbjct: 671 TQLQREKNTGIGRIRTDHGREFENKYFTEFCDNEDAESE-STSLTRETTYSPPHSKTNII 730

Query: 481 EMARAEALNTACHIHNRVILRPGTTTTSYELWKGRK-PNVKDHRRKWDSKSDRGIFLGYL 540
           +M+                  P T+    E+ +G    +   H  +W   S         
Sbjct: 731 DMS-----------------TPPTSVNHSEICEGEAVVSASQHTPEWTIDS--------- 790

Query: 541 ANSRAYRVYNQCSKIVMESINVIIDDLGRNLTEILMMKLRFFGILFLINQMKEKLLQFER 600
            +S  +++              II D+   +  I   K R      + N   E+LLQFER
Sbjct: 791 TDSLKHKLMPPMHIAKNHPSIFIIGDVHSGI--ITRKKERRDYAKMVAN---EELLQFER 850

Query: 601 NQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGETFAPVA 660
           NQVWELVPKPP+ANIIGTKWIFK KT E+GRVIRN+ARLVAQGYSQIEGLD  ETFA VA
Sbjct: 851 NQVWELVPKPPHANIIGTKWIFKKKTVEQGRVIRNEARLVAQGYSQIEGLDLRETFALVA 880

Query: 661 RLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVYKLRKA 694
           RLEAIRLLLSYA F RFKLF MDVKSAFLNGYL EEVYVA+PKGFVD VH DHVYKL+KA
Sbjct: 911 RLEAIRLLLSYAWFRRFKLFPMDVKSAFLNGYLYEEVYVAKPKGFVDLVHHDHVYKLQKA 880

BLAST of Pay0016463 vs. NCBI nr
Match: AAO73527.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 685.3 bits (1767), Expect = 6.0e-193
Identity = 369/723 (51.04%), Postives = 452/723 (62.52%), Query Frame = 0

Query: 132  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH 191
            V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H
Sbjct: 549  VVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVH 608

Query: 192  SGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCY 251
             GLP L  V L++GL ANLISISQLCD+G+ V+F K  C V + +++V + G+R  DNCY
Sbjct: 609  DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCY 668

Query: 252  HWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCS 311
             W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C 
Sbjct: 669  LWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 728

Query: 312  ECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIK 371
            EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+ 
Sbjct: 729  ECQIGKQVKMSHQKLRHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVN 788

Query: 372  FILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL 431
            FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN  F EFC +EGI HEFSA +
Sbjct: 789  FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRFTEFCTSEGITHEFSAAI 848

Query: 432  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYEL 491
            T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+
Sbjct: 849  TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 908

Query: 492  WKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVM 551
            WKGRKP+VK               + RRK D KSD GIFLGY  NSRAYRV+N  ++ VM
Sbjct: 909  WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 968

Query: 552  ESINVIIDDLG---------------------------------------------RNLT 611
            ESINV++DDL                                              R+ T
Sbjct: 969  ESINVVVDDLSPARKKDVEEDVRTLGDNVADAAKSGENAENSDSATDESNINQPDKRSST 1028

Query: 612  EILMM----------------KLRFFGIL---------------------FLINQMKEKL 671
             I  M                + R   I+                     F IN M+E+L
Sbjct: 1029 RIQKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEEL 1088

Query: 672  LQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET 731
             QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Sbjct: 1089 EQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDET 1148

Query: 732  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVY 741
            FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY
Sbjct: 1149 FAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVY 1208

BLAST of Pay0016463 vs. NCBI nr
Match: AAO73521.1 (gag-pol polyprotein [Glycine max])

HSP 1 Score: 684.9 bits (1766), Expect = 7.8e-193
Identity = 369/723 (51.04%), Postives = 452/723 (62.52%), Query Frame = 0

Query: 132  VALTSVKSPNSSDWYFDSGCSRHMTGNADFFSELSECKVGSVVFGDGGKGKIIGKGTINH 191
            V  TS+++    DWY DSGCSRHMTG  +F   +  C    V FGDG KGKIIG G + H
Sbjct: 547  VVHTSLRASAKEDWYLDSGCSRHMTGVKEFLLNIEPCSTSYVTFGDGSKGKIIGMGKLVH 606

Query: 192  SGLPFLLDVRLIQGLAANLISISQLCDQGYQVSFNKDRCNVLDVQNKVFLSGTRLSDNCY 251
             GLP L  V L++GL ANLISISQLCD+G+ V+F K  C V + +++V + G+R  DNCY
Sbjct: 607  DGLPSLNKVLLVKGLTANLISISQLCDEGFNVNFTKSECLVTNEKSEVLMKGSRSKDNCY 666

Query: 252  HWDAEVT----LCNLSKVEEARLWHKRLGHLSGATISKVTKVDAIIGLPPLSFLSLESCS 311
             W  + T     C  SK +E R+WH+R GHL    + K+    A+ G+P L       C 
Sbjct: 667  LWTPQETSYSSTCLSSKEDEVRIWHQRFGHLHLRGMKKIIDKGAVRGIPNLKIEEGRICG 726

Query: 312  ECTAGKQVKSVHKPVNISSTSHILELLHIDLMGPMQTESLGRKWYAVVCVDDFSRYTWIK 371
            EC  GKQVK  H+ +   +TS +LELLH+DLMGPMQ ESLG K YA V VDDFSR+TW+K
Sbjct: 727  ECQIGKQVKMSHQKLQHQTTSRVLELLHMDLMGPMQVESLGGKRYAYVVVDDFSRFTWVK 786

Query: 372  FILDKSETFKTCQTLFTQLQREKNTSIGQIQTDHGHEFENQHFAEFCDNEGIFHEFSAPL 431
            FI +KSETF+  + L  +LQREK+  I +I++DHG EFEN    EFC +EGI HEFSA +
Sbjct: 787  FIREKSETFEVFKELSLRLQREKDCVIKRIRSDHGREFENSRLTEFCTSEGITHEFSAAI 846

Query: 432  TLQQNGVVERRNRTLQEMAR-------------AEALNTACHIHNRVILRPGTTTTSYEL 491
            T QQNG+VER+NRTLQE AR             AEA+NTAC+IHNRV LR GT TT YE+
Sbjct: 847  TPQQNGIVERKNRTLQEAARVMLHAKELPYNLWAEAMNTACYIHNRVTLRRGTPTTLYEI 906

Query: 492  WKGRKPNVK---------------DHRRKWDSKSDRGIFLGYLANSRAYRVYNQCSKIVM 551
            WKGRKP+VK               + RRK D KSD GIFLGY  NSRAYRV+N  ++ VM
Sbjct: 907  WKGRKPSVKHFHIFGSPCYILADREQRRKMDPKSDAGIFLGYSTNSRAYRVFNSRTRTVM 966

Query: 552  ESINVIIDDLG---------------------------------------------RNLT 611
            ESINV++DDL                                              R+ T
Sbjct: 967  ESINVVVDDLSPARKKDVEEDVRTSGDNVADAAKSGENAENSDSATDESNINQPDKRSST 1026

Query: 612  EILMM----------------KLRFFGIL---------------------FLINQMKEKL 671
             I  M                + R   I+                     F IN M+E+L
Sbjct: 1027 RIQKMHPKELIIGDPNRGVTTRSREVEIVSNSCFVSKIEPKNVKEALTDEFWINAMQEEL 1086

Query: 672  LQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGLDFGET 731
             QF+RN+VWELVP+P   N+IGTKWIFKNKT+EEG + RNKARLVAQGY+QIEG+DF ET
Sbjct: 1087 EQFKRNEVWELVPRPEGTNVIGTKWIFKNKTNEEGVITRNKARLVAQGYTQIEGVDFDET 1146

Query: 732  FAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFVDPVHEDHVY 741
            FAPVARLE+IRLLL  AC  +FKL+QMDVKSAFLNGYL EEVYV QPKGF DP H DHVY
Sbjct: 1147 FAPVARLESIRLLLGVACILKFKLYQMDVKSAFLNGYLNEEVYVEQPKGFADPTHPDHVY 1206

BLAST of Pay0016463 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 144.8 bits (364), Expect = 2.7e-34
Identity = 74/187 (39.57%), Postives = 115/187 (61.50%), Query Frame = 0

Query: 553 MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGL 612
           M +++   E    WE+   PP    IG KW++K K + +G + R KARLVA+GY+Q EG+
Sbjct: 102 MDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEGI 161

Query: 613 DFGETFAPVARLEAIRLLLSYACFWRFKLFQMDVKSAFLNGYLCEEVYVAQPKGFV---- 672
           DF ETF+PV +L +++L+L+ +  + F L Q+D+ +AFLNG L EE+Y+  P G+     
Sbjct: 162 DFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQG 221

Query: 673 DPVHEDHVYKLRKALYRLKQAPRACYERLSTYLLQQGYQRGSADQTMFIYRQGTDFLIIQ 732
           D +  + V  L+K++Y LKQA R  + + S  L+  G+ +  +D T F+    T FL + 
Sbjct: 222 DSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVL 281

Query: 733 IYVDGII 736
           +YVD II
Sbjct: 282 VYVDDII 288

BLAST of Pay0016463 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 84.0 bits (206), Expect = 5.7e-16
Identity = 48/118 (40.68%), Postives = 67/118 (56.78%), Query Frame = 0

Query: 553 MKEKLLQFERNQVWELVPKPPYANIIGTKWIFKNKTDEEGRVIRNKARLVAQGYSQIEGL 612
           M+E+L    RN+ W LVP P   NI+G KW+FK K   +G + R KARLVA+G+ Q EG+
Sbjct: 44  MQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGI 103

Query: 613 DFGETFAPVARLEAIRLLLSYA--------CFWRFKL-FQMDVKSAFLNGYLCEEVYV 662
            F ET++PV R   IR +L+ A          W FK+ F M +   F   ++C  + V
Sbjct: 104 YFVETYSPVVRTATIRTILNVAQQLEVGQSINWMFKMHFSMGI---FKKKFICINLLV 158

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.4e-6426.65Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT941.7e-4123.91Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW24.0e-3845.40Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041466.8e-3040.00Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925208.0e-1540.68Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5A7V0460.0e+0077.03Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold430G... [more]
A0A5D3C9Q64.9e-23364.66Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G... [more]
A0A5A7TGY47.9e-19957.46Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Q84VH82.9e-19351.04Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Q84VI43.8e-19351.04Gag-pol polyprotein OS=Glycine max OX=3847 GN=gag-pol PE=4 SV=1[more]
Match NameE-valueIdentityDescription
KAA0059225.10.0e+0077.03gag-pol polyprotein [Cucumis melo var. makuwa][more]
KAA0048721.11.0e-23264.66gag-pol polyprotein [Cucumis melo var. makuwa] >TYK07908.1 gag-pol polyprotein [... [more]
KAA0040705.11.6e-19857.46retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
AAO73527.16.0e-19351.04gag-pol polyprotein [Glycine max][more]
AAO73521.17.8e-19351.04gag-pol polyprotein [Glycine max][more]
Match NameE-valueIdentityDescription
AT4G23160.12.7e-3439.57cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.15.7e-1640.68Reverse transcriptase (RNA-dependent DNA polymerase) [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Payzawat) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 28..48
NoneNo IPR availableCOILSCoilCoilcoord: 66..86
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 101..124
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 145..527
coord: 569..736
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 330..426
e-value: 8.2E-10
score: 38.9
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 317..447
score: 18.22897
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 322..496
e-value: 6.1E-36
score: 125.6
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 563..736
e-value: 2.2E-49
score: 168.3
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 260..314
e-value: 2.5E-10
score: 40.1
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 349..480

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Pay0016463.1Pay0016463.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding