Pay0003779 (gene) Melon (Payzawat) v1

Overview
NamePay0003779
Typegene
OrganismCucumis melo L. var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr03: 20756519 .. 20763813 (-)
RNA-Seq ExpressionPay0003779
SyntenyPay0003779
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTGGAAAAATGTTCTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATAAGAATTTTTTGGTAACGTTTATCCTTAATTCTTTACCTTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATGTGCATGAATTACAAAGTATGCTCATTCAAGAGGAAGCGAGACTTAAGAAACCAATAATTCACTCTGCCAATCTCATGGGTCATAAAGAGCTGGAAAGAAACCTGGAAAAAGAATGGGAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTCAAAGTTCCAGTTGAAGCTGTGGGAACCTATCGTTTAACTTTAGATACTGGACATCATTTAGACCTTTTTGATACCTTTTATGTTCCTTCTATTTCTCGTAATTTGATTTCCTTGTCAAAACTTGATACTTCAGGTTATTACTTTAAATTTGGGAATGAGTGTTTTAGTTTATTCAAACAGAACATTTTTATTGGTTCTGGTATTCTTTGTGATGACTTATATAAATTAAAGCTTGATAATATTTTTGCTGAGAGTTTGTTAACCTTGCATCATAATGTTGGTACTAAACGTGGTCAAACTAATGAATCGTCAGCTTACTTGTGGCATAAACGTTTAGGTCACATATCCAAAGAAAGAATTAAAAGATTGATAAAAAATGAAATTCTTCCAGATTTGGATTTTACTGACCTTGGAATTTGTGTGGATTGTATTAAAGGAAAACAAACAAAACACACAGTTAATAAAGAAGCCACAAGAAGCTCACATCTCCTTGAAATTATACACACTCATATTTGTGGGCCTTTTGATGTTCCATCTTTTGGTGGAGAAATGTATTTTATCACCTTTATTGATGATTTCTCACGTTATGGTTATATCTATTTATTGCATGAGAAATCTCAAGCAATAGATGTCTTAAAAGTATTTATAAATGAAGTTGAAAGGCAATTAGATAGAAAGGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGTTAATTAATTCATCTTTACCTGTGTCCTTGTGGATGTATGCATTAAGAACAGCTCAATATTTATTAAACAGGGTTCCTAGTAAGTCAGTTCCAAAGACACATTTTGAACTGTGGACATGAAGGAAACCTAGTTTAAGACACCTACATGTTTGGGGTTGTCAAGCGAAGTAAGAATTTATAATCCACATGAAAAGAAACTGGTTTCAAGAACAACCAGTGGTTTCTTCATTGGTTATCCAGAAAAATCAAAAGGGTATAGATTTTATTGTCCTAACCATAGTACGAGAATAGTTGAAACTGGAAATGTAAGGTTCATTGAGAATGACATAATTAGTGGGAGTTTGGAACCGCGAAAAGTGGAAATTCAAGAAGTTAGGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATGAACCTGTAACTGAGGGACCACAAGAAATAGAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGACTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAAGTCATTAAAGGAGATAATTCTACCAAATAGTTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTCTTTAAGACCAAACGTGACTCAAATGGAAATATTGAACGATACAAGGCTAGACTTGTTGCCAAAGGTTATAGTCAGAAAGATGGCATTGACTACAAAGAGACTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTCCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACCAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTCAAATTTTGCCGGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAAGTGCAATGCAGTCTATTATCGCTGCATCCCACTACAAGAAAATGGAGCATTCCCGACGCCGAAAAGCACGTCGGCATAAAGAACGTCGGGAATAAGGGCTTTCCCGACGCGTTGTTGAGGGCGTCGGGAGATGCGTCGGGAAAACCATCTTTCCCGACGCACCATGAAGGCGTCGGCAAGAATACGTCGGGAAAAGGGGTTTTTCCCGACGCCGTGAACAGTGCGTCGAGAGCGGCGTCGGGTTCCCGACGCCGCGTGAAACGTCGGGAAAAGGGGTTTAATGAGCGTCGGGAATTCCCCTTTTCCCGACGCATTTTGAGGGATTTCCCGACGCCACGTCACTGCGTCGGGAAATCCCATTTAAATTCGATTCAGTTCTGTGAATTACAGAACCGAAACCGAGAGGAGGAGAAAAAGAAAGGAGAAGAAGAAAAAGTTGCCGCCGTTGAAATTTCTTGTTGTTCCGCCGCCGTCCGTCGCCGACCGCCCTCGCCGTTGTTCCTCGTCGCCGTCTGCCACTTTAGGTAAGTGAAATATGTAAATTTATTTAGTTTAAGTTTATGTTTTAGAGTTTTTTTTTGATAAAAATTAGTTTAGGGTTTGTTTGTTTTTTTGTTTTTTTGTTTCAAAGTTAAAAAAATGTATGTATTAAATTAACGTATATTCAAATTGTTTGCTAGATAGAGTTTATTGAAATTGTTTAAAGTTCTTTTTTGTTTTTGTTTTAGTTTTGCTTTAGTTAATTAGTTTTAGGATTAGTTTTAGAATTTAGATTTTGAAATAGTTTTAGGATATAGATTTTGAATTAGTTTTAGGTTTTAGTGTTGAATTTGAATTTGAAGTGGTTTTAGGATTTAAGATTGACGTTGAGATTGAAGTTGAAATTGAATTTGAGATTGATAAAAAATTTGAATGTGTAGAGATTTTTGAGGTTATATTGAATTGGGGGGGGGGTGTTTATTGAATTTGAGATTGTGATTTAATTTGAATTTAAGATTGAATTTGAGATAAAGATTTAATTTGAATTTTGAATTTGACACTAAAACAAATTAATTTGAATTTAAGATTATATATCTTTGTCATGCCAACAAAGTGTCTTCATGCTCATTCTATCATTTTGTTTCTTTATGTATAATATAAACTTCTTTTTGATATCCTTCCATTATTCAATTCAATAATAAACTAGAAGCCAAGATATAATTTGTCCAAAATGCTATCTATTTCATTTTCAACAACTACAACAATAATTATTATGAAATGAAATTACTACTTATCTGGGGAGAGGGAGGTTTAAGTTAAATTGAAAATGTTTGTTTTCTATGTCCATAGCCATTATGTCATATCGACGATCAAATTTTATGGAGACGGGCGATATGTTCCTCCAGTTTGAGGACGATTTAGATAATAACATCGCGGAGGGGAGGGTCATCATCTGTGGGCGACAATACGGGTGAGTCTAAGAATTTTCTTCATTTTAACTTACAAATATTTCATGTTCTTTTTTCTAACTTTATATGTTATTGTCTATTTATTGAGCAGGGTCTTTTCTCAACAAACGACTCCGACTCCTAGGAGACGTGCGCAGTCTCGACTCTTGGAGTTAGAGCGCCACGTTGCAATAAATGGGCGCATTCCGATGACGATCGCCCTGGAGCGGAGAAGCCTATTTCTCCACACGCCGTTCGCTTCAGCCAGGCGATAGGCGTGTGCGTGCGAAAGACATTTCCCGTCCGCTGTCTTAAGTGGACGGACGTTGGGAGAGAATACTTGAGGTCGTCAAGGGCGACCTCCAGGTAATTAAATGCACTACACATTTATATATGATTTCATTTGAAACATATCTAACTTTAACCAATCTAATGTGTTATGTTTGTAATGTGCAGCGATTCTTTGTGCTTGATTTCAATGATCAAGCAATGAACAGGTTTGTTGAGCATCAGATGCTCACGACCTTTAAAGAGTTCCGGGCCGACTGTCATAGACATTTCAAAAAGTACAGCGACCCGGAGGAGGCTCGTGCCAACCCACCAAACGCATTGGTTGGACGTGATGAGGATTGGCACTTCCTCTGCGACCATTATATCAGCCGTGCATTCCAGGTATTTGTCATGATTAATTATGATAACAAGTTTTTATATGTATAAGAAATAAACAATATAATTGTTTTAATGCAGGAGCAATCACGGACAAACAAGGCTGCTAGACAGAAGCAGCCTTACAATCATAGTAGCGGGTCCAAGTCGTTTCTACAACGACAGTATGAGCTCGCTGAAAGAAGAGGGCAGCCGGTCGATCGTGTGGAATTGTTCCGGGAAACACACGTTCGAGCTGGGACATTCGTGTCGCAAGCCGCCGAGGATGCGCATGTAAGTTATACCCTTATTAATTATCCACTTTTATTATATATTTTTGCGCGTTACCTAACATAATTTTTAAATTTGTTGCAGAATCAAATGCTGGAACTCCAATCCCAGCCTATCCCAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGCGATCAGGTGTTGGGTAGACGACCAGGCTACTCAAAAGGCCTTGGTTGGGGACCCAAGCCGAAGGCCCGCAGAACGGCAAGTGCAAGCAGTTCGTCGACATCTTGTTCGCAGTCCACACAAAAAGAGATTGAATTACAAGCTAAACTTCATGAAGCTTTGGAACGGATTGAAGTACAAGATAGAAATCACCAAGCATTAGCTTCACAAGTGGAAGCTATGAAAAAGATGATTGAAGACCTAACTCGTGCACAACAGGGACCACCACATGATCCCTAGCTCTGCGGTACGTCGTATATGTCTCTATTATTCTTAAGTTTTTGCAATATATGATTATGTATTAAACTTAGTTATTTTTATACCATTTTTAGGACCGAAGGTGTAGAATGACGCGCATACGCACCTCGTTGGGAGATTTTTCATTATGTTTATGTTTTTTCTATATTGAGAACTATATTTTGTTGTAGTCAACTTATTCGTATTATTAATTTTAATTCTATTTTTTTAATTTGTGTTTGTATATTTATTAATTTATTTTAATTTAATCCAAAACGTCGCGAAATTTTTTTGAGCAATCCGAACATCATTATGAATGTTTGAGCAAAATATTATAAGGAAAAAAGTAAAAATAAAATATTTAAAAAAATATATAAAAAAATATATAAAAAAATATTTTCCGACGTTCATTACGTCGGGAAAAAAACGTCGGAAAACAGGTTTTCCCGACGCCGGGAGATGCGTCGGCATAGACGGCGTCGGGAATAAGGTTTTCCCGACGCCGTCATGCCGACGCATCTCCCGGCGTCGGGAAAGGCTTTCCCGACTCTGATTTGGTACGGCGTCGGGAAAGCCTCTCGCGACGCATTTCTGCCGACGTTCTTCCCGACGTCGTTTTGTACGTCGGGATATCCTTTCCCGACGTACTTTGCATTTTCGCCGACGTATTTGTGCGTCGGGAGTACCCCCGTCTCTTGTAGTGTCCACTATGGAAGCTGAATTTGTAGCATGCTTTGAGGCTACAGTTCATGGTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGCCGACAGTATTGCCAAGCCGCTGAGAATTTATTGTGATAATTCTAGTTTTCTTCTAAAAAAAACGACATGTATTCTAAAGGTGCTAAACATATGGAATTAAAATACTTTGCCATTAAAGAAGAAGTTCAGAAAGAGAGGGTGTCAGTTGAACACATTAGCACTAAACTTATGATTGCGGATCCACTGACTAAAGGATTGCCACCAAAGATGTTCAATGATCACGTTGAACGTATGGACATCAGTAGATATCATCATTGA

mRNA sequence

ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTGGAAAAATGTTCTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATAAGAATTTTTTGGTAACGTTTATCCTTAATTCTTTACCTTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATAATGGGAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATGAACCTGTAACTGAGGGACCACAAGAAATAGAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGACTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAATTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTCTTTAAGACCAAACGTGACTCAAATGGAAATATTGAACGATACAAGGCTAGACTTGTTGCCAAAGGTTATAACTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTCCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACCAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTCAAATTTTGCCGGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAATACCCCCGTCTCTTGTAGTGTCCACTATGGAAGCTGAATTTGTAGCATGCTTTGAGGCTACAGTTCATGGTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGCCGACAGTATTGCCAAGCCGCTGAGAATTTATTGTGATAATTCTAGTGCTAAACATATGGAATTAAAATACTTTGCCATTAAAGAAGAAGTTCAGAAAGAGAGGGTGTCAGTTGAACACATTAGCACTAAACTTATGATTGCGGATCCACTGACTAAAGGATTGCCACCAAAGATGTTCAATGATCACGTTGAACGTATGGACATCAGTAGATATCATCATTGA

Coding sequence (CDS)

ATGTTTATGCGAATGACTGTAGCAAACAATATTAAGTCCACAATTAAGAACACTGAAGATGCTAAGGAATTTATGAAATCTGTGGAAAAATGTTCTCAGTCAGAGTCGGCTGACAAGTCACTTGCTGGAACACTTATGAGTACTTTAACCAACATCAAGTTTGATGGTTCTCGTACTATACATGAGCATATCCTTGAAATGACGAACTTGGCAGCAAGGTTAAAGACCATGGGAATGGAAGTTAATAAGAATTTTTTGGTAACGTTTATCCTTAATTCTTTACCTTCAGAGTATGGTCCATTTCACATGAACTATAACACTCTGAAAGATAAATGGAATAATGGGAAGGGCAATCATGGACAATTAAAGGTAAAACAGTCATCTGCCCCAATCCACAAAAAGGGACAAATTAAGGATAAGTGTCGTTTTTGCAACAAACCTGGACACTATCAGAAAGATTGTCTAAAACGTAAGGCATGGTTCGAGAATAAAGGTAAGCATAATGCTTTAGTATGTTTCGAATCAAACTTAACTGAAGTTCCTTATAATACATGGTGGATTGATTCTGGTTGTACCATTCATGTTTCCAATACGATGCAGGGATTCCTTACGACCCGAACCACAAACCCAAATGAGAGATTCATTTTTATGGGAAACAGAGTGAAAATCTTAAGATCTGATAGAGGTGGTGAGTATTATGGAAAATGCCCCGGTCCATTCGCTAAATTCCTAGAAAGTCATGGCATATGTGCTCAATACACAATGCCAGGAACACCACAACAAAATGGTGTTGCAGAAAGGCGAAATCGTACATTAATGAATATGGTTAGAAGCATGGTGGAAATTCCTTCATCTATAACTTCTTCTCAAGTTGTTGTTCCTGTAGTTGTTGACTCTGTTAACAATCCACAAGAACAACAAATTAATGGTCAAACACCACATAATGATATTGTAACAAATGAACCTGTAACTGAGGGACCACAAGAAATAGAATTAAGAAGATCTGTAAGATCAAGAAGATCAGCTATTTCTGATGACTATTTGGTTTATTTGCATGAGTCAGAATTTGACTTAAGCATTGATAATGATCCAGTTTCGTTTTCACAATTAGATGCCATGAAAGAAGAGTTAAAATCTATGAATGATAATGAAGTCTGGGATCTTGTAGAATTGCCTAAAGAAAGTAAAAGAGTTGGGTGTAAATGGGTCTTTAAGACCAAACGTGACTCAAATGGAAATATTGAACGATACAAGGCTAGACTTGTTGCCAAAGGTTATAACTTTTCTCCTGTCTCGAAAAAGGACTCATTAAGAATTATTATGGCTTTGGTAGCTCATTATGATTTAGAGCTTCATCAAATGGATGTGAAAACCGCCTTTCTAAATGGAAATTTAGATGAAGAAGTGTTCATGGATCAACCAGAAGGTTTTATGGTTGAAGGAAAGGAACATATGGTGTGTAAATTAAAGAGGTCAATATATGGACTTAAACAAGCTTCCAGACAGTGGTATCTTAAGTTTAATGATACCATCACATCTTTTGGTTTTAAAGAAAACATCGTTGATCGATGTATATACCTAAAGATCAGTGGGAGTAAGTTTATAATTCTTGTTCTATATGTTGATGACATCTTGCTTGCTACAAATGACTTTGGTTTATTATGTCAAACCAAAGAATTTCTTTCTAAAAACTTTGAAATGAAAGATATGGGTGAGGCATCCTATGTGATTGGAATTGAAATATTCCGTGATCGAACACATGGATTGTTAGGATTGTCTCAAAAGGCCTATATTAATAAAGTTTTAGAGAAATTTAAGATGGACAAATGCTCTTCAAGTGTAGTTCCAATTCAGAAGGGAGATAAATTTAGTCTCATGCAATGTCCAAAAAATGAATTGGAACGAAATCAGATGGAAACTATTCCTTATGCATCTATTGTTGGAAGCTTATTGTATGCACAAACTTGCACTAGACCAGACATCAGTTTTGCTGTGGGTATGCTAGGCAGGTATCAAAGTAATCCAGGAATGGATCATTGGAAAGCTGCAAAGAAAGTTTTAAGGTATCTGCAAGGAACAAAAGATTATATGCTTACTTACAAGAGATCTGATCATCTTGAGGTGATTGGATATTCAGATTCAAATTTTGCCGGATGTGTGGATACAAGAAAATCCACTTTTGGCTATTTGTTCCTTTTAGCTGAAGGAGCAATTTCATGGAAAATACCCCCGTCTCTTGTAGTGTCCACTATGGAAGCTGAATTTGTAGCATGCTTTGAGGCTACAGTTCATGGTTTATGGCTGCGGAACTTTATCTCAGGACTTGGAATTGCCGACAGTATTGCCAAGCCGCTGAGAATTTATTGTGATAATTCTAGTGCTAAACATATGGAATTAAAATACTTTGCCATTAAAGAAGAAGTTCAGAAAGAGAGGGTGTCAGTTGAACACATTAGCACTAAACTTATGATTGCGGATCCACTGACTAAAGGATTGCCACCAAAGATGTTCAATGATCACGTTGAACGTATGGACATCAGTAGATATCATCATTGA

Protein sequence

MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTIHEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWNNGKGNHGQLKVKQSSAPIHKKGQIKDKCRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQLDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYNFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISWKIPPSLVVSTMEAEFVACFEATVHGLWLRNFISGLGIADSIAKPLRIYCDNSSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADPLTKGLPPKMFNDHVERMDISRYHH
Homology
BLAST of Pay0003779 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 444.1 bits (1141), Expect = 3.5e-123
Identity = 288/783 (36.78%), Postives = 403/783 (51.47%), Query Frame = 0

Query: 218  GNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLMNMVR 277
            G ++K LRSD GGEY  +    F ++  SHGI  + T+PGTPQ NGVAER NRT++  VR
Sbjct: 541  GRKLKRLRSDNGGEYTSR---EFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVR 600

Query: 278  SMV-------------------------------EIPSSI-TSSQV-------------- 337
            SM+                               EIP  + T+ +V              
Sbjct: 601  SMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFA 660

Query: 338  ------------------------------------------------------------ 397
                                                                        
Sbjct: 661  HVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMSE 720

Query: 398  -----VVP---VVVDSVNNP-------QEQQINGQTPHNDIVTNEPVTEGPQEIE----- 457
                 ++P    +  + NNP        E    G+ P   I   E + EG +E+E     
Sbjct: 721  KVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPTQG 780

Query: 458  ------LRRSVRSR---RSAISDDYLVYL--HESEFDLSIDNDPVSFSQLDAMKEELKSM 517
                  LRRS R R   R   S +Y++     E E    + + P     + AM+EE++S+
Sbjct: 781  EEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMKAMQEEMESL 840

Query: 518  NDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN----------FSP 577
              N  + LVELPK  + + CKWVFK K+D +  + RYKARLV KG+           FSP
Sbjct: 841  QKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKGIDFDEIFSP 900

Query: 578  VSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLK 637
            V K  S+R I++L A  DLE+ Q+DVKTAFL+G+L+EE++M+QPEGF V GK+HMVCKL 
Sbjct: 901  VVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLN 960

Query: 638  RSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLK-ISGSKFIILVLYVDDILLATN 697
            +S+YGLKQA RQWY+KF+  + S  + +   D C+Y K  S + FIIL+LYVDD+L+   
Sbjct: 961  KSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGK 1020

Query: 698  DFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDK 757
            D GL+ + K  LSK+F+MKD+G A  ++G++I R+RT   L LSQ+ YI +VLE+F M  
Sbjct: 1021 DKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKN 1080

Query: 758  CSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLG 817
                  P+    K S   CP    E+  M  +PY+S VGSL+YA  CTRPDI+ AVG++ 
Sbjct: 1081 AKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVS 1140

Query: 818  RYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGY 838
            R+  NPG +HW+A K +LRYL+GT    L +  SD + + GY+D++ AG +D RKS+ GY
Sbjct: 1141 RFLENPGKEHWEAVKWILRYLRGTTGDCLCFGGSDPI-LKGYTDADMAGDIDNRKSSTGY 1200

BLAST of Pay0003779 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 324.3 bits (830), Expect = 4.1e-87
Identity = 188/497 (37.83%), Postives = 279/497 (56.14%), Query Frame = 0

Query: 371  DAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY---- 430
            +A+  EL +   N  W + + P+    V  +WVF  K +  GN  RYKARLVA+G+    
Sbjct: 908  EAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKY 967

Query: 431  ------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVE 490
                   F+PV++  S R I++LV  Y+L++HQMDVKTAFLNG L EE++M  P+G  + 
Sbjct: 968  QIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQG--IS 1027

Query: 491  GKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISG--SKFIILV 550
                 VCKL ++IYGLKQA+R W+  F   +    F  + VDRCIY+   G  ++ I ++
Sbjct: 1028 CNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVL 1087

Query: 551  LYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYI 610
            LYVDD+++AT D   +   K +L + F M D+ E  + IGI I  +     + LSQ AY+
Sbjct: 1088 LYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRI--EMQEDKIYLSQSAYV 1147

Query: 611  NKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTR 670
             K+L KF M+ C++   P+     + L       L  ++    P  S++G L+Y   CTR
Sbjct: 1148 KKILSKFNMENCNAVSTPLPSKINYEL-------LNSDEDCNTPCRSLIGCLMYIMLCTR 1207

Query: 671  PDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLE--VIGYSDSNF 730
            PD++ AV +L RY S    + W+  K+VLRYL+GT D  L +K++   E  +IGY DS++
Sbjct: 1208 PDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDW 1267

Query: 731  AGCVDTRKSTFGYLFLLAE-GAISW--KIPPSLVVSTMEAEFVACFEATVHGLWLRNFIS 790
            AG    RKST GYLF + +   I W  K   S+  S+ EAE++A FEA    LWL+  ++
Sbjct: 1268 AGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLT 1327

Query: 791  GLGIADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHIST 838
             + I   +  P++IY DN               AKH+++KY   +E+VQ   + +E+I T
Sbjct: 1328 SINI--KLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPT 1387

BLAST of Pay0003779 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 300.8 bits (769), Expect = 4.8e-80
Identity = 181/504 (35.91%), Postives = 271/504 (53.77%), Query Frame = 0

Query: 371  DAMKEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN-- 430
            +AM  E+ +   N  WDLV  P      VGC+W+F  K +S+G++ RYKARLVAKGYN  
Sbjct: 970  NAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQR 1029

Query: 431  --------FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV 490
                    FSPV K  S+RI++ +       + Q+DV  AFL G L ++V+M QP GF+ 
Sbjct: 1030 PGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFID 1089

Query: 491  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL 550
            + + + VCKL++++YGLKQA R WY++  + + + GF  ++ D  +++   G   + +++
Sbjct: 1090 KDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLV 1149

Query: 551  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYIN 610
            YVDDIL+  ND  LL  T + LS+ F +KD  E  Y +GIE  R  T   L LSQ+ YI 
Sbjct: 1150 YVDDILITGNDPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTG--LHLSQRRYIL 1209

Query: 611  KVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRP 670
             +L +  M        P+    K SL    K        +   Y  IVGSL Y    TRP
Sbjct: 1210 DLLARTNMITAKPVTTPMAPSPKLSLYSGTK------LTDPTEYRGIVGSLQYL-AFTRP 1269

Query: 671  DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGC 730
            DIS+AV  L ++   P  +H +A K++LRYL GT ++ +  K+ + L +  YSD+++AG 
Sbjct: 1270 DISYAVNRLSQFMHMPTEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGD 1329

Query: 731  VDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVACFEATVHGLWLRNFISGLGI 790
             D   ST GY+  L    ISW  K    +V S+ EAE+ +    +    W+ + ++ LGI
Sbjct: 1330 KDDYVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGI 1389

Query: 791  ADSIAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMI 849
               + +P  IYCDN             S  KH+ + Y  I+ +VQ   + V H+ST   +
Sbjct: 1390 --RLTRPPVIYCDNVGATYLCANPVFHSRMKHIAIDYHFIRNQVQSGALRVVHVSTHDQL 1449

BLAST of Pay0003779 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 293.5 bits (750), Expect = 7.7e-78
Identity = 206/619 (33.28%), Postives = 304/619 (49.11%), Query Frame = 0

Query: 256  PGTPQQNGVAERRNRTLMNMVRSMVEI--PSSITSSQVVVPVVVDSVNNPQEQQINGQTP 315
            P +P QN    +   +  ++      I  P+S +SS    P +   +  P   Q+N Q P
Sbjct: 847  PNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAP 906

Query: 316  HNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQLDAM 375
             N   T+   T     I       S  ++++ +      E    +    D        AM
Sbjct: 907  VN---THSMATRAKDGIRKPNQKYSYATSLAAN-----SEPRTAIQAMKDD---RWRQAM 966

Query: 376  KEELKSMNDNEVWDLVELPKESKR-VGCKWVFKTKRDSNGNIERYKARLVAKGYN----- 435
              E+ +   N  WDLV  P  S   VGC+W+F  K +S+G++ RYKARLVAKGYN     
Sbjct: 967  GSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRYKARLVAKGYNQRPGL 1026

Query: 436  -----FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGK 495
                 FSPV K  S+RI++ +       + Q+DV  AFL G L +EV+M QP GF+ + +
Sbjct: 1027 DYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDR 1086

Query: 496  EHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVD 555
               VC+L+++IYGLKQA R WY++    + + GF  +I D  +++   G   I +++YVD
Sbjct: 1087 PDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVD 1146

Query: 556  DILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVL 615
            DIL+  ND  LL  T + LS+ F +K+  +  Y +GIE    R    L LSQ+ Y   +L
Sbjct: 1147 DILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIE--AKRVPQGLHLSQRRYTLDLL 1206

Query: 616  EKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDIS 675
             +  M        P+    K +L    K        +   Y  IVGSL Y    TRPD+S
Sbjct: 1207 ARTNMLTAKPVATPMATSPKLTLHSGTK------LPDPTEYRGIVGSLQYL-AFTRPDLS 1266

Query: 676  FAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGCVDT 735
            +AV  L +Y   P  DHW A K+VLRYL GT D+ +  K+ + L +  YSD+++AG  D 
Sbjct: 1267 YAVNRLSQYMHMPTDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDD 1326

Query: 736  RKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVACFEATVHGLWLRNFISGLGIADS 795
              ST GY+  L    ISW  K    +V S+ EAE+ +    +    W+ + ++ LGI   
Sbjct: 1327 YVSTNGYIVYLGHHPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGI--Q 1386

Query: 796  IAKPLRIYCDN-------------SSAKHMELKYFAIKEEVQKERVSVEHISTKLMIADP 847
            ++ P  IYCDN             S  KH+ L Y  I+ +VQ   + V H+ST   +AD 
Sbjct: 1387 LSHPPVIYCDNVGATYLCANPVFHSRMKHIALDYHFIRNQVQSGALRVVHVSTHDQLADT 1443

BLAST of Pay0003779 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 164.1 bits (414), Expect = 7.0e-39
Identity = 98/313 (31.31%), Postives = 160/313 (51.12%), Query Frame = 0

Query: 454 MDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITS 513
           MDV TAFLN  +DE +++ QP GF+ E     V +L   +YGLKQA   W    N+T+  
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 514 FGFKENIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEA 573
            GF  +  +  +Y + +    I + +YVDD+L+A     +  + K+ L+K + MKD+G+ 
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKV 120

Query: 574 SYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNEL 633
              +G+ I +  ++G + LS + YI K   + +++    +  P+           P    
Sbjct: 121 DKFLGLNIHQS-SNGDITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSP---- 180

Query: 634 ERNQMETIPYASIVGSLLYAQTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGT 693
             +  +  PY SIVG LL+     RPDIS+ V +L R+   P   H ++A++VLRYL  T
Sbjct: 181 --HLKDITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTT 240

Query: 694 KDYMLTYKRSDHLEVIGYSDSNFAGCVDTRKSTFGYLFLLAEGAISW---KIPPSLVVST 753
           +   L Y+    L +  Y D++     D   ST GY+ LLA   ++W   K+   + V +
Sbjct: 241 RSMCLKYRSGSQLALTVYCDASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPS 300

Query: 754 MEAEFVACFEATV 764
            EAE++   E  +
Sbjct: 301 TEAEYITASETVM 306

BLAST of Pay0003779 vs. ExPASy TrEMBL
Match: A0A5D3BWW5 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1856G00300 PE=4 SV=1)

HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 669/906 (73.84%), Postives = 685/906 (75.61%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
           MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTI
Sbjct: 85  MFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTI 144

Query: 61  HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
           HEHILEMTNLAARLKTMGMEVN+NFLVTFILNSLPSEYGPFHMNYNTLKDKWN       
Sbjct: 145 HEHILEMTNLAARLKTMGMEVNENFLVTFILNSLPSEYGPFHMNYNTLKDKWNVHELQSM 204

Query: 121 -------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCR 180
                                          NGKGNHGQLKVKQSSAPIHKKG+IKDKCR
Sbjct: 205 LIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGEIKDKCR 264

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGF 240
           FCNKPGHYQKDCLKRKAWFENK +                                    
Sbjct: 265 FCNKPGHYQKDCLKRKAWFENKVERQ---------------------------------- 324

Query: 241 LTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ---- 300
                         +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Sbjct: 325 --------------LDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP 384

Query: 301 -YTMPGTPQQNGVAERR--------------------------NRTL------------- 360
            YTMPGTPQQN VAER+                          +RT              
Sbjct: 385 GYTMPGTPQQNDVAERKPSLRHLYVWGCQAEARIYNPHEKKQDSRTTSGFFIGYSEKSKG 444

Query: 361 ----------------------------------MNMVRSMVEIPSSITSSQVVVPVVVD 420
                                             + +    VEIPSSITSSQ+VVPVVVD
Sbjct: 445 YRFYCPNHSTRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVD 504

Query: 421 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 480
           SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL
Sbjct: 505 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 564

Query: 481 SIDNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKR 540
           SIDNDPVSFSQ          LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKR
Sbjct: 565 SIDNDPVSFSQAIKGDNSTKWLDAMKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKR 624

Query: 541 DSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 600
           DSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Sbjct: 625 DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 684

Query: 601 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 660
           AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE
Sbjct: 685 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 744

Query: 661 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIG 720
           NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDM EASYVIG
Sbjct: 745 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIG 804

Query: 721 IEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQM 763
           IEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQM
Sbjct: 805 IEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQM 864

BLAST of Pay0003779 vs. ExPASy TrEMBL
Match: A0A5A7UG95 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43055G00040 PE=4 SV=1)

HSP 1 Score: 1227.6 bits (3175), Expect = 0.0e+00
Identity = 669/906 (73.84%), Postives = 684/906 (75.50%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
           MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTI
Sbjct: 85  MFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTI 144

Query: 61  HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
           HEHILEMTNLAARLKTMGMEVN+NFLV FILNSLPSEYGPFHMNYNTLKDKWN       
Sbjct: 145 HEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSM 204

Query: 121 -------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCR 180
                                          NGKGNHGQLKVKQSSAPIHKKGQIKDKCR
Sbjct: 205 LIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCR 264

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGF 240
           FCNKPGHYQKDCLKRKAWFENK +                                    
Sbjct: 265 FCNKPGHYQKDCLKRKAWFENKVERQ---------------------------------- 324

Query: 241 LTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ---- 300
                         +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Sbjct: 325 --------------LDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP 384

Query: 301 -YTMPGTPQQNGVAERR--------------------------NRTL------------- 360
            YTMPGTPQQN VAER+                          +RT              
Sbjct: 385 GYTMPGTPQQNDVAERKPSLRHLYVWGCQAEARIYNPHEKKQDSRTTSGFFIGYSEKSKG 444

Query: 361 ----------------------------------MNMVRSMVEIPSSITSSQVVVPVVVD 420
                                             + +    VEIPSSITSSQ+VVPVVVD
Sbjct: 445 YRFYCPNHSTRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVD 504

Query: 421 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 480
           SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL
Sbjct: 505 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 564

Query: 481 SIDNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKR 540
           SIDNDPVSFSQ          LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKR
Sbjct: 565 SIDNDPVSFSQAIKGNNSTKWLDAMKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKR 624

Query: 541 DSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 600
           DSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Sbjct: 625 DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 684

Query: 601 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 660
           AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE
Sbjct: 685 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 744

Query: 661 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIG 720
           NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDM EASYVIG
Sbjct: 745 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIG 804

Query: 721 IEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQM 763
           IEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQM
Sbjct: 805 IEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQM 864

BLAST of Pay0003779 vs. ExPASy TrEMBL
Match: A0A445LQ30 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_004205 PE=4 SV=1)

HSP 1 Score: 1107.0 bits (2862), Expect = 0.0e+00
Identity = 641/1279 (50.12%), Postives = 738/1279 (57.70%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+
Sbjct: 138  MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 197

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
            HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEYGPF M+YNT+KDKWN       
Sbjct: 198  HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 257

Query: 121  --------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDK 180
                      +G+H                         G LK+K     I KK    + 
Sbjct: 258  LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGKGPLKIKDGPVQIQKKASKNNN 317

Query: 181  CRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQ 240
            C FC K GH+QKDC KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTMQ
Sbjct: 318  CHFCGKSGHFQKDCPKRKSWFEKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQ 377

Query: 241  GFLTTRTTNPNERFIFMGNR---------------------------------------- 300
            GFLT +T +PNE+F+FMGNR                                        
Sbjct: 378  GFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLS 437

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 438  KLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSL 497

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 498  VNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHTKKGATRS 557

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 558  TQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVE 617

Query: 481  ------VKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERR 540
                  VKI+RSDRGGEYY      G+ P PFAK L+  GICAQYTMPGTPQQNGV+ERR
Sbjct: 618  RQLDRKVKIIRSDRGGEYYRRYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSERR 677

Query: 541  NRTLMNMVRSM------------------------------------------------- 600
            N+TLM+MVRSM                                                 
Sbjct: 678  NKTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRHLH 737

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 738  VWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIENG 797

Query: 661  -----------------VEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHND--IVTN 720
                             V++P +  SS  V+   V + N+ +E Q      HND  ++ N
Sbjct: 798  EISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMIHN 857

Query: 721  EPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ---------- 780
            EP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQ          
Sbjct: 858  EPIMEEPQEVALRKSQRERRPAISNDYVVYLHETETNLSINDNDPVSFSQAISCDNSEKW 917

Query: 781  LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY--- 840
            L+AMKEE+ SM  N+VWDLVELPK  KRVG KWVFKTKRDS+GN+ERYKARLVAKG+   
Sbjct: 918  LNAMKEEIDSMEHNDVWDLVELPKGCKRVGYKWVFKTKRDSHGNLERYKARLVAKGFTQK 977

Query: 841  -------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV 844
                    FSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Sbjct: 978  DGIDYKETFSPVSRKDSFRIIMALVAHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSV 1037

BLAST of Pay0003779 vs. ExPASy TrEMBL
Match: A0A445GJ88 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3848 GN=D0Y65_043849 PE=4 SV=1)

HSP 1 Score: 1045.0 bits (2701), Expect = 1.7e-301
Identity = 567/918 (61.76%), Postives = 659/918 (71.79%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMTVA++IK+T+  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+
Sbjct: 326  MFMRMTVADSIKTTLPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 385

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
            HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEY PF M+YNT+KDKWN       
Sbjct: 386  HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYDPFQMSYNTMKDKWNVHELHSM 445

Query: 121  --------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDK 180
                      +G+H                         G LK+K     I KK    + 
Sbjct: 446  LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGKGPLKIKDGPVEIQKKASKNNN 505

Query: 181  CRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQ 240
            C FC K GH+QKDC KRK+WFE KG+ NAL                              
Sbjct: 506  CHFCGKSGHFQKDCPKRKSWFEKKGELNAL------------------------------ 565

Query: 241  GFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ 300
            GFLT +T +PNE+F+FMGNRVK      G        G     LE+  +           
Sbjct: 566  GFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYV----------- 625

Query: 301  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTN 360
                    +R L+++  S ++I     +        V + N+ +E Q N +     ++ N
Sbjct: 626  -----PSLSRNLVSL--SKLDITGYSFN-------FVTATNSNEEVQHNNE----PMIHN 685

Query: 361  EPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ---------- 420
            EP+ E PQE+ LR+S R RR AIS+DY+VYLHE E +LSI DNDPVSFSQ          
Sbjct: 686  EPIVEEPQEVALRKSQRERRPAISNDYVVYLHEIETNLSINDNDPVSFSQAVSCDNSEKW 745

Query: 421  LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY--- 480
            L+AMKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ERYKARLVAKG+   
Sbjct: 746  LNAMKEEIDSMEHNGVWDLVELPKGCKRVGCKWVFKTKRDSHGNLERYKARLVAKGFTQK 805

Query: 481  -------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV 540
                    FSPVS+KDS RIIMALV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Sbjct: 806  DGIDYKETFSPVSRKDSFRIIMALVTHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSV 865

Query: 541  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL 600
            EGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVL
Sbjct: 866  EGKEHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVL 925

Query: 601  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYIN 660
            YVDDIL+ATND GLL +TK+FLS NFEMKDMGEA+YVIGIEIFR+R+ GLLGLSQK YIN
Sbjct: 926  YVDDILIATNDLGLLHETKKFLSSNFEMKDMGEANYVIGIEIFRNRSQGLLGLSQKTYIN 985

Query: 661  KVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRP 720
            KVLE+F+M+KCS+S VPIQK DKFSL QCPKN+LER QME I YAS+VGS++YAQTCTRP
Sbjct: 986  KVLERFRMEKCSASPVPIQKRDKFSLAQCPKNDLERKQMEEISYASVVGSIMYAQTCTRP 1045

Query: 721  DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGC 780
            DISFA GMLGRYQSNPGM+HWKAAKKVLRYLQGTKD+MLTYKRSDHLEVIGYSDS+FAGC
Sbjct: 1046 DISFATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHMLTYKRSDHLEVIGYSDSDFAGC 1105

Query: 781  VDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI 840
            VDTRKST G++FLLA GAISWK     VV  STMEAEFVACFEAT+   WLRNFISGLGI
Sbjct: 1106 VDTRKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAEFVACFEATIQANWLRNFISGLGI 1165

Query: 841  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI 843
             DSIA+PL++YCDNS+             AKHMELKYF +KEEVQK+RVS+EHISTKLMI
Sbjct: 1166 VDSIARPLKMYCDNSAAVFFSKNDKYSTVAKHMELKYFVVKEEVQKQRVSIEHISTKLMI 1182

BLAST of Pay0003779 vs. ExPASy TrEMBL
Match: A0A438D994 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2481 PE=4 SV=1)

HSP 1 Score: 1027.3 bits (2655), Expect = 3.6e-296
Identity = 566/1094 (51.74%), Postives = 688/1094 (62.89%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMT+ANNIK+++  TE A EF+KSVE+  + + ADKSLAGTLM+ LT +K+DG + I
Sbjct: 61   MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN----NGK 120
             +HIL MT  AA+LK +GM ++++FLV F+LNSLPS++ PF ++YNT  D+WN      K
Sbjct: 121  QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121  GNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRF 180
                +++++Q       A  H    KKG+ K                          C F
Sbjct: 181  CIQEEVRLRQEGHNLAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSHDGKFTVSCYF 240

Query: 181  CNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFL 240
            C K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV+N MQGFL
Sbjct: 241  CGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGFL 300

Query: 241  TTRTTNPNERFIFMGNR------------------------------------------- 300
            TTR    +E+F++MGNR                                           
Sbjct: 301  TTRKPKESEKFLYMGNRLKVEVVAVDDLSRYGYVYLMHEKSQAIDIFEMFITEVERQLDK 360

Query: 301  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLM 360
             +KI++SDRGGEYYG+       PGPFAKFLE HGI AQYTMPGTPQQNGVAERRNRTLM
Sbjct: 361  KIKIVKSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRNRTLM 420

Query: 361  NMVRSM------------------------------------------------------ 420
             MVRSM                                                      
Sbjct: 421  EMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHIWGCP 480

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 481  AEARIYNPHEKKLDSRTVSGYFIGYPNKSKGYRFYCPNHSVRIVETGNARFLENGEISGS 540

Query: 481  ------------VEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGP 540
                        V+IP      +++VP  V  V + ++   +G  P  +I   E V E P
Sbjct: 541  NEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENVVEPP 600

Query: 541  QEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL 600
            Q   LRRS R RR AI+DDY+VYL ES++D+ I  DPVSFSQ          ++AM EEL
Sbjct: 601  QPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAMNEEL 660

Query: 601  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------N 660
            KSM  N VWDL+ELP   K VGCKWVFKTKRD+ GNIER+KARLVAKG+           
Sbjct: 661  KSMAHNGVWDLIELPNNCKPVGCKWVFKTKRDAKGNIERFKARLVAKGFTQKEGIDYKDT 720

Query: 661  FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVC 720
            FSPVSKKDSLRIIMALVAH+DLELHQMDVKTAFLNGN  +    +  +G   +  +H+VC
Sbjct: 721  FSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNGNWMKISIWNNLKGSQRKEMKHLVC 780

Query: 721  KLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLA 780
            KLK+SIYGLKQASRQWY+KFN+TITSFGFKENIVD+CIYLK+SGSKFI L+LYVDDILLA
Sbjct: 781  KLKKSIYGLKQASRQWYIKFNNTITSFGFKENIVDQCIYLKVSGSKFIFLILYVDDILLA 840

Query: 781  TNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKM 840
            ++D GLL +TKE+LSKNF M DMGEA+YVIGIEIFRDR+ G+LGLSQK YI++VLE+F M
Sbjct: 841  SSDLGLLRETKEYLSKNFHMVDMGEANYVIGIEIFRDRSRGVLGLSQKGYIDRVLERFNM 900

Query: 841  DKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM 847
              CSS + PI KGDK S MQCP+N +ER QM+ IPYAS VGSL+YAQTCTRPDISFA+GM
Sbjct: 901  QSCSSGIAPILKGDKLSKMQCPRNNIEREQMKKIPYASAVGSLMYAQTCTRPDISFAIGM 960

BLAST of Pay0003779 vs. NCBI nr
Match: TYK04201.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1228.8 bits (3178), Expect = 0.0e+00
Identity = 669/906 (73.84%), Postives = 685/906 (75.61%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
           MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTI
Sbjct: 85  MFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTI 144

Query: 61  HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
           HEHILEMTNLAARLKTMGMEVN+NFLVTFILNSLPSEYGPFHMNYNTLKDKWN       
Sbjct: 145 HEHILEMTNLAARLKTMGMEVNENFLVTFILNSLPSEYGPFHMNYNTLKDKWNVHELQSM 204

Query: 121 -------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCR 180
                                          NGKGNHGQLKVKQSSAPIHKKG+IKDKCR
Sbjct: 205 LIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGEIKDKCR 264

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGF 240
           FCNKPGHYQKDCLKRKAWFENK +                                    
Sbjct: 265 FCNKPGHYQKDCLKRKAWFENKVERQ---------------------------------- 324

Query: 241 LTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ---- 300
                         +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Sbjct: 325 --------------LDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP 384

Query: 301 -YTMPGTPQQNGVAERR--------------------------NRTL------------- 360
            YTMPGTPQQN VAER+                          +RT              
Sbjct: 385 GYTMPGTPQQNDVAERKPSLRHLYVWGCQAEARIYNPHEKKQDSRTTSGFFIGYSEKSKG 444

Query: 361 ----------------------------------MNMVRSMVEIPSSITSSQVVVPVVVD 420
                                             + +    VEIPSSITSSQ+VVPVVVD
Sbjct: 445 YRFYCPNHSTRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVD 504

Query: 421 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 480
           SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL
Sbjct: 505 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 564

Query: 481 SIDNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKR 540
           SIDNDPVSFSQ          LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKR
Sbjct: 565 SIDNDPVSFSQAIKGDNSTKWLDAMKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKR 624

Query: 541 DSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 600
           DSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Sbjct: 625 DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 684

Query: 601 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 660
           AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE
Sbjct: 685 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 744

Query: 661 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIG 720
           NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDM EASYVIG
Sbjct: 745 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIG 804

Query: 721 IEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQM 763
           IEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQM
Sbjct: 805 IEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQM 864

BLAST of Pay0003779 vs. NCBI nr
Match: KAA0052755.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1227.6 bits (3175), Expect = 0.0e+00
Identity = 669/906 (73.84%), Postives = 684/906 (75.50%), Query Frame = 0

Query: 1   MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
           MF+RMTVANNIK TIKNTEDAKEFMKSV+KC QSESADKSLAGTLMSTLTNIKFDGSRTI
Sbjct: 85  MFIRMTVANNIKFTIKNTEDAKEFMKSVKKCFQSESADKSLAGTLMSTLTNIKFDGSRTI 144

Query: 61  HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
           HEHILEMTNLAARLKTMGMEVN+NFLV FILNSLPSEYGPFHMNYNTLKDKWN       
Sbjct: 145 HEHILEMTNLAARLKTMGMEVNENFLVMFILNSLPSEYGPFHMNYNTLKDKWNVHELQSM 204

Query: 121 -------------------------------NGKGNHGQLKVKQSSAPIHKKGQIKDKCR 180
                                          NGKGNHGQLKVKQSSAPIHKKGQIKDKCR
Sbjct: 205 LIQEEARLKKPIIHSVNLMGHKGAGKKPGKKNGKGNHGQLKVKQSSAPIHKKGQIKDKCR 264

Query: 181 FCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGF 240
           FCNKPGHYQKDCLKRKAWFENK +                                    
Sbjct: 265 FCNKPGHYQKDCLKRKAWFENKVERQ---------------------------------- 324

Query: 241 LTTRTTNPNERFIFMGNRVKILRSDRGGEYYGK------CPGPFAKFLESHGICAQ---- 300
                         +   VKILRSDRGGEYYGK      CPGPFAKFLESHGICAQ    
Sbjct: 325 --------------LDRNVKILRSDRGGEYYGKYDENGQCPGPFAKFLESHGICAQYTMP 384

Query: 301 -YTMPGTPQQNGVAERR--------------------------NRTL------------- 360
            YTMPGTPQQN VAER+                          +RT              
Sbjct: 385 GYTMPGTPQQNDVAERKPSLRHLYVWGCQAEARIYNPHEKKQDSRTTSGFFIGYSEKSKG 444

Query: 361 ----------------------------------MNMVRSMVEIPSSITSSQVVVPVVVD 420
                                             + +    VEIPSSITSSQ+VVPVVVD
Sbjct: 445 YRFYCPNHSTRIVETGNVRFIENDIINGSLEPRKVEIQEVRVEIPSSITSSQIVVPVVVD 504

Query: 421 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 480
           SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL
Sbjct: 505 SVNNPQEQQINGQTPHNDIVTNEPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDL 564

Query: 481 SIDNDPVSFSQ----------LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKR 540
           SIDNDPVSFSQ          LDAMKEELKSMNDNEVWDLVELPK+SKRVGCKWVFKTKR
Sbjct: 565 SIDNDPVSFSQAIKGNNSTKWLDAMKEELKSMNDNEVWDLVELPKKSKRVGCKWVFKTKR 624

Query: 541 DSNGNIERYKARLVAKGY----------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 600
           DSNGNIER KARLVAKGY           FSPVSKKDSLRIIMALVAHYDLELHQMDVKT
Sbjct: 625 DSNGNIERCKARLVAKGYTQKDGIDYKETFSPVSKKDSLRIIMALVAHYDLELHQMDVKT 684

Query: 601 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 660
           AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE
Sbjct: 685 AFLNGNLDEEVFMDQPEGFMVEGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKE 744

Query: 661 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIG 720
           NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDM EASYVIG
Sbjct: 745 NIVDRCIYLKISGSKFIILVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMSEASYVIG 804

Query: 721 IEIFRDRTHGLLGLSQKAYINKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQM 763
           IEIFRDRTHGLLGLSQ AYINKVLEKFKM+KCSSSVVPIQKGDKFSLMQCPKNELERNQM
Sbjct: 805 IEIFRDRTHGLLGLSQNAYINKVLEKFKMNKCSSSVVPIQKGDKFSLMQCPKNELERNQM 864

BLAST of Pay0003779 vs. NCBI nr
Match: RZC25410.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 1107.0 bits (2862), Expect = 0.0e+00
Identity = 641/1279 (50.12%), Postives = 738/1279 (57.70%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMTVA++IK+ +  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+
Sbjct: 138  MFMRMTVADSIKTALPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 197

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
            HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEYGPF M+YNT+KDKWN       
Sbjct: 198  HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYGPFQMSYNTMKDKWNVHELHSM 257

Query: 121  --------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDK 180
                      +G+H                         G LK+K     I KK    + 
Sbjct: 258  LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGKGPLKIKDGPVQIQKKASKNNN 317

Query: 181  CRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQ 240
            C FC K GH+QKDC KRK+WFE KG+ NALVCFESNLTEVP+NTWWIDSGCT HVSNTMQ
Sbjct: 318  CHFCGKSGHFQKDCPKRKSWFEKKGELNALVCFESNLTEVPHNTWWIDSGCTTHVSNTMQ 377

Query: 241  GFLTTRTTNPNERFIFMGNR---------------------------------------- 300
            GFLT +T +PNE+F+FMGNR                                        
Sbjct: 378  GFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYVPSLSRNLVSLS 437

Query: 301  ------------------------------------------------------------ 360
                                                                        
Sbjct: 438  KLDITGYSFNFGNGCFSLFKYNHLIGTGVLCDGLYKLKLDGLYVETVLTLHHNVGTKRSL 497

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 498  VNERSAFLWHKRLGHISGERIERLIKNEILPDLDFTDLNICVDCIKGKQTKHTKKGATRS 557

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 558  TQLLEIVHTDICGPFDVSSFGRERYFITFIDDYSRYGYVYLLHEKSQAVNALEIYLNEVE 617

Query: 481  ------VKILRSDRGGEYY------GKCPGPFAKFLESHGICAQYTMPGTPQQNGVAERR 540
                  VKI+RSDRGGEYY      G+ P PFAK L+  GICAQYTMPGTPQQNGV+ERR
Sbjct: 618  RQLDRKVKIIRSDRGGEYYRRYDETGQHPSPFAKLLQKRGICAQYTMPGTPQQNGVSERR 677

Query: 541  NRTLMNMVRSM------------------------------------------------- 600
            N+TLM+MVRSM                                                 
Sbjct: 678  NKTLMDMVRSMLINSTLPVSLWMYALKTAMYLLNRVPSKAVPKTPFELWTNRTPSMRHLH 737

Query: 601  ------------------------------------------------------------ 660
                                                                        
Sbjct: 738  VWGCQAEIRIYNPQERKLDARTISGYFIGYPEKSKGYMFYCPNHSTRIVETGNARFIENG 797

Query: 661  -----------------VEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHND--IVTN 720
                             V++P +  SS  V+   V + N+ +E Q      HND  ++ N
Sbjct: 798  EISGSTVPREVEIKEVRVQVPLAFASSSKVITTSVTATNSNEEVQ------HNDEPMIHN 857

Query: 721  EPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ---------- 780
            EP+ E PQE+ LR+S R RR AIS+DY+VYLHE+E +LSI DNDPVSFSQ          
Sbjct: 858  EPIMEEPQEVALRKSQRERRPAISNDYVVYLHETETNLSINDNDPVSFSQAISCDNSEKW 917

Query: 781  LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY--- 840
            L+AMKEE+ SM  N+VWDLVELPK  KRVG KWVFKTKRDS+GN+ERYKARLVAKG+   
Sbjct: 918  LNAMKEEIDSMEHNDVWDLVELPKGCKRVGYKWVFKTKRDSHGNLERYKARLVAKGFTQK 977

Query: 841  -------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV 844
                    FSPVS+KDS RIIMALVAHYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Sbjct: 978  DGIDYKETFSPVSRKDSFRIIMALVAHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSV 1037

BLAST of Pay0003779 vs. NCBI nr
Match: RZB61294.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja])

HSP 1 Score: 1045.0 bits (2701), Expect = 3.4e-301
Identity = 567/918 (61.76%), Postives = 659/918 (71.79%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMTVA++IK+T+  T+ AKEFM  V +  +S++ADKSLAGTLMSTLT +KFDGSRT+
Sbjct: 326  MFMRMTVADSIKTTLPKTDSAKEFMGLVGE--RSQTADKSLAGTLMSTLTTMKFDGSRTM 385

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN------- 120
            HEH++EMTN+AARLKT+GM VN+NFLV FILNSLPSEY PF M+YNT+KDKWN       
Sbjct: 386  HEHVIEMTNIAARLKTLGMAVNENFLVQFILNSLPSEYDPFQMSYNTMKDKWNVHELHSM 445

Query: 121  --------NGKGNH-------------------------GQLKVKQSSAPIHKKGQIKDK 180
                      +G+H                         G LK+K     I KK    + 
Sbjct: 446  LVQEETRLKNQGSHSIHYVSHRGNQGAGKKFVKKHDKGKGPLKIKDGPVEIQKKASKNNN 505

Query: 181  CRFCNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQ 240
            C FC K GH+QKDC KRK+WFE KG+ NAL                              
Sbjct: 506  CHFCGKSGHFQKDCPKRKSWFEKKGELNAL------------------------------ 565

Query: 241  GFLTTRTTNPNERFIFMGNRVKILRSDRGGEYYGKCPGPFAKFLESHGICAQYTMPGTPQ 300
            GFLT +T +PNE+F+FMGNRVK      G        G     LE+  +           
Sbjct: 566  GFLTIQTISPNEKFVFMGNRVKAPVEAVGTYRLKLDTGHHLDLLETLYV----------- 625

Query: 301  QNGVAERRNRTLMNMVRSMVEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTN 360
                    +R L+++  S ++I     +        V + N+ +E Q N +     ++ N
Sbjct: 626  -----PSLSRNLVSL--SKLDITGYSFN-------FVTATNSNEEVQHNNE----PMIHN 685

Query: 361  EPVTEGPQEIELRRSVRSRRSAISDDYLVYLHESEFDLSI-DNDPVSFSQ---------- 420
            EP+ E PQE+ LR+S R RR AIS+DY+VYLHE E +LSI DNDPVSFSQ          
Sbjct: 686  EPIVEEPQEVALRKSQRERRPAISNDYVVYLHEIETNLSINDNDPVSFSQAVSCDNSEKW 745

Query: 421  LDAMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY--- 480
            L+AMKEE+ SM  N VWDLVELPK  KRVGCKWVFKTKRDS+GN+ERYKARLVAKG+   
Sbjct: 746  LNAMKEEIDSMEHNGVWDLVELPKGCKRVGCKWVFKTKRDSHGNLERYKARLVAKGFTQK 805

Query: 481  -------NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMV 540
                    FSPVS+KDS RIIMALV HYDLELHQMDVKTAFLNG+L+E+V+MDQP GF V
Sbjct: 806  DGIDYKETFSPVSRKDSFRIIMALVTHYDLELHQMDVKTAFLNGDLEEDVYMDQPMGFSV 865

Query: 541  EGKEHMVCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVL 600
            EGKEHMVCKLK+SIYGLKQASRQWYLKFNDTI SFGFKEN VDRC+YLK+SGSK + LVL
Sbjct: 866  EGKEHMVCKLKKSIYGLKQASRQWYLKFNDTIVSFGFKENTVDRCVYLKVSGSKVMFLVL 925

Query: 601  YVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYIN 660
            YVDDIL+ATND GLL +TK+FLS NFEMKDMGEA+YVIGIEIFR+R+ GLLGLSQK YIN
Sbjct: 926  YVDDILIATNDLGLLHETKKFLSSNFEMKDMGEANYVIGIEIFRNRSQGLLGLSQKTYIN 985

Query: 661  KVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRP 720
            KVLE+F+M+KCS+S VPIQK DKFSL QCPKN+LER QME I YAS+VGS++YAQTCTRP
Sbjct: 986  KVLERFRMEKCSASPVPIQKRDKFSLAQCPKNDLERKQMEEISYASVVGSIMYAQTCTRP 1045

Query: 721  DISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFAGC 780
            DISFA GMLGRYQSNPGM+HWKAAKKVLRYLQGTKD+MLTYKRSDHLEVIGYSDS+FAGC
Sbjct: 1046 DISFATGMLGRYQSNPGMEHWKAAKKVLRYLQGTKDHMLTYKRSDHLEVIGYSDSDFAGC 1105

Query: 781  VDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGLGI 840
            VDTRKST G++FLLA GAISWK     VV  STMEAEFVACFEAT+   WLRNFISGLGI
Sbjct: 1106 VDTRKSTLGFVFLLAGGAISWKSAKQSVVAASTMEAEFVACFEATIQANWLRNFISGLGI 1165

Query: 841  ADSIAKPLRIYCDNSS-------------AKHMELKYFAIKEEVQKERVSVEHISTKLMI 843
             DSIA+PL++YCDNS+             AKHMELKYF +KEEVQK+RVS+EHISTKLMI
Sbjct: 1166 VDSIARPLKMYCDNSAAVFFSKNDKYSTVAKHMELKYFVVKEEVQKQRVSIEHISTKLMI 1182

BLAST of Pay0003779 vs. NCBI nr
Match: RVW32004.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 1027.3 bits (2655), Expect = 7.4e-296
Identity = 566/1094 (51.74%), Postives = 688/1094 (62.89%), Query Frame = 0

Query: 1    MFMRMTVANNIKSTIKNTEDAKEFMKSVEKCSQSESADKSLAGTLMSTLTNIKFDGSRTI 60
            MFMRMT+ANNIK+++  TE A EF+KSVE+  + + ADKSLAGTLM+ LT +K+DG + I
Sbjct: 61   MFMRMTIANNIKTSLPQTEFASEFLKSVEE--RFKRADKSLAGTLMAELTTMKYDGQKGI 120

Query: 61   HEHILEMTNLAARLKTMGMEVNKNFLVTFILNSLPSEYGPFHMNYNTLKDKWN----NGK 120
             +HIL MT  AA+LK +GM ++++FLV F+LNSLPS++ PF ++YNT  D+WN      K
Sbjct: 121  QQHILNMTEKAAKLKALGMGMDESFLVQFVLNSLPSQFAPFKIHYNTNSDQWNLNELTSK 180

Query: 121  GNHGQLKVKQSS-----APIH----KKGQIKD------------------------KCRF 180
                +++++Q       A  H    KKG+ K                          C F
Sbjct: 181  CIQEEVRLRQEGHNLAFAVTHGVTKKKGKFKKGKNFPPKKSGPGEGSQSHDGKFTVSCYF 240

Query: 181  CNKPGHYQKDCLKRKAWFENKGKHNALVCFESNLTEVPYNTWWIDSGCTIHVSNTMQGFL 240
            C K GH +KDC+KRKAWFE +G + + VC+ESNL EVP NTWWIDSG T HV+N MQGFL
Sbjct: 241  CGKKGHVKKDCIKRKAWFEKRGINLSFVCYESNLAEVPSNTWWIDSGATTHVTNLMQGFL 300

Query: 241  TTRTTNPNERFIFMGNR------------------------------------------- 300
            TTR    +E+F++MGNR                                           
Sbjct: 301  TTRKPKESEKFLYMGNRLKVEVVAVDDLSRYGYVYLMHEKSQAIDIFEMFITEVERQLDK 360

Query: 301  -VKILRSDRGGEYYGKC------PGPFAKFLESHGICAQYTMPGTPQQNGVAERRNRTLM 360
             +KI++SDRGGEYYG+       PGPFAKFLE HGI AQYTMPGTPQQNGVAERRNRTLM
Sbjct: 361  KIKIVKSDRGGEYYGRYDESGQNPGPFAKFLEKHGIRAQYTMPGTPQQNGVAERRNRTLM 420

Query: 361  NMVRSM------------------------------------------------------ 420
             MVRSM                                                      
Sbjct: 421  EMVRSMMSYSSVPISLWGEALKTAMYILNRVPSKAVPKTPFELWTGRKPSLRHIHIWGCP 480

Query: 421  ------------------------------------------------------------ 480
                                                                        
Sbjct: 481  AEARIYNPHEKKLDSRTVSGYFIGYPNKSKGYRFYCPNHSVRIVETGNARFLENGEISGS 540

Query: 481  ------------VEIPSSITSSQVVVPVVVDSVNNPQEQQINGQTPHNDIVTNEPVTEGP 540
                        V+IP      +++VP  V  V + ++   +G  P  +I   E V E P
Sbjct: 541  NEPRKVDIEEIRVDIPPPFLPQEIIVPQPVQQVEDNEQNNRDGSLPLENIAI-ENVVEPP 600

Query: 541  QEIELRRSVRSRRSAISDDYLVYLHESEFDLSIDNDPVSFSQ----------LDAMKEEL 600
            Q   LRRS R RR AI+DDY+VYL ES++D+ I  DPVSFSQ          ++AM EEL
Sbjct: 601  QPAPLRRSQRERRPAITDDYVVYLQESDYDIGIRKDPVSFSQAMESDDSSKWMEAMNEEL 660

Query: 601  KSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----------N 660
            KSM  N VWDL+ELP   K VGCKWVFKTKRD+ GNIER+KARLVAKG+           
Sbjct: 661  KSMAHNGVWDLIELPNNCKPVGCKWVFKTKRDAKGNIERFKARLVAKGFTQKEGIDYKDT 720

Query: 661  FSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEGKEHMVC 720
            FSPVSKKDSLRIIMALVAH+DLELHQMDVKTAFLNGN  +    +  +G   +  +H+VC
Sbjct: 721  FSPVSKKDSLRIIMALVAHFDLELHQMDVKTAFLNGNWMKISIWNNLKGSQRKEMKHLVC 780

Query: 721  KLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIILVLYVDDILLA 780
            KLK+SIYGLKQASRQWY+KFN+TITSFGFKENIVD+CIYLK+SGSKFI L+LYVDDILLA
Sbjct: 781  KLKKSIYGLKQASRQWYIKFNNTITSFGFKENIVDQCIYLKVSGSKFIFLILYVDDILLA 840

Query: 781  TNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAYINKVLEKFKM 840
            ++D GLL +TKE+LSKNF M DMGEA+YVIGIEIFRDR+ G+LGLSQK YI++VLE+F M
Sbjct: 841  SSDLGLLRETKEYLSKNFHMVDMGEANYVIGIEIFRDRSRGVLGLSQKGYIDRVLERFNM 900

Query: 841  DKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCTRPDISFAVGM 847
              CSS + PI KGDK S MQCP+N +ER QM+ IPYAS VGSL+YAQTCTRPDISFA+GM
Sbjct: 901  QSCSSGIAPILKGDKLSKMQCPRNNIEREQMKKIPYASAVGSLMYAQTCTRPDISFAIGM 960

BLAST of Pay0003779 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 306.6 bits (784), Expect = 6.2e-83
Identity = 169/442 (38.24%), Postives = 256/442 (57.92%), Query Frame = 0

Query: 372 AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGY----- 431
           AM +E+ +M     W++  LP   K +GCKWV+K K +S+G IERYKARLVAKGY     
Sbjct: 101 AMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQEG 160

Query: 432 -----NFSPVSKKDSLRIIMALVAHYDLELHQMDVKTAFLNGNLDEEVFMDQPEGFMVEG 491
                 FSPV K  S+++I+A+ A Y+  LHQ+D+  AFLNG+LDEE++M  P G+    
Sbjct: 161 IDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQ 220

Query: 492 KEHM----VCKLKRSIYGLKQASRQWYLKFNDTITSFGFKENIVDRCIYLKISGSKFIIL 551
            + +    VC LK+SIYGLKQASRQW+LKF+ T+  FGF ++  D   +LKI+ + F+ +
Sbjct: 221 GDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCV 280

Query: 552 VLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTHGLLGLSQKAY 611
           ++YVDDI++ +N+   + + K  L   F+++D+G   Y +G+EI R      + + Q+ Y
Sbjct: 281 LVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAG--INICQRKY 340

Query: 612 INKVLEKFKMDKCSSSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYAQTCT 671
              +L++  +  C  S VP+     FS           + ++   Y  ++G L+Y Q  T
Sbjct: 341 ALDLLDETGLLGCKPSSVPMDPSVTFSA------HSGGDFVDAKAYRRLIGRLMYLQ-IT 400

Query: 672 RPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDSNFA 731
           R DISFAV  L ++   P + H +A  K+L Y++GT    L Y     +++  +SD++F 
Sbjct: 401 RLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQ 460

Query: 732 GCVDTRKSTFGYLFLLAEGAISWKIPPSLVV--STMEAEFVACFEATVHGLWLRNFISGL 791
            C DTR+ST GY   L    ISWK     VV  S+ EAE+ A   AT   +WL  F   L
Sbjct: 461 SCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFREL 520

Query: 792 GIADSIAKPLRIYCDNSSAKHM 798
            +   ++KP  ++CDN++A H+
Sbjct: 521 QL--PLSKPTLLFCDNTAAIHI 531

BLAST of Pay0003779 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 115.5 bits (288), Expect = 2.0e-25
Identity = 82/226 (36.28%), Postives = 119/226 (52.65%), Query Frame = 0

Query: 537 LVLYVDDILLATNDFGLLCQTKEFLSKNFEMKDMGEASYVIGIEIFRDRTH-GLLGLSQK 596
           L+LYVDDILL  +   LL      LS  F MKD+G   Y +GI+I   +TH   L LSQ 
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQI---KTHPSGLFLSQT 62

Query: 597 AYINKVLEKFKMDKCS--SSVVPIQKGDKFSLMQCPKNELERNQMETIPYASIVGSLLYA 656
            Y  ++L    M  C   S+ +P++     S  + P         +   + SIVG+L Y 
Sbjct: 63  KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP---------DPSDFRSIVGALQYL 122

Query: 657 QTCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSD 716
            T TRPDIS+AV ++ +    P +  +   K+VLRY++GT  + L   ++  L V  + D
Sbjct: 123 -TLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 182

Query: 717 SNFAGCVDTRKSTFGYLFLLAEGAISW--KIPPSLVVSTMEAEFVA 758
           S++AGC  TR+ST G+   L    ISW  K  P++  S+ E E+ A
Sbjct: 183 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRA 215

BLAST of Pay0003779 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 68.6 bits (166), Expect = 2.8e-11
Identity = 35/80 (43.75%), Postives = 51/80 (63.75%), Query Frame = 0

Query: 372 AMKEELKSMNDNEVWDLVELPKESKRVGCKWVFKTKRDSNGNIERYKARLVAKGYN---- 431
           AM+EEL +++ N+ W LV  P     +GCKWVFKTK  S+G ++R KARLVAKG++    
Sbjct: 43  AMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEG 102

Query: 432 ------FSPVSKKDSLRIIM 442
                 +SPV +  ++R I+
Sbjct: 103 IYFVETYSPVVRTATIRTIL 122

BLAST of Pay0003779 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 62.4 bits (150), Expect = 2.0e-09
Identity = 30/79 (37.97%), Postives = 47/79 (59.49%), Query Frame = 0

Query: 655 TCTRPDISFAVGMLGRYQSNPGMDHWKAAKKVLRYLQGTKDYMLTYKRSDHLEVIGYSDS 714
           T TRPD++FAV  L ++ S       +A  KVL Y++GT    L Y  +  L++  ++DS
Sbjct: 4   TITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAFADS 63

Query: 715 NFAGCVDTRKSTFGYLFLL 734
           ++A C DTR+S  G+  L+
Sbjct: 64  DWASCPDTRRSVTGFCSLV 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109783.5e-12336.78Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041464.1e-8737.83Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q94HW24.8e-8035.91Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT947.7e-7833.28Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P256007.0e-3931.31Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
A0A5D3BWW50.0e+0073.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7UG950.0e+0073.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A445LQ300.0e+0050.12Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A445GJ881.7e-30161.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Glycine soja OX=3... [more]
A0A438D9943.6e-29651.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
TYK04201.10.0e+0073.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0052755.10.0e+0073.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RZC25410.10.0e+0050.12Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
RZB61294.13.4e-30161.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Glycine soja][more]
RVW32004.17.4e-29651.74Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
AT4G23160.16.2e-8338.24cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.12.0e-2536.28DNA/RNA polymerases superfamily protein [more]
ATMG00820.12.8e-1143.75Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00240.12.0e-0937.97Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Melon (Payzawat) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 6..106
e-value: 2.2E-14
score: 53.4
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 369..757
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 709..835
e-value: 4.83696E-47
score: 162.252
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 383..617
e-value: 5.8E-68
score: 229.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 197..323
e-value: 2.4E-18
score: 68.4
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 140..154
score: 9.125269
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 212..321
score: 17.073601
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 213..315
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 138..162
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 383..764

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Pay0003779.1Pay0003779.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0050896 response to stimulus
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0016779 nucleotidyltransferase activity
molecular_function GO:0008270 zinc ion binding