Pay0005361 (gene) Melon (Payzawat) v1

Overview
NamePay0005361
Typegene
OrganismCucumis melo L. var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Locationchr01: 4445149 .. 4447929 (-)
RNA-Seq ExpressionPay0005361
SyntenyPay0005361
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSexon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTCAAACTCATCCCTACTCGGTGTTGAGAACACTAATGCATCTTCACCGATTAATCAAATATTTGGATCGGGTAAAAAAATATCTTTAGTGAAGCCCAACGATGATAGTTTTCTCTTATGGAAGTTCCAAATTCTTACAATATTAGAAGCCTATGATCTAGAAATTTTTCTTGAATCTGAATCAGAACCACCATCAAAATATCTCACATCCACTGGGAGTTCATCAACATCTGCTACAAGAACACCAAATCCGGAATATAAGGTATGGAAACGCCAAGATCGCCTTATCTCCTCATGGATTCTAGGGTCCATGAGTGAAGAAATACTGAATCAGATGCTTCATTGCAAATCTGCAAAAGAAATTCGGGGAACTATTTAAGGTATTTTCTCTTCCCGTTACTTGGCACAAGCTATGCAATTTGAAAAAAAACTTTACAATATAAAGAAAGGATCCATGCCATTAAAAGAATACTTTCTCAAAATATAGCAGTGTATTGATGCCTTAGCTTTAATTAACAAATCAGTTTCATCTGATGATCATATTCTATACATATTGGCTGGTTTAGGATCTGATTATCAATCCATGATATCTGTTATTTCCAAAAAGAACTTACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCACGAATGTCAAAATGAGAGCAAATTAATCAGTGAGACTGATATACCTTCTGTTAACATTGTCACCTAAACAACTGAAAAATGAGCATAATCTTACATAAGGAACAACCAACACAATCAAAGGGGTGGTCGTGGAAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGTCAAATCTGTACAAAGCATAGACATAATGTTGCTTCTTTCAATATACTCCAAGATCAAATTCATCAGTTACTCACCAAACTCACATAATACTTAATATACTAATATCAATAATCATCCACAGATGTCTGCTATGGTGGCTTCCCCTGACCTAAATATTGACAACAATTGGTATCCTGATTCGGGAGCTACAAACCATTTAACTCATAGCTTGAGCAACCTATCTACTGGATCTAAGTACGGGGAGGAATTCAAATATATGCAGCAAATGGGTTAGGTTTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTATATTACCATTCAAATCGTTTACACTAAATAACTTTTTCCATGTTCCATTCATTACCAAAAATTTAATCAGTGTTTCACAATTTGCTAAAGATAATCATGTTTTCTTTGAATTTCACCCCACTTTGTGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTCAATGATGGGCTCTACAAATTTACCATCCAACCATCACATAAAAGACTTCACCATTCTGAACTCAACACCAAGTCTGTTTTCAATACAGTTGTACCTAAATCTAATTCTCACTTACTTGATCTATCGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGATGTTTTGAATCACATTGACTATTCTTCTGGAACTATGAATAAAATGACTTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCATTATACACATCCTTTACAAATTATTAGTTGTGATTTATGGGGTCCTGCAATAAATGTATCTCATAATTGTTTTAGTTACTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTATATTCCAAGCCTGATGCTTTTTTAACCTTTCAAAATTCAAAACCTGTGTTGAAAAGTCTCTTGGTCAATCAATTAAAAGTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTAAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTTTCCACTAGTGTCTATCTCATAAATTGTTTGCCTACCTCAGTACTTGATAATATAAGCCCGTTAGAGAAGCTATCTGGTTGGAAACCTAACTTTCCTTCTTTTCGAGTCTTTGGCTGCAAATGTTATCCCTAACTTCGACCCTACCAATCACATAAACTATCTCTCCAATCCACACCATGTACTTTCCTAGGATATAGTACATCACATAAAGGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAAACATGTATTATTTGATGAAAATTCATTTCCATATGCATCATTTTCATCTCATTCTAGCATGCTCAAATCTAAAAATGTCCTATCTCCACCATTTCACTCAATAATTCAATCATCCCTTATAAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATCATCTAAACCCTATTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTCGGGATGATGGTAACATTGGAGGTATTACTCAATCTCTAAGTCCTATGGAACCACAACATCAAATTGACTCTGGTATGAACACTCAACTTCAATCTACATCCGTTCATCTCATGATAACACGGAGTAAGCATGGTATTTTCAAACCAAAAACATTCTTAATTGATTATACTCAAACTGAACCTTGTAATGCCAAGGAAGCTTTTAAACATCCTCATTGGAAAAAGGCCATGGAAGAAAAGTTTGAAGCCTTACAGAAAAATGACACAAAATCCTAA

mRNA sequence

ATGAGTTCAAACTCATCCCTACTCGGTGTTGAGAACACTAATGCATCTTCACCGATTAATCAAATATTTGGATCGGGTAAAAAAATATCTTTAGTGAAGCCCAACGATGATAGTTTTCTCTTATGGAAGTTCCAAATTCTTACAATATTAGAAGCCTATGATCTAGAAATTTTTCTTGAATCTGAATCAGAACCACCATCAAAATATCTCACATCCACTGGGAGTTCATCAACATCTGCTACAAGAACACCAAATCCGGAATATAAGGTATGGAAACGCCAAGATCGCCTTATCTCCTCATGGATTCTAGGGTCCATGACTTTAATTAACAAATCAGTTTCATCTGATGATCATATTCTATACATATTGGCTGGTTTAGGATCTGATTATCAATCCATGATATCTGTTATTTCCAAAAAGAACTTACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCACGAATGAACAACCAACACAATCAAAGGGGTGGTCGTGGAAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGTCAAATCTGTACAAAGCATAGACATAATATGTCTGCTATGGTGGCTTCCCCTGACCTAAATATTGACAACAATTGGTATCCTGATTCGGGAGCTACAAACCATTTAACTCATATACGGGGAGGAATTCAAATATATGCAGCAAATGGGTTAGGTTTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTATATTACCATTCAAATCGTTTACACTAAATAACTTTTTCCATGTTCCATTCATTACCAAAAATTTAATCAGTGTTTCACAATTTGCTAAAGATAATCATGTTTTCTTTGAATTTCACCCCACTTTGTGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTCAATGATGGGCTCTACAAATTTACCATCCAACCATCACATAAAAGACTTCACCATTCTGAACTCAACACCAAGTCTGTTTTCAATACAGTTGTACCTAAATCTAATTCTCACTTACTTGATCTATCGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGATGTTTTGAATCACATTGACTATTCTTCTGGAACTATGAATAAAATGACTTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCATTATACACATCCTTTACAAATTATTAGTTGTGATTTATGGGGTCCTGCAATAAATGTATCTCATAATTGTTTTAGTTACTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTATATTCCAAGCCTGATGCTTTTTTAACCTTTCAAAATTCAAAACCTGTTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTAAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTTTCCACTAGTGTCTATCTCATAAATTGTTTGCCTACCTCAGTACTTGATAATATAAGCCCGTTAGAGAAGCTATCTGGTTGGAAACCTAACTTTCCTTCTTTTCGAGTCTTTGGCTGCAAATGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAAACATGTATTATTTGATGAAAATTCATTTCCATATGCATCATTTTCATCTCATTCTAGCATGCTCAAATCTAAAAATGTCCTATCTCCACCATTTCACTCAATAATTCAATCATCCCTTATAAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATCATCTAAACCCTATTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTCGGGATGATGGTAACATTGGAGGTATTACTCAATCTCTAAGTCCTATGGAACCACAACATCAAATTGACTCTGGTATGAACACTCAACTTCAATCTACATCCGTTCATCTCATGATAACACGGAGTAAGCATGGTATTTTCAAACCAAAAACATTCTTAATTGATTATACTCAAACTGAACCTTGTAATGCCAAGGAAGCTTTTAAACATCCTCATTGGAAAAAGGCCATGGAAGAAAAGTTTGAAGCCTTACAGAAAAATGACACAAAATCCTAA

Coding sequence (CDS)

ATGAGTTCAAACTCATCCCTACTCGGTGTTGAGAACACTAATGCATCTTCACCGATTAATCAAATATTTGGATCGGGTAAAAAAATATCTTTAGTGAAGCCCAACGATGATAGTTTTCTCTTATGGAAGTTCCAAATTCTTACAATATTAGAAGCCTATGATCTAGAAATTTTTCTTGAATCTGAATCAGAACCACCATCAAAATATCTCACATCCACTGGGAGTTCATCAACATCTGCTACAAGAACACCAAATCCGGAATATAAGGTATGGAAACGCCAAGATCGCCTTATCTCCTCATGGATTCTAGGGTCCATGACTTTAATTAACAAATCAGTTTCATCTGATGATCATATTCTATACATATTGGCTGGTTTAGGATCTGATTATCAATCCATGATATCTGTTATTTCCAAAAAGAACTTACTCTCCTTCTGTACAAGAAGTTATGTCTTTATTACTTACTCACGAATGAACAACCAACACAATCAAAGGGGTGGTCGTGGAAATGGAAGATCAAACAGGGGAGGAAGAGGAAATCGTAATAAACCACAATGTCAAATCTGTACAAAGCATAGACATAATATGTCTGCTATGGTGGCTTCCCCTGACCTAAATATTGACAACAATTGGTATCCTGATTCGGGAGCTACAAACCATTTAACTCATATACGGGGAGGAATTCAAATATATGCAGCAAATGGGTTAGGTTTGCCAATCACTCATTATGGTTCCATGTCATTTAACTCCTCTATATTACCATTCAAATCGTTTACACTAAATAACTTTTTCCATGTTCCATTCATTACCAAAAATTTAATCAGTGTTTCACAATTTGCTAAAGATAATCATGTTTTCTTTGAATTTCACCCCACTTTGTGTTATGTGAAGGATCTGGATACTGGCCAAGTACTTCTTCAAGGACTACTCAATGATGGGCTCTACAAATTTACCATCCAACCATCACATAAAAGACTTCACCATTCTGAACTCAACACCAAGTCTGTTTTCAATACAGTTGTACCTAAATCTAATTCTCACTTACTTGATCTATCGCATAGAAGACTAGGTCATCCCCATTTACCTACTGTTAAAGATGTTTTGAATCACATTGACTATTCTTCTGGAACTATGAATAAAATGACTTTTTGTGAAGCATGTGCATTGGGCAAACATCATGCCCTTCCTTTCTCTCACTCCCTTACTCATTATACACATCCTTTACAAATTATTAGTTGTGATTTATGGGGTCCTGCAATAAATGTATCTCATAATTGTTTTAGTTACTACATAAGTTTTGTTGATGCCTATAGTAGATACACCTGGATATATTTCTTATATTCCAAGCCTGATGCTTTTTTAACCTTTCAAAATTCAAAACCTGTTCTTCAAACTGATGGTGGTACTGAATTTAAACCATTCAAACCTTTTCTTGATCAACATGGCATTAAACATAGGATAACATGTCCTTACACTTCAAAGCAGAATGACATAGTTGAGAGAAAACATAGGCATATCATGGAAATGGGTCTTACATTGCTATCTCAAGCCACTTTACCTCTATCATTCTGGGATGAAGCCTTTTCCACTAGTGTCTATCTCATAAATTGTTTGCCTACCTCAGTACTTGATAATATAAGCCCGTTAGAGAAGCTATCTGGTTGGAAACCTAACTTTCCTTCTTTTCGAGTCTTTGGCTGCAAATGGTACAAATGTCTAGCTTCAGATGGTCGCCTTTTCATTTCTAAACATGTATTATTTGATGAAAATTCATTTCCATATGCATCATTTTCATCTCATTCTAGCATGCTCAAATCTAAAAATGTCCTATCTCCACCATTTCACTCAATAATTCAATCATCCCTTATAAACCATAATGAGGATAGGCGACACACTGACACAGTTTCTGATAACACTGATCATCTAAACCCTATTATTGTGTATCCTTTAGAGACAGGTACTCAAGAGAGCTCTCGGGATGATGGTAACATTGGAGGTATTACTCAATCTCTAAGTCCTATGGAACCACAACATCAAATTGACTCTGGTATGAACACTCAACTTCAATCTACATCCGTTCATCTCATGATAACACGGAGTAAGCATGGTATTTTCAAACCAAAAACATTCTTAATTGATTATACTCAAACTGAACCTTGTAATGCCAAGGAAGCTTTTAAACATCCTCATTGGAAAAAGGCCATGGAAGAAAAGTTTGAAGCCTTACAGAAAAATGACACAAAATCCTAA

Protein sequence

MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLESESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMTLINKSVSSDDHILYILAGLGSDYQSMISVISKKNLLSFCTRSYVFITYSRMNNQHNQRGGRGNGRSNRGGRGNRNKPQCQICTKHRHNMSAMVASPDLNIDNNWYPDSGATNHLTHIRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHHALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAFLTFQNSKPVLQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSFRVFGCKWYKCLASDGRLFISKHVLFDENSFPYASFSSHSSMLKSKNVLSPPFHSIIQSSLINHNEDRRHTDTVSDNTDHLNPIIVYPLETGTQESSRDDGNIGGITQSLSPMEPQHQIDSGMNTQLQSTSVHLMITRSKHGIFKPKTFLIDYTQTEPCNAKEAFKHPHWKKAMEEKFEALQKNDTKS
Homology
BLAST of Pay0005361 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 279.3 bits (713), Expect = 1.3e-73
Identity = 229/761 (30.09%), Postives = 331/761 (43.50%), Query Frame = 0

Query: 30  SLVKPNDDSFLLWKFQILTILEAYDLEIFLESESEPPSKYLTSTGSSSTSATRTPNPEYK 89
           ++ K    ++L+W  Q+  + + Y+L  FL+  +  P        +  T A    NP+Y 
Sbjct: 22  NVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMP------PATIGTDAVPRVNPDYT 81

Query: 90  VWKRQDRLISSWILGS-------------------------------------------- 149
            W+RQD+LI S ILG+                                            
Sbjct: 82  RWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRFITRFD 141

Query: 150 -MTLINKSVSSDDHILYILAGLGSDYQSMISVISKKN------------------LLSFC 209
            + L+ K +  D+ +  +L  L  DY+ +I  I+ K+                  LL+  
Sbjct: 142 QLALLGKPMDHDEQVERVLENLPDDYKPVIDQIAAKDTPPSLTEIHERLINRESKLLALN 201

Query: 210 TRSYVFITYS----RMNNQHNQRGGRGNGR--------------SNRGGRGNRNKP---- 269
           +   V IT +    R  N +  +  RG+ R              S+ G R +  +P    
Sbjct: 202 SAEVVPITANVVTHRNTNTNRNQNNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYL 261

Query: 270 -QCQICTKHRHNMSAMVA--------------------SPDLNI-------DNNWYPDSG 329
            +CQIC+   H+                           P  N+        NNW  DSG
Sbjct: 262 GRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSG 321

Query: 330 ATNHLTH----------IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHV 389
           AT+H+T             GG  +  A+G  +PITH GS S  +S    +S  LN   +V
Sbjct: 322 ATHHITSDFNNLSFHQPYTGGDDVMIADGSTIPITHTGSASLPTS---SRSLDLNKVLYV 381

Query: 390 PFITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRL 449
           P I KNLISV +    N V  EF P    VKDL+TG  LLQG   D LY++ I  S    
Sbjct: 382 PNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTGVPLLQGKTKDELYEWPIASS---- 441

Query: 450 HHSELNTKSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVL-NHIDYSSGTMNKMTFCE 509
                   S+F +   K+        H RLGHP L  +  V+ NH        +K+  C 
Sbjct: 442 -----QAVSMFASPCSKATH---SSWHSRLGHPSLAILNSVISNHSLPVLNPSHKLLSCS 501

Query: 510 ACALGKHHALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYF 569
            C + K H +PFS+S    + PL+ I  D+W   I +S + + YY+ FVD ++RYTW+Y 
Sbjct: 502 DCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSPI-LSIDNYRYYVIFVDHFTRYTWLYP 561

Query: 570 LYSK---PDAFL--------TFQNSKPVLQTDGGTEFKPFKPFLDQHGIKHRITCPYTSK 622
           L  K    D F+         FQ     L +D G EF   + +L QHGI H  + P+T +
Sbjct: 562 LKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVLRDYLSQHGISHFTSPPHTPE 621

BLAST of Pay0005361 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 272.3 bits (695), Expect = 1.6e-71
Identity = 217/757 (28.67%), Postives = 329/757 (43.46%), Query Frame = 0

Query: 30  SLVKPNDDSFLLWKFQILTILEAYDLEIFLESESEPPSKYLTSTGSSSTSATRTPNPEYK 89
           ++ K    ++L+W  Q+  + + Y+L  FL+  +  P        +  T A    NP+Y 
Sbjct: 22  NVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMP------PATIGTDAAPRVNPDYT 81

Query: 90  VWKRQDRLISSWILGS-------------------------------------------- 149
            WKRQD+LI S +LG+                                            
Sbjct: 82  RWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYGHVTQLRTQLKQW 141

Query: 150 --------------------MTLINKSVSSDDHILYILAGLGSDYQSMISVISKKN---- 209
                               + L+ K +  D+ +  +L  L  +Y+ +I  I+ K+    
Sbjct: 142 TKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIAAKDTPPT 201

Query: 210 --------------LLSFCTRSYVFITYSRMNNQ------HNQRGGRGNGRSNRGG---- 269
                         +L+  + + + IT + ++++      +N  G R N   NR      
Sbjct: 202 LTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNS 261

Query: 270 ------------RGNRNKP---QCQI----------CTKHRHNMSAMVA----------S 329
                         N++KP   +CQI          C++ +H +S++ +           
Sbjct: 262 KPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPPSPFTPWQ 321

Query: 330 PDLNI-------DNNWYPDSGATNHLTH----------IRGGIQIYAANGLGLPITHYGS 389
           P  N+        NNW  DSGAT+H+T             GG  +  A+G  +PI+H GS
Sbjct: 322 PRANLALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVADGSTIPISHTGS 381

Query: 390 MSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVL 449
            S ++   P     L+N  +VP I KNLISV +    N V  EF P    VKDL+TG  L
Sbjct: 382 TSLSTKSRP---LNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQVKDLNTGVPL 441

Query: 450 LQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVK 509
           LQG   D LY++ I  S      +  ++K+  ++             H RLGHP    + 
Sbjct: 442 LQGKTKDELYEWPIASSQPVSLFASPSSKATHSS------------WHARLGHPAPSILN 501

Query: 510 DVLNHIDYSSGTMN---KMTFCEACALGKHHALPFSHSLTHYTHPLQIISCDLWGPAINV 569
            V++  +YS   +N   K   C  C + K + +PFS S  + T PL+ I  D+W   I +
Sbjct: 502 SVIS--NYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSSPI-L 561

Query: 570 SHNCFSYYISFVDAYSRYTWIYFLYSK---PDAFLTFQNSKP--------VLQTDGGTEF 598
           SH+ + YY+ FVD ++RYTW+Y L  K    + F+TF+N              +D G EF
Sbjct: 562 SHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEF 621

BLAST of Pay0005361 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 136.3 bits (342), Expect = 1.4e-30
Identity = 161/667 (24.14%), Postives = 267/667 (40.03%), Query Frame = 0

Query: 154 TYSRMNNQHNQRGGRGNGRS------------NRGGRGNRNKPQCQ------ICTKHRHN 213
           +Y R +N + + G RG  ++            N+ G   R+ P  +         K+  N
Sbjct: 204 SYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQKNDDN 263

Query: 214 MSAMVASPDLNI---------------DNNWYPDSGATNHLTHIRGGIQIYAANGLGL-- 273
            +AMV + D  +               ++ W  D+ A++H T +R     Y A   G   
Sbjct: 264 TAAMVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRDLFCRYVAGDFGTVK 323

Query: 274 -------PITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFEFHP 333
                   I   G +   +++    +  L +  HVP +  NLIS     +D +  +  + 
Sbjct: 324 MGNTSYSKIAGIGDICIKTNV--GCTLVLKDVRHVPDLRMNLISGIALDRDGYESYFANQ 383

Query: 334 TLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHLLDL 393
                K      V+ +G+    LY+     ++  +   ELN            +   +DL
Sbjct: 384 KWRLTKG---SLVIAKGVARGTLYR-----TNAEICQGELNA---------AQDEISVDL 443

Query: 394 SHRRLGHPHLPTVKDVLNH--IDYSSGTMNKMTFCEACALGKHHALPFSHSLTHYTHPLQ 453
            H+R+GH     ++ +     I Y+ GT  K   C+ C  GK H + F  S     + L 
Sbjct: 444 WHKRMGHMSEKGLQILAKKSLISYAKGTTVKP--CDYCLFGKQHRVSFQTSSERKLNILD 503

Query: 454 IISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAFLTFQNSKPV------- 513
           ++  D+ GP    S     Y+++F+D  SR  W+Y L +K   F  FQ    +       
Sbjct: 504 LVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGR 563

Query: 514 ----LQTDGGTEF--KPFKPFLDQHGIKHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQ 573
               L++D G E+  + F+ +   HGI+H  T P T + N + ER +R I+E   ++L  
Sbjct: 564 KLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSMLRM 623

Query: 574 ATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSFRVFGCKWYKCLASD 633
           A LP SFW EA  T+ YLIN  P+  L    P    +  + ++   +VFGC+ +  +  +
Sbjct: 624 AKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKE 683

Query: 634 GRLFISKHVLFDENSFPYASFSSHSSMLKSKNVLSPPFHSIIQSS--LINHNEDRRHTDT 693
            R         D+ S P   F  +        +  P    +I+S   +   +E R   D 
Sbjct: 684 QR------TKLDDKSIP-CIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADM 743

Query: 694 VSDNTDHLNPIIVYPLETGTQESSRDDGNIGGITQSLSP---MEPQHQIDSGMNT---QL 748
                + + P  V    T    +S +        Q   P   +E   Q+D G+       
Sbjct: 744 SEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGVEEVEHPT 803

BLAST of Pay0005361 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 110.5 bits (275), Expect = 8.1e-23
Identity = 117/452 (25.88%), Postives = 184/452 (40.71%), Query Frame = 0

Query: 178 RGNRNKPQCQICTKHRHNMSAMVASPDLNIDN-NWYPDSGATNHLTHIRG--GIQIYAAN 237
           +   N+ Q Q  T H         +    +DN  +  DSGA++HL +        +    
Sbjct: 255 KNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLDSGASDHLINDESLYTDSVEVVP 314

Query: 238 GLGLPITHYGSMSF--NSSILPFKS---FTLNNFFHVPFITKNLISVSQFAKDNHVFFEF 297
            L + +   G   +     I+  ++    TL +         NL+SV +  ++  +  EF
Sbjct: 315 PLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCKEAAGNLMSVKRL-QEAGMSIEF 374

Query: 298 HPTLCYVKDLDTGQVLLQGLLND----GLYKFTIQPSHK---RLHHSELNTKSVFNTVVP 357
             +   +       V   G+LN+        ++I   HK   RL H              
Sbjct: 375 DKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSINAKHKNNFRLWHERF----------- 434

Query: 358 KSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHHALPFSH--S 417
               H+ D     +   ++ + + +LN+++ S         CE C  GK   LPF     
Sbjct: 435 ---GHISDGKLLEIKRKNMFSDQSLLNNLELS------CEICEPCLNGKQARLPFKQLKD 494

Query: 418 LTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAFLTFQ--- 477
            TH   PL ++  D+ GP   V+ +  +Y++ FVD ++ Y   Y +  K D F  FQ   
Sbjct: 495 KTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFV 554

Query: 478 -------NSKPV-LQTDGGTEF--KPFKPFLDQHGIKHRITCPYTSKQNDIVERKHRHIM 537
                  N K V L  D G E+     + F  + GI + +T P+T + N + ER  R I 
Sbjct: 555 AKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTIT 614

Query: 538 EMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNIS--PLEKLSGWKPNFPSFRVF 593
           E   T++S A L  SFW EA  T+ YLIN +P+  L + S  P E     KP     RVF
Sbjct: 615 EKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVF 674

BLAST of Pay0005361 vs. ExPASy Swiss-Prot
Match: P0C2I5 (Transposon Ty1-LR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY1B-LR2 PE=3 SV=1)

HSP 1 Score: 84.0 bits (206), Expect = 8.1e-15
Identity = 92/367 (25.07%), Postives = 151/367 (41.14%), Query Frame = 0

Query: 214 DSGATNHLT----HIRGG-----IQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFF 273
           DSGA+  L     HI        I +  A    +PI   G + F+       + T     
Sbjct: 461 DSGASRTLIRSAHHIHSASSNPDINVVDAQKRNIPINAIGDLQFH---FQDNTKTSIKVL 520

Query: 274 HVPFITKNLISVSQFAKDNHVFFEFHPTLCYVKDL---DTGQVLLQGLLNDGLYKFTIQP 333
           H P I  +L+S+++ A           T C+ K++     G VL   +     Y      
Sbjct: 521 HTPNIAYDLLSLNELA-------AVDITACFTKNVLERSDGTVLAPIVKYGDFY----WV 580

Query: 334 SHKRLHHSELNTKSVFNTVVPKS-NSHLLDLSHRRLGHPHLPTVKDVLNH---------- 393
           S K L  S ++  ++ N    +S   +     HR L H + PT++  L +          
Sbjct: 581 SKKYLLPSNISVPTINNVHTSESTRKYPYPFIHRMLAHANAPTIRYSLKNNTITYFNESD 640

Query: 394 IDYSSGTMNKMTFCEACALGK----HHALPFSHSLTHYTHPLQIISCDLWGPAINVSHNC 453
           +D+SS    +   C  C +GK     H         +   P Q +  D++GP  N+ ++ 
Sbjct: 641 VDWSSAIDYQ---CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPNSA 700

Query: 454 FSYYISFVDAYSRYTWIYFLYSKP-----DAFLT--------FQNSKPVLQTDGGTEF-- 513
            SY+ISF D  +++ W+Y L+ +      D F T        FQ S  V+Q D G+E+  
Sbjct: 701 PSYFISFTDETTKFRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTN 760

Query: 514 KPFKPFLDQHGIKHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTS 539
           +    FL+++GI    T    S+ + + ER +R +++   T L  + LP   W  A   S
Sbjct: 761 RTLHKFLEKNGITPCYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFS 810

BLAST of Pay0005361 vs. ExPASy TrEMBL
Match: A0A5D3CH97 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold25G00040 PE=4 SV=1)

HSP 1 Score: 1087.8 bits (2812), Expect = 0.0e+00
Identity = 599/889 (67.38%), Postives = 628/889 (70.64%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYDLE FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMT------------- 120
           SESEPPSKYL ST SSS SAT TPNP YKVWKRQDRLISSW+LGSM+             
Sbjct: 61  SESEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSA 120

Query: 121 ---------------------------------------------------LINKSVSSD 180
                                                               INK VSSD
Sbjct: 121 KEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSD 180

Query: 181 DHILYILAGLGSDYQSMISVISKK---------------------------------NLL 240
           DHILYILAGLGSDYQSMISVIS +                                 N++
Sbjct: 181 DHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIV 240

Query: 241 SFCT----RSYVFITYSRMNNQH--NQRGGRGNGRSNRGGRGNRNKPQCQICTK------ 300
           +  T     SY+    +  +N H  NQRGGRGNGRSNRG RGNRNKPQCQIC K      
Sbjct: 241 TQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSAD 300

Query: 301 -----------------HRHN-----------MSAMVASPDLNIDNNWYPDSGATNHLTH 360
                            + HN           MSAMVA+ DLNID+NWYPDSGATNHLTH
Sbjct: 301 RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 360

Query: 361 ----------IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNL 420
                       GG QIYAANG GLPITHYGSMSFNSS LPFKSFTLNN   VP ITKNL
Sbjct: 361 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 420

Query: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNT 480
           ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS  NT
Sbjct: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNT 480

Query: 481 KSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHH 540
           K VFNTVVPKSN+ LLDL HRRLGHPHLP VK VLNHID SSGT+NK+ FCEACALGKHH
Sbjct: 481 KPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHH 540

Query: 541 ALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAF 600
           ALPFSHSLT YTHPLQ+I+CDLWGPA+NVSHN F YYISFVDAYSRYTWIYFL SK DAF
Sbjct: 541 ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF 600

Query: 601 LTFQNSKPV-----------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQNDIVERK 660
           L FQ  K             LQTDGGTEFKPFKPFLDQHGI+HRITCPYTSKQNDIVERK
Sbjct: 601 LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERK 660

Query: 661 HRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSF 702
           HR+IMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPT VLDNISPLEKL   KPNFPS 
Sbjct: 661 HRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSL 720

BLAST of Pay0005361 vs. ExPASy TrEMBL
Match: A0A5A7U233 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G00060 PE=4 SV=1)

HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 599/889 (67.38%), Postives = 627/889 (70.53%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYDLE FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMT------------- 120
           SESEPPSKYL ST SSS SAT TPNP YKVWKRQDRLISSW+LGSM+             
Sbjct: 61  SESEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSA 120

Query: 121 ---------------------------------------------------LINKSVSSD 180
                                                               INK VSSD
Sbjct: 121 KEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSD 180

Query: 181 DHILYILAGLGSDYQSMISVISKK---------------------------------NLL 240
           DHILYILAGLGSDYQSMISVIS +                                 N++
Sbjct: 181 DHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIV 240

Query: 241 SFCT----RSYVFITYSRMNNQH--NQRGGRGNGRSNRGGRGNRNKPQCQICTK------ 300
           +  T     SY+    +  +N H  NQRGGRGNGRSNRG RGNRNKPQCQIC K      
Sbjct: 241 TQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSAD 300

Query: 301 -----------------HRHN-----------MSAMVASPDLNIDNNWYPDSGATNHLTH 360
                            + HN           MSAMVA+ DLNID+NWYPDSGATNHLTH
Sbjct: 301 RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 360

Query: 361 ----------IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNL 420
                       GG QIYAANG GLPITHYGSMSFNSS LPFKSFTLNN   VP ITKNL
Sbjct: 361 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 420

Query: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNT 480
           ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS  NT
Sbjct: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNT 480

Query: 481 KSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHH 540
           K VFNTVVPKSN+ LLDL HRRLGHPHLP VK VLNHID SSGT+NK+ FCEACALGKHH
Sbjct: 481 KPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHH 540

Query: 541 ALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAF 600
           ALPFSHSLT YTHPLQ+I+CDLWGPA+NVSHN F YYISFVDAYSRYTWIYFL SK DAF
Sbjct: 541 ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF 600

Query: 601 LTFQNSKPV-----------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQNDIVERK 660
           L FQ  K             LQTDGGTEFKPFKPFLDQHGI+HRITCPYTSKQNDIVERK
Sbjct: 601 LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERK 660

Query: 661 HRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSF 702
           HR+IMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPT VLDNISPLEKL   KPNFPS 
Sbjct: 661 HRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSL 720

BLAST of Pay0005361 vs. ExPASy TrEMBL
Match: A0A5A7VFQ6 (Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold418G00150 PE=4 SV=1)

HSP 1 Score: 717.6 bits (1851), Expect = 5.3e-203
Identity = 391/596 (65.60%), Postives = 407/596 (68.29%), Query Frame = 0

Query: 194 HNMSAMVASPDLNIDNNWYPDSGATNHLTH----------IRGGIQIYAANGLGLPITHY 253
           H MSAMVA+PDLNID+NWYPDSGATNHLTH            GG QIYAANG GLPITHY
Sbjct: 40  HQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHY 99

Query: 254 GSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQ 313
           GSMSFNSS LPFKSFTLNN  H                                DLDTGQ
Sbjct: 100 GSMSFNSSTLPFKSFTLNNLLH--------------------------------DLDTGQ 159

Query: 314 VLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHLLDLSHRRLGHPHLPT 373
           VLLQGLLNDGLYKFTIQPSHKRLHHS+ NTKSVFNTVVPKSN+ LLDL HRRLGHPHLPT
Sbjct: 160 VLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPT 219

Query: 374 VKDVLNHIDYSSGTMNKMTFCEACALGKHHALPFSHSLTHYTHPLQIISCDLWGPAINVS 433
           VK VLNHID+SS    K   C   +LG+                                
Sbjct: 220 VKAVLNHIDHSS-AFQKFKTCVEKSLGQ-------------------------------- 279

Query: 434 HNCFSYYISFVDAYSRYTWIYFLYSKPDAFLTFQNSKPVLQTDGGTEFKPFKPFLDQHGI 493
                                              S   LQTDGGTEFKPFKPFLDQHGI
Sbjct: 280 -----------------------------------SIKSLQTDGGTEFKPFKPFLDQHGI 339

Query: 494 KHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVL 553
           +HRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT VL
Sbjct: 340 EHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVL 399

Query: 554 DNISPLEKLSGWKPNFPSFRVFGC------------------------------KWYKCL 613
           DNISPLEKL   KPNFP  RVFGC                              K YKCL
Sbjct: 400 DNISPLEKLFCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL 459

Query: 614 ASDGRLFISKHVLFDENSFPYASFSSHSSMLKSKNVLSPPFHSIIQSSLINHNEDRRHTD 673
           ASDGRLFIS+HVLFDENSFPYASF+SHSS+ KSKNVLSPP HSII SSL+NHNEDRRHTD
Sbjct: 460 ASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTD 519

Query: 674 TVSDNTDHLNPIIVYPLETGTQESSRDDGNIGGITQSLSPMEPQHQIDSGMNTQLQSTSV 733
           TVSDNTD+LN  IVYPLETGTQESSRDDGN GGITQS SPMEP HQ DSGMNTQLQSTS+
Sbjct: 520 TVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSI 535

Query: 734 HLMITRSKHGIFKPKTFLIDYTQTEPCNAKEAFKHPHWKKAMEEKFEALQKNDTKS 750
           H MIT+SKH IFKPK FLIDYTQTE CNAKEAF HPHWKKAMEE+FEALQKN T S
Sbjct: 580 HPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWS 535

BLAST of Pay0005361 vs. ExPASy TrEMBL
Match: A0A5D3D3G2 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold411G00260 PE=4 SV=1)

HSP 1 Score: 489.6 bits (1259), Expect = 2.4e-134
Identity = 264/389 (67.87%), Postives = 287/389 (73.78%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYD+E FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDNFLLWKFQILTALEAYDMENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMTLINKSVSSDDHIL 120
           SESEPP+KYLTSTGSSSTSATRTPNPEYK  + + +LIS   L S+ ++ ++        
Sbjct: 61  SESEPPTKYLTSTGSSSTSATRTPNPEYKESQNESKLISETALPSVNIVTQTTEKG---- 120

Query: 121 YILAGLGSDYQSMISVISKKNLLSFCTRSYVFITYSRMNNQH--NQRGGRGNGRSNRGGR 180
                                       SY+  + +  +N H  NQRGGRGNGRSN GGR
Sbjct: 121 --------------------------AESYIRTSQNNYHNNHSYNQRGGRGNGRSNSGGR 180

Query: 181 GNRNKPQCQICTKHRHNMSAMVASPDLNIDNNWYPDSGATNHLTH----------IRGGI 240
           GNRNKPQCQICTK  H           + D+NWYPDSGATNHLTH            GG 
Sbjct: 181 GNRNKPQCQICTKLGH-----------SADSNWYPDSGATNHLTHSLSNLSTGSEYGGGN 240

Query: 241 QIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFE 300
           QIY ANG GLPITHYGSMSFNSS LPFKSFTL N  HVP ITKNLISVS FAKDNHVFFE
Sbjct: 241 QIYTANGSGLPITHYGSMSFNSSTLPFKSFTLKNLLHVPSITKNLISVSLFAKDNHVFFE 300

Query: 301 FHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHL 360
           FHPTLCYVKDLD GQVLLQGLLNDGLYKFTIQPSHKRLH+S+ NTKSVFNTVVPKSN+ L
Sbjct: 301 FHPTLCYVKDLDNGQVLLQGLLNDGLYKFTIQPSHKRLHNSDPNTKSVFNTVVPKSNTPL 348

Query: 361 LDLSHRRLGHPHLPTVKDVLNHIDYSSGT 378
           +DL HRRLGHPHLPTVK VL H+D+SSGT
Sbjct: 361 IDLWHRRLGHPHLPTVKAVLKHVDHSSGT 348

BLAST of Pay0005361 vs. ExPASy TrEMBL
Match: A0A438EA49 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2917 PE=4 SV=1)

HSP 1 Score: 463.4 bits (1191), Expect = 1.8e-126
Identity = 339/966 (35.09%), Postives = 450/966 (46.58%), Query Frame = 0

Query: 13  TNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLESESEPPSKYLTS 72
           T     +  +     ++  ++  DD+FL+WK+QI   +  Y LE FL    + P K +T 
Sbjct: 26  TTTDESLRMVISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVT- 85

Query: 73  TGSSSTSATRTPNPEYKVWKRQDRLISSWILGS--------------------------- 132
                      PNP+++ ++RQD L+ SW+L S                           
Sbjct: 86  ----DKIGVLVPNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEDGLTMRDYLT 145

Query: 133 --------MTLINKSVSSDDHILYILAGLGSDYQSMISVISKKN---LLSFCTRSYV--- 192
                   +      +S  DHIL I+ GLG +Y+S+I+VIS K     L + T + +   
Sbjct: 146 KMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHE 205

Query: 193 ------------FITY-SRMNNQ------------------HNQRGG----RGNGRSNRG 252
                        + Y S+ +N+                   NQ GG    RG+   NRG
Sbjct: 206 GRIAHKISSNDLSVNYTSQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRG 265

Query: 253 -GRGNRN---KPQCQICTKHRH-------------------------------------- 312
            GRG      KPQCQ+C K  H                                      
Sbjct: 266 RGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGS 325

Query: 313 ---------------------NMSAMVASPDLNIDNNWYPDSGATNHLTH---------- 372
                                 M AMVA+P+   +  W+PDSGATNH+TH          
Sbjct: 326 ISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAE 385

Query: 373 IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDN 432
             G  +I+  NG GL I+H G   F SS  P K   L N   VP I KNL+SVSQFA+DN
Sbjct: 386 YNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDN 445

Query: 433 HVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQP-----------SHKRLHHSELN 492
           +V+FEFHP +C+VKD     +LLQG L+ GLY+F +             S+ +   +  N
Sbjct: 446 NVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCN 505

Query: 493 TKSVFN---TVVPKSNS--HLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEAC 552
              V N       K+NS  H+ DL H+RLGHP    V  VLN       T +  + C AC
Sbjct: 506 ASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSAC 565

Query: 553 ALGKHHALPFSHSLTHYTHPLQIISCDLWGPA-INVSHNCFSYYISFVDAYSRYTWIYFL 612
            LGK H LPF  S T YT PLQ++  DLWGPA IN S+  F+YY+SFVDAYSRYTW+YFL
Sbjct: 566 QLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYG-FTYYVSFVDAYSRYTWVYFL 625

Query: 613 YSKP---DAFLTFQNSKPV--------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQ 672
            +K    +AFL F+    +         QTD G EF+  K + +Q+GI HR++CP+TSKQ
Sbjct: 626 KTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQ 685

Query: 673 NDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGW 732
           N I+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LIN LPT VL    P E L   
Sbjct: 686 NGIIERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNS 745

Query: 733 KPNFPSFRVFGC------------------------------KWYKCLASDGRLFISKHV 750
           KPN+   +VFGC                              K YKCL   GR+FIS+ V
Sbjct: 746 KPNYSQLKVFGCLCFPHLRPYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSV 805

BLAST of Pay0005361 vs. NCBI nr
Match: TYK10642.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1087.8 bits (2812), Expect = 0.0e+00
Identity = 599/889 (67.38%), Postives = 628/889 (70.64%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYDLE FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMT------------- 120
           SESEPPSKYL ST SSS SAT TPNP YKVWKRQDRLISSW+LGSM+             
Sbjct: 61  SESEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSA 120

Query: 121 ---------------------------------------------------LINKSVSSD 180
                                                               INK VSSD
Sbjct: 121 KEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSD 180

Query: 181 DHILYILAGLGSDYQSMISVISKK---------------------------------NLL 240
           DHILYILAGLGSDYQSMISVIS +                                 N++
Sbjct: 181 DHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIV 240

Query: 241 SFCT----RSYVFITYSRMNNQH--NQRGGRGNGRSNRGGRGNRNKPQCQICTK------ 300
           +  T     SY+    +  +N H  NQRGGRGNGRSNRG RGNRNKPQCQIC K      
Sbjct: 241 TQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSAD 300

Query: 301 -----------------HRHN-----------MSAMVASPDLNIDNNWYPDSGATNHLTH 360
                            + HN           MSAMVA+ DLNID+NWYPDSGATNHLTH
Sbjct: 301 RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 360

Query: 361 ----------IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNL 420
                       GG QIYAANG GLPITHYGSMSFNSS LPFKSFTLNN   VP ITKNL
Sbjct: 361 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 420

Query: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNT 480
           ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS  NT
Sbjct: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNT 480

Query: 481 KSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHH 540
           K VFNTVVPKSN+ LLDL HRRLGHPHLP VK VLNHID SSGT+NK+ FCEACALGKHH
Sbjct: 481 KPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHH 540

Query: 541 ALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAF 600
           ALPFSHSLT YTHPLQ+I+CDLWGPA+NVSHN F YYISFVDAYSRYTWIYFL SK DAF
Sbjct: 541 ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF 600

Query: 601 LTFQNSKPV-----------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQNDIVERK 660
           L FQ  K             LQTDGGTEFKPFKPFLDQHGI+HRITCPYTSKQNDIVERK
Sbjct: 601 LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERK 660

Query: 661 HRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSF 702
           HR+IMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPT VLDNISPLEKL   KPNFPS 
Sbjct: 661 HRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSL 720

BLAST of Pay0005361 vs. NCBI nr
Match: KAA0048297.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 1087.0 bits (2810), Expect = 0.0e+00
Identity = 599/889 (67.38%), Postives = 627/889 (70.53%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYDLE FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDTFLLWKFQILTALEAYDLENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMT------------- 120
           SESEPPSKYL ST SSS SAT TPNP YKVWKRQDRLISSW+LGSM+             
Sbjct: 61  SESEPPSKYLISTESSSASATGTPNPAYKVWKRQDRLISSWLLGSMSEEILNQMLHCKSA 120

Query: 121 ---------------------------------------------------LINKSVSSD 180
                                                               INK VSSD
Sbjct: 121 KEIWETLQGIFSSRYLAQAMQFKNKLHNIKKGSMPLKEYFLKILQCVDALASINKPVSSD 180

Query: 181 DHILYILAGLGSDYQSMISVISKK---------------------------------NLL 240
           DHILYILAGLGSDYQSMISVIS +                                 N++
Sbjct: 181 DHILYILAGLGSDYQSMISVISARTDSPSVQEVMSLLLTQESQNESKLISETALPSVNIV 240

Query: 241 SFCT----RSYVFITYSRMNNQH--NQRGGRGNGRSNRGGRGNRNKPQCQICTK------ 300
           +  T     SY+    +  +N H  NQRGGRGNGRSNRG RGNRNKPQCQIC K      
Sbjct: 241 TQTTEKGAESYIRTNQNNYHNNHSYNQRGGRGNGRSNRGRRGNRNKPQCQICAKLGYSAD 300

Query: 301 -----------------HRHN-----------MSAMVASPDLNIDNNWYPDSGATNHLTH 360
                            + HN           MSAMVA+ DLNID+NWYPDSGATNHLTH
Sbjct: 301 RCFFRYTPRSNSSGYSPNSHNTSYTNMNNHPQMSAMVAALDLNIDSNWYPDSGATNHLTH 360

Query: 361 ----------IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNL 420
                       GG QIYAANG GLPITHYGSMSFNSS LPFKSFTLNN   VP ITKNL
Sbjct: 361 SLSNLSIGSEYGGGNQIYAANGSGLPITHYGSMSFNSSTLPFKSFTLNNLLQVPSITKNL 420

Query: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNT 480
           ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTI+PSHKRLHHS  NT
Sbjct: 421 ISVSQFAKDNHVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIEPSHKRLHHSNSNT 480

Query: 481 KSVFNTVVPKSNSHLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEACALGKHH 540
           K VFNTVVPKSN+ LLDL HRRLGHPHLP VK VLNHID SSGT+NK+ FCEACALGKHH
Sbjct: 481 KPVFNTVVPKSNTPLLDLWHRRLGHPHLPIVKAVLNHIDNSSGTINKLNFCEACALGKHH 540

Query: 541 ALPFSHSLTHYTHPLQIISCDLWGPAINVSHNCFSYYISFVDAYSRYTWIYFLYSKPDAF 600
           ALPFSHSLT YTHPLQ+I+CDLWGPA+NVSHN F YYISFVDAYSRYTWIYFL SK DAF
Sbjct: 541 ALPFSHSLTLYTHPLQLITCDLWGPAVNVSHNGFRYYISFVDAYSRYTWIYFLNSKSDAF 600

Query: 601 LTFQNSKPV-----------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQNDIVERK 660
           L FQ  K             LQTDGGTEFKPFKPFLDQHGI+HRITCPYTSKQNDIVERK
Sbjct: 601 LAFQKFKTCVEKSLGQSIKSLQTDGGTEFKPFKPFLDQHGIEHRITCPYTSKQNDIVERK 660

Query: 661 HRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGWKPNFPSF 702
           HR+IMEMGLTLLSQATLPLSFWDEAFSTSVYLIN LPT VLDNISPLEKL   KPNFPS 
Sbjct: 661 HRYIMEMGLTLLSQATLPLSFWDEAFSTSVYLINRLPTPVLDNISPLEKLFCRKPNFPSL 720

BLAST of Pay0005361 vs. NCBI nr
Match: KAA0067212.1 (retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa])

HSP 1 Score: 717.6 bits (1851), Expect = 1.1e-202
Identity = 391/596 (65.60%), Postives = 407/596 (68.29%), Query Frame = 0

Query: 194 HNMSAMVASPDLNIDNNWYPDSGATNHLTH----------IRGGIQIYAANGLGLPITHY 253
           H MSAMVA+PDLNID+NWYPDSGATNHLTH            GG QIYAANG GLPITHY
Sbjct: 40  HQMSAMVATPDLNIDSNWYPDSGATNHLTHSLSNLSTGSEYGGGNQIYAANGSGLPITHY 99

Query: 254 GSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFEFHPTLCYVKDLDTGQ 313
           GSMSFNSS LPFKSFTLNN  H                                DLDTGQ
Sbjct: 100 GSMSFNSSTLPFKSFTLNNLLH--------------------------------DLDTGQ 159

Query: 314 VLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHLLDLSHRRLGHPHLPT 373
           VLLQGLLNDGLYKFTIQPSHKRLHHS+ NTKSVFNTVVPKSN+ LLDL HRRLGHPHLPT
Sbjct: 160 VLLQGLLNDGLYKFTIQPSHKRLHHSDSNTKSVFNTVVPKSNTPLLDLWHRRLGHPHLPT 219

Query: 374 VKDVLNHIDYSSGTMNKMTFCEACALGKHHALPFSHSLTHYTHPLQIISCDLWGPAINVS 433
           VK VLNHID+SS    K   C   +LG+                                
Sbjct: 220 VKAVLNHIDHSS-AFQKFKTCVEKSLGQ-------------------------------- 279

Query: 434 HNCFSYYISFVDAYSRYTWIYFLYSKPDAFLTFQNSKPVLQTDGGTEFKPFKPFLDQHGI 493
                                              S   LQTDGGTEFKPFKPFLDQHGI
Sbjct: 280 -----------------------------------SIKSLQTDGGTEFKPFKPFLDQHGI 339

Query: 494 KHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVL 553
           +HRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAF TSVYLIN LPT VL
Sbjct: 340 EHRITCPYTSKQNDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFFTSVYLINLLPTPVL 399

Query: 554 DNISPLEKLSGWKPNFPSFRVFGC------------------------------KWYKCL 613
           DNISPLEKL   KPNFP  RVFGC                              K YKCL
Sbjct: 400 DNISPLEKLFCRKPNFPFLRVFGCKCYPYFRPYQSHKLSLRSTPCTFLGYSTSHKGYKCL 459

Query: 614 ASDGRLFISKHVLFDENSFPYASFSSHSSMLKSKNVLSPPFHSIIQSSLINHNEDRRHTD 673
           ASDGRLFIS+HVLFDENSFPYASF+SHSS+ KSKNVLSPP HSII SSL+NHNEDRRHTD
Sbjct: 460 ASDGRLFISRHVLFDENSFPYASFASHSSIPKSKNVLSPPLHSIIPSSLMNHNEDRRHTD 519

Query: 674 TVSDNTDHLNPIIVYPLETGTQESSRDDGNIGGITQSLSPMEPQHQIDSGMNTQLQSTSV 733
           TVSDNTD+LN  IVYPLETGTQESSRDDGN GGITQS SPMEP HQ DSGMNTQLQSTS+
Sbjct: 520 TVSDNTDYLNSTIVYPLETGTQESSRDDGNSGGITQSPSPMEPPHQTDSGMNTQLQSTSI 535

Query: 734 HLMITRSKHGIFKPKTFLIDYTQTEPCNAKEAFKHPHWKKAMEEKFEALQKNDTKS 750
           H MIT+SKH IFKPK FLIDYTQTE CNAKEAF HPHWKKAMEE+FEALQKN T S
Sbjct: 580 HPMITQSKHDIFKPKAFLIDYTQTETCNAKEAFNHPHWKKAMEEEFEALQKNGTWS 535

BLAST of Pay0005361 vs. NCBI nr
Match: KAA0068024.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK18104.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 489.6 bits (1259), Expect = 4.9e-134
Identity = 264/389 (67.87%), Postives = 287/389 (73.78%), Query Frame = 0

Query: 1   MSSNSSLLGVENTNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLE 60
           MSS SSLLGVENT ASSPINQIFGSG KISLVK NDD+FLLWKFQILT LEAYD+E FLE
Sbjct: 1   MSSTSSLLGVENTEASSPINQIFGSGNKISLVKLNDDNFLLWKFQILTALEAYDMENFLE 60

Query: 61  SESEPPSKYLTSTGSSSTSATRTPNPEYKVWKRQDRLISSWILGSMTLINKSVSSDDHIL 120
           SESEPP+KYLTSTGSSSTSATRTPNPEYK  + + +LIS   L S+ ++ ++        
Sbjct: 61  SESEPPTKYLTSTGSSSTSATRTPNPEYKESQNESKLISETALPSVNIVTQTTEKG---- 120

Query: 121 YILAGLGSDYQSMISVISKKNLLSFCTRSYVFITYSRMNNQH--NQRGGRGNGRSNRGGR 180
                                       SY+  + +  +N H  NQRGGRGNGRSN GGR
Sbjct: 121 --------------------------AESYIRTSQNNYHNNHSYNQRGGRGNGRSNSGGR 180

Query: 181 GNRNKPQCQICTKHRHNMSAMVASPDLNIDNNWYPDSGATNHLTH----------IRGGI 240
           GNRNKPQCQICTK  H           + D+NWYPDSGATNHLTH            GG 
Sbjct: 181 GNRNKPQCQICTKLGH-----------SADSNWYPDSGATNHLTHSLSNLSTGSEYGGGN 240

Query: 241 QIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDNHVFFE 300
           QIY ANG GLPITHYGSMSFNSS LPFKSFTL N  HVP ITKNLISVS FAKDNHVFFE
Sbjct: 241 QIYTANGSGLPITHYGSMSFNSSTLPFKSFTLKNLLHVPSITKNLISVSLFAKDNHVFFE 300

Query: 301 FHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQPSHKRLHHSELNTKSVFNTVVPKSNSHL 360
           FHPTLCYVKDLD GQVLLQGLLNDGLYKFTIQPSHKRLH+S+ NTKSVFNTVVPKSN+ L
Sbjct: 301 FHPTLCYVKDLDNGQVLLQGLLNDGLYKFTIQPSHKRLHNSDPNTKSVFNTVVPKSNTPL 348

Query: 361 LDLSHRRLGHPHLPTVKDVLNHIDYSSGT 378
           +DL HRRLGHPHLPTVK VL H+D+SSGT
Sbjct: 361 IDLWHRRLGHPHLPTVKAVLKHVDHSSGT 348

BLAST of Pay0005361 vs. NCBI nr
Match: RVW44519.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 463.4 bits (1191), Expect = 3.8e-126
Identity = 339/966 (35.09%), Postives = 450/966 (46.58%), Query Frame = 0

Query: 13  TNASSPINQIFGSGKKISLVKPNDDSFLLWKFQILTILEAYDLEIFLESESEPPSKYLTS 72
           T     +  +     ++  ++  DD+FL+WK+QI   +  Y LE FL    + P K +T 
Sbjct: 26  TTTDESLRMVISPLSQLITMRLEDDNFLMWKYQIENAVRGYGLEGFLFGTEQVPPKMVT- 85

Query: 73  TGSSSTSATRTPNPEYKVWKRQDRLISSWILGS--------------------------- 132
                      PNP+++ ++RQD L+ SW+L S                           
Sbjct: 86  ----DKIGVLVPNPKFRDYQRQDHLLISWLLSSIGSAFLPQVVGCSSAFEDGLTMRDYLT 145

Query: 133 --------MTLINKSVSSDDHILYILAGLGSDYQSMISVISKKN---LLSFCTRSYV--- 192
                   +      +S  DHIL I+ GLG +Y+S+I+VIS K     L + T + +   
Sbjct: 146 KMKNYCDLLATAGHKISDTDHILAIMQGLGDEYESVIAVISSKKSSPSLQYVTSTLIAHE 205

Query: 193 ------------FITY-SRMNNQ------------------HNQRGG----RGNGRSNRG 252
                        + Y S+ +N+                   NQ GG    RG+   NRG
Sbjct: 206 GRIAHKISSNDLSVNYTSQYSNRGPSSSWNSNGYPSSGFQNRNQFGGNQVTRGSFVHNRG 265

Query: 253 -GRGNRN---KPQCQICTKHRH-------------------------------------- 312
            GRG      KPQCQ+C K  H                                      
Sbjct: 266 RGRGRAQGGIKPQCQLCNKFGHTVHRCFYRYDPNFHGNMPANGPTPGVLGSGARNGASGS 325

Query: 313 ---------------------NMSAMVASPDLNIDNNWYPDSGATNHLTH---------- 372
                                 M AMVA+P+   +  W+PDSGATNH+TH          
Sbjct: 326 ISSAGNVNLTEYDAQENQDYSEMEAMVATPEDLQNCCWFPDSGATNHVTHDLGNLNSGAE 385

Query: 373 IRGGIQIYAANGLGLPITHYGSMSFNSSILPFKSFTLNNFFHVPFITKNLISVSQFAKDN 432
             G  +I+  NG GL I+H G   F SS  P K   L N   VP I KNL+SVSQFA+DN
Sbjct: 386 YNGNSKIHMGNGTGLKISHIGLSVFPSSSSPNKVLFLKNILRVPAIKKNLLSVSQFARDN 445

Query: 433 HVFFEFHPTLCYVKDLDTGQVLLQGLLNDGLYKFTIQP-----------SHKRLHHSELN 492
           +V+FEFHP +C+VKD     +LLQG L+ GLY+F +             S+ +   +  N
Sbjct: 446 NVYFEFHPKVCFVKDKSNHSLLLQGNLHKGLYQFNLSKKLFGKASGLSLSNDKNELTCCN 505

Query: 493 TKSVFN---TVVPKSNS--HLLDLSHRRLGHPHLPTVKDVLNHIDYSSGTMNKMTFCEAC 552
              V N       K+NS  H+ DL H+RLGHP    V  VLN       T +  + C AC
Sbjct: 506 ASLVHNDNSDFPEKTNSSFHVFDLWHKRLGHPASKIVTQVLNDNKIPFSTKSGSSICSAC 565

Query: 553 ALGKHHALPFSHSLTHYTHPLQIISCDLWGPA-INVSHNCFSYYISFVDAYSRYTWIYFL 612
            LGK H LPF  S T YT PLQ++  DLWGPA IN S+  F+YY+SFVDAYSRYTW+YFL
Sbjct: 566 QLGKSHNLPFPISQTVYTKPLQLVVSDLWGPAPINSSYG-FTYYVSFVDAYSRYTWVYFL 625

Query: 613 YSKP---DAFLTFQNSKPV--------LQTDGGTEFKPFKPFLDQHGIKHRITCPYTSKQ 672
            +K    +AFL F+    +         QTD G EF+  K + +Q+GI HR++CP+TSKQ
Sbjct: 626 KTKSQTREAFLMFKAQAELQFGCKLKTFQTDWGGEFRSLKTYFEQNGIIHRLSCPHTSKQ 685

Query: 673 NDIVERKHRHIMEMGLTLLSQATLPLSFWDEAFSTSVYLINCLPTSVLDNISPLEKLSGW 732
           N I+ERKHRHI+E+GLTLL+QA+LPL +W +AFST+V+LIN LPT VL    P E L   
Sbjct: 686 NGIIERKHRHIVELGLTLLAQASLPLKYWPDAFSTAVFLINRLPTEVLKQKCPYEFLFNS 745

Query: 733 KPNFPSFRVFGC------------------------------KWYKCLASDGRLFISKHV 750
           KPN+   +VFGC                              K YKCL   GR+FIS+ V
Sbjct: 746 KPNYSQLKVFGCLCFPHLRPYNKHKLDFRSSPCTFLGYSSKHKGYKCLNQQGRMFISRSV 805

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q9ZT941.3e-7330.09Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.6e-7128.67Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P109781.4e-3024.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041468.1e-2325.88Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P0C2I58.1e-1525.07Transposon Ty1-LR2 Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC ... [more]
Match NameE-valueIdentityDescription
A0A5D3CH970.0e+0067.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7U2330.0e+0067.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7VFQ65.3e-20365.60Retrotransposon protein, putative, Ty1-copia subclass OS=Cucumis melo var. makuw... [more]
A0A5D3D3G22.4e-13467.87Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A438EA491.8e-12635.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
Match NameE-valueIdentityDescription
TYK10642.10.0e+0067.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0048297.10.0e+0067.38Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0067212.11.1e-20265.60retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa][more]
KAA0068024.14.9e-13467.87Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
RVW44519.13.8e-12635.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (Payzawat) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 313..392
e-value: 8.8E-8
score: 31.9
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 399..564
e-value: 2.1E-29
score: 104.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 65..85
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 161..184
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 39..107
coord: 107..249
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 107..249
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 39..107
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 403..556
score: 15.87009
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 405..565

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Pay0005361.1Pay0005361.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding