IVF0021562 (gene) Melon (IVF77) v1

Overview
NameIVF0021562
Typegene
OrganismCucumis melo L. ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Locationchr01: 14704723 .. 14714965 (+)
RNA-Seq ExpressionIVF0021562
SyntenyIVF0021562
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAGCTCGATAGTTCAATTGTTAGCTTTCGAAAAACTTAACCGCGATAATCTTAACACAATACTAGTGGTTGATGATTTAAGGTTTGTCTTAATTGAGGAATGTCCTCAAACCCCTACCTCAAATGTAAACCGAATTAGTCGGAAAGCATACGATCGATGGATAAAAGCAAATGAGAAAGCCCGTGTCTACATCCTTGCTAGCATGTCTGATGTTTTAGCAAAGAAACATGAATCCTTAGCAACGACTAAAGAGATAATGGATTCATTAAAAGGAATGTTTGGGCAACCAGAATGGTCCTTGAGACACAAGGCAATTAAATACATTTACACTAAGCAAATGAAGGAGGGGGCCTCTGTTAGAGAACATGTCCTGGACATGATGATGCACTTCAATATTGCTGAGGTAAATGGTGGTGCCATCGATGAGGCTAATCAGGTTAGCTTTATCTTAGAGTCTCTTCCAAAGAGCTTCATACCATTTCAAACAAATGCGTCCTTGAACAAGATAGAGTATAATATGACAACCCTTCTAAATGAGCTCTATCGTTTCCAGAATCTTACCATGGGTAAGAGAAAAGAAATAGAAGCAAATGTTGCTACTACTACAAAAAGAAAATTCTCAAGAGGATCGTCCTCTAAATCTAAAGCTGGACCCTCAAAACCTAATTGAAAGATAGAAAAGAAGGGAAAGGGGAAGACTCCCAAACAGAACAAGGGAAAGAAGACTACAGAAAAAGGTAAGTGTTACAAGACATGATGATTGAAGGGATGAAAACCCTTTATAGTAGAAGAATTACAGAAACCAAATTTAATTGGAACCTTACAAAATACCAATACATTGGCAAAAATTACAAAAACAATTTTTATGGTTAAACATGCTTTGAAGAAAAATACAGAGAAGAACTTACGCTCGTTGACGACTCTGAATCTACTGATCACCAATTTGGGAACCACCACTTGGAAACCTTCCTATTCTTTGATTGAGATGATTTGTGGGAACCAAATTGGAATAAGGGAATTTTTTCTCTACAGAGAAATTTTTTTCAGAGAGAACGGGAGAGCAACTCAGTTTTTGCAGAAATCACTCCTAAAAAAATTTTTCATTACGCAAAAATTCCCTCTTTTTCAGTTGGAAGAAGTGGAAAGTTGGAGCGAATAAATGGGAAAGTGGGAAAACCACCCACGTTCCCTATTAACTTAATAATTATTAAATTAATATATATTAAACTAATTAATTAATTTAATAATTAATTAAATCATATTTAATTAATATTTTCATTTTAATCATATTTAAATGAATATCTCTCGCATAACCTATAGTTTTAATTTAATTAATTTAATTAAATCAAAATAAATTAAACTATTAATTAATTTTCCAATAATTAATTTCTAAATTAAATATCTTATATTTAATTTAATCCATAATTTGAATCATATTCAAATATAAATTTCTCTCATAACCTATAGTTTTAATATGTATCATATACACATTAAATTTCAACTTATAGTTTTAATATGAATCTAATTCATATTAAACTAATATTTGAACTCATTCAAATATTTGATTCTCTCGTAATTTAATTTTGAATCATATCCAAAATTAAATTTATATAATAAAGTCTAATTAAAATATAAACTTTATATTATATAATAAAGTCTAATTAAAATATAAACTTTATATTATAATGTATCAATATACATTATATTAATTCCCAAAGTAAATTTGAACATTTCAAATTACAACTAATATAAATAAATCTCATTACTCTTTATGAGCTAGGAAGGGGACCTAATGGACCTATAGATCAGAAGCTACAACGATATGAGATTAATTGGCTAAACTCATTAACCACATTAATCAATATTCGTTAACTGTGTGTACACTCCACTAAAGACTCACAGCTGAACTCTTCTCACTGTAGATATATTTCTGTGTCCATGGATATAGACCAATACCAGTAAGTTAGTCTTTCACAAGTGTTCGTAAAACCAGTTGGGTCAAATTACTGTTTTACCCCTGGGTTACTTCTAGTCCTTAAATACCAATGCTCCTCTAATGAACAACCTGTTTATGGTCCAACCACTAAACAAAAAACCCTTCTCGTGCCATAGAGAGGGTAGGGCCCTTTGTTAAAGTTTCGGAGACACTATTTAAGGGAACACTTATCTACTTACCCTAAAGGTGAGAATGAGTGAATTTCATCTTGTGTGATTATGTTCCCAACTCCCCACTCGGTGTTGTCCCCAAAATGATAAGTATATTGAGTCAAAAATTTGGCCACTCTCACCCGTACAAATCAAAGGACAATCCATCGCAAACAGAAGTTCATAATACACTCAGAATTAAGACTAAGTCACCTAGGTCATCCTAATGAAATAGAAATCCAACTAGTTAACGGAGTTACATCTAGTGATTACTATTTCGTGGTCCAATCTTATGCAAACTCATTGCATAAGATACATTCACTCGCATGTCGCATACATGAACACATTGGATCAATGTATTTGTATCAAATACAAAGTGAGCCATATCCATAGTGTTAACAGGATAAGGTACCTAACCTTAACCCTATACTATAGACCCTTTAAGCTGATCTTGAACATTGATCCCCATATGTCTCTACATACTGTTCAAGACTCATCAAACAACTTAGGATGTTAGTTTATTGGATTTAGGTTATTAAGACAAAACTAATAATATAATCAATAACACTTATTGAAATTATAATAATAAAACACTTTATTAATGACAGTCAATTGATTATATTTACTATCTACGAATTTTAGGACATAAAACCCAACAATGATGCATTTCAATATTGCTGAGGTAAATGGTGATGCCATCGATGAGGCTAATCATGTTAGCTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATACCATTTCAAACAAATGCATCCCTGAACAAGATAGAGTTTAATCTGACAACCCTTCTAAATGAGCTTCAGCGTTTCCAGAATCTTACCATGGGTAAGGGAAAAGAAATATAAGCAAATGTTGCTACTACTACAAAAAGAAAATTCTCAAGAGGATCGTCATCTAAATCTAAAGCTAGACCCTCAAAACCTAATCAAAAGATAGAAAAGAAGGGAAAGAAGAAGACTCCCAATCAGAACAAGGGAAAGAAGACTACAGAAAAAGGTAAGTGTTACCACTATGGTGAAAATGGGCATTGGTTAAGAAACTGCCCAAAATACCTTGCTCATAAAAAAACGGAGAAGGAAGCACAAGGTAAATATGATTTACTTGTTCTTGAAACATGTTTAGTGGAAAATGAAAATTCTACCTGGATATTAGATTCATGAGCCACTAATCATATTTGCTTCTCATTTCAGGAAAATAATTCTTGGAAAAGACTTTCTGAGGGCGAGATTACTCTCAAAGTTGGAACTGGAGAAATGGTCTCAGCTAAAGCAATGGGAGACTTGAAGTTGTTTTTTAATGATAGATATATCATGCTCAAGAATGTCTTGTATGCTCCTCATATGAAGAGAAATTTAATATCTATCTCTTGTATATTAGAACATATGTATAAAATATCTTTTGAAATTAATGAAGCCTTCATTTTCCAAAAAGGTATTTATGTTTATTCCGCTATACTTGAAGACAACTTATATAAGTTAAGACCAACACAAGCAAATTTTGTCTTGAATACTGAAATGTTTAGAACAACTGAAACTCAGGATAAAAGACAAAAAGTTTCTTCCAATGCCTTCTTGTGGCACTTAACACTTGGTCACATAAATCTCAATAGGATTGAGAGATTGGTTAAGAGTGGACTTCTAAGTCAGTTAGAAGATAACTCTTTACCTCCATGTGATTCTTGTCTGGAAGGAAAAATGAGTAAAAGATCTTTTACTGGAAAAGGTCTTAGAGCCAAAACATCTTCAGAGCTCGTAAATTCGGACCTATGTGGATCAATGAATGTCAAAGCTCGGGGAGGATATGAATATTTCATTAGTTTTATTGATGATTATTCAAGGTATGGTCATGTTTACCTAATTCAGAACAAGTCTGATTCTTTTGAAAAGTTCAAAGAATATAAGGCTGAAGTTGAAAATGAATAAGAAAAAACTATAAAAACACTTTGATCAGATCGAGGTGGAGAGTATATGGACTTGCGATTCCAAGACTACTTGATAGGACATGGAATCCAATCCAACTCTTTGCACCTAGTACGTCTCAGCAGAACGGTGTATCAGAAAGGAGAAACTGAACTTTGTTGGACATGGTTCACTCTATGATGAGTTTTGCTCAATTGTCGGATTCTTTTTAGGGATATGCTTTAGAAACAACTATCTATATTTTAAACAACGTTCCCTTTAAAAGTGTTTCTGAAACAATTTATGAGCTATACAAAGGGCGTAAAGGTAGTTTACGTAACTTTAGAATTTGGGGATGTTCAGCACACGTGTTAGTGAAAAACCCTAAAAAGTTGGAACATCGTTCAAAATTATGCCTATTTGTAGGTTGTCCAAAAGAATCTAAAGGTGGTTTGTTTTATAACCCTCAAGAAAATAAAGTATTTGTGTCGACAAATTCTACGTTCTTAGAGAAAGACCACATAAGAAATCATTAAACTTGCAGTAAACTAGTATTAGAAGAAATTTCCAAAAATACTACAGATAGACCTAGTTCATCCACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAACACATCTTCCTCAAGAGTTGGGAGAACCTCGACGTAGTGGGAGGGTTATACGACAACCTGATCGCTATTTGGGTTTAAGTGAAGCTCAAATTGTCACACCTAATAATGGAATAGAGAATCCATTGACCTTTAAACAGACAATGAATGATGTGGACTGTGACCAATGGGTCAAAGCCATGGACCTTGAAATGAAATCTATGTATTCCAATTCTGTTTGGACTCTAGTAGATTAACCAAATAATGTAAAACCTATTGGTTGTAAATGGATCTACAAAAGAAAACGAGACCAAGCTGGTGAATTACGAATTTCAAAGCTCGACTTGTGGCAAAATGTTATACACAAAAGGAGGGAGTGGATTATGAAGAAACCTTCTCTCCTGTTGCCATGTTGAAGTCGATTAGAATACTCTTATTCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAACCTTTTTGGATGGAAATCTTGAGGAGAGTATTTATAAGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGTTTGTAAGATTAAAAAATCCATATATGGATTAAAACAAGCATCTAGATCCTGGAATATAAGGTTTGATACTGCGATCAAATCTTATGGTTTTGAACAAAATGTTGATGAACCTTGTGTTTACAAAAGGATCATCAAATCAACTGTAGCATTCTTAGTTCTATATGTAGATGACATTCTACTCATTGGGAATGATATAGGTCATTTAACTAATATTAAGAAATGGCTAGCTACACAATTCCAAATGAAAGATTTGGGAAATGCTCAATATGTTATTGGTATCCAAATAGTTCGGAATCGAAAGAACAAAACACTAGCCATGTCTCAAACATCTTATATAGATAAAATGTTGTTTGGATATAAGATGCAGAATTCAAAAAAGGGTCTGTTGCCGTACAGATATAGAATTCATTTATCAAAAGACAATGTCCAAAAACACATCAAGAAGTTGAGGATATGAGTAACATTCCCTATGCTTCTGCTGTTGGGAGCCTGATGTATGCAATGTTATGTACTAGACCTGACATTTGCTATTCAGTATGGATAGTTAGTAGATATTAGTCCAATCCTTGACGTGATCATTAGACAGCTGTTAAGAATATTGTAAAATATCTTAGAAGAACAAAAGACTACATGCTTGTGTATGGTTCTAAGGATATGATCCTTTCTGGATACACTGACTCTGATTTTCACCTGATAAAGATGCTAGAAAGTCTACATCAGGATAAGTTATCACTCTGAATGGAGGAGCAGTAGTATGGAGAAGCATAAAACAATCTTGTATTGCTGACTCCACTATGGAAGATGAATATGTAGCAACCTGTGAAGCAACAAAGGAAGCAGTATAGCTTAAAAAGTTCTTAACAGATTGGGAAATTGTTCTAAATATGCATTTGCCAATCACCTTATACCGTGACAACAGTGGTGCAGTTGCAAATTCACGAGAACCTAGAAATCATAAACTAGGAAAGCACATTGAACAAAAGTACCATCTTATCAGGGAAATCGTACATCGAGGAGATGTTACATTAACAAAAATCTCCTTCGAGCAAAACATAGGTGATTCGTTTACAAAAGCTCTCACGGCTAAAGTGTTGAGAGCCTCCTACATGGTTTAGGTCTGCGTTATTTGTAAACTAGGACAAGTGGAGGACTTCTTGGGAATATGCCTTTAGTTTAATATATATATGTTTATTGTAGTCATGTATATTCTTCTTTCATTGTTAAAATTCTAACTTTGTATATCCTACTGGGAGTTTTAGTCCAAGTGGGAGTTTGTTGGAATTTATGTCCTAAAACTCGTATTTTGTTATTTGATTCAGTAAAATTTATTATTGAATACTATAATCTTAAAACCAATAAATTAAGGTTTCGAGGCTATTTTACTAAGTTTGTCAATACACTTGAACTTTATGTAGAGACATAAGCATGGATTAAGTTCGAGTTAATAGCCCAAATAGTCTATAGTGTATGAATAAGGTTGGGCGCCTTATTCTGGAAAAACACTATGGATGTGGCCCACTCCGTAGTTAGTACAAACGATGTAATCTTGAATCGTTCATGTAGGGACATGAGAGTGGGGACGTTCTATGTAAATGGTTTGCATAAGACTGGAACCACGAAATAGTCACTTTTAGTTATAACGCCCTAAACTATAAACTGACTATTTTATTTATAATAACCTAGGTAACTTGATCTTAATCCTAAGCTAACTATGACGTTATGTTCGTTCGGTAGTATCTTTAGATCTGCATAGGTGAGGGCAGCTAATTATCGTTGGCCCAATAAGCCTCCGATTTCAGGGATAAGACCGAGTGGATAGCCAGGAACATAGGGTGCAAGACGGAATTCACTCCTACTGACTTTTGGAATAGTAGATAGGTTGTTCTCTTAAGGATTGAATCCAAGTCTTGAACAAGGGGTCCTACCTTCTCATTGGCCCGAGAAGGATTCAGGTTTATAGGTTGGACCTTAAACCAATTGTTTAATAGTGGATCAGTGGGTCTTAAGGCGCAAAATGTAATCTCAAGGGTAAAAGGGTATTTTGACCCAGCCAAGATTACGAACAACCTGTGAAGGATTAACCTACTAATCATGGTTATATCAGGTGGACAGAAATATATCTATAGTGAGGGGAGAGCAACTAAGAGTCTTTAGTGGTATGAGTGTTTAGTTAACGAATGTTGATTAAGCTTGGTCTAAAGGAGTTTAACCAGTTAATCTTGACTCGTTGGAGCCCATGATCTATAGGTCCATTAGGTTCCCCTACTAGCTCATATGGATTCAACTAAGAACAATATGTTGGAATAATTCGAATTATTCGAATTAAGTAAGGAGAGAGAAATCGACCAGTGTATATGATATAAGTTGGTAAATATAAACTTTAAACTTTATGTTTAAATATGATTTAAATAATGAATATGGATTCATATTTTGAAGCTTGGAAAGTTTTGAAACGATCTAAGCTATAAAAGTCAACATGTTGACTTTTCACTTTTAAAAAACAAAACATTGACCGAATTCATACTCAAATATGATTTAAGTTTTAGAAAAATGAATGCGGATTCATACTCGGAAGGTTGAAATTAGTCAAGACAGATAAAATAGCAAAAAGTCAAAACGTTGACTTTTGGCTAAGAAAAGTCAAAGTTTGATTTTCACTTAATTGGTCAAATGACCAAATTGCCCTTTGACTAATATACTTATTAACTAAATGTTAATGGGAAATGTGGCCGCTTATAATGTATTTAGAAGCCACTAATTCCATTAATAGTTAATGGATTAATTAGGTGTTTTGATTTTATGAAATAAATTGCATGCATTTTGCATGTAATTTTTCTATATATAAACTTCCATTTACAGAATGAGAAGATGATGATTCTGATGAAAAATCTCTAAACGATACACCTACCTCCATCCATCTCTTTAAAGATTTCCTTAAAAAAAAACGAGTCCCACGACTCAGTTCTTAGTCCTGAGATTAGTAGTTCAACATAGTGGTGTCCTTTGCTCGTGATCTTTAGGCAGAGAAGAAGTTTTGGAACGAAGAAGAAGTTGAGAACTACAAAGGTAAGCTCATCGTTTACCTTCTATATTCTTCGTTTAGGTATAGCATGTTAGTTTCTAAATTTTTTAGATGCATATAAAGTAAAGCATGATTCTTGTCTTCTGCTGCATGTTTCTCTAGGTTTTTTTTTGCTTCATGTGGTATCAGAGCGTGCTTAGTTTTACTCATATGCATATGGTGGGTTCTTACAATTTATATGATAAATTGTTATGGTTGAACTAAAATGGATTAGTTATGGATTTGACTCTAAATTAAGTTGTTTATTTGCATTAGTTATTGTAATTAGCCATCTTGTTGGCTTAATTTACTTTTTGCAAAGGGTCTGTAATTTGTTGGAGTCATTAGAGTTGATTTTAGGTTGTAATTGCTTCATTTGGGAAAAAGAAGTGTTGTTTTGAAGCTCATACCCGCGAAGAAGTTGTTTCAGACCTATCGTTCACCCTCTCAACGATCGCGTATGGTACGTGCATCGCTTGTGCATGTAAATGTCGCTATTACCAGCTAAACGATCGCGTATGGTACGCGTAGAATAAATCAGGATTCTTGTCTTCCGCTGCATGTTTCTCTAGGTTTTTTTTTTCCATCATGTACTATATCAAATTTATCTTTATTTTCTATGCTCAAACATTTTAAAATAAATTAAGTGTGAAATGCTGTTACACCCATCCTTTATGGAATCTATTGAAATTCTATCGACGCTCTCGTAACAAAAGATCTCGAAACTTGCCAGTCCTTGCCTTCTTAAATTTTAGGAAATGTGTACAATTGGTTCATTCTTTCATAAACTGTTACGTTGCAAAACTCTTTTGAAATATTTAATCCCACAACTAGTATTATTAATTCATACACTCTAGTTTGGATCGATGCGGTCTACTATTTTATTATATTCATCACCTTTTTCATTGAAACATATATATGAAGTAACAGTTTTGTCTGAAATTAAGGTAGTGACCTTTGAACAACTTTATATTGTCCAGACGACGAAAAATTTCACATTTGTTAAGAAACCTCTCGAGTTACTTGTTTACCGATCAAAGTACAAGAGGTAGCGAACCACCCTTCATGATTTTTGTATAACAAAATTAGAGTTTGGTATATCCTCAAATATTTAAAGCTACGTGGGTTCAATCGTTACATGTATTGGCCCTCGAGTTTTGCAACAGAGGTTTAATCGTATCAACAGTTGTAACGATTAATATATGAAGAAATTGGAGTTTGGTACCCTTTCATATTTAAATCTACATCGATTCAATTGTTAGAAATGTTGACCCTCGAGGTTTTCAGTTGACGATAAAGAAGAAGTACCAAAACACCTCATTCTTTTCTGGGGAATGGTTGAACATGGGTGCTTAAATGTAGAGGATGGAGTTCGATGATAGACCCAAATTAATGATCCAATTATGTGATGATTTAATTTGTTCGGGGTGTTAAATCATGTGGACATCACGATGAAAACAACGAATTATATAAGAGATATTGCCTACTTTTGGATACAAAAGAGGAGAGTGCAAGCGAAGGAGCAAAGTTATATATATATTTTTAAGTTATTTCATTATAGTTGATTGAGATGGAATTTAAATACTTGGAAACAACTGCATGAAAAAGATCTAACGTCAAAATATTTCATTACACAATGCTTGGCTCCTTATCTCCGTAGTTATAGCATATATCCTTGCCAAGGTTTTTCGGGTCTTGCAAGCACATGCTATATTAATGAATGATCCATCTGCATGTGGTGCATTTCCCTACATCCCTTTTGTCATCAAAGATATTGAACTAGTAAATTTGGTGTTGTCAAGTCCATGTGAAGCTGCAGAAAAACAATGTCGTCCACATGAGGTTGGGTATGAACTTGAACGAGTGGCTGTCCAAGCGGAAGGACATGGACTCCTAGGTCATCGTTCTTGGATTTGCAATGAACGGCGATAG

mRNA sequence

ATGAATAGCTCGATAGTTCAATTGTTAGCTTTCGAAAAACTTAACCGCGATAATCTTAACACAATACTAGTGGTTGATGATTTAAGGTTTGTCTTAATTGAGGAATGTCCTCAAACCCCTACCTCAAATGTAAACCGAATTAGTCGGAAAGCATACGATCGATGGATAAAAGCAAATGAGAAAGCCCGTGTCTACATCCTTGCTAGCATGTCTGATGTTTTAGCAAAGAAACATGAATCCTTAGCAACGACTAAAGAGATAATGGATTCATTAAAAGGAATGTTTGGGCAACCAGAATGGTCCTTGAGACACAAGGCAATTAAATACATTTACACTAAGCAAATGAAGGAGGGGGCCTCTGTTAGAGAACATGTCCTGGACATGATGATGCACTTCAATATTGCTGAGGTAAATGGTGGTGCCATCGATGAGGCTAATCAGGAAAATAATTCTTGGAAAAGACTTTCTGAGGGCGAGATTACTCTCAAAGTTGGAACTGGAGAAATGGTCTCAGCTAAAGCAATGGGAGACTTGAAGTTGTTTTTTAATGATAGATATATCATGCTCAAGAATGTCTTAACAACTGAAACTCAGGATAAAAGACAAAAAGTTTCTTCCAATGCCTTCTTGTGGCACTTAACACTTGGTCACATAAATCTCAATAGGATTGAGAGATTGGTTAAGAGTGGACTTCTAAGTCAGTTAGAAGATAACTCTTTACCTCCATGTGATTCTTGTCTGGAAGGAAAAATGAGTAAAAGATCTTTTACTGGAAAAGGTCTTAGAGCCAAAACATCTTCAGAGCTCGTAAATTCGGACCTATGTGGATCAATGAATGTCAAAGCTCGGGGAGGATATGAATATTTCATTAGTTTTATTGATGATTATTCAAGTAAACTAGTATTAGAAGAAATTTCCAAAAATACTACAGATAGACCTAGTTCATCCACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAACACATCTTCCTCAAGAGTTGGGAGAACCTCGACGTAGTGGGAGGGTTATACGACAACCTGATCGCTATTTGGGTTTAAGTGAAGCTCAAATTGTCACACCTAATAATGGAATAGAGAATCCATTGACCTTTAAACAGACAATGAATGATGTGGACTGTGACCAATGGGTCAAAGCCATGGACCTTGAAATGAAATCTATGTATTCCAATTCTGTTTGGACTCTAATGGATGTCAAGACAACCTTTTTGGATGGAAATCTTGAGGAGAGTATTTATAAGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGCAGAGAAGAAGTTTTGGAACGAAGAAGAAGTTGAGAACTACAAAGTTATAGCATATATCCTTGCCAAGGTTTTTCGGGTCTTGCAAGCACATGCTATATTAATGAATGATCCATCTGCATGTGGTGCATTTCCCTACATCCCTTTTGTCATCAAAGATATTGAACTAGTAAATTTGGTGTTGTCAAGTCCATGTGAAGCTGCAGAAAAACAATGTCGTCCACATGAGGTTGGGTATGAACTTGAACGAGTGGCTGTCCAAGCGGAAGGACATGGACTCCTAGGTCATCGTTCTTGGATTTGCAATGAACGGCGATAG

Coding sequence (CDS)

ATGAATAGCTCGATAGTTCAATTGTTAGCTTTCGAAAAACTTAACCGCGATAATCTTAACACAATACTAGTGGTTGATGATTTAAGGTTTGTCTTAATTGAGGAATGTCCTCAAACCCCTACCTCAAATGTAAACCGAATTAGTCGGAAAGCATACGATCGATGGATAAAAGCAAATGAGAAAGCCCGTGTCTACATCCTTGCTAGCATGTCTGATGTTTTAGCAAAGAAACATGAATCCTTAGCAACGACTAAAGAGATAATGGATTCATTAAAAGGAATGTTTGGGCAACCAGAATGGTCCTTGAGACACAAGGCAATTAAATACATTTACACTAAGCAAATGAAGGAGGGGGCCTCTGTTAGAGAACATGTCCTGGACATGATGATGCACTTCAATATTGCTGAGGTAAATGGTGGTGCCATCGATGAGGCTAATCAGGAAAATAATTCTTGGAAAAGACTTTCTGAGGGCGAGATTACTCTCAAAGTTGGAACTGGAGAAATGGTCTCAGCTAAAGCAATGGGAGACTTGAAGTTGTTTTTTAATGATAGATATATCATGCTCAAGAATGTCTTAACAACTGAAACTCAGGATAAAAGACAAAAAGTTTCTTCCAATGCCTTCTTGTGGCACTTAACACTTGGTCACATAAATCTCAATAGGATTGAGAGATTGGTTAAGAGTGGACTTCTAAGTCAGTTAGAAGATAACTCTTTACCTCCATGTGATTCTTGTCTGGAAGGAAAAATGAGTAAAAGATCTTTTACTGGAAAAGGTCTTAGAGCCAAAACATCTTCAGAGCTCGTAAATTCGGACCTATGTGGATCAATGAATGTCAAAGCTCGGGGAGGATATGAATATTTCATTAGTTTTATTGATGATTATTCAAGTAAACTAGTATTAGAAGAAATTTCCAAAAATACTACAGATAGACCTAGTTCATCCACTAAAGTAGTTGATAAAACTAGGAATATTGGTCAAACACATCTTCCTCAAGAGTTGGGAGAACCTCGACGTAGTGGGAGGGTTATACGACAACCTGATCGCTATTTGGGTTTAAGTGAAGCTCAAATTGTCACACCTAATAATGGAATAGAGAATCCATTGACCTTTAAACAGACAATGAATGATGTGGACTGTGACCAATGGGTCAAAGCCATGGACCTTGAAATGAAATCTATGTATTCCAATTCTGTTTGGACTCTAATGGATGTCAAGACAACCTTTTTGGATGGAAATCTTGAGGAGAGTATTTATAAGGTCCAACCAGAGGGGTTTATACAAAAGGGTCAAGAACAAAAGGCAGAGAAGAAGTTTTGGAACGAAGAAGAAGTTGAGAACTACAAAGTTATAGCATATATCCTTGCCAAGGTTTTTCGGGTCTTGCAAGCACATGCTATATTAATGAATGATCCATCTGCATGTGGTGCATTTCCCTACATCCCTTTTGTCATCAAAGATATTGAACTAGTAAATTTGGTGTTGTCAAGTCCATGTGAAGCTGCAGAAAAACAATGTCGTCCACATGAGGTTGGGTATGAACTTGAACGAGTGGCTGTCCAAGCGGAAGGACATGGACTCCTAGGTCATCGTTCTTGGATTTGCAATGAACGGCGATAG

Protein sequence

MNSSIVQLLAFEKLNRDNLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYDRWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTKQMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAKAMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSLPPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFIDDYSSKLVLEEISKNTTDRPSSSTKVVDKTRNIGQTHLPQELGEPRRSGRVIRQPDRYLGLSEAQIVTPNNGIENPLTFKQTMNDVDCDQWVKAMDLEMKSMYSNSVWTLMDVKTTFLDGNLEESIYKVQPEGFIQKGQEQKAEKKFWNEEEVENYKVIAYILAKVFRVLQAHAILMNDPSACGAFPYIPFVIKDIELVNLVLSSPCEAAEKQCRPHEVGYELERVAVQAEGHGLLGHRSWICNERR
Homology
BLAST of IVF0021562 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 79.7 bits (195), Expect = 1.1e-13
Identity = 52/167 (31.14%), Postives = 83/167 (49.70%), Query Frame = 0

Query: 137 VNGGAIDEANQEN---NSWKRLSEGEITLKVGTGEMVSAKAMGDLKLFFNDRYIMLKNVL 196
           ++G A+D    E+   N   RL++G + +  G        A G L       Y     + 
Sbjct: 364 ISGIALDRDGYESYFANQKWRLTKGSLVIAKGV-------ARGTL-------YRTNAEIC 423

Query: 197 TTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSLPPCDSCLEGKMSK 256
             E    + ++S +  LWH  +GH++   ++ L K  L+S  +  ++ PCD CL GK  +
Sbjct: 424 QGELNAAQDEISVD--LWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHR 483

Query: 257 RSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFIDDYSSKL 301
            SF     R     +LV SD+CG M +++ GG +YF++FIDD S KL
Sbjct: 484 VSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKL 514

BLAST of IVF0021562 vs. ExPASy TrEMBL
Match: A0A5D3DE90 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold482G00310 PE=4 SV=1)

HSP 1 Score: 545.4 bits (1404), Expect = 2.6e-151
Identity = 315/533 (59.10%), Postives = 343/533 (64.35%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRD-------NLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNSSIVQLLA +KLN D       NLNTILVVD L FVL EECPQTP+SN NR SRKAYD
Sbjct: 1   MNSSIVQLLASKKLNGDNYAAWKSNLNTILVVDGLSFVLTEECPQTPSSNANRASRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIK NEKA VYILASM DVLAKKHESLAT KEIMDSLKGMFGQP+WSLRH+AIKYIYTK
Sbjct: 61  RWIKVNEKAHVYILASMLDVLAKKHESLATAKEIMDSLKGMFGQPKWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180
           +MKEG SVREHV+DMMMHFNIAEVN GAIDEANQ                          
Sbjct: 121 RMKEGTSVREHVMDMMMHFNIAEVNRGAIDEANQNLTKSKGKEVEANVATTKGKFKRGSS 180

Query: 181 ---------------------------ENNSWKRLSEGEITLKVGTGEMVSAKAMGDLKL 240
                                      E +S KRLS+G ITLKVGTGEMVSAKA+GDLKL
Sbjct: 181 SRSKTGPLKPNRKIEKKEKGKTSKRNKETSSSKRLSKGGITLKVGTGEMVSAKAVGDLKL 240

Query: 241 FFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSL 300
           F NDRYI+LKNVL                             IERLVK  LL+QLEDNSL
Sbjct: 241 FCNDRYILLKNVL-----------------------------IERLVKIRLLNQLEDNSL 300

Query: 301 PPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFIDDYS--- 360
           PPCD CL+GKM+K SF GKGLRAKT  ELV+S+LCG MNVKA+GGYEYFISFIDDYS   
Sbjct: 301 PPCDFCLKGKMTKISFAGKGLRAKTPLELVHSNLCGPMNVKAQGGYEYFISFIDDYSRYG 360

Query: 361 ------------------------------------------------------------ 406
                                                                       
Sbjct: 361 HVYLIHNKSDSFEKFKKYKAEVENESGYSKESRGGLFYDPQENKVFVLTNATFLEEDHIR 420

BLAST of IVF0021562 vs. ExPASy TrEMBL
Match: A0A5D3E3F1 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold349G00180 PE=4 SV=1)

HSP 1 Score: 540.0 bits (1390), Expect = 1.1e-149
Identity = 305/482 (63.28%), Postives = 333/482 (69.09%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRDN-------LNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MN  IVQLLA EKLN DN       LNTILVVDDLRFVL EECPQTP SN NR SRK YD
Sbjct: 35  MNGLIVQLLASEKLNGDNYAAWKLYLNTILVVDDLRFVLTEECPQTPASNANRTSRKVYD 94

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIKA+EKA VYILASMSDVLAKKHESLATTK+I+DSLKGMFGQ EWS+RH+ IKYIYTK
Sbjct: 95  RWIKASEKAHVYILASMSDVLAKKHESLATTKKIIDSLKGMFGQQEWSVRHETIKYIYTK 154

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MKE  S+REHVLDMMMH NIAEVNGGAIDEANQEN+SWK+LSEGE+TLKVGTGEMVSAK
Sbjct: 155 RMKEETSIREHVLDMMMHLNIAEVNGGAIDEANQENSSWKKLSEGEVTLKVGTGEMVSAK 214

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
           A+GDLKLFF+DRYIMLKNVLT                             +RLVKSGLLS
Sbjct: 215 AVGDLKLFFDDRYIMLKNVLT-----------------------------KRLVKSGLLS 274

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKG-----LRAKTSSEL-------------VNSDLC 300
           QLEDNSLPPCDSCLEGKM+KRSFTGKG     LR+    E              + S L 
Sbjct: 275 QLEDNSLPPCDSCLEGKMTKRSFTGKGKTIKTLRSDRGGEYMDLQFQDYLIEHGIQSQLS 334

Query: 301 GSMNVKARGGYE-----YFISFIDDYS--------------------------------- 360
                +  GGY      Y ++ +   S                                 
Sbjct: 335 APSMPQQNGGYALETTIYILNNVPSKSVSETPYELRKGRKGYPKESKGGLFYDPQENKVF 394

Query: 361 -------------------SKLVLEEISKNTTDRPSSSTKVVDKTRNIGQTHLPQELGEP 401
                              SKLVLEEISKNTTDRPSSSTKVVDKTR+IGQTHL QELGEP
Sbjct: 395 VSINATFLQEDHIRNHQTCSKLVLEEISKNTTDRPSSSTKVVDKTRDIGQTHLSQELGEP 454

BLAST of IVF0021562 vs. ExPASy TrEMBL
Match: A0A5D3E2A3 (Putative polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold64G00300 PE=4 SV=1)

HSP 1 Score: 512.7 bits (1319), Expect = 1.9e-141
Identity = 303/515 (58.83%), Postives = 340/515 (66.02%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLN-------RDNLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNS+IVQLL+ EKLN       + NLNTILVVDDLRF L ++CPQTPTSN NR S KAYD
Sbjct: 1   MNSAIVQLLSSEKLNSVNYATWKSNLNTILVVDDLRFALTKKCPQTPTSNTNRASWKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIKANEK RVYILASMSDVLAKKHESLAT KEIM+SLKGMFGQPEWSLRH+AIKYIYTK
Sbjct: 61  RWIKANEKVRVYILASMSDVLAKKHESLATAKEIMNSLKGMFGQPEWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MKEG SVREHVLDMMMHFNIAEVNGGAIDEANQ                          
Sbjct: 121 RMKEGTSVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
                                         VSSNAFLWHL LGHINLNRI RLV+SGLL+
Sbjct: 181 ------------------------------VSSNAFLWHLRLGHINLNRIGRLVESGLLN 240

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFI 300
           QLEDNSLPP DSC EGKM+KRSFTGKGLRAKT  ELV+SDLCG MNVKAR GYEYFISFI
Sbjct: 241 QLEDNSLPPYDSCFEGKMTKRSFTGKGLRAKTPLELVHSDLCGPMNVKARVGYEYFISFI 300

Query: 301 DDYS--SKLVLEEISKNTTDR-------------------PSSSTKVVDKTRNIGQTHLP 360
           DDYS    + L +   N+ ++                     ++  +++   +      P
Sbjct: 301 DDYSRYGHVYLIQNKSNSFEKFKEYKVEVENESGSFWGYALETTIYILNNVPSKSVFETP 360

Query: 361 QELGEPRRS-----------GRVIRQPDRYLGLSEAQIVTPNNGIENPLTFKQTMNDVDC 420
            EL + R+              V+  PDRYLGLSEAQI+  ++GIE+PLT+KQ MNDVDC
Sbjct: 361 YELWKGRKGSLHHFRIWGCPAHVL--PDRYLGLSEAQIIILDDGIEDPLTYKQAMNDVDC 420

Query: 421 DQWVKAMDLEMKSMYSNSVWTL------MDVKTTFLDGNLEESIYKVQPEGFIQKGQEQK 471
           DQW+K MDLEM+SMYSNSVWTL      MDVKT FL+GNLEESIY VQPEGFIQKG+EQK
Sbjct: 421 DQWIKGMDLEMESMYSNSVWTLVDQPSEMDVKTAFLNGNLEESIYMVQPEGFIQKGKEQK 450

BLAST of IVF0021562 vs. ExPASy TrEMBL
Match: A0A5A7T2N1 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold403G001250 PE=4 SV=1)

HSP 1 Score: 500.7 bits (1288), Expect = 7.4e-138
Identity = 293/509 (57.56%), Postives = 324/509 (63.65%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRD-------NLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNS IVQLLA EKLN D       NLNTILVVDDLRF L EE  QTP SN NR SRKAYD
Sbjct: 1   MNSLIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFFLTEEFLQTPASNANRASRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           + IKAN+KARVYILASMSDVLAKKHESLAT K+IMDSLKGMFGQ EWS+RH+AIKYIYTK
Sbjct: 61  QMIKANKKARVYILASMSDVLAKKHESLATAKKIMDSLKGMFGQLEWSIRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180
            MK+G SVREHV+DMMMHFNI EVNGG IDEANQ                          
Sbjct: 121 HMKKGISVREHVMDMMMHFNIVEVNGGTIDEANQVSFILESLPKSFIPFQTNASLNKIEF 180

Query: 181 ------------------------------------------------------------ 240
                                                                       
Sbjct: 181 NLTTFLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKVGPLEPNHKILKKGKGK 240

Query: 241 ---ENNSWKRLSEGEITLKVGTGEMVSAKAMGDLKLFFNDRYIMLKNVLTTETQDKRQKV 300
               N   K   +GEITLKV TG+MVSAKA+GDLKLF+NDRYI+LKNVL           
Sbjct: 241 TPKHNKEKKTTVKGEITLKVRTGDMVSAKAVGDLKLFYNDRYIILKNVL----------- 300

Query: 301 SSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSLPPCDSCLEGKMSKRSFTGKGLRAK 360
                             I RLVK+GLLSQLEDNSLPPCDSCLEGKM KRSFTGKGLR K
Sbjct: 301 ------------------IGRLVKNGLLSQLEDNSLPPCDSCLEGKMIKRSFTGKGLRTK 360

Query: 361 TSSELVNSDLCGSMNVKARGGYEYFISFIDDYSS-----------------KLVLEEISK 397
            S EL++SDLCG MNVKARGGYEYFISFIDDYS                  K    EI +
Sbjct: 361 ISLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHVYLIQNKFDSFEKFKEYNAEI-E 420

BLAST of IVF0021562 vs. ExPASy TrEMBL
Match: A0A5D3C7X2 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold852G00110 PE=4 SV=1)

HSP 1 Score: 496.1 bits (1276), Expect = 1.8e-136
Identity = 280/452 (61.95%), Postives = 316/452 (69.91%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRD-------NLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNSSIVQLLA EKLN D       NLNTILVVDDLR VL +ECPQTPTSN NR SRKAYD
Sbjct: 1   MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRVVLTKECPQTPTSNANRTSRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RW+KA+EKA VYILA+M+DVLAKKH+ LAT K I+D+LK MFG+PEWSLRH+AIKYIYTK
Sbjct: 61  RWVKADEKACVYILANMTDVLAKKHKFLATAKNIIDALKAMFGRPEWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MK+  SVREHVLDMMMHFNI EVN GAIDEANQENNSWK LSEGEITLKVGTGEMVSAK
Sbjct: 121 RMKKRTSVREHVLDMMMHFNIVEVNSGAIDEANQENNSWKTLSEGEITLKVGTGEMVSAK 180

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
           A+G+LK                                           IERLVKSG LS
Sbjct: 181 AVGNLK-------------------------------------------IERLVKSGFLS 240

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGG-----YE- 300
           +LEDNSLPPC+S LEGKM+KRSFTGKGLRAK   ELV+SD+CG MNVKARGG     YE 
Sbjct: 241 KLEDNSLPPCESFLEGKMTKRSFTGKGLRAKIPLELVHSDICGPMNVKARGGVSETPYEL 300

Query: 301 --------------------------YFIS---------FIDDYS--SKLVLEEISKNTT 360
                                      F+S          I D+   +KLVL EISKN  
Sbjct: 301 WKGRKGSLRYPKESRGGLFYDPQENKVFVSTNATFLEEDHIKDHQPRNKLVLNEISKNAL 360

Query: 361 DRPSSSTKVVDKTRNIGQTHLPQELGEPRRSGRVIRQPDRYLGLSEAQIVTPNNGIENPL 403
           D+PSSSTKVVDKT+  GQTH  QEL EPR S RV+ QP+RYLGLSE+ +V PN+GIE+PL
Sbjct: 361 DKPSSSTKVVDKTKISGQTHPSQELREPRCSERVVHQPNRYLGLSESHVVIPNDGIEDPL 409

BLAST of IVF0021562 vs. NCBI nr
Match: KAA0035987.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK30416.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 544 bits (1401), Expect = 3.07e-187
Identity = 305/482 (63.28%), Postives = 333/482 (69.09%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRDN-------LNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MN  IVQLLA EKLN DN       LNTILVVDDLRFVL EECPQTP SN NR SRK YD
Sbjct: 35  MNGLIVQLLASEKLNGDNYAAWKLYLNTILVVDDLRFVLTEECPQTPASNANRTSRKVYD 94

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIKA+EKA VYILASMSDVLAKKHESLATTK+I+DSLKGMFGQ EWS+RH+ IKYIYTK
Sbjct: 95  RWIKASEKAHVYILASMSDVLAKKHESLATTKKIIDSLKGMFGQQEWSVRHETIKYIYTK 154

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MKE  S+REHVLDMMMH NIAEVNGGAIDEANQEN+SWK+LSEGE+TLKVGTGEMVSAK
Sbjct: 155 RMKEETSIREHVLDMMMHLNIAEVNGGAIDEANQENSSWKKLSEGEVTLKVGTGEMVSAK 214

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
           A+GDLKLFF+DRYIMLKNVLT                             +RLVKSGLLS
Sbjct: 215 AVGDLKLFFDDRYIMLKNVLT-----------------------------KRLVKSGLLS 274

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKG-----LRAKTSSEL-------------VNSDLC 300
           QLEDNSLPPCDSCLEGKM+KRSFTGKG     LR+    E              + S L 
Sbjct: 275 QLEDNSLPPCDSCLEGKMTKRSFTGKGKTIKTLRSDRGGEYMDLQFQDYLIEHGIQSQLS 334

Query: 301 GSMNVKARGGYE-----YFISFIDDYS--------------------------------- 360
                +  GGY      Y ++ +   S                                 
Sbjct: 335 APSMPQQNGGYALETTIYILNNVPSKSVSETPYELRKGRKGYPKESKGGLFYDPQENKVF 394

Query: 361 -------------------SKLVLEEISKNTTDRPSSSTKVVDKTRNIGQTHLPQELGEP 400
                              SKLVLEEISKNTTDRPSSSTKVVDKTR+IGQTHL QELGEP
Sbjct: 395 VSINATFLQEDHIRNHQTCSKLVLEEISKNTTDRPSSSTKVVDKTRDIGQTHLSQELGEP 454

BLAST of IVF0021562 vs. NCBI nr
Match: KAA0032972.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK21997.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 550 bits (1416), Expect = 1.49e-184
Identity = 315/533 (59.10%), Postives = 343/533 (64.35%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRDN-------LNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNSSIVQLLA +KLN DN       LNTILVVD L FVL EECPQTP+SN NR SRKAYD
Sbjct: 1   MNSSIVQLLASKKLNGDNYAAWKSNLNTILVVDGLSFVLTEECPQTPSSNANRASRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIK NEKA VYILASM DVLAKKHESLAT KEIMDSLKGMFGQP+WSLRH+AIKYIYTK
Sbjct: 61  RWIKVNEKAHVYILASMLDVLAKKHESLATAKEIMDSLKGMFGQPKWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180
           +MKEG SVREHV+DMMMHFNIAEVN GAIDEANQ                          
Sbjct: 121 RMKEGTSVREHVMDMMMHFNIAEVNRGAIDEANQNLTKSKGKEVEANVATTKGKFKRGSS 180

Query: 181 ---------------------------ENNSWKRLSEGEITLKVGTGEMVSAKAMGDLKL 240
                                      E +S KRLS+G ITLKVGTGEMVSAKA+GDLKL
Sbjct: 181 SRSKTGPLKPNRKIEKKEKGKTSKRNKETSSSKRLSKGGITLKVGTGEMVSAKAVGDLKL 240

Query: 241 FFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSL 300
           F NDRYI+LKNVL                             IERLVK  LL+QLEDNSL
Sbjct: 241 FCNDRYILLKNVL-----------------------------IERLVKIRLLNQLEDNSL 300

Query: 301 PPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFIDDYS--- 360
           PPCD CL+GKM+K SF GKGLRAKT  ELV+S+LCG MNVKA+GGYEYFISFIDDYS   
Sbjct: 301 PPCDFCLKGKMTKISFAGKGLRAKTPLELVHSNLCGPMNVKAQGGYEYFISFIDDYSRYG 360

Query: 361 ------------------------------------------------------------ 405
                                                                       
Sbjct: 361 HVYLIHNKSDSFEKFKKYKAEVENESGYSKESRGGLFYDPQENKVFVLTNATFLEEDHIR 420

BLAST of IVF0021562 vs. NCBI nr
Match: TYK29680.1 (putative polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 516 bits (1328), Expect = 1.14e-174
Identity = 305/519 (58.77%), Postives = 342/519 (65.90%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLN-------RDNLNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNS+IVQLL+ EKLN       + NLNTILVVDDLRF L ++CPQTPTSN NR S KAYD
Sbjct: 1   MNSAIVQLLSSEKLNSVNYATWKSNLNTILVVDDLRFALTKKCPQTPTSNTNRASWKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RWIKANEK RVYILASMSDVLAKKHESLAT KEIM+SLKGMFGQPEWSLRH+AIKYIYTK
Sbjct: 61  RWIKANEKVRVYILASMSDVLAKKHESLATAKEIMNSLKGMFGQPEWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MKEG SVREHVLDMMMHFNIAEVNGGAIDEANQ                          
Sbjct: 121 RMKEGTSVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
                                         VSSNAFLWHL LGHINLNRI RLV+SGLL+
Sbjct: 181 ------------------------------VSSNAFLWHLRLGHINLNRIGRLVESGLLN 240

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGGYEYFISFI 300
           QLEDNSLPP DSC EGKM+KRSFTGKGLRAKT  ELV+SDLCG MNVKAR GYEYFISFI
Sbjct: 241 QLEDNSLPPYDSCFEGKMTKRSFTGKGLRAKTPLELVHSDLCGPMNVKARVGYEYFISFI 300

Query: 301 DDYS--SKLVLEEISKNTTDR-------------------PSSSTKVVDKTRNIGQTHLP 360
           DDYS    + L +   N+ ++                     ++  +++   +      P
Sbjct: 301 DDYSRYGHVYLIQNKSNSFEKFKEYKVEVENESGSFWGYALETTIYILNNVPSKSVFETP 360

Query: 361 QELGEPRRSG-----------RVIRQPDRYLGLSEAQIVTPNNGIENPLTFKQTMNDVDC 420
            EL + R+              V+  PDRYLGLSEAQI+  ++GIE+PLT+KQ MNDVDC
Sbjct: 361 YELWKGRKGSLHHFRIWGCPAHVL--PDRYLGLSEAQIIILDDGIEDPLTYKQAMNDVDC 420

Query: 421 DQWVKAMDLEMKSMYSNSVWTL------MDVKTTFLDGNLEESIYKVQPEGFIQKGQEQK 472
           DQW+K MDLEM+SMYSNSVWTL      MDVKT FL+GNLEESIY VQPEGFIQKG+EQK
Sbjct: 421 DQWIKGMDLEMESMYSNSVWTLVDQPSEMDVKTAFLNGNLEESIYMVQPEGFIQKGKEQK 454

BLAST of IVF0021562 vs. NCBI nr
Match: KAA0035260.1 (gag/pol protein [Cucumis melo var. makuwa] >TYK07575.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 500 bits (1287), Expect = 3.88e-171
Identity = 280/452 (61.95%), Postives = 316/452 (69.91%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRDN-------LNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNSSIVQLLA EKLN DN       LNTILVVDDLR VL +ECPQTPTSN NR SRKAYD
Sbjct: 1   MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRVVLTKECPQTPTSNANRTSRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           RW+KA+EKA VYILA+M+DVLAKKH+ LAT K I+D+LK MFG+PEWSLRH+AIKYIYTK
Sbjct: 61  RWVKADEKACVYILANMTDVLAKKHKFLATAKNIIDALKAMFGRPEWSLRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQENNSWKRLSEGEITLKVGTGEMVSAK 180
           +MK+  SVREHVLDMMMHFNI EVN GAIDEANQENNSWK LSEGEITLKVGTGEMVSAK
Sbjct: 121 RMKKRTSVREHVLDMMMHFNIVEVNSGAIDEANQENNSWKTLSEGEITLKVGTGEMVSAK 180

Query: 181 AMGDLKLFFNDRYIMLKNVLTTETQDKRQKVSSNAFLWHLTLGHINLNRIERLVKSGLLS 240
           A+G+LK                                           IERLVKSG LS
Sbjct: 181 AVGNLK-------------------------------------------IERLVKSGFLS 240

Query: 241 QLEDNSLPPCDSCLEGKMSKRSFTGKGLRAKTSSELVNSDLCGSMNVKARGG-----YEY 300
           +LEDNSLPPC+S LEGKM+KRSFTGKGLRAK   ELV+SD+CG MNVKARGG     YE 
Sbjct: 241 KLEDNSLPPCESFLEGKMTKRSFTGKGLRAKIPLELVHSDICGPMNVKARGGVSETPYEL 300

Query: 301 ---------------------------FIS---------FIDDYS--SKLVLEEISKNTT 360
                                      F+S          I D+   +KLVL EISKN  
Sbjct: 301 WKGRKGSLRYPKESRGGLFYDPQENKVFVSTNATFLEEDHIKDHQPRNKLVLNEISKNAL 360

Query: 361 DRPSSSTKVVDKTRNIGQTHLPQELGEPRRSGRVIRQPDRYLGLSEAQIVTPNNGIENPL 402
           D+PSSSTKVVDKT+  GQTH  QEL EPR S RV+ QP+RYLGLSE+ +V PN+GIE+PL
Sbjct: 361 DKPSSSTKVVDKTKISGQTHPSQELREPRCSERVVHQPNRYLGLSESHVVIPNDGIEDPL 409

BLAST of IVF0021562 vs. NCBI nr
Match: KAA0035827.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 504 bits (1299), Expect = 7.81e-170
Identity = 293/509 (57.56%), Postives = 324/509 (63.65%), Query Frame = 0

Query: 1   MNSSIVQLLAFEKLNRDN-------LNTILVVDDLRFVLIEECPQTPTSNVNRISRKAYD 60
           MNS IVQLLA EKLN DN       LNTILVVDDLRF L EE  QTP SN NR SRKAYD
Sbjct: 1   MNSLIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFFLTEEFLQTPASNANRASRKAYD 60

Query: 61  RWIKANEKARVYILASMSDVLAKKHESLATTKEIMDSLKGMFGQPEWSLRHKAIKYIYTK 120
           + IKAN+KARVYILASMSDVLAKKHESLAT K+IMDSLKGMFGQ EWS+RH+AIKYIYTK
Sbjct: 61  QMIKANKKARVYILASMSDVLAKKHESLATAKKIMDSLKGMFGQLEWSIRHEAIKYIYTK 120

Query: 121 QMKEGASVREHVLDMMMHFNIAEVNGGAIDEANQ-------------------------- 180
            MK+G SVREHV+DMMMHFNI EVNGG IDEANQ                          
Sbjct: 121 HMKKGISVREHVMDMMMHFNIVEVNGGTIDEANQVSFILESLPKSFIPFQTNASLNKIEF 180

Query: 181 ------------------------------------------------------------ 240
                                                                       
Sbjct: 181 NLTTFLNELQRFQNLTKGKGKEVEANVATTKRKFKRGSSSKSKVGPLEPNHKILKKGKGK 240

Query: 241 ---ENNSWKRLSEGEITLKVGTGEMVSAKAMGDLKLFFNDRYIMLKNVLTTETQDKRQKV 300
               N   K   +GEITLKV TG+MVSAKA+GDLKLF+NDRYI+LKNVL           
Sbjct: 241 TPKHNKEKKTTVKGEITLKVRTGDMVSAKAVGDLKLFYNDRYIILKNVL----------- 300

Query: 301 SSNAFLWHLTLGHINLNRIERLVKSGLLSQLEDNSLPPCDSCLEGKMSKRSFTGKGLRAK 360
                             I RLVK+GLLSQLEDNSLPPCDSCLEGKM KRSFTGKGLR K
Sbjct: 301 ------------------IGRLVKNGLLSQLEDNSLPPCDSCLEGKMIKRSFTGKGLRTK 360

Query: 361 TSSELVNSDLCGSMNVKARGGYEYFISFIDDYSS-----------------KLVLEEISK 396
            S EL++SDLCG MNVKARGGYEYFISFIDDYS                  K    EI +
Sbjct: 361 ISLELIHSDLCGPMNVKARGGYEYFISFIDDYSRYGHVYLIQNKFDSFEKFKEYNAEI-E 420

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.1e-1331.14Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A5D3DE902.6e-15159.10Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold482G0031... [more]
A0A5D3E3F11.1e-14963.28Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold349G0018... [more]
A0A5D3E2A31.9e-14158.83Putative polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold64G... [more]
A0A5A7T2N17.4e-13857.56Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold403G0012... [more]
A0A5D3C7X21.8e-13661.95Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold852G0011... [more]
Match NameE-valueIdentityDescription
KAA0035987.13.07e-18763.28gag/pol protein [Cucumis melo var. makuwa] >TYK30416.1 gag/pol protein [Cucumis ... [more]
KAA0032972.11.49e-18459.10gag/pol protein [Cucumis melo var. makuwa] >TYK21997.1 gag/pol protein [Cucumis ... [more]
TYK29680.11.14e-17458.77putative polyprotein [Cucumis melo var. makuwa][more]
KAA0035260.13.88e-17161.95gag/pol protein [Cucumis melo var. makuwa] >TYK07575.1 gag/pol protein [Cucumis ... [more]
KAA0035827.17.81e-17057.56gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Melon (IVF77) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 55..145
e-value: 1.9E-8
score: 34.2
NoneNo IPR availablePANTHERPTHR35317OS04G0629600 PROTEINcoord: 48..151
NoneNo IPR availablePANTHERPTHR35317:SF8POLYPROTEIN-LIKE PROTEINcoord: 48..151
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 202..251
e-value: 7.8E-11
score: 41.7

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
IVF0021562.2IVF0021562.2mRNA