Lag0015614 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0015614
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionGag/pol protein
Locationchr12: 17701343 .. 17710680 (+)
RNA-Seq ExpressionLag0015614
SyntenyLag0015614
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGTTATTCCGGGAATAGGGATTGGCTGAGATTGTCTCAATTCAGAGGAGGAGTAGCTGACTACCCTACGGTAGCTGTTGTTCTAAACCTCTGAAGTATCTTTGCAAAATGAATGTTTATTCTATGGGTTGATCGTTACTTTTGCTAAATTGATTGGATTAAAATAATATTATTTTGTGTAATGGGTTAATGCAACAATACTTATTAAGTTAATTTATTTGTTCTATCATCAATGAAACGTTCTTTGTATTCCTCTAATATCGCCAATTACAAAACTGGTATTTCTAATACAACTAAGAAAATGAACTTAAACACTAAATGCGTGGAATTGATCTTAACTAAGGACGGTCCTCAAGCCCATATGTCTAATTTATCCGAAATTGTTCATGATGAATATGACATAGACTGCTTAGCACATCTCAATGAAAATAAATTTCATTTATTGATCATTGAAACATGTATAGTGGAAAATGATGATACAGCCTCGATCCTTGATTCAAGTGCCACTCATCACGTATGCTCTTCCTTTGAGGGAATGAGCTCCTGGCAATAGTTAGATGCTGGGGAAATGACTCTCAGAGTTGGGATTGGGGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGTTATATGTTATTAGATAACGTATATATAGTTCCCAGAATAAAAAGAAATTTGATTTCTATTTCTTGTTTGCTAGAACAAGAGTATTCAGTGTCCTTTTCTGTGAATGGAGCTTTCATTACTAAAAAGGGTGCTGATATATGTTCTGCAAAAATGGAGAACAATTTGTATGTCTTAAGACCAATAGAGTCTAGGGCCATTTTGAACAATGAAATGTTTAAAACAGCTGAGACTCAACCAAAAAGGCAAAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGCCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTCTATGTGAGTCATGTCTCGAAGGTAAGATGACCAAACGACCTTTTACTGGAAAAGGTTATCGTGCCAATGAGCCCTTAGAACTCAAGGTATGGTTATTTATACCTAATGAGTCATAAGTCTGAAGCCCTTGAAAAGTTCAAGGAGTACAGGACTGAAGTTGAGAACTTGTTAGGTAAGACCATTAAAACACTTCGATCTGATCGAGGTGGAGACTATTTGGACTTAAAATTCCAAAACTATTTGATAGAACATGGAATATCTCTCAGCACTAGGTACACCTCAGCAGAATGGTGTATCTGAAAGGAGAAACAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCCAGTTCTTTTTGGGGTTATGCAGTAGAGACTGCAGTTTATATTTTGAATGTTGCTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTTTGGAAAGGGCGTAAAGTTAGTTTACGTCATTTCAGAATTTGGGGCTGTCCTGCACATGTGTTAGTAGCAAACCCAAAGAAATTGGATTCACGTTCAAAATTATGTCTATTCGTAGACTACCCAAAAGAAACATGAGGTGGATACTTCTTTGATCCAAAAGAAAATAAAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACGTTAAAGAACACATACCACGCAGTAAAATTGTATTAAGTGAACTTTCTGGTAAAGCTACAAATGAATCAACAAGAGTTGTTGATGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTTCAGAGGCTGGACCTTCAACAAGAGTTATTGATGAGCCTAGTACATCTGGTCAAGCCCATCCTTCTCAAGAGTTGAGAATGCTTCGACGTAGTGGGAGGATTGTTATTCAACCTGAACGCTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGATGGGGTTGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTGGGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGAGACACAGCTGGGAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAGGGTTATACCCAAAAGGAGGAGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAAGTCGATAAGAATTCTCTTGTCCATTGCCACCTTTTATGATTATGAGATTTGGCAAAAGGACGTCAAGACTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGTCTCAACCAGAGGGGTTCGTTGCTCAAGGTCAGGAGCAAAAAGTTTGCAAGCTTAGACGATCCATTTATGGGTTGAAGTAAGCTTCCAGATCTTGGAATATAAAGTTTGACACTGCTGTCAAATCTTATGGCTTTGACCAGAACGTTGATGAGCCTTGTGTTTACAAAAGGATTATCAACAATTCAGTAGCTTTTCTGGTGTTGTATGTAGATGATATCCTTCTCATTGGGAATGATGTAGAATATCTGGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATGAAATATTTGGGTGAAGCTCAGTATGTTCTAGGGATTCAAATTATCAGGAATCGTAAGAACAGAACGTTAGCCCTGTCTCAAACATCATATATAGACAAGATGTTGTCTAGATATAAGATGCAGAATTCCAAGAGGGGTTTACTACCCTTCAGGCATGGAGTTCATTTGTTTGTTATCTGCTTTCCTTATATATATGCTGAGAATGAAATGTTATGTATACACTTGTGAACATTGAAATGGTTGTATTAATCTTAAATTGTGGTTATTGCATTGCTTGAGACGTGTACGCTTAAGAAGTATAAGCATGATAGGTACTTGTCTTAGGCTTAGTGACTGTTGAGTCAAGCATGTTGAGTCTGGCTTGCATGTTGCATGGGGGATGTGTAGCATGAGTCGGGTGTGAGGCACAGCACATGGTGCGTAATTCATGTGTAGCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGGAAATCATGTAAGAGTGGCGCAACACATGGCGCGGAATTCATGTGGGGTAGGTGCGGAACATGACGCGAAAATCATATAAGTAGCGCAGCACATGGCGCGGAATTCATATGTAGCGGAGCATGACGCGAAAATCATGTAAGTAGCGCAACAAATGGCGCAGAATTCATGTGTAGCGAAGCATGACGCGGAAATCATGTAGTTAGCGGAGCATGACGCGGAAATCATGTAGTTAGCGGAGCATGACGCGAAAATCATGTGGGGTCTAGTGCGTGTGTTGCATAACATGTGGTGCATTGGCATGAGTTGATTGTTTTTAAGAAAGTGGTTAGTTTTTAGCTTGAAAACATGTTTCTTATCGTGGTTGTGTAACCTGTTGGATGCGTATGCCGAGCATGATGCTTGTATTGGTTGTTTGTTTGCGGTTTGGTAAGTGTTAGCTGCTTACCAGTACCACGGTTGTACTGATACCCCCTTCCCCACCTTCCCCAAACATTTTAGATGTTACAGGTATCGAGGATGATCCGGACCTTGGTGGCGAGGAGGAGAACTGGGAGGAAAGATTCCTAGAGTGTTAGTTTCCTTGTTGATATGTTTTCCTAGTGATGTAATAAGTTAGCTGGAGAATGGCTTATTTTCTAGTTTATAAACTAGAGTGGATAGTTTGGTTGAGTTTAATAAAATGTATAGCTAGTGGACTAGCTCCTTTGTTAATAGAATGTCTTTATGATTCCGCTGCATGATTTTTGGTTATTCTATGTTGCTGTGAGTTTCTGTCTCGTCTAGAGGAGATAGGATGTGAAATCTAACTGCAGTTTGGTGGAGATTTACTTATGCATGATTAGCAGTGTTAAAATGTCTTTATCAGGTGTTGTAAAAAAATTTGGTTCAGGTCATTTTATGTCGTTATGTTGCCGAAATTTTCAGTACATCCGGTTTAAAGTGGTTCAGTTCTAGAGTTCAGAGTTAAGGGTTGTAGTAGTGTCCCTAGGCTAAGGGTTGGAAAACCTGGGGCGTTACAAAACTACAGTCCATCTTCTAAGATGAGTAGTCTCTCTCTCTGTGGAACTCTCTGTAAGATTGAAAGAAAAATATTTTCTCTCAAAGATTTTCTTCTCTCACACACAAAAACAAAGCATCATTCTTGCTGTTGTTTTTCCTTCAAAAATTCCCTTCAAAGTCGAGTTCCCACAAACTCTATCTCAAGAATCCAGAGATATAGTGGGTGTACTATTTTGGTAGTGTTCTTGGAGCAAAAAGGCAGTACGTGAATTTTCATATTGGAGCATTGTGCTGAGGTGTTTGGCCACTACAATTCACGTAAGTTTCTTCAACTCTAAAGTAGTTTAGAGTAATTTACAACATGTTTAATCTTAGAATTCTGAAGAAATAGTTTCTGAATTGGCCCAATGTATCGGTGTTTTTCAAAGAAACTAAATTATTTTCTTTCTTGCATTTCTTATTCTTAGATAGGTTGTTTGTCATTGTTTGTGTCTTTTGTTTTTTGTTTTGTTTGGAAAAAATTATGAAAAAAGTATATCTCTATGAAAAAACCACTAGATGTCTAGTTGGCTTAGTCTTCTAAAGGCACCTACAGCAGAGCTGAATAAGAGTAGAACTTAACTTACAATCTAGTTGTCAGGGTGTTTGCAAGCACAACCGGAGGAATCTGTCTGCACCATGTCTATGTGGGCATTGTAGATAGCATTCTCCAATCGTTTGCAATGCATGAGTGAGTCGTTAGCGCAAAGGTGGGTGCAATCTAAAGTTTTTGGTTGATGTTATTTGAATTCCCCGTTTATCTCTATATCTTTACCGCTTTCATTTACTTTCTTTATTTTATCTTTTTGTCTTCAATAAAAAAACCCTTAATCAAATTATTTCTTTGGTTACCATCCTCAGTTGCAAACTTAGAGGAGGTTCTTGTTTTGCAGAATTGAGTGTTTTCAACTAGTCACCATCTCCCTTGGATACGAAATCCGAAATAATTAACTCTCCACTGTATTACTTCATAGTGTATACTTGAACTCACATCAGCCATCACTGCTTTAGAACTTTTTGTTAGATCTTGGATGAGAGAAAAAGATGAAAGGTAGAGAGGTTCCACCAGACGTTCCGAACTATTCACACGAGATTTCAGGTGAGAAATTTGATTTGTGACACAACTTCTAGTATGTATTGATTCTTGTATAAGTATAGTGGAGTACAATCTCTAGTTACAATCCTTTGATATTTTCTCTCACAAACTTAAGGTAGCTTTGATCTACAAAGTCTTCAAGAAGCTTTTGTCTTGAGTCTTGAAGGTTTGTTCGCTCTTTAAAAGGATTTGAGCTCTTTTGGATAGATTAATGTAGACTTCAGTTTTCAGGTCTTCATTCTTCCAATCTTTAGAAGGATTTCACTTCAATCTTCAATTTTGCAGGAGGATTTGAGCTTGAACCTTCATTTTTGCAGGAGAATCTGAGCTTCAATCTTCAGTTCTGTAAGAGGATTTGAGCTTTAATGTAAACCCGAGACTTTCCTTGGATTATTTTCTCTTGTCACATATTGTAGCACTACAGGAAAATTCACATATGATAGCATTCAAAAAACGCTATCGTAAGGCTACTATAGCGTTTTATGGAACGTTGTCGTAGCCCATGTTATTGAAAGTCTGAGACTTTTAATAGCGTTTTTTTCCCGCTATCGAAATCGCTGTAGAAAGTCTAAATTTATTGCGTTATTCTAATGTTATGGATACCTTTCATAGCGCTGGAAAAAACGCTATGGTTTCTCTTTAATAATGTTTTTATAGCTGTAAAAAAACGCTATAAAAGGCATATCATAGCGTTTTTACCACGTTGTTGTAGCCGATGTTATTGAAAGTCTCTAACTTTCTATAGCATTTTTTCGACGCTATTGAAAATGCTATAAAAAGTTTATTTTTATTGCGTTACTTTAATGTTATGGAAACTTGTCATAGCGTTGAAAAAACAATATGATTTCTCTTTAATAGCATTTTTATAGCAAAAAAAAAAGCTATAAAAGACATATCATAGTGTTTTTACCATGTTGTTGTAGCCAATGTTATTGAAAGTCTCTAACTTTTCATTACATTTATTCGACGCTATTGAAAATGCTATAAAAACGTCCATTTTTTATTTCGTAATTATAATGTTATGGAATTTTTTCACAGCGTTACGGAAAAAGCTATGATTGGTCTTTAATAACATTTTTTATGCAGTGAAATAATGCTATAGAAAATATAGTATAGCATTTCTAATCTTGTTATGATAGACACACTAATTGACTTTTAATAGTTTTTTTATATATATATTATATTGTTTTGTTTACATTTATATATATATATTTTTGTTTCAAATGTTTTTATATTGAAAAATCAACTTGAATTTTTTTATAATGTATTTAAATTCATTTTCAAAACTAATAACATATATATCTATTGAAGTCGGTTGTCTCATCATGTATAGACAACTAAAAAATGCAAATCTTCAAATTTCTCATAAAGATACACATAAAATCTTCAAATTCCCAGCACAAAAAATGAAGTACTACAATAAAATCAAAAGTTTGTTATAGCAAATAAATATAAAGAAAGTTGCAAATTTATAATCCAAATAACTAAGGCCCCGTTTGATAACCATTTCGTTTTTGTTTTTTGTTTTTGAAAATTATGCTTGTTTTCTCCTAAATTCCCTATCATGACTTTCATCCTCATTAATGATCCATTTGAATTCTTAGCCAAATTCTAAAAACAAAACAAGTTTTTGAAAACTACTTTTTTTTTGTTTTCAAAATTGGACTTGGTTTTTGAAAACAAAGAAGAAGGTTGATACTAAAACAAAGAAACTTATAAGTAAAATTAGGTGTGTATAAGCTTAATTTTCAAAAACAAAAAACAAAAAACGAAATGGTTATCAAACGGGACCTAAAATAACTACATATAGTCCCAAAAAGTTCATATACTCAAAAACAACTAAGAGAGTTGATGAAATGGTTGCGTACTAACAATTCCCCCATGAACATGCAGTTGCATGTGTCTTATCCTTTCATTCACCTAATAAATTAAAAAACCAAATGTACTAAAAAAACAAATTAAAAACCAAAATATCTTTTATAAATCATGATTTAAACTCTAACCCATGAACCAATATAATAAATAAACATTAAAACTCATAAACAGTTTTACACATTAGAATTGCAACATTATAAAACACCAGCTCATGTAGAAATCAACAAACAGTTAAACCTTGCAAGAAACAATAACAAAAAGACCAAAAGTGTTTGAATGTGTAACGTCTCAGGTTTCTATACCTAACCTAGCGACATTCCTAAAGCCCCTATCTTAATTCTAATCAGATACTTTGAAAATTTCAGCAACATAACGACATACATATAGGACTAACCCAAATTTTTTTAAACTTGTTAAACAAGATAAACGCACTTCTATATGAAAAACCGCCCAACTACAACCTCACAACACTTCAGGTTCACAACAAGACTCCAAATATCCTCAAACTAAGCAAGCCAGGTCTCGCATCAGATACTATTCAAAATCTAATCCAAGAATTATACAAATTTAATTTAAACGTAGGGACTCAATCACAAACTCTAATTTTCAAATCTGAATTCAAACCAAGTAATGAACGGTAAACATGCTATACTTATTTCCAAAAGTCATAAGAAAACTTTTAGAATCGCAAACCAAACCACTGCAAATACTATAGGTAGCCACATCCACAAATCTAAACTACCATTGCGACCTATAATTTAAGCAAGAATCTCAAACTATTTAATCAAAATCTCAAAACTAACAAAAACCTTGGTCAAAATTATAAAACATGATCACACTCAAAACTAACAAACCTGGACTAGCAAAACTAACCCCTCATTGCTCTAAAATATATAAACTAAACAACAAAATTGATGTTCAACAAAACAACGAGTTCAAAGCACCAATTGTAACCCACCCCCTCTATTAACAAAATCTTTAAGCAAAAACAACTACTTCAAATCAAATCCAAAGCAATCTACCCTCACATGCTCAAAGCCTTTAACAAGTGAACTAGAAAAGCTAAAACAATCAAAGTTTTAGAAGCATAACATCAAAACGTATAACTGATGTGGAATTTTCCTTACCGCAAGCGCACGGGTCAAGTAATAATAAAGTGTTCGTATAAACGAGTGTCGTCCTCTGGATTGGATTTTTAAGCAAATCAAGTATTGTGAATTATCTTGCTCAATTTTATTTAGGGATCAAAAAGTAGTGAATGAATGGTAGGAAACTTAATTCAAAGCAAGTAAAAGGTGGAAATTCAATTGGAAACAACACTTAGGGAATTGATTTCGCCGATTTAGTCTAGGATTTATTTTAAGGAGTAAGTTATTATGCACAATGGAATTATGAGCCAAAGACCCTTTAACCACAACTACCTCTCCTGAGTATAATTGATTCTCATGCAAATTAACCAACTTGATATCTCTATCTAAGTTTATTAACAAGCAAGGCAATGAGATCTTTTAACCAATTCTAAAGTTAATCTAAAACCGTCACGAATTAGAAAATAAACTTTAGTCATGAAATCTTGAGTAAGTCTATATCAAATCCCCTCTCCCGAGCTAGATTCGAAACTTAAATCATTCAACTAATGGCCAAGTAATTGAAAGCATTAAATCAAGATTACAAACATAATAAACATTGCATAACTTAAACCATAAATCAATCCATAAACACAATCAACTACATCAATCCCTAAGCTTACAAGTTTAGCTACTCATGAATAGAATTACAAGCTCAACTAAACATGTAAAAGCCATAAAATCTAGCTAAAGAAAGGAAAAGGGAAGGAAAAACTCAATGTAGGCCGAGACCGAACGTCCCCGTCACGATCCCCACGAACAATGGGTTGATTTCCGAATCTCTCCCTTTGTTTCATCCTTCAATTGCTCTGTTTCTGCCCTCTTTACGGCTCCAAGAAGTCCCAAAATTGAATCCCCAAAAACCTAGCTGAAAAGAACTATTTAATCTGAAATTTCATCGTAGCGTCGAGACGCTGTAAAGACAGCGTCGCGACGCTGCCTCTAAACACGCGCGTTCTGGAAAGGAAATTACGGCAGCGTCGAGACGCTAAGGTCGCAGCGTCGAGACGCTGCGATGATTTCGCGTCTATCCAAATAAGGAATGCTAGCGTCGAGACGCCTAAGGGCTAG

mRNA sequence

ATGAGTTATTCCGGGAATAGGGATTGGCTGAGATTGTCTCAATTCAGAGGAGGAGTAGCTGACTACCCTACGTTAGATGCTGGGGAAATGACTCTCAGAGTTGGGATTGGGGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGTTATATGTTATTAGATAACGTATATATAGTTCCCAGAATAAAAAGAAATTTGATTTCTATTTCTTGTTTGCTAGAACAAGAGTATTCAGTGTCCTTTTCTGTGAATGGAGCTTTCATTACTAAAAAGGGTGCTGATATATGTTCTGCAAAAATGGAGAACAATTTGTATGTCTTAAGACCAATAGAGTCTAGGGCCATTTTGAACAATGAAATGTTTAAAACAGCTGAGACTCAACCAAAAAGGCAAAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGCCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTCTATGTGAGTCATGTCTCGAAGGTAAGATGACCAAACGACCTTTTACTGGAAAAGGTTATCGTGCCAATGAGCCCTTAGAACTCAAGAACATGGAATATCTCTCAGCACTAGGTACACCTCAGCAGAATGGTGTATCTGAAAGGAGAAACAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCCAGTTCTTTTTGGGGTTATGCAGTAGAGACTGCAGTTTATATTTTGAATGTTGCTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTTTGGAAAGGGCGTAAAGTTAGTTTACGTCATTTCAGAATTTGGGGCTGTCCTGCACATGTTAAAATTGTATTAAGTGAACTTTCTGGTAAAGCTACAAATGAATCAACAAGAGTTGTTGATGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTTCAGAGGCTGGACCTTCAACAAGAGTTATTGATGAGCCTAGTACATCTGGTCAAGCCCATCCTTCTCAAGAGTTGAGAATGCTTCGACGTAGTGGGAGGATTGTTATTCAACCTGAACGCTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGATGGGGTTGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTGGGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGAGACACAGCTGGGAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAGGGTTATACCCAAAAGGAGGAGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAAGTCGATAAGAATTCTCTTGTCCATTGCCACCTTTTATGATTATGAGATTTGGCAAAAGGACGTCAAGACTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGTCTCAACCAGAGGGGTTCGTTGCTCAAGGTCAGGAGCAAAAAAACGTTGATGAGCCTTGTGTTTACAAAAGGATTATCAACAATTCAGTAGCTTTTCTGGTGTTGTATGTAGATGATATCCTTCTCATTGGGAATGATGTAGAATATCTGGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATGAAATATTTGGGTGAAGCTCAGTATGTTCTAGGGATTCAAATTATCAGGAATCCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGGAAATCATTACCACGGTTGTACTGATACCCCCTTCCCCACCTTCCCCAAACATTTTAGATGTTACAGGTATCGAGGATGATCCGGACCTTGGTGGCGAGGAGGAGAACTGGGAGGAAAGATTCCTAGAGTTTCTAGAGTTCAGAGTTAAGGGTTGTAGTAGTGTCCCTAGGCTAAGGGTTGGAAAACCTGGGGCGTTACAAAACTACAGTCCATCTTCTAAGATGAGTAGTCTCTCTCTCTGTGGAACTCTCTGTAGCTTTGATCTACAAAGTCTTCAAGAAGCTTTTGTCTTGAGTCTTGAAGGAGGATTTGAGCTTGAACCTTCATTTTTGCAGGAGAATCTGAGCTTCAATCTTCAGTTCTCGTCGAGACGCTGTAAAGACAGCGTCGCGACGCTGCCTCTAAACACGCGCGTTCTGGAAAGGAAATTACGGCAGCGTCGAGACGCTAAGGTCGCAGCGTCGAGACGCTGCGATGATTTCGCGTCTATCCAAATAAGGAATGCTAGCGTCGAGACGCCTAAGGGCTAG

Coding sequence (CDS)

ATGAGTTATTCCGGGAATAGGGATTGGCTGAGATTGTCTCAATTCAGAGGAGGAGTAGCTGACTACCCTACGTTAGATGCTGGGGAAATGACTCTCAGAGTTGGGATTGGGGAGCTCATTTCAGCTAAAGTAGTGGGAGATATTAAAATGTTCTTCTCCAGGGAACGTTATATGTTATTAGATAACGTATATATAGTTCCCAGAATAAAAAGAAATTTGATTTCTATTTCTTGTTTGCTAGAACAAGAGTATTCAGTGTCCTTTTCTGTGAATGGAGCTTTCATTACTAAAAAGGGTGCTGATATATGTTCTGCAAAAATGGAGAACAATTTGTATGTCTTAAGACCAATAGAGTCTAGGGCCATTTTGAACAATGAAATGTTTAAAACAGCTGAGACTCAACCAAAAAGGCAAAAAGTTTCTCAAAGTACCTATCTTTGGCACTTGAGACTTGGCCACATAAATCTCAATAGGATTGGGAGATTGGTTAAGAACGGACTTCTAAGCCAGTTAGAAGATGGTACTTTACCTCTATGTGAGTCATGTCTCGAAGGTAAGATGACCAAACGACCTTTTACTGGAAAAGGTTATCGTGCCAATGAGCCCTTAGAACTCAAGAACATGGAATATCTCTCAGCACTAGGTACACCTCAGCAGAATGGTGTATCTGAAAGGAGAAACAGAACCTTGTTAGACATGGTTCGATCTATGATGAGCTATGCTCAGTTGCCCAGTTCTTTTTGGGGTTATGCAGTAGAGACTGCAGTTTATATTTTGAATGTTGCTCCCTCAAAGAGTGTTTCTGAAACACCATTTGAGCTTTGGAAAGGGCGTAAAGTTAGTTTACGTCATTTCAGAATTTGGGGCTGTCCTGCACATGTTAAAATTGTATTAAGTGAACTTTCTGGTAAAGCTACAAATGAATCAACAAGAGTTGTTGATGAGGCTGGACCTTCAACAAGAGTTGCTGATGAGCCCAGTTCAGAGGCTGGACCTTCAACAAGAGTTATTGATGAGCCTAGTACATCTGGTCAAGCCCATCCTTCTCAAGAGTTGAGAATGCTTCGACGTAGTGGGAGGATTGTTATTCAACCTGAACGCTATCTGGGTTTAGCAGAAACTCCAACTGTCATACCTGATGATGGGGTTGAGGATCCATTGACCTTTAGACAGGCAATGAATGATGTGGACCATGACCAGTGGGTCAAAGCTATGGACCTAGAGATGGAGTCTATGTACTTCAATCAAGTCTGGGATCTTGTAGACCTACCTGAAGGGGTTAAACCCATAGGGTGCAAATGGATCTACAAGAGAAAACGAGACACAGCTGGGAAAGTACAGACTTTCAAAGCTCGACTTGTGGCAAAGGGTTATACCCAAAAGGAGGAGGTGGACTATGAGGAAACCTTCTCCCCAGTTGCCATGTTAAAGTCGATAAGAATTCTCTTGTCCATTGCCACCTTTTATGATTATGAGATTTGGCAAAAGGACGTCAAGACTGCCTTTCTGAATGGCAATCTTGATGAGAACATCTTTATGTCTCAACCAGAGGGGTTCGTTGCTCAAGGTCAGGAGCAAAAAAACGTTGATGAGCCTTGTGTTTACAAAAGGATTATCAACAATTCAGTAGCTTTTCTGGTGTTGTATGTAGATGATATCCTTCTCATTGGGAATGATGTAGAATATCTGGCTGACATTAAAAAATGGCTATCGACTGAGTTCCAAATGAAATATTTGGGTGAAGCTCAGTATGTTCTAGGGATTCAAATTATCAGGAATCCGGAGCGTGACGCGGAAATCACGTGGGTTGTGAGTGCGGAGCGTGACGCGGAAATCATTACCACGGTTGTACTGATACCCCCTTCCCCACCTTCCCCAAACATTTTAGATGTTACAGGTATCGAGGATGATCCGGACCTTGGTGGCGAGGAGGAGAACTGGGAGGAAAGATTCCTAGAGTTTCTAGAGTTCAGAGTTAAGGGTTGTAGTAGTGTCCCTAGGCTAAGGGTTGGAAAACCTGGGGCGTTACAAAACTACAGTCCATCTTCTAAGATGAGTAGTCTCTCTCTCTGTGGAACTCTCTGTAGCTTTGATCTACAAAGTCTTCAAGAAGCTTTTGTCTTGAGTCTTGAAGGAGGATTTGAGCTTGAACCTTCATTTTTGCAGGAGAATCTGAGCTTCAATCTTCAGTTCTCGTCGAGACGCTGTAAAGACAGCGTCGCGACGCTGCCTCTAAACACGCGCGTTCTGGAAAGGAAATTACGGCAGCGTCGAGACGCTAAGGTCGCAGCGTCGAGACGCTGCGATGATTTCGCGTCTATCCAAATAAGGAATGCTAGCGTCGAGACGCCTAAGGGCTAG

Protein sequence

MSYSGNRDWLRLSQFRGGVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQKVSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLELKNMEYLSALGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHVKIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQGQEQKNVDEPCVYKRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPERDAEITWVVSAERDAEIITTVVLIPPSPPSPNILDVTGIEDDPDLGGEEENWEERFLEFLEFRVKGCSSVPRLRVGKPGALQNYSPSSKMSSLSLCGTLCSFDLQSLQEAFVLSLEGGFELEPSFLQENLSFNLQFSSRRCKDSVATLPLNTRVLERKLRQRRDAKVAASRRCDDFASIQIRNASVETPKG
Homology
BLAST of Lag0015614 vs. NCBI nr
Match: KAA0025945.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0035786.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0040492.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0041262.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 808.9 bits (2088), Expect = 3.8e-230
Identity = 444/751 (59.12%), Postives = 499/751 (66.44%), Query Frame = 0

Query: 20  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
           + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL
Sbjct: 223 SSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCL 282

Query: 80  LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
           +E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E++A+LN+EMF+TA TQ KRQ+
Sbjct: 283 IEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQR 342

Query: 140 VS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGY 199
           +S   +TYLWHLRLGHINL+RIGRLVKNGLL++L+D +LP CESCLEGKMTKRPFTGKGY
Sbjct: 343 ISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 402

Query: 200 RANEPLEL--------------------------------------------KNMEY--- 259
           RA EPLEL                                            K  EY   
Sbjct: 403 RAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 462

Query: 260 -------------------------------------LSALGTPQQNGVSERRNRTLLDM 319
                                                LSA GTPQQNGVSERRNRTLLDM
Sbjct: 463 VENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 522

Query: 320 VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAH 379
           VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAH
Sbjct: 523 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 582

Query: 380 V----------------------------------------------------------K 439
           V                                                          K
Sbjct: 583 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSK 642

Query: 440 IVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRML 499
           +VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM 
Sbjct: 643 LVLSE----ATDESTRVVDEVGPSSRV---------------DETTTSGQSHPSQSLRMP 702

Query: 500 RRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFN 559
           RRSGR+V QP RYLGL ET  VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN
Sbjct: 703 RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN 762

Query: 560 QVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM 594
            VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+E VDYEETFSPVAM
Sbjct: 763 SVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM 822

BLAST of Lag0015614 vs. NCBI nr
Match: KAA0035907.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 799.7 bits (2064), Expect = 2.3e-227
Identity = 440/751 (58.59%), Postives = 496/751 (66.05%), Query Frame = 0

Query: 20  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
           + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL
Sbjct: 223 SSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCL 282

Query: 80  LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
           +E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E++A+LN+EMF+TA TQ KRQ+
Sbjct: 283 IEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQR 342

Query: 140 VS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGY 199
           +S   +TYLWHLRLGHINL+RIGRLVK+GLL++L+D +LP CESCLEGKMTKRPFTGKGY
Sbjct: 343 ISPNNNTYLWHLRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 402

Query: 200 RANEPLEL--------------------------------------------KNMEY--- 259
           RA EPLEL                                            K  EY   
Sbjct: 403 RAKEPLELIHSDLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 462

Query: 260 -------------------------------------LSALGTPQQNGVSERRNRTLLDM 319
                                                LSA GTPQQNGVSERRNRTLLDM
Sbjct: 463 VENLLSKKIKIFRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 522

Query: 320 VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAH 379
           VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAH
Sbjct: 523 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 582

Query: 380 V----------------------------------------------------------K 439
           V                                                          K
Sbjct: 583 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSK 642

Query: 440 IVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRML 499
           +VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM 
Sbjct: 643 LVLSE----ATDESTRVVDEVGPSSRV---------------DETTTSGQSHPSQSLRMP 702

Query: 500 RRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFN 559
           RRSGR+V QP RYLGL ET  VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN
Sbjct: 703 RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN 762

Query: 560 QVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM 594
            VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KE VDYEETFS VAM
Sbjct: 763 SVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAM 822

BLAST of Lag0015614 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 753.4 bits (1944), Expect = 1.9e-213
Identity = 416/732 (56.83%), Postives = 482/732 (65.85%), Query Frame = 0

Query: 20   ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
            + +  L  GE+TL+VG GE++SA+ VGD+ +FF ++RY++L +V  VP +KRNLISI+C+
Sbjct: 319  SSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFF-QDRYLILKDVLYVPLMKRNLISIACI 378

Query: 80   LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
            LE  Y++SF VN  FI  KG  ICSA  ENNLY LRP  +  +LN EMF+T ETQ K+QK
Sbjct: 379  LEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRANVVLNTEMFRTLETQNKKQK 438

Query: 140  VSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRA 199
            VS + YLWHLRLGHINLNRI RLVK+G+L+QLED +LP CESCLEGKMTKR FTGKG RA
Sbjct: 439  VSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCESCLEGKMTKRSFTGKGLRA 498

Query: 200  NEPLEL--------------------------------------------KNMEY----- 259
              PLEL                                            K  EY     
Sbjct: 499  KVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLHHKSESFEKFKEYKAEVE 558

Query: 260  -----------------------------------LSALGTPQQNGVSERRNRTLLDMVR 319
                                               LSA  TPQQNGVSERRNRTLLDMVR
Sbjct: 559  NEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQQNGVSERRNRTLLDMVR 618

Query: 320  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV- 379
            SMMSYAQLP SFWGYA+ETA++ILN  PSKSV ETP+ELWKGRK SLR+FRIWGCPAHV 
Sbjct: 619  SMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPAHVL 678

Query: 380  -----------KIVL-----SELSG------------KATNESTRVVDEA---GPSTRV- 439
                       K+ L      E  G             +TN +    D      P +++ 
Sbjct: 679  VQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRSKIV 738

Query: 440  --------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAET 499
                     D+PSS    ST+V+D+ + S Q+H SQELR+ RRSGR+V QP RYLGL ET
Sbjct: 739  LKEMFKNATDKPSS----STKVVDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVET 798

Query: 500  PTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKW 559
              +IPDDGVEDPLT++QAMNDVD DQW+KAM+LEMESMYFN VW LVDLP  VKPIGCKW
Sbjct: 799  QIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGCKW 858

Query: 560  IYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW 594
            IYKRKRD AGKVQTFKARLVAKGYTQKE VDYEETFSPVAMLKSIRILLSIATFY+YEIW
Sbjct: 859  IYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYEIW 918

BLAST of Lag0015614 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 730.7 bits (1885), Expect = 1.3e-206
Identity = 409/753 (54.32%), Postives = 470/753 (62.42%), Query Frame = 0

Query: 18   GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISIS 77
            G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ 
Sbjct: 323  GISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS-FLLLENVYVVPDLKRNLISVK 382

Query: 78   CLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKR 137
            CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR + S+A+LN EMFKTA TQ KR
Sbjct: 383  CLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKR 442

Query: 138  QKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGK 197
             K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGK
Sbjct: 443  LKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGK 502

Query: 198  GYRANEPLEL--------------------------------------------KNMEY- 257
            G+RA EPLEL                                            K  EY 
Sbjct: 503  GHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYK 562

Query: 258  ---------------------------------------LSALGTPQQNGVSERRNRTLL 317
                                                   LSA GTPQQNGVSERRNRTLL
Sbjct: 563  AEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLL 622

Query: 318  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCP 377
            DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCP
Sbjct: 623  DMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCP 682

Query: 378  AHV--------------------------------------------------------- 437
            AHV                                                         
Sbjct: 683  AHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPR 742

Query: 438  -KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELR 497
             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR
Sbjct: 743  SKIVLNELSKETTEPSTRVVEEPSALTRVV-----HVGSSTR----------THQPQSLR 802

Query: 498  MLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMY 557
              RRSGR+   P RY+ L ET TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMY
Sbjct: 803  EPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY 862

Query: 558  FNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV 594
            FN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Sbjct: 863  FNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPV 922

BLAST of Lag0015614 vs. NCBI nr
Match: TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 730.7 bits (1885), Expect = 1.3e-206
Identity = 409/753 (54.32%), Postives = 470/753 (62.42%), Query Frame = 0

Query: 18   GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISIS 77
            G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ 
Sbjct: 324  GISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS-FLLLENVYVVPDLKRNLISVK 383

Query: 78   CLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKR 137
            CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR + S+A+LN EMFKTA TQ KR
Sbjct: 384  CLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKR 443

Query: 138  QKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGK 197
             K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGK
Sbjct: 444  LKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGK 503

Query: 198  GYRANEPLEL--------------------------------------------KNMEY- 257
            G+RA EPLEL                                            K  EY 
Sbjct: 504  GHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYK 563

Query: 258  ---------------------------------------LSALGTPQQNGVSERRNRTLL 317
                                                   LSA GTPQQNGVSERRNRTLL
Sbjct: 564  AEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLL 623

Query: 318  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCP 377
            DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCP
Sbjct: 624  DMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCP 683

Query: 378  AHV--------------------------------------------------------- 437
            AHV                                                         
Sbjct: 684  AHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPR 743

Query: 438  -KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELR 497
             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR
Sbjct: 744  SKIVLNELSKETTEPSTRVVEEPSALTRVV-----HVGSSTR----------THQPQSLR 803

Query: 498  MLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMY 557
              RRSGR+   P RY+ L ET TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMY
Sbjct: 804  EPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY 863

Query: 558  FNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV 594
            FN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Sbjct: 864  FNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPV 923

BLAST of Lag0015614 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 2.0e-64
Identity = 167/490 (34.08%), Postives = 249/490 (50.82%), Query Frame = 0

Query: 215  GTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFE 274
            GTPQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N +PS  ++ E P  
Sbjct: 577  GTPQHNGVAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPER 636

Query: 275  LWKGRKVSLRHFRIWGCP--AHV-KIVLSELSGK-------------------------- 334
            +W  ++VS  H +++GC   AHV K   ++L  K                          
Sbjct: 637  VWTNKEVSYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKV 696

Query: 335  ---------------ATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQ---- 394
                           A + S +V +   P+       S+    +    DE S  G+    
Sbjct: 697  IRSRDVVFRESEVRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGE 756

Query: 395  ---------------AHPSQ---ELRMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPL 454
                            HP+Q   + + LRRS R  ++  RY     T  V+  D   +P 
Sbjct: 757  VIEQGEQLDEGVEEVEHPTQGEEQHQPLRRSERPRVESRRY---PSTEYVLISDD-REPE 816

Query: 455  TFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQ 514
            + ++ ++  + +Q +KAM  EMES+  N  + LV+LP+G +P+ CKW++K K+D   K+ 
Sbjct: 817  SLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLV 876

Query: 515  TFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNL 574
             +KARLV KG+ QK+ +D++E FSPV  + SIR +LS+A   D E+ Q DVKTAFL+G+L
Sbjct: 877  RYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDL 936

Query: 575  DENIFMSQPEGFVAQGQEQ---------------------------------KNVDEPCV 604
            +E I+M QPEGF   G++                                  K   +PCV
Sbjct: 937  EEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCV 996

BLAST of Lag0015614 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 164.9 bits (416), Expect = 3.8e-39
Identity = 151/580 (26.03%), Postives = 241/580 (41.55%), Query Frame = 0

Query: 194  GKGYRANEPLEL---KNMEY-LSALGTPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWG 253
            G+ Y +NE  +    K + Y L+   TPQ NGVSER  RT+ +  R+M+S A+L  SFWG
Sbjct: 552  GREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWG 611

Query: 254  YAVETAVYILNVAPSKSV---SETPFELWKGRKVSLRHFRIWGCPAHVKIVLSELSGKAT 313
             AV TA Y++N  PS+++   S+TP+E+W  +K  L+H R++G   +V I      GK  
Sbjct: 612  EAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFGATVYVHI--KNKQGKFD 671

Query: 314  NESTR---------------------------VVDEAG---------PSTRVADEPSSE- 373
            ++S +                           VVDE            +  + D   SE 
Sbjct: 672  DKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDETNMVNSRAVKFETVFLKDSKESEN 731

Query: 374  ---AGPSTRVI--DEPSTSGQAHPSQELRMLRRS--------GRIVIQPE---------- 433
                  S ++I  + P+ S +    Q L+  + S         R +IQ E          
Sbjct: 732  KNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDN 791

Query: 434  -----------RYL-------------------------GLAETPTVIPDDGVEDP---- 493
                       +Y                            +ET   + + G+++P    
Sbjct: 792  IQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKND 851

Query: 494  -------------------------------LTFRQAMNDV-----------DHDQWVKA 553
                                           L      NDV           D   W +A
Sbjct: 852  GIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEA 911

Query: 554  MDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEV 591
            ++ E+ +   N  W +   PE    +  +W++  K +  G    +KARLVA+G+TQK ++
Sbjct: 912  INTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQI 971

BLAST of Lag0015614 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 141.7 bits (356), Expect = 3.4e-32
Identity = 87/245 (35.51%), Postives = 125/245 (51.02%), Query Frame = 0

Query: 385  DPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLV-DLPEGVKPIGCKWIYKRKRDTA 444
            +P T  QAM D   D+W +AM  E+ +   N  WDLV   P  V  +GC+WI+ +K ++ 
Sbjct: 938  EPRTAIQAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSD 997

Query: 445  GKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFL 504
            G +  +KARLVAKGY Q+  +DY ETFSPV    SIRI+L +A    + I Q DV  AFL
Sbjct: 998  GSLNRYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFL 1057

Query: 505  NGNLDENIFMSQPEGFVAQ--------------GQEQ-------------------KNVD 564
             G L + ++MSQP GFV +              G +Q                    ++ 
Sbjct: 1058 QGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSIS 1117

Query: 565  EPCVYKRIINNSVAFLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQI 596
            +  ++      S+ ++++YVDDIL+ GND   L      LS  F +K   +  Y LGI+ 
Sbjct: 1118 DTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLDALSQRFSVKEHEDLHYFLGIEA 1177


HSP 2 Score: 66.2 bits (160), Expect = 1.8e-09
Identity = 39/135 (28.89%), Postives = 75/135 (55.56%), Query Frame = 0

Query: 169 SQLEDGTLPLCESCLEGKMTKRPFTGKGYRANEPLELKNMEYLSALG---------TPQQ 228
           SQ++D T  + +S +E +   R  T       E + L+  +YLS  G         TP+ 
Sbjct: 544 SQVKD-TFIIFKSLVENRFQTRIGTLYSDNGGEFVVLR--DYLSQHGISHFTSPPHTPEH 603

Query: 229 NGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGR 288
           NG+SER++R +++M  +++S+A +P ++W YA   AVY++N  P+  +  ++PF+   G+
Sbjct: 604 NGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQ 663

Query: 289 KVSLRHFRIWGCPAH 294
             +    +++GC  +
Sbjct: 664 PPNYEKLKVFGCACY 675

BLAST of Lag0015614 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 140.2 bits (352), Expect = 1.0e-31
Identity = 109/372 (29.30%), Postives = 170/372 (45.70%), Query Frame = 0

Query: 299  SELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEP------STSGQAHPSQEL 358
            S+L+   +  +        P+T  +   +S   PS  +   P      + + QA P    
Sbjct: 869  SQLAQSLSTPAQSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQA-PLNTH 928

Query: 359  RMLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESM 418
             M  R+   +I+P     LA     +      +P T  QA+ D   ++W  AM  E+ + 
Sbjct: 929  SMGTRAKAGIIKPNPKYSLA-----VSLAAESEPRTAIQALKD---ERWRNAMGSEINAQ 988

Query: 419  YFNQVWDLVDLPEG-VKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFS 478
              N  WDLV  P   V  +GC+WI+ +K ++ G +  +KARLVAKGY Q+  +DY ETFS
Sbjct: 989  IGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLNRYKARLVAKGYNQRPGLDYAETFS 1048

Query: 479  PVAMLKSIRILLSIATFYDYEIWQKDVKTAFLNGNLDENIFMSQPEGFVAQ--------- 538
            PV    SIRI+L +A    + I Q DV  AFL G L ++++MSQP GF+ +         
Sbjct: 1049 PVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKL 1108

Query: 539  -----GQEQ-------------------KNVDEPCVYKRIINNSVAFLVLYVDDILLIGN 598
                 G +Q                    +V +  ++      S+ ++++YVDDIL+ GN
Sbjct: 1109 RKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGN 1168

Query: 599  DVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNPE--RDAEITWVVSAERDAEIITTV 629
            D   L +    LS  F +K   E  Y LGI+  R P     ++  +++       +IT  
Sbjct: 1169 DPTLLHNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAK 1228


HSP 2 Score: 62.0 bits (149), Expect = 3.5e-08
Identity = 26/79 (32.91%), Postives = 50/79 (63.29%), Query Frame = 0

Query: 216 TPQQNGVSERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFEL 275
           TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N  P+  +  E+PF+ 
Sbjct: 618 TPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQK 677

Query: 276 WKGRKVSLRHFRIWGCPAH 294
             G   +    R++GC  +
Sbjct: 678 LFGTSPNYDKLRVFGCACY 696

BLAST of Lag0015614 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 78.2 bits (191), Expect = 4.7e-13
Identity = 35/86 (40.70%), Postives = 55/86 (63.95%), Query Frame = 0

Query: 401 WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQ 460
           W +AM  E++++  N+ W LV  P     +GCKW++K K  + G +   KARLVAKG+ Q
Sbjct: 40  WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQ 99

Query: 461 KEEVDYEETFSPVAMLKSIRILLSIA 487
           +E + + ET+SPV    +IR +L++A
Sbjct: 100 EEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of Lag0015614 vs. ExPASy TrEMBL
Match: A0A5A7TZD0 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G00090 PE=4 SV=1)

HSP 1 Score: 808.9 bits (2088), Expect = 1.9e-230
Identity = 444/751 (59.12%), Postives = 499/751 (66.44%), Query Frame = 0

Query: 20  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
           + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL
Sbjct: 223 SSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCL 282

Query: 80  LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
           +E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E++A+LN+EMF+TA TQ KRQ+
Sbjct: 283 IEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQR 342

Query: 140 VS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGY 199
           +S   +TYLWHLRLGHINL+RIGRLVKNGLL++L+D +LP CESCLEGKMTKRPFTGKGY
Sbjct: 343 ISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 402

Query: 200 RANEPLEL--------------------------------------------KNMEY--- 259
           RA EPLEL                                            K  EY   
Sbjct: 403 RAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 462

Query: 260 -------------------------------------LSALGTPQQNGVSERRNRTLLDM 319
                                                LSA GTPQQNGVSERRNRTLLDM
Sbjct: 463 VENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 522

Query: 320 VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAH 379
           VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAH
Sbjct: 523 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 582

Query: 380 V----------------------------------------------------------K 439
           V                                                          K
Sbjct: 583 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSK 642

Query: 440 IVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRML 499
           +VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM 
Sbjct: 643 LVLSE----ATDESTRVVDEVGPSSRV---------------DETTTSGQSHPSQSLRMP 702

Query: 500 RRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFN 559
           RRSGR+V QP RYLGL ET  VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN
Sbjct: 703 RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN 762

Query: 560 QVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM 594
            VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYTQ+E VDYEETFSPVAM
Sbjct: 763 SVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQREGVDYEETFSPVAM 822

BLAST of Lag0015614 vs. ExPASy TrEMBL
Match: A0A5A7T2V9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760 PE=4 SV=1)

HSP 1 Score: 799.7 bits (2064), Expect = 1.1e-227
Identity = 440/751 (58.59%), Postives = 496/751 (66.05%), Query Frame = 0

Query: 20  ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
           + +  L+  EMTL+VG G++ISA+ VGD K+FF   ++M L+N+YIVP+IKRNL+S+SCL
Sbjct: 223 SSFKQLEDSEMTLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCL 282

Query: 80  LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
           +E  YS++FS+N AFI K G  ICSAK+ENNLYVLRP E++A+LN+EMF+TA TQ KRQ+
Sbjct: 283 IEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQR 342

Query: 140 VS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGY 199
           +S   +TYLWHLRLGHINL+RIGRLVK+GLL++L+D +LP CESCLEGKMTKRPFTGKGY
Sbjct: 343 ISPNNNTYLWHLRLGHINLDRIGRLVKDGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGY 402

Query: 200 RANEPLEL--------------------------------------------KNMEY--- 259
           RA EPLEL                                            K  EY   
Sbjct: 403 RAKEPLELIHSDLCGPMNVKARGSFEYFISFIDDYSRYGYLYLMEHKSEALEKFKEYKTE 462

Query: 260 -------------------------------------LSALGTPQQNGVSERRNRTLLDM 319
                                                LSA GTPQQNGVSERRNRTLLDM
Sbjct: 463 VENLLSKKIKIFRSDRGGEYMDLIFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLDM 522

Query: 320 VRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAH 379
           VRSMMSYAQLPSSFWGYAVETAV+ILN  PSKSVSETPFELW+GRK SL HFRIWGCPAH
Sbjct: 523 VRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAH 582

Query: 380 V----------------------------------------------------------K 439
           V                                                          K
Sbjct: 583 VLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPKENRVFVSTNATFLEEDHMRNHKPRSK 642

Query: 440 IVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELRML 499
           +VLSE    AT+ESTRVVDE GPS+RV               DE +TSGQ+HPSQ LRM 
Sbjct: 643 LVLSE----ATDESTRVVDEVGPSSRV---------------DETTTSGQSHPSQSLRMP 702

Query: 500 RRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFN 559
           RRSGR+V QP RYLGL ET  VIPDDGVEDPL+++QAMNDVD DQWVKAMDLEMESMYFN
Sbjct: 703 RRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFN 762

Query: 560 QVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAM 594
            VW+LVDLPEGVKPIGCKWIYKRKRD+AGKVQTFKARLVAKGYT+KE VDYEETFS VAM
Sbjct: 763 SVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTRKEGVDYEETFSSVAM 822

BLAST of Lag0015614 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 753.4 bits (1944), Expect = 9.2e-214
Identity = 416/732 (56.83%), Postives = 482/732 (65.85%), Query Frame = 0

Query: 20   ADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISISCL 79
            + +  L  GE+TL+VG GE++SA+ VGD+ +FF ++RY++L +V  VP +KRNLISI+C+
Sbjct: 319  SSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFF-QDRYLILKDVLYVPLMKRNLISIACI 378

Query: 80   LEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKRQK 139
            LE  Y++SF VN  FI  KG  ICSA  ENNLY LRP  +  +LN EMF+T ETQ K+QK
Sbjct: 379  LEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRANVVLNTEMFRTLETQNKKQK 438

Query: 140  VSQSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGKGYRA 199
            VS + YLWHLRLGHINLNRI RLVK+G+L+QLED +LP CESCLEGKMTKR FTGKG RA
Sbjct: 439  VSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCESCLEGKMTKRSFTGKGLRA 498

Query: 200  NEPLEL--------------------------------------------KNMEY----- 259
              PLEL                                            K  EY     
Sbjct: 499  KVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYLLHHKSESFEKFKEYKAEVE 558

Query: 260  -----------------------------------LSALGTPQQNGVSERRNRTLLDMVR 319
                                               LSA  TPQQNGVSERRNRTLLDMVR
Sbjct: 559  NEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPSTPQQNGVSERRNRTLLDMVR 618

Query: 320  SMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCPAHV- 379
            SMMSYAQLP SFWGYA+ETA++ILN  PSKSV ETP+ELWKGRK SLR+FRIWGCPAHV 
Sbjct: 619  SMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYELWKGRKSSLRYFRIWGCPAHVL 678

Query: 380  -----------KIVL-----SELSG------------KATNESTRVVDEA---GPSTRV- 439
                       K+ L      E  G             +TN +    D      P +++ 
Sbjct: 679  VQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVSTNATFLEEDHXRNHQPRSKIV 738

Query: 440  --------ADEPSSEAGPSTRVIDEPSTSGQAHPSQELRMLRRSGRIVIQPERYLGLAET 499
                     D+PSS    ST+V+D+ + S Q+H SQELR+ RRSGR+V QP RYLGL ET
Sbjct: 739  LKEMFKNATDKPSS----STKVVDKANISDQSHTSQELRVPRRSGRVVHQPNRYLGLVET 798

Query: 500  PTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKW 559
              +IPDDGVEDPLT++QAMNDVD DQW+KAM+LEMESMYFN VW LVDLP  VKPIGCKW
Sbjct: 799  QIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSVWTLVDLPSDVKPIGCKW 858

Query: 560  IYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIW 594
            IYKRKRD AGKVQTFKARLVAKGYTQKE VDYEETFSPVAMLKSIRILLSIATFY+YEIW
Sbjct: 859  IYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLKSIRILLSIATFYNYEIW 918

BLAST of Lag0015614 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 6.4e-207
Identity = 409/753 (54.32%), Postives = 470/753 (62.42%), Query Frame = 0

Query: 18   GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISIS 77
            G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ 
Sbjct: 324  GISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS-FLLLENVYVVPDLKRNLISVK 383

Query: 78   CLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKR 137
            CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR + S+A+LN EMFKTA TQ KR
Sbjct: 384  CLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKR 443

Query: 138  QKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGK 197
             K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGK
Sbjct: 444  LKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGK 503

Query: 198  GYRANEPLEL--------------------------------------------KNMEY- 257
            G+RA EPLEL                                            K  EY 
Sbjct: 504  GHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYK 563

Query: 258  ---------------------------------------LSALGTPQQNGVSERRNRTLL 317
                                                   LSA GTPQQNGVSERRNRTLL
Sbjct: 564  AEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLL 623

Query: 318  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCP 377
            DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCP
Sbjct: 624  DMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCP 683

Query: 378  AHV--------------------------------------------------------- 437
            AHV                                                         
Sbjct: 684  AHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPR 743

Query: 438  -KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELR 497
             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR
Sbjct: 744  SKIVLNELSKETTEPSTRVVEEPSALTRVV-----HVGSSTR----------THQPQSLR 803

Query: 498  MLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMY 557
              RRSGR+   P RY+ L ET TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMY
Sbjct: 804  EPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY 863

Query: 558  FNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV 594
            FN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Sbjct: 864  FNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPV 923

BLAST of Lag0015614 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 6.4e-207
Identity = 409/753 (54.32%), Postives = 470/753 (62.42%), Query Frame = 0

Query: 18   GVADYPTLDAGEMTLRVGIGELISAKVVGDIKMFFSRERYMLLDNVYIVPRIKRNLISIS 77
            G++ +  L+ GEMT+RVG G ++SA  VG +++   +  ++LL+NVY+VP +KRNLIS+ 
Sbjct: 324  GISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKS-FLLLENVYVVPDLKRNLISVK 383

Query: 78   CLLEQEYSVSFSVNGAFITKKGADICSAKMENNLYVLRPIESRAILNNEMFKTAETQPKR 137
            CLLEQ YS++F+VN  FI K G +ICSAK+ENNLYVLR + S+A+LN EMFKTA TQ KR
Sbjct: 384  CLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKR 443

Query: 138  QKVS--QSTYLWHLRLGHINLNRIGRLVKNGLLSQLEDGTLPLCESCLEGKMTKRPFTGK 197
             K+S  ++ +LWHLRLGHINLNRI RLVKNGLLS+LE+ +LP+CESCLEGKMTKRPFTGK
Sbjct: 444  LKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELEENSLPVCESCLEGKMTKRPFTGK 503

Query: 198  GYRANEPLEL--------------------------------------------KNMEY- 257
            G+RA EPLEL                                            K  EY 
Sbjct: 504  GHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYK 563

Query: 258  ---------------------------------------LSALGTPQQNGVSERRNRTLL 317
                                                   LSA GTPQQNGVSERRNRTLL
Sbjct: 564  AEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLL 623

Query: 318  DMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVSETPFELWKGRKVSLRHFRIWGCP 377
            DMVRSMMSYA LP+SFWGYAV+TAVYILN  PSKSVSETP +LW GRK SLRHFRIWGCP
Sbjct: 624  DMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCP 683

Query: 378  AHV--------------------------------------------------------- 437
            AHV                                                         
Sbjct: 684  AHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPR 743

Query: 438  -KIVLSELSGKATNESTRVVDEAGPSTRVADEPSSEAGPSTRVIDEPSTSGQAHPSQELR 497
             KIVL+ELS + T  STRVV+E    TRV        G STR           H  Q LR
Sbjct: 744  SKIVLNELSKETTEPSTRVVEEPSALTRVV-----HVGSSTR----------THQPQSLR 803

Query: 498  MLRRSGRIVIQPERYLGLAETPTVIPDDGVEDPLTFRQAMNDVDHDQWVKAMDLEMESMY 557
              RRSGR+   P RY+ L ET TVI D  +EDPLTF++AM DVD D+W+KAM+LE+ESMY
Sbjct: 804  EPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMY 863

Query: 558  FNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQKEEVDYEETFSPV 594
            FN VWDLVD P+GVKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ E VDYEETFSPV
Sbjct: 864  FNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPV 923

BLAST of Lag0015614 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 153.7 bits (387), Expect = 6.2e-37
Identity = 83/247 (33.60%), Postives = 136/247 (55.06%), Query Frame = 0

Query: 384 EDPLTFRQAMNDVDHDQWVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTA 443
           ++P T+ +A   +    W  AMD E+ +M     W++  LP   KPIGCKW+YK K ++ 
Sbjct: 84  KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 444 GKVQTFKARLVAKGYTQKEEVDYEETFSPVAMLKSIRILLSIATFYDYEIWQKDVKTAFL 503
           G ++ +KARLVAKGYTQ+E +D+ ETFSPV  L S++++L+I+  Y++ + Q D+  AFL
Sbjct: 144 GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 504 NGNLDENIFMSQPEGFVAQGQEQKNVDEPCVYKRII------------------------ 563
           NG+LDE I+M  P G+ A+  +    +  C  K+ I                        
Sbjct: 204 NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 564 ---NNSVAFL----------VLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVL 594
              ++   FL          ++YVDDI++  N+   + ++K  L + F+++ LG  +Y L
Sbjct: 264 QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

BLAST of Lag0015614 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 78.2 bits (191), Expect = 3.3e-14
Identity = 35/86 (40.70%), Postives = 55/86 (63.95%), Query Frame = 0

Query: 401 WVKAMDLEMESMYFNQVWDLVDLPEGVKPIGCKWIYKRKRDTAGKVQTFKARLVAKGYTQ 460
           W +AM  E++++  N+ W LV  P     +GCKW++K K  + G +   KARLVAKG+ Q
Sbjct: 40  WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQ 99

Query: 461 KEEVDYEETFSPVAMLKSIRILLSIA 487
           +E + + ET+SPV    +IR +L++A
Sbjct: 100 EEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of Lag0015614 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 51.2 bits (121), Expect = 4.3e-06
Identity = 27/80 (33.75%), Postives = 45/80 (56.25%), Query Frame = 0

Query: 227 NRTLLDMVRSMMSYAQLPSSFWGYAVETAVYILNVAPSKSVS-ETPFELWKGRKVSLRHF 286
           NRT+++ VRSM+    LP +F   A  TAV+I+N  PS +++   P E+W     +  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 287 RIWGCPAHVKIVLSELSGKA 306
           R +GC A++     +L  +A
Sbjct: 62  RRFGCVAYIHCDEGKLKPRA 81

BLAST of Lag0015614 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 44.7 bits (104), Expect = 4.1e-04
Identity = 25/50 (50.00%), Postives = 31/50 (62.00%), Query Frame = 0

Query: 545 FLVLYVDDILLIGNDVEYLADIKKWLSTEFQMKYLGEAQYVLGIQIIRNP 595
           +L+LYVDDILL G+    L  +   LS+ F MK LG   Y LGIQI  +P
Sbjct: 2   YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHP 51

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAA0025945.13.8e-23059.12gag/pol protein [Cucumis melo var. makuwa] >KAA0026303.1 gag/pol protein [Cucumi... [more]
KAA0035907.12.3e-22758.59gag/pol protein [Cucumis melo var. makuwa][more]
ADJ18449.11.9e-21356.83gag/pol protein, partial [Bryonia dioica][more]
KAA0048404.11.3e-20654.32gag/pol protein [Cucumis melo var. makuwa][more]
TYK14550.11.3e-20654.32gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
P109782.0e-6434.08Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041463.8e-3926.03Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT943.4e-3235.51Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.0e-3129.30Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925204.7e-1340.70Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5A7TZD01.9e-23059.12Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold1163G000... [more]
A0A5A7T2V91.1e-22758.59Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold56G00760... [more]
E2GK519.2e-21456.83Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
A0A5A7SMH86.4e-20754.32Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ66.4e-20754.32Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
Match NameE-valueIdentityDescription
AT4G23160.16.2e-3733.60cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.13.3e-1440.70Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00710.14.3e-0633.75Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
ATMG00810.14.1e-0450.00DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 206..287
e-value: 2.4E-15
score: 58.3
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 130..187
e-value: 2.1E-12
score: 46.7
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 415..528
e-value: 4.1E-37
score: 128.1
coord: 534..593
e-value: 4.6E-10
score: 39.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 307..351
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 334..349
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 384..595
coord: 217..294
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 204..279
score: 9.082294
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 415..590
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 210..287

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0015614.1Lag0015614.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding