Spg021704 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg021704
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Locationscaffold2: 3426574 .. 3436484 (-)
RNA-Seq ExpressionSpg021704
SyntenySpg021704
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAGCCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGTGCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCATCTGTTGAATTCAGGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATCATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGATTCCATAAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATATGATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTATGACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACTGAAATATCCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTACCTTCATCCCTTGCCGGCGGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCCTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAGTGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGGTTTATCTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGTAAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATTCAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAACAAGATAATAGCATCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAATTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGATGTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCATCATGGTTCTTCCAACTGGCTCAATACCCAAACCACCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGGTACAAAATTTACAGAGTACAGTTCATTACGATAAATCGGCTACTTTGGCTTTATTGGAAGGAGAATCAATTGATCAATGAAGTTTCTTTCATGGAATGTTAGAGGTTTGGGCTCTTGGAAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTTATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGAAGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATTTTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTTACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATATCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTCTTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCTCTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCTTATTTCTCAATTGAAGAGTTTGGACAGTATTGGGGATGAGCACATTTTATCTACAGATCAGAAAGTACAGAGACGATTATTGAGGGAACAAATTGAAGACCAGACAGCCCGTGATCATATTGCTTGGCAACAAAGATGTAAGTTACAATGGCTCAAGGAAGGTGATGAAAATACTAGATTTTTTCATCGTATCATGGCTGCCCGTAAACGGAAAAACTCTATTCATGAGGTTCTTTCTAGAAATGGAATCAGTTTACTCACTGCTAGTGATATTGAGACGGAGTTCATTAGCTTCTACCAGTCTTTATTTACTAAAGACTATAATCACCGTTTTCTCCCAATAAACGTTGATTGGAGCCCTATCAGTGCAAATCAGTCAGCAGGCTTGGAATTGGCTTTTTCGGAGGAAGAAGTTTATCAGGCTGTAAAATCGCTAGGTACAAATAAATCTCCAGGTCCAGATGGTTTTACTGCCGAATTTTTTAAATACTCATGGCATATTATTAAATCGGATGTTATGACAATGATCAGAGATTTTTTCACCACAGGTATTATTAATGTGAGTCTGAATGAAACATATATTTGTCTAATCCCAAAGAAACTGGACTCCAAATCTGTCTCAGATTATCGGCCTATTAGTCTGATTCCATGTGCATACAAGATCATTGCTCCTATTCTGTCCAACAGATTAAAGTTAGTTTTGCCATCTACTATCGCATATAATCAATTGGCTTTTGTAGCCAACAGACAAATTTTAGATGCTTCTTTAATGGCCAATGAGTTGATTGATGATTGGACTATTTCTAATAAGAAAGGTGTGGTTCTTAAACTCGATCTGGAGAAGGCTTTTGATAAGATTGATTGGGACTTCTTGGATGCAGTTCTTCAAGCCAAAGGTTTTGGTTTGATTTGGAGAAAATGGATTCATGGTTGCATTTCAAGTGTTAACTACTCTATTATCATTAATGGTAGACCTAGGGGAAAAATTATTCCTTCTCGTGGTATTCGCCAAGGTGATCCTCTTTCTCCTTTTCTGTTCATATTGGTTTCTGATTGCCTCAGTCGTCTTTTATCTCACAGTGCAAGATTGGGTAGAATTATTTCTCATCCAATAGGTAACTCTCGTCTCTCATTGACACACTTGCAATTTGCGGACGATACTCTTCTTTTCTCCATCTATGATTCTAAAGCATTGGATAATCTTTTTGAGATTATCAAACTCTTTGAGATGGCTTCTGGTTTGAACATCAATTTTGCTAAGAGTGAGCTTTTGGGGATTCACATTGCCGACTCAGATATGGAATGTTTGACAGCAAAATTTGGTTGTAAGCAGGGTTCATGGCCTTCCACATATCTGGGGCTTCCTTTGGGAGGCAGTTCGAAAGGTTCTCAGTTTTGGCAGCCTGTTATTGAAAGAATCCAACATAAGCTTCATAACTGGAAATACTCTTTTATCTCCAAAGGTGGTAGGCACACCCTCATCCAATCTACTCTCTCAAGTATGCCCACATATTACTTATCCTTATTTAAATTACCTTCTAAGGTTGCAAAATCTCTGGATAAGCTTATTCGAGATTTTTTTTGGGAGGGCTCCAGAGGCGAGGGCGGCATGCATAATGTAAATTGGGAGACAACTCAGCTTCCAAAATTTATGGGTGGAGTAGGCATTGGAAATTTCCATCACCGTAATTTGGCTCTCTTATCAAAATGGATTTGGAGGTTTTTACATGAAGGTAATGCCCTTTGGCGCCAACTTATTATTGCTAAATATTATTTCTCAGAATCAACTTGTATTTGGCCTACACATATTCAGAGAGGATCTTTTAAATCTCCTTGGCGCTTTATTTGTTCTACCATAGATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTGAGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGACGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAATGATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCACAGTTAAATCTCTTATGGGAGATATGGTTGGTGATTCTGACCCCACATCGAGCAAATTATATAATGTGGTGTGGAAAGACGTTTATCCAAAGAAGATCAAAATTTTTATCTGGGAGCTTAGTCTTGGAGCTATTAATACGTCTGATCGACTTCAAAGACGAATGCCTTATTTGCACCTTTCTCCATCCTGGTGTGTTATGTGTTGTTCTGATGCTGAAAATACTTGTCATCTATTTGTGCATTGCTCCTTTGCTTCCCGTTATTGGTCTACAATCTTCAATGCCTTTGAGTGGTCCTTGGCTCTACCAAACAACATTTATGATGTTCTTGCTTCCATTTTTGTGGGACATCCCTTCCATGGTGTGAAGAAGATCCTTTGGCTTGCTCTTAACCGGGTCTTCCTCTGGTTTCTTTGGGGCGAAAGGAATGGTCGAATTTTCAGGGATTCTTTCTCATCTTTTGAGAACTTTATGGATTTGATCCTTTTTTATGCTTTATATTGGTGCAAATGTAAACATCCATTTTCTGATTATAGTCTTTCTTCTTTAATTCTTAATTGGAGATCTTTCTTGTAATCACCTTCTTAGGTTTTGGAGTTTTACTCCTTTTATTTCATTTATCAATGAAATTTGTTTCCCTTCTCTAAAAAAAGATAAGAATTAAGAACCAAGAACAACAGACTCTCACCTTCAAAAAGAAGCTTATTGAAAAAGCGAACGAATTTTCTGGAGTAAACTTTTTTGTCAAATGCATAGTCGATCAAGATACTTTTCAACTTTAAAGGGAAAAAGAATCTAGACTTGAGTAAAGATAATTCCTTTGTGATTCTCTTACCTGCGCTTATGATGAATATGATATCAGACCTGCTCACTGCCAAGTGTTAAAATAAAATTGCATCTGATGGACCAATGTTTTGGGTTCTCAAATGGACATTTTTAAGGTTTCAAGCTCAGATGGATTAACGGGAGGGGCATTGGATATCAGGAATGGAATGCTCCAATGGGAAGTTTGAAACAAAATGAAATAGTATTAACTTAGCTGAAATCTTATGTGAATTTCAAATATTCTCTTGAATGAAACATCCTAAATAACAACTATGAATGTTCAAATTTTATGCTTGGCAAGTCTTACTGTGGGAGGGTTAACACTTTGGATCATGTTCAAAGACATTCTTCTTTAGTTTTGTCCCCGCAATGGTGCACTCTTTGCAAAAGACATGAGGAGGATTTGTTTCATTTGCTGTGGGAGTGTCAGTATGCTAACCACCTTTGGGATTGTTGGAGGAGCTCGTTCAGTGTTAGTGGTCCGCGTAACAGAGATGGGCGAGGGCCGTTGGAGGAGGTGCTCTTGCATCCTCCCTTTAGGGAGAAAGGAAATGTGTTGTGGCAAGCTTGTTTTTTTGCAGTTTTGTGGGGCGTTTGACTTGAGAGAAATAATAGAATTTTTAGGGAGAAGGAGAGATCAGGTGAGGAAGTTTGGAAGGTTGTTAGGTTTAATACTTCCTTGTGGGCGTCGGTCACGAGACCCTTTTGTAATTATGATCTTGGCATGGTGCTTTTGGATTGGAGTCCCTTTTTGTAATTTTGTCGGACTCCATCTTCTTTAGGGCTTTTTTTTTTTTTTGGTATGCCCTTGTATTATTTCATTTGTTCAATGAAAATTCGGTCTCTTATCAAAAAAAAAAAAAAAAAAAAAACTATGAATGTTCAATGTTGTACAATTCTAGAAATGTATGCAGCTTGGTTACTGCATCACAGAATTGTTGCCTGTATTTACTCATTGTACAACAAATTGCACTAGTCATTGCAATCCTAATTACAGTTAAGAAAAAATATGAATTGTTCCAGGCTGGTGGTGTTGTTGCTTCAAATGGTCGACTGCAGGGTCGTCTTGAAGTTTCTCGGGCATTCGGTGATCGCCAATTTAAAAAGGTTTTATGGCGTTCATTCATACACCCTGTTTATTATCCCATTTGTTCTTGTTTTTATTGTTTATAGTTTAAAATCTCATGGCAAAGATTAAGTTGGACGGGAAACATTGCATCTTGAAAAAAATTTATAACTAAAATCTTTTAAGAAACAATCCTTCTTCTGCAATAGATTAGAATATGAGGTTTTCCATTTCCCGTATGATATAAAAAACTACAACCAAATTTTCTTTTTAAGTTTCCTTTATGCATAGGCATGGCCAAACTATTACAAACGTTCTATCCTTAGCATTATCTTTTGAACCCAAATCTGAGCAGCAACACATACCACTTGCAGTTGGGTGTTATTGCAACTCCAGACATCCATTCTTTCGAATTGACTGATAAGGAGCATTTTATCATTCTCGGATGTGATGGATTTTGGGGGGTTAGTTTCACTTTCAGCACTGTTTACGTTTTTGTTTTATCATTCCTATTATTGTCTCTCATGTCTTTATCTAGTTTCCTCTTTCCTTTTTGACTTTTCTTTCATAATTGTTTCAGGTCTTTGGACCAAGCGATGCTGTTGATTTTGTCCAGAAATTATTGAAGGTAGGGGCACATGTTTCATATTTTACTAAGAATCTAACTGAGGGAGCGTTTTATTATCATGTGGAAGTACTCCATTTATTTGTTTATCCTTTCTTTTGGGCACCACTTTTCACATTAATCTCATCGTTAAGCAAAGTTTCCATTGTTCTAACTAATGATTTTTGGTTGGCTCTGTCAAACTATGTCCTTAATTATGACATTCTTCTTTTTCTTGCAAATTCATACTTTGGCATTCGTTTATGCTTTTACTCAGCACCAAGTCGAGTTGTTGCATAGATAATAAATTTTAATTATTGTTTTTCATTATTTCATTTTATCTTACTCTTTCAAATGTCATAACTTTAAAGAGGAAAAATGAACTCTCGATGGAGAATATTGGTTTTCTCCTTTCTCTGAAAAATTTTAATGTTTAAATTGAGTTTGATTGTGTTGAAAGGCATTTCTTCACTATGTGGTACATGCTAGTCTTGGAAATTTGTTAGTTCGTGTGTATGTGCGCACATGCGCTTGTGTGTTTGTATAAGTAGAAACCTGTTCGTATATGTCAATTAGAAATCAAGGTCACAAAAAAGAAATCAAGGTCACAATAAAGGGAAGGTAGAGGGATCCTCCCTATTAGCCAAGAGAGTTACAAAAAGAAATAAATAAAATTAATAGAATAGCTCTCTTGTAAAATTCTTAAAGAGAATACAATACACCATCAAAAGGTCCAAAGATAACCCCATCAAAGGTCTATTGAGTAGTTGAAATTATATCCTTAAAAAGTTCTTTTGTTCCTTTCCAACAAAATAGCCCGGAAGGGTCCTAATCATAGTCTTCCTAAAGGTCTCTGTCTTGTCGAAAAGGCCCCAGCCGTAGATGCTTTGGACAAGAATTTCTGCTTAGTTGGAGGGGAACAACAATGAAAATTAGCAGCTTGAAGAAAATTGCCTTGGACATCGCGAGGGGACAAAGAACAAAAATATGACCGACCAATTCCCCCCCTTTTGACAAATAATACACTAAGAGGACGAGAGATACAGGTTCAAACACTTCTTTTATACCCTTATTCTATGTGTTAATCTTGCGATGGAGAGTCTAAAAATTGAATCAAACTATTTTCATGATCTTCCAACCCTGAATTGGCTTAGCAATTCCTTTTAGTTTTAGATGCACCCAAGATATGTCAAGAATATGATTGAATGAAGAGGAGATAGTAATGAACCCTAAAGTGTCAATACTCCTAGTAAAGGGAAGAAGACGAGCTCATTCCTTAACTTCCAAATCAATTAAGGTTTTCTTGACCTCAAGTTCCAACTAAGCATGGAGACTATTCCAAACATTTTCCATACGCTTCATGTTGTGGTTTGAGAGTTTAAAATTCTTAGGAAAATAATCTTTGAAGGACATGCCCCCAATCCAAACATCCTTCAAAAACTTTATTTTGCTTCCATCACCGAGTTTGGAACAAAAAAAAAGCCTCCACTGTCCACTGAGGATTTGTAAGAAGCTGTCAGCTTCCATGGCCTTCTTTCTATCACGACATCCTTTGGACTTCTTTGTAAACCAACCAAAGGAAGTCAAATGCTTCCTCATTTACAAAATCTCCAAATTTCCTCTAGCAATTAGAGCCATGTGTTTCAGCAACGTGAGCATAAAATAAATTTTTAGCTTTATGTAGCTGCAAAATTTTCTTTGCTTTCAACTTTTACTCAGGCCACCTGTTGAACAGTAAGTCAGTGTTATATCTCTTCTGGCCTTTGCATGAGGGGGTGACTATGGTAAGTTGTTTTGTGTTATTAAAGACTCACCTTGATCAAGTATCACGAAAGATATCCTTGTTGAACAGGAGGGCTTGCCCATAACATCAGTAAGTCGACGCCTTGTGCGGGAAGCTATTCGTGAACGGCGCTGTAAAGATAACTGTACTGCAATGGTTATTGTCTTCAGGCCCAAATGATTAAGCGAGAAATAATTCCGAATGATTTGAGATTTCTGTATTACTGGTGCTGCTCAAGCGAACTCCCTAGACAGATAACTTTTTTGTACAAGTGGGTAGACTGCTTTACTAGGGATTGTGGAATCCGCCAGCATTAGAGATGCAGGGTTTGGAGGGTTTGGAAGCTGATGTGGAAACAGTGAGATACCCAGTGTTCCATTTTGTGTGCAACTTCAGTTTCAAGATTTTGTAAGATACTGATGATACTACCTTCCAACAATATTGTGCAAGTAAAGATTTTCTTAGTTGTTTGTATGTTGGTGGAAATATGTGAAAACTCTACAACAATGTTGATTTCCTGCTTAGATATGTATTCTTGTTCATGACAAAACAATCAGGAGTGATGATTCCATGATTGGAACTCAATGATATTCATAACTAAAATCACTTAAAAGTAGGGACTTGTTGAT

mRNA sequence

ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAGCCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGTGCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCATCTGTTGAATTCAGGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATCATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGATTCCATAAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATATGATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTATGACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACTGAAATATCCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTACCTTCATCCCTTGCCGGCGGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCCTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAGTGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGGTTTATCTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGTAAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATTCAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAACAAGATAATAGCATCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAATTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGATGTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCATCATGGTTCTTCCAACTGGCTCAATACCCAAACCACCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGGTTTGGGCTCTTGGAAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTTATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGAAGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATTTTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTTACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATATCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTCTTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCTCTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCTTATTTCTCAATTGAAGAGTTTGGACAGTATTGGGGATGAGCACATTTTATCTACAGATCAGAAAGTACAGAGACGATTATTGAGGGAACAAATTGAAGACCAGACAGCCCGTGATCATATTGCTTGGCAACAAAGATGTAAGTTACAATGGCTCAAGGAAGGTGATGAAAATACTAGATTTTTTCATCGTATCATGGCTGCCCGTAACTCTCGTCTCTCATTGACACACTTGCAATTTGCGGACGATACTCTTCTTTTCTCCATCTATGATTCTAAAGCATTGGATAATCTTTTTGAGATTATCAAACTCTTTGAGATGGCTTCTGGTTTGAACATCAATTTTGCTAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTGAGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGACGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAATGATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCACAGTTAAATCTCTTATGGGAGATATGGTTGGTGATTCTGACCCCACATCGAGCAAATTATATAATGTGGTGTGGAAAGACGTTTATCCAAAGAAGATCAAAATTTTTATCTGGGAGCTTAGTCTTGGAGCTATTAATACGTCTGATCGACTTCAAAGACGAATGCCTTATTTGCACCTTTCTCCATCCTGGTGTGTTATGTGTTGTTCTGATGCTGAAAATACTTGTCATCTATTTGTGCATTGCTCCTTTGCTTCCCGTTATTGGTCTACAATCTTCAATGCCTTTGAGTGGTCCTTGGCTCTACCAAACAACATTTATGATGTTCTTGCTTCCATTTTTGTGGGACATCCCTTCCATGGTGTGAAGAAGATCCTTTGGCTTGCTCTTAACCGGGTCTTCCTCTGGTTTCTTTGGGGCGAAAGGAATGGTCGAATTTTCAGGGATTCTTTCTCATCTTTTGAGAACTTTATGGATTTGATCCTTTTTTATGCTTTATATTGA

Coding sequence (CDS)

ATGACCACTGCTATCTCTGCCACAGCTCAACCTTGGAACCACTCCACCAGATCTATCTACATTGATCGAAAAACTTTCTCCATTGAATTTGATGAACCTTCTAGGGGAAGCCGAGCAAAAATCACAGAGCATAGTAGAGCCTCCTCCCATTCCTTAACTTTGTCTTGGAAATCTCTCCATTGGCTAGCATCCTCCTTCAAAACTCTTGCCCATGAACCGTGCTCCTACAAATTCTCCTCCAAGATAAGAACTGATGACTATGTTCTCTGGTTGGAAAAACTCAGCAATAAGTATGGCTTCTTTGTGGAAATTAATCATCTGTTGAATTCAGGTGATCGACGTCGACTCCTTATACCATCTGAAGACAACAAGCAAGGTTGGTTCTCCTTTTTCTCCCTCATCTCTGATTACCCAGGAGAGGCTCATCGATCAACAAAATCATATAAAGATGTCCTCCAACAAAAGGAAAGTCATGTTGTCACCACTCACCCTTCATCATCTGTCCCTTCACCACAGCCTCTCGACAGTGAGATTATTGTTGTTCAACGATTCCATAAAAAGGATGATTGGCCTTCCATTCGGAACACCATTCTTGCCGGCATATCCCACCGTTGCTCCATCAATCCATTTCAAGATAATAAAGCTTTGTTACATGTATATGATCAAAATATCGTGTCAAAACTTTGCAACAACAAGGATTGGTCCTCCATTGGCAAATACCGATTGAAGTTTTACCCATTGACTACCGACTCATTTTATCAAGACACTATGACTAATTCTTTTGGTGGATGGATTGAAGTGCTGCAACTTCCTTTACCTTTATGGACAGAACAAATTTTCAGATATATTGGTGATGTTTGTGGAGGCTTCACTGAAATATCCAACCACACCAGCAGGAAGCTAAATCTCACAGCGGCAAAGATTAAAATCCGGCAAAATTCCATCGGTTTCATCCCGGCCAGAATTAAGCTACCTTCATCCCTTGCCGGCGGCGACGTTACAGTGGAAATCAAAGGGTTGACGGCCAGCCTTTTCAAATCAGCGAGATTTGAGGAATCCCCGTCATTTTCGGAACAAAATAATTTAGAAATTAAGAGGAGTGAGAAATTGGATGGAAAAAATTTGAAACTACCAGAGGAAATCAAATCGCCTCCTAAAAATCAGGAATTCATTCCTATTTTGGCCGAAACTGCTTCTCCAAAGGGTTTATCTCCTTGTCTGGTTTCTCCCCAAACTATATCACAGCCAAGAAGTATTTTGCAAGCGGCAGATCACTCCAATATATCTCCTAAAAAAAAGACAGCACACTCCAACAAAGGTAAATCTCCCCTCCACGTGGCCTCCCCCATTGAGGCAAAGAATCATTCAAATTATTTTCTTCCAGTGGGACCCACCACTCTTGGTTTAGGAGAAAAGAAATCAACAGGCAACAAGATAATAGCATCTGATACTGAGGCTTACTTATCAAGTCCAGCCAATGACAAATCCCCTCATACTTCGGTTTGTGACCCTACATCGCCTCGAAACTTTGACCTTGCAATTTTTGATGAGTTACATTTACCCGAGTCGGAACAAATTCCATTGGCAAGCTTACCTCCGACCTCCCACATTCCATCATCCCCCCATATTATTCCCTCTCCCACAAAAGATGTTACCCCACTACAACAACCATCTAGCTCTCCACCCGAACCATCACCCCTTTCCCTCCCAACATATCTCTGTCATTTAGCTCCAATGCTTAGTAAACATGGTTTATGCATCATGGTTCTTCCAACTGGCTCAATACCCAAACCACCAACTAAGAAAACTAAAGCTACTACAGGGAAAAAGTCAAAACTTAAGAGAGAGGTTTGGGCTCTTGGAAAAAAAAGAGCATTAATTAAGAAGACCATCCAACAACAAAATCCGAGCTTCGTGCTACTTCAAGAAACTAAAAAGACATCGGTTGATGGAAAATTTATTAAATCTATATGGAGTTCTTCTTGCATTGGTTGGGCTTCCCTTGATTCCATTGGAGCATCCGGAGGCATCCTTATTCTTTGGAGTGATCCTGATTTCACGATCAAAGAAGTTATTCAAGGTCACTTTTCAATCTCAATTCATGTTTTTATGGCTGACGGTTTTTCTTTTTGGCTTTCGGCTATTTATGGTCCTTCTAGGCGTGAGCACCGTGCAGATTTTTGGCAAGAACTCCATGATTTGGCTGGTTTAGGTGGTGATCGATGGATCCTTGGAGGGGATTTTAATGTTACTCGCTGGTCTTGGGAGAAATCTCATGGTCGAAATGTTACTAGGAGCATGCGCACTTTCAATCAATGGATTGCCAATTACCATCTCTTGGACATTCCACTACAAAATGGCTCTTACACTTGGTCTAGCTTTGGGGATGACATTGAATATCTCTCACTTCTGGATAGATTTTTATTAACAAATGATTGCCTTCACAAATTTGGGTCAGCAAATCTCCTTCGTCTTGATAGAGTCACATCAGATCACTACCCTTTAGCTCTTTCTTTTGGAGACATAGCTTGGGGGCCTTGTCCTTTCCGTTTTGACAATGCTTGGTTACATATTGAGTCCTTTCGTGAAGTTCTGAAAAACTGGTGGAACCAAAATCCTCTCCAAGGCTGGCCAGGGCATGGTTTTATGATGAAACTCAAGGGATTGAAAATGGAACTAAGAAAATGGAACATCACGAATCGTAATGATGTTTCCCAACTACCATCTCTTATTTCTCAATTGAAGAGTTTGGACAGTATTGGGGATGAGCACATTTTATCTACAGATCAGAAAGTACAGAGACGATTATTGAGGGAACAAATTGAAGACCAGACAGCCCGTGATCATATTGCTTGGCAACAAAGATGTAAGTTACAATGGCTCAAGGAAGGTGATGAAAATACTAGATTTTTTCATCGTATCATGGCTGCCCGTAACTCTCGTCTCTCATTGACACACTTGCAATTTGCGGACGATACTCTTCTTTTCTCCATCTATGATTCTAAAGCATTGGATAATCTTTTTGAGATTATCAAACTCTTTGAGATGGCTTCTGGTTTGAACATCAATTTTGCTAAGAATCTTGTTGCTAGTCGTATTCAGAGACGGCTTGGAAATGGTTGTTCCACCCTTTTTTGGCATGATTCTTGGCTAAGTTGTGGAGTCTTGTCTGAGGCTTTCCCTCGTCTTTATAGATTATCTAATCGCTCGGACGGTACAGTTGCTGACTTTTGGGTTTCATTGAATTCGGCTTGGGATTTGAGTCTTCGTCGAAATTTAAATGATTCGGAGACAAATGAGTGGGCTAGTCTCTCTCATCTGCTTTCTTCCATCAGAATTCGAGTTATTGATGACACTTGGTCTTGGCCTATTGATTCGTCTAATGCATTCACAGTTAAATCTCTTATGGGAGATATGGTTGGTGATTCTGACCCCACATCGAGCAAATTATATAATGTGGTGTGGAAAGACGTTTATCCAAAGAAGATCAAAATTTTTATCTGGGAGCTTAGTCTTGGAGCTATTAATACGTCTGATCGACTTCAAAGACGAATGCCTTATTTGCACCTTTCTCCATCCTGGTGTGTTATGTGTTGTTCTGATGCTGAAAATACTTGTCATCTATTTGTGCATTGCTCCTTTGCTTCCCGTTATTGGTCTACAATCTTCAATGCCTTTGAGTGGTCCTTGGCTCTACCAAACAACATTTATGATGTTCTTGCTTCCATTTTTGTGGGACATCCCTTCCATGGTGTGAAGAAGATCCTTTGGCTTGCTCTTAACCGGGTCTTCCTCTGGTTTCTTTGGGGCGAAAGGAATGGTCGAATTTTCAGGGATTCTTTCTCATCTTTTGAGAACTTTATGGATTTGATCCTTTTTTATGCTTTATATTGA

Protein sequence

MTTAISATAQPWNHSTRSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSYKFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRLLIPSEDNKQGWFSFFSLISDYPGEAHRSTKSYKDVLQQKESHVVTTHPSSSVPSPQPLDSEIIVVQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVYDQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIFRYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAGGDVTVEIKGLTASLFKSARFEESPSFSEQNNLEIKRSEKLDGKNLKLPEEIKSPPKNQEFIPILAETASPKGLSPCLVSPQTISQPRSILQAADHSNISPKKKTAHSNKGKSPLHVASPIEAKNHSNYFLPVGPTTLGLGEKKSTGNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTSHIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSIPKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLISQLKSLDSIGDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNLVASRIQRRLGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTCHLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLWFLWGERNGRIFRDSFSSFENFMDLILFYALY
Homology
BLAST of Spg021704 vs. NCBI nr
Match: XP_022158956.1 (uncharacterized protein LOC111025405 [Momordica charantia])

HSP 1 Score: 359.0 bits (920), Expect = 1.7e-94
Identity = 174/367 (47.41%), Postives = 236/367 (64.31%), Query Frame = 0

Query: 622 KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
           KK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW
Sbjct: 15  KKGALIKQFISRLNPNVVILQETKLSYMDILIVKSLWSAHGINWSALDASGMASGILILW 74

Query: 682 SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
           +DPD    E+I+G FS++I+  ++DGF FW+S IYGPS  E    FWQEL DL+ L  + 
Sbjct: 75  NDPDLKAAEMIEGVFSLTINFCLSDGFLFWVSGIYGPSTTEFHYLFWQELLDLSDLCENH 134

Query: 742 WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
           WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         
Sbjct: 135 WILAGDFNVTRWSWEKSNGRPLTKSMWLFNSFIEDSSLIDVPLTNGQHTWSRNTS----F 194

Query: 802 SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
           SL+D FLLTN C+ K G     R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F
Sbjct: 195 SLIDCFLLTNGCIDKLGMPIAKRMTRTTSDHFPILLDFGQNNWGLTPFRFENMWLSHKTF 254

Query: 862 REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI 921
           +  L+ WW   PL GWPGHG MMKLK LK  ++ W   +   + SQ   L + + SLD +
Sbjct: 255 KPFLETWWGNKPLHGWPGHGLMMKLKSLKYAIKLWITEHFRCIHSQKEDLTNLMNSLDDL 314

Query: 922 GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
                ++ DQ   R   +E +    A++   W+QRCK +WL EGDENT+FFHR +A +  
Sbjct: 315 EGSQPVTPDQSRARIQAKEDLLSVVAKEEAFWRQRCKQKWLCEGDENTKFFHRFLANKRR 374

Query: 982 RLSLTHL 988
           R  +T +
Sbjct: 375 RSIITEI 377

BLAST of Spg021704 vs. NCBI nr
Match: RVX15530.1 (putative ribonuclease H protein [Vitis vinifera])

HSP 1 Score: 337.0 bits (863), Expect = 7.0e-88
Identity = 242/920 (26.30%), Postives = 364/920 (39.57%), Query Frame = 0

Query: 622  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
            KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W    + WA+L + GASGGI+ILW
Sbjct: 406  KKRRIVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILW 465

Query: 682  SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
                F   E + G FS+++     +  SFWL+++YGP     R DFW EL DL GL   R
Sbjct: 466  DSSKFECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPR 525

Query: 742  WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
            W +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D    
Sbjct: 526  WCVGGDFNVIRRISEKLGETRLTFNMRCFDEFIRESGLLDPPLRNAAFTWSNMQAD-PIC 585

Query: 802  SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
              LDRFL +++    F  +    L R TSDH P+ L    + WGP PFRF+N WL    F
Sbjct: 586  KRLDRFLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLIHPEF 645

Query: 862  REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI 921
            +E  + WW +   +GW GH FM KLK +K +L++WNI    D+ +   LI   L  +D I
Sbjct: 646  KEKFRVWWQECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILMDLSRIDLI 705

Query: 922  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
              E  L+ D  ++R L R ++ED   ++ + W+Q+ +++W+KEGD N++FFHR+    + 
Sbjct: 706  EQEGNLNPDLVLERTLRRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGESW 765

Query: 982  RLS--------------------------------------------------------- 1041
            R+                                                          
Sbjct: 766  RVEGIDWVPISGESGVWLDRPFTEEEDLMRVFLEFHTNGVINQSTNATFIALVPKKRRLR 825

Query: 1042 ------------------------------------------------------------ 1101
                                                                        
Sbjct: 826  KVLHETISGSQGAFVEGRHILDAVLIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFL 885

Query: 1102 --------------------LTHLQFA--------------------------------- 1161
                                L+   FA                                 
Sbjct: 886  DHVLQRKGFSQKWRSWIRGCLSSSSFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVAD 945

Query: 1162 --DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA 1221
              +DT+ FS    + L NL  I+ +F   SGL IN  K+                   V 
Sbjct: 946  VLNDTIFFSKASLEHLQNLKIILLVFGQVSGLKINLEKSTISGLPLGGNPKTIGFWDPVV 1005

Query: 1222 SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR----------------- 1281
             RI RRL    ++    +  +  W   G       V  E   R                 
Sbjct: 1006 ERISRRLDASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLGFGKISLRNI 1065

Query: 1282 ------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIR 1296
                  L+R      G     +   + +  + WD ++   +  S    W +++ +     
Sbjct: 1066 ALLGKWLWRFPRERSGLWHKVIVSIYGTHPNGWDANM--VVRWSHRCPWKAIAQVFQEFS 1125

BLAST of Spg021704 vs. NCBI nr
Match: RVW99869.1 (Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera])

HSP 1 Score: 318.5 bits (815), Expect = 2.6e-82
Identity = 247/825 (29.94%), Postives = 348/825 (42.18%), Query Frame = 0

Query: 616  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASG 675
            E W L  +R+    T++ + P  V++QETKK   D + + S+W+     W  L + GASG
Sbjct: 484  EKW-LKCERSYALWTLRLEKPDVVMIQETKKEKCDRRLVGSVWTVRSKDWVILPACGASG 543

Query: 676  GILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLA 735
            GIL +W       +EV+ G FSIS+   +      W+SAIYGP+    R DFW EL+D+ 
Sbjct: 544  GILFIWDSKKLCKEEVVLGSFSISVKFALEGCGPLWISAIYGPNSPSLRKDFWVELYDIY 603

Query: 736  GLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFG 795
            GL    W +GGDFNV R S EK  G  +T SMR F+ +I+   LLD PL+N S+TWS+  
Sbjct: 604  GLTFPLWCVGGDFNVIRRSSEKLGGSRLTSSMRDFDSFISESELLDPPLRNASFTWSNM- 663

Query: 796  DDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAW 855
             +      LDRFL +N+    F       L R TSDH+P+AL      WGP PFRF+N W
Sbjct: 664  QESPVCKRLDRFLYSNEWGQLFPQGIQETLIRRTSDHWPIALDTNPFMWGPTPFRFENMW 723

Query: 856  LHIESFREVLKNWWNQNPLQGWPGH--------------GFMM----------------- 915
            L   SF+E  +NWW      GW GH              G ++                 
Sbjct: 724  LQHPSFKENFRNWWRGFQGNGWEGHKRNRKYIKSLENERGLVLNNVVSITEEILLYFEKL 783

Query: 916  ------KLKGLKMELRK---------------WNI---------------------TNRN 975
                   L  LKM+  K               W++                     TN +
Sbjct: 784  YANPKESLGVLKMDRDKAPGPDGFTIAVFQDCWDVIKEDLVRVFAEFHRSGVINQSTNAS 843

Query: 976  DVSQLP--------------SLISQLKS---------LDSIGDEHILSTD-------QKV 1035
             +  LP              SLI+ L           L  +  E I ST        Q +
Sbjct: 844  FIVLLPKKSTTKKISDFRPISLITSLYKIIAKVLSGRLRGVLHETIHSTQGAFVQGRQIM 903

Query: 1036 QRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADD 1095
               L+  +I D+  R         KL   +      ++       RN R  ++HLQFADD
Sbjct: 904  DAVLIANEIVDERRRSG-RKVSSLKLILKRLTITGEKYVGGFRVGRN-RTRVSHLQFADD 963

Query: 1096 TLLFSIYDSKALDNLFEIIKLFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLF 1155
            T+ FS    + L  L  ++  F   SGL +N  K      NL  + I R      C    
Sbjct: 964  TIFFSNTREEDLQTLKSLLLAFGHISGLKVNLDKSNIYGINLDQAHISRLAETLECKASG 1023

Query: 1156 WHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN 1215
            W   +L   +                       L   +P L+R+       V D  + ++
Sbjct: 1024 WPILYLGLPLAGTPGQEMGKEFVLGRFVVGDQPLGSQYPSLFRV-------VLDKNIPIS 1083

Query: 1216 S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVK 1275
            S        +W+L+ RRNL+DSE  +   L   L  + +   + D   WP+ SS  F+VK
Sbjct: 1084 SVLGPTRPFSWNLNFRRNLSDSEIEDLEGLMRSLDGVHLSPSVPDARLWPLSSSGLFSVK 1143

Query: 1276 SL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHL 1296
            S    +    G      SK    VW    P K++ FIW ++   +NT+D LQ R PY  L
Sbjct: 1144 SFFLALSQFFGSPQVFPSKF---VWNSQIPFKVQSFIWLVAHKKVNTNDMLQVRRPYKAL 1203

BLAST of Spg021704 vs. NCBI nr
Match: TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 312.0 bits (798), Expect = 2.4e-80
Identity = 270/1062 (25.42%), Postives = 447/1062 (42.09%), Query Frame = 0

Query: 17   RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSY 76
            RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + 
Sbjct: 63   RSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTN 122

Query: 77   KFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRLLIPSEDNKQGWFSFFSLIS- 136
            +F  + R  +  +W+ K  N  G   EI  +     +  +L+P   +K GW SF S+I+ 
Sbjct: 123  RFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSFLSMITP 182

Query: 137  --------------------------DYPGEAHR---------STKSYKDVLQQKESHVV 196
                                      DY   ++          +T    D     +S   
Sbjct: 183  KVEVKAKTRPTFLPRTSPDCRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHS 242

Query: 197  TTHPSSSVPSPQPLDSEIIVVQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVY 256
            +++     PS   L++ +++V+RF   DDW  I   +        + N F   KAL+H  
Sbjct: 243  SSNSFCDSPSSDLLENTVVIVRRFF-HDDWHKILQNLRKQTEESFTYNAFHAEKALVHFS 302

Query: 257  DQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF 316
                 + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Sbjct: 303  SNIPANLLCQNKGWSTVGKYSVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTF 362

Query: 317  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG---------- 376
            + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +             
Sbjct: 363  QQIGKACEGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHP 422

Query: 377  -GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR-------SEKLDGKNLKLPE 436
             G   +E        FK   +A F++    SEQ   E          S   DG+    P+
Sbjct: 423  EGKWLIERNVRLHGTFKRQAAASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRKSSTPD 482

Query: 437  E-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQP 496
            +       I  P +N      L E          TA+        G+S   V  +   + 
Sbjct: 483  QPSALKSVIIKPDRNATLPSFLNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKV 542

Query: 497  RSILQAADHSNISPKKKTAHSNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKST 556
               LQ     N+   K+    N   +  ++ +P  A  NHS         +L   EKK  
Sbjct: 543  DIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHS--------PSLNSPEKKQK 602

Query: 557  GNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS 616
             ++  +   ++  + P N K+        T P        D      S  + L  LP   
Sbjct: 603  VSRERSIKKKSSSTQP-NSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALD 662

Query: 617  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSI 676
               S      S   +V  +      P  P  + +P      +   + +     V      
Sbjct: 663  PNKSLEDHHNSDNAEVVDITNTEVVPETPE-MKMPVNENSNSSSEANYRKPKHVHKRKYY 722

Query: 677  PKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN----PSFVLLQETKK--TSV 736
             +   +K K    +  K +   W   KK  L   T    +     + VLL +        
Sbjct: 723  YRKKEEKEKDPDSEAFKKQLVSWL--KKNGLKLSTDTDSSGATTSTNVLLNQMNSGLKIT 782

Query: 737  DGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFS 796
            + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S
Sbjct: 783  NKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSS 842

Query: 797  FWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRT 856
            +WL+ +YGP +R  R  FW ELH+L  L    WILGGD NV R   E +   + + + R 
Sbjct: 843  WWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHNSRM 902

Query: 857  FNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT 916
             N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F       L R T
Sbjct: 903  LNNFISNNLLIDPPLTNNRFTWSNLRNPPTF-SRIDRFLYNSSWENLFSPHTTRTLPRST 962

Query: 917  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLK 976
            SDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK
Sbjct: 963  SDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRNMGRWWENSIQAGYPGFSFIQRLK 1022

Query: 977  GLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHILSTDQKVQRRLLREQIEDQTA 989
             L   ++ W     + ++    ++I ++ S+D    +  L+ ++  +R  L+  + + + 
Sbjct: 1023 SLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSL 1082

BLAST of Spg021704 vs. NCBI nr
Match: TYJ99315.1 (LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa])

HSP 1 Score: 103.6 bits (257), Expect = 1.3e-17
Identity = 54/161 (33.54%), Postives = 84/161 (52.17%), Query Frame = 0

Query: 1037 LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDS 1096
            L NG    FW+ +W   G LS A+PRL+ L+   + +V D W + ++ W++  RR LND 
Sbjct: 1597 LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDR 1656

Query: 1097 ETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPT----SSKLYN 1156
            E   W  +  +L + R        +W  DS+N+F++ S    +    D T     +KL  
Sbjct: 1657 ERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASAKVLISRQLDQTPGDPRAKLLE 1716

Query: 1157 VVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS 1194
            ++WK   P KIK F+W L    INT + +Q++MP   L P+
Sbjct: 1717 IIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN 1757


HSP 2 Score: 299.3 bits (765), Expect = 1.6e-76
Identity = 156/395 (39.49%), Postives = 232/395 (58.73%), Query Frame = 0

Query: 622  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
            KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W
Sbjct: 117  KKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGSVWSVRNKDWAALPASGASGGILIIW 176

Query: 682  SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
                   +EV+ G FS+SI   M    S WLSA+YGP+    R DFW EL D+AGL   R
Sbjct: 177  DSIKMRREEVVLGSFSVSIKFAMDGCESLWLSAVYGPNNSALRKDFWVELSDIAGLSHPR 236

Query: 742  WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
            W +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++    
Sbjct: 237  WCVGGDFNVIRRSSEKLGGSRLTPCMKDFDEFIRDCELIDSPLRSVSYTWSNMQEN-PVC 296

Query: 802  SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
              LDRFL +N+    F  +    L R TSDH+P+ L      WGP PFRF+N WL   SF
Sbjct: 297  KRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHSSF 356

Query: 862  REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI 921
            +E    WW++    GW GH FM KL+ +K +L++WN T+  ++S +   +++ L + DS+
Sbjct: 357  KENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWNKTSFGELSKKKKDILAVLANFDSL 416

Query: 922  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
              E  LS +  VQR   + ++E+   R+ I W+Q+ +++W+KEGD N++FFH++   R +
Sbjct: 417  EQEGGLSQELLVQRAFSKGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRN 476

Query: 982  RLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE 1016
            R  +  L+     +L +    K      EI+K FE
Sbjct: 477  RKFIKELENESGLMLNNPESIKE-----EILKYFE 505

BLAST of Spg021704 vs. ExPASy TrEMBL
Match: A0A6J1E2G6 (uncharacterized protein LOC111025405 OS=Momordica charantia OX=3673 GN=LOC111025405 PE=4 SV=1)

HSP 1 Score: 359.0 bits (920), Expect = 8.4e-95
Identity = 174/367 (47.41%), Postives = 236/367 (64.31%), Query Frame = 0

Query: 622 KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
           KK ALIK+ I + NP+ V+LQETK + +D   +KS+WS+  I W++LD+ G + GILILW
Sbjct: 15  KKGALIKQFISRLNPNVVILQETKLSYMDILIVKSLWSAHGINWSALDASGMASGILILW 74

Query: 682 SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
           +DPD    E+I+G FS++I+  ++DGF FW+S IYGPS  E    FWQEL DL+ L  + 
Sbjct: 75  NDPDLKAAEMIEGVFSLTINFCLSDGFLFWVSGIYGPSTTEFHYLFWQELLDLSDLCENH 134

Query: 742 WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
           WIL GDFNVTRWSWEKS+GR +T+SM  FN +I +  L+D+PL NG +TWS         
Sbjct: 135 WILAGDFNVTRWSWEKSNGRPLTKSMWLFNSFIEDSSLIDVPLTNGQHTWSRNTS----F 194

Query: 802 SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
           SL+D FLLTN C+ K G     R+ R TSDH+P+ L FG   WG  PFRF+N WL  ++F
Sbjct: 195 SLIDCFLLTNGCIDKLGMPIAKRMTRTTSDHFPILLDFGQNNWGLTPFRFENMWLSHKTF 254

Query: 862 REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDV-SQLPSLISQLKSLDSI 921
           +  L+ WW   PL GWPGHG MMKLK LK  ++ W   +   + SQ   L + + SLD +
Sbjct: 255 KPFLETWWGNKPLHGWPGHGLMMKLKSLKYAIKLWITEHFRCIHSQKEDLTNLMNSLDDL 314

Query: 922 GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
                ++ DQ   R   +E +    A++   W+QRCK +WL EGDENT+FFHR +A +  
Sbjct: 315 EGSQPVTPDQSRARIQAKEDLLSVVAKEEAFWRQRCKQKWLCEGDENTKFFHRFLANKRR 374

Query: 982 RLSLTHL 988
           R  +T +
Sbjct: 375 RSIITEI 377

BLAST of Spg021704 vs. ExPASy TrEMBL
Match: A0A438K2W1 (Putative ribonuclease H protein OS=Vitis vinifera OX=29760 GN=VvCHDp000001_863 PE=4 SV=1)

HSP 1 Score: 337.0 bits (863), Expect = 3.4e-88
Identity = 242/920 (26.30%), Postives = 364/920 (39.57%), Query Frame = 0

Query: 622  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
            KKR ++++ +  QNP  V+LQETK+ + D +F+ S+W    + WA+L + GASGGI+ILW
Sbjct: 406  KKRRIVRRFLSTQNPDIVMLQETKRETWDRRFVSSVWKGKRVEWAALPACGASGGIVILW 465

Query: 682  SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
                F   E + G FS+++     +  SFWL+++YGP     R DFW EL DL GL   R
Sbjct: 466  DSSKFECTEKVLGSFSVTVKFNSGEEGSFWLTSVYGPINPLWRKDFWLELQDLYGLTFPR 525

Query: 742  WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
            W +GGDFNV R   EK     +T +MR F+++I    LLD PL+N ++TWS+   D    
Sbjct: 526  WCVGGDFNVIRRISEKLGETRLTFNMRCFDEFIRESGLLDPPLRNAAFTWSNMQAD-PIC 585

Query: 802  SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
              LDRFL +++    F  +    L R TSDH P+ L    + WGP PFRF+N WL    F
Sbjct: 586  KRLDRFLFSSEWDTFFSQSFQEALPRWTSDHSPICLETNPLKWGPTPFRFENMWLIHPEF 645

Query: 862  REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVSQLPSLI-SQLKSLDSI 921
            +E  + WW +   +GW GH FM KLK +K +L++WNI    D+ +   LI   L  +D I
Sbjct: 646  KEKFRVWWQECTGEGWEGHKFMRKLKFVKSKLKEWNIMTFGDLKERKKLILMDLSRIDLI 705

Query: 922  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
              E  L+ D  ++R L R ++ED   ++ + W+Q+ +++W+KEGD N++FFHR+    + 
Sbjct: 706  EQEGNLNPDLVLERTLRRRELEDVLLKEEVQWRQKSRVKWIKEGDCNSKFFHRVATGESW 765

Query: 982  RLS--------------------------------------------------------- 1041
            R+                                                          
Sbjct: 766  RVEGIDWVPISGESGVWLDRPFTEEEDLMRVFLEFHTNGVINQSTNATFIALVPKKRRLR 825

Query: 1042 ------------------------------------------------------------ 1101
                                                                        
Sbjct: 826  KVLHETISGSQGAFVEGRHILDAVLIANEVVDEKRRSGEEGIVFKIDFEKAYDHVDWGFL 885

Query: 1102 --------------------LTHLQFA--------------------------------- 1161
                                L+   FA                                 
Sbjct: 886  DHVLQRKGFSQKWRSWIRGCLSSSSFAILVNGNAKGWVKASRGLRQGDPLSPFLFTLVAD 945

Query: 1162 --DDTLLFSIYDSKALDNLFEIIKLFEMASGLNINFAKNL------------------VA 1221
              +DT+ FS    + L NL  I+ +F   SGL IN  K+                   V 
Sbjct: 946  VLNDTIFFSKASLEHLQNLKIILLVFGQVSGLKINLEKSTISGLPLGGNPKTIGFWDPVV 1005

Query: 1222 SRIQRRLGNGCST----LFWHDSWLSCG-------VLSEAFPR----------------- 1281
             RI RRL    ++    +  +  W   G       V  E   R                 
Sbjct: 1006 ERISRRLDASIASKIEKMQRNFLWSGAGEGKKDHLVRWEVVSRPKELGGLGFGKISLRNI 1065

Query: 1282 ------LYRLSNRSDG----TVADFWVSLNSAWDLSLRRNLNDSETNEWASLSHLLSSIR 1296
                  L+R      G     +   + +  + WD ++   +  S    W +++ +     
Sbjct: 1066 ALLGKWLWRFPRERSGLWHKVIVSIYGTHPNGWDANM--VVRWSHRCPWKAIAQVFQEFS 1125

BLAST of Spg021704 vs. ExPASy TrEMBL
Match: A0A438IT16 (Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX2_785 PE=4 SV=1)

HSP 1 Score: 318.5 bits (815), Expect = 1.3e-82
Identity = 247/825 (29.94%), Postives = 348/825 (42.18%), Query Frame = 0

Query: 616  EVWALGKKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASG 675
            E W L  +R+    T++ + P  V++QETKK   D + + S+W+     W  L + GASG
Sbjct: 484  EKW-LKCERSYALWTLRLEKPDVVMIQETKKEKCDRRLVGSVWTVRSKDWVILPACGASG 543

Query: 676  GILILWSDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLA 735
            GIL +W       +EV+ G FSIS+   +      W+SAIYGP+    R DFW EL+D+ 
Sbjct: 544  GILFIWDSKKLCKEEVVLGSFSISVKFALEGCGPLWISAIYGPNSPSLRKDFWVELYDIY 603

Query: 736  GLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFG 795
            GL    W +GGDFNV R S EK  G  +T SMR F+ +I+   LLD PL+N S+TWS+  
Sbjct: 604  GLTFPLWCVGGDFNVIRRSSEKLGGSRLTSSMRDFDSFISESELLDPPLRNASFTWSNM- 663

Query: 796  DDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAW 855
             +      LDRFL +N+    F       L R TSDH+P+AL      WGP PFRF+N W
Sbjct: 664  QESPVCKRLDRFLYSNEWGQLFPQGIQETLIRRTSDHWPIALDTNPFMWGPTPFRFENMW 723

Query: 856  LHIESFREVLKNWWNQNPLQGWPGH--------------GFMM----------------- 915
            L   SF+E  +NWW      GW GH              G ++                 
Sbjct: 724  LQHPSFKENFRNWWRGFQGNGWEGHKRNRKYIKSLENERGLVLNNVVSITEEILLYFEKL 783

Query: 916  ------KLKGLKMELRK---------------WNI---------------------TNRN 975
                   L  LKM+  K               W++                     TN +
Sbjct: 784  YANPKESLGVLKMDRDKAPGPDGFTIAVFQDCWDVIKEDLVRVFAEFHRSGVINQSTNAS 843

Query: 976  DVSQLP--------------SLISQLKS---------LDSIGDEHILSTD-------QKV 1035
             +  LP              SLI+ L           L  +  E I ST        Q +
Sbjct: 844  FIVLLPKKSTTKKISDFRPISLITSLYKIIAKVLSGRLRGVLHETIHSTQGAFVQGRQIM 903

Query: 1036 QRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADD 1095
               L+  +I D+  R         KL   +      ++       RN R  ++HLQFADD
Sbjct: 904  DAVLIANEIVDERRRSG-RKVSSLKLILKRLTITGEKYVGGFRVGRN-RTRVSHLQFADD 963

Query: 1096 TLLFSIYDSKALDNLFEIIKLFEMASGLNINFAK------NLVASRIQRRLGN-GCSTLF 1155
            T+ FS    + L  L  ++  F   SGL +N  K      NL  + I R      C    
Sbjct: 964  TIFFSNTREEDLQTLKSLLLAFGHISGLKVNLDKSNIYGINLDQAHISRLAETLECKASG 1023

Query: 1156 WHDSWLSCGV-----------------------LSEAFPRLYRLSNRSDGTVADFWVSLN 1215
            W   +L   +                       L   +P L+R+       V D  + ++
Sbjct: 1024 WPILYLGLPLAGTPGQEMGKEFVLGRFVVGDQPLGSQYPSLFRV-------VLDKNIPIS 1083

Query: 1216 S--------AWDLSLRRNLNDSETNEWASLSHLLSSIRIR-VIDDTWSWPIDSSNAFTVK 1275
            S        +W+L+ RRNL+DSE  +   L   L  + +   + D   WP+ SS  F+VK
Sbjct: 1084 SVLGPTRPFSWNLNFRRNLSDSEIEDLEGLMRSLDGVHLSPSVPDARLWPLSSSGLFSVK 1143

Query: 1276 SL---MGDMVGDSDPTSSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHL 1296
            S    +    G      SK    VW    P K++ FIW ++   +NT+D LQ R PY  L
Sbjct: 1144 SFFLALSQFFGSPQVFPSKF---VWNSQIPFKVQSFIWLVAHKKVNTNDMLQVRRPYKAL 1203

BLAST of Spg021704 vs. ExPASy TrEMBL
Match: A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)

HSP 1 Score: 312.0 bits (798), Expect = 1.2e-80
Identity = 270/1062 (25.42%), Postives = 447/1062 (42.09%), Query Frame = 0

Query: 17   RSIYIDRKTFSIEFDEPSRGSRAKITEHSRASSHSLTLSWKSLHWLASSFKTLAHEPCSY 76
            RS  ++RK F +  D+ S+ +   +TE     + S+ +S + L W+  + K+L   P + 
Sbjct: 63   RSCKVERKEFVLHLDKYSKHTHYWLTETGAHKAFSIEVSPRDLDWIRCTLKSLIATPNTN 122

Query: 77   KFSSKIRTDDYVLWLEKLSNKYGFFVEINHLLNSGDRRRLLIPSEDNKQGWFSFFSLIS- 136
            +F  + R  +  +W+ K  N  G   EI  +     +  +L+P   +K GW SF S+I+ 
Sbjct: 123  RFFLETRDSEQRIWIRKTRNSKGCTAEIFRVDQKNRKSCILVPEGPDKSGWVSFLSMITP 182

Query: 137  --------------------------DYPGEAHR---------STKSYKDVLQQKESHVV 196
                                      DY   ++          +T    D     +S   
Sbjct: 183  KVEVKAKTRPTFLPRTSPDCRLSPPIDYHKRSYAKAVTEGRPFATSDSSDSYDSSDSSHS 242

Query: 197  TTHPSSSVPSPQPLDSEIIVVQRFHKKDDWPSIRNTILAGISHRCSINPFQDNKALLHVY 256
            +++     PS   L++ +++V+RF   DDW  I   +        + N F   KAL+H  
Sbjct: 243  SSNSFCDSPSSDLLENTVVIVRRFF-HDDWHKILQNLRKQTEESFTYNAFHAEKALVHFS 302

Query: 257  DQNIVSKLCNNKDWSSIGKYRLKFYPLTTDSFYQDTMTNSFGGWIEVLQLPLPLWTEQIF 316
                 + LC NK WS++GKY ++F   +        +  S+GGW     +PL LW    F
Sbjct: 303  SNIPANLLCQNKGWSTVGKYSVRFEKWSPVYHATPKLIPSYGGWTTFRGIPLHLWNMMTF 362

Query: 317  RYIGDVCGGFTEISNHTSRKLNLTAAKIKIRQNSIGFIPARIKLPSSLAG---------- 376
            + IG  C G  +++  T    NL  A+IK+R N  GF+PA +++  +             
Sbjct: 363  QQIGKACEGLIKVAEETRSAKNLIEARIKVRYNYSGFLPANVRIFDNEGNKFFVQVVTHP 422

Query: 377  -GDVTVEIKGLTASLFK---SARFEESPSFSEQNNLEIKR-------SEKLDGKNLKLPE 436
             G   +E        FK   +A F++    SEQ   E          S   DG+    P+
Sbjct: 423  EGKWLIERNVRLHGTFKRQAAASFDDFNPESEQFFFEGSEAISPDFLSTSSDGRKSSTPD 482

Query: 437  E-------IKSPPKNQEFIPILAE----------TASP------KGLSPCLVSPQTISQP 496
            +       I  P +N      L E          TA+        G+S   V  +   + 
Sbjct: 483  QPSALKSVIIKPDRNATLPSFLNEELVNDSNLHATANKSKLEILSGISNDGVLDKGKQKV 542

Query: 497  RSILQAADHSNISPKKKTAHSNKGKSPLHVASPIEA-KNHSNYFLPVGPTTLGLGEKKST 556
               LQ     N+   K+    N   +  ++ +P  A  NHS         +L   EKK  
Sbjct: 543  DIQLQPNSALNLDKSKRKVSFNSPSNKTNIFNPDSAPANHS--------PSLNSPEKKQK 602

Query: 557  GNKIIASDTEAYLSSPANDKSPHTSVCDPTSPRNFDLAIFDELHLPESEQIPLASLPPTS 616
             ++  +   ++  + P N K+        T P        D      S  + L  LP   
Sbjct: 603  VSRERSIKKKSSSTQP-NSKANQNKGVFITQPIQIVAHDRDAAKKGLSLTVDLGDLPALD 662

Query: 617  HIPSSPHIIPSPTKDVTPLQQPSSSPPEPSPLSLPTYLCHLAPMLSKHGLCIMVLPTGSI 676
               S      S   +V  +      P  P  + +P      +   + +     V      
Sbjct: 663  PNKSLEDHHNSDNAEVVDITNTEVVPETPE-MKMPVNENSNSSSEANYRKPKHVHKRKYY 722

Query: 677  PKPPTKKTKATTGKKSKLKREVWALGKKRALIKKTIQQQN----PSFVLLQETKK--TSV 736
             +   +K K    +  K +   W   KK  L   T    +     + VLL +        
Sbjct: 723  YRKKEEKEKDPDSEAFKKQLVSWL--KKNGLKLSTDTDSSGATTSTNVLLNQMNSGLKIT 782

Query: 737  DGKFIKSIWSSSCIGWASLDSIGASGGILILWSDPDFTIKEVIQGHFSISIHVFMADGFS 796
            + + IKS+W S+ I W + ++ G+SGGILILW   + ++    +G FS+S +  + +  S
Sbjct: 783  NKRIIKSLWPSNSINWIAKNASGSSGGILILWDAQNHSLLSQEEGLFSLSANFLLNNNSS 842

Query: 797  FWLSAIYGPSRREHRADFWQELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRNVTRSMRT 856
            +WL+ +YGP +R  R  FW ELH+L  L    WILGGD NV R   E +   + + + R 
Sbjct: 843  WWLTGLYGPVKRRERIHFWAELHNLQHLNSFPWILGGDLNVIRMREESTSVLSSSHNSRM 902

Query: 857  FNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLRLDRVT 916
             N +I+N  L+D PL N  +TWS+  +   + S +DRFL  +   + F       L R T
Sbjct: 903  LNNFISNNLLIDPPLTNNRFTWSNLRNPPTF-SRIDRFLYNSSWENLFSPHTTRTLPRST 962

Query: 917  SDHYPLAL--SFGDIAWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGFMMKLK 976
            SDH+PL    S   ++WGP PFR ++  L    F+  +  WW  +   G+PG  F+ +LK
Sbjct: 963  SDHFPLVCEDSNPKLSWGPIPFRLNSITLSDPEFKRNMGRWWENSIQAGYPGFSFIQRLK 1022

Query: 977  GLKMELRKWNITNRNDVSQL-PSLISQLKSLDSIGDEHILSTDQKVQRRLLREQIEDQTA 989
             L   ++ W     + ++    ++I ++ S+D    +  L+ ++  +R  L+  + + + 
Sbjct: 1023 SLANFIKPWQKEKLHSLTYAKEAIIREVDSIDKKELDTPLTQEESNRRLALKADLSELSL 1082

BLAST of Spg021704 vs. ExPASy TrEMBL
Match: A0A5D3BLV7 (LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G005290 PE=4 SV=1)

HSP 1 Score: 103.6 bits (257), Expect = 6.3e-18
Identity = 54/161 (33.54%), Postives = 84/161 (52.17%), Query Frame = 0

Query: 1037 LGNGCSTLFWHDSWLSCGVLSEAFPRLYRLSNRSDGTVADFWVSLNSAWDLSLRRNLNDS 1096
            L NG    FW+ +W   G LS A+PRL+ L+   + +V D W + ++ W++  RR LND 
Sbjct: 1597 LNNGDQISFWYSNWSQEGRLSTAYPRLFALTLDKEISVKDAWNTFDNQWNIIFRRELNDR 1656

Query: 1097 ETNEWASLSHLLSSIRIRVIDDTWSWPIDSSNAFTVKSLMGDMVGDSDPT----SSKLYN 1156
            E   W  +  +L + R        +W  DS+N+F++ S    +    D T     +KL  
Sbjct: 1657 ERCNWEKILEILPTPRSNRGSSKPTWIPDSNNSFSIASAKVLISRQLDQTPGDPRAKLLE 1716

Query: 1157 VVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPS 1194
            ++WK   P KIK F+W L    INT + +Q++MP   L P+
Sbjct: 1717 IIWKSSIPMKIKFFMWCLIQRRINTMEVIQQKMPNTLLQPN 1757


HSP 2 Score: 299.3 bits (765), Expect = 7.9e-77
Identity = 156/395 (39.49%), Postives = 232/395 (58.73%), Query Frame = 0

Query: 622  KKRALIKKTIQQQNPSFVLLQETKKTSVDGKFIKSIWSSSCIGWASLDSIGASGGILILW 681
            KKR ++K  +  + P  V++QETKK   D + + S+WS     WA+L + GASGGILI+W
Sbjct: 117  KKRRVVKNFLSSEKPDVVMIQETKKEECDRRLVGSVWSVRNKDWAALPASGASGGILIIW 176

Query: 682  SDPDFTIKEVIQGHFSISIHVFMADGFSFWLSAIYGPSRREHRADFWQELHDLAGLGGDR 741
                   +EV+ G FS+SI   M    S WLSA+YGP+    R DFW EL D+AGL   R
Sbjct: 177  DSIKMRREEVVLGSFSVSIKFAMDGCESLWLSAVYGPNNSALRKDFWVELSDIAGLSHPR 236

Query: 742  WILGGDFNVTRWSWEKSHGRNVTRSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYL 801
            W +GGDFNV R S EK  G  +T  M+ F+++I +  L+D PL++ SYTWS+  ++    
Sbjct: 237  WCVGGDFNVIRRSSEKLGGSRLTPCMKDFDEFIRDCELIDSPLRSVSYTWSNMQEN-PVC 296

Query: 802  SLLDRFLLTNDCLHKFGSANLLRLDRVTSDHYPLALSFGDIAWGPCPFRFDNAWLHIESF 861
              LDRFL +N+    F  +    L R TSDH+P+ L      WGP PFRF+N WL   SF
Sbjct: 297  KRLDRFLYSNEWEQVFPQSLQGVLPRWTSDHWPIVLETNPFKWGPTPFRFENMWLQHSSF 356

Query: 862  REVLKNWWNQNPLQGWPGHGFMMKLKGLKMELRKWNITNRNDVS-QLPSLISQLKSLDSI 921
            +E    WW++    GW GH FM KL+ +K +L++WN T+  ++S +   +++ L + DS+
Sbjct: 357  KENFGRWWSEFQGNGWEGHKFMRKLQFVKAKLKEWNKTSFGELSKKKKDILAVLANFDSL 416

Query: 922  GDEHILSTDQKVQRRLLREQIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNS 981
              E  LS +  VQR   + ++E+   R+ I W+Q+ +++W+KEGD N++FFH++   R +
Sbjct: 417  EQEGGLSQELLVQRAFSKGELEELILREEIHWRQKARVKWVKEGDCNSKFFHKVANGRRN 476

Query: 982  RLSLTHLQFADDTLLFSIYDSKALDNLFEIIKLFE 1016
            R  +  L+     +L +    K      EI+K FE
Sbjct: 477  RKFIKELENESGLMLNNPESIKE-----EILKYFE 505

BLAST of Spg021704 vs. TAIR 10
Match: AT1G43760.1 (DNAse I-like superfamily protein )

HSP 1 Score: 71.2 bits (173), Expect = 6.7e-12
Identity = 59/245 (24.08%), Postives = 106/245 (43.27%), Query Frame = 0

Query: 765  RSMRTFNQWIANYHLLDIPLQNGSYTWSSFGDDIEYLSLLDRFLLTNDCLHKFGSANLLR 824
            R +  F   + +  L+DIP +   YTWS+  DD   +  LDR +   D    F SA  + 
Sbjct: 247  RGLEEFQNCLRDSDLVDIPSRGVHYTWSNHQDDNPIIRKLDRAIANGDWFSSFPSAIAVF 306

Query: 825  LDRVTSDHYPLALSFGDI--AWGPCPFRFDNAWLHIESFREVLKNWWNQNPLQGWPGHGF 884
                 SDH P  +   ++      C   F     H      +   W  Q P+     H F
Sbjct: 307  ELSGVSDHSPCIIILENLPKRSKKCFRYFSFLSTHPTFLVSLTVAWEEQIPV---GSHMF 366

Query: 885  MMKLKGLKMELRKWNITNRNDVSQLPSLISQ-LKSLDSIGDEHILSTDQKVQR--RLLRE 944
             +  + LK   +   + NR     +     + L SL+SI  + + +    + R   + R+
Sbjct: 367  SLG-EHLKAAKKCCKLLNRQGFGNIQHKTKEALDSLESIQSQLLTNPSDSLFRVEHVARK 426

Query: 945  QIEDQTARDHIAWQQRCKLQWLKEGDENTRFFHRIMAARNSRLSLTHLQFADDTLLFSIY 1004
            +     A     ++Q+ +++WL++GD NTRFFH+++ A  ++  +  L+  DD  + ++ 
Sbjct: 427  KWNFFAAALESFYRQKSRIKWLQDGDANTRFFHKVILANQAKNLIKFLRMDDDVRVENVT 486

BLAST of Spg021704 vs. TAIR 10
Match: AT4G04650.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 61.6 bits (148), Expect = 5.3e-09
Identity = 63/262 (24.05%), Postives = 107/262 (40.84%), Query Frame = 0

Query: 1037 LGNGCSTLFWHDSWLSCGVLSEAF----PRLYRLSNRSDGTVADFWVSLNSAWDLSLRRN 1096
            +G+G +  FWHD+W+  G L E      PR   L    D  V D      ++W ++  R+
Sbjct: 16   VGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLP--IDAVVRD--ALRGTSWWIASSRS 75

Query: 1097 LNDSETNEWASLSHLLSSIRIRV---IDDTWSWPID---SSNAFTVKSLMGDMVGDSDPT 1156
             N         L +LL   +  +    DD++ W  D    SN F+       +   S   
Sbjct: 76   RNPI----IVQLKNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQSH-- 135

Query: 1157 SSKLYNVVWKDVYPKKIKIFIWELSLGAINTSDRLQRRMPYLHLSPSWCVMCCSDAENTC 1216
            +   +  VW   +  K     W ++   ++T DRLQ    +    P+ C++C +  ++  
Sbjct: 136  TVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQN---WGLSIPAECLLCNAHDDSRA 195

Query: 1217 HLFVHCSFASRYWSTIFNAFEWSLALPNNIYDVLASIFVGHPFHGVKKILWLALNRVFLW 1276
            HLF  C F+   W   F     +L  P  + D L  +        +  I+ LA +   ++
Sbjct: 196  HLFFECQFSGVVWR--FFTASTNLNPPAQLMDCLNWLLSPSREKNICLIIRLAFHSC-VY 255

Query: 1277 FLWGERNGRIFRDSFSSFENFM 1289
             +W ERN R+      S E+ +
Sbjct: 256  AIWRERNQRLHSGVSRSTESIL 261

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158956.11.7e-9447.41uncharacterized protein LOC111025405 [Momordica charantia][more]
RVX15530.17.0e-8826.30putative ribonuclease H protein [Vitis vinifera][more]
RVW99869.12.6e-8229.94Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera][more]
TYJ99315.12.4e-8025.42LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
TYJ99315.11.3e-1733.54LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1E2G68.4e-9547.41uncharacterized protein LOC111025405 OS=Momordica charantia OX=3673 GN=LOC111025... [more]
A0A438K2W13.4e-8826.30Putative ribonuclease H protein OS=Vitis vinifera OX=29760 GN=VvCHDp000001_863 P... [more]
A0A438IT161.3e-8229.94Transposon TX1 uncharacterized 149 kDa protein OS=Vitis vinifera OX=29760 GN=YTX... [more]
A0A5D3BLV71.2e-8025.42LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
A0A5D3BLV76.3e-1833.54LINE-1 retrotransposable element ORF2 protein OS=Cucumis melo var. makuwa OX=119... [more]
Match NameE-valueIdentityDescription
AT1G43760.16.7e-1224.08DNAse I-like superfamily protein [more]
AT4G04650.15.3e-0924.05RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025558Domain of unknown function DUF4283PFAMPF14111DUF4283coord: 180..310
e-value: 1.1E-12
score: 47.8
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 611..839
e-value: 3.8E-31
score: 110.7
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 620..839
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 1130..1219
e-value: 1.3E-13
score: 51.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 528..556
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 424..445
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 528..566
NoneNo IPR availablePANTHERPTHR33710:SF32SUBFAMILY NOT NAMEDcoord: 699..937
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 699..937

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg021704.1Spg021704.1mRNA