Spg035607.1 (mRNA) Sponge gourd (cylindrica) v1

Overview
NameSpg035607.1
TypemRNA
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotrans_gag domain-containing protein
Locationscaffold3: 1873692 .. 1883103 (+)
Sequence length2394
RNA-Seq ExpressionSpg035607.1
SyntenySpg035607.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCGATTCGCCTGGGGTAAGGTTTGAGTTGTATCCAGAAATTGAGAGGACATTTAGGAGAAAAAGGAGAGAGCAGTGAAGAAATCAAAATCCAATGGATAACGTGTCGCGTCTCCCAGAAGTTCCAGAAGATCCAATTAATCCCCAGCAGAATCGTGTGCTGCAACCTAATCCACCGATTGAGCAAAATGGACAGCAAAATAATCAGGCTGAGAACCCTATCTTGGTAGCGAACGATAGGGCTAGAGCCATTCGAGCGTATGCTTTCCTGATGTTTGATAAGTTAAATCCAGGAATTGCACGTCCTCAGATTGAGGCAACAAATTTTGAAATGAAACCGGTAATGTTTCAGATGTTACAAATCGTGGGTCAGTTCCATGGTTTGTCATCTGAAGACCCCCATTTACATCTTAAGTCTTTTCTTGGAGTTAGTGATTCGTTTGTAATTCAAGGAGTGTCTAGAGATGCCCTTAGATTAACTTTGTTCCCGTATTCTCTCAGAGATGGAGCAAAAGCATGGTTAAATTCTTTTGCTCCAGAATCAATTAGGACATGGAATGAGTTAGCGGAAAAATTTCTTAGTAAGTATTTTTCACCAACTAGGAATGCCAAGTTAAGGAGTGAGATAGAGAGATTTAGGCAACTTGAAGATAAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGATTAAACATGACAACCCAAAGCATGGTCGATGCCTCGGCTGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTTAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCTACCATTAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAGAATGTGATAGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGTTGTAGTGGTGAACCAAGTTGCAGAGGAATCATGTGTCTATTGTGATGAAGAGCACAACTACGAGTTTTGCCCCAGCAATCCAGCCTCTGTTTTTTTTGTAGTTAATCAAAGGAATAACCCTTACTCTAACTTTTATAATCCAGGTTGGCGCAACCACCCTAACTTTGCATGGGGAGAGCAAGGAAGCAATTCACAAGTCCCTCAAACACAGCAAAAGGTGAACCAGCCAGGATTTGCTAAATTACAAGCATTGCCTCAGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAGGAATATATGGCTCGTACAGATGCCGCTATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGTCAGCGAGCTAATGAGCTGAAGGCAAAAACTCAAGGGAAACTTCCTGCGGATACTGAACACCCTAAAAGGGAAGGTAAGGAACAGGTACATGCAGTAACTCTAAGAAGTGATAAGCCACTAGAAGAGAGAAAGAAACCTAGTAAACCCCAGGATGTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAGAGAGTTGGAGTCTGGTAAAGGTGATGGAGGCAGCAATAATAATGCTGGAGCATCTGTTTTTGTCCCAGATGTGGATCCACCTTATGTGCCGCCCCCACCGTATGTACCACCTCTACCTTTTCCATAAAGGCAGAAGCCTATGAATCAGGATGGTCAATTTAAGAATTTTTTAGAGATTCTTAAGCAATTGCAGATAAATATTCCTTTAGTAGAAGCTATAGAGCAAATGCCAAATTATGCTAAATTTCTTAAGGATATTTTAACTAAAAAGAAGAGATTAAGTGAATTTGAAACTGTATCTCTTACTGAGGAATGTAGTGCTATTCTCAAGAATGGGCTACCAACCAAGGCTAAGGATCCAAGATCATTCACTATTCCTGTCTCAATAGGTGGAAAAGGGTTGGGTAGAGCACTTTGTGATTTAGAAGCATTAACCTTATGCCTCTTTCGGTCTATCAAAAGTTAGGTATTGGTGAATCTAGGCCTACCACAGTCACACTCCAATTAGCTGATAGGTTTATCGCATATCCAGAGGGTAAAATTGAGGATGTCTTAGTGAAAGTAGTTAAATTCATATTTCCTATTGATTTTATTATATTAGATTATGAGGAAGCTAAAGATATCCCAATTATTCTTGGTCGTCCATTTTTGGCTACTGGTAGAGCATTGATAGATGTTCAAAAAGGGGAACTAACAATGAGGGTTTATGATGAGGAAGTAAAGTTCAATGTTTTTAAGGCCATGAAGTATCCAGACGAAGTGGAAGATTGTTCTTTCATTAGGATTTTGGAGAACACAATTGTTGAGACAGAAATGGAGGATTTGACAAATAAACATTTGGAAGACTATGGAGAGATTAGTGGAAGTTTGTTCTTTAGAAAGAAAAAGTGAAAAAGAAGTGTCTAGGTGTGAGGATATTTTTGAGTTTTAGATTTGGATGAAAGAAAGGCTCCTCCTATTAAGCCATCCCTGATTGAGGCACCCACTTTAGATTTGAAGCCCTTGCCGGATCATCTAAAGTATGTGTATCTTGGGGAGGGTGAGACGTTGCCCATTATTGTTGCATTAGATTTAATGTCGGAGGATGAAGAGGCATTAATAAAGTTGCTGCAGCAATATCGCAAAGCAATAGGTTGGACATTGGCTGACATTCAGGGAATTAGCCCGTCTTTCTGTATGCACAAAATCACTCTAGAGGAGGGATCCTTTAGGAGTATTGAGCAACAGAGAAGGCTTAACCCTGCAATGAAAGAGGTAGTTAAGAAGGAGGTGATTAAATGGTTGGATGCTGGGATATCCAATTGTAGATAGCAATTGGGTAAGCCCGGTCCAATGTGTTCCTAAGAAAGGAAGTGTTACGGTAGTGACCAATAAAGACAATGAGTTGATCCCAACCAGGACAGTAACTGGCTGGAGGGTTTGCATGGATTATAGGAGACTTAACAAAGCCACCCGTAAGGACCATTTCCCTCTGCCATTTATTGACCAGATGTTGGATAGATTGGTTGGTCAGGCCTACTACTGCTTCTTAAATTGTTATTCTGGGTATAACCAGATTACCATTGCTCCTGAGGATCAGGAAAAAACCACTTTCACCTTCCCTTATGGGACATTTGCTTTCAGGCGAATGCCTTTTGGACTTTGCAATGCTCCAGCAACATTCAGCAGTGTATGTTAGCAATTTTCTCTAATATGATTGAGTCCACTGTTGAGGTATTTATGGACGATTTTTCAGTATTTGGAGGGTCTTTTCAGCATTGTTTGAATAATTTAGGTAAGGTGTTAAAGAGATGTGAGGATACTCATTTAGTTCTTAATTGGGAGAAATGCCACTTCATGGTGAAGGAGGGCATTGTATTAGGTCATAAGATTTCAAATAAAGGTTTAGAAGTTGATCGAGCAAAAATAGAGGTTATTGACTGATTAGAACCACTGAATTCAGTTAAAAGGATTCAGAGTTTTTTAGGTCATGCTGGATTTTATAGAAGGTTCATAAAGGATTTTTCCAAAATCAGTAAACCTCTTTGTAACTTATTGTGTGCTGATAGTGTTTTTTACTTTAATGCAGATTGTAGGAAGGCTTTTGAGACCTTTAAGGCTGCTTTAATCTCAGCCCCCATTTTGTGTGCACCTAACTGGAATTTACCATTTGAGGTAATGTGTGATGCGAGTGATGTTGCAGTAGGTTCAATGCTGGGGCAAAAGCAGGACAAATTTATCCATCCTATTTATTATGCAAGCAGGGTTTTGAATGAGGCACAAATCAACTATACAACTACTGAAAAGGAGTTGTTAGCGGTTGTGTTTGCCTTTGAGAAATTTCGCCCATATTTGGTTGGAACCAAAGTCACGGTTTTCACAGATCATGCAGCAATAAGGTACTTAATGTCTAAAAAAGATGCAAAGCCTAGACTGATTCGTTGAGTTTTATTATTGCAGGAATTTGACTTGGAGTAAAGGACAAGAAGGGATCGGAAAATGTTATTGCAGATCATTTGTCTCGCCTTGATCCAGCATCATCTTTCCTGGAGCAATTTGACATTTCAGATTCCTTTCCAAATGAGCATCTCTTTGCTGTTAATGTAAAGGTAGTACAGGATATCCCTTGGTATGCTGATATTGCCAACTTTTTGGTAAAAAGAGTTATTCCTATGACATGGATTGGAGACAGGTGAAACGGTTTAAGCATGATGCAACATTCTTTTATTGGGACGAGCCATTTATGTATAAACAATGCTCTGATGGTATTATTCGAAGATGTGTTTCAGGTGATGAAGCAAAGGAAATCCTGGCGCAATGTCACTCTTCGCCGTATGGAGGTCATTTCAGCGGACAGAGGACAACTATGAGGATTTTGCAATGTGGATTCTTCTGGCCTTCCTTGTTTAAGGATAGTCATTGGTTCTACAAGCAATGTGATGCTTGCCAAAGGAGAGGAAATTTAGGACCTAGAGATGAAATGCCTCCTACTTACATTTTGGAATTGGAGTTATTCGATGTGTGGGGTATTGATTTTATGGGGCCATTTCCCCCTTCTAATGGTAATAATTTTATCTTATTGGCAGTTGATTACGTTTCCAAGTGGGTGGAGGCCATTGCATGTCATCAGAATGATGCCAAGACAGTATCAAGGTTTCTCCAATCGCACATCTTTGCGCGGTTTGGGACACCTAGAGCTCTTGTGAGTGATGAGGGTACACACTTTGTGAATAATGTTTTAATTAAGATTTTAGCTAAGTATGGAATTAAGCATATGATAGCTACCCCTCATCACCCACAAACAAATGGTCAGGCTGAAGTTAGTAATAGGGAAATTAAATCTATTTTCGAGAAAGCAGTCCATCCATCTAGGAAAGATTGGTCTTTTCGGTTAGATGAGGCTCTTTGGGCTTATAGGACAGCCTATAAGACTTCTTTAGGTATGTCTCCCTATAGGTTAGTATATGAAAAAGCTTGCCATTTACCATTAGAGCTTGAGCATAAAACGTTTTGGGCTTTAAAGAAATTGAATTTTGATCTAAGTCGTGCAGGTGCAATAAGAATGCTGTAGCTTAATGAATTAGAGGAGTTTCGTCAGTTTTCTTATGAAAATGCGAAGTTGTATAAAGAGAAGACTAAGTGGTGGCATGATAAGAAGATAAAACCTAAGGAATTTGTTAAAAGTCAGAGAGTCTTGCTTTACAACTCTAGATTGAAATTGTTTCCTGGAAAATTAAAATCTAAATGGTCTGGGCCGTTTATTGTGATTGAAGTTTTTCCTCATGGAGCAATTACTTTGAGAGATGAAAAGGATGAGCGAGTGTTCAAGGTTAATGGACAGCGAGTGAAGCATTATTGGGGAGAGGAGTTTCGTTTGAAATATCCTTCTCTAAAACTGGTCGATGAATGAACGAGCGAGAGTTTTACGGGAGCATTTCATGTTGCAATATTTTTAGCTCCCAGTGTTTGTTATGATTTTATTTTTCGTTGATTGATTTTGATTTTAATTCGAATTATTTTGAATTAGATTTGAGTTATTTGGATTTTATCTGGTTTATTTTAATTTGTTGGTATTTTTAAACTTTTGTAGAATTTGTGTTTGTTTTATTTTTGCCGATTTAATTTAATTTGATCGGTTTCGTTTAGTTTAGATTTAATGTTAATTTGTATTTTCGTTATTTTAAATTTCTATTTAGATTAGATTTTAATAAAATAATTTCCTGATTTATTCGGCAGAAGATAAAAAGGCGAGATCGATTTTTGGCGGTGTGAATTTAAATTGAAATTTTCCGAATAAAAGATTCTTCTAGAAGAAAATCTTTTGATGATTAGTTTGTTTTTACAATCTGTTGGAAGGATCAAATATTTCGTCAGATTGTGAAGGGATTGGTTGTGAATTAAATAAGTTTGTCGTTTGGAATTTAATTTTTCTGTGGGTAGGTAAATCGCGTGAAGCATCTGGCGCTTCCACGTGTCACACAGTGTCAATCCAATGTTCTATCCGTTAGCCAGCGCAGAAGAGATTTACCACGGGGCAAAATTTTACCGTTTAAAATTTAATTACCCATGATTACATGCAATTTAAACGGTAATTTTTATTTAATTGGGGCAGAATCTTTCGGTTATGATTTAAAGTGCTGGAGCCCTATTTATTTCAAAACCCTGGTACGTTTCGTTACTTCATCTTCTTGCCTTCATTCTTTGTTTTTATTTCTCTCCTCCTTTCTTTCGATTATTGTGAGAGACTTTGCATCTGAAATGGCTAAAACAAGAGGCCGTAAAGAAAAAGATGTTGAGGAAGAGGAAGTGCCGATTACCCCTGAGGCACCGAAGACAAAAGCAAAGAGAAGAAAGACGCCGGAAGAGAGGGAAGCTAAGAGGCGAAGAAGACAACAGCGAGCAGAGGTTGTGGAAATAGAACGGAAAGTGGTTGTAGATATTGTTGAAGAAGTGGTTGAGGTAGAACAACCAAAGGACCCTGAGGAAAAGAAAGATCCTGAACAAGGTAACCCGACAGTTGAGAATCCGCAAGAGGAACAAGAAAAGCGAGTGGAAGATGTGCAGGAGCAGGGAAATGGTCAGGAGCAGCAGAATTAAGAAGTCGTACCGGAGATTCCACGTCGTCGCCGCCGCAACCAAAAGGCAGGACGAATTAAGGTGATCAAGACATACACTCCATCTCCGTCGACGACAAAATCTGAGAAAGAAAATTCTGAAAAAGAAGAGGCTGAGAAGAAAGCGGAAGAAGAAACCTTGGCGAAGCAGCAAGAAGACAAGGGCAAAGGAGTTGCTGCAGCACAGACAGAGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGCACGCGCGCTTCGTCAACGATCTTGCGCGAGCAAAATATTTAGAGATGTTGAAAAGGGATTTTCTGTTCGAAATGGGATTCGGTGATGATCTGCAGCATTTCTTAAGAGCTGGAATTTCAAAGCATGGTTGGGATCAATTTTGCGCAAAACCAGAGCCGGTAAATTCGAATGTTCATGAATTTTACGCGAATATAGATGAAGAATAGGGGTTTCAGGCAATTTTTCGAGGTGTGGCAGTCGATTGGAGCCCTGGTGCGGTAAATTCTCTGTTTAACCTTCAAGATTTCCCACATGTCGGGTATAATGAGATGGCGGCAGCGCCATCCAATGAACAGTTGAATGCAGTTGTTCGAGAGGTGGGAATTGAGGGGGCCTAGTGGCATTTGACAAAAACAGAGAAGAGAACCTTTCAAGCGGCCTATCTTAAGAGTGAAGCCAACAGTTGGTTGAGATTCATCAAACTGCGCCTGCTACCAACAACTCACGACTCTACTGTCTCCCGTGATCGAGTTCTTCTGGTATTTGCTATTCTGAGGTCATTAAGTATAGATGTTGGAAAAATTATCTCCAGTGAAATACATTCTTGTTGGAGGAAGAAGGTGGGTAAGCTATGTTTCCCGAACACAATAACTATGTTGTGTCAAAGAGCTGGGGTTCCCATGAGTGCAGAGGATGTTATTCTGATGGATAAGGGAATAATAGACACACCAAACCTGGCAAGGCTTCAGAGGACTCAGGAAGCACGCCAAGGCGGTTTGGTGTGTGGCATCTACCAAATTCAAGAACATATGCAAACGCATTCCAGCAGAACGGAGTTTGCCGAAAGGCAATTCCAAACTTTATGGAATTATGTTAAGAGAAGGGATGCCACGTTGAAGAGGGCTTTGCAATCTAATTTTTCCAAATCGTATCCGGCCTTCCCAGTATTCCCTGATGATCTATTGAACCAGTGGATCCCACCACCGCCAATAGAAAGAGAAGGAGATGAGGAGGAGGACGCTGGTAAGGAGGATTAGAAGATAAGTTGAGCGAGCTTTAATTTTATTTTGAAAGATCACTTTTGTTGCTCGCGCATTTCCCTAGCCAAGAAAAGTTTATCTACTTTTGATATTCCTGTTTCTGTGTTTTTCTTATTTTTCTACTGACCTTTTAAATTTTTATCTTATATTGTAAATCTTTTTCTGTTATTTTAATGAAATTTCCAAGGTTGTTTCTGTTCTATAGCACTGGGCATCCCTAACTTAACCATTTTCTTCTGAATTTTTGAAGCTTGGACTAGTTAGCTTAATTAGATCTAGGTGTTCTGATCTCCAGAAACCTTTTGCTTGAACATTTCTTCTAGCCTGGTCATCGCTCGGCAAGAAGATTCTGAGGTAGTATTGGCTTATTTGATCCACCTTAAGCTTAATTATAGTCCCACACTTAGTTTAAAATTCAGATAGAAAAGGAATTAAGTTTGGGGGTATAATTTATCTTATTAATAATTCATTGAGACAATGAATAATTCAAGTTTGGGGGACTTTTTGCTACTCCGCTTGTTTTTCTCGAGCATTCCTCTCCTTCATAATTATTTTTTTCTAGGTTATTGAGTAACCAGGGACATTTGTGACACCCACAGTTTTTCGTTATGTCAAAAGATGACCTAGGAAATATTGAAAAATGATGAAAAAAAAAAGTATTGCTGAAAGAAAAAAATTGTGAAAATTTTGCTAAGAAAATTCATAGAGCAAAAATTTTGAAGGATGAATGATAATCTAGCAGTTTTCTGTCTCCCAGGGCTTAGGTGAGCTTCATCATAAAATGTCGCCATTCTGAGAAGTAAGGGATAGAATAACTAAGGATTTTTTAGCTTAGCAATGAGTATTTAGCCTAGTTGGTGATAAGCTTGAGGCAAGGGTATACTGCACCATAAAGTGGGTCATCCCATGCTTAAGGGCTTATGACTGTAGGGTTGCTTTAAGTCTGAAGAACTAAAGAATAAAACCCCTTAAAAATGTGTTCAAATATGTCGGATAATATGGTTAAGATGTGGTGAGTTCTTAGGGTTGAGTTAAAGGAGGAGATTATCAGTTCAATGCTGAAAGAATTATTTTGCTGCAGCAGTGCTTGGTTTTGCAGAATGCTCAGGTAAAGGTTGAAGGTAGTGTTGGAATATCTGCTTTGATTGAGTTATGCTATGACTAATGTTTTGAATTCTCAAGGGAATGTGAGTTAGTATTGCTCGGGACGCGCAATAGTTCAAGTTTGGGGGTATGATAACTCTCCAAAAAGGAGTTATTTAGACCTTATTTTTATAGTGATTTGTGTGTCTCTTGTGTTATTTAGGGTTGCTTTTAGATGAATATTGTATGTTTTAACCCATTTGGAAGAATGGATGCTTTGGAAGTTATTTTGTGCAGAATATAATGCTGAGCGACTTGAGGGAGCAAAATCCGTGTTCCAGCAAAGCATGGAGCAAAACTGCCACGTAAAATGTGCATAA

mRNA sequence

ATGAGCGATTCGCCTGGGGTAAGAGATGGAGCAAAAGCATGGTTAAATTCTTTTGCTCCAGAATCAATTAGGACATGGAATGAGTTAGCGGAAAAATTTCTTAGTAAGTATTTTTCACCAACTAGGAATGCCAAGTTAAGGAGTGAGATAGAGAGATTTAGGCAACTTGAAGATAAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGATTAAACATGACAACCCAAAGCATGGTCGATGCCTCGGCTGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTTAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCTACCATTAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAGAATGTGATAGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGTTGTAGTGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAGGAATATATGGCTCGTACAGATGCCGCTATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGTCAGCGAGCTAATGAGCTGAAGGCAAAAACTCAAGGGAAACTTCCTGCGGATACTGAACACCCTAAAAGGGAAGGTAAGGAACAGGTACATGCAGTAACTCTAAGAAGTGATAAGCCACTAGAAGAGAGAAAGAAACCTAGTAAACCCCAGGATGTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAGAGAGTTGGAGTCTGGTAAAGACTTTGCATCTGAAATGGCTAAAACAAGAGGCCGTAAAGAAAAAGATGTTGAGGAAGAGGAAGTGCCGATTACCCCTGAGGCACCGAAGACAAAAGCAAAGAGAAGAAAGACGCCGGAAGAGAGGGAAGCTAAGAGGCGAAGAAGACAACAGCGAGCAGAGGTTGTGGAAATAGAACGGAAAGTGGTTGTAGATATTGTTGAAGAAGTGGTTGAGGTAGAACAACCAAAGGACCCTGAGGAAAAGAAAGATCCTGAACAAGAAGTCGTACCGGAGATTCCACGTCGTCGCCGCCGCAACCAAAAGGCAGGACGAATTAAGGTGATCAAGACATACACTCCATCTCCGTCGACGACAAAATCTGAGAAAGAAAATTCTGAAAAAGAAGAGGCTGAGAAGAAAGCGGAAGAAGAAACCTTGGCGAAGCAGCAAGAAGACAAGGGCAAAGGAGTTGCTGCAGCACAGACAGAGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGCACGCGCGCTTCGTCAACGATCTTGCGCGAGCAAAATATTTAGAGATGTTGAAAAGGGATTTTCTGTTCGAAATGGGATTCGGTGATGATCTGCAGCATTTCTTAAGAGCTGGAATTTCAAAGCATGGTTGGGATCAATTTTGCGCAAAACCAGAGCCGTGGCATTTGACAAAAACAGAGAAGAGAACCTTTCAAGCGGCCTATCTTAAGAGTGAAGCCAACAGTTGGTTGAGATTCATCAAACTGCGCCTGCTACCAACAACTCACGACTCTACTGTCTCCCGTGATCGAGTTCTTCTGGTATTTGCTATTCTGAGGTCATTAAGTATAGATGTTGGAAAAATTATCTCCAGTGAAATACATTCTTGTTGGAGGAAGAAGGTGGGTAAGCTATGTTTCCCGAACACAATAACTATGTTGTGTCAAAGAGCTGGGGTTCCCATGAGTGCAGAGGATGTTATTCTGATGGATAAGGGAATAATAGACACACCAAACCTGGCAAGGCTTCAGAGGACTCAGGAAGCACGCCAAGGCGGTTTGGTGTGTGGCATCTACCAAATTCAAGAACATATGCAAACGCATTCCAGCAGAACGGAGTTTGCCGAAAGGCAATTCCAAACTTTATGGAATTATGTTAAGAGAAGGGATGCCACGTTGAAGAGGGCTTTGCAATCTAATTTTTCCAAATCGTATCCGGCCTTCCCAGTATTCCCTGATGATCTATTGAACCAGTGGATCCCACCACCGCCAATAGAAAGAGAAGGAGATGAGGAGGAGGACGCTGAAACCTTTTGCTTGAACATTTCTTCTAGCCTGGTCATCGCTCGGCAAGAAGATTCTGAGGTAGTATTGGCTTATTTGATCCACCTTAAGCTTAATTATACAGTGCTTGGTTTTGCAGAATGCTCAGAATATAATGCTGAGCGACTTGAGGGAGCAAAATCCGTGTTCCAGCAAAGCATGGAGCAAAACTGCCACGTAAAATGTGCATAA

Coding sequence (CDS)

ATGAGCGATTCGCCTGGGGTAAGAGATGGAGCAAAAGCATGGTTAAATTCTTTTGCTCCAGAATCAATTAGGACATGGAATGAGTTAGCGGAAAAATTTCTTAGTAAGTATTTTTCACCAACTAGGAATGCCAAGTTAAGGAGTGAGATAGAGAGATTTAGGCAACTTGAAGATAAAACTTTTAGTGAGGCTTGGGAAAGGTTTAAGGAGCTTTTGCGAAAGTGTCCCCACCATGGTTTACCTCATTGTATTCAAATGGAAACATTTTACAATGGATTAAACATGACAACCCAAAGCATGGTCGATGCCTCGGCTGGAGGGGCCCTTTTGGCAAAAACCTTTAATGAAGCCCATGAAATTTTAGAAAGAATATCAACCAATAGTTGTCAGTGGTCGGATGTTAGAGGCTTAAATAAAAAGGTTAAGAGTGTGTTAGAGGTTGATGGTGTGTCTACCATTAGGGTTGATATTGCAATGTTAGCTAACGCTCTTAAGAATGTGATAGTGGTTAGTCATCAGCAGCCGCCAGCTGTGGAGCCTGTTGTAGTGCAAAATAAGCAGGCTTTGCCCCAGCAAAATTCAGAGAGTTCTCTTGAGGCAATGATGAAGGAATATATGGCTCGTACAGATGCCGCTATTCAAAGTAATCAAGCTTCAATGAGAGCCCTGGAATTGCAAATGGGTCAGCGAGCTAATGAGCTGAAGGCAAAAACTCAAGGGAAACTTCCTGCGGATACTGAACACCCTAAAAGGGAAGGTAAGGAACAGGTACATGCAGTAACTCTAAGAAGTGATAAGCCACTAGAAGAGAGAAAGAAACCTAGTAAACCCCAGGATGTAGAGAAGAATAGTGATAAAAATGTTGTTGTTGAGAGAGAGTTGGAGTCTGGTAAAGACTTTGCATCTGAAATGGCTAAAACAAGAGGCCGTAAAGAAAAAGATGTTGAGGAAGAGGAAGTGCCGATTACCCCTGAGGCACCGAAGACAAAAGCAAAGAGAAGAAAGACGCCGGAAGAGAGGGAAGCTAAGAGGCGAAGAAGACAACAGCGAGCAGAGGTTGTGGAAATAGAACGGAAAGTGGTTGTAGATATTGTTGAAGAAGTGGTTGAGGTAGAACAACCAAAGGACCCTGAGGAAAAGAAAGATCCTGAACAAGAAGTCGTACCGGAGATTCCACGTCGTCGCCGCCGCAACCAAAAGGCAGGACGAATTAAGGTGATCAAGACATACACTCCATCTCCGTCGACGACAAAATCTGAGAAAGAAAATTCTGAAAAAGAAGAGGCTGAGAAGAAAGCGGAAGAAGAAACCTTGGCGAAGCAGCAAGAAGACAAGGGCAAAGGAGTTGCTGCAGCACAGACAGAGACAGAAGAGACTGACGTTGAGGAACCGAGTCTGCCGCACGCGCGCTTCGTCAACGATCTTGCGCGAGCAAAATATTTAGAGATGTTGAAAAGGGATTTTCTGTTCGAAATGGGATTCGGTGATGATCTGCAGCATTTCTTAAGAGCTGGAATTTCAAAGCATGGTTGGGATCAATTTTGCGCAAAACCAGAGCCGTGGCATTTGACAAAAACAGAGAAGAGAACCTTTCAAGCGGCCTATCTTAAGAGTGAAGCCAACAGTTGGTTGAGATTCATCAAACTGCGCCTGCTACCAACAACTCACGACTCTACTGTCTCCCGTGATCGAGTTCTTCTGGTATTTGCTATTCTGAGGTCATTAAGTATAGATGTTGGAAAAATTATCTCCAGTGAAATACATTCTTGTTGGAGGAAGAAGGTGGGTAAGCTATGTTTCCCGAACACAATAACTATGTTGTGTCAAAGAGCTGGGGTTCCCATGAGTGCAGAGGATGTTATTCTGATGGATAAGGGAATAATAGACACACCAAACCTGGCAAGGCTTCAGAGGACTCAGGAAGCACGCCAAGGCGGTTTGGTGTGTGGCATCTACCAAATTCAAGAACATATGCAAACGCATTCCAGCAGAACGGAGTTTGCCGAAAGGCAATTCCAAACTTTATGGAATTATGTTAAGAGAAGGGATGCCACGTTGAAGAGGGCTTTGCAATCTAATTTTTCCAAATCGTATCCGGCCTTCCCAGTATTCCCTGATGATCTATTGAACCAGTGGATCCCACCACCGCCAATAGAAAGAGAAGGAGATGAGGAGGAGGACGCTGAAACCTTTTGCTTGAACATTTCTTCTAGCCTGGTCATCGCTCGGCAAGAAGATTCTGAGGTAGTATTGGCTTATTTGATCCACCTTAAGCTTAATTATACAGTGCTTGGTTTTGCAGAATGCTCAGAATATAATGCTGAGCGACTTGAGGGAGCAAAATCCGTGTTCCAGCAAAGCATGGAGCAAAACTGCCACGTAAAATGTGCATAA

Protein sequence

MSDSPGVRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERISTNSCQWSDVRGLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAVEPVVVQNKQALPQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKNVVVERELESGKDFASEMAKTRGRKEKDVEEEEVPITPEAPKTKAKRRKTPEEREAKRRRRQQRAEVVEIERKVVVDIVEEVVEVEQPKDPEEKKDPEQEVVPEIPRRRRRNQKAGRIKVIKTYTPSPSTTKSEKENSEKEEAEKKAEEETLAKQQEDKGKGVAAAQTETEETDVEEPSLPHARFVNDLARAKYLEMLKRDFLFEMGFGDDLQHFLRAGISKHGWDQFCAKPEPWHLTKTEKRTFQAAYLKSEANSWLRFIKLRLLPTTHDSTVSRDRVLLVFAILRSLSIDVGKIISSEIHSCWRKKVGKLCFPNTITMLCQRAGVPMSAEDVILMDKGIIDTPNLARLQRTQEARQGGLVCGIYQIQEHMQTHSSRTEFAERQFQTLWNYVKRRDATLKRALQSNFSKSYPAFPVFPDDLLNQWIPPPPIEREGDEEEDAETFCLNISSSLVIARQEDSEVVLAYLIHLKLNYTVLGFAECSEYNAERLEGAKSVFQQSMEQNCHVKCA
Homology
BLAST of Spg035607.1 vs. NCBI nr
Match: XP_030494802.1 (uncharacterized protein LOC115710583 [Cannabis sativa])

HSP 1 Score: 269.2 bits (687), Expect = 1.1e-67
Identity = 160/357 (44.82%), Postives = 215/357 (60.22%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+Q ED+T S+AWE
Sbjct: 39  LRDRARAWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQSEDETTSDAWE 98

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKE+LRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA GA+L+K++NEA EILERI++
Sbjct: 99  RFKEVLRKCPHHGIPHCIQLETFYNGLNAASRMVLDASANGAILSKSYNEAFEILERIAS 158

Query: 127 NSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV------- 186
           N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A        
Sbjct: 159 NNYQWSTNRAPTSRKVAGVLEVDALTALTAQMASMTNILKNMNMGGSVQPAATIQRAEIS 218

Query: 187 ---------------EPVVV-------------QNKQAL---------------PQQNSE 246
                           P  V             Q KQ+                PQ +  
Sbjct: 219 CVYCGDGHTFENCPSNPASVCYVGASSSGAEQAQGKQSFPPGFSQQPRPQQPHQPQGSQT 278

Query: 247 SSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLPADTEHPKREGKE 306
           SSLE++M++YMA+ DA IQS  AS++ LE+Q+GQ AN+LK + QG LP+DTE+P+R+GKE
Sbjct: 279 SSLESLMRDYMAKNDAVIQSQAASLQNLEIQLGQLANDLKNRPQGTLPSDTENPRRDGKE 338

BLAST of Spg035607.1 vs. NCBI nr
Match: XP_030503898.1 (uncharacterized protein LOC115719117 [Cannabis sativa])

HSP 1 Score: 267.7 bits (683), Expect = 3.2e-67
Identity = 155/345 (44.93%), Postives = 208/345 (60.29%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWE
Sbjct: 90  LRDRARAWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDETTSDAWE 149

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELLRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA GA+L+K++NEA EILERI++
Sbjct: 150 RFKELLRKCPHHGIPHCIQLETFYNGLNAASRMVLDASANGAILSKSYNEAFEILERIAS 209

Query: 127 NSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAVEPVVV-- 186
           N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP       +  
Sbjct: 210 NNYQWSTNRAPTSRKVAGVLEVDALTALTAQMASMTNILKNMNMGGSVQPARHSKGEISS 269

Query: 187 ---------------------------------------------QNKQAL--------- 246
                                                        Q KQ+          
Sbjct: 270 FLCYVGNQNFNRNNNPYSNSYNPAWKHHPNFSWGGQGASSSGAQGQGKQSFPPGFSQQPR 329

Query: 247 ------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQMGQRANELKAKTQGKLP 286
                 PQ +  SSLE++M++YMA+ DA IQS  AS+R LE+Q+GQ AN+LK + QG LP
Sbjct: 330 PQQPHQPQGSQTSSLESLMRDYMAKNDAVIQSQAASLRNLEVQLGQLANDLKNRPQGTLP 389

BLAST of Spg035607.1 vs. NCBI nr
Match: XP_030507648.1 (uncharacterized protein LOC115722545 [Cannabis sativa])

HSP 1 Score: 262.7 bits (670), Expect = 1.0e-65
Identity = 155/342 (45.32%), Postives = 203/342 (59.36%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWE
Sbjct: 52  LRDRARAWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDETTSDAWE 111

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELLRKCPHHG+PHCIQ+ETFYNGLN+ T+ ++DASA GA+L+K++NEA EILERI++
Sbjct: 112 RFKELLRKCPHHGIPHCIQLETFYNGLNL-TRMVLDASANGAILSKSYNEAFEILERIAS 171

Query: 127 NSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV------- 186
           N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A        
Sbjct: 172 NNYQWSTNRAPTSRKVAGVLEVDALTALTAQMASMTNILKNMNMGGSVQPAAAIQRAEIS 231

Query: 187 ---------------------------------------EPV------------------ 246
                                                   P                   
Sbjct: 232 CVYCGDGHTFENCPSNLASVCYVGNQNFNRNNNPYSNSYNPAWKHHPNFSWGGQGASSSG 291

Query: 247 --VVQNKQAL------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM 270
               Q KQ+             PQ +  SSLE++M++YMA+ DA IQS  AS+R LE+Q+
Sbjct: 292 AQQAQGKQSFPPGFSQQPRSHQPQGSQTSSLESLMRDYMAKNDAVIQSQAASLRNLEVQL 351

BLAST of Spg035607.1 vs. NCBI nr
Match: XP_030505184.1 (uncharacterized protein LOC115720166 [Cannabis sativa])

HSP 1 Score: 262.7 bits (670), Expect = 1.0e-65
Identity = 161/381 (42.26%), Postives = 220/381 (57.74%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A++WLN+ +P+S+  WN+ AEKFL KYF PTRNAK RSEI  F QLED++ S+AWE
Sbjct: 110 LRDRARSWLNTLSPDSVTNWNDFAEKFLRKYFPPTRNAKFRSEIMSFHQLEDESASDAWE 169

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELLRKCPHHG+PHCIQMETFYNGLN T+Q ++DASA GA+L+K++NEA EILE I++
Sbjct: 170 RFKELLRKCPHHGIPHCIQMETFYNGLNATSQMVLDASANGAILSKSYNEAFEILETIAS 229

Query: 127 NSCQWSDVRGL-NKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQ--QPPAV----- 186
           N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+ + + +  QP A      
Sbjct: 230 NNYQWSNTRAPGSRKVAGVLEVDAITALTTQMASMTNVLKNLSIGNSKNIQPAAAIQSDD 289

Query: 187 -----------------EPVVV-------------------------------------- 246
                             P  V                                      
Sbjct: 290 VSCVFCREGHAFEKCPSNPESVCYMGNQNFNRNNGAFSNSYNQAWKNHPNLSWGSRSKLK 349

Query: 247 ---QNKQALP-------------QQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQM 306
              Q +QA P             Q +  SSLE++M++YMA+ DA IQS  A +R LELQ+
Sbjct: 350 HFDQGRQAYPPGFSQQLRHPQHAQNSQPSSLESLMRDYMAKNDAVIQSQAAFLRNLELQL 409

Query: 307 GQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDKN 309
           G  ANELKA+ QG LP+DTE+P+R+GKEQ  ++ LRS K L+        ++  K S + 
Sbjct: 410 GHLANELKARPQGSLPSDTENPRRDGKEQCKSIHLRSGKHLK------NSEEEIKGSGEP 469

BLAST of Spg035607.1 vs. NCBI nr
Match: XP_030509259.1 (uncharacterized protein LOC115723937 [Cannabis sativa])

HSP 1 Score: 261.5 bits (667), Expect = 2.3e-65
Identity = 163/380 (42.89%), Postives = 216/380 (56.84%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A+AWLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED+T S+AWE
Sbjct: 117 LRDRARAWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDETTSDAWE 176

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELLRKCPHHG+PHCIQ+ETFYNGLN  ++ ++DASA GA+L+K++NEA EILERI++
Sbjct: 177 RFKELLRKCPHHGIPHCIQLETFYNGLNAASRMVLDASANGAILSKSYNEAFEILERIAS 236

Query: 127 NSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPAV------- 186
           N+ QWS  R   ++KV  VLEVD ++ +   +A + N LKN+ +    QP A        
Sbjct: 237 NNYQWSTNRAPTSRKVAGVLEVDALTALTAQMASMTNILKNMNMGGSVQPAAAIQRAEIS 296

Query: 187 ---------------EPVVV---------------------------------------- 246
                           P  V                                        
Sbjct: 297 CVYCGDGHTFENCPSNPASVCYVGNQNFNRNNNPYSNSYNPAWKHHPNFSWGGQGASSSG 356

Query: 247 ----QNKQAL---------------PQQNSESSLEAMMKEYMARTDAAIQSNQASMRALE 295
               Q KQ+                PQ +  SSLE++M++YMA+ DA IQS  AS+R LE
Sbjct: 357 AQQAQGKQSFPPGFSQQPRPQQPHQPQGSQTSSLESLMRDYMAKNDAVIQSQAASLRNLE 416

BLAST of Spg035607.1 vs. ExPASy TrEMBL
Match: A0A6J1G7Q6 (uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC111451598 PE=4 SV=1)

HSP 1 Score: 237.7 bits (605), Expect = 1.7e-58
Identity = 148/351 (42.17%), Postives = 189/351 (53.85%), Query Frame = 0

Query: 4   SPGVRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSE 63
           S  +RDGAK+WLN  A   I +WN LAEKFL KYF PTR+A+ R+EI  F++ E++T SE
Sbjct: 101 SYSLRDGAKSWLNILALGIIDSWNSLAEKFLFKYFPPTRSARFRNEIVAFQKFENETLSE 160

Query: 64  AWERFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILER 123
           AWERFKE LRKCPHHGLPHCIQ+ETFYNGLN  T+ +VDASA G +L+KT+NEA+EILER
Sbjct: 161 AWERFKETLRKCPHHGLPHCIQIETFYNGLNTATKQVVDASANGDILSKTYNEAYEILER 220

Query: 124 ISTNSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSH---QQPPAVE 183
           I++N+CQW DVR    KK + VLEVD +S+I   +A + N L+N+        + P    
Sbjct: 221 IASNNCQWVDVRSNPGKKTREVLEVDALSSINAQLASMTNILQNLAFGQGSMIKAPAHTA 280

Query: 184 PVVVQ------------------------------------------------------- 243
            V++Q                                                       
Sbjct: 281 TVMIQTATESCVYCGEKHTFDQCPSNPASIFYVGNQASQGNPKTNPSSNTYNPGWRNHPN 340

Query: 244 ---------NKQALPQQN-------------------------------SESSLEAMMKE 256
                    N+Q  P+ N                               S + LE+++KE
Sbjct: 341 FLCKGQGSYNQQMPPKANYPPGFGLQNQLTYDSQQATTQGEGTSQAQHISGTLLESLIKE 400

BLAST of Spg035607.1 vs. ExPASy TrEMBL
Match: A0A6J1H7E4 (uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC111461168 PE=4 SV=1)

HSP 1 Score: 232.3 bits (591), Expect = 7.3e-57
Identity = 107/162 (66.05%), Postives = 135/162 (83.33%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RDGAK+WLN+ AP +I +WN LAEKFL KYF PTRNA+ R+EI  F+Q ED+T SEAWE
Sbjct: 125 LRDGAKSWLNTLAPRTIDSWNSLAEKFLIKYFPPTRNARFRNEIVAFQQFEDETLSEAWE 184

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKE+LRKCPHHGLPHCIQMETFYNGLN+ T+ +VDASA GA+L+KT+NEA+EILERI++
Sbjct: 185 RFKEMLRKCPHHGLPHCIQMETFYNGLNIATKQVVDASANGAMLSKTYNEAYEILERIAS 244

Query: 127 NSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNV 168
           N+CQW+DVR    KK + VLEVD +S+I   +A + N L+N+
Sbjct: 245 NNCQWADVRSNPGKKTRGVLEVDALSSINAQLASVTNILQNL 286

BLAST of Spg035607.1 vs. ExPASy TrEMBL
Match: A0A6J1EEI2 (uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC111433394 PE=4 SV=1)

HSP 1 Score: 221.5 bits (563), Expect = 1.3e-53
Identity = 110/184 (59.78%), Postives = 140/184 (76.09%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RDGAK+WLN+ A  +I +WN L EKFL KYF PTRNA+ R+EI  F+Q ED T SEAWE
Sbjct: 134 LRDGAKSWLNTLALGTIDSWNSLVEKFLIKYFPPTRNARFRNEIVVFQQFEDDTLSEAWE 193

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKE+LRKCPHHGLPHCIQMETFYNGLN+ T+ +VDASA GA+L+KT+NEA+EILERI++
Sbjct: 194 RFKEMLRKCPHHGLPHCIQMETFYNGLNIATKQVVDASANGAILSKTYNEAYEILERIAS 253

Query: 127 NSCQWSDVR-GLNKKVKSVLEVDGVSTIRVDIAMLANALKNVIVVSHQQPPA-VEPVVVQ 186
           N+CQW+DVR    +K + VLEVD +S+I   +A + N L+N+ +       A V  V V 
Sbjct: 254 NNCQWADVRSNPGRKTRGVLEVDALSSINAQLASVTNILQNLALGQDSMIKAPVHTVAVI 313

Query: 187 NKQA 189
           N+ A
Sbjct: 314 NQTA 317

BLAST of Spg035607.1 vs. ExPASy TrEMBL
Match: A0A5B6VWJ0 (Retroelement pol polyprotein-like OS=Gossypium australe OX=47621 GN=EPI10_024080 PE=4 SV=1)

HSP 1 Score: 214.9 bits (546), Expect = 1.2e-51
Identity = 162/463 (34.99%), Postives = 232/463 (50.11%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A+AWLNS  P SI TW ELAE+FL KYF P++NAKLR+EI  F  ++D++  EAWE
Sbjct: 10  LRDRARAWLNSLPPNSISTWQELAERFLVKYFLPSKNAKLRNEITAFHHMDDESLYEAWE 69

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELL+KCPHHG+PHCIQ+ETFYNGL   T+ +VDASA GALL+K++NEA+EI+ERI++
Sbjct: 70  RFKELLQKCPHHGIPHCIQLETFYNGLKAHTRMVVDASANGALLSKSYNEAYEIIERIAS 129

Query: 127 NSCQWSDVRGLN-KKVKSVLEVDGVSTIRVDIAMLANALKNVI----------------- 186
           N+ QW   R  + ++V  + EVD ++++   ++ +++  KN+                  
Sbjct: 130 NNYQWPTSRAASGRRVAGIHEVDAITSLASQVSSISSMTKNLTTNGSNSFAAQPPNQFEN 189

Query: 187 ------------------------------------------------------------ 246
                                                                       
Sbjct: 190 IAYVYCGEGHLLEECPSNPESVYYMGNQNQNRGRQGLQSNFYNSSWRNHLDFSWSNQGAG 249

Query: 247 --VVSHQQPPAVEPVVVQNKQALPQQNSESSLEAMMKEYMARTDAAIQSNQASMRALELQ 306
              V  Q  P   P   Q  Q L Q  + +SLE+++K YMA+ DA IQS  A+++ LE Q
Sbjct: 250 TSTVYTQPRPTQLPNFPQQVQKLVQAKASNSLESLLKTYMAKNDALIQSQAATLKNLENQ 309

Query: 307 MGQRANELKAKTQGKLPADTEHPKREGKEQVHAVTLRSDKPLEERKKPSKPQDVEKNSDK 366
           +GQ A EL+ + QG LP+DTE+P+  GKE   A+TLRS+K +E       P  VE     
Sbjct: 310 VGQLATELRNRLQGALPSDTENPRNLGKEHCKALTLRSEKIIE-------PNTVE----- 369

Query: 367 NVVVERELESGKDFASEMAKTRGRKEKDVEEEEVPITPEAPKTKAKRRKTPEEREAKRRR 390
              VE+E  + +D            E+     E P++PE   TK      P++  +    
Sbjct: 370 ---VEKEQANAQD-----------AEEVQPSVETPVSPEPKSTK------PDKVTSGPVN 429

BLAST of Spg035607.1 vs. ExPASy TrEMBL
Match: U5CUI2 (Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMTR_s04947p00003620 PE=4 SV=1)

HSP 1 Score: 214.2 bits (544), Expect = 2.0e-51
Identity = 98/164 (59.76%), Postives = 131/164 (79.88%), Query Frame = 0

Query: 7   VRDGAKAWLNSFAPESIRTWNELAEKFLSKYFSPTRNAKLRSEIERFRQLEDKTFSEAWE 66
           +RD A++WLN+  P+S+  WN+LAEKFL KYF PTRNAK RSEI  F+QLED++ S+AWE
Sbjct: 105 LRDRARSWLNTLPPDSVTNWNDLAEKFLRKYFPPTRNAKFRSEIMSFQQLEDESTSDAWE 164

Query: 67  RFKELLRKCPHHGLPHCIQMETFYNGLNMTTQSMVDASAGGALLAKTFNEAHEILERIST 126
           RFKELLRKCPHHG+PHCIQMETFYNGLN  ++ ++DASA GA+L+K++NEA EILE I++
Sbjct: 165 RFKELLRKCPHHGIPHCIQMETFYNGLNAASRMVLDASANGAILSKSYNEAFEILETIAS 224

Query: 127 NSCQWSDVRG-LNKKVKSVLEVDGVSTIRVDIAMLANALKNVIV 170
           N+ QWS+ R   ++KV  VLEVD ++ +   +A + N LKN+ +
Sbjct: 225 NNYQWSNTRAPTSRKVAGVLEVDAITALTAQMASMTNVLKNLSI 268

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_030494802.11.1e-6744.82uncharacterized protein LOC115710583 [Cannabis sativa][more]
XP_030503898.13.2e-6744.93uncharacterized protein LOC115719117 [Cannabis sativa][more]
XP_030507648.11.0e-6545.32uncharacterized protein LOC115722545 [Cannabis sativa][more]
XP_030505184.11.0e-6542.26uncharacterized protein LOC115720166 [Cannabis sativa][more]
XP_030509259.12.3e-6542.89uncharacterized protein LOC115723937 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1G7Q61.7e-5842.17uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC1114515... [more]
A0A6J1H7E47.3e-5766.05uncharacterized protein LOC111461168 OS=Cucurbita moschata OX=3662 GN=LOC1114611... [more]
A0A6J1EEI21.3e-5359.78uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC1114333... [more]
A0A5B6VWJ01.2e-5134.99Retroelement pol polyprotein-like OS=Gossypium australe OX=47621 GN=EPI10_024080... [more]
U5CUI22.0e-5159.76Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMT... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 416..444
NoneNo IPR availableCOILSCoilCoilcoord: 339..359
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 419..443
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 240..353
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 233..353
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 368..395
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 368..465
NoneNo IPR availablePANTHERPTHR24559:SF334SUBFAMILY NOT NAMEDcoord: 8..143
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 8..143
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 8..94
e-value: 1.3E-18
score: 67.1

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Spg035607Spg035607gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1873692..1873714CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1874195..1874720CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1874973..1875318CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1880022..1880282CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1880376..1880779CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1880999..1881623CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1881936..1882034CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1882722..1882748CDS
Spg035607.1-cdsSpg035607.1-cds-scaffold3:1883021..1883103CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Spg035607.1Spg035607.1-proteinpolypeptide