Spg029397 (gene) Sponge gourd (cylindrica) v1

Overview
NameSpg029397
Typegene
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Locationscaffold12: 35565438 .. 35581156 (-)
RNA-Seq ExpressionSpg029397
SyntenySpg029397
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAAAATCAAGTTCAGAAACACCCTGTCAAGCTATGTGGCGGTTGCCGTGATTGGAAAGGAATGTCGAATGAGGCGATGCAGGTTGATAACCGACGGGACCATGATGTCGCGATACAAATTATGGCCAGAAAATTCAAAGGGAGGGAGCCAATTTTCAATTCTAGTTTCAAAAAAGGGAATATAGGAGAGGGGAAAGGGAATAAGGAGAAAGGGAAGAGAGAGAAAGGGGAAGAGGGAAAAAAGAAAAGAAAATGACATGTAATCTAAAAAGAATAATATAATTATTAAATTAGTCACCTCAATCACATCATTTTTTGTCTGGTAGTCAATTAACAATAAAAAAAACAGTTAGCACTATTTTTCAAACCTAAGGACTAAGGGGCCGTTTGGCCCACGGGTTTGGAATGGATTGGAAAGGGATGCCCATGTTTGGTTGCCCTGGTGTGGGTTTGGAATGAAATGGTGGGCATGTGATTCCAACCCATTCCCACATTTTTTTGAGTCCCACTATTTCTCTCTCCATTTTTCTTCATAATTTTGATGGGACCCACCATTTCTTTTTCTATTTTAATCACATTTTGGTGGGTCCCACTATTTCTCCATCCATTAATTCCAACCCAAATCCAATGCACCAAACACAGGCATGAGAATCATGATTCCAGCAAAATATTCCCATGGGCATCAGATTCTATTCCAAACCCATTCAATTCCGGCGCACCAAACGGCTCTAAAGGCTAACAAGAGCATAACTCAACTAATGTACTAATAACCACGAGATTTGTGTTTCTAATCTCCCACCCCATATTGTTAAAGAAAAAAAAAAAAACCTAAGGACTAAAGTGTATTTTAGTCTATAAATTAAAAAAAGGAAAAAAGGAAAAAGAAGCTTGGCCCTAAACCCATCTTCTTCTTCCCTTCTCCTTATCCACGCCCACCGAAATCACGAGCGGCGCGATTTTTCCGGCAACATTGTGCGTCGACGACAGTTGCACCCCTCTTCTATTTCTTTTTCCTCGACGAGCCACACACGAGGGATCCTATGGCGGTGTCTCGGCGGCGTCTTGACTGCGCGCAAGGCTGCAACAAGTGCTGGTCGTATCGTTCAACGTGCGGCATATTCAACATTTGAACTTTTCTACTCCGATGAAGCGTTTTCCGTTGGTTCCAGCAGCACCCACGCGTTTTCGGTAAGGTTTGGCTTCGACCCACTCTATTTCCGCAAGTTTAAGCACACCCATTTTGTTTTGGATCCCAAAATCACTTTTTAAATCATGACCCACAGTCTATCAACTTGTAATTTGGATTACACACTCCCTACTGACATCGAATTTGCAGGTTTAAGCCTTCTTGAACGCCCATCAGCCAGCTCTTGGACTTGGGTATAACTTTTAGTGGCTACCCACAACCTGTTTAGTCCCAATTTGGTTTACCCATAGCCCGAAAACTTCATAATTGTATTTTCAGACACATTCAATCTCGTTCCAGCATTCGTTCGGACTTGTTGGGAAAAGTTCACGTAGTTTCGTACGCCTTTTAGACTATTTCGGATTGGATTAATTTAGTTCAGACCATTCCCTAAAACGTGTAAGGCTCTTGTTTGCAGTTTCGAACCAATTCAGTCTCATTCAACCCTATTCAAATACATTCCATCAAAGGCGCAAGCTTTTGATAAGTCTTAAGATGTTTCAAATTAGTTCAGAGTGTTCGCATTAGCAGTTAAGGTTGTTTTTGGGTAAGAACAACCTTTTGGAGCACTCTTTTTGCTTTAGTTAAGCTTTTTGAATCAAGTTCTATCATTTAAGCGTTGTCTTTGTTGTTTAGAAGTTTTTTGGATTAATTTAGGAGCGCTTTCAAACGATAGTAAGTGGAATTCTGCTTGAAGAGTCTCTTTGTTGGGTGTTAGCATGTTGTGTGTTTACTTGTTTAGTTATATTATATGCTGTGTATATGTCGTAGTATCTCAGAATTCTACCTCCAAGTCACCTTCCTTAGAGATCAGATTTGTAATTTTCCCATGAATACATGCTTCGAATACGAATTCCAGAACCGTAACAAGTGTTCATTGGAAGTGGATTTCTTCAAGTACACCGTATTTTGTTATAATCAATGTCTCTGTTTAACTCGAATCCTGGAGATTGGTGTCCTATTAATGGGGATATGGCTGTTGATTTAGAAAAAGAATTTACAGAGGAGGAAGTTTATTGTGCGGTCAAATCCTTGGGTGTATGTAAATCTCCAGGCCCTGGTAGCTGAATTCTTTAAGCATTCTTGGCACATTATTAAATCTGATATTACGACCATGATCAGAGAATTTCATAACTCGGGTGTTATTAATGCAACCTTAAATGAAACCTATATTTGCTTAATCCCAAAGAAACTAGTTGCTAAATCAGTGTCAGATTATCGTCCTATTAGCCTTATCCCCCGTGCCTACAAGATTATTGCTAGAGTCTTGTCTAATAGATTAAAGTATGTTCTGCCTCATACTATTGCTATTGCTGAACATCAAATGGCCTTTGTAGCCAATAGACAAATCTTGGATTTAGCCAATGAAGTGATTGATGACTGGAATATCTCTAAAAAGGAAGATGTGATTCTTAAATTGGACCTGGAAAAGGCATTTGATAAAGTGGATTGAGACTTTCTAGATGCGGTCTTAAATGCTAAAGGTTTCGGCTTAAAATGGAGGAATTGGATTTGGGGTTGTATCTCTTCAATTAACCACTCGATAATTATCAATGGAAGGCCTCGAGGTAAAATTATTCCCTCAAGAGGTATTCGTCAAGGGGATTCGTTTTTCCAATTTCTTTTCATCATTGTGGCTGATTGTCTTAGTAGACTATTATCTCATGGAGTGCAGTCAGGAAAAATTATATCTCATCGGGTTGGTATCTCTTCTGTTAGATTGACTCATTTGCAATTTGATGATGATACATTATTGTTCTCCATTTTTTATTTGAAAGCTCTTGAGAATTTGTTTGATCTTATTTAAATCTTTGAATTGGCTTCTAGATTGAATATCAATTTTGGTAAAAGTGAATTTTTTGGGAATCAACCTTGAAGACCATCAAATGGATTGGCTGAAGACAACTTTTGGATGTAAACAGGGTAATTGGCCTATAACATATCTTGGGGTTCCCTTAGGTAGTAATCCAAAGAATGTGTCCTTTTGGCAATCAGTGATTGAAAAAATTCAGAAAAAGCTTCATAGTTGGAAATATGCATTTATCTCAAAAGGTGGTAGACATACGCTCATTCAAGCTACTCTCTCAAGCATATCCATATACTATCTTTCCTCGTTTAAACTCCCGAACAAGGTTGCCAAAATTTTGAGGTTCTCTTTGGATGATATCATTGTTGGTAAATATTATTATGCTGATGATAATAATATTAAATGGCCAAACAAAATTTTGAGAGGTCCTTACAAATCACCTTGACATCATATCTGTTCTACAATTGATCTTATTGAAAATCAGATTAGAGTTCTTGGAAATGGTCATGATATATATTTTTGACATGACTGTTGGCTACAATGTGGAATTATTGCAGAAGCTTATCCAAAATTACCAGATCTACTGCTATGGTGGATCAAGTCTGGAAAAGTTCCAATGCGGCATGGGATTTGACACTTAGACGTAATCTAAATGACTTGGAGGTAAATGAATGGACTAGTCTCTCTCATATTCTTTCATCTGTTACGATACGATTAGTGAATGACTCATGGTGTCGGCCTCTTGATCAATCAAAGTGTTTTACTGCCAAGTCTCTCAAGTCTGATATGCTTACTTCCAGTGTTATTGAGGCGACCCCAGGCGCTCGCCTATGGCGAAAGGCGAGGTGCCCAGAGGGTTATGCGCCTCGTAGGTAGCTGGGGCGAGCAGATTCAATCAGGCGCTCGCCTTTTTGCGCCTTTGGTCGCCTGTCGTGTTTTTCGCCTGATTGAAGGTTGCCGTTTGTGCCTTTTTTTTTTTTGTTTTTTTTTATAATTTTAAATTATGTTTAACATAACTTAAGAAACCCTAATAATATCCCACAAAAAAAGCCCATGAAGGCCATGCAGAGAGGGACGATTCAACTATTGCTTATTGGGCAAGTTGCATTTCTCATGCAAAAAGAAAAAACTCCACGGCGGCTGGCTTTCCACCTTCTTCGGCTTCTTTTGTTGGTTCTTCTTCAACTTTCAACTTCTTCTGCCAGTTTTTCTTCAACCTTTTGCTTCTTCGCTGGCTTTTCATCTTCTTCGGCGGTATTTTCATCTTCTTCAACTTCTTCTGTTGGTTTTTCAAGTTTTCATCTTCACGACTTCTTCGGTTGGTTTTTGTTCTTCATCTTCACAGTTTCAACTTTTCATCTTCTTCGGCGGTGTTTTCATCTTCACGGCTTCTTCTGTTGGCTTTTCAAGTTTTCATCTTCACGCCTTCTTTTGGTTCTTCATCTTCACGGTTTCTTCTTCATCTCCACAGTTTCAGCTTCTCTAGTTCTCCTTCTCCATCTCCTTCAACCGATTTCTTCTTGCTGAGGTAGGCCATAGGCTTTAGTTGTAATCTAGGAAGCACGGACACGCTAAAATGAAGGAAGAATCTGTGTCGGACACGCGTCGGACACGGATTCGTCCGGACACACTCCGGACACGTGTCGGACACGCAATTTTGCGTGTCCGAAAATATATTTATTTTTTTTAATTTCGGACACGCTGGGACACGACCCAGACACGCCCACATCAATATTTTAAAGATTTCTTTGGGCAGACTGTCTTTAATTAAAAAAAAAAAGAAAGAGAAAGATGAATAGAAAATGAAGCTGCCGTTCACGATTTCTTCACCATCACCACGCTAGGGCTTCTTCGCATTCTGAAGGTGAAAACTGAAAAGAAAACAAGTGAGGTTCTTCTTCTTCACAAATTCCGTTGACCCATTTGTTCTCTCTTCTTCACCGGTGCAGTCACCGTCCGACTTCACCACCACCGCCGTTTGCTGCCACTAGTCGGTTTGCTGCTACAGAAAGTCGCTTCTGTTTCTCTTCGTACCTTTGGTTTTTCTTTTCTTTTTTCTTTTTATATTTAAACCTCTAAATTAAAGAAAACCTCATGTTTAACATAGACAAATGGTTGTATAGGCCCAGCCCAATAAAAGTAGCATTTTAAGTTAAAAAAACAGGAAGGATGAAGTTAAACTACTATCAATAAAAATTTATATTTTAAAAAAATTAGTATATACTATAATTTTTTTAAAAAAAATTACATATATACTAAAAATTTTAAAAATTATATATATATACTAAAAAATAATAATAATAATAATAATAATAATAAACAGCGTATCCTCAACGTATCCGTATCTTCGTTTTTTAGAAATTGGCGTATCGCCGTGTCCCGTGTCCGTGTCCGTGCTTCTTAGGTTGTAATATATATATATATATATATATATATATATTCTCTAACTTTCTTGTTTGAAATTTGAATATTCCATTTCCATAGAATTGTTAAATTAAAAAAATTATATTATAATTATAATAGGAACCAGTGATTTCAAAGCGCAAGGCGCATCTCAAGGCGACAAAAAGACGACGCCTGAGTGAAGCGAGGCGCAATAAATAGAAACATAAATAAATAATATATATAGAGCATAAATTTCTATTAGTAAATATAGTTACATAAAGTGTCAAATTCTTATCCTCCTTACAAATTAATATTAAATAGTAAAAACATATAATGGCCAAAACATCAAACAAGAATCAATTTCTAGTAGTTCCAAAGAGACTTAAGAGTGAACAAAAACTAAAAGTCTAAAAGCCAAAAGTAAAACATTAAAGGTCAAAATCATCATCCTCCTCTCCTTCTAAATCTAAGGCCTCATCTACTGCTCCTCTTTGGCCATTATCCTCTTTAATATCCAACTCTTCTTCATTGTCGCCATCGTTGTCACTAACCACCTGTATAGTCGGTGGTCGACTTCTAGAAGTTGCACTTGCCGGTGACTTTCCTTTAGATCTTGTGTACTTTAAGGGTTCTCTAACACCACTAGCACGAGCTACGTCTCCCCATGTGAGGTCTTCGTCATCAAAAACTAGCTCGTGATCTACTTCAGCTTCATCATTTTCCTCTTCAATTGTTCCGACTAGCCACTCATGGCTTTCATCAATATGATCTAGAGAAATTGGATCAAGTTGATCCTTCAGATCAAAGCGTTCTTTAAGTGCTTGGTTATATTTTATATAAACCAAATCATTTAAACGCTTTTGCTCTAATATGTTTCTCTTCTTCGAACGAATCTACAAAATTATTAACCAACATAGAGTTAAAAGTAAACTTTTAAATCTCAAATTAATAATGTAAAAAACAAATTATCTGCTTACGTGTTCAAAAACACTCCAATTGCGCTCACAACCGGAAGCCCTACACGTTAGATTAAGTACTCTCATGGCTAGCTGTTGTAAGATTGGAGTACTACCTCCATAAAGGGCCCAACATGATGTTGTTCACATATTTTGAAATGGTTTAGAACATTAAACTTTGGATTTGGGAATAACTTTAACTTTAACACAACACTATTTAAGTAATAAAGAACGTCATTACCTGGTGTCTTAATTTTCCTAGTTTTGATTGCCAAATTGATACCGAACATGCCTTTAGCTTCAGAATACAACTCTAATTCCACTATGCATTGATTTTGGTCACTAGTCGATGGTATCAATCGTTGCACAACTTTTAGCAATCCACTCATGACCTCAACATCTTGAATAATTCTTTCTTTGTGGTCATAGAAAAGAGATGAATTCAAATAATATCCAGCTGCATGTAATGGCCGATGAAGTTGGCAATCCCATCTTTTATCAATGATATTCCATATTGGTTAATACTAGGGGTGAGCATCATAGACCCTAATAATTTCGGTCGGAAACTGACCAGTCGGTCGGTCGGTGGGCCTAAAAAGGCCCAAACCGACCGATTTGTCTTCATTTAAAAAAAAAAAAAACGTAAAGTATAAAGGAAAAGCATGCAGTCTGCTTCTTTAGATTTCCAAGGTAAGAGAAAAGGTGCGTAGGTAGGTATAGGTAATGATTACCTAGGTAATCATTACTTTTTCTTTTACATTTTTTATTTATTGTTTCTCAAAGCTTGAACCCACATATACTTTCATCTTCTTCTCAGAACCCTCGATTTTCTTCTTCTCTTTCACACAGTCTCCGCCACTCTTCTGTGCCGTTGAAGCTCCGTCAAAGCTTCGATGTTTCTCTTCTCCTTCACACCACCGAAGCCAAACCTCACGCCGTTGCCCCCTCCATTGATGCTTTGATTCTCTCTATTGTTCCTCCCCTTCGACCTCCGTTGTGCCGCTCCTCCATTTCTTTTCAGTCAAAACCATCGGTTTAAACCGACCAAAAGTAAGAGATGAATTGAAGGAATACCAAATAAGATCAAAAGCAAAGAGGGAAGAAATCAATGCTCTGCCAAGCTACGATGCTTATGATTTGAACACAATTGATGAAGAGGATGAGATAGAAGTCATTGGTGGATCATGTTCAACGTGTGCAACATCTAGAAAGCGGCCAAGTATGAATAGTAGTGGAGTAGTATCTAGTGGAAGCCAATTTCAGTCACAACCAAAGCCACCACGACAAAAGGGAGCAATGGATAAATTTTTTTTATCCAAATCCGGCCAAGGTTGTTGATGATAGAAATAGATTGAAGAAGCTTAAGCAAACAAGCATCACTGAAAAATACAACAAAGAAGGAAGGGATGCTACTGTTCATGGTCTGGTTCCTCCTTCTTATCATGAGGCTAGAGTAACATGTTTGAAAAAGGAAGTTCAATTCACGAAAAACCTAATGAGCAGTCATGAGGTGGAATGGAAAAAAAATGGTTGTTCATTGATGTCAGATGGGTGGACGGATAGAAGAGATAGGACTTTAATAAACTTTTTAGTTCATTCTGCTGCTGGAACGTTGTTTTTAGAATCCATTGATGCATCTTCGTGCATCAAAATCGGAGAAAAGGTATTTGAACTCTTGGATGGAATGGTAGAGAAGATTGGAAAAGAAAATGTCATCCAAGTTGTTACCGATAATGCCTCAAATTATGTCCTAGCTGGAAAGTATTTGGAAGCAAAACGACCACATCTTTATTGGACACCATGTGCTGCTCACTGTTTAGATTTGATTTTAAAGGATATTGGAAAGATAATCCAAATCAAAACATGTATTAAAAGGGTTGTGGCGCTTAGTGGTTTTATTTATAACCACTTATATGTCCTAAACATGATGAGGGAGTTTACAAATCAGCATGAATTGGTGAGACCAGCTGTTACTCGTTTTGCAACCTCTTTTTTGACATTGACAAGTATTCATAGAAATAAATCCAGTTTGAGGAAGATGTTTATTTCTGAGAAATGGACAACTTCTAAATGAGCAAAGGATCCAAAGGGTAAGAGAGCAGCTAACACCATTTTGATGCCATCCTTTTGGAATTTAGTTGTCTATACTTCAAAGGCATCAAGACCATTAGTTCGAGTTCTTAGACTTGTTGATGGTGATAAACCTGCGATGAGATATATATGAGGCTATGGATAGAGCAAAAGAGGCCATTAAGACTGAGTTTAACAATAATGAAGTGAAGTATCAACTAGGGGTGAGCATCGGTCGGTCGGGGTCGATTTTTGGTCCCAAACCGACGCCGAACCGACCTCGTCGGTTTCCATCCTTCTGGAAACCAACTTTTTTTTTTGGGTCTAAATTTCGCCTACCAACTGACCGACCAGTCAATCGGTCGGTTTAAACCGATGGTTGTGACTGAAAAGAAATGGAGGAGCGGCACGACGGAGGTCGAAGGGGAGGAACAGTACAGCGAATCAAAGTGTCAATGGAGGGGGCGGCGTCGTGAGGTTTGGCTTCGGTTGTGTGAGGGAGAAGAGAAACATCGAAGCTTCAACGACACGACTTCAGCCTTCAATGGCACGACAAAGCTTCAACGACACGGAAGAGTGGCGACGGTTGTGTGAGGGAGAAGAAGAAAATCGAGGGTTCTGAGAAGAAGATGAAAGTATATGTGGTTTCAAGCTTTGAGAAACAATAAATAAAAAGTATTGATTACCTATACCTACCTACACGATTTTTTTTTTCCCTTCAATGAAGGCAAATCGGTCGGTTTGGGCCTTTTTAGGCCCACCGATCGACCGAAATTATTAGAAAACCGACCGACCGATGTCAGTTTGGTCGGTCGGATCGGTTTTTCGTTCTATGATGCTCACCCCTAGTATCAACAAATATGGGATATCATTGATAAAAGATGGGATTGCCAACTTCATCGGCCATTACATGCAGCTGGATATTATTTGAATCCATCTCTTTTCTATGACCACAAAGAAAGAATTATTCAAGATGTTGAGGTCATGAGTGGATTGCTTAAAGTTGTGCAACGATTGATACCATCGACTAGTGACCAAAATCGATGCACAGTGGAATTAGAGTTGTATTATGAAGCTAAAGGCATGTTCGGTATCGATTTGGCAATTAACCCTAGGAAAATTAAGACACCAGGTAATGACATTCTTTATTACTTAAATAGTGTTGTGTTAAAGTTAAAGTTATTCCAAAATCCAAAGTTTAGGCCCCGTTTGATAACCATTTAGTTTTTGGTTTTTTGTTTTTGAAAATTAGGCTTGTTTTCTCCAAAATTCCCTATCATGGTTTTCCACCTTGTTAAGGAACCATTTGAATTCCTTGCCAAATTTCAAAAACAAAAACAAGTTTTTGGAAACTACTTTTTTTGGTTTTCAAAATTTGACTTGGTTTTTGAAAACAAGAAACAAGGTAGATGCTAAAACAAAGAAACTCATGGGTAGGAATAGATGTGTATAAGCTTAATTTCCAAAAACCAAAAACCAAATGGTTATCAAACGGGGCCTTAATGTTCTAAACCATTTCAAAATATGTGAACAGCATCATGTTGGGCCCTTTATGGAGGTAGTACTCCAATCTTACAAAAGCTAGTCATGAGAGTACTTTATCTAACGTGTAGTGCTTCCGGTTGTGAGCGCAATTGGAGTGTTTTTGAACACGTAAGCAGATAATTTGTTTTTTACATTATTAATTTGAGATTTAAAAGTTAAATTTTAACTCTATGTTGGTTAATAATTTTGTAGATTCGTTCGAAGAAGAGAAACATATTAGAGCAAAAGCGTTTAAATGATTTGGTTTATATAAAATATAACCAAGCACTTAAAGAACGCTTTGATCTGAAGGATCAACTTGATCCAATTTCTCTAGATCATATTGATGAAAGCCATGAGTGGCTAGTCGGAACAGTTGAAGAGGAAAATGATGAAGCTGAAGTAGATCACGAGCTAGTTTTTGATGATGAAGACCTCACATGGGGAGACGTAGCTCGTGCTAGTGGTGTTAGAGAACCCTAAGTACACAAGATCTAAAGGAAAGTCACCGGCAAGTGCAACTTCTAGAAGTCGACCACCGACTATATAGGTGGTTAGTGACGACGATGAAGAAGAGGAGGATGATGATTTTGACCTTTAATGTTTTACTTTTTGCTTTTAGACTTTTAGGGTCCGTTTGGTTGGTGATCTGAAAATAGAAATTTGAAAACAAGGTATTCAATGAAAACAGAGTTGTATTTCATGTTTTCAGATATGTGTTTGGTAGCAGATTCAGAAATTGGATCCCAATTTAAACAAGTGTTCAAAATGTGTTTGATAATATATTTATAAACCATAAACTATTTGTAGTTATCAACTAAATACTAAGTTGAATATAAACCATTATTAATTAATTTTTAAACAGGTTATGTGTTATAATTTTTTTATAATTATTATTTTACAATATCATATAATTTATAATACATTATATTATACAAATTTTAATTTCGCAAAACTAAATTGAGTTTATAACATAAAATATTATATTTATTAAAGTTTTTATCTTTTTTAATTTAATTCTGTATTTAAAAACCTAAAATTCAAAATCTGGATACACTGAAAACATAAAAAAATTGTTTTCAGAATTTCTACTATTTGGATTACAGAATCCGAAAACAGTTTTCAAAAACACGTTTGCCAAAAACATATTCACTGAATTCAGTGAATCTGAAAACATAAAACAGAATCTAGATTGCCTACCAAACAAGCCCTAGTTTTTGTTCACTCTTAAGTCTCGTCTTCATGGTGTTTTGGAACTACTAGAAATTGATTCTTGTTTGATGTTTTGGCCATTACATGTTTTTACTATTTAATATTAATGTGTAAGGAGGATAAGAATATGACACTTTATGTAACTATATTTACTAATAGAAATTTATGCTCTATATATATTATTTATTTATGTTTCTATTTATTGCGCCTTGCTTCACTTGGGCGTCGCCTTTTTGTCGCCTCTCGCCTTGAGGCGATCAAGGGGCTTGTCGCCTTGAGGTGTGCCTTGTGCTTTGAAATCACTGCTTACTGCTTATGGTTCAACATCATCCAACTTTTACTTGAAGCTTTGGAAAGATGCCTATCCTAAGAAAATCAAAATTTTTTTATGGGAGCTTAGCCATGGAGGTATTAATACGGTAGATCGCTTGCAAAAGAGAATGCCTCATTTGACTCTCTCTCCTTGTTGTGTGATGTGTTATTCTGCTCCATTAAATCCTTGTCATCTCTTTGTTCATTGCTTCTATGCCTCTCAATACTGGCTAATCATGAAGAATGCCTTTGGTTGGTCCTTGATCTTACCCAACAATATATATGGAATTCTCGATTAGGTCTTTACGGGACACTCGTTTCATGGTGTGAAAAAGATCTTATGGTTATCTATTAACAAAGTATTCTTTTGGTATCTCTGGTGTGAGCGAAATGGAAGACTATTCAGGGACGTCTCCTCAACTTTTGATTCTTTTTGGGATTTAATTATCTTTCATGCTATGTATTGGTGTAAAAATAAACACCCTTTTAATGCTTATAGTCTCTCTACTCTACTTTGATTACTAACTGGAGATCTTTAGTGTAATTGTCTTTGGACGTAAGGTGTTCTTTAGCATTCTTTATTTCATCCATCAATTAAATGAATTGTTTCTCTTCTAAAAAAAAATAATAAAAAAATTTATCTTCCGAAGATGCACACCATCTGTTTGTTAAAATGCCTCAGTTAGCAGTTGATATATGGGTTCTTCATGTGTTGTTTTGACTGTTCTTATTTCAGCAACGACATGTTCATGGAGTCATTAAACCCTGATGTTTTCCGATGGAGGGGTTTTTACTTCAGGAATTTTAGCGTTATGACAAGATATTTCATCATCCCTCTTGGTTGATAAAAGCTGAAGTAAGTTTGGATAATGGCAACTTATCGATTGTTTAGTGTGCTCACAGAAATTTCTCGTCTGATGGCAAGTTTCTAAATTACAGTTCTGATGCTATTCTCTCAATTTTCTTTACAGATATGCAAAAATGCAATTTGGGGGGTTTCGAAGATTTTGCGGGATGAACATGTGGAACTTTGCAGTTTTGCGAAGTGTACATATTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGTAAATATCCAATCTGCTATGCTGCTCCGTATAACTTCTCCTTACTATGTTTATATTAAGAAATATTTGGTTTCTTTCTTTATGAAATTTGGAATTTGATAAAATTTGTTGATTTCCTTGCGATGATGTAAAGAACCTTCTGGAAACCTTCTATTTTCTGCAATCGACCTTTACTTCCGCAAGCTACCTATCGTTCTTGAATCTTCCTAATCCCCACCCCTTCCAAAAGAAATGTACAAGCCATGAGAAAGTCCGAATACTGATAATTCAAAGTTTATATGCATACAAGTTTCAATAATTTTCTTGTTGAATAATCAGTTCATATTTTGAAAATCAAGCATTTCATTTGTAATTTTTCCTCTAACATATAGACCTTTATTACTTGTGGCAGGAAAATGTGTTCAGAGAATTCAGAGTTCTGAAAATTATTCAGCTCTCGCATGTGTTGATGATGAAATTGATAAGGGAATGGAGAAAAAGAAATTGGCCATCAACCTGGCCTCGCTTGTTGAAGAATCTCTTATTGTTGATCTCAAAAGACCAAAGACTCGAATGGAACTCAGGAGATCCCTTGAAATTCAGATTAAGAGGAGGGTGAAGGCCCAATATATGAATGGGAAGTTTATGGACTTGATGGGGAAAGTGATTGCCTGCCCCACAACTCTTCAGAATGCTTATGATTGCATTAGACTTAACTCAAATGTAGATATAGCATCGAATGATCATTTAATGTCGTTTGATTCTATGGCTGAAGAGCTTCGTAATGGTAGTTTTGATGTCAACACCAATACTTTCTCCATATCAAGTTCGAGAAAAGAAGTACTGATCTTACCGAAGCTAAAGTTGAAGGTTCTTCAGGAGGCCATTAGGATAGTTTTGGAGTGTGTTTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAGAAAGGAGATCGATAATCCCGATTGGTGGTTCACACTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAAATCATTACAGTAATGGAGGACAAGATAGAAGATCCTAATTTATTTGCTGTCATTAGAAGTATATTTGATGCTGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCATGGTCTTCCACAAGAGGGCGTTCTGTCTCCTATATTAATGAACATCTATCTAAATCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAAGCTATCAATAAATATGGTAATGCTGGTCGAGATGTGTCACAATCAAAGCTGCGGAGCTGGTTTAGAAGACATTTGAAAGGAAATGATTCTGAGTATCCAGGTGAGGAGAAAGATAACATAAGAGTATACTGTTGTCGCTATATGGATGAAATCTTTTTGGCGGTATCAGGTTCTAAAGATGTTGCTATTAGTTTTCGGTCTGAGATTCTAGAATTCATACAGAAGTCTTTGCATTTGGATCTTAATCATCAAGGGGAAATGGTATCATGTGCGGAGACTCGCGGAATTCGTTTTCTTGGTTGTTTGGTCAGAAGAAGTATGAAGGAAAGTCCTGCTGTAAAAGCTGTCCACAAGTTGAAAGAAAAAGTTGAGTTGTTTGCTTTACAAAAGCAGGAGGCTTGGAATGCTTGGACAGTGTGGTTGGGGAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTCAAAGAGTCAGAGATCAAGCATTTAGCTAAAAATAGCCCTTCTTTGAATCAAATTTCGAGTTTTCGTAAAGCTGGGATGGAAACCGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATATAAACGCAAAGGCTGCAGACAGTGAAGAAACAATCTTATCTAAGCATGTAGTGGAACCTTCTCTTCCTTTGGAACTTAGAGACTCCTTCCATGAATTTCAAAGGTGTGTGGAAGAATATGTTTCATCCGAGACAGCTTCTACTATTGCTCTTTTACCAAATTATGACCCTTCTGTCAAATCTACTTTCATAACTGAGATTATAGCTCCAGTCGATTCTATCCGAAAACGACTATTGCGATATAGGTTAATCACGAATAAAGGGTATCCATGCCCCTCACCTTTCCTCATCTTACAAGATAACAACCAAATTATCGACTGGTTTTCAGGAGTATCACGGCGTTGGCTTCGATGGTACAGCAATTGTTCAAACTTCAGCGAGTTGATCTTAATATGCGATCAAGTTAGGAAATCTTGTATCCGAACACTAGCAGCAAAGCATCGAATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCGGAACTGAGTAGGATTTACTCCTCCCCTGAAATAGAGCAAGAAGAAGAGAAGACGACATCAGATACCCATGTTTTAGACCATGATGAGGCACTGATGTATGGAATTTCATACAGTGGTTTGTGTTTGCTCTCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTTGTCATGGGGTGTTTGGCTCCTGCGCCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAATACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAGACGACGATTCGGGTTATGCAAGCAACATTTGAAGGATCTGTATCTGGGCCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAATGAAGTTCTTCTTTTGTGTATGATCTCATTTAACCACTGGATTGCTGAGAGATCCAGGGTGGCAGCCACGACTGAGCCTTCGTGGCGCTTGGTGTCACAGGATCAAAACTCGATTATGAATTGGATTGTTGCACTGATGAAGAATATCTCAACTGCTCGGATGAATTGATGGTCACAAGTTTATTTGAGAGAAAGTGATAGTCACACCATATGCTTCAAGTTGAGAAGATTGGAACGAGGTAAATGTATGCTGTGTTCGGTCTGAGCTGATGGGATATTAATCAAACATGATTTGATTAGATACATGTCTTAAAAATCTCTTTTATATGTACAGTGTAAAAGTAATTTACTTTTATACGTATAAAAATGCAAAACTATCTTAACCTTTGTAAAATTGAGCCTAGCTTAGTGGTAATTTGCATGTGCCTTCGTACCAAGAGGTCAAAAGATTTGAACCCACGACTCTATATATTTTTCTTAGTAC

mRNA sequence

GAAAATCAAGTTCAGAAACACCCTGTCAAGCTATGTGGCGGTTGCCGTGATTGGAAAGGAATGTCGAATGAGGCGATGCAGTTGCACCCCTCTTCTATTTCTTTTTCCTCGACGAGCCACACACGAGGGATCCTATGGCGGTGTCTCGGCGGCGTCTTGACTGCGCGCAAGGCTGCAACAAGTGCTGGTCGTATCGTTCAACGTGCGGCATATTCAACATTTGAACTTTTCTACTCCGATGAAGCGTTTTCCGTTGGTTCCAGCAGCACCCACGCGTTTTCGGTAAGAAGCTTATCCAAAATTACCAGATCTACTGCTATGGTGGATCAAGTCTGGAAAAGTTCCAATGCGGCATGGGATTTGACACTTAGACGTAATCTAAATGACTTGGAGGTAAATGAATGGACTAGTCTCTCTCATATTCTTTCATCTGTTACGATACGATTAGTGAATGACTCATGGTGTCGGCCTCTTGATCAATCAAAGTGTTTTACTGCCAAGTCTCTCAAGTCTGATATGCTTACTTCCAGTGTTATTGAGGCGACCCCAGGCGCTCGCCTATGGCGAAAGGCGAGGTGCCCAGAGGGTTATGCGCCTCGTAGTTTCAACTTTTCATCTTCTTCGGCGGTGTTTTCATCTTCACGGCTTCTTCTGTTGGCTTTTCAAGTTTTCATCTTCACGCCTTCTTTTGGTTCTTCATCTTCACGGTTTCTTCTTCATCTCCACAGTTTCAGCTTCTCTAGTTCTCCTTCTCCATCTCCTTCAACCGATTTCTTCTTGCTGAGTCACCGTCCGACTTCACCACCACCGCCGTTTGCTGCCACTAGTCGGTTTGCTGCTACAGAAAGTCGCTTCTGTTTCTCTTCGTACCTTTGTCTCCGCCACTCTTCTGTGCCGTTGAAGCTCCGTCAAAGCTTCGATGTTTCTCTTCTCCTTCACACCACCGAAGCCAAACCTCACGCCGTTGCCCCCTCCATTGATGCTTTGATTCTCTCTATTGTTCCTCCCCTTCGACCTCCGTTATCAAAAGCAAAGAGGGAAGAAATCAATGCTCTGCCAAGCTACGATGCTTATGATTTGAACACAATTGATGAAGAGGATGAGATAGAAGTCATTGGTGGATCATGTTCAACATATGCAAAAATGCAATTTGGGGGGTTTCGAAGATTTTGCGGGATGAACATGTGGAACTTTGCAGTTTTGCGAAGTGTACATATTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAAAATGTGTTCAGAGAATTCAGAGTTCTGAAAATTATTCAGCTCTCGCATGTGTTGATGATGAAATTGATAAGGGAATGGAGAAAAAGAAATTGGCCATCAACCTGGCCTCGCTTGTTGAAGAATCTCTTATTGTTGATCTCAAAAGACCAAAGACTCGAATGGAACTCAGGAGATCCCTTGAAATTCAGATTAAGAGGAGGGTGAAGGCCCAATATATGAATGGGAAGTTTATGGACTTGATGGGGAAAGTGATTGCCTGCCCCACAACTCTTCAGAATGCTTATGATTGCATTAGACTTAACTCAAATGTAGATATAGCATCGAATGATCATTTAATGTCGTTTGATTCTATGGCTGAAGAGCTTCGTAATGGTAGTTTTGATGTCAACACCAATACTTTCTCCATATCAAGTTCGAGAAAAGAAGTACTGATCTTACCGAAGCTAAAGTTGAAGGTTCTTCAGGAGGCCATTAGGATAGTTTTGGAGTGTGTTTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAGAAAGGAGATCGATAATCCCGATTGGTGGTTCACACTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAAATCATTACAGTAATGGAGGACAAGATAGAAGATCCTAATTTATTTGCTGTCATTAGAAGTATATTTGATGCTGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCATGGTCTTCCACAAGAGGGCGTTCTGTCTCCTATATTAATGAACATCTATCTAAATCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAAGCTATCAATAAATATGGTAATGCTGGTCGAGATGTGTCACAATCAAAGCTGCGGAGCTGGTTTAGAAGACATTTGAAAGGAAATGATTCTGAGTATCCAGGTGAGGAGAAAGATAACATAAGAGTATACTGTTGTCGCTATATGGATGAAATCTTTTTGGCGGTATCAGGTTCTAAAGATGTTGCTATTAGTTTTCGGTCTGAGATTCTAGAATTCATACAGAAGTCTTTGCATTTGGATCTTAATCATCAAGGGGAAATGGTATCATGTGCGGAGACTCGCGGAATTCGTTTTCTTGGTTGTTTGGTCAGAAGAAGTATGAAGGAAAGTCCTGCTGTAAAAGCTGTCCACAAGTTGAAAGAAAAAGTTGAGTTGTTTGCTTTACAAAAGCAGGAGGCTTGGAATGCTTGGACAGTGTGGTTGGGGAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTCAAAGAGTCAGAGATCAAGCATTTAGCTAAAAATAGCCCTTCTTTGAATCAAATTTCGAGTTTTCGTAAAGCTGGGATGGAAACCGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATATAAACGCAAAGGCTGCAGACAGTGAAGAAACAATCTTATCTAAGCATGTAGTGGAACCTTCTCTTCCTTTGGAACTTAGAGACTCCTTCCATGAATTTCAAAGGTGTGTGGAAGAATATGTTTCATCCGAGACAGCTTCTACTATTGCTCTTTTACCAAATTATGACCCTTCTGTCAAATCTACTTTCATAACTGAGATTATAGCTCCAGTCGATTCTATCCGAAAACGACTATTGCGATATAGGTTAATCACGAATAAAGGGTATCCATGCCCCTCACCTTTCCTCATCTTACAAGATAACAACCAAATTATCGACTGGTTTTCAGGAGTATCACGGCGTTGGCTTCGATGGTACAGCAATTGTTCAAACTTCAGCGAGTTGATCTTAATATGCGATCAAGTTAGGAAATCTTGTATCCGAACACTAGCAGCAAAGCATCGAATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCGGAACTGAGTAGGATTTACTCCTCCCCTGAAATAGAGCAAGAAGAAGAGAAGACGACATCAGATACCCATGTTTTAGACCATGATGAGGCACTGATGTATGGAATTTCATACAGTGGTTTGTGTTTGCTCTCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTTGTCATGGGGTGTTTGGCTCCTGCGCCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAATACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAGACGACGATTCGGGTTATGCAAGCAACATTTGAAGGATCTGTATCTGGGCCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAATGA

Coding sequence (CDS)

GAAAATCAAGTTCAGAAACACCCTGTCAAGCTATGTGGCGGTTGCCGTGATTGGAAAGGAATGTCGAATGAGGCGATGCAGTTGCACCCCTCTTCTATTTCTTTTTCCTCGACGAGCCACACACGAGGGATCCTATGGCGGTGTCTCGGCGGCGTCTTGACTGCGCGCAAGGCTGCAACAAGTGCTGGTCGTATCGTTCAACGTGCGGCATATTCAACATTTGAACTTTTCTACTCCGATGAAGCGTTTTCCGTTGGTTCCAGCAGCACCCACGCGTTTTCGGTAAGAAGCTTATCCAAAATTACCAGATCTACTGCTATGGTGGATCAAGTCTGGAAAAGTTCCAATGCGGCATGGGATTTGACACTTAGACGTAATCTAAATGACTTGGAGGTAAATGAATGGACTAGTCTCTCTCATATTCTTTCATCTGTTACGATACGATTAGTGAATGACTCATGGTGTCGGCCTCTTGATCAATCAAAGTGTTTTACTGCCAAGTCTCTCAAGTCTGATATGCTTACTTCCAGTGTTATTGAGGCGACCCCAGGCGCTCGCCTATGGCGAAAGGCGAGGTGCCCAGAGGGTTATGCGCCTCGTAGTTTCAACTTTTCATCTTCTTCGGCGGTGTTTTCATCTTCACGGCTTCTTCTGTTGGCTTTTCAAGTTTTCATCTTCACGCCTTCTTTTGGTTCTTCATCTTCACGGTTTCTTCTTCATCTCCACAGTTTCAGCTTCTCTAGTTCTCCTTCTCCATCTCCTTCAACCGATTTCTTCTTGCTGAGTCACCGTCCGACTTCACCACCACCGCCGTTTGCTGCCACTAGTCGGTTTGCTGCTACAGAAAGTCGCTTCTGTTTCTCTTCGTACCTTTGTCTCCGCCACTCTTCTGTGCCGTTGAAGCTCCGTCAAAGCTTCGATGTTTCTCTTCTCCTTCACACCACCGAAGCCAAACCTCACGCCGTTGCCCCCTCCATTGATGCTTTGATTCTCTCTATTGTTCCTCCCCTTCGACCTCCGTTATCAAAAGCAAAGAGGGAAGAAATCAATGCTCTGCCAAGCTACGATGCTTATGATTTGAACACAATTGATGAAGAGGATGAGATAGAAGTCATTGGTGGATCATGTTCAACATATGCAAAAATGCAATTTGGGGGGTTTCGAAGATTTTGCGGGATGAACATGTGGAACTTTGCAGTTTTGCGAAGTGTACATATTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAAAATGTGTTCAGAGAATTCAGAGTTCTGAAAATTATTCAGCTCTCGCATGTGTTGATGATGAAATTGATAAGGGAATGGAGAAAAAGAAATTGGCCATCAACCTGGCCTCGCTTGTTGAAGAATCTCTTATTGTTGATCTCAAAAGACCAAAGACTCGAATGGAACTCAGGAGATCCCTTGAAATTCAGATTAAGAGGAGGGTGAAGGCCCAATATATGAATGGGAAGTTTATGGACTTGATGGGGAAAGTGATTGCCTGCCCCACAACTCTTCAGAATGCTTATGATTGCATTAGACTTAACTCAAATGTAGATATAGCATCGAATGATCATTTAATGTCGTTTGATTCTATGGCTGAAGAGCTTCGTAATGGTAGTTTTGATGTCAACACCAATACTTTCTCCATATCAAGTTCGAGAAAAGAAGTACTGATCTTACCGAAGCTAAAGTTGAAGGTTCTTCAGGAGGCCATTAGGATAGTTTTGGAGTGTGTTTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAGAAAGGAGATCGATAATCCCGATTGGTGGTTCACACTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAAATCATTACAGTAATGGAGGACAAGATAGAAGATCCTAATTTATTTGCTGTCATTAGAAGTATATTTGATGCTGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCATGGTCTTCCACAAGAGGGCGTTCTGTCTCCTATATTAATGAACATCTATCTAAATCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAAGCTATCAATAAATATGGTAATGCTGGTCGAGATGTGTCACAATCAAAGCTGCGGAGCTGGTTTAGAAGACATTTGAAAGGAAATGATTCTGAGTATCCAGGTGAGGAGAAAGATAACATAAGAGTATACTGTTGTCGCTATATGGATGAAATCTTTTTGGCGGTATCAGGTTCTAAAGATGTTGCTATTAGTTTTCGGTCTGAGATTCTAGAATTCATACAGAAGTCTTTGCATTTGGATCTTAATCATCAAGGGGAAATGGTATCATGTGCGGAGACTCGCGGAATTCGTTTTCTTGGTTGTTTGGTCAGAAGAAGTATGAAGGAAAGTCCTGCTGTAAAAGCTGTCCACAAGTTGAAAGAAAAAGTTGAGTTGTTTGCTTTACAAAAGCAGGAGGCTTGGAATGCTTGGACAGTGTGGTTGGGGAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTCAAAGAGTCAGAGATCAAGCATTTAGCTAAAAATAGCCCTTCTTTGAATCAAATTTCGAGTTTTCGTAAAGCTGGGATGGAAACCGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATATAAACGCAAAGGCTGCAGACAGTGAAGAAACAATCTTATCTAAGCATGTAGTGGAACCTTCTCTTCCTTTGGAACTTAGAGACTCCTTCCATGAATTTCAAAGGTGTGTGGAAGAATATGTTTCATCCGAGACAGCTTCTACTATTGCTCTTTTACCAAATTATGACCCTTCTGTCAAATCTACTTTCATAACTGAGATTATAGCTCCAGTCGATTCTATCCGAAAACGACTATTGCGATATAGGTTAATCACGAATAAAGGGTATCCATGCCCCTCACCTTTCCTCATCTTACAAGATAACAACCAAATTATCGACTGGTTTTCAGGAGTATCACGGCGTTGGCTTCGATGGTACAGCAATTGTTCAAACTTCAGCGAGTTGATCTTAATATGCGATCAAGTTAGGAAATCTTGTATCCGAACACTAGCAGCAAAGCATCGAATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCGGAACTGAGTAGGATTTACTCCTCCCCTGAAATAGAGCAAGAAGAAGAGAAGACGACATCAGATACCCATGTTTTAGACCATGATGAGGCACTGATGTATGGAATTTCATACAGTGGTTTGTGTTTGCTCTCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTTGTCATGGGGTGTTTGGCTCCTGCGCCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAATACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAGACGACGATTCGGGTTATGCAAGCAACATTTGAAGGATCTGTATCTGGGCCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAATGA

Protein sequence

ENQVQKHPVKLCGGCRDWKGMSNEAMQLHPSSISFSSTSHTRGILWRCLGGVLTARKAATSAGRIVQRAAYSTFELFYSDEAFSVGSSSTHAFSVRSLSKITRSTAMVDQVWKSSNAAWDLTLRRNLNDLEVNEWTSLSHILSSVTIRLVNDSWCRPLDQSKCFTAKSLKSDMLTSSVIEATPGARLWRKARCPEGYAPRSFNFSSSSAVFSSSRLLLLAFQVFIFTPSFGSSSSRFLLHLHSFSFSSSPSPSPSTDFFLLSHRPTSPPPPFAATSRFAATESRFCFSSYLCLRHSSVPLKLRQSFDVSLLLHTTEAKPHAVAPSIDALILSIVPPLRPPLSKAKREEINALPSYDAYDLNTIDEEDEIEVIGGSCSTYAKMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDDEIDKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMGKVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIRFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPSLPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYRLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFGLCKQHLKDLYLGHISLQSIDFGAWK
Homology
BLAST of Spg029397 vs. NCBI nr
Match: XP_022146068.1 (nuclear intron maturase 4, mitochondrial isoform X2 [Momordica charantia])

HSP 1 Score: 1453.7 bits (3762), Expect = 0.0e+00
Identity = 719/805 (89.32%), Postives = 759/805 (94.29%), Query Frame = 0

Query: 381  KMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDDEI 440
            +MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDIGKCVQR+Q+SENYSALAC DD+ 
Sbjct: 8    EMQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIGKCVQRVQTSENYSALACADDDF 67

Query: 441  DKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMG 500
             KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLMG
Sbjct: 68   CKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMG 127

Query: 501  KVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKE 560
            KVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+KE
Sbjct: 128  KVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKE 187

Query: 561  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFT 620
            VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWFT
Sbjct: 188  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFT 247

Query: 621  LDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVL 680
            +D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGVL
Sbjct: 248  VDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVL 307

Query: 681  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEE 740
            SPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +E
Sbjct: 308  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQE 367

Query: 741  KDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGI 800
            KDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRGI
Sbjct: 368  KDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGI 427

Query: 801  RFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKES 860
            RFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKES
Sbjct: 428  RFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKES 487

Query: 861  EIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPS 920
            EIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEPS
Sbjct: 488  EIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPS 547

Query: 921  LPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRY 980
            LPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLRY
Sbjct: 548  LPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRY 607

Query: 981  RLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIR 1040
            RLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCIR
Sbjct: 608  RLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIR 667

Query: 1041 TLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLC 1100
            TLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGLC
Sbjct: 668  TLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLC 727

Query: 1101 LLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFG 1160
            LLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR G
Sbjct: 728  LLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVG 787

Query: 1161 LCKQHLKDLYLGHISLQSIDFGAWK 1186
            LCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 788  LCKQHLKDLYLGHISLQSVNFGAWK 809

BLAST of Spg029397 vs. NCBI nr
Match: XP_022146067.1 (nuclear intron maturase 4, mitochondrial isoform X1 [Momordica charantia])

HSP 1 Score: 1449.1 bits (3750), Expect = 0.0e+00
Identity = 719/806 (89.21%), Postives = 759/806 (94.17%), Query Frame = 0

Query: 381  KMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDI-GKCVQRIQSSENYSALACVDDE 440
            +MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDI GKCVQR+Q+SENYSALAC DD+
Sbjct: 8    EMQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIAGKCVQRVQTSENYSALACADDD 67

Query: 441  IDKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLM 500
              KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLM
Sbjct: 68   FCKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLM 127

Query: 501  GKVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRK 560
            GKVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+K
Sbjct: 128  GKVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKK 187

Query: 561  EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWF 620
            EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWF
Sbjct: 188  EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWF 247

Query: 621  TLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGV 680
            T+D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGV
Sbjct: 248  TVDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGV 307

Query: 681  LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGE 740
            LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +
Sbjct: 308  LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQ 367

Query: 741  EKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRG 800
            EKDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRG
Sbjct: 368  EKDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRG 427

Query: 801  IRFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKE 860
            IRFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKE
Sbjct: 428  IRFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKE 487

Query: 861  SEIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEP 920
            SEIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEP
Sbjct: 488  SEIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEP 547

Query: 921  SLPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLR 980
            SLPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLR
Sbjct: 548  SLPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLR 607

Query: 981  YRLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCI 1040
            YRLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCI
Sbjct: 608  YRLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCI 667

Query: 1041 RTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGL 1100
            RTLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGL
Sbjct: 668  RTLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGL 727

Query: 1101 CLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRF 1160
            CLLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR 
Sbjct: 728  CLLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRV 787

Query: 1161 GLCKQHLKDLYLGHISLQSIDFGAWK 1186
            GLCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 788  GLCKQHLKDLYLGHISLQSVNFGAWK 810

BLAST of Spg029397 vs. NCBI nr
Match: XP_022146069.1 (nuclear intron maturase 4, mitochondrial isoform X3 [Momordica charantia])

HSP 1 Score: 1448.3 bits (3748), Expect = 0.0e+00
Identity = 719/805 (89.32%), Postives = 758/805 (94.16%), Query Frame = 0

Query: 382  MQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDI-GKCVQRIQSSENYSALACVDDEI 441
            MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDI GKCVQR+Q+SENYSALAC DD+ 
Sbjct: 1    MQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIAGKCVQRVQTSENYSALACADDDF 60

Query: 442  DKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMG 501
             KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLMG
Sbjct: 61   CKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMG 120

Query: 502  KVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKE 561
            KVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+KE
Sbjct: 121  KVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKE 180

Query: 562  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFT 621
            VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWFT
Sbjct: 181  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFT 240

Query: 622  LDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVL 681
            +D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGVL
Sbjct: 241  VDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVL 300

Query: 682  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEE 741
            SPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +E
Sbjct: 301  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQE 360

Query: 742  KDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGI 801
            KDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRGI
Sbjct: 361  KDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGI 420

Query: 802  RFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKES 861
            RFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKES
Sbjct: 421  RFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKES 480

Query: 862  EIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPS 921
            EIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEPS
Sbjct: 481  EIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPS 540

Query: 922  LPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRY 981
            LPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLRY
Sbjct: 541  LPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRY 600

Query: 982  RLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIR 1041
            RLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCIR
Sbjct: 601  RLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIR 660

Query: 1042 TLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLC 1101
            TLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGLC
Sbjct: 661  TLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLC 720

Query: 1102 LLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFG 1161
            LLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR G
Sbjct: 721  LLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVG 780

Query: 1162 LCKQHLKDLYLGHISLQSIDFGAWK 1186
            LCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 781  LCKQHLKDLYLGHISLQSVNFGAWK 802

BLAST of Spg029397 vs. NCBI nr
Match: XP_038882003.1 (nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida])

HSP 1 Score: 1426.8 bits (3692), Expect = 0.0e+00
Identity = 699/808 (86.51%), Postives = 756/808 (93.56%), Query Frame = 0

Query: 378  TYAKMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVD 437
            +YAKMQFGG +RFC +NM N   L  V++CKV+SS VS IGK VQR+Q+SENYS L C D
Sbjct: 31   SYAKMQFGGLQRFCRINMRNLTNLLCVNVCKVDSSVVSVIGKSVQRVQNSENYSTLTCAD 90

Query: 438  DEIDKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMD 497
            DEIDKGMEK KLA+NLASLVEESL VDLKR KT+MEL+RSLEIQIK RVKAQY+NGKF+D
Sbjct: 91   DEIDKGMEKMKLAMNLASLVEESLDVDLKRSKTQMELKRSLEIQIKERVKAQYLNGKFLD 150

Query: 498  LMGKVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSS 557
            LMGKVIACPTTLQNAYDC+R+NSNVDI SND L+SF+SMAEEL NG+FDVN NTFSI SS
Sbjct: 151  LMGKVIACPTTLQNAYDCVRINSNVDIMSNDCLISFESMAEELSNGNFDVNANTFSILSS 210

Query: 558  RKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDW 617
            RKEVL+LPK++LKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI NPDW
Sbjct: 211  RKEVLVLPKIELKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIKNPDW 270

Query: 618  WFTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQE 677
            WFT+DLSKKMDELVMAK+ITVMEDKIEDP LFAVIRSI+ AGALNLEFGGFPKGHGLPQE
Sbjct: 271  WFTIDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYVAGALNLEFGGFPKGHGLPQE 330

Query: 678  GVLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYP 737
            G+LSPIL NIYLNLFDQEFFRLSMKYEAIN+YGN G+D SQS+LRSWFRR LKGN S+YP
Sbjct: 331  GILSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYP 390

Query: 738  GEEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAET 797
            GE+KD IRVYCCRYMDEIFLAVSGSKDVA+SFRSEI  F+QK+LHLD+NHQ EMVSC ET
Sbjct: 391  GEQKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFYFLQKTLHLDVNHQEEMVSCGET 450

Query: 798  RGIRFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKV 857
             GIRFLGCLVRRS++ESPAVK+VHKLK+KVELFALQKQE WNAWTVWLGKKWLAHGLKKV
Sbjct: 451  HGIRFLGCLVRRSVQESPAVKSVHKLKKKVELFALQKQETWNAWTVWLGKKWLAHGLKKV 510

Query: 858  KESEIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVV 917
            KESEIKHLAKNS SLNQISSFRKAGMETDHWYKVLLKIWMQD+NA+AA+SEE ILSKH V
Sbjct: 511  KESEIKHLAKNS-SLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAV 570

Query: 918  EPSLPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRL 977
            EPSLPLELRDSF+EFQRCV+EY+S+ETAST+ALLPNYDPSVK TFITEIIAPV+SIRKRL
Sbjct: 571  EPSLPLELRDSFYEFQRCVQEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRL 630

Query: 978  LRYRLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKS 1037
            LRYRL+TNKG+PC SPFLILQDN QIIDWF GVSRRW RWY+NCSNFSELILICD VRKS
Sbjct: 631  LRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDLVRKS 690

Query: 1038 CIRTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYS 1097
            CIRTLAAKHRIHESEIEKKFDSELS++YSSPEIEQEEEK + DTH LDHDEAL YGISYS
Sbjct: 691  CIRTLAAKHRIHESEIEKKFDSELSKMYSSPEIEQEEEK-SPDTHGLDHDEALKYGISYS 750

Query: 1098 GLCLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRR 1157
            GLCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGW TGFSSSIHPSLN+R
Sbjct: 751  GLCLLSLARMVSQSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKR 810

Query: 1158 RFGLCKQHLKDLYLGHISLQSIDFGAWK 1186
            RFGLCK+HL+DLYLGHISLQSIDFGAWK
Sbjct: 811  RFGLCKKHLEDLYLGHISLQSIDFGAWK 836

BLAST of Spg029397 vs. NCBI nr
Match: XP_038882001.1 (nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida] >XP_038882002.1 nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida])

HSP 1 Score: 1426.0 bits (3690), Expect = 0.0e+00
Identity = 699/807 (86.62%), Postives = 755/807 (93.56%), Query Frame = 0

Query: 379  YAKMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDD 438
            YAKMQFGG +RFC +NM N   L  V++CKV+SS VS IGK VQR+Q+SENYS L C DD
Sbjct: 78   YAKMQFGGLQRFCRINMRNLTNLLCVNVCKVDSSVVSVIGKSVQRVQNSENYSTLTCADD 137

Query: 439  EIDKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDL 498
            EIDKGMEK KLA+NLASLVEESL VDLKR KT+MEL+RSLEIQIK RVKAQY+NGKF+DL
Sbjct: 138  EIDKGMEKMKLAMNLASLVEESLDVDLKRSKTQMELKRSLEIQIKERVKAQYLNGKFLDL 197

Query: 499  MGKVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSR 558
            MGKVIACPTTLQNAYDC+R+NSNVDI SND L+SF+SMAEEL NG+FDVN NTFSI SSR
Sbjct: 198  MGKVIACPTTLQNAYDCVRINSNVDIMSNDCLISFESMAEELSNGNFDVNANTFSILSSR 257

Query: 559  KEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWW 618
            KEVL+LPK++LKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI NPDWW
Sbjct: 258  KEVLVLPKIELKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIKNPDWW 317

Query: 619  FTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEG 678
            FT+DLSKKMDELVMAK+ITVMEDKIEDP LFAVIRSI+ AGALNLEFGGFPKGHGLPQEG
Sbjct: 318  FTIDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYVAGALNLEFGGFPKGHGLPQEG 377

Query: 679  VLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPG 738
            +LSPIL NIYLNLFDQEFFRLSMKYEAIN+YGN G+D SQS+LRSWFRR LKGN S+YPG
Sbjct: 378  ILSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPG 437

Query: 739  EEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETR 798
            E+KD IRVYCCRYMDEIFLAVSGSKDVA+SFRSEI  F+QK+LHLD+NHQ EMVSC ET 
Sbjct: 438  EQKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFYFLQKTLHLDVNHQEEMVSCGETH 497

Query: 799  GIRFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVK 858
            GIRFLGCLVRRS++ESPAVK+VHKLK+KVELFALQKQE WNAWTVWLGKKWLAHGLKKVK
Sbjct: 498  GIRFLGCLVRRSVQESPAVKSVHKLKKKVELFALQKQETWNAWTVWLGKKWLAHGLKKVK 557

Query: 859  ESEIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVE 918
            ESEIKHLAKNS SLNQISSFRKAGMETDHWYKVLLKIWMQD+NA+AA+SEE ILSKH VE
Sbjct: 558  ESEIKHLAKNS-SLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE 617

Query: 919  PSLPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLL 978
            PSLPLELRDSF+EFQRCV+EY+S+ETAST+ALLPNYDPSVK TFITEIIAPV+SIRKRLL
Sbjct: 618  PSLPLELRDSFYEFQRCVQEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLL 677

Query: 979  RYRLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSC 1038
            RYRL+TNKG+PC SPFLILQDN QIIDWF GVSRRW RWY+NCSNFSELILICD VRKSC
Sbjct: 678  RYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDLVRKSC 737

Query: 1039 IRTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSG 1098
            IRTLAAKHRIHESEIEKKFDSELS++YSSPEIEQEEEK + DTH LDHDEAL YGISYSG
Sbjct: 738  IRTLAAKHRIHESEIEKKFDSELSKMYSSPEIEQEEEK-SPDTHGLDHDEALKYGISYSG 797

Query: 1099 LCLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRR 1158
            LCLLSLARMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGW TGFSSSIHPSLN+RR
Sbjct: 798  LCLLSLARMVSQSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRR 857

Query: 1159 FGLCKQHLKDLYLGHISLQSIDFGAWK 1186
            FGLCK+HL+DLYLGHISLQSIDFGAWK
Sbjct: 858  FGLCKKHLEDLYLGHISLQSIDFGAWK 882

BLAST of Spg029397 vs. ExPASy Swiss-Prot
Match: Q9CA78 (Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT4 PE=3 SV=2)

HSP 1 Score: 879.4 bits (2271), Expect = 4.6e-254
Identity = 449/743 (60.43%), Postives = 557/743 (74.97%), Query Frame = 0

Query: 449  LAINLASLVEESL--IVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMGKVIACP 508
            LA  LASLVEES   + D  +P++RMEL+RSLE+++K+RVK Q +NGKF DL+ KVIA P
Sbjct: 56   LAGELASLVEESSSHVDDDSKPRSRMELKRSLELRLKKRVKEQCINGKFSDLLKKVIARP 115

Query: 509  TTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSI--SSSRKEVLIL 568
             TL++AYDCIRLNSNV I   +  ++FDS+AEEL +G FDV +NTFSI      KEVL+L
Sbjct: 116  ETLRDAYDCIRLNSNVSITERNGSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVL 175

Query: 569  PKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTLDLS 628
            P + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FTL L+
Sbjct: 176  PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 235

Query: 629  KKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVLSPIL 688
            KK+D  V   +++VME+K+ED +L  ++RS+F+A  LNLEFGGFPKGHGLPQEGVLS +L
Sbjct: 236  KKLDVSVFENLLSVMEEKVEDSSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVL 295

Query: 689  MNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEEKDNI 748
            MNIYL+ FD EF+R+SM++EA+        D   SKLRSWFRR       +   E+   +
Sbjct: 296  MNIYLDRFDHEFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVAL 355

Query: 749  RVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIRFLG 808
            RVYCCR+MDEI+ +VSG K VA   RSE + F++ SLHLD+  + +   C  T G+R LG
Sbjct: 356  RVYCCRFMDEIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG 415

Query: 809  CLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKH 868
             LVR++++ESP VKAVHKLKEKV LFALQK+EAW   TV +GKKWL HGLKKVKESEIK 
Sbjct: 416  TLVRKNVRESPTVKAVHKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKG 475

Query: 869  LAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAAD-SEETILSKHVVEPSLPL 928
            LA ++ +L+QIS  RKAGMETDHWYK+LL+IWM+D+   +AD SEE +LSKHVVEP++P 
Sbjct: 476  LADSNSTLSQISCHRKAGMETDHWYKILLRIWMEDVLRTSADRSEEFVLSKHVVEPTVPQ 535

Query: 929  ELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYRLI 988
            ELRD+F++FQ     YVSSETA+  ALLP      +  F  +++AP ++I +RL RY LI
Sbjct: 536  ELRDAFYKFQNAAAAYVSSETANLEALLPCPQSHDRPVFFGDVVAPTNAIGRRLYRYGLI 595

Query: 989  TNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSEL-ILICDQVRKSCIRTL 1048
            T KGY   +  LIL D  QIIDW+SG+ RRW+ WY  CSNF E+  LI +Q+R SCIRTL
Sbjct: 596  TAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQIRMSCIRTL 655

Query: 1049 AAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCLL 1108
            AAK+RIHE+EIEK+ D ELS I S+ +IEQE +    D+   D DE L YG+S SGLCLL
Sbjct: 656  AAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYGLSNSGLCLL 715

Query: 1109 SLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFGLC 1168
            SLAR+VS+SRPCNCFV+GC   AP+VYTLH MERQKFPGW TGFS  I  SLN RR GLC
Sbjct: 716  SLARLVSESRPCNCFVIGCSMAAPAVYTLHAMERQKFPGWKTGFSVCIPSSLNGRRIGLC 775

Query: 1169 KQHLKDLYLGHISLQSIDFGAWK 1186
            KQHLKDLY+G ISLQ++DFGAW+
Sbjct: 776  KQHLKDLYIGQISLQAVDFGAWR 798

BLAST of Spg029397 vs. ExPASy Swiss-Prot
Match: Q9LZA5 (Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT3 PE=3 SV=2)

HSP 1 Score: 227.6 bits (579), Expect = 7.2e-58
Identity = 196/749 (26.17%), Postives = 328/749 (43.79%), Query Frame = 0

Query: 481  QIKRRVKAQYMNGKFMDLMGKVIACPTTLQNAYDCIRL--NSNVDIASN-DHLMSFDSMA 540
            +++  V  QY +GKF  L+   ++ P  L  A   + L  NS+ D+A       S + M 
Sbjct: 48   ELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSLSANSSGDLADRVSRRFSIEEMG 107

Query: 541  EELRNGSFDVNTNTFSISSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRS 600
             E+R G FD+ +      SS    L+LP LKLKVL EAIR+VLE V+   F+  S+G R 
Sbjct: 108  REIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRV 167

Query: 601  GRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIIT-VMEDKIEDPNLFAVIRSIF 660
            G G  TA++Y++  ++NP WWF +  +++M E     I+   + +KI D  L  +I+ +F
Sbjct: 168  GMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCGFVGEKINDVMLIEMIKKLF 227

Query: 661  DAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDV 720
            + G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N     G + 
Sbjct: 228  EFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQDLRLKMKVKNPRVGTGDEE 287

Query: 721  SQSKLRSWFRRHLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEF 780
            S             GN    P      + +Y  RY+DEI +  SGSK + +  +  I++ 
Sbjct: 288  S------------TGNVFFKP------VNIYAVRYLDEILVITSGSKMLTMDLKKRIVDI 347

Query: 781  IQKSLHLDLNHQGEMVSCAETRGIRFLGCL-------VRRSMKESPAVKAVHKLKEKVEL 840
            +++ L L ++     +  A +  I FLG         V R  K   AV+A+ K + + ++
Sbjct: 348  LEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQKDV 407

Query: 841  FALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSPSLNQISSFRKAGMETDHWY 900
              L+ + A       LG K   H LKK+K+S               + F+  G E ++  
Sbjct: 408  RKLELRNARERNRKTLGLKIFRHVLKKIKQS---------------NGFKFEG-EIENEV 467

Query: 901  KVLLKIW----MQDINAKAAD--------SEETILSKHVVEPSLPLELRDSFHEFQRCVE 960
            + + + W    MQD      +        +    LS   +   LP +L D++ EFQ  V+
Sbjct: 468  RDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVD 527

Query: 961  EYVSSETASTIALLPNYDPSVK---------------STFITEIIAPVDSIRKRLLRYRL 1020
            ++++   A  +  L + +  V+               +    ++ AP + +RK +     
Sbjct: 528  KHLAPTQAKKV--LEDEERRVEEEEEQRYAERTVEDLTKLCMKVSAPEELVRKAIKLVGF 587

Query: 1021 ITNKGYPCPSPFLILQDNNQIIDWFS-----GVSRRWLRWYSNCSNFSELILICDQVRKS 1080
              + G P P   L+  +++ II W++     G +++ +R Y+     S+L          
Sbjct: 588  TNSMGRPRPIIHLVTLEDSDIIKWYARHEKHGSTKKLIRHYTKDLRVSDL---------- 647

Query: 1081 CIRTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYS 1140
                        +   E  F SE        E++   +K  SD   +D            
Sbjct: 648  ------------DGREEAHFPSE-------REVKMMGDKNLSDPKPVD------------ 707

Query: 1141 GLCLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKF------PGWNTGFSSSIH 1181
            G   L L R+ S     +C    C      ++ +H+++ +          W  G   +IH
Sbjct: 708  GTLSLLLIRLASDEPLHHCAASFCERSDTIMHRVHLLQNRLHINPLDEEKWVPGM-GTIH 715

BLAST of Spg029397 vs. ExPASy Swiss-Prot
Match: P0A3U0 (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris OX=1359 GN=ltrA PE=1 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 1.9e-21
Identity = 83/291 (28.52%), Postives = 146/291 (50.17%), Query Frame = 0

Query: 555 SSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDN 614
           +S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK I++E   
Sbjct: 93  NSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGG 152

Query: 615 PDWWFTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGH-G 674
             W+   D+    D +    +I ++  KI+D  +  +I     AG   LE   + K + G
Sbjct: 153 ARWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSG 212

Query: 675 LPQEGVLSPILMNIYLNLFDQEFFRLSMKY--EAINKYGNAGRDV-SQSKLRSWFRRHLK 734
            PQ G+LSP+L NIYL+  D+   +L MK+  E+  +     R++ ++ K  S   + L+
Sbjct: 213 TPQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIKRISHRLKKLE 272

Query: 735 GNDS-----EYPGEEKDNIRVYC----------CRYMDEIFLAVSGSKDVAISFRSEILE 794
           G +      EY  + K    + C           RY D+  ++V GSK+     + ++  
Sbjct: 273 GEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDCQWIKEQLKL 332

Query: 795 FIQKSLHLDLNHQGEMVSCAETRGIRFLGCLVRRSMKESPAVKAVHKLKEK 827
           FI   L ++L+ +  +++   ++  RFLG  +R  ++ S  +K   K+K++
Sbjct: 333 FIHNKLKMELSEEKTLIT-HSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of Spg029397 vs. ExPASy Swiss-Prot
Match: P0A3U1 (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (strain MG1363) OX=416870 GN=ltrA PE=1 SV=1)

HSP 1 Score: 106.7 bits (265), Expect = 1.9e-21
Identity = 83/291 (28.52%), Postives = 146/291 (50.17%), Query Frame = 0

Query: 555 SSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDN 614
           +S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK I++E   
Sbjct: 93  NSKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGG 152

Query: 615 PDWWFTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGH-G 674
             W+   D+    D +    +I ++  KI+D  +  +I     AG   LE   + K + G
Sbjct: 153 ARWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSG 212

Query: 675 LPQEGVLSPILMNIYLNLFDQEFFRLSMKY--EAINKYGNAGRDV-SQSKLRSWFRRHLK 734
            PQ G+LSP+L NIYL+  D+   +L MK+  E+  +     R++ ++ K  S   + L+
Sbjct: 213 TPQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIKRISHRLKKLE 272

Query: 735 GNDS-----EYPGEEKDNIRVYC----------CRYMDEIFLAVSGSKDVAISFRSEILE 794
           G +      EY  + K    + C           RY D+  ++V GSK+     + ++  
Sbjct: 273 GEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDCQWIKEQLKL 332

Query: 795 FIQKSLHLDLNHQGEMVSCAETRGIRFLGCLVRRSMKESPAVKAVHKLKEK 827
           FI   L ++L+ +  +++   ++  RFLG  +R  ++ S  +K   K+K++
Sbjct: 333 FIHNKLKMELSEEKTLIT-HSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of Spg029397 vs. ExPASy Swiss-Prot
Match: P03876 (Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=AI2 PE=4 SV=2)

HSP 1 Score: 102.8 bits (255), Expect = 2.7e-20
Identity = 93/367 (25.34%), Postives = 171/367 (46.59%), Query Frame = 0

Query: 487 KAQYMNGKFMDLMGKVIACPTTLQNAYDCIR-LNSNVDIASNDHLMSFDSM-AEELRNGS 546
           K + +N + + LM  +      L  AY+ I+    N+   SN+  ++ D +    L   S
Sbjct: 277 KTETINTRILKLMSDI----RMLLIAYNKIKSKKGNMSKGSNN--ITLDGINISYLNKLS 336

Query: 547 FDVNTNTFSISSSRK----------EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHG 606
            D+NTN F  S  R+            L +   + K++QE++R++LE ++   FS  SHG
Sbjct: 337 KDINTNMFKFSPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYSHG 396

Query: 607 CRSGRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRS 666
            R      TA+   +  +   +W+  +DL+K  D +    +I V+ ++I+D     ++  
Sbjct: 397 FRPNLSCLTAIIQCKNYMQYCNWFIKVDLNKCFDTIPHNMLINVLNERIKDKGFMDLLYK 456

Query: 667 IFDAGALNLEFGGFPKGHGLPQEGVLSPILMNIYL--------NLFDQEFFRLSMKYEAI 726
           +  AG ++          G+PQ  V+SPIL NI+L        N F+ EF   +M     
Sbjct: 457 LLRAGYVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDKYLENKFENEFNTGNMSNRGR 516

Query: 727 NK-YGNAGRDVSQSKLRSWFRR--HLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSK 786
           N  Y +    + + KL S   +   L+ +     G +K   R Y  RY D+I + V GS 
Sbjct: 517 NPIYNSLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAYFVRYADDIIIGVMGSH 576

Query: 787 DVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIRFLGCLVRRSMKESPAVKAVHKL 831
           +   +  ++I  F++++L + +N    ++  ++  G+ FLG      +K +P  K  +++
Sbjct: 577 NDCKNILNDINNFLKENLGMSINMDKSVIKHSK-EGVSFLG----YDVKVTPWEKRPYRM 632

BLAST of Spg029397 vs. ExPASy TrEMBL
Match: A0A6J1CXL0 (nuclear intron maturase 4, mitochondrial isoform X2 OS=Momordica charantia OX=3673 GN=LOC111015360 PE=4 SV=1)

HSP 1 Score: 1453.7 bits (3762), Expect = 0.0e+00
Identity = 719/805 (89.32%), Postives = 759/805 (94.29%), Query Frame = 0

Query: 381  KMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDDEI 440
            +MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDIGKCVQR+Q+SENYSALAC DD+ 
Sbjct: 8    EMQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIGKCVQRVQTSENYSALACADDDF 67

Query: 441  DKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMG 500
             KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLMG
Sbjct: 68   CKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMG 127

Query: 501  KVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKE 560
            KVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+KE
Sbjct: 128  KVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKE 187

Query: 561  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFT 620
            VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWFT
Sbjct: 188  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFT 247

Query: 621  LDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVL 680
            +D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGVL
Sbjct: 248  VDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVL 307

Query: 681  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEE 740
            SPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +E
Sbjct: 308  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQE 367

Query: 741  KDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGI 800
            KDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRGI
Sbjct: 368  KDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGI 427

Query: 801  RFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKES 860
            RFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKES
Sbjct: 428  RFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKES 487

Query: 861  EIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPS 920
            EIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEPS
Sbjct: 488  EIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPS 547

Query: 921  LPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRY 980
            LPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLRY
Sbjct: 548  LPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRY 607

Query: 981  RLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIR 1040
            RLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCIR
Sbjct: 608  RLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIR 667

Query: 1041 TLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLC 1100
            TLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGLC
Sbjct: 668  TLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLC 727

Query: 1101 LLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFG 1160
            LLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR G
Sbjct: 728  LLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVG 787

Query: 1161 LCKQHLKDLYLGHISLQSIDFGAWK 1186
            LCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 788  LCKQHLKDLYLGHISLQSVNFGAWK 809

BLAST of Spg029397 vs. ExPASy TrEMBL
Match: A0A6J1CYJ7 (nuclear intron maturase 4, mitochondrial isoform X1 OS=Momordica charantia OX=3673 GN=LOC111015360 PE=4 SV=1)

HSP 1 Score: 1449.1 bits (3750), Expect = 0.0e+00
Identity = 719/806 (89.21%), Postives = 759/806 (94.17%), Query Frame = 0

Query: 381  KMQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDI-GKCVQRIQSSENYSALACVDDE 440
            +MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDI GKCVQR+Q+SENYSALAC DD+
Sbjct: 8    EMQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIAGKCVQRVQTSENYSALACADDD 67

Query: 441  IDKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLM 500
              KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLM
Sbjct: 68   FCKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLM 127

Query: 501  GKVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRK 560
            GKVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+K
Sbjct: 128  GKVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKK 187

Query: 561  EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWF 620
            EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWF
Sbjct: 188  EVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWF 247

Query: 621  TLDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGV 680
            T+D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGV
Sbjct: 248  TVDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGV 307

Query: 681  LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGE 740
            LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +
Sbjct: 308  LSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQ 367

Query: 741  EKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRG 800
            EKDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRG
Sbjct: 368  EKDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRG 427

Query: 801  IRFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKE 860
            IRFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKE
Sbjct: 428  IRFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKE 487

Query: 861  SEIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEP 920
            SEIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEP
Sbjct: 488  SEIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEP 547

Query: 921  SLPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLR 980
            SLPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLR
Sbjct: 548  SLPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLR 607

Query: 981  YRLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCI 1040
            YRLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCI
Sbjct: 608  YRLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCI 667

Query: 1041 RTLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGL 1100
            RTLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGL
Sbjct: 668  RTLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGL 727

Query: 1101 CLLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRF 1160
            CLLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR 
Sbjct: 728  CLLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRV 787

Query: 1161 GLCKQHLKDLYLGHISLQSIDFGAWK 1186
            GLCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 788  GLCKQHLKDLYLGHISLQSVNFGAWK 810

BLAST of Spg029397 vs. ExPASy TrEMBL
Match: A0A6J1CX32 (nuclear intron maturase 4, mitochondrial isoform X3 OS=Momordica charantia OX=3673 GN=LOC111015360 PE=4 SV=1)

HSP 1 Score: 1448.3 bits (3748), Expect = 0.0e+00
Identity = 719/805 (89.32%), Postives = 758/805 (94.16%), Query Frame = 0

Query: 382  MQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDI-GKCVQRIQSSENYSALACVDDEI 441
            MQFGGF+RFC MNM NFAVLR   ICKVNSSFVSDI GKCVQR+Q+SENYSALAC DD+ 
Sbjct: 1    MQFGGFQRFCRMNMRNFAVLR---ICKVNSSFVSDIAGKCVQRVQTSENYSALACADDDF 60

Query: 442  DKGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMG 501
             KGMEKKKLA NLASLVEESL VD +RPK+RMEL+RSLEIQIK+RVKAQY+NGKFMDLMG
Sbjct: 61   CKGMEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMG 120

Query: 502  KVIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKE 561
            KVIACP TLQNAYDC+R+NSNVDIASNDHL+SF+SMAEEL NGSFDVN NTFSISSS+KE
Sbjct: 121  KVIACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKE 180

Query: 562  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFT 621
            VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEI+NPDWWFT
Sbjct: 181  VLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFT 240

Query: 622  LDLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVL 681
            +D+SKKMDEL MAK+I+VMEDKIEDP  FA+IRSIF+AGALNLEFGGFPKGHGLPQEGVL
Sbjct: 241  VDISKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVL 300

Query: 682  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEE 741
            SPILMNIYLNLFDQEFFRLSMKYEAINKYGNA +D SQSKLRSWFRR LKGNDSEYP +E
Sbjct: 301  SPILMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQE 360

Query: 742  KDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGI 801
            KDNIRVYCCRYMDEIF+AVSGSKDVA+SFRSEI +FIQKSLHLD+NHQ EMVSC ETRGI
Sbjct: 361  KDNIRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGI 420

Query: 802  RFLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKES 861
            RFLGCLVRRS KESPAVKAVHKLKEKVELFALQKQEAWN WTVWLGKKWLAHGLKKVKES
Sbjct: 421  RFLGCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKES 480

Query: 862  EIKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPS 921
            EIKHLAKNSPSLNQISSFRK GMETDHWYKVLLKIWMQDINAKAA++EETILS +VVEPS
Sbjct: 481  EIKHLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPS 540

Query: 922  LPLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRY 981
            LPLELRDSF+EFQR VEEYVSSETAST+ALLPNYDPSVKSTFITEIIAPV+SIRKRLLRY
Sbjct: 541  LPLELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRY 600

Query: 982  RLITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIR 1041
            RLITNKGYPC SPFLIL DN QIIDWF GV RRWL+WYSNCSNFSE+ILICDQVRKSCIR
Sbjct: 601  RLITNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIR 660

Query: 1042 TLAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLC 1101
            TLAAKHR HESEIEKKFD ELSRI S+PEIEQEEE+  SDTH L HDEA  YGISYSGLC
Sbjct: 661  TLAAKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLC 720

Query: 1102 LLSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFG 1161
            LLSLARMVSQSRPCNCFVMGCLA APSVYTLHVMERQKFPGW TGFSSSIHPSLNRRR G
Sbjct: 721  LLSLARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVG 780

Query: 1162 LCKQHLKDLYLGHISLQSIDFGAWK 1186
            LCKQHLKDLYLGHISLQS++FGAWK
Sbjct: 781  LCKQHLKDLYLGHISLQSVNFGAWK 802

BLAST of Spg029397 vs. ExPASy TrEMBL
Match: A0A1S3B491 (uncharacterized protein LOC103486008 OS=Cucumis melo OX=3656 GN=LOC103486008 PE=4 SV=1)

HSP 1 Score: 1386.3 bits (3587), Expect = 0.0e+00
Identity = 688/804 (85.57%), Postives = 742/804 (92.29%), Query Frame = 0

Query: 382  MQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDDEID 441
            MQFGG RRFC +N  NF+  +SV++C VNSSFVSDIGKC Q +QSSENYS LA  DDEID
Sbjct: 1    MQFGGLRRFCKINKRNFSNSQSVNVCIVNSSFVSDIGKCFQIVQSSENYSTLARADDEID 60

Query: 442  KGMEKKKLAINLASLVEESLIVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMGK 501
            KGMEK KLA+NLASLVEESL VDL+R KTRMEL+RSLEIQIK RVKAQY+NGKF+DLMG 
Sbjct: 61   KGMEKMKLAMNLASLVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGN 120

Query: 502  VIACPTTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKEV 561
            VIACP TLQNAYDCIR+NSNVDI SND L+SF+SMA+EL +G+FDVNTNTFSI SSRKEV
Sbjct: 121  VIACPNTLQNAYDCIRINSNVDIKSNDCLISFESMAKELSHGNFDVNTNTFSILSSRKEV 180

Query: 562  LILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTL 621
            LILPK+KLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYI+KEI +PDWWFT+
Sbjct: 181  LILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTV 240

Query: 622  DLSKKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVLS 681
            DLSKKMDELVMAK+ITVMEDKIEDP LFAVIRSI  AGALNLEFG FPKGHGLPQEGVLS
Sbjct: 241  DLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLS 300

Query: 682  PILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEEK 741
            PIL NIYLNLFDQEFFRLSMKYEAIN+YGN G+D SQSKLRSWFRR LK N S+YPGEEK
Sbjct: 301  PILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKENSSDYPGEEK 360

Query: 742  DNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIR 801
            D IRVYCCRYMDEIFLAVSGSKDVA+SFRSEI +F+QK+LHLD+NH+ EMVSC ET GIR
Sbjct: 361  DKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSC-ETHGIR 420

Query: 802  FLGCLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKESE 861
            FLGCLVRRS++ESPAVK++HKLKEKVELF LQKQE W +WTVWLGKKWLAHGLKKVKESE
Sbjct: 421  FLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESE 480

Query: 862  IKHLAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPSL 921
            IKHLAKNS SLNQISSFRK GMETDHWYKVLLKIWMQD+NA+AA+SEE ILSKH VEPSL
Sbjct: 481  IKHLAKNS-SLNQISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSL 540

Query: 922  PLELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYR 981
            P ELRDSF+EFQR VEEY+SSETAST+ALLPNYDPSVK TFITEIIAPV+SIRKRL RYR
Sbjct: 541  PFELRDSFYEFQRRVEEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYR 600

Query: 982  LITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIRT 1041
            L+TNKG+PC SPFLILQDN QIIDWF GVSRRW RWY+  SNFSEL LI DQVRKSCIRT
Sbjct: 601  LVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRT 660

Query: 1042 LAAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCL 1101
            LAAKH+IHESEIEKKFDSELS+IYSSPEIEQE+EK+T DTHVLDHDEAL YGISYSGLCL
Sbjct: 661  LAAKHQIHESEIEKKFDSELSKIYSSPEIEQEKEKST-DTHVLDHDEALNYGISYSGLCL 720

Query: 1102 LSLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFGL 1161
            LSLARMVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGW TGFSSSIHPSLN+RRFGL
Sbjct: 721  LSLARMVSRSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGL 780

Query: 1162 CKQHLKDLYLGHISLQSIDFGAWK 1186
            CKQHL DLYLG ISLQS+DFGAWK
Sbjct: 781  CKQHLADLYLGRISLQSVDFGAWK 801

BLAST of Spg029397 vs. ExPASy TrEMBL
Match: A0A6J1EMZ4 (nuclear intron maturase 4, mitochondrial isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111435824 PE=4 SV=1)

HSP 1 Score: 1384.4 bits (3582), Expect = 0.0e+00
Identity = 685/841 (81.45%), Postives = 752/841 (89.42%), Query Frame = 0

Query: 382  MQFGGFRRFCGMNMWNFAVLRSVHICKVNSSFVSDIGKCVQRIQSSENYSALACVDDEI- 441
            MQFGG RRFC +NM +F+V RSV+ C+V++SFVS+IG+CV   QSS+NYSALA  DDEI 
Sbjct: 1    MQFGGIRRFCWINMRSFSVSRSVNACRVDTSFVSEIGECV---QSSKNYSALAFTDDEIG 60

Query: 442  -------------------------------------DKGMEKKKLAINLASLVEESLIV 501
                                                  KGMEK KLA+NLASLVEESL V
Sbjct: 61   KGTEKRKLAMNLASLVEESIDVDMKTSKTRMELERSLGKGMEKSKLAMNLASLVEESLDV 120

Query: 502  DLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMGKVIACPTTLQNAYDCIRLNSNVD 561
            D K  KTRMEL+RSLEIQIK+RVKAQY+NGKF+DLM KV+ACP TLQNAY+C+R+NSNVD
Sbjct: 121  DTKTSKTRMELKRSLEIQIKKRVKAQYVNGKFLDLMEKVVACPKTLQNAYNCVRINSNVD 180

Query: 562  IASNDHLMSFDSMAEELRNGSFDVNTNTFSISSSRKEVLILPKLKLKVLQEAIRIVLECV 621
            + SNDHL+SF+ MAEELRNG+FD+N NTFSISSSRKEVLILPKLKLKVLQEAIRIVLECV
Sbjct: 181  VTSNDHLISFEPMAEELRNGNFDINANTFSISSSRKEVLILPKLKLKVLQEAIRIVLECV 240

Query: 622  FRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIITVMEDKI 681
            FRPHFSKISHGCRSGRGHSTALKYI+KEI NPDWWFT++LSK MDEL+MAK+ITVM+DKI
Sbjct: 241  FRPHFSKISHGCRSGRGHSTALKYIKKEIKNPDWWFTVNLSKMMDELMMAKLITVMKDKI 300

Query: 682  EDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKY 741
            EDPNLFAVIR+IFDAGALNLEFG FPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKY
Sbjct: 301  EDPNLFAVIRTIFDAGALNLEFGDFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKY 360

Query: 742  EAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSK 801
            EAIN+Y NAG+  SQSKLRSWFRR LKGNDSEYPG+EK NIRVYCCR MDEIFLA+SGSK
Sbjct: 361  EAINEYSNAGQGGSQSKLRSWFRRQLKGNDSEYPGDEKGNIRVYCCRCMDEIFLAISGSK 420

Query: 802  DVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIRFLGCLVRRSMKESPAVKAVHKL 861
            DVA+ FRSEIL+F+Q SLHLD++HQ EMV C  T GIRFLGCLVRRS +ESPAVKAVHK+
Sbjct: 421  DVALRFRSEILDFLQNSLHLDVHHQEEMVPCQATHGIRFLGCLVRRSEQESPAVKAVHKM 480

Query: 862  KEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSPSLNQISSFRKAGM 921
            KEKVELFA QKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLA+ SPSLNQISSFRKAGM
Sbjct: 481  KEKVELFAFQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAEQSPSLNQISSFRKAGM 540

Query: 922  ETDHWYKVLLKIWMQDINAKAADSEETILSKHVVEPSLPLELRDSFHEFQRCVEEYVSSE 981
            ETDHWYK LLKIWMQ+INA+AA+SEETILSKHVVEPSLP ELRDSF+EFQRCVEEYVSSE
Sbjct: 541  ETDHWYKALLKIWMQNINARAAESEETILSKHVVEPSLPQELRDSFYEFQRCVEEYVSSE 600

Query: 982  TASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYRLITNKGYPCPSPFLILQDNNQI 1041
            TASTIALLPNYDPSVK TF+TEIIAPV+SI KRL RYRL+TNKGYPCPSPFLILQD+ QI
Sbjct: 601  TASTIALLPNYDPSVKPTFVTEIIAPVNSIGKRLRRYRLLTNKGYPCPSPFLILQDDTQI 660

Query: 1042 IDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSR 1101
            IDWF GVSRRW RWY+NCSNFSELILICDQVR+SCIRTLAAKHR+HES+IEKKF+SELS+
Sbjct: 661  IDWFFGVSRRWFRWYTNCSNFSELILICDQVRQSCIRTLAAKHRMHESDIEKKFESELSK 720

Query: 1102 IYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCLLSLARMVSQSRPCNCFVMGCLA 1161
            IYS+P+IEQEEEK +SDT+ LD+DEALMYGISYSGLCLLSLARMVSQSRPCNCFV+GCL+
Sbjct: 721  IYSTPDIEQEEEKKSSDTNGLDNDEALMYGISYSGLCLLSLARMVSQSRPCNCFVIGCLS 780

Query: 1162 PAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFGLCKQHLKDLYLGHISLQSIDFGA 1185
            PAPSVYTLHVMERQKFPGW TGFSSSIHPSLNRRRFGLC++HLKDLYLGHISLQS+DFGA
Sbjct: 781  PAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRFGLCEKHLKDLYLGHISLQSVDFGA 838

BLAST of Spg029397 vs. TAIR 10
Match: AT1G74350.1 (Intron maturase, type II family protein )

HSP 1 Score: 879.4 bits (2271), Expect = 3.2e-255
Identity = 449/743 (60.43%), Postives = 557/743 (74.97%), Query Frame = 0

Query: 449  LAINLASLVEESL--IVDLKRPKTRMELRRSLEIQIKRRVKAQYMNGKFMDLMGKVIACP 508
            LA  LASLVEES   + D  +P++RMEL+RSLE+++K+RVK Q +NGKF DL+ KVIA P
Sbjct: 11   LAGELASLVEESSSHVDDDSKPRSRMELKRSLELRLKKRVKEQCINGKFSDLLKKVIARP 70

Query: 509  TTLQNAYDCIRLNSNVDIASNDHLMSFDSMAEELRNGSFDVNTNTFSI--SSSRKEVLIL 568
             TL++AYDCIRLNSNV I   +  ++FDS+AEEL +G FDV +NTFSI      KEVL+L
Sbjct: 71   ETLRDAYDCIRLNSNVSITERNGSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVL 130

Query: 569  PKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTLDLS 628
            P + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FTL L+
Sbjct: 131  PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 190

Query: 629  KKMDELVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKGHGLPQEGVLSPIL 688
            KK+D  V   +++VME+K+ED +L  ++RS+F+A  LNLEFGGFPKGHGLPQEGVLS +L
Sbjct: 191  KKLDVSVFENLLSVMEEKVEDSSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVL 250

Query: 689  MNIYLNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEEKDNI 748
            MNIYL+ FD EF+R+SM++EA+        D   SKLRSWFRR       +   E+   +
Sbjct: 251  MNIYLDRFDHEFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVAL 310

Query: 749  RVYCCRYMDEIFLAVSGSKDVAISFRSEILEFIQKSLHLDLNHQGEMVSCAETRGIRFLG 808
            RVYCCR+MDEI+ +VSG K VA   RSE + F++ SLHLD+  + +   C  T G+R LG
Sbjct: 311  RVYCCRFMDEIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG 370

Query: 809  CLVRRSMKESPAVKAVHKLKEKVELFALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKH 868
             LVR++++ESP VKAVHKLKEKV LFALQK+EAW   TV +GKKWL HGLKKVKESEIK 
Sbjct: 371  TLVRKNVRESPTVKAVHKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKG 430

Query: 869  LAKNSPSLNQISSFRKAGMETDHWYKVLLKIWMQDINAKAAD-SEETILSKHVVEPSLPL 928
            LA ++ +L+QIS  RKAGMETDHWYK+LL+IWM+D+   +AD SEE +LSKHVVEP++P 
Sbjct: 431  LADSNSTLSQISCHRKAGMETDHWYKILLRIWMEDVLRTSADRSEEFVLSKHVVEPTVPQ 490

Query: 929  ELRDSFHEFQRCVEEYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYRLI 988
            ELRD+F++FQ     YVSSETA+  ALLP      +  F  +++AP ++I +RL RY LI
Sbjct: 491  ELRDAFYKFQNAAAAYVSSETANLEALLPCPQSHDRPVFFGDVVAPTNAIGRRLYRYGLI 550

Query: 989  TNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNFSEL-ILICDQVRKSCIRTL 1048
            T KGY   +  LIL D  QIIDW+SG+ RRW+ WY  CSNF E+  LI +Q+R SCIRTL
Sbjct: 551  TAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQIRMSCIRTL 610

Query: 1049 AAKHRIHESEIEKKFDSELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCLL 1108
            AAK+RIHE+EIEK+ D ELS I S+ +IEQE +    D+   D DE L YG+S SGLCLL
Sbjct: 611  AAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYGLSNSGLCLL 670

Query: 1109 SLARMVSQSRPCNCFVMGCLAPAPSVYTLHVMERQKFPGWNTGFSSSIHPSLNRRRFGLC 1168
            SLAR+VS+SRPCNCFV+GC   AP+VYTLH MERQKFPGW TGFS  I  SLN RR GLC
Sbjct: 671  SLARLVSESRPCNCFVIGCSMAAPAVYTLHAMERQKFPGWKTGFSVCIPSSLNGRRIGLC 730

Query: 1169 KQHLKDLYLGHISLQSIDFGAWK 1186
            KQHLKDLY+G ISLQ++DFGAW+
Sbjct: 731  KQHLKDLYIGQISLQAVDFGAWR 753

BLAST of Spg029397 vs. TAIR 10
Match: AT5G04050.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 218.8 bits (556), Expect = 2.4e-56
Identity = 163/582 (28.01%), Postives = 272/582 (46.74%), Query Frame = 0

Query: 481  QIKRRVKAQYMNGKFMDLMGKVIACPTTLQNAYDCIRL--NSNVDIASN-DHLMSFDSMA 540
            +++  V  QY +GKF  L+   ++ P  L  A   + L  NS+ D+A       S + M 
Sbjct: 48   ELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSLSANSSGDLADRVSRRFSIEEMG 107

Query: 541  EELRNGSFDVNTNTFSISSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRS 600
             E+R G FD+ +      SS    L+LP LKLKVL EAIR+VLE V+   F+  S+G R 
Sbjct: 108  REIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRV 167

Query: 601  GRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIIT-VMEDKIEDPNLFAVIRSIF 660
            G G  TA++Y++  ++NP WWF +  +++M E     I+   + +KI D  L  +I+ +F
Sbjct: 168  GMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCGFVGEKINDVMLIEMIKKLF 227

Query: 661  DAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDV 720
            + G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N     G + 
Sbjct: 228  EFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQDLRLKMKVKNPRVGTGDEE 287

Query: 721  SQSKLRSWFRRHLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEF 780
            S             GN    P      + +Y  RY+DEI +  SGSK + +  +  I++ 
Sbjct: 288  S------------TGNVFFKP------VNIYAVRYLDEILVITSGSKMLTMDLKKRIVDI 347

Query: 781  IQKSLHLDLNHQGEMVSCAETRGIRFLGCL-------VRRSMKESPAVKAVHKLKEKVEL 840
            +++ L L ++     +  A +  I FLG         V R  K   AV+A+ K + + ++
Sbjct: 348  LEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQKDV 407

Query: 841  FALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSPSLNQISSFRKAGMETDHWY 900
              L+ + A       LG K   H LKK+K+S               + F+  G E ++  
Sbjct: 408  RKLELRNARERNRKTLGLKIFRHVLKKIKQS---------------NGFKFEG-EIENEV 467

Query: 901  KVLLKIW----MQDINAKAAD--------SEETILSKHVVEPSLPLELRDSFHEFQRCVE 960
            + + + W    MQD      +        +    LS   +   LP +L D++ EFQ  V+
Sbjct: 468  RDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVD 527

Query: 961  EYVSSETASTIALLPNYDPSVK---------------STFITEIIAPVDSIRKRLLRYRL 1020
            ++++   A  +  L + +  V+               +    ++ AP + +RK +     
Sbjct: 528  KHLAPTQAKKV--LEDEERRVEEEEEQRYAERTVEDLTKLCMKVSAPEELVRKAIKLVGF 587

Query: 1021 ITNKGYPCPSPFLILQDNNQIIDWFSGVSRRWLRWYSNCSNF 1025
              + G P P   L+  +++ II W++GV R+WL ++  C N+
Sbjct: 588  TNSMGRPRPIIHLVTLEDSDIIKWYAGVGRKWLDFFCCCHNY 590

BLAST of Spg029397 vs. TAIR 10
Match: AT5G04050.2 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 203.8 bits (517), Expect = 7.9e-52
Identity = 184/730 (25.21%), Postives = 310/730 (42.47%), Query Frame = 0

Query: 481  QIKRRVKAQYMNGKFMDLMGKVIACPTTLQNAYDCIRL--NSNVDIASN-DHLMSFDSMA 540
            +++  V  QY +GKF  L+   ++ P  L  A   + L  NS+ D+A       S + M 
Sbjct: 48   ELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSLSANSSGDLADRVSRRFSIEEMG 107

Query: 541  EELRNGSFDVNTNTFSISSSRKEVLILPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRS 600
             E+R G FD+ +      SS    L+LP LKLKVL EAIR+VLE V+   F+  S+G R 
Sbjct: 108  REIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIEAIRMVLEIVYDDRFATFSYGGRV 167

Query: 601  GRGHSTALKYIRKEIDNPDWWFTLDLSKKMDELVMAKIIT-VMEDKIEDPNLFAVIRSIF 660
            G G  TA++Y++  ++NP WWF +  +++M E     I+   + +KI D  L  +I+ +F
Sbjct: 168  GMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCGFVGEKINDVMLIEMIKKLF 227

Query: 661  DAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINKYGNAGRDV 720
            + G L +E GG   G G PQE  L  IL+N+Y +  D+E   L +K +  N     G + 
Sbjct: 228  EFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQDLRLKMKVKNPRVGTGDEE 287

Query: 721  SQSKLRSWFRRHLKGNDSEYPGEEKDNIRVYCCRYMDEIFLAVSGSKDVAISFRSEILEF 780
            S             GN    P      + +Y  RY+DEI +  SGSK + +  +  I++ 
Sbjct: 288  S------------TGNVFFKP------VNIYAVRYLDEILVITSGSKMLTMDLKKRIVDI 347

Query: 781  IQKSLHLDLNHQGEMVSCAETRGIRFLGCL-------VRRSMKESPAVKAVHKLKEKVEL 840
            +++ L L ++     +  A +  I FLG         V R  K   AV+A+ K + + ++
Sbjct: 348  LEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQKDV 407

Query: 841  FALQKQEAWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSPSLNQISSFRKAGMETDHWY 900
              L+ + A       LG K   H LKK+K+S               + F+  G E ++  
Sbjct: 408  RKLELRNARERNRKTLGLKIFRHVLKKIKQS---------------NGFKFEG-EIENEV 467

Query: 901  KVLLKIW----MQDINAKAAD--------SEETILSKHVVEPSLPLELRDSFHEFQRCVE 960
            + + + W    MQD      +        +    LS   +   LP +L D++ EFQ  V+
Sbjct: 468  RDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVD 527

Query: 961  EYVSSETASTIALLPNYDPSVKSTFITEIIAPVDSIRKRLLRYRLITNKGYPCPSPFLIL 1020
            ++                           +AP  + +                     +L
Sbjct: 528  KH---------------------------LAPTQAKK---------------------VL 587

Query: 1021 QDNNQIIDWFSGVSRRWLRWYSNCSNFSELILICDQVRKSCIRTLAAKHRIHESEIEKKF 1080
            +D  + ++                  ++E  +  + + K C++  A +  + ++      
Sbjct: 588  EDEERRVE------------EEEEQRYAERTV--EDLTKLCMKVSAPEELVRKAIKVSDL 647

Query: 1081 DS-ELSRIYSSPEIEQEEEKTTSDTHVLDHDEALMYGISYSGLCLLSLARMVSQSRPCNC 1140
            D  E +   S  E++   +K  SD   +D            G   L L R+ S     +C
Sbjct: 648  DGREEAHFPSEREVKMMGDKNLSDPKPVD------------GTLSLLLIRLASDEPLHHC 665

Query: 1141 FVMGCLAPAPSVYTLHVMERQKF------PGWNTGFSSSIHPSLNRRRFGLCKQHLKDLY 1181
                C      ++ +H+++ +          W  G   +IH +LNR+   LC  H+ D+Y
Sbjct: 708  AASFCERSDTIMHRVHLLQNRLHINPLDEEKWVPGM-GTIHSALNRKCLPLCSTHISDVY 665

BLAST of Spg029397 vs. TAIR 10
Match: ATMG00520.1 (Intron maturase, type II family protein )

HSP 1 Score: 87.0 bits (214), Expect = 1.1e-16
Identity = 56/172 (32.56%), Postives = 90/172 (52.33%), Query Frame = 0

Query: 570 KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIDNPDWWFTLDLSKKMDE 629
           K+++EAIR+VLE ++ P F   SH  RSG+G  + L+ I++E     W+   D+ K    
Sbjct: 15  KIMKEAIRMVLESIYDPEFPDTSH-FRSGQGCHSVLRRIKEEWGISRWFLEFDIRKCFHT 74

Query: 630 LVMAKIITVMEDKIEDPNLFAVIRSIFDAGALNLEFGGFPKG-HGLPQEGVLSPILMNIY 689
           +   ++I +++++I+DP  F  I+ +F AG L     G  +G + +P   +LS +  NIY
Sbjct: 75  IDRHRLIQILKEEIDDPKFFYSIQKVFSAGRL----VGVERGPYSVPHSVLLSALPGNIY 134

Query: 690 LNLFDQEFFRLSMKYEAINKYGNAGRDVSQSKLRSWFRRHLKGNDSEYPGEE 741
           L+  DQE  R+  KYE           + Q       R   + +D E PGEE
Sbjct: 135 LHKLDQEIGRIRQKYEI---------PIVQRVRSVLLRTGRRIDDQENPGEE 172

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022146068.10.0e+0089.32nuclear intron maturase 4, mitochondrial isoform X2 [Momordica charantia][more]
XP_022146067.10.0e+0089.21nuclear intron maturase 4, mitochondrial isoform X1 [Momordica charantia][more]
XP_022146069.10.0e+0089.32nuclear intron maturase 4, mitochondrial isoform X3 [Momordica charantia][more]
XP_038882003.10.0e+0086.51nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida][more]
XP_038882001.10.0e+0086.62nuclear intron maturase 4, mitochondrial isoform X1 [Benincasa hispida] >XP_0388... [more]
Match NameE-valueIdentityDescription
Q9CA784.6e-25460.43Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
Q9LZA57.2e-5826.17Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
P0A3U01.9e-2128.52Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris OX=13... [more]
P0A3U11.9e-2128.52Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (stra... [more]
P038762.7e-2025.34Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204... [more]
Match NameE-valueIdentityDescription
A0A6J1CXL00.0e+0089.32nuclear intron maturase 4, mitochondrial isoform X2 OS=Momordica charantia OX=36... [more]
A0A6J1CYJ70.0e+0089.21nuclear intron maturase 4, mitochondrial isoform X1 OS=Momordica charantia OX=36... [more]
A0A6J1CX320.0e+0089.32nuclear intron maturase 4, mitochondrial isoform X3 OS=Momordica charantia OX=36... [more]
A0A1S3B4910.0e+0085.57uncharacterized protein LOC103486008 OS=Cucumis melo OX=3656 GN=LOC103486008 PE=... [more]
A0A6J1EMZ40.0e+0081.45nuclear intron maturase 4, mitochondrial isoform X2 OS=Cucurbita moschata OX=366... [more]
Match NameE-valueIdentityDescription
AT1G74350.13.2e-25560.43Intron maturase, type II family protein [more]
AT5G04050.12.4e-5628.01RNA-directed DNA polymerase (reverse transcriptase) [more]
AT5G04050.27.9e-5225.21RNA-directed DNA polymerase (reverse transcriptase) [more]
ATMG00520.11.1e-1632.56Intron maturase, type II family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (cylindrica) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 965..1073
e-value: 1.2E-10
score: 41.5
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 563..804
e-value: 1.2E-13
score: 51.1
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 545..807
score: 9.039609
NoneNo IPR availablePANTHERPTHR33642:SF3COX1/OXI3 INTRON 1 PROTEIN-RELATEDcoord: 408..1179
NoneNo IPR availablePANTHERPTHR33642COX1/OXI3 INTRON 1 PROTEIN-RELATEDcoord: 408..1179
NoneNo IPR availableCDDcd01651RT_G2_introncoord: 565..804
e-value: 3.73521E-41
score: 149.274
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 565..810

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Spg029397.1Spg029397.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0000373 Group II intron splicing
biological_process GO:0006315 homing of group II introns
biological_process GO:0090615 mitochondrial mRNA processing
biological_process GO:1900864 mitochondrial RNA modification
biological_process GO:0007005 mitochondrion organization
biological_process GO:0032885 regulation of polysaccharide biosynthetic process
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0009845 seed germination
biological_process GO:0006397 mRNA processing
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003964 RNA-directed DNA polymerase activity