CsaV3_4G012020 (gene) Cucumber (Chinese Long) v3

Overview
NameCsaV3_4G012020
Typegene
OrganismCucumis sativus L. var. sativus cv. Chinese Long (Cucumber (Chinese Long) v3)
Descriptionnuclear intron maturase 4, mitochondrial isoform X2
Locationchr4: 9323828 .. 9328675 (+)
RNA-Seq ExpressionCsaV3_4G012020
SyntenyCsaV3_4G012020
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCACTTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGGGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTTATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTTTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGAATTGTTTTTGTTCTGCTTATGATTTCATTTAACTTCTAAATTACTTGTTGAGATAAGATTTGCCAAGAGAAATGCCATGACTGAGCCTTCGTGATGCATAATTTGTGCTAGCATAATGATTTTCTGGTCTCAATGGTGTCTGAAATCCTAATTGGAATGTTGCGCCGATTGAACATTGTTAAAACTAATGTGAGGACCAGGCTGGAACAACCCAATGTGAGAATCAGGTTTGAACAACGTTCAGTTGAATGGGTGATGTTTATATCGATATCTTGGAAACCTTTTCAATTGCTCGGATGAAGAATATTAGTATGGTGAGGAGTACATGCCAATGGTCTATGCAAACTTCAACTACTATAGATCGTCAAGATTATGATATTGGCTTTGTGCCTAGCTTAACCCATGGTGGGTGGCTCTATGTGCACATAGGAGCGTGGATACGAAGGTTGCTACTGTGGCAATTTCTAAATATGCGTGAGCTCTGTCTAGCCGTGTGTTTCCCTTATGGATATGGATCACATGTGAGCTATTCTCGGTCGGTATTTAGTATTCTACCATGGTTGGTCGAATGAAGGTTGCTATTGTGGCAAGTACTAAATATGCGTGAGCTTAGTTTGGCCGCATGATTCCTTAATCATGTGCGAACTATTCTCGGTCGTATATTTAGGCACTTTGCTATAGTAGACTTTTATATGGTTCAGGTTTGTTGGCAAATACTTGTGGTTTATATTAATATGTGATTTTAACGTTTTAATTATTATTTCATTATCGATGCCATATTTAAACAGTTTTGAGAGCATGTTTTACGACTATTTACGTCTAAAAACTTGCCACTCACTAAGTTTTATCTAACATTTTCAATGCCCCCTCCCCTACCCCAGGTAGCAGTTAAGGACCAAGCATTCGTTGAGCTTCCTGTCATTTATTGTAGATTCTCATTCTTTTTGTATGTATGTACTTAAGTTTTGTATTGCATCATTGGGTGGTTAGAAGCAGCGAGAGTCATGTTATTATATTGTGTTTTGAGGGATTCTTGTTGGAATGTAATTATTTTGTAACTAATTCCTAGTTTGGATTGTAACTAATTCCTAGTTTGGATATGAACTCTGATTGAGACTCGGACTAGTTTTGATGAGTGTGTGGTGAGATACACTCGATGTGATTTTTTAGTATGAAGATTATATATATATTGAAAATTTCTTGAAAATTTCCTCCATTTTCAGGTTTGGCTAACCCATATTTTAAGGAAAATGTTGTCGGAATTTCTATAGAATTTAATAAAACTTGTTCACTTGGCTTTTAGGTTGAAAATTGTTATTTTCAAAAAGTGTTTTCATGCCATGTCTAAGTTAAAAGGGAAAACCTAAAACGGAGTGTGACCACTAGCGGATTTAATATAGGCCTAGACCCCTCAACTTTATCGTTAAGAAAACTATTAAATTTTTAATACATGTAGATAGTTTAGTGGTTACTTTGTTTGTCAGCCCTCTTTCAACCAGGTTTGAATCGTATCATGTAATTTATTTTTCTTTTTTCTTCAAATTACAACTTTTTTATTTTCACCTTCAACTTATTTTTTTTAAGTTTTTTAATTTTTTCAATTTTTTGTTGCTTATTCAATATCTAAGTTTTTATTTATTTATTTATTTGTATTTGTATTTTTTTTTTAAGTTTGTTATTTTATTTATATGCATTTGTTCTAATTTTTAGAATTATTTTAATTGAACTATAAAGTTCCTATTAATAAATTTTTTAGATCTGACACTGAATGTGATACCATGACAGTCAATGTGCATGTCTAGATGCACATCTCCATAAGCATATTCAGATGCCCATGATTGCATGGCACTAATGCTTAACACAACCTGTGTGCATAAGTGGCAGCCCAGGCATATGTGCACATAAGCACACGCCCAACTCAGATGGTTGCCTATGCTCCCACGCACATGGTGCATGTGTCCCACACAGGCCCATGGCATGTGTGTCGCCTTGTGCACAGACAGTGCTCTCATGTGCGCCCCAAACATAACATGGGCATTGTGTGTCACCTCGTGCATACACAGTGCTTTTCTTATGCATTGACACAAGTCTCAGATGCATGCGACACCTTAAGTGCATGCGTTTACACTAAACGCCCCTATAAGCATTCGCAAGTCTCTCAAAGGTTTCAGAAAGGCCCAAACATGCCATGATCGTGTTGTAGCTCATCCATGGCCTAGATATTCTCAAAAGGCTTTAGAAGGCTCGAGACAGTGCCAAAATGTGCTAGAAGCATCTTGACCACTCTGAGAAGGCATTGGAACGTGTTGGAGAATGCTAGGTAATGTTGGAACATTCCATAACGATACTTTTAATGACGGTTATGTATCTATGACGGTCTAGAAGTGTCATGAAAGGTCTATAATGTTCTAAGGCTTAATGTAAGGGCTAGAACTTATTGAGATGGCTATAGAAGAAGCCATAATCGACCTCAATGCCTTAAGGACGGTCACGTGAGTCTATAAATACCCTAAGGGGGCCTCATTTGTACTCAAGGAATTCAAATGATTGAGTTAATCAAAGCTCTCATTCTCTCAAGCTCACTCTTATTCACTCATCTCTAAATTCTTTTGCTGCAAAACTCATCCC

mRNA sequence

ATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCACTTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGGGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTTATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTTTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGA

Coding sequence (CDS)

ATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCACTTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGGGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTTATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTTTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGA

Protein sequence

MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK*
Homology
BLAST of CsaV3_4G012020 vs. NCBI nr
Match: XP_011653460.1 (nuclear intron maturase 4, mitochondrial [Cucumis sativus])

HSP 1 Score: 1480.7 bits (3832), Expect = 0.0e+00
Identity = 739/739 (100.00%), Postives = 739/739 (100.00%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI
Sbjct: 61  MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 120

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI
Sbjct: 121 ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 180

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 181 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 240

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI
Sbjct: 241 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 300

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK
Sbjct: 301 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 360

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG
Sbjct: 361 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 420

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 421 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 480

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL
Sbjct: 481 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 540

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN
Sbjct: 541 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 600

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 601 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 660

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR
Sbjct: 661 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 720

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 721 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 780

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 781 ADLYLGRISLQSVDFGAWK 799

BLAST of CsaV3_4G012020 vs. NCBI nr
Match: KAE8649366.1 (hypothetical protein Csa_019152 [Cucumis sativus])

HSP 1 Score: 1480.7 bits (3832), Expect = 0.0e+00
Identity = 739/739 (100.00%), Postives = 739/739 (100.00%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI
Sbjct: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI
Sbjct: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI
Sbjct: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK
Sbjct: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG
Sbjct: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL
Sbjct: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN
Sbjct: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR
Sbjct: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 721 ADLYLGRISLQSVDFGAWK 739

BLAST of CsaV3_4G012020 vs. NCBI nr
Match: KAA0041778.1 (hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa] >TYK27066.1 hypothetical protein E5676_scaffold95G00640 [Cucumis melo var. makuwa])

HSP 1 Score: 1413.7 bits (3658), Expect = 0.0e+00
Identity = 704/739 (95.26%), Postives = 721/739 (97.56%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+MKLA+NLASLVEESLDVDLRRSKT+MELKRSLEI+IKERVKAQYLNGKFLDLMGNVI
Sbjct: 85  MEKMKLAMNLASLVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVI 144

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQN YDCIRINSNVDIKSND LISFESMAEELS+GNFDVNTNTFSILSSRKEVLI
Sbjct: 145 ACPNTLQNAYDCIRINSNVDIKSNDCLISFESMAEELSHGNFDVNTNTFSILSSRKEVLI 204

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 205 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 264

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMD+LVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPI
Sbjct: 265 SKKMDDLVMAKLITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPI 324

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGN+SDY GEEKDK
Sbjct: 325 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKGNSSDYPGEEKDK 384

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTLHLDVN EEEMVSCETHGIRFLG
Sbjct: 385 IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSCETHGIRFLG 444

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 445 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKH 504

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLN+ISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFEL
Sbjct: 505 LAKNSSLNQISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFEL 564

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTN
Sbjct: 565 RDSFYEFQRRVEEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTN 624

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 625 KGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAK 684

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           HRIHESEIEKKFDSELSKIYSS EI+Q KEKSTDTHVLDHDEAL YGISYSGLCLLS AR
Sbjct: 685 HRIHESEIEKKFDSELSKIYSSPEIEQGKEKSTDTHVLDHDEALNYGISYSGLCLLSLAR 744

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 745 MVSRSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 804

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 805 ADLYLGRISLQSVDFGAWK 823

BLAST of CsaV3_4G012020 vs. NCBI nr
Match: XP_008442019.1 (PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo])

HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 703/739 (95.13%), Postives = 721/739 (97.56%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+MKLA+NLASLVEESLDVDLRRSKT+MELKRSLEI+IKERVKAQYLNGKFLDLMGNVI
Sbjct: 63  MEKMKLAMNLASLVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVI 122

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQN YDCIRINSNVDIKSND LISFESMA+ELS+GNFDVNTNTFSILSSRKEVLI
Sbjct: 123 ACPNTLQNAYDCIRINSNVDIKSNDCLISFESMAKELSHGNFDVNTNTFSILSSRKEVLI 182

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 183 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 242

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPI
Sbjct: 243 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPI 302

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK N+SDY GEEKDK
Sbjct: 303 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKENSSDYPGEEKDK 362

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTLHLDVN EEEMVSCETHGIRFLG
Sbjct: 363 IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSCETHGIRFLG 422

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 423 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKH 482

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLN+ISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFEL
Sbjct: 483 LAKNSSLNQISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFEL 542

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTN
Sbjct: 543 RDSFYEFQRRVEEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTN 602

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 603 KGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAK 662

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           H+IHESEIEKKFDSELSKIYSS EI+QEKEKSTDTHVLDHDEAL YGISYSGLCLLS AR
Sbjct: 663 HQIHESEIEKKFDSELSKIYSSPEIEQEKEKSTDTHVLDHDEALNYGISYSGLCLLSLAR 722

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 723 MVSRSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 782

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 783 ADLYLGRISLQSVDFGAWK 801

BLAST of CsaV3_4G012020 vs. NCBI nr
Match: XP_038882003.1 (nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida])

HSP 1 Score: 1380.2 bits (3571), Expect = 0.0e+00
Identity = 684/740 (92.43%), Postives = 713/740 (96.35%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+MKLA+NLASLVEESLDVDL+RSKTQMELKRSLEI+IKERVKAQYLNGKFLDLMG VI
Sbjct: 97  MEKMKLAMNLASLVEESLDVDLKRSKTQMELKRSLEIQIKERVKAQYLNGKFLDLMGKVI 156

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACP TLQN YDC+RINSNVDI SND LISFESMAEELSNGNFDVN NTFSILSSRKEVL+
Sbjct: 157 ACPTTLQNAYDCVRINSNVDIMSNDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLV 216

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKI+LKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYI+KEIK+PDWWFT+DL
Sbjct: 217 LPKIELKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEIKNPDWWFTIDL 276

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIY+AGALNLEFGGFPKGHGLPQEG+LSPI
Sbjct: 277 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGILSPI 336

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGN+SDY GE+KDK
Sbjct: 337 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNSSDYPGEQKDK 396

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSC-ETHGIRFL 360
           IRVYCCRYMDEIFLAVSGSKDVA SFRSEIFYF+QKTLHLDVN +EEMVSC ETHGIRFL
Sbjct: 397 IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFYFLQKTLHLDVNHQEEMVSCGETHGIRFL 456

Query: 361 GCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIK 420
           GCLVRRSVQESPAVKS+HKLK+KVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIK
Sbjct: 457 GCLVRRSVQESPAVKSVHKLKKKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIK 516

Query: 421 HLAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFE 480
           HLAKNSSLN+ISSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLP E
Sbjct: 517 HLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPLE 576

Query: 481 LRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVT 540
           LRDSFYEFQR V+EYIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRLLRYRLVT
Sbjct: 577 LRDSFYEFQRCVQEYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLLRYRLVT 636

Query: 541 NKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAA 600
           NKGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI D VRKSCIRTLAA
Sbjct: 637 NKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELILICDLVRKSCIRTLAA 696

Query: 601 KHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFA 660
           KHRIHESEIEKKFDSELSK+YSS EI+QE+EKS DTH LDHDEALKYGISYSGLCLLS A
Sbjct: 697 KHRIHESEIEKKFDSELSKMYSSPEIEQEEEKSPDTHGLDHDEALKYGISYSGLCLLSLA 756

Query: 661 RMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQH 720
           RMVSQSRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+H
Sbjct: 757 RMVSQSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKH 816

Query: 721 LADLYLGRISLQSVDFGAWK 740
           L DLYLG ISLQS+DFGAWK
Sbjct: 817 LEDLYLGHISLQSIDFGAWK 836

BLAST of CsaV3_4G012020 vs. ExPASy Swiss-Prot
Match: Q9CA78 (Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT4 PE=3 SV=2)

HSP 1 Score: 843.2 bits (2177), Expect = 2.3e-243
Identity = 439/743 (59.08%), Postives = 548/743 (73.76%), Query Frame = 0

Query: 6   LAINLASLVEESLD--VDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACP 65
           LA  LASLVEES     D  + +++MELKRSLE+R+K+RVK Q +NGKF DL+  VIA P
Sbjct: 56  LAGELASLVEESSSHVDDDSKPRSRMELKRSLELRLKKRVKEQCINGKFSDLLKKVIARP 115

Query: 66  NTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILS--SRKEVLIL 125
            TL++ YDCIR+NSNV I   +  ++F+S+AEELS+G FDV +NTFSI++    KEVL+L
Sbjct: 116 ETLRDAYDCIRLNSNVSITERNGSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVL 175

Query: 126 PKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLS 185
           P + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FT+ L+
Sbjct: 176 PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 235

Query: 186 KKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPIL 245
           KK+D  V   L++VME+K+ED  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +L
Sbjct: 236 KKLDVSVFENLLSVMEEKVEDSSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVL 295

Query: 246 TNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI 305
            NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ        + E+   +
Sbjct: 296 MNIYLDRFDHEFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVAL 355

Query: 306 RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCE-THGIRFLG 365
           RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  E +   CE T G+R LG
Sbjct: 356 RVYCCRFMDEIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG 415

Query: 366 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 425
            LVR++V+ESP VK++HKLKEKV LF LQK+E W   TV +GKKWL HGLKKVKESEIK 
Sbjct: 416 TLVRKNVRESPTVKAVHKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKG 475

Query: 426 LA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPF 485
           LA  NS+L++IS  RK GMETDHWYK+LL+IWM+D L   A  SEE +LSKH VE ++P 
Sbjct: 476 LADSNSTLSQISCHRKAGMETDHWYKILLRIWMEDVLRTSADRSEEFVLSKHVVEPTVPQ 535

Query: 486 ELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLV 545
           ELRD+FY+FQ     Y+SSETA+  ALLP      +P F  +++AP N+I +RL RY L+
Sbjct: 536 ELRDAFYKFQNAAAAYVSSETANLEALLPCPQSHDRPVFFGDVVAPTNAIGRRLYRYGLI 595

Query: 546 TNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL 605
           T KG+  S+  LIL D  QIIDW+ G+ RR   WY   SNF E+  LI +Q+R SCIRTL
Sbjct: 596 TAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQIRMSCIRTL 655

Query: 606 AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLL 665
           AAK+RIHE+EIEK+ D ELS I S+ +I+QE + +  D+   D DE L YG+S SGLCLL
Sbjct: 656 AAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYGLSNSGLCLL 715

Query: 666 SFARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLC 725
           S AR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPGWKTGFS  I  SLN RR GLC
Sbjct: 716 SLARLVSESRPCNCFVIGCSMAAPAVYTLHAMERQKFPGWKTGFSVCIPSSLNGRRIGLC 775

Query: 726 KQHLADLYLGRISLQSVDFGAWK 740
           KQHL DLY+G+ISLQ+VDFGAW+
Sbjct: 776 KQHLKDLYIGQISLQAVDFGAWR 798

BLAST of CsaV3_4G012020 vs. ExPASy Swiss-Prot
Match: Q9LZA5 (Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT3 PE=3 SV=2)

HSP 1 Score: 206.1 bits (523), Expect = 1.4e-51
Identity = 196/770 (25.45%), Postives = 330/770 (42.86%), Query Frame = 0

Query: 17  SLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDC 76
           SL ++  ++ T+  +K  LE  + +    QY +GKF  L+ N ++ P  L    QN+   
Sbjct: 31  SLFLNSDQTITEPLVKSELEALVLK----QYSHGKFYSLVKNAVSLPCVLLAACQNL--S 90

Query: 77  IRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQE 136
           +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL E
Sbjct: 91  LSANSSGDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIE 150

Query: 137 AIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMA 196
           AIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  + 
Sbjct: 151 AIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVD 210

Query: 197 KLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD 256
            L   + +KI D  L  +I+ ++  G L +E GG   G G PQE  L  IL N+Y +  D
Sbjct: 211 ILCGFVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLD 270

Query: 257 QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMD 316
           +E   L +K +  N    TG + S   +  +F+                 + +Y  RY+D
Sbjct: 271 KEIQDLRLKMKVKNPRVGTGDEESTGNV--FFK----------------PVNIYAVRYLD 330

Query: 317 EIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMV-SCETHGIRFLGCL------- 376
           EI +  SGSK +    +  I   +++ L L V+R    + S  +  I FLG         
Sbjct: 331 EILVITSGSKMLTMDLKKRIVDILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPS 390

Query: 377 VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA 436
           V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S      
Sbjct: 391 VLRPPKSEKAVRAMKKYQRQKDVRKLELRNARERNRKTLGLKIFRHVLKKIKQS------ 450

Query: 437 KNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKH 496
                   + F+  G E ++  + + + W    MQD      E        +    LS  
Sbjct: 451 --------NGFKFEG-EIENEVRDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLR 510

Query: 497 AVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAK---------------P 556
            +   LP +L D++ EFQ  V ++++   A    +L + +   +                
Sbjct: 511 HIREKLPQDLIDAYDEFQEQVDKHLAPTQAK--KVLEDEERRVEEEEEQRYAERTVEDLT 570

Query: 557 TFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFV-----GVSRRLF 616
               ++ AP   +RK +       + G P     L+  +++ II W+      G +++L 
Sbjct: 571 KLCMKVSAPEELVRKAIKLVGFTNSMGRPRPIIHLVTLEDSDIIKWYARHEKHGSTKKLI 630

Query: 617 RWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKE 676
           R Y      S+L                      +   E  F SE           +E +
Sbjct: 631 RHYTKDLRVSDL----------------------DGREEAHFPSE-----------REVK 690

Query: 677 KSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCNCFVIGCLAPAPSVYTLHVMER 735
              D ++ D            G   L   R+ S     +C    C      ++ +H+++ 
Sbjct: 691 MMGDKNLSDPKPV-------DGTLSLLLIRLASDEPLHHCAASFCERSDTIMHRVHLLQN 715

BLAST of CsaV3_4G012020 vs. ExPASy Swiss-Prot
Match: P0A3U0 (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris OX=1359 GN=ltrA PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 1.2e-23
Identity = 84/299 (28.09%), Postives = 141/299 (47.16%), Query Frame = 0

Query: 113 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDP 172
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 173 DWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH-GL 232
            W+   D+    D +    LI ++  KI+D K+  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 233 PQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQ 292
           PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      ++
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDR----------ESPERITPEYRELHNEIKR 273

Query: 293 LKGNNSDYSGEEKDKIR----------------------VYCCRYMDEIFLAVSGSKDVA 352
           +        GEEK K+                       +   RY D+  ++V GSK+  
Sbjct: 274 ISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDC 333

Query: 353 HSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK 383
              + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 QWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of CsaV3_4G012020 vs. ExPASy Swiss-Prot
Match: P0A3U1 (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (strain MG1363) OX=416870 GN=ltrA PE=1 SV=1)

HSP 1 Score: 113.2 bits (282), Expect = 1.2e-23
Identity = 84/299 (28.09%), Postives = 141/299 (47.16%), Query Frame = 0

Query: 113 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDP 172
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 173 DWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH-GL 232
            W+   D+    D +    LI ++  KI+D K+  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 233 PQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQ 292
           PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      ++
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDR----------ESPERITPEYRELHNEIKR 273

Query: 293 LKGNNSDYSGEEKDKIR----------------------VYCCRYMDEIFLAVSGSKDVA 352
           +        GEEK K+                       +   RY D+  ++V GSK+  
Sbjct: 274 ISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDC 333

Query: 353 HSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK 383
              + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 QWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of CsaV3_4G012020 vs. ExPASy Swiss-Prot
Match: P03876 (Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=AI2 PE=4 SV=2)

HSP 1 Score: 107.1 bits (266), Expect = 8.8e-22
Identity = 91/361 (25.21%), Postives = 164/361 (45.43%), Query Frame = 0

Query: 44  KAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFD 103
           K + +N + L LM ++      L   Y+ I+       K ++ +         L+  + D
Sbjct: 277 KTETINTRILKLMSDI----RMLLIAYNKIKSKKGNMSKGSNNITLDGINISYLNKLSKD 336

Query: 104 VNTNTFSILSSRK----------EVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR 163
           +NTN F     R+            L +   + K++QE++R++LE ++   FS  SHG R
Sbjct: 337 INTNMFKFSPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYSHGFR 396

Query: 164 SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIY 223
                 TA+   K  ++  +W+  VDL+K  D +    LI V+ ++I+D     ++  + 
Sbjct: 397 PNLSCLTAIIQCKNYMQYCNWFIKVDLNKCFDTIPHNMLINVLNERIKDKGFMDLLYKLL 456

Query: 224 LAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEY-----GN 283
            AG ++          G+PQ  V+SPIL NI+L+  D+    L  K+E  NE+      N
Sbjct: 457 RAGYVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDK---YLENKFE--NEFNTGNMSN 516

Query: 284 TGQDGSQSRLRSWFRR-----------QLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVS 343
            G++   + L S   R           +L+ +     G +K   R Y  RY D+I + V 
Sbjct: 517 RGRNPIYNSLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAYFVRYADDIIIGVM 576

Query: 344 GSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIH 379
           GS +   +  ++I  F+++ L + +N ++ ++     G+ FLG  V+ +  E    + I 
Sbjct: 577 GSHNDCKNILNDINNFLKENLGMSINMDKSVIKHSKEGVSFLGYDVKVTPWEKRPYRMIK 628

BLAST of CsaV3_4G012020 vs. ExPASy TrEMBL
Match: A0A0A0KWB0 (Reverse transcriptase domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G188380 PE=4 SV=1)

HSP 1 Score: 1480.7 bits (3832), Expect = 0.0e+00
Identity = 739/739 (100.00%), Postives = 739/739 (100.00%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI
Sbjct: 61  MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 120

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI
Sbjct: 121 ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 180

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 181 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 240

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI
Sbjct: 241 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 300

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK
Sbjct: 301 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 360

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG
Sbjct: 361 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 420

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 421 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 480

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL
Sbjct: 481 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 540

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN
Sbjct: 541 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 600

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 601 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 660

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR
Sbjct: 661 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 720

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 721 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 780

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 781 ADLYLGRISLQSVDFGAWK 799

BLAST of CsaV3_4G012020 vs. ExPASy TrEMBL
Match: A0A5A7TFZ7 (Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold95G00640 PE=4 SV=1)

HSP 1 Score: 1413.7 bits (3658), Expect = 0.0e+00
Identity = 704/739 (95.26%), Postives = 721/739 (97.56%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+MKLA+NLASLVEESLDVDLRRSKT+MELKRSLEI+IKERVKAQYLNGKFLDLMGNVI
Sbjct: 85  MEKMKLAMNLASLVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVI 144

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQN YDCIRINSNVDIKSND LISFESMAEELS+GNFDVNTNTFSILSSRKEVLI
Sbjct: 145 ACPNTLQNAYDCIRINSNVDIKSNDCLISFESMAEELSHGNFDVNTNTFSILSSRKEVLI 204

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 205 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 264

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMD+LVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPI
Sbjct: 265 SKKMDDLVMAKLITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPI 324

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLKGN+SDY GEEKDK
Sbjct: 325 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKGNSSDYPGEEKDK 384

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTLHLDVN EEEMVSCETHGIRFLG
Sbjct: 385 IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSCETHGIRFLG 444

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 445 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKH 504

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLN+ISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFEL
Sbjct: 505 LAKNSSLNQISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFEL 564

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTN
Sbjct: 565 RDSFYEFQRRVEEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTN 624

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 625 KGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAK 684

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           HRIHESEIEKKFDSELSKIYSS EI+Q KEKSTDTHVLDHDEAL YGISYSGLCLLS AR
Sbjct: 685 HRIHESEIEKKFDSELSKIYSSPEIEQGKEKSTDTHVLDHDEALNYGISYSGLCLLSLAR 744

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 745 MVSRSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 804

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 805 ADLYLGRISLQSVDFGAWK 823

BLAST of CsaV3_4G012020 vs. ExPASy TrEMBL
Match: A0A1S3B491 (uncharacterized protein LOC103486008 OS=Cucumis melo OX=3656 GN=LOC103486008 PE=4 SV=1)

HSP 1 Score: 1411.4 bits (3652), Expect = 0.0e+00
Identity = 703/739 (95.13%), Postives = 721/739 (97.56%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+MKLA+NLASLVEESLDVDLRRSKT+MELKRSLEI+IKERVKAQYLNGKFLDLMGNVI
Sbjct: 63  MEKMKLAMNLASLVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVI 122

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACPNTLQN YDCIRINSNVDIKSND LISFESMA+ELS+GNFDVNTNTFSILSSRKEVLI
Sbjct: 123 ACPNTLQNAYDCIRINSNVDIKSNDCLISFESMAKELSHGNFDVNTNTFSILSSRKEVLI 182

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL
Sbjct: 183 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 242

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDELVMAKLITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPI
Sbjct: 243 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPI 302

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK N+SDY GEEKDK
Sbjct: 303 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKENSSDYPGEEKDK 362

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLG 360
           IRVYCCRYMDEIFLAVSGSKDVA SFRSEIF F+QKTLHLDVN EEEMVSCETHGIRFLG
Sbjct: 363 IRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSCETHGIRFLG 422

Query: 361 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 420
           CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKH
Sbjct: 423 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKH 482

Query: 421 LAKNSSLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFEL 480
           LAKNSSLN+ISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFEL
Sbjct: 483 LAKNSSLNQISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFEL 542

Query: 481 RDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTN 540
           RDSFYEFQR V+EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTN
Sbjct: 543 RDSFYEFQRRVEEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTN 602

Query: 541 KGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAK 600
           KGHPCSSPFLILQDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAK
Sbjct: 603 KGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAK 662

Query: 601 HRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFAR 660
           H+IHESEIEKKFDSELSKIYSS EI+QEKEKSTDTHVLDHDEAL YGISYSGLCLLS AR
Sbjct: 663 HQIHESEIEKKFDSELSKIYSSPEIEQEKEKSTDTHVLDHDEALNYGISYSGLCLLSLAR 722

Query: 661 MVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 720
           MVS+SRPCNCFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL
Sbjct: 723 MVSRSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHL 782

Query: 721 ADLYLGRISLQSVDFGAWK 740
           ADLYLGRISLQSVDFGAWK
Sbjct: 783 ADLYLGRISLQSVDFGAWK 801

BLAST of CsaV3_4G012020 vs. ExPASy TrEMBL
Match: A0A6J1CYJ7 (nuclear intron maturase 4, mitochondrial isoform X1 OS=Momordica charantia OX=3673 GN=LOC111015360 PE=4 SV=1)

HSP 1 Score: 1251.5 bits (3237), Expect = 0.0e+00
Identity = 623/742 (83.96%), Postives = 679/742 (91.51%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+ KLA NLASLVEESLDVD RR K++MELKRSLEI+IK+RVKAQY+NGKF+DLMG VI
Sbjct: 69  MEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMGKVI 128

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACP TLQN YDC+RINSNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLI
Sbjct: 129 ACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKEVLI 188

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPK+KLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYI+KEI +PDWWFTVD+
Sbjct: 189 LPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFTVDI 248

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDEL MAKLI+VMEDKIEDP+ FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLSPI
Sbjct: 249 SKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVLSPI 308

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           L NIYLNLFDQEFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN+S+Y  +EKD 
Sbjct: 309 LMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQEKDN 368

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSC-ETHGIRFL 360
           IRVYCCRYMDEIF+AVSGSKDVA SFRSEI  F+QK+LHLDVN +EEMVSC ET GIRFL
Sbjct: 369 IRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGIRFL 428

Query: 361 GCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIK 420
           GCLVRRS +ESPAVK++HKLKEKVELF LQKQE WN WTVWLGKKWLAHGLKKVKESEIK
Sbjct: 429 GCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKESEIK 488

Query: 421 HLAKNS-SLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPF 480
           HLAKNS SLN+ISSFRK GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS + VE SLP 
Sbjct: 489 HLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPSLPL 548

Query: 481 ELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLV 540
           ELRDSFYEFQR V+EY+SSETAST+ALLPNYDPS K TFITEIIAPVNSIRKRLLRYRL+
Sbjct: 549 ELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRYRLI 608

Query: 541 TNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLA 600
           TNKG+PC+SPFLIL DNTQIIDWF+GV RR  +WY+N SNFSE+ LI DQVRKSCIRTLA
Sbjct: 609 TNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIRTLA 668

Query: 601 AKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYGISYSGLCLLS 660
           AKHR HESEIEKKFD ELS+I S+ EI+Q E+E+++DTH L HDEA  YGISYSGLCLLS
Sbjct: 669 AKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLCLLS 728

Query: 661 FARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK 720
            ARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK
Sbjct: 729 LARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVGLCK 788

Query: 721 QHLADLYLGRISLQSVDFGAWK 740
           QHL DLYLG ISLQSV+FGAWK
Sbjct: 789 QHLKDLYLGHISLQSVNFGAWK 810

BLAST of CsaV3_4G012020 vs. ExPASy TrEMBL
Match: A0A6J1CX32 (nuclear intron maturase 4, mitochondrial isoform X3 OS=Momordica charantia OX=3673 GN=LOC111015360 PE=4 SV=1)

HSP 1 Score: 1251.5 bits (3237), Expect = 0.0e+00
Identity = 623/742 (83.96%), Postives = 679/742 (91.51%), Query Frame = 0

Query: 1   MERMKLAINLASLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVI 60
           ME+ KLA NLASLVEESLDVD RR K++MELKRSLEI+IK+RVKAQY+NGKF+DLMG VI
Sbjct: 61  MEKKKLAENLASLVEESLDVDSRRPKSRMELKRSLEIQIKKRVKAQYVNGKFMDLMGKVI 120

Query: 61  ACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLI 120
           ACP TLQN YDC+RINSNVDI SND LISFESMAEEL NG+FDVN NTFSI SS+KEVLI
Sbjct: 121 ACPPTLQNAYDCVRINSNVDIASNDHLISFESMAEELHNGSFDVNANTFSISSSKKEVLI 180

Query: 121 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 180
           LPK+KLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYI+KEI +PDWWFTVD+
Sbjct: 181 LPKLKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIRKEINNPDWWFTVDI 240

Query: 181 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPI 240
           SKKMDEL MAKLI+VMEDKIEDP+ FA+IRSI+ AGALNLEFGGFPKGHGLPQEGVLSPI
Sbjct: 241 SKKMDELEMAKLISVMEDKIEDPEFFAIIRSIFEAGALNLEFGGFPKGHGLPQEGVLSPI 300

Query: 241 LTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK 300
           L NIYLNLFDQEFFRLSMKYEAIN+YGN  QDGSQS+LRSWFRR+LKGN+S+Y  +EKD 
Sbjct: 301 LMNIYLNLFDQEFFRLSMKYEAINKYGNAVQDGSQSKLRSWFRRKLKGNDSEYPAQEKDN 360

Query: 301 IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSC-ETHGIRFL 360
           IRVYCCRYMDEIF+AVSGSKDVA SFRSEI  F+QK+LHLDVN +EEMVSC ET GIRFL
Sbjct: 361 IRVYCCRYMDEIFMAVSGSKDVALSFRSEIQDFIQKSLHLDVNHQEEMVSCRETRGIRFL 420

Query: 361 GCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIK 420
           GCLVRRS +ESPAVK++HKLKEKVELF LQKQE WN WTVWLGKKWLAHGLKKVKESEIK
Sbjct: 421 GCLVRRSEKESPAVKAVHKLKEKVELFALQKQEAWNDWTVWLGKKWLAHGLKKVKESEIK 480

Query: 421 HLAKNS-SLNKISSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPF 480
           HLAKNS SLN+ISSFRK GMETDHWYKVLLKIWMQD+NA+AAE+EE ILS + VE SLP 
Sbjct: 481 HLAKNSPSLNQISSFRKVGMETDHWYKVLLKIWMQDINAKAAETEETILSNYVVEPSLPL 540

Query: 481 ELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLV 540
           ELRDSFYEFQR V+EY+SSETAST+ALLPNYDPS K TFITEIIAPVNSIRKRLLRYRL+
Sbjct: 541 ELRDSFYEFQRRVEEYVSSETASTVALLPNYDPSVKSTFITEIIAPVNSIRKRLLRYRLI 600

Query: 541 TNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLA 600
           TNKG+PC+SPFLIL DNTQIIDWF+GV RR  +WY+N SNFSE+ LI DQVRKSCIRTLA
Sbjct: 601 TNKGYPCASPFLILHDNTQIIDWFLGVYRRWLKWYSNCSNFSEVILICDQVRKSCIRTLA 660

Query: 601 AKHRIHESEIEKKFDSELSKIYSSSEIDQ-EKEKSTDTHVLDHDEALKYGISYSGLCLLS 660
           AKHR HESEIEKKFD ELS+I S+ EI+Q E+E+++DTH L HDEA  YGISYSGLCLLS
Sbjct: 661 AKHRTHESEIEKKFDLELSRICSTPEIEQEEEEEASDTHGLGHDEASTYGISYSGLCLLS 720

Query: 661 FARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK 720
            ARMVSQSRPCNCFV+GCLA APSVYTLHVMERQKFPGWKTGFSSSIHPSLN+RR GLCK
Sbjct: 721 LARMVSQSRPCNCFVMGCLASAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNRRRVGLCK 780

Query: 721 QHLADLYLGRISLQSVDFGAWK 740
           QHL DLYLG ISLQSV+FGAWK
Sbjct: 781 QHLKDLYLGHISLQSVNFGAWK 802

BLAST of CsaV3_4G012020 vs. TAIR 10
Match: AT1G74350.1 (Intron maturase, type II family protein )

HSP 1 Score: 843.2 bits (2177), Expect = 1.6e-244
Identity = 439/743 (59.08%), Postives = 548/743 (73.76%), Query Frame = 0

Query: 6   LAINLASLVEESLD--VDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACP 65
           LA  LASLVEES     D  + +++MELKRSLE+R+K+RVK Q +NGKF DL+  VIA P
Sbjct: 11  LAGELASLVEESSSHVDDDSKPRSRMELKRSLELRLKKRVKEQCINGKFSDLLKKVIARP 70

Query: 66  NTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILS--SRKEVLIL 125
            TL++ YDCIR+NSNV I   +  ++F+S+AEELS+G FDV +NTFSI++    KEVL+L
Sbjct: 71  ETLRDAYDCIRLNSNVSITERNGSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVL 130

Query: 126 PKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLS 185
           P + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FT+ L+
Sbjct: 131 PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 190

Query: 186 KKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPIL 245
           KK+D  V   L++VME+K+ED  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +L
Sbjct: 191 KKLDVSVFENLLSVMEEKVEDSSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVL 250

Query: 246 TNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI 305
            NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ        + E+   +
Sbjct: 251 MNIYLDRFDHEFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVAL 310

Query: 306 RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCE-THGIRFLG 365
           RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  E +   CE T G+R LG
Sbjct: 311 RVYCCRFMDEIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG 370

Query: 366 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 425
            LVR++V+ESP VK++HKLKEKV LF LQK+E W   TV +GKKWL HGLKKVKESEIK 
Sbjct: 371 TLVRKNVRESPTVKAVHKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKG 430

Query: 426 LA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPF 485
           LA  NS+L++IS  RK GMETDHWYK+LL+IWM+D L   A  SEE +LSKH VE ++P 
Sbjct: 431 LADSNSTLSQISCHRKAGMETDHWYKILLRIWMEDVLRTSADRSEEFVLSKHVVEPTVPQ 490

Query: 486 ELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLV 545
           ELRD+FY+FQ     Y+SSETA+  ALLP      +P F  +++AP N+I +RL RY L+
Sbjct: 491 ELRDAFYKFQNAAAAYVSSETANLEALLPCPQSHDRPVFFGDVVAPTNAIGRRLYRYGLI 550

Query: 546 TNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL 605
           T KG+  S+  LIL D  QIIDW+ G+ RR   WY   SNF E+  LI +Q+R SCIRTL
Sbjct: 551 TAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQIRMSCIRTL 610

Query: 606 AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLL 665
           AAK+RIHE+EIEK+ D ELS I S+ +I+QE + +  D+   D DE L YG+S SGLCLL
Sbjct: 611 AAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYGLSNSGLCLL 670

Query: 666 SFARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLC 725
           S AR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPGWKTGFS  I  SLN RR GLC
Sbjct: 671 SLARLVSESRPCNCFVIGCSMAAPAVYTLHAMERQKFPGWKTGFSVCIPSSLNGRRIGLC 730

Query: 726 KQHLADLYLGRISLQSVDFGAWK 740
           KQHL DLY+G+ISLQ+VDFGAW+
Sbjct: 731 KQHLKDLYIGQISLQAVDFGAWR 753

BLAST of CsaV3_4G012020 vs. TAIR 10
Match: AT5G04050.1 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 188.0 bits (476), Expect = 2.8e-47
Identity = 162/604 (26.82%), Postives = 273/604 (45.20%), Query Frame = 0

Query: 17  SLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDC 76
           SL ++  ++ T+  +K  LE  + +    QY +GKF  L+ N ++ P  L    QN+   
Sbjct: 31  SLFLNSDQTITEPLVKSELEALVLK----QYSHGKFYSLVKNAVSLPCVLLAACQNL--S 90

Query: 77  IRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQE 136
           +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL E
Sbjct: 91  LSANSSGDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIE 150

Query: 137 AIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMA 196
           AIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  + 
Sbjct: 151 AIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVD 210

Query: 197 KLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD 256
            L   + +KI D  L  +I+ ++  G L +E GG   G G PQE  L  IL N+Y +  D
Sbjct: 211 ILCGFVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLD 270

Query: 257 QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMD 316
           +E   L +K +  N    TG + S   +  +F+                 + +Y  RY+D
Sbjct: 271 KEIQDLRLKMKVKNPRVGTGDEESTGNV--FFK----------------PVNIYAVRYLD 330

Query: 317 EIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMV-SCETHGIRFLGCL------- 376
           EI +  SGSK +    +  I   +++ L L V+R    + S  +  I FLG         
Sbjct: 331 EILVITSGSKMLTMDLKKRIVDILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPS 390

Query: 377 VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA 436
           V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S      
Sbjct: 391 VLRPPKSEKAVRAMKKYQRQKDVRKLELRNARERNRKTLGLKIFRHVLKKIKQS------ 450

Query: 437 KNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKH 496
                   + F+  G E ++  + + + W    MQD      E        +    LS  
Sbjct: 451 --------NGFKFEG-EIENEVRDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLR 510

Query: 497 AVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAK---------------P 556
            +   LP +L D++ EFQ  V ++++   A    +L + +   +                
Sbjct: 511 HIREKLPQDLIDAYDEFQEQVDKHLAPTQAK--KVLEDEERRVEEEEEQRYAERTVEDLT 570

Query: 557 TFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNN 580
               ++ AP   +RK +       + G P     L+  +++ II W+ GV R+   ++  
Sbjct: 571 KLCMKVSAPEELVRKAIKLVGFTNSMGRPRPIIHLVTLEDSDIIKWYAGVGRKWLDFFCC 590

BLAST of CsaV3_4G012020 vs. TAIR 10
Match: AT5G04050.2 (RNA-directed DNA polymerase (reverse transcriptase) )

HSP 1 Score: 185.3 bits (469), Expect = 1.8e-46
Identity = 183/750 (24.40%), Postives = 316/750 (42.13%), Query Frame = 0

Query: 17  SLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTL----QNVYDC 76
           SL ++  ++ T+  +K  LE  + +    QY +GKF  L+ N ++ P  L    QN+   
Sbjct: 31  SLFLNSDQTITEPLVKSELEALVLK----QYSHGKFYSLVKNAVSLPCVLLAACQNL--S 90

Query: 77  IRINSNVDIKSN-DRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQE 136
           +  NS+ D+     R  S E M  E+  G FD+ +     +SS    L+LP +KLKVL E
Sbjct: 91  LSANSSGDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIE 150

Query: 137 AIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMA 196
           AIR+VLE V+   F+  S+G R G G  TA++Y+K  +++P WWF V  +++M +E  + 
Sbjct: 151 AIRMVLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVD 210

Query: 197 KLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD 256
            L   + +KI D  L  +I+ ++  G L +E GG   G G PQE  L  IL N+Y +  D
Sbjct: 211 ILCGFVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLD 270

Query: 257 QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMD 316
           +E   L +K +  N    TG + S   +  +F+                 + +Y  RY+D
Sbjct: 271 KEIQDLRLKMKVKNPRVGTGDEESTGNV--FFK----------------PVNIYAVRYLD 330

Query: 317 EIFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMV-SCETHGIRFLGCL------- 376
           EI +  SGSK +    +  I   +++ L L V+R    + S  +  I FLG         
Sbjct: 331 EILVITSGSKMLTMDLKKRIVDILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPS 390

Query: 377 VRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA 436
           V R  +   AV+++ K + + ++  L+ +         LG K   H LKK+K+S      
Sbjct: 391 VLRPPKSEKAVRAMKKYQRQKDVRKLELRNARERNRKTLGLKIFRHVLKKIKQS------ 450

Query: 437 KNSSLNKISSFRKPGMETDHWYKVLLKIW----MQDLNARAAE--------SEEKILSKH 496
                   + F+  G E ++  + + + W    MQD      E        +    LS  
Sbjct: 451 --------NGFKFEG-EIENEVRDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLR 510

Query: 497 AVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRK 556
            +   LP +L D++ EFQ  V ++++   A                              
Sbjct: 511 HIREKLPQDLIDAYDEFQEQVDKHLAPTQAKK---------------------------- 570

Query: 557 RLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVR 616
                               +L+D  + ++                  ++E  +  + + 
Sbjct: 571 --------------------VLEDEERRVE------------EEEEQRYAERTV--EDLT 630

Query: 617 KSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISY 676
           K C++  A +  + ++      D      + S   ++E +   D ++ D           
Sbjct: 631 KLCMKVSAPEELVRKAIKVSDLDGREEAHFPS---EREVKMMGDKNLSDPKPV------- 665

Query: 677 SGLCLLSFARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKF------PGWKTGFSSSI 735
            G   L   R+ S     +C    C      ++ +H+++ +          W  G   +I
Sbjct: 691 DGTLSLLLIRLASDEPLHHCAASFCERSDTIMHRVHLLQNRLHINPLDEEKWVPGM-GTI 665

BLAST of CsaV3_4G012020 vs. TAIR 10
Match: ATMG00520.1 (Intron maturase, type II family protein )

HSP 1 Score: 86.3 bits (212), Expect = 1.1e-16
Identity = 50/136 (36.76%), Postives = 80/136 (58.82%), Query Frame = 0

Query: 127 KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDE 186
           K+++EAIR+VLE ++ P F   SH  RSG+G  + L+ IK+E     W+   D+ K    
Sbjct: 15  KIMKEAIRMVLESIYDPEFPDTSH-FRSGQGCHSVLRRIKEEWGISRWFLEFDIRKCFHT 74

Query: 187 LVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKG-HGLPQEGVLSPILTNIY 246
           +   +LI +++++I+DPK F  I+ ++ AG L     G  +G + +P   +LS +  NIY
Sbjct: 75  IDRHRLIQILKEEIDDPKFFYSIQKVFSAGRL----VGVERGPYSVPHSVLLSALPGNIY 134

Query: 247 LNLFDQEFFRLSMKYE 262
           L+  DQE  R+  KYE
Sbjct: 135 LHKLDQEIGRIRQKYE 145

BLAST of CsaV3_4G012020 vs. TAIR 10
Match: ATCG00040.1 (maturase K )

HSP 1 Score: 45.8 bits (107), Expect = 1.7e-04
Identity = 28/101 (27.72%), Postives = 49/101 (48.51%), Query Frame = 0

Query: 524 PVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELF 583
           P++SI   L + +     GHP S        ++ I++ FV + R +  +Y+ SS    L+
Sbjct: 388 PISSIIGSLAKDKFCNVLGHPISKATWTDSSDSDILNRFVRICRNISHYYSGSSKKKNLY 447

Query: 584 LIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSE 625
            I   +R  C++TLA KH+       K+  S L + + + E
Sbjct: 448 RIKYILRLCCVKTLARKHKSTVRTFLKRLGSGLLEEFLTGE 488

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011653460.10.0e+00100.00nuclear intron maturase 4, mitochondrial [Cucumis sativus][more]
KAE8649366.10.0e+00100.00hypothetical protein Csa_019152 [Cucumis sativus][more]
KAA0041778.10.0e+0095.26hypothetical protein E6C27_scaffold67G001360 [Cucumis melo var. makuwa] >TYK2706... [more]
XP_008442019.10.0e+0095.13PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo][more]
XP_038882003.10.0e+0092.43nuclear intron maturase 4, mitochondrial isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q9CA782.3e-24359.08Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
Q9LZA51.4e-5125.45Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
P0A3U01.2e-2328.09Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris OX=13... [more]
P0A3U11.2e-2328.09Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (stra... [more]
P038768.8e-2225.21Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204... [more]
Match NameE-valueIdentityDescription
A0A0A0KWB00.0e+00100.00Reverse transcriptase domain-containing protein OS=Cucumis sativus OX=3659 GN=Cs... [more]
A0A5A7TFZ70.0e+0095.26Reverse transcriptase domain-containing protein OS=Cucumis melo var. makuwa OX=1... [more]
A0A1S3B4910.0e+0095.13uncharacterized protein LOC103486008 OS=Cucumis melo OX=3656 GN=LOC103486008 PE=... [more]
A0A6J1CYJ70.0e+0083.96nuclear intron maturase 4, mitochondrial isoform X1 OS=Momordica charantia OX=36... [more]
A0A6J1CX320.0e+0083.96nuclear intron maturase 4, mitochondrial isoform X3 OS=Momordica charantia OX=36... [more]
Match NameE-valueIdentityDescription
AT1G74350.11.6e-24459.08Intron maturase, type II family protein [more]
AT5G04050.12.8e-4726.82RNA-directed DNA polymerase (reverse transcriptase) [more]
AT5G04050.21.8e-4624.40RNA-directed DNA polymerase (reverse transcriptase) [more]
ATMG00520.11.1e-1636.76Intron maturase, type II family protein [more]
ATCG00040.11.7e-0427.72maturase K [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Chinese Long) v3
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 520..632
e-value: 1.6E-12
score: 47.5
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 120..361
e-value: 4.6E-14
score: 52.5
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 1..363
score: 10.86402
NoneNo IPR availablePANTHERPTHR33642:SF3COX1/OXI3 INTRON 1 PROTEIN-RELATEDcoord: 2..733
NoneNo IPR availablePANTHERPTHR33642COX1/OXI3 INTRON 1 PROTEIN-RELATEDcoord: 2..733
NoneNo IPR availableCDDcd01651RT_G2_introncoord: 122..360
e-value: 4.12554E-41
score: 148.118
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 124..382

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsaV3_4G012020.1CsaV3_4G012020.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing