Cla97C10G189660 (gene) Watermelon (97103) v2

NameCla97C10G189660
Typegene
OrganismCitrullus lanatus (Watermelon (97103) v2)
DescriptionMaturase-like protein
LocationCla97Chr10 : 5964139 .. 5966442 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCATCCACTTCAGAAAAATAGCGACTCCATTGAAACCCAAGCTCTCAAATTCCCTCACCAAGCTTTACTCCCATCTGCCATTGAAATTGAAGCATTCGCCTCAGCTCAAGCTTCCGTCTCAACATTCTCCAGAAACCCTCACAAGGCCCCAACTGAAGGCTCTAGTTCTCAGCCATTTCTCCCGGGGCAAGTTCTTCGACCTTTTTCAAAATGTTGTCGCCTCCCCCTCTGTTCTTCTCACTGCCTCCCAAAACCTCATCACTCCGCCACTCAGTAATGCCCCCGATTCCTTACCTGTTTTCGATTTGGTCTCCAATTGCTTTTCGGTCGAGGTCATGGCTCGGGAGCTCTCCGAAAATCGTTTCGATGTTGGAGCTTGCTGTGTTCGGATGGTACCATCGGAAGAGAAAGGTGAGTCTCTGGTCTTACCGAATATGAAATTGAAGGTCTTAATCGAGGCTATTAGGATGGTGTTGGAAATTGTTTATGATGAACGATTTGTAACGTTCTCTTATGGTGGGCGTGTCGGTATGGGGCGACACACTGCGATTAGATACCTCAAAAACTCAGTGCAAAACCCTAGTTGGTGGTTTACTGTTGCATTTCGTCACAAAAAGTTCGAATCTGTACATGTAAATACGTTGTGCTTATTGATGCAAGAGAAAATTAAGGATGATATTTTGATTTATATGTTAAGGAAGCTGTTTGAACTGGAAGCAATTCAAATTGAATTGGGTGCTTGTTATTTAGGAAGGGGTTTCCCTCAGGAAAGTAGTTTGTGTTCTATCTTGATTAATATATACTTCAATGGCTTTGATAAAGAAATTCAACAAATACGTCTCCAAAAAAATGAAGAAAATCCTAAGTTCAATCTGGACGAGATTGTTTCTTTTCATAATCCGGTGAAAATATATGCTGTTAGATATCTAGATGAGATATTAGTTATAACATCAGGGTCAAAGATGCTAACAATGGAGTTGAAAAGCCAGGTGCTAAAGTACTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCAGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCGGTACCACCTTCAGTTCTGCATCCACCAATGTCGGAGAAGGCAATTAGGGCAAGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCGATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTAAAAATATTGAGTCACGTATTCAAGAAATTGAAGCAAACTAGTGGATTTAAATCTGAATTCCAAATCGAGAAGGAAGTTAGAGAAATCTTCAGAAACTGGGCCGATGAAGTGGTGCAAGATTTCTTTGAGTCTTCTGAAGATCATGCAGAGTGGCACCGTGTGCTGTCAGCAGGTGATTTCCTCTCTTTAAAACACATAAGAAATCAATTGCCAGAAGATCTTGTGAATGCTTATGATAGGTTTCAAGATCAGATAAACAAGCACTTGAATCCAGTTAAGGCTAAAAAGGAAAAGGCTCGGGAGGATGAAGAGAAAAGAGTGGAGGAAGAAGAACTATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTCGATGCTCCTATAGAGCTTGTTAGGAAGGCAGTCAAGATGGTTGGGTTTACAAATAAAATGGGTCGTCCTCAGCCAATCAGCTCACTCATTGCTCTTGAAGACACTGATATTATCAAGTGGTATTCTGGTGTAGGAAGACGGTGGCTAGACTTCTTCTGCTGTTGTCATAACTACAAGATGGTCAAAACTGTTGTAACTTACCACTTAAGGTTTTCTTGTATTTTGACATTGGCAGAAAAGCATGAATCAACCAAACGGGAAGCCATGAAACATTACAGTAAGGATTTGAAAGTCGTTGATCTGAATGGCAATGAAGAAATACACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGTAGACCCATACCCTGTGGATGGGGCTTTATCTTTGCTTGTGATTAGGTTAGTCACTGATGAAGCTTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATATTATACCGTGTTCGATTACTGCAAAGGACTCTGAATGTCAATCCATTTAATGGAGTGGAATGGGTTAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTTTGTGCTGATCACATTAGTGCCTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

mRNA sequence

ATGTTCATCCACTTCAGAAAAATAGCGACTCCATTGAAACCCAAGCTCTCAAATTCCCTCACCAAGCTTTACTCCCATCTGCCATTGAAATTGAAGCATTCGCCTCAGCTCAAGCTTCCGTCTCAACATTCTCCAGAAACCCTCACAAGGCCCCAACTGAAGGCTCTAGTTCTCAGCCATTTCTCCCGGGGCAAGTTCTTCGACCTTTTTCAAAATGTTGTCGCCTCCCCCTCTGTTCTTCTCACTGCCTCCCAAAACCTCATCACTCCGCCACTCAGTAATGCCCCCGATTCCTTACCTGTTTTCGATTTGGTCTCCAATTGCTTTTCGGTCGAGGTCATGGCTCGGGAGCTCTCCGAAAATCGTTTCGATGTTGGAGCTTGCTGTGTTCGGATGGTACCATCGGAAGAGAAAGGTGAGTCTCTGGTCTTACCGAATATGAAATTGAAGGTCTTAATCGAGGCTATTAGGATGGTGTTGGAAATTGTTTATGATGAACGATTTGTAACGTTCTCTTATGGTGGGCGTGTCGGTATGGGGCGACACACTGCGATTAGATACCTCAAAAACTCAGTGCAAAACCCTAGTTGGTGGTTTACTGTTGCATTTCGTCACAAAAAGTTCGAATCTGTACATGTAAATACGTTGTGCTTATTGATGCAAGAGAAAATTAAGGATGATATTTTGATTTATATGTTAAGGAAGCTGTTTGAACTGGAAGCAATTCAAATTGAATTGGGTGCTTGTTATTTAGGAAGGGGTTTCCCTCAGGAAAGTAGTTTGTGTTCTATCTTGATTAATATATACTTCAATGGCTTTGATAAAGAAATTCAACAAATACGTCTCCAAAAAAATGAAGAAAATCCTAAGTTCAATCTGGACGAGATTGTTTCTTTTCATAATCCGGTGAAAATATATGCTGTTAGATATCTAGATGAGATATTAGTTATAACATCAGGGTCAAAGATGCTAACAATGGAGTTGAAAAGCCAGGTGCTAAAGTACTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCAGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCGGTACCACCTTCAGTTCTGCATCCACCAATGTCGGAGAAGGCAATTAGGGCAAGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCGATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTAAAAATATTGAGTCACGTATTCAAGAAATTGAAGCAAACTAGTGGATTTAAATCTGAATTCCAAATCGAGAAGGAAGTTAGAGAAATCTTCAGAAACTGGGCCGATGAAGTGGTGCAAGATTTCTTTGAGTCTTCTGAAGATCATGCAGAGTGGCACCGTGTGCTGTCAGCAGGTGATTTCCTCTCTTTAAAACACATAAGAAATCAATTGCCAGAAGATCTTGTGAATGCTTATGATAGGTTTCAAGATCAGATAAACAAGCACTTGAATCCAGTTAAGGCTAAAAAGGAAAAGGCTCGGGAGGATGAAGAGAAAAGAGTGGAGGAAGAAGAACTATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTCGATGCTCCTATAGAGCTTGTTAGGAAGGCAGTCAAGATGGTTGGGTTTACAAATAAAATGGGTCGTCCTCAGCCAATCAGCTCACTCATTGCTCTTGAAGACACTGATATTATCAAGTGGTATTCTGGTGTAGGAAGACGGTGGCTAGACTTCTTCTGCTGTTGTCATAACTACAAGATGGTCAAAACTGTTGTAACTTACCACTTAAGGTTTTCTTGTATTTTGACATTGGCAGAAAAGCATGAATCAACCAAACGGGAAGCCATGAAACATTACAGTAAGGATTTGAAAGTCGTTGATCTGAATGGCAATGAAGAAATACACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGTAGACCCATACCCTGTGGATGGGGCTTTATCTTTGCTTGTGATTAGGTTAGTCACTGATGAAGCTTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATATTATACCGTGTTCGATTACTGCAAAGGACTCTGAATGTCAATCCATTTAATGGAGTGGAATGGGTTAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTTTGTGCTGATCACATTAGTGCCTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

Coding sequence (CDS)

ATGTTCATCCACTTCAGAAAAATAGCGACTCCATTGAAACCCAAGCTCTCAAATTCCCTCACCAAGCTTTACTCCCATCTGCCATTGAAATTGAAGCATTCGCCTCAGCTCAAGCTTCCGTCTCAACATTCTCCAGAAACCCTCACAAGGCCCCAACTGAAGGCTCTAGTTCTCAGCCATTTCTCCCGGGGCAAGTTCTTCGACCTTTTTCAAAATGTTGTCGCCTCCCCCTCTGTTCTTCTCACTGCCTCCCAAAACCTCATCACTCCGCCACTCAGTAATGCCCCCGATTCCTTACCTGTTTTCGATTTGGTCTCCAATTGCTTTTCGGTCGAGGTCATGGCTCGGGAGCTCTCCGAAAATCGTTTCGATGTTGGAGCTTGCTGTGTTCGGATGGTACCATCGGAAGAGAAAGGTGAGTCTCTGGTCTTACCGAATATGAAATTGAAGGTCTTAATCGAGGCTATTAGGATGGTGTTGGAAATTGTTTATGATGAACGATTTGTAACGTTCTCTTATGGTGGGCGTGTCGGTATGGGGCGACACACTGCGATTAGATACCTCAAAAACTCAGTGCAAAACCCTAGTTGGTGGTTTACTGTTGCATTTCGTCACAAAAAGTTCGAATCTGTACATGTAAATACGTTGTGCTTATTGATGCAAGAGAAAATTAAGGATGATATTTTGATTTATATGTTAAGGAAGCTGTTTGAACTGGAAGCAATTCAAATTGAATTGGGTGCTTGTTATTTAGGAAGGGGTTTCCCTCAGGAAAGTAGTTTGTGTTCTATCTTGATTAATATATACTTCAATGGCTTTGATAAAGAAATTCAACAAATACGTCTCCAAAAAAATGAAGAAAATCCTAAGTTCAATCTGGACGAGATTGTTTCTTTTCATAATCCGGTGAAAATATATGCTGTTAGATATCTAGATGAGATATTAGTTATAACATCAGGGTCAAAGATGCTAACAATGGAGTTGAAAAGCCAGGTGCTAAAGTACTTAGAAGGGAATTTAGAATTGGAAGTGGATCGAATGAATACTGCAATTCATAGTGCTGTCTCAGAGAAAATTAGTTTCTTAGGAATGGAACTACAGGCGGTACCACCTTCAGTTCTGCATCCACCAATGTCGGAGAAGGCAATTAGGGCAAGGAAGAAGTACCTTAGACAGAAGGAAGTTAGAGCGATAGAATTGAGAAATGCCCGTGAGAGAAACAGGAAAAAATTGGGATTAAAAATATTGAGTCACGTATTCAAGAAATTGAAGCAAACTAGTGGATTTAAATCTGAATTCCAAATCGAGAAGGAAGTTAGAGAAATCTTCAGAAACTGGGCCGATGAAGTGGTGCAAGATTTCTTTGAGTCTTCTGAAGATCATGCAGAGTGGCACCGTGTGCTGTCAGCAGGTGATTTCCTCTCTTTAAAACACATAAGAAATCAATTGCCAGAAGATCTTGTGAATGCTTATGATAGGTTTCAAGATCAGATAAACAAGCACTTGAATCCAGTTAAGGCTAAAAAGGAAAAGGCTCGGGAGGATGAAGAGAAAAGAGTGGAGGAAGAAGAACTATATGCTAAAAGAACAGTTGAGGACTTAACAAGGCTATGCATCAAAGTCGATGCTCCTATAGAGCTTGTTAGGAAGGCAGTCAAGATGGTTGGGTTTACAAATAAAATGGGTCGTCCTCAGCCAATCAGCTCACTCATTGCTCTTGAAGACACTGATATTATCAAGTGGTATTCTGGTGTAGGAAGACGGTGGCTAGACTTCTTCTGCTGTTGTCATAACTACAAGATGGTCAAAACTGTTGTAACTTACCACTTAAGGTTTTCTTGTATTTTGACATTGGCAGAAAAGCATGAATCAACCAAACGGGAAGCCATGAAACATTACAGTAAGGATTTGAAAGTCGTTGATCTGAATGGCAATGAAGAAATACACTTCCCAACAGAAAGAGAAGTTAAGATGTTGGGAGAAAGAAATCTTGTAGACCCATACCCTGTGGATGGGGCTTTATCTTTGCTTGTGATTAGGTTAGTCACTGATGAAGCTTCATATCCTTGTATTGCTCATTTTTGCAATAGAACAGACTCTATATTATACCGTGTTCGATTACTGCAAAGGACTCTGAATGTCAATCCATTTAATGGAGTGGAATGGGTTAGAGGGATGGGAGTGATTCATGAAAGTTTAAATCAGAGATGCCTCCCTCTTTGTGCTGATCACATTAGTGCCTTATACATGGGGAAAATCAACCTTCAAGACTTGGACTGCACCTTATCATTGGATATGGACTGA

Protein sequence

MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVSNCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIVSFHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD
BLAST of Cla97C10G189660 vs. NCBI nr
Match: XP_011650528.1 (PREDICTED: uncharacterized protein LOC101217546 [Cucumis sativus])

HSP 1 Score: 1420.6 bits (3676), Expect = 0.0e+00
Identity = 715/767 (93.22%), Postives = 736/767 (95.96%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSH 60
           MFIHFRKIAT LKPK SNSL KLYSHLPLKLKHSP+L LPSQHSPETLT  QLKALVLS 
Sbjct: 1   MFIHFRKIATSLKPKFSNSLNKLYSHLPLKLKHSPELNLPSQHSPETLTSSQLKALVLSR 60

Query: 61  FSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVSNCFSVEVMARELSE 120
           FS GKF DLFQNVVASPSVLLTASQNLITPP SNAPDSLP+FDLVS CFSVEVMARELSE
Sbjct: 61  FSHGKFVDLFQNVVASPSVLLTASQNLITPPFSNAPDSLPLFDLVSKCFSVEVMARELSE 120

Query: 121 NRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG 180
           NRFDVGACCV M P EEKGESL+LPN+KLKVLIEAIRMV+EIVYDERFVTFSYGGRVGMG
Sbjct: 121 NRFDVGACCVPMAPLEEKGESLLLPNLKLKVLIEAIRMVMEIVYDERFVTFSYGGRVGMG 180

Query: 181 RHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFELE 240
           RHTAIRYLKNSVQNPSWWFTVAFR KKFESVHVNTLCLLMQEKIKDDILIYMLRKLFE+E
Sbjct: 181 RHTAIRYLKNSVQNPSWWFTVAFRRKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFEVE 240

Query: 241 AIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIVSFH 300
           AIQIELG CYLGRGFPQES LCSIL+NIYFNGFDKEIQ+IRLQK EENPKFNLDEIVSFH
Sbjct: 241 AIQIELGGCYLGRGFPQESGLCSILLNIYFNGFDKEIQRIRLQKIEENPKFNLDEIVSFH 300

Query: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIS 360
           NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKI 
Sbjct: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIG 360

Query: 361 FLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVF 420
           FLGMEL+AVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVF
Sbjct: 361 FLGMELRAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVF 420

Query: 421 KKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480
           KK K+T GFKSEFQIEKEVR IFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR
Sbjct: 421 KKFKRTGGFKSEFQIEKEVRSIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480

Query: 481 NQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIK 540
           NQLPEDLVNAYDRFQDQ+NKHLNPVK KKEKAREDEEKR+EEEELYAKRTV+DLTRLCIK
Sbjct: 481 NQLPEDLVNAYDRFQDQVNKHLNPVKFKKEKAREDEEKRLEEEELYAKRTVDDLTRLCIK 540

Query: 541 VDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYK 600
           VDAPIELVRKAV+MVGFTN MGRP+PISSLI LED DIIKWYSGVGRRWLDFFCCCHNYK
Sbjct: 541 VDAPIELVRKAVRMVGFTNNMGRPRPISSLIVLEDADIIKWYSGVGRRWLDFFCCCHNYK 600

Query: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLG 660
           MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKV DLNGNEE+HFPTE+EVKMLG
Sbjct: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGNEEMHFPTEKEVKMLG 660

Query: 661 ERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGV 720
           ERNL DPYPVDGALSLL+IRL TDEASYPCIA+FCNRT+SILYRVRLLQRTLNVNP +GV
Sbjct: 661 ERNLADPYPVDGALSLLLIRLATDEASYPCIANFCNRTNSILYRVRLLQRTLNVNPSDGV 720

Query: 721 EWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           EWVRGMGVIHESLNQRCLPLCADHIS LYMGKINLQDLDCTLSLDMD
Sbjct: 721 EWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 767

BLAST of Cla97C10G189660 vs. NCBI nr
Match: XP_008463418.1 (PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903029.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903030.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903031.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903032.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903033.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903034.1 PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo])

HSP 1 Score: 1394.4 bits (3608), Expect = 0.0e+00
Identity = 707/767 (92.18%), Postives = 728/767 (94.92%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSH 60
           MFIHFRKIAT LKPK SNSL KLYSHLPL+ K    L LPSQHSPE+LT  QLKALVLS 
Sbjct: 1   MFIHFRKIATSLKPKFSNSLNKLYSHLPLRKK---VLNLPSQHSPESLTSSQLKALVLSR 60

Query: 61  FSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVSNCFSVEVMARELSE 120
           FS GKF DLFQNVVASPSVLLTAS+NLITPP SNAPDSLP+ DLVS CFSVEVMARELSE
Sbjct: 61  FSHGKFVDLFQNVVASPSVLLTASRNLITPPFSNAPDSLPLSDLVSKCFSVEVMARELSE 120

Query: 121 NRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG 180
           NRFDVGACCV M P EEKGESLVLPN+KLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG
Sbjct: 121 NRFDVGACCVPMAPLEEKGESLVLPNLKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG 180

Query: 181 RHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFELE 240
           RHTAIRYLKNSVQNPSWWFTVAFR KKFESVHVNTLCLLMQEKIKDDILIYMLRKLFE+E
Sbjct: 181 RHTAIRYLKNSVQNPSWWFTVAFRRKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFEVE 240

Query: 241 AIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIVSFH 300
           AIQIELG CYLGRGFPQES LCSIL+NIYFNGFDKEIQ+IRLQKNEENPKFNLDEIVSFH
Sbjct: 241 AIQIELGGCYLGRGFPQESGLCSILLNIYFNGFDKEIQRIRLQKNEENPKFNLDEIVSFH 300

Query: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIS 360
           NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKI 
Sbjct: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIG 360

Query: 361 FLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVF 420
           FLGMELQAV PSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSH+F
Sbjct: 361 FLGMELQAVTPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHMF 420

Query: 421 KKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480
           KK K+T GFKSEFQIE EVR IFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR
Sbjct: 421 KKFKRTGGFKSEFQIETEVRSIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480

Query: 481 NQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIK 540
           NQLPEDLVNAYDRFQ Q+NKHLNPVK KKEKAREDEEKR+EEEELYAKRTVEDLTRLCIK
Sbjct: 481 NQLPEDLVNAYDRFQYQVNKHLNPVKVKKEKAREDEEKRLEEEELYAKRTVEDLTRLCIK 540

Query: 541 VDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYK 600
           VDAPIELVRKAV+MVGFTNKMGRP+PISSLIALED DIIKWYSGVGRRWLDFFCCCHNYK
Sbjct: 541 VDAPIELVRKAVRMVGFTNKMGRPRPISSLIALEDADIIKWYSGVGRRWLDFFCCCHNYK 600

Query: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLG 660
           MVKTVVTYHLRFSCILTLAEKHESTKREAMKHY KDLKV DLNGNEE+HFPTE+ VKMLG
Sbjct: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYGKDLKVFDLNGNEEMHFPTEKAVKMLG 660

Query: 661 ERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGV 720
           ERNL DPYPVDGALSLL+IRL TDEASYPCIA+FCNRT+SILYRVRLLQRTLNVNP +GV
Sbjct: 661 ERNLADPYPVDGALSLLLIRLATDEASYPCIANFCNRTNSILYRVRLLQRTLNVNPSDGV 720

Query: 721 EWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           EWV+GMGVIHESLNQRCLPLCADHIS LYMGKINLQDLDCTLSLDMD
Sbjct: 721 EWVKGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 764

BLAST of Cla97C10G189660 vs. NCBI nr
Match: XP_022976163.1 (nuclear intron maturase 3, mitochondrial [Cucurbita maxima])

HSP 1 Score: 1343.9 bits (3477), Expect = 0.0e+00
Identity = 684/770 (88.83%), Postives = 717/770 (93.12%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSH 60
           M IH RKIATPLKPKLS+SL KLYS  P  LKHSP LKLP QHS +TLTRPQL+ALVLS 
Sbjct: 1   MLIHLRKIATPLKPKLSSSLNKLYSDRP--LKHSPLLKLPFQHSVQTLTRPQLEALVLSR 60

Query: 61  FSRGKFFDLFQNVVASPSVLLTASQNLITPPLS---NAPDSLPVFDLVSNCFSVEVMARE 120
           FS+GKFFDL QNVVASPSVL TASQNLITP  S   NAP+SL   D+VS+CFSVE MARE
Sbjct: 61  FSQGKFFDLLQNVVASPSVLFTASQNLITPLPSNRLNAPESLLSLDMVSSCFSVEDMARE 120

Query: 121 LSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRV 180
           L ENRFDVGACCVR+  SEEKGE LVLPN+KLKVL+EA++MVLEIVYDERFVTFSYGGRV
Sbjct: 121 LYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAMKMVLEIVYDERFVTFSYGGRV 180

Query: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLF 240
           GMGRHTAIRYLKNSVQNPSWWFTVAFR +KF+SVHVN LCLL+QEKIKDDILI ML+KLF
Sbjct: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLF 240

Query: 241 ELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIV 300
           ELEA+QIELG CYLGRGFPQES LCSIL NIYFNGFDKEIQQIRLQKNEENPKF+L+  V
Sbjct: 241 ELEAVQIELGGCYLGRGFPQESGLCSILSNIYFNGFDKEIQQIRLQKNEENPKFSLNGTV 300

Query: 301 SFHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSE 360
           SFHNPVKIYAVRYLDEILVITSGSKM  MELKSQVL+YLEGNLELEVDRMNTAIHSAVSE
Sbjct: 301 SFHNPVKIYAVRYLDEILVITSGSKMQIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSE 360

Query: 361 KISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS 420
           KISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS
Sbjct: 361 KISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS 420

Query: 421 HVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLK 480
           HVFKKLK+T G KSEFQIEKEV EIFRNWADEVV+DFFES ED+ EWHR LSAGDFLSLK
Sbjct: 421 HVFKKLKRTDGLKSEFQIEKEVTEIFRNWADEVVRDFFESLEDNTEWHRPLSAGDFLSLK 480

Query: 481 HIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRL 540
           HIRNQLP DLVNAYDRFQDQ+NKHLNP+KAKKEKAREDEEKR+ EEE YAKRTVEDLTRL
Sbjct: 481 HIRNQLPVDLVNAYDRFQDQVNKHLNPLKAKKEKAREDEEKRLVEEERYAKRTVEDLTRL 540

Query: 541 CIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCH 600
           CIKV+APIELVRKAVKM+GFTNKMGRPQPISSLIALEDTDIIKWY+GVGRRWLDFFCCCH
Sbjct: 541 CIKVEAPIELVRKAVKMIGFTNKMGRPQPISSLIALEDTDIIKWYAGVGRRWLDFFCCCH 600

Query: 601 NYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVK 660
           NYKMVKT+VTYHLRFSCILTLAEKHESTKREAMKHYSKDLKV DLNG EE+HFPTEREVK
Sbjct: 601 NYKMVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVK 660

Query: 661 MLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPF 720
           MLGERNL DPYPVDGA SL +IRLVTDE SYPCIAHFCNRTDSILYRVRLLQ+TLNVNP 
Sbjct: 661 MLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPS 720

Query: 721 NGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           NGVEWVRGMGVIHESLNQRCLPLCADHIS LYMGKINLQDLDCTLSLDMD
Sbjct: 721 NGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 768

BLAST of Cla97C10G189660 vs. NCBI nr
Match: XP_022942578.1 (nuclear intron maturase 3, mitochondrial [Cucurbita moschata])

HSP 1 Score: 1337.0 bits (3459), Expect = 0.0e+00
Identity = 681/770 (88.44%), Postives = 715/770 (92.86%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSH 60
           M IH RKIATPLKPKLS+S  KLYS  P  LKHSP LKLP QHS +TLTRPQL+ALVLS 
Sbjct: 1   MLIHLRKIATPLKPKLSSSFNKLYSDRP--LKHSPLLKLPFQHSVQTLTRPQLEALVLSR 60

Query: 61  FSRGKFFDLFQNVVASPSVLLTASQNLITPPLS---NAPDSLPVFDLVSNCFSVEVMARE 120
           FS+GKFFDL QNVVASPSVL TASQNLITP  S   NAPDSL   D+VS+CFSVE MARE
Sbjct: 61  FSQGKFFDLLQNVVASPSVLFTASQNLITPLPSNRLNAPDSLLSLDMVSSCFSVEDMARE 120

Query: 121 LSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRV 180
           L ENRFDVGACCVR+  SEEKGE LVLPN+KLKVL+EAIR+VLEIVYDERFVTFSYGGRV
Sbjct: 121 LYENRFDVGACCVRLESSEEKGEFLVLPNLKLKVLLEAIRIVLEIVYDERFVTFSYGGRV 180

Query: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLF 240
           GMGRHTAIRYLKNSVQNPSWWFTVAFR +KF+SVHVN LCLL+QEKIKDDILI ML+KLF
Sbjct: 181 GMGRHTAIRYLKNSVQNPSWWFTVAFRRRKFDSVHVNKLCLLVQEKIKDDILILMLKKLF 240

Query: 241 ELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIV 300
           ELEA+QIELG CYLGRG PQES LCSIL NIYFNGFDKEIQQIRL+KNEENPKF+LD  V
Sbjct: 241 ELEAVQIELGGCYLGRGLPQESGLCSILSNIYFNGFDKEIQQIRLEKNEENPKFSLDGTV 300

Query: 301 SFHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSE 360
           SFHNPVKIYAVRYLDEILVITSGSKML MELKSQVL+YLEGNLELEVDRMNTAIHSAVSE
Sbjct: 301 SFHNPVKIYAVRYLDEILVITSGSKMLIMELKSQVLRYLEGNLELEVDRMNTAIHSAVSE 360

Query: 361 KISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS 420
           KISFLGMELQAV PSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS
Sbjct: 361 KISFLGMELQAVLPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILS 420

Query: 421 HVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLK 480
           HVFKKLK+T G KSEFQIEKEV EIFRNWADEVV+DFFES ED+ EWHR LSAGDFLSLK
Sbjct: 421 HVFKKLKRTDGLKSEFQIEKEVTEIFRNWADEVVRDFFESLEDNTEWHRPLSAGDFLSLK 480

Query: 481 HIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRL 540
           HIRNQLP DLVNAYDRFQDQ+N+HLNP+KAK+EKAREDEEKR+ EEE YAKRTVEDLTRL
Sbjct: 481 HIRNQLPVDLVNAYDRFQDQVNQHLNPLKAKEEKAREDEEKRLGEEERYAKRTVEDLTRL 540

Query: 541 CIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCH 600
           CIKV+APIELVRKAVKM+GFTNKMGRP+PISSLIALEDTDIIKWY+GVGRRWLDFFCCCH
Sbjct: 541 CIKVEAPIELVRKAVKMIGFTNKMGRPRPISSLIALEDTDIIKWYAGVGRRWLDFFCCCH 600

Query: 601 NYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVK 660
           NYKMVKT+VTYHLRFSCILTLAEKHESTKREAMKHYSKDLKV DLNG EE+HFPTEREVK
Sbjct: 601 NYKMVKTIVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVFDLNGKEEMHFPTEREVK 660

Query: 661 MLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPF 720
           MLGERNL DPYPVDGA SL +IRLVTDE SYPCIAHFCNRTDSILYRVRLLQ+TLNVNP 
Sbjct: 661 MLGERNLADPYPVDGAFSLFLIRLVTDEDSYPCIAHFCNRTDSILYRVRLLQKTLNVNPS 720

Query: 721 NGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           NGVEWVRGMGVIHESLNQRCLPLCADHIS LYMGKINLQDLDCTLSLDMD
Sbjct: 721 NGVEWVRGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 768

BLAST of Cla97C10G189660 vs. NCBI nr
Match: XP_022155823.1 (nuclear intron maturase 3, mitochondrial [Momordica charantia] >XP_022155825.1 nuclear intron maturase 3, mitochondrial [Momordica charantia])

HSP 1 Score: 1305.4 bits (3377), Expect = 0.0e+00
Identity = 670/772 (86.79%), Postives = 704/772 (91.19%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQL-KLPSQHSPETLTRPQLKALVLS 60
           MFI  RKIATPLK  LS S  KLYS LP  LKHSP L   P QHS ETLT P+LKALVLS
Sbjct: 1   MFIPLRKIATPLKSILSTSFNKLYSDLP--LKHSPPLHNFPFQHSTETLTWPELKALVLS 60

Query: 61  HFSRGKFFDLFQNVVASPSVLLTASQNLITPPLS----NAPDSLPVFDLVSNCFSVEVMA 120
            F+ GKF DL QNVVASPSVLLTASQNLITPP      NAPD LP+ DLVS CFSVE MA
Sbjct: 61  RFTHGKFLDLLQNVVASPSVLLTASQNLITPPPPSNGLNAPDPLPILDLVSKCFSVEEMA 120

Query: 121 RELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGG 180
           REL E+RFDVGACCVRM   ++KG+SLVLPN+KLKVLIEAIRMVLEIVYDERFVTFSYGG
Sbjct: 121 RELYEDRFDVGACCVRM---DQKGQSLVLPNLKLKVLIEAIRMVLEIVYDERFVTFSYGG 180

Query: 181 RVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRK 240
           RVGMGRHTAIRYLKNSVQNPSWWFTVAFR KKF+SVHV+ LCLL++EKIKD +LI ML+K
Sbjct: 181 RVGMGRHTAIRYLKNSVQNPSWWFTVAFRRKKFDSVHVHKLCLLVEEKIKDHLLICMLKK 240

Query: 241 LFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDE 300
           LFELEAIQIELGACYLGRGFPQES LCSIL NI+FNGFDK+IQQIRLQKNEENPKF+LDE
Sbjct: 241 LFELEAIQIELGACYLGRGFPQESGLCSILCNIFFNGFDKDIQQIRLQKNEENPKFSLDE 300

Query: 301 IVSFHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAV 360
           IVSFH+PVKIYAVRYLDEILVITSGSKMLTM+LKSQVLKYLEGNLELEVDRMNTAIHSAV
Sbjct: 301 IVSFHSPVKIYAVRYLDEILVITSGSKMLTMDLKSQVLKYLEGNLELEVDRMNTAIHSAV 360

Query: 361 SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKI 420
           SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKY+RQKEVR IELRNARERNRKKLGLKI
Sbjct: 361 SEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYIRQKEVRVIELRNARERNRKKLGLKI 420

Query: 421 LSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLS 480
           LSHVFKKLKQT GFK EFQIEKEVREIFRNWADEV Q FFES E+HAEWH  LSAGDFLS
Sbjct: 421 LSHVFKKLKQTDGFKFEFQIEKEVREIFRNWADEVAQHFFESLENHAEWHHALSAGDFLS 480

Query: 481 LKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLT 540
           LKHIRNQLPEDLVNAYDRFQDQ++KHLNPVK K  KAREDEEKRVEEE+ YA+RTVEDLT
Sbjct: 481 LKHIRNQLPEDLVNAYDRFQDQVDKHLNPVKVKYVKAREDEEKRVEEEQKYARRTVEDLT 540

Query: 541 RLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCC 600
           RLCIKVDAPIELVRKAVKMVGFTNKMGRPQPIS L+ALED DIIKWY+GVGRRWLDFFCC
Sbjct: 541 RLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISLLVALEDIDIIKWYAGVGRRWLDFFCC 600

Query: 601 CHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTERE 660
           CHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKH+SKDLKV DLNG+EEIHFPTERE
Sbjct: 601 CHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHFSKDLKVFDLNGDEEIHFPTERE 660

Query: 661 VKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVN 720
           VKMLG+R L DPYPVDG LSL +IRL  DE S PCIAHFCNRTDSILYRVRLLQRTLNVN
Sbjct: 661 VKMLGDRMLADPYPVDGTLSLFLIRLAIDEPSCPCIAHFCNRTDSILYRVRLLQRTLNVN 720

Query: 721 PFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
             +G +WVRGMGVIHESLNQRCLPLCADHIS LYMG+INLQDLDCTLSLDMD
Sbjct: 721 SCDGEKWVRGMGVIHESLNQRCLPLCADHISDLYMGRINLQDLDCTLSLDMD 767

BLAST of Cla97C10G189660 vs. TrEMBL
Match: tr|A0A1S4E4Y1|A0A1S4E4Y1_CUCME (uncharacterized protein LOC103501588 OS=Cucumis melo OX=3656 GN=LOC103501588 PE=4 SV=1)

HSP 1 Score: 1394.4 bits (3608), Expect = 0.0e+00
Identity = 707/767 (92.18%), Postives = 728/767 (94.92%), Query Frame = 0

Query: 1   MFIHFRKIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPETLTRPQLKALVLSH 60
           MFIHFRKIAT LKPK SNSL KLYSHLPL+ K    L LPSQHSPE+LT  QLKALVLS 
Sbjct: 1   MFIHFRKIATSLKPKFSNSLNKLYSHLPLRKK---VLNLPSQHSPESLTSSQLKALVLSR 60

Query: 61  FSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVSNCFSVEVMARELSE 120
           FS GKF DLFQNVVASPSVLLTAS+NLITPP SNAPDSLP+ DLVS CFSVEVMARELSE
Sbjct: 61  FSHGKFVDLFQNVVASPSVLLTASRNLITPPFSNAPDSLPLSDLVSKCFSVEVMARELSE 120

Query: 121 NRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG 180
           NRFDVGACCV M P EEKGESLVLPN+KLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG
Sbjct: 121 NRFDVGACCVPMAPLEEKGESLVLPNLKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMG 180

Query: 181 RHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFELE 240
           RHTAIRYLKNSVQNPSWWFTVAFR KKFESVHVNTLCLLMQEKIKDDILIYMLRKLFE+E
Sbjct: 181 RHTAIRYLKNSVQNPSWWFTVAFRRKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFEVE 240

Query: 241 AIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIVSFH 300
           AIQIELG CYLGRGFPQES LCSIL+NIYFNGFDKEIQ+IRLQKNEENPKFNLDEIVSFH
Sbjct: 241 AIQIELGGCYLGRGFPQESGLCSILLNIYFNGFDKEIQRIRLQKNEENPKFNLDEIVSFH 300

Query: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIS 360
           NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKI 
Sbjct: 301 NPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKIG 360

Query: 361 FLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVF 420
           FLGMELQAV PSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSH+F
Sbjct: 361 FLGMELQAVTPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHMF 420

Query: 421 KKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480
           KK K+T GFKSEFQIE EVR IFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR
Sbjct: 421 KKFKRTGGFKSEFQIETEVRSIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIR 480

Query: 481 NQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIK 540
           NQLPEDLVNAYDRFQ Q+NKHLNPVK KKEKAREDEEKR+EEEELYAKRTVEDLTRLCIK
Sbjct: 481 NQLPEDLVNAYDRFQYQVNKHLNPVKVKKEKAREDEEKRLEEEELYAKRTVEDLTRLCIK 540

Query: 541 VDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYK 600
           VDAPIELVRKAV+MVGFTNKMGRP+PISSLIALED DIIKWYSGVGRRWLDFFCCCHNYK
Sbjct: 541 VDAPIELVRKAVRMVGFTNKMGRPRPISSLIALEDADIIKWYSGVGRRWLDFFCCCHNYK 600

Query: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLG 660
           MVKTVVTYHLRFSCILTLAEKHESTKREAMKHY KDLKV DLNGNEE+HFPTE+ VKMLG
Sbjct: 601 MVKTVVTYHLRFSCILTLAEKHESTKREAMKHYGKDLKVFDLNGNEEMHFPTEKAVKMLG 660

Query: 661 ERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGV 720
           ERNL DPYPVDGALSLL+IRL TDEASYPCIA+FCNRT+SILYRVRLLQRTLNVNP +GV
Sbjct: 661 ERNLADPYPVDGALSLLLIRLATDEASYPCIANFCNRTNSILYRVRLLQRTLNVNPSDGV 720

Query: 721 EWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           EWV+GMGVIHESLNQRCLPLCADHIS LYMGKINLQDLDCTLSLDMD
Sbjct: 721 EWVKGMGVIHESLNQRCLPLCADHISDLYMGKINLQDLDCTLSLDMD 764

BLAST of Cla97C10G189660 vs. TrEMBL
Match: tr|A0A2I4ECV4|A0A2I4ECV4_9ROSI (uncharacterized protein LOC108988417 OS=Juglans regia OX=51240 GN=LOC108988417 PE=4 SV=1)

HSP 1 Score: 1056.6 bits (2731), Expect = 2.7e-305
Identity = 555/779 (71.25%), Postives = 636/779 (81.64%), Query Frame = 0

Query: 1   MFIHFRKIATPL--KPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSP-ETLTRPQLKALV 60
           M +H R+I  PL  +P +S SL KL+S L       PQ  LPS   P E LT+PQL+ LV
Sbjct: 1   MLMHLRRI-NPLGFRPSISYSL-KLFSTLLPNPSAVPQ-TLPSTPDPTEPLTKPQLEHLV 60

Query: 61  LSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNA---PDSLPVFDLVSNCFSVEVM 120
           L  +S GKFF+L QNVVA P+VLLTA QNL T   +NA   PDS  +   VS  F +  M
Sbjct: 61  LRQYSHGKFFNLVQNVVALPAVLLTACQNLTTRRPNNALKPPDSSSLLHYVSKRFDIADM 120

Query: 121 ARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYG 180
            REL ENRFDV ACCV M+PS +KGESLVLPN+KLKVLIEAIRMVLEIVYDERFVTFSYG
Sbjct: 121 GRELCENRFDVKACCVTMLPSRKKGESLVLPNLKLKVLIEAIRMVLEIVYDERFVTFSYG 180

Query: 181 GRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLR 240
           GRVGMGRHTA RYLK SV+NPSWWF V+F  + FE+ HVN LCL ++EKI D ILI ++ 
Sbjct: 181 GRVGMGRHTAFRYLKKSVENPSWWFNVSFDREMFENRHVNRLCLFIEEKINDRILINIIN 240

Query: 241 KLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLD 300
            LFE E ++IELG CYLGRGFPQES L SI INIYFNGFDKEIQ  RL KN+EN KF+ +
Sbjct: 241 TLFECEVVRIELGGCYLGRGFPQESGLSSIFINIYFNGFDKEIQDKRLLKNQENLKFDPN 300

Query: 301 EIVS----FHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTA 360
           E+VS    F+ PVKIY VRYLDEILVITSGSK+LTM+LK+ V+ YLEG LE +VDRM TA
Sbjct: 301 ELVSTTGVFYKPVKIYVVRYLDEILVITSGSKVLTMDLKNWVVNYLEGRLEFKVDRMKTA 360

Query: 361 IHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKK 420
           IHSAVSE I+FLGMELQAV PSVLHPPM+EKAIRARKKYLRQKEVR +EL+NARERNRKK
Sbjct: 361 IHSAVSENINFLGMELQAVTPSVLHPPMTEKAIRARKKYLRQKEVRTLELKNARERNRKK 420

Query: 421 LGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSA 480
           LGLKI  HVFKKLKQ  GFK EFQIE EV++IFR+WADEVV+DF  S E+  EWHR L+A
Sbjct: 421 LGLKIFQHVFKKLKQCDGFKFEFQIENEVQKIFRSWADEVVRDFLGSLEERWEWHRNLTA 480

Query: 481 GDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEE--LYAK 540
           GDFLSL+HIR+QLP++LV+AYD+FQ+QI KHL+P KA+KE   E EE+RVEEEE   YA 
Sbjct: 481 GDFLSLRHIRDQLPQELVDAYDKFQEQIYKHLSPAKARKE--LEKEERRVEEEEELKYAN 540

Query: 541 RTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRR 600
           RTVEDLTRLC+KVDAPIELVRK VKM GFTN MGRP+PI  L+ALEDTDIIKWY+GVGRR
Sbjct: 541 RTVEDLTRLCMKVDAPIELVRKGVKMAGFTNSMGRPRPIKLLVALEDTDIIKWYAGVGRR 600

Query: 601 WLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEI 660
           WLDFFCCCHN+KMVKTVVTYHLRFSCILTLAEKHESTKREAMKHY+KDLKV DL+GNEE+
Sbjct: 601 WLDFFCCCHNFKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYTKDLKVSDLDGNEEV 660

Query: 661 HFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLL 720
           +FPTEREVKM+G++NL DP PVDG LSL +IRL +DE S  CIAHFC++  ++ YRVRLL
Sbjct: 661 YFPTEREVKMMGDKNLSDPKPVDGTLSLALIRLASDEPSCSCIAHFCDQMATVFYRVRLL 720

Query: 721 QRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           Q  LNVNP +  +WV GMG IHESLN++CLPLC+DHIS LYMGKI LQD+DCT  +D D
Sbjct: 721 QNCLNVNPSDQEKWVPGMGAIHESLNRKCLPLCSDHISDLYMGKITLQDIDCTSFVDED 774

BLAST of Cla97C10G189660 vs. TrEMBL
Match: tr|F6HLP1|F6HLP1_VITVI (Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_08s0007g05780 PE=4 SV=1)

HSP 1 Score: 1040.8 bits (2690), Expect = 1.5e-300
Identity = 529/766 (69.06%), Postives = 628/766 (81.98%), Query Frame = 0

Query: 7   KIATPLKPKLSNSLTKLYSHLPLKLKHSPQLKLPSQHSPET-LTRPQLKALVLSHFSRGK 66
           +++  L PK   +L    S L L  +HS    LP   +P T LT+PQLKALV++H+SRGK
Sbjct: 21  RVSMLLNPKRIATLHSRVSILSLLRRHS---TLPPNPNPTTPLTKPQLKALVINHYSRGK 80

Query: 67  FFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVSNCFSVEVMARELSENRFDV 126
           F +L QNVVASP VLL A QNL   P SN  +SL     V+  FSVE + REL ENRFDV
Sbjct: 81  FSNLIQNVVASPPVLLLACQNL--TPRSNDVNSL-ASPAVALRFSVEELGRELGENRFDV 140

Query: 127 GACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFSYGGRVGMGRHTAI 186
            +CCVRMVPS +KGESLVLPN+KLKV+IEAIRMVLEIVYDER VTF+YGGRVGMGRHTAI
Sbjct: 141 ESCCVRMVPSRKKGESLVLPNLKLKVVIEAIRMVLEIVYDERLVTFAYGGRVGMGRHTAI 200

Query: 187 RYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYMLRKLFELEAIQIE 246
           RYLKNSVQNP+WWF V F  +KFE  +VN LCL+++EKIKD +LI ++RKLFE E +QIE
Sbjct: 201 RYLKNSVQNPNWWFKVTFDREKFEHKNVNKLCLIIEEKIKDTVLIGIVRKLFECEVLQIE 260

Query: 247 LGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQKNEENPKFNLDEIVS----FHN 306
           LG CYLGRGFPQE  L SILIN+YFNGFDKEIQ +R++ N+ENP+F+ +E++S    F+ 
Sbjct: 261 LGGCYLGRGFPQECGLSSILINVYFNGFDKEIQDLRIRTNQENPRFDSNEVLSGSSVFYK 320

Query: 307 PVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISF 366
           PVKIYAVRYLDEILVITSGSKMLTM+LK+QV+K+LEG LEL+VDR+  AIHSA  EKI F
Sbjct: 321 PVKIYAVRYLDEILVITSGSKMLTMDLKNQVMKFLEGKLELKVDRLKMAIHSATMEKIDF 380

Query: 367 LGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVFK 426
           LGMELQAV PSVL PPMSEKAIRA+KKYLRQKEV+AIELRNARE NRKKLGLKIL+HVFK
Sbjct: 381 LGMELQAVQPSVLRPPMSEKAIRAQKKYLRQKEVKAIELRNARETNRKKLGLKILAHVFK 440

Query: 427 KLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKHIRN 486
           KLKQ+  FK +F IE EVREIFR WADEVV++F  S E+ A W+R+LS GDFLSL+HIR+
Sbjct: 441 KLKQSDEFKFDFHIENEVREIFRTWADEVVKEFLGSLEEQANWYRMLSVGDFLSLRHIRH 500

Query: 487 QLPEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIKV 546
           QLP++LV+AYD FQ+Q++KH+ PVKA+K     +          YA+RTV++LTRLC+KV
Sbjct: 501 QLPQELVDAYDHFQEQVDKHIKPVKARKALEEAERXXXXXXXXXYAERTVQELTRLCMKV 560

Query: 547 DAPIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYKM 606
           DAPIELVRKAVKM GFTN MGRP+PI  LIALEDTDIIKWY+GVGRRWLDFFCCCHN+KM
Sbjct: 561 DAPIELVRKAVKMAGFTNNMGRPRPIKLLIALEDTDIIKWYAGVGRRWLDFFCCCHNFKM 620

Query: 607 VKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLGE 666
           VKTVVTYHLRFSC+LTLAEKHESTK E ++HY+KDLKV D NG EE+HFP ERE+KM+G+
Sbjct: 621 VKTVVTYHLRFSCLLTLAEKHESTKLETIRHYTKDLKVSDFNGIEEVHFPAEREIKMMGD 680

Query: 667 RNLVDPYPVDGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGVE 726
           +NL DP PVDGALSL +IRL +DE +Y C+AHFC+R D+I+YRVRLLQ  LNVNP +  +
Sbjct: 681 KNLSDPKPVDGALSLALIRLASDEPAYSCVAHFCDRKDTIVYRVRLLQNRLNVNPLDEKK 740

Query: 727 WVRGMGVIHESLNQRCLPLCADHISALYMGKINLQDLDCTLSLDMD 768
           WV GMG IHE LN++CLPLC+DHI  LYMG I+LQD+DCT  +D+D
Sbjct: 741 WVPGMGAIHEGLNRKCLPLCSDHIHDLYMGTISLQDIDCTSFVDVD 780

BLAST of Cla97C10G189660 vs. TrEMBL
Match: tr|A0A2P6RTE0|A0A2P6RTE0_ROSCH (Putative reverse transcriptase domain, domain X OS=Rosa chinensis OX=74649 GN=RchiOBHm_Chr2g0124831 PE=4 SV=1)

HSP 1 Score: 1008.8 bits (2607), Expect = 6.4e-291
Identity = 503/733 (68.62%), Postives = 606/733 (82.67%), Query Frame = 0

Query: 40  PSQHSPETLTRPQLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPD-S 99
           P+ +S + L+  QLK+LVLS ++RGKF +L QNV+A P+VLLTA QNL TP   N    S
Sbjct: 53  PNSNSTQPLSESQLKSLVLSRYARGKFTNLLQNVIALPAVLLTACQNLTTPQTQNGLRLS 112

Query: 100 LPVFDLVSNCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRM 159
           LP  D VS  FS+  M REL ENRFDV A  V M      GESLVLP++KLKVLIEAIR+
Sbjct: 113 LP--DSVSKRFSIHEMGRELCENRFDVAASSVTMAAPRNGGESLVLPSLKLKVLIEAIRI 172

Query: 160 VLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCL 219
           VL +VYDERFVTFSYGGRV MGRHTAIRYLKNSV NPSWWF+V+F   KFE  HVN LCL
Sbjct: 173 VLGVVYDERFVTFSYGGRVNMGRHTAIRYLKNSVANPSWWFSVSFNGGKFEQRHVNKLCL 232

Query: 220 LMQEKIKDDILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQ 279
            M EKI+D++L  +++ LFE  A++IELG+C  GRGFPQES L SIL+NIYFNGFDKEIQ
Sbjct: 233 FMHEKIEDEVLTNIIKTLFECGAVRIELGSCCFGRGFPQESGLSSILMNIYFNGFDKEIQ 292

Query: 280 QIRLQKNEENPKFNLDEIVS----FHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLK 339
           ++RL+KN+E+PKF  +E+VS    F+ PVKIYAVRYLDEILV+TSGSKMLTM+LK+ V+K
Sbjct: 293 EMRLKKNQEHPKFESNELVSEDGVFYKPVKIYAVRYLDEILVMTSGSKMLTMDLKNWVVK 352

Query: 340 YLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKE 399
           YLEG+LEL VD++ T+IHSAVSEKI F+GMELQAVPPSVLHPPMSEKAIRARKKYLRQKE
Sbjct: 353 YLEGSLELMVDKIKTSIHSAVSEKIDFMGMELQAVPPSVLHPPMSEKAIRARKKYLRQKE 412

Query: 400 VRAIELRNARERNRKKLGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDF 459
           VRAIEL+NARERNRKKLG+KI+SHVFKKLK + G KSE+QIE +VREIFR WADEVVQ+F
Sbjct: 413 VRAIELKNARERNRKKLGMKIMSHVFKKLKSSDGLKSEYQIENQVREIFRTWADEVVQEF 472

Query: 460 FESSEDHAEWHRVLSAGDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKARE 519
            ES ++  +W+R LSAG+FLSL+HIR QLPE+LV+AYD+FQ+Q++KHLNPV+ +K +  E
Sbjct: 473 LESLDERWDWYRKLSAGNFLSLRHIRQQLPEELVDAYDKFQEQVDKHLNPVRDRKAREEE 532

Query: 520 DEEKRVEEEELYAKRTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALE 579
           +        + YAKRTVEDLT+LC+K DAPIE++RK VK++GFTN MGRP+PI+ L ALE
Sbjct: 533 EXXXXXXXXQKYAKRTVEDLTKLCVKADAPIEVLRKMVKLIGFTNHMGRPRPITLLTALE 592

Query: 580 DTDIIKWYSGVGRRWLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYS 639
           D DIIKWY+GVGRR LDF+CCCHN+KMVKT+VTYHLRFSCILTLAEKHESTK EAMKHY+
Sbjct: 593 DADIIKWYAGVGRRLLDFYCCCHNFKMVKTIVTYHLRFSCILTLAEKHESTKSEAMKHYT 652

Query: 640 KDLKVVDLNGNEEIHFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHF 699
           KDLKV D+NGN+E++FPTEREVKM+G++NL DP PVDGA SL +IRL +DE  Y C+AHF
Sbjct: 653 KDLKVFDINGNQEVYFPTEREVKMMGDKNLSDPIPVDGAFSLALIRLASDEPPYSCVAHF 712

Query: 700 CNRTDSILYRVRLLQRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKIN 759
           C+RTD+I+YRVRLLQ  LN+ P +  +WV GMG I+ESL+ +C PLC DHI  LYMG I 
Sbjct: 713 CDRTDTIVYRVRLLQSRLNLTPVDDKKWVPGMGAINESLHLKCFPLCPDHIHDLYMGSIT 772

Query: 760 LQDLDCTLSLDMD 768
            QD+DCT  +D+D
Sbjct: 773 FQDIDCTSFVDVD 783

BLAST of Cla97C10G189660 vs. TrEMBL
Match: tr|A0A2N9I3C4|A0A2N9I3C4_FAGSY (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47064 PE=4 SV=1)

HSP 1 Score: 1004.6 bits (2596), Expect = 1.2e-289
Identity = 497/678 (73.30%), Postives = 576/678 (84.96%), Query Frame = 0

Query: 94  NAPDSLPVFDLVSNCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLI 153
           N+PDS  + D VS  F +E M REL E+RFDV  CCV MVPS +KGE+LVLPN+KLKVLI
Sbjct: 26  NSPDSQSLLDSVSRRFDIEEMGRELCEDRFDVEGCCVTMVPSRKKGENLVLPNLKLKVLI 85

Query: 154 EAIRMVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHV 213
           EAIRMVLEIVYDERF+TFSYGGRVGMGRHTAIRYLKNSV+NPSWWF+V F  +KF+  HV
Sbjct: 86  EAIRMVLEIVYDERFLTFSYGGRVGMGRHTAIRYLKNSVENPSWWFSVTFDREKFDDRHV 145

Query: 214 NTLCLLMQEKIKDDILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGF 273
           N LCL ++EKIKD   I ++++LFE E + IELG CYLGRGFPQES L SILINIY NGF
Sbjct: 146 NKLCLFIEEKIKDGFFIGIIKRLFECEVVGIELGGCYLGRGFPQESGLSSILINIYLNGF 205

Query: 274 DKEIQQIRLQKNEENPKFNLDEIVS----FHNPVKIYAVRYLDEILVITSGSKMLTMELK 333
           DKEIQ +RL+K++ENPKF  +E+VS    F+ PVK++ VRYLDEILVITSGSKMLTM+LK
Sbjct: 206 DKEIQDMRLRKSQENPKFESEELVSMSRMFYKPVKMFVVRYLDEILVITSGSKMLTMDLK 265

Query: 334 SQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKY 393
           +  LKYLEG+LEL+VD+M TAIHSAVSE ISFLGMELQAVPPSVLHPPM+EKA+RARKKY
Sbjct: 266 NWALKYLEGSLELKVDKMKTAIHSAVSENISFLGMELQAVPPSVLHPPMTEKAMRARKKY 325

Query: 394 LRQKEVRAIELRNARERNRKKLGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADE 453
           LRQKEVRA+EL+NARERNRKKLG+KIL HVFKKLKQ+ GFK EFQIE +VREIF  WADE
Sbjct: 326 LRQKEVRALELKNARERNRKKLGMKILQHVFKKLKQSDGFKFEFQIENKVREIFTTWADE 385

Query: 454 VVQDFFESSEDHAEWHRVLSAGDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKK 513
           VVQ+F  S E+  EWHR L+AGDFLSL+HIR+QLP++LV AYD+FQ+Q++K+L+PV+A+K
Sbjct: 386 VVQNFLGSLEERWEWHRKLTAGDFLSLRHIRDQLPQELVEAYDKFQEQVDKYLSPVQARK 445

Query: 514 EKAREDEEKRVEEEELYAKRTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISS 573
              +            YAKRTVEDLTRLC+KVDAPIELVRKAVKM GFTN MGRP+PI  
Sbjct: 446 ALEKXXXXXXXXXXXXYAKRTVEDLTRLCVKVDAPIELVRKAVKMAGFTNSMGRPRPIKL 505

Query: 574 LIALEDTDIIKWYSGVGRRWLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREA 633
           LIALEDTDIIKWY+GVGRRWLDFFCCCHN+KMVK VVTYHLRFSCILTLAEKHESTK EA
Sbjct: 506 LIALEDTDIIKWYAGVGRRWLDFFCCCHNFKMVKIVVTYHLRFSCILTLAEKHESTKLEA 565

Query: 634 MKHYSKDLKVVDLNGNEEIHFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYP 693
           +KHY+KDLKV DLNGNEE++FPTEREVKM+G++NL DP PVDGALSL +IRL  DE SY 
Sbjct: 566 IKHYTKDLKVSDLNGNEEVYFPTEREVKMMGDQNLSDPRPVDGALSLALIRLAYDEPSYS 625

Query: 694 CIAHFCNRTDSILYRVRLLQRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALY 753
           CIAHFC+R D++ YRVRLLQ  LNVNP    +WV GMG IHESLN++CLPLC+DHI+ LY
Sbjct: 626 CIAHFCDRMDTVFYRVRLLQNRLNVNPSVEEKWVSGMGAIHESLNRKCLPLCSDHINDLY 685

Query: 754 MGKINLQDLDCTLSLDMD 768
            GKI LQD+DCT  +D+D
Sbjct: 686 TGKITLQDIDCTSFVDVD 703

BLAST of Cla97C10G189660 vs. Swiss-Prot
Match: sp|Q9LZA5|NMAT3_ARATH (Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT3 PE=3 SV=2)

HSP 1 Score: 793.1 bits (2047), Expect = 2.7e-228
Identity = 419/732 (57.24%), Postives = 527/732 (71.99%), Query Frame = 0

Query: 43  HSPETLTRP----QLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDS 102
           +S +T+T P    +L+ALVL  +S GKF+ L +N V+ P VLL A QNL      +A  S
Sbjct: 35  NSDQTITEPLVKSELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSL----SANSS 94

Query: 103 LPVFDLVSNCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRM 162
             + D VS  FS+E M RE+ E RFD+ +CCV  + S     SLVLPN+KLKVLIEAIRM
Sbjct: 95  GDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISS-----SLVLPNLKLKVLIEAIRM 154

Query: 163 VLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCL 222
           VLEIVYD+RF TFSYGGRVGMGRHTAIRYLKNSV+NP WWF V+F  + FE  +V+ LC 
Sbjct: 155 VLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCG 214

Query: 223 LMQEKIKDDILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQ 282
            + EKI D +LI M++KLFE   ++IELG C  GRGFPQE  LCSILIN+YF+G DKEIQ
Sbjct: 215 FVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQ 274

Query: 283 QIRLQKNEENPKFNLDEIVS----FHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLK 342
            +RL+   +NP+    +  S    F  PV IYAVRYLDEILVITSGSKMLTM+LK +++ 
Sbjct: 275 DLRLKMKVKNPRVGTGDEESTGNVFFKPVNIYAVRYLDEILVITSGSKMLTMDLKKRIVD 334

Query: 343 YLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKE 402
            LE  LEL VDR+NT+IHSAVSEKI+FLGM LQAVPPSVL PP SEKA+RA KKY RQK+
Sbjct: 335 ILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQKD 394

Query: 403 VRAIELRNARERNRKKLGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDF 462
           VR +ELRNARERNRK LGLKI  HV KK+KQ++GFK E +IE EVR+IF++W +EV+QDF
Sbjct: 395 VRKLELRNARERNRKTLGLKIFRHVLKKIKQSNGFKFEGEIENEVRDIFQSWGEEVMQDF 454

Query: 463 FESSEDHAEWHRVLSAGDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKARE 522
             S E+  +WH +L+ GDFLSL+HIR +LP+DL++AYD FQ+Q++KHL P +AKK     
Sbjct: 455 MGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVDKHLAPTQAKKVLEXX 514

Query: 523 DEEKRVEEEELYAKRTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALE 582
                       A+RTVEDLT+LC+KV AP ELVRKA+K+VGFTN MGRP+PI  L+ LE
Sbjct: 515 XXXXXXXXXXXXAERTVEDLTKLCMKVSAPEELVRKAIKLVGFTNSMGRPRPIIHLVTLE 574

Query: 583 DTDIIKWYSGVGRRWLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYS 642
           D+DIIKWY+                                    EKH STK+  ++HY+
Sbjct: 575 DSDIIKWYA----------------------------------RHEKHGSTKK-LIRHYT 634

Query: 643 KDLKVVDLNGNEEIHFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHF 702
           KDL+V DL+G EE HFP+EREVKM+G++NL DP PVDG LSLL+IRL +DE  + C A F
Sbjct: 635 KDLRVSDLDGREEAHFPSEREVKMMGDKNLSDPKPVDGTLSLLLIRLASDEPLHHCAASF 694

Query: 703 CNRTDSILYRVRLLQRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKIN 762
           C R+D+I++RV LLQ  L++NP +  +WV GMG IH +LN++CLPLC+ HIS +Y+GKI 
Sbjct: 695 CERSDTIMHRVHLLQNRLHINPLDEEKWVPGMGTIHSALNRKCLPLCSTHISDVYLGKIT 722

Query: 763 LQDLDCTLSLDM 767
           LQD+D +  +D+
Sbjct: 755 LQDVDSSSFIDL 722

BLAST of Cla97C10G189660 vs. Swiss-Prot
Match: sp|Q9CA78|NMAT4_ARATH (Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT4 PE=3 SV=2)

HSP 1 Score: 222.2 bits (565), Expect = 1.9e-56
Identity = 201/759 (26.48%), Postives = 336/759 (44.27%), Query Frame = 0

Query: 52  QLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVS----- 111
           +LK  V      GKF DL + V+A P              L +A D + +   VS     
Sbjct: 90  RLKKRVKEQCINGKFSDLLKKVIARPET------------LRDAYDCIRLNSNVSITERN 149

Query: 112 NCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDE 171
              + + +A ELS   FDV +    +V  ++  E LVLP++ LKV+ EAIR+VLE+V+  
Sbjct: 150 GSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVLPSVALKVVQEAIRIVLEVVFSP 209

Query: 172 RFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKD 231
            F   S+  R G GR +A++Y+ N++    W FT++   K   SV  N L  +M+EK++D
Sbjct: 210 HFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLNKKLDVSVFENLLS-VMEEKVED 269

Query: 232 DILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQI------ 291
             L  +LR +FE   + +E G    G G PQE  L  +L+NIY + FD E  +I      
Sbjct: 270 SSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVLMNIYLDRFDHEFYRISMRHEA 329

Query: 292 --------------------RLQKNEENPKFNLDEIVSFHNPVKIYAVRYLDEILVITSG 351
                               R Q  E+  K   ++ V+    +++Y  R++DEI    SG
Sbjct: 330 LGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVA----LRVYCCRFMDEIYFSVSG 389

Query: 352 SKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSE 411
            K +  +++S+ + +L  +L L++           +  +  LG     V  +V   P + 
Sbjct: 390 PKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG---TLVRKNVRESP-TV 449

Query: 412 KAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVFKKLKQTS------------- 471
           KA+   K+ +R   + A++   A      ++G K L H  KK+K++              
Sbjct: 450 KAVHKLKEKVR---LFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKGLADSNSTLSQ 509

Query: 472 ---GFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKH-IRNQL 531
                K+  + +   + + R W ++V++    +S D +E        +F+  KH +   +
Sbjct: 510 ISCHRKAGMETDHWYKILLRIWMEDVLR----TSADRSE--------EFVLSKHVVEPTV 569

Query: 532 PEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIKVDA 591
           P++L +A+ +FQ+    +++   A               E L       D       V A
Sbjct: 570 PQELRDAFYKFQNAAAAYVSSETANL-------------EALLPCPQSHDRPVFFGDVVA 629

Query: 592 PIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYKMVK 651
           P   + + +   G     G  +  S LI L+   II WYSG+ RRW+ ++  C N+  +K
Sbjct: 630 PTNAIGRRLYRYGLITAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIK 689

Query: 652 TVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLGERN 711
            ++   +R SCI TLA K+   + E  K    +L  +    + E     E+      +R+
Sbjct: 690 ALIDNQIRMSCIRTLAAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRD 749

Query: 712 LVDPYPV--DGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGVE 760
               Y +   G   L + RLV++     C    C+     +Y +  ++R      F G  
Sbjct: 750 EHLTYGLSNSGLCLLSLARLVSESRPCNCFVIGCSMAAPAVYTLHAMER----QKFPG-- 793

BLAST of Cla97C10G189660 vs. Swiss-Prot
Match: sp|P03876|AI2M_YEAST (Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=AI2 PE=4 SV=2)

HSP 1 Score: 77.4 bits (189), Expect = 7.6e-13
Identity = 72/291 (24.74%), Postives = 130/291 (44.67%), Query Frame = 0

Query: 114 MARELSENRFDVGACCVRMVPSEEKG-ESLVLPNMKLKVLIEAIRMVLEIVYDERFVTFS 173
           ++++++ N F         +P    G   L + N + K++ E++RM+LEI+Y+  F  +S
Sbjct: 329 LSKDINTNMFKFSPVRRVEIPKTSGGFRPLSVGNPREKIVQESMRMMLEIIYNNSFSYYS 388

Query: 174 YGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDILIYM 233
           +G R  +   TAI   KN +Q  +W+  V   +K F+++  N L  ++ E+IKD   + +
Sbjct: 389 HGFRPNLSCLTAIIQCKNYMQYCNWFIKVDL-NKCFDTIPHNMLINVLNERIKDKGFMDL 448

Query: 234 LRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYF--------NGFDKEIQQIRLQK 293
           L KL     +           G PQ S +  IL NI+         N F+ E     +  
Sbjct: 449 LYKLLRAGYVDKNNNYHNTTLGIPQGSVVSPILCNIFLDKLDKYLENKFENEFNTGNMSN 508

Query: 294 NEENPKFN-----------LDE--------------IVSFHNPVKIYAVRYLDEILVITS 353
              NP +N           L E              + S  +  + Y VRY D+I++   
Sbjct: 509 RGRNPIYNSLSSKIYRCKLLSEKLKLIRLRDHYQRNMGSDKSFKRAYFVRYADDIIIGVM 568

Query: 354 GSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVP 371
           GS      + + +  +L+ NL + ++ M+ ++     E +SFLG +++  P
Sbjct: 569 GSHNDCKNILNDINNFLKENLGMSIN-MDKSVIKHSKEGVSFLGYDVKVTP 617

BLAST of Cla97C10G189660 vs. Swiss-Prot
Match: sp|P38478|YMF40_MARPO (Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha OX=3197 GN=YMF40 PE=4 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 1.1e-11
Identity = 68/287 (23.69%), Postives = 128/287 (44.60%), Query Frame = 0

Query: 111 VEVMARELSENRFDVGACCVRMVP-SEEKGESLVLPNMKLKVLIEAIRMVLEIVYDERFV 170
           VE + R+L +  F         +P ++ K  SL +P+ + K++ E +R +LE V++ RF+
Sbjct: 46  VEKVVRQLKDESFQFRPSRREFIPKADGKLRSLGIPSPRDKIVQEVMRRILEPVFEPRFL 105

Query: 171 TFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKDDIL 230
             S+G R     HTA+R ++      + W         F+++  + L   + E +KD  L
Sbjct: 106 DSSHGFRPHRSPHTALRQIRR--WTGTSWMIEGDIKGYFDNIDHHLLAGFIAELVKDQRL 165

Query: 231 IYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQIRLQ------ 290
           + +  KL     +       +L  G PQ   L  +L NIY + FD  +++I+++      
Sbjct: 166 LALYWKLVRAGYVNQGKAEPHLLTGVPQGRILSPLLSNIYLHQFDLFMEEIKVKYTTTGA 225

Query: 291 KNEENP----------------KFNLDEIVSFHNPV-----------KIYAVRYLDEILV 350
            ++ NP                K +  EI+     +           ++  VRY D+ ++
Sbjct: 226 LSKNNPIYLKARNKYYKLVKSLKASSAEIIRARRDMLKMTYGIQTGSRVRYVRYADDWVI 285

Query: 351 ITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLG 364
             +G K L +++K +V  +L+  L+L +    T I +    +  FLG
Sbjct: 286 GVTGPKALAVQIKEEVSTFLQEKLKLSLQAEKTRITNLSRSEALFLG 330

BLAST of Cla97C10G189660 vs. Swiss-Prot
Match: sp|B1N1A3|NICA_PSEPU (Putative nicotine oxidoreductase OS=Pseudomonas putida OX=303 GN=nicA PE=4 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 2.5e-11
Identity = 71/264 (26.89%), Postives = 122/264 (46.21%), Query Frame = 0

Query: 150 KVLIEAIRMVLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFE 209
           KV+ E IR +LE +Y+  F   S+G R G   HTA++ ++ S    +W      +   F+
Sbjct: 109 KVVQEVIRSILEAIYEPTFSKNSHGFRAGKSCHTALKQVRESWSGVTWVIEGDIK-GCFD 168

Query: 210 SVHVNTLCLLMQEKIKDDILIYMLRKLFELEAIQIELGACYLGR-GFPQESSLCSILINI 269
           ++  + L   ++ +IKD+  I ++RK   L A   E GA +    G PQ S +  IL N+
Sbjct: 169 NISHSKLIDQLRLRIKDERFINLIRK--ALNAGYFENGAFFSATLGTPQGSIISPILANV 228

Query: 270 YFNGFDKEIQQI-------------------RLQK------------------------N 329
           + +  D++++Q+                   +LQ+                         
Sbjct: 229 FLDQLDRKVEQLIKDHHQGEEGDKITDPAYRKLQRQKTSLRKKAEKQEGAERDATLSLAR 288

Query: 330 EENPK-FNLDEIVSFHNP-VKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELE 368
           E N K  ++   ++ +N  +++  VRY D+ ++  +G K+L  EL+S V ++LE N  LE
Sbjct: 289 EANSKLLSMSPYLTRNNGFIRVKYVRYADDWIIGVNGPKLLAEELRSVVGEFLE-NAGLE 348

BLAST of Cla97C10G189660 vs. TAIR10
Match: AT5G04050.2 (RNA-directed DNA polymerase (reverse transcriptase))

HSP 1 Score: 704.9 bits (1818), Expect = 5.4e-203
Identity = 387/732 (52.87%), Postives = 485/732 (66.26%), Query Frame = 0

Query: 43  HSPETLTRP----QLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDS 102
           +S +T+T P    +L+ALVL  +S GKF+ L +N V+ P VLL A QNL      +A  S
Sbjct: 35  NSDQTITEPLVKSELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSL----SANSS 94

Query: 103 LPVFDLVSNCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRM 162
             + D VS  FS+E M RE+ E RFD+ +CCV  + S     SLVLPN+KLKVLIEAIRM
Sbjct: 95  GDLADRVSRRFSIEEMGREIREGRFDIRSCCVEFISS-----SLVLPNLKLKVLIEAIRM 154

Query: 163 VLEIVYDERFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCL 222
           VLEIVYD+RF TFSYGGRVGMGRHTAIRYLKNSV+NP WWF V+F  + FE  +V+ LC 
Sbjct: 155 VLEIVYDDRFATFSYGGRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCG 214

Query: 223 LMQEKIKDDILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQ 282
            + EKI D +LI M++KLFE   ++IELG C  GRGFPQE  LCSILIN+YF+G DKEIQ
Sbjct: 215 FVGEKINDVMLIEMIKKLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQ 274

Query: 283 QIRLQKNEENPKFNLDEIVS----FHNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLK 342
            +RL+   +NP+    +  S    F  PV IYAVRYLDEILVITSGSKMLTM+LK +++ 
Sbjct: 275 DLRLKMKVKNPRVGTGDEESTGNVFFKPVNIYAVRYLDEILVITSGSKMLTMDLKKRIVD 334

Query: 343 YLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKE 402
            LE  LEL VDR+NT+IHSAVSEKI+FLGM LQAVPPSVL PP SEKA+RA KKY RQK+
Sbjct: 335 ILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQKD 394

Query: 403 VRAIELRNARERNRKKLGLKILSHVFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDF 462
           VR +ELRNARERNRK LGLKI  HV KK+KQ++GFK E +IE EVR+IF++W +EV+QDF
Sbjct: 395 VRKLELRNARERNRKTLGLKIFRHVLKKIKQSNGFKFEGEIENEVRDIFQSWGEEVMQDF 454

Query: 463 FESSEDHAEWHRVLSAGDFLSLKHIRNQLPEDLVNAYDRFQDQINKHLNPVKAKKEKARE 522
             S E+  +WH +L+ GDFLSL+HIR +LP+DL++AYD FQ+Q++KHL P +AKK     
Sbjct: 455 MGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQVDKHLAPTQAKKVLEXX 514

Query: 523 DEEKRVEEEELYAKRTVEDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPISSLIALE 582
                       A+RTVEDLT+LC+KV AP ELVRKA+                      
Sbjct: 515 XXXXXXXXXXXXAERTVEDLTKLCMKVSAPEELVRKAI---------------------- 574

Query: 583 DTDIIKWYSGVGRRWLDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYS 642
                                                                       
Sbjct: 575 ------------------------------------------------------------ 634

Query: 643 KDLKVVDLNGNEEIHFPTEREVKMLGERNLVDPYPVDGALSLLVIRLVTDEASYPCIAHF 702
              KV DL+G EE HFP+EREVKM+G++NL DP PVDG LSLL+IRL +DE  + C A F
Sbjct: 635 ---KVSDLDGREEAHFPSEREVKMMGDKNLSDPKPVDGTLSLLLIRLASDEPLHHCAASF 672

Query: 703 CNRTDSILYRVRLLQRTLNVNPFNGVEWVRGMGVIHESLNQRCLPLCADHISALYMGKIN 762
           C R+D+I++RV LLQ  L++NP +  +WV GMG IH +LN++CLPLC+ HIS +Y+GKI 
Sbjct: 695 CERSDTIMHRVHLLQNRLHINPLDEEKWVPGMGTIHSALNRKCLPLCSTHISDVYLGKIT 672

Query: 763 LQDLDCTLSLDM 767
           LQD+D +  +D+
Sbjct: 755 LQDVDSSSFIDL 672

BLAST of Cla97C10G189660 vs. TAIR10
Match: AT1G74350.1 (Intron maturase, type II family protein)

HSP 1 Score: 222.2 bits (565), Expect = 1.1e-57
Identity = 201/759 (26.48%), Postives = 336/759 (44.27%), Query Frame = 0

Query: 52  QLKALVLSHFSRGKFFDLFQNVVASPSVLLTASQNLITPPLSNAPDSLPVFDLVS----- 111
           +LK  V      GKF DL + V+A P              L +A D + +   VS     
Sbjct: 45  RLKKRVKEQCINGKFSDLLKKVIARPET------------LRDAYDCIRLNSNVSITERN 104

Query: 112 NCFSVEVMARELSENRFDVGACCVRMVPSEEKGESLVLPNMKLKVLIEAIRMVLEIVYDE 171
              + + +A ELS   FDV +    +V  ++  E LVLP++ LKV+ EAIR+VLE+V+  
Sbjct: 105 GSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVLPSVALKVVQEAIRIVLEVVFSP 164

Query: 172 RFVTFSYGGRVGMGRHTAIRYLKNSVQNPSWWFTVAFRHKKFESVHVNTLCLLMQEKIKD 231
            F   S+  R G GR +A++Y+ N++    W FT++   K   SV  N L  +M+EK++D
Sbjct: 165 HFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLNKKLDVSVFENLLS-VMEEKVED 224

Query: 232 DILIYMLRKLFELEAIQIELGACYLGRGFPQESSLCSILINIYFNGFDKEIQQI------ 291
             L  +LR +FE   + +E G    G G PQE  L  +L+NIY + FD E  +I      
Sbjct: 225 SSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVLMNIYLDRFDHEFYRISMRHEA 284

Query: 292 --------------------RLQKNEENPKFNLDEIVSFHNPVKIYAVRYLDEILVITSG 351
                               R Q  E+  K   ++ V+    +++Y  R++DEI    SG
Sbjct: 285 LGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVA----LRVYCCRFMDEIYFSVSG 344

Query: 352 SKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKISFLGMELQAVPPSVLHPPMSE 411
            K +  +++S+ + +L  +L L++           +  +  LG     V  +V   P + 
Sbjct: 345 PKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG---TLVRKNVRESP-TV 404

Query: 412 KAIRARKKYLRQKEVRAIELRNARERNRKKLGLKILSHVFKKLKQTS------------- 471
           KA+   K+ +R   + A++   A      ++G K L H  KK+K++              
Sbjct: 405 KAVHKLKEKVR---LFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKGLADSNSTLSQ 464

Query: 472 ---GFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKH-IRNQL 531
                K+  + +   + + R W ++V++    +S D +E        +F+  KH +   +
Sbjct: 465 ISCHRKAGMETDHWYKILLRIWMEDVLR----TSADRSE--------EFVLSKHVVEPTV 524

Query: 532 PEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTVEDLTRLCIKVDA 591
           P++L +A+ +FQ+    +++   A               E L       D       V A
Sbjct: 525 PQELRDAFYKFQNAAAAYVSSETANL-------------EALLPCPQSHDRPVFFGDVVA 584

Query: 592 PIELVRKAVKMVGFTNKMGRPQPISSLIALEDTDIIKWYSGVGRRWLDFFCCCHNYKMVK 651
           P   + + +   G     G  +  S LI L+   II WYSG+ RRW+ ++  C N+  +K
Sbjct: 585 PTNAIGRRLYRYGLITAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIK 644

Query: 652 TVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEIHFPTEREVKMLGERN 711
            ++   +R SCI TLA K+   + E  K    +L  +    + E     E+      +R+
Sbjct: 645 ALIDNQIRMSCIRTLAAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRD 704

Query: 712 LVDPYPV--DGALSLLVIRLVTDEASYPCIAHFCNRTDSILYRVRLLQRTLNVNPFNGVE 760
               Y +   G   L + RLV++     C    C+     +Y +  ++R      F G  
Sbjct: 705 EHLTYGLSNSGLCLLSLARLVSESRPCNCFVIGCSMAAPAVYTLHAMER----QKFPG-- 748

BLAST of Cla97C10G189660 vs. TAIR10
Match: ATMG00520.1 (Intron maturase, type II family protein)

HSP 1 Score: 71.2 bits (173), Expect = 3.0e-12
Identity = 88/372 (23.66%), Postives = 158/372 (42.47%), Query Frame = 0

Query: 300 HNPVKIYAVRYLDEILVITSGSKMLTMELKSQVLKYLEGNLELEVDRMNTAIHSAVSEKI 359
           H  ++I   RY D++L+   G+  L +E++ ++  +L+  L L V    +   +A S  +
Sbjct: 311 HYLIRICYARYADDLLLGIVGAVELLIEIQKRIAHFLQSGLNLWVGSAGSTTIAARS-TV 370

Query: 360 SFLGMELQAVPPSVLHPPMSEKAIRARKKYLRQKEVRAIELRNARERNRKKLGLKI-LSH 419
            FLG  ++ VPP    P    + +  R +   +  + A  LR+A     + LG  I +  
Sbjct: 371 EFLGTVIREVPPRTT-PIQFLRELEKRLRVKHRIHITACHLRSAIHSKFRNLGDSIPIKQ 430

Query: 420 VFKKLKQTSGFKSEFQIEKEVREIFRNWADEVVQDFFESSEDHAEWHRVLSAGDFLSLKH 479
           + K + +T   +   Q+           A+ +      S +    W  V         KH
Sbjct: 431 LTKGMSKTGSLQDGVQL-----------AETLGTAGVRSPQVSVLWGTV---------KH 490

Query: 480 IRNQL-------PEDLVNAYDRFQDQINKHLNPVKAKKEKAREDEEKRVEEEELYAKRTV 539
           IR               NA    Q  +++     +          +   E    +A    
Sbjct: 491 IRQGSRGISFLHSSGRSNASSDVQQVVSRSGTHARKLSLYTPPGRKAAGEGGGHWAGSIS 550

Query: 540 EDLTRLCIKVDAPIELVRKAVKMVGFTNKMGRPQPI--SSLIALEDTDIIKWYSGVGRRW 599
            +     IK++API+ + + ++  G  ++  RP PI  + L  + D DI+ W +G+    
Sbjct: 551 SEFP---IKIEAPIKKILRRLRDRGIISRR-RPWPIHVACLTNVSDEDIVNWSAGIAISP 610

Query: 600 LDFFCCCHNYKMVKTVVTYHLRFSCILTLAEKHESTKREAMKHYSKDLKVVDLNGNEEI- 659
           L ++ C  N   V+T+V + +R+S I TLA KH+S+    +  YSKD  +V+  G + + 
Sbjct: 611 LSYYRCRDNLYQVRTIVDHQIRWSAIFTLAHKHKSSAPNIILKYSKDSNIVNQEGGKILA 656

Query: 660 HFPTEREVKMLG 661
            FP   E+  LG
Sbjct: 671 EFPNSIELGKLG 656

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_011650528.10.0e+0093.22PREDICTED: uncharacterized protein LOC101217546 [Cucumis sativus][more]
XP_008463418.10.0e+0092.18PREDICTED: uncharacterized protein LOC103501588 [Cucumis melo] >XP_016903029.1 P... [more]
XP_022976163.10.0e+0088.83nuclear intron maturase 3, mitochondrial [Cucurbita maxima][more]
XP_022942578.10.0e+0088.44nuclear intron maturase 3, mitochondrial [Cucurbita moschata][more]
XP_022155823.10.0e+0086.79nuclear intron maturase 3, mitochondrial [Momordica charantia] >XP_022155825.1 n... [more]
Match NameE-valueIdentityDescription
tr|A0A1S4E4Y1|A0A1S4E4Y1_CUCME0.0e+0092.18uncharacterized protein LOC103501588 OS=Cucumis melo OX=3656 GN=LOC103501588 PE=... [more]
tr|A0A2I4ECV4|A0A2I4ECV4_9ROSI2.7e-30571.25uncharacterized protein LOC108988417 OS=Juglans regia OX=51240 GN=LOC108988417 P... [more]
tr|F6HLP1|F6HLP1_VITVI1.5e-30069.06Uncharacterized protein OS=Vitis vinifera OX=29760 GN=VIT_08s0007g05780 PE=4 SV=... [more]
tr|A0A2P6RTE0|A0A2P6RTE0_ROSCH6.4e-29168.62Putative reverse transcriptase domain, domain X OS=Rosa chinensis OX=74649 GN=Rc... [more]
tr|A0A2N9I3C4|A0A2N9I3C4_FAGSY1.2e-28973.30Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS47064 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
sp|Q9LZA5|NMAT3_ARATH2.7e-22857.24Nuclear intron maturase 3, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
sp|Q9CA78|NMAT4_ARATH1.9e-5626.48Nuclear intron maturase 4, mitochondrial OS=Arabidopsis thaliana OX=3702 GN=NMAT... [more]
sp|P03876|AI2M_YEAST7.6e-1324.74Putative COX1/OXI3 intron 2 protein OS=Saccharomyces cerevisiae (strain ATCC 204... [more]
sp|P38478|YMF40_MARPO1.1e-1123.69Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha OX=3197 GN=... [more]
sp|B1N1A3|NICA_PSEPU2.5e-1126.89Putative nicotine oxidoreductase OS=Pseudomonas putida OX=303 GN=nicA PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G04050.25.4e-20352.87RNA-directed DNA polymerase (reverse transcriptase)[more]
AT1G74350.11.1e-5726.48Intron maturase, type II family protein[more]
ATMG00520.13.0e-1223.66Intron maturase, type II family protein[more]
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006397mRNA processing
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR024937Domain_X
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing
biological_process GO:0022904 respiratory electron transport chain
biological_process GO:0032259 methylation
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0000373 Group II intron splicing
biological_process GO:0090615 mitochondrial mRNA processing
cellular_component GO:0016021 integral component of membrane
cellular_component GO:0016020 membrane
cellular_component GO:0005739 mitochondrion
cellular_component GO:0005575 cellular_component
molecular_function GO:0009055 electron carrier activity
molecular_function GO:0016491 oxidoreductase activity
molecular_function GO:0008168 methyltransferase activity
molecular_function GO:0003964 RNA-directed DNA polymerase activity
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla97C10G189660.1Cla97C10G189660.1mRNA


Analysis Name: InterPro Annotations of watermelon 97103 v2
Date Performed: 2019-05-12
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 505..525
NoneNo IPR availablePANTHERPTHR33642FAMILY NOT NAMEDcoord: 44..762
NoneNo IPR availablePANTHERPTHR33642:SF1RNA-DIRECTED DNA POLYMERASE REVERSE TRANSCRIPTASEcoord: 44..762
NoneNo IPR availableCDDcd01651RT_G2_introncoord: 141..366
e-value: 4.68011E-38
score: 140.799
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 541..646
e-value: 9.4E-16
score: 57.9
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 143..365
e-value: 9.8E-10
score: 38.2
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 114..366
score: 9.06

The following gene(s) are paralogous to this gene:

None