CSPI04G10980 (gene) Wild cucumber (PI 183967)

NameCSPI04G10980
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionIntron maturase, type II family protein
LocationChr4 : 9294372 .. 9297350 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGAACTTTTCAAATTTGCAAAGTGTAAATGTTTGCATATTCAATTCATCCTTTGTTTCTGACATTGGTAAATATCCAATCTACTCAGCTGCTTCTTGGTTAGATTCCAGTTTCTTTTTTCAGTGTTCATGGTATGAAATGTTTGATTTTATTCTTTGTGATGTTCCGAATTTGATAGAATGCAATGATGCAATGAACCTTCTCTTCTGCAATTGATCTTTACTTTCTTGTTGAATAATCCGTTTAAAAGTTTGAAAATCAAACATTTTATTTGTATTTTTTCTTTTAATATCTGGTAAATTATGCGACTTGTACGTGATTGTCTGTGGACCTTTGTTACTTTTGACAGGAAAGTGTGTTCAAATAGTTCAGGGTTCTGAAAATTATTCAACTCTCGCCCGTGCTGAAATTGACAAGGGAATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCAATTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTGATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGAATTGTTTTTGTTCTGCTTATGATTTCATTTAACTTCTAAATTACTTGTTGAGATAAGATTTGCCAAGAGAAATGCCATGACTGAGCCTTCGTGATGCATAATTTGTGCTAGCATAATGATTTTCTGGTCTCAATGGTGTCTGAAATCCTAATTGGAATGTTGCGCCGATTGAACATTGTTAAAACTAATGTGAGGACCAGGCTGGAACAACCCAATGTGAGAATCAGGTTTGAACAACGTTCAGTTGAATGGGTGATGTTTATATCGATATCTTGGAAACCTTTTCAATTGCTCGGATGAAGAATATTAGTATGGTGAGGAGTACATGCCAATGGT

mRNA sequence

ATGCGGAACTTTTCAAATTTGCAAAGTGTAAATGTTTGCATATTCAATTCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAAATAGTTCAGGGTTCTGAAAATTATTCAACTCTCGCCCGTGCTGAAATTGACAAGGGAATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCAATTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTGATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGA

Coding sequence (CDS)

ATGCGGAACTTTTCAAATTTGCAAAGTGTAAATGTTTGCATATTCAATTCATCCTTTGTTTCTGACATTGGAAAGTGTGTTCAAATAGTTCAGGGTTCTGAAAATTATTCAACTCTCGCCCGTGCTGAAATTGACAAGGGAATGGAGAGAATGAAACTGGCCATAAACTTGGCCTCGCTTGTTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTTAAGAGATCAATTGAAATTCGGATTAAGGAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAATGTAATTGCCTGCCCCAATACTCTTCAAAATGTTTACGACTGTATTAGAATTAACTCAAATGTTGACATTAAGTCGAATGATCGTTTGATCTCATTTGAATCTATGGCTGAAGAGCTTTCTAATGGTAATTTTGATGTCAATACCAATACTTTCTCCATATTAAGTTCAAGAAAAGAAGTACTAATTTTACCAAAGATAAAGTTGAAGGTTCTTCAGGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGTTGTCGAAGTGGAAGAGGACACTCAACAGCATTGAAGTACATCAAAAAAGAGATAAAAGATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTGATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGAGGACCCCAAATTATTTGCTGTTATCAGAAGTATATATTTGGCCGGAGCACTGAATTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGAGTTCTGTCTCCTATATTAACGAACATTTATCTAAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATACGAAGCTATTAATGAGTATGGTAATACTGGTCAAGATGGGTCACAATCAAGGCTACGGAGTTGGTTTAGGAGACAATTGAAAGGAAATAATTCTGATTATTCAGGTGAGGAGAAAGACAAGATAAGAGTATATTGTTGTCGCTATATGGATGAAATCTTTTTAGCAGTATCAGGTTCTAAAGATGTTGCTCATAGTTTTAGGTCTGAGATTTTTGATTTCGTGCAGAAGACTTTGCATTTGGACGTTAACCGTGAAGAGGAAATGGTATCATGTGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCTGCTGTAAAATCCATCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGGTTTACAAAAGCAGGAGACTTGGAATGCTTGGACAGTGTGGTTGGGAAAGAAATGGCTTGCTCATGGTTTGAAGAAGGTTAAAGAGTCTGAGATTAAGCATTTAGCTAAAAATAGCTCTTTAAATAAAATTTCCAGTTTTCGTAAACCTGGAATGGAAACTGATCACTGGTACAAGGTTCTGTTGAAAATTTGGATGCAAGATCTAAATGCAAGAGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGCATGCAGTGGAACTTTCTCTTCCTTTTGAACTTCGAGATTCCTTTTATGAATTCCAAAGGCATGTCAAAGAATACATTTCTTCTGAGACAGCGTCTACTCTTGCCCTTTTACCAAATTATGACCCTTCTGCCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATCAGAAAACGACTTTTGCGATATAGATTAGTCACAAATAAAGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACCCAAATTATTGACTGGTTTGTAGGAGTATCTCGTCGTTTGTTTAGATGGTACAACAATTCTTCTAACTTCAGCGAGTTGTTCTTAATTTTCGATCAAGTTAGGAAATCTTGTATCCGAACGCTAGCAGCAAAGCACCGGATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAATTGAGTAAGATTTACTCCTCTTCTGAAATAGATCAAGAAAAAGAGAAGTCAACAGATACCCATGTTTTAGACCACGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTTTGTGTTTGCTATCTCTTGCTAGAATGGTCAGCCAATCTCGTCCTTGCAATTGTTTCGTCATTGGGTGTTTGGCTCCTGCACCAAGTGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCGGGATGGAAGACTGGGTTCTCGAGTTCCATTCATCCTAGCTTGAACAAACGACGATTTGGGTTATGCAAACAACATTTGGCAGATTTGTATTTGGGTCGCATTTCTTTGCAATCTGTTGATTTTGGTGCATGGAAGTGA
BLAST of CSPI04G10980 vs. Swiss-Prot
Match: LTRA_LACLM (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (strain MG1363) GN=ltrA PE=1 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 2.8e-23
Identity = 84/299 (28.09%), Postives = 141/299 (47.16%), Query Frame = 1

Query: 160 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDP 219
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 220 DWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH-GL 279
            W+   D+    D +    LI ++  KI+D K+  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 280 PQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQ 339
           PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      ++
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDR----------ESPERITPEYRELHNEIKR 273

Query: 340 LKGNNSDYSGEEKDKIR----------------------VYCCRYMDEIFLAVSGSKDVA 399
           +        GEEK K+                       +   RY D+  ++V GSK+  
Sbjct: 274 ISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDC 333

Query: 400 HSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK 430
              + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 QWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of CSPI04G10980 vs. Swiss-Prot
Match: LTRA_LACLC (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=ltrA PE=1 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 2.8e-23
Identity = 84/299 (28.09%), Postives = 141/299 (47.16%), Query Frame = 1

Query: 160 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDP 219
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   TALK IK+E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 220 DWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH-GL 279
            W+   D+    D +    LI ++  KI+D K+  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 280 PQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFR------RQ 339
           PQ G+LSP+L NIYL+  D+   +L MK++            S  R+   +R      ++
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDR----------ESPERITPEYRELHNEIKR 273

Query: 340 LKGNNSDYSGEEKDKIR----------------------VYCCRYMDEIFLAVSGSKDVA 399
           +        GEEK K+                       +   RY D+  ++V GSK+  
Sbjct: 274 ISHRLKKLEGEEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDC 333

Query: 400 HSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKEK 430
              + ++  F+   L ++++ E+ +++  +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 QWIKEQLKLFIHNKLKMELSEEKTLITHSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of CSPI04G10980 vs. Swiss-Prot
Match: NICA_PSEPU (Putative nicotine oxidoreductase OS=Pseudomonas putida GN=nicA PE=3 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 2.1e-18
Identity = 78/282 (27.66%), Postives = 130/282 (46.10%), Query Frame = 1

Query: 174 KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDE 233
           KV+QE IR +LE ++ P FSK SHG R+G+   TALK +++      W    D+    D 
Sbjct: 109 KVVQEVIRSILEAIYEPTFSKNSHGFRAGKSCHTALKQVRESWSGVTWVIEGDIKGCFDN 168

Query: 234 LVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH-GLPQEGVLSPILTNIY 293
           +  +KLI  +  +I+D +   +IR    AG    E G F     G PQ  ++SPIL N++
Sbjct: 169 ISHSKLIDQLRLRIKDERFINLIRKALNAG--YFENGAFFSATLGTPQGSIISPILANVF 228

Query: 294 LNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDK----- 353
           L+  D++  +L +K     E G+   D +  +L+   +  L+       G E+D      
Sbjct: 229 LDQLDRKVEQL-IKDHHQGEEGDKITDPAYRKLQRQ-KTSLRKKAEKQEGAERDATLSLA 288

Query: 354 --------------------IRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKT-LH 413
                               IRV   RY D+  + V+G K +A   RS + +F++   L 
Sbjct: 289 REANSKLLSMSPYLTRNNGFIRVKYVRYADDWIIGVNGPKLLAEELRSVVGEFLENAGLE 348

Query: 414 LDVNREEEMVSCETHGIRFLGCLVRRSVQESPAVKSIHKLKE 429
           L + +   +   ++   +FLG  +R   + S  +K +   K+
Sbjct: 349 LSIEK-THIRHAKSETAKFLGTNLRIGSENSKIMKVLRNGKK 385

BLAST of CSPI04G10980 vs. Swiss-Prot
Match: YMF40_MARPO (Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha GN=YMF40 PE=3 SV=1)

HSP 1 Score: 91.3 bits (225), Expect = 5.2e-17
Identity = 70/258 (27.13%), Postives = 121/258 (46.90%), Query Frame = 1

Query: 168 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDL 227
           +P  + K++QE +R +LE VF P F   SHG R  R   TAL+ I++      W    D+
Sbjct: 80  IPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRSPHTALRQIRR-WTGTSWMIEGDI 139

Query: 228 SKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGH---GLPQEGVL 287
               D +    L   + + ++D +L A+   +  AG +N    G  + H   G+PQ  +L
Sbjct: 140 KGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAGYVN---QGKAEPHLLTGVPQGRIL 199

Query: 288 SPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSW-FRRQLKGNNSDYSGE 347
           SP+L+NIYL+ FD     + +KY              ++R + +   + LK ++++    
Sbjct: 200 SPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYLKARNKYYKLVKSLKASSAEIIRA 259

Query: 348 EKDKI----------RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREE 407
            +D +          RV   RY D+  + V+G K +A   + E+  F+Q+ L L +  E+
Sbjct: 260 RRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKALAVQIKEEVSTFLQEKLKLSLQAEK 319

Query: 408 -EMVSCETHGIRFLGCLV 411
             + +       FLG L+
Sbjct: 320 TRITNLSRSEALFLGTLI 333

BLAST of CSPI04G10980 vs. Swiss-Prot
Match: YMC6_SCHPO (Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPMIT.06 PE=3 SV=4)

HSP 1 Score: 90.9 bits (224), Expect = 6.7e-17
Identity = 76/270 (28.15%), Postives = 121/270 (44.81%), Query Frame = 1

Query: 160 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDP 219
           S  K  L +   + K++QE +RIVLE ++ P F+  SHG R GR   +AL+ I    K  
Sbjct: 304 SGGKRPLTIGSPRDKLVQEILRIVLEAIYEPLFNTASHGFRPGRSCHSALRSIFTNFKGC 363

Query: 220 DWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLP 279
            WW   D+    D +   KLI ++  KI+D +   +IR    AG L      +    G P
Sbjct: 364 TWWIEGDIKACFDSIPHDKLIALLSSKIKDQRFIQLIRKALNAGYLTENRYKYDI-VGTP 423

Query: 280 QEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSD 339
           Q  ++SPIL NIYL+  D+    L  +++         +  S+SR   +   + K  N+D
Sbjct: 424 QGSIVSPILANIYLHQLDEFIENLKSEFDYKGPIAR--KRTSESRHLHYLMAKAKRENAD 483

Query: 340 YSGEEKDKI---------------RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKT 399
                K  I               ++   RY D+  +AV+GS        ++I  F   +
Sbjct: 484 SKTIRKIAIEMRNVPNKIHGIQSNKLMYVRYADDWIVAVNGSYTQTKEILAKITCFC-SS 543

Query: 400 LHLDVN-REEEMVSCETHGIRFLGCLVRRS 414
           + L V+  + ++ +  T  I FLG  +  S
Sbjct: 544 IGLTVSPTKTKITNSYTDKILFLGTNISHS 569

BLAST of CSPI04G10980 vs. TrEMBL
Match: A0A0A0KWB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188380 PE=4 SV=1)

HSP 1 Score: 1565.1 bits (4051), Expect = 0.0e+00
Identity = 783/786 (99.62%), Postives = 784/786 (99.75%), Query Frame = 1

Query: 1   MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL 60
           MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL
Sbjct: 14  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL 73

Query: 61  VEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI 120
           VEESLDVDLRRSKTQMELKRS+EIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI
Sbjct: 74  VEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI 133

Query: 121 RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI 180
           RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI
Sbjct: 134 RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI 193

Query: 181 RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI 240
           RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI
Sbjct: 194 RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI 253

Query: 241 TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF 300
           TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
Sbjct: 254 TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF 313

Query: 301 FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF 360
           FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF
Sbjct: 314 FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF 373

Query: 361 LAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV 420
           LAVSGSKDVAHSFRSEIF FVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV
Sbjct: 374 LAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV 433

Query: 421 KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF 480
           KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF
Sbjct: 434 KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF 493

Query: 481 RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE 540
           RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE
Sbjct: 494 RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE 553

Query: 541 YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ 600
           YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
Sbjct: 554 YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ 613

Query: 601 DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD 660
           DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD
Sbjct: 614 DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD 673

Query: 661 SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVI 720
           SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLS ARMVSQSRPCNCFVI
Sbjct: 674 SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCNCFVI 733

Query: 721 GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV 780
           GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV
Sbjct: 734 GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV 793

Query: 781 DFGAWK 787
           DFGAWK
Sbjct: 794 DFGAWK 799

BLAST of CSPI04G10980 vs. TrEMBL
Match: W9QLZ5_9ROSA (Group II intron-encoded protein ltrA OS=Morus notabilis GN=L484_020694 PE=4 SV=1)

HSP 1 Score: 986.9 bits (2550), Expect = 1.4e-284
Identity = 502/793 (63.30%), Postives = 624/793 (78.69%), Query Frame = 1

Query: 1   MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAE-IDKGMERMKLAINLAS 60
           +R  + +Q +N  +  SSF  D GK  + +Q   ++ST A A+ I+    +  LA NLAS
Sbjct: 14  LRKLATMQRINQILLYSSFFIDRGKSSERIQEPRHFSTAAAADAINMCSGKNTLATNLAS 73

Query: 61  LVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDC 120
           L+EES++VD R+  ++MELKRS+E R+K+RVK QY+NGKF +L+  VIA P TLQ+ Y+C
Sbjct: 74  LLEESVEVDERKPSSRMELKRSLEYRVKKRVKEQYVNGKFHNLLEKVIANPETLQDAYNC 133

Query: 121 IRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILS--SRKEVLILPKIKLKVLQ 180
           IR+NSNVDI  N+   SFES+ EEL  GNFDV  NT SI +  +RKEVL+LP +KLKV+Q
Sbjct: 134 IRLNSNVDIMLNNETTSFESVPEELFCGNFDVKANTVSISTRGARKEVLVLPNLKLKVIQ 193

Query: 181 EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMA 240
           EAIRIVLE V+RPHFSKISHGCRSGRGH TALK+IKK+I  P WW T+ ++KK+D  ++ 
Sbjct: 194 EAIRIVLEVVYRPHFSKISHGCRSGRGHFTALKFIKKDICAPIWWSTLIVNKKLDTCILD 253

Query: 241 KLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD 300
           KLI+V+E+KI DP LF++IRS++ +  +NLEFGGFPKGHGLPQEG+LSPIL NIYL+LFD
Sbjct: 254 KLISVLEEKIVDPGLFSIIRSMFESQVINLEFGGFPKGHGLPQEGILSPILMNIYLDLFD 313

Query: 301 QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMD 360
           +EF RLS+KYEA++         SQS+LRSWFRR LK  +   +GEEK  +RV+ CR+MD
Sbjct: 314 REFCRLSLKYEALDLDLEANHQKSQSKLRSWFRRNLKAKDLSGAGEEKFSLRVHSCRFMD 373

Query: 361 EIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQE 420
           EIFLAVSGSKD A  F+SEI ++++ +LHLDV+ E E++ C+  HGIRF+G LVRR+V+E
Sbjct: 374 EIFLAVSGSKDAALGFKSEIQNYLKNSLHLDVDDETELLPCDGLHGIRFMGTLVRRTVKE 433

Query: 421 SPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA-KNSSLN 480
           SPA K+IHKLKEKVELF +QKQE W+  TV +GKKWL HGLKKVKESEI+HLA   S L+
Sbjct: 434 SPATKAIHKLKEKVELFAIQKQEAWDVGTVRIGKKWLGHGLKKVKESEIRHLADPESVLS 493

Query: 481 KISSFRKPGMETDHWYKVLLKIWMQDLNAR-AAESEEKILSKHAVELSLPFELRDSFYEF 540
           +IS FRK GMETDHWYK LLKIWMQD+ A+ AAE EE ILSK+  E +LP EL++SFY F
Sbjct: 494 QISHFRKAGMETDHWYKHLLKIWMQDIKAKAAAECEETILSKYVAEPALPQELKNSFYVF 553

Query: 541 QRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSS 600
           QRH +EY+SSETA T ALL + D S +P  IT+I AP+N+I+KRLLRY LVT KG+  + 
Sbjct: 554 QRHAQEYVSSETAFTCALLKSSDASTQPVIITQIFAPINAIKKRLLRYGLVTTKGYSRAC 613

Query: 601 PFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTLAAKHRIHES 660
             LILQD+ QIIDWF+G+ RR FRWY+   NFS++ FL+  Q+RKSCIRTLA+KH IHE+
Sbjct: 614 SCLILQDDNQIIDWFLGIVRRWFRWYSECDNFSDIKFLVCGQIRKSCIRTLASKHHIHET 673

Query: 661 EIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSR 720
           EIEK+FD+ELS+I SS +++QE      + V + DEAL YGISYSGLC LSLARMVSQSR
Sbjct: 674 EIEKRFDAELSRIPSSEDLEQEMVNDETSDVFEKDEALMYGISYSGLCALSLARMVSQSR 733

Query: 721 PCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLG 780
           PC CFV GC APA SVY+LHVMERQKFPGWKTGFSS IHPSLN+RRFGLCKQHL DLYLG
Sbjct: 734 PCTCFVTGCQAPAQSVYSLHVMERQKFPGWKTGFSSCIHPSLNRRRFGLCKQHLKDLYLG 793

Query: 781 RISLQSVDFGAWK 787
            ISLQS+DFG+WK
Sbjct: 794 HISLQSIDFGSWK 806

BLAST of CSPI04G10980 vs. TrEMBL
Match: F6I1Y2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0174g00300 PE=4 SV=1)

HSP 1 Score: 979.2 bits (2530), Expect = 3.0e-282
Identity = 500/765 (65.36%), Postives = 611/765 (79.87%), Query Frame = 1

Query: 30  VQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRIK 89
           +Q    YSTL     + DK + +  LA NLA L+EES +  + R   +MELKRS E+RIK
Sbjct: 1   MQACAVYSTLGAVSGDADKDIGKPTLAKNLAFLMEESSN-HVIRPMARMELKRSFELRIK 60

Query: 90  ERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNG 149
           +RVK QY+NGKF DLM  VIA P TL++ Y+CIRINSNVD+  +   ISF+SMAEEL  G
Sbjct: 61  KRVKEQYVNGKFQDLMVKVIANPQTLEDAYNCIRINSNVDLALDGDNISFKSMAEELLGG 120

Query: 150 NFDVNTNTFSIL--SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGH 209
           +F+VN NTFSI   S+RKEVLILP +KLKV+QEAIRIVLE V+RP+FSKISHGCRSGRGH
Sbjct: 121 SFNVNVNTFSISTKSARKEVLILPSLKLKVVQEAIRIVLEIVYRPYFSKISHGCRSGRGH 180

Query: 210 STALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGAL 269
           STALKYI KEI +PDWWF + ++KK+D +V+AKLI+ M+DKIEDP LF +I++++ A  L
Sbjct: 181 STALKYISKEISNPDWWFILHVNKKLDAVVLAKLISTMQDKIEDPNLFVMIQNMFHAQVL 240

Query: 270 NLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRL 329
           NLEFGGFPKGHGLPQEGVLSPIL NIYL+LFD EF+R+SM+YEA++       D S S+L
Sbjct: 241 NLEFGGFPKGHGLPQEGVLSPILMNIYLDLFDHEFYRMSMRYEALDPGMCIDHDKSHSKL 300

Query: 330 RSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTL 389
           RSWFRRQLKGN+  Y+G E    RV+ CR+MDEIF A+SGSKD+A  F+SEI +++Q +L
Sbjct: 301 RSWFRRQLKGNDVKYTGRESSNFRVHSCRFMDEIFFAISGSKDIAIEFKSEILNYMQNSL 360

Query: 390 HLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAW 449
           HLDV+ + E++ C   HGI+FLG LV+RSV+ESP V+++HKLKEKV LF  QKQE W+A 
Sbjct: 361 HLDVSNQSELLPCHGPHGIQFLGTLVKRSVRESPTVRAVHKLKEKVRLFASQKQEAWDAG 420

Query: 450 TVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNKISSFRKPGMETDHWYKVLLKIWMQDLN 509
           T+ +GKKWLAHGLKKVKESEI+HLA   S L++IS FRK GMETDHWYK+LLKIW+ D+ 
Sbjct: 421 TLRIGKKWLAHGLKKVKESEIRHLADTDSVLSQISCFRKTGMETDHWYKLLLKIWLHDVK 480

Query: 510 ARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPT 569
           A+AAE+E  ILSK+  E  LP ELRDSFYEFQ+  ++Y++SETAS LALLPN     +  
Sbjct: 481 AKAAENEGVILSKYIAEPLLPKELRDSFYEFQKRAEDYVASETASMLALLPNSKSCTESV 540

Query: 570 FITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNS 629
            I +IIAPVN I+KRLLRYRL   KG+PC+SP LILQD+ QI+DWF G++RR   WY+  
Sbjct: 541 PIIKIIAPVNVIKKRLLRYRLTNAKGYPCASPMLILQDDIQIVDWFSGLARRWLIWYSEC 600

Query: 630 SNFSELFLIF-DQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEK-EKSTD 689
            NFSE+ LI  DQ+RKSCIRTLAAK+R+HE+EIEK+ D+EL +I S+ EI+QEK  +++D
Sbjct: 601 DNFSEVKLIICDQLRKSCIRTLAAKYRLHETEIEKRSDTELCRIPSTLEIEQEKVNETSD 660

Query: 690 THVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFP 749
           +   D +EAL YGISYSGLCLLSLARMVSQSR CNCFV+GCLA APSVYTLHVMERQKFP
Sbjct: 661 SQASDTNEALMYGISYSGLCLLSLARMVSQSRRCNCFVMGCLAAAPSVYTLHVMERQKFP 720

Query: 750 GWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK 787
           GWKTGFSS IHPSLN RR GLCKQHL DLYLG ISLQS++FGAWK
Sbjct: 721 GWKTGFSSCIHPSLNGRRIGLCKQHLKDLYLGHISLQSIEFGAWK 764

BLAST of CSPI04G10980 vs. TrEMBL
Match: A0A061DIX4_THECC (Intron maturase isoform 1 OS=Theobroma cacao GN=TCM_001510 PE=4 SV=1)

HSP 1 Score: 964.9 bits (2493), Expect = 5.9e-278
Identity = 491/771 (63.68%), Postives = 609/771 (78.99%), Query Frame = 1

Query: 24  GKCVQIVQGSENYSTLA-RAEIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSI 83
           GK ++ +     YS+ +   ++    E+M LA +LA LVEES   D R++K++MELKRS+
Sbjct: 31  GKPIEKLHAWVCYSSFSTNGDLKGAHEKMTLAKDLACLVEESSHQDERKAKSRMELKRSL 90

Query: 84  EIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAE 143
           E+R+K+RVK QYLNG F +LM  VIA P TLQ+ Y+CIR+NSNVDI      + F+SMAE
Sbjct: 91  ELRVKKRVKEQYLNGNFHNLMAKVIANPATLQDAYNCIRLNSNVDISVKHDSVCFKSMAE 150

Query: 144 ELSNGNFDVNTNTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR 203
           EL  G+FDV  NTFS+ +  + KEVL+LP +K++++QEAIRIVLE V++PHFSKISHGCR
Sbjct: 151 ELLEGSFDVKANTFSVSTRGASKEVLVLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGCR 210

Query: 204 SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIY 263
           SGR HSTAL+YI KEI  P WWFT+ L+KK+D  ++AKLI+ ++DK+ED +L A I+S++
Sbjct: 211 SGRDHSTALRYISKEIASPSWWFTLILNKKVDSSILAKLISKLQDKVEDNQLLATIQSMF 270

Query: 264 LAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDG 323
            A  LN EFGGFPKGHGLPQEGVLSPIL NIYL+LFDQEF+RLSM+YEA++   +  +D 
Sbjct: 271 DAQVLNFEFGGFPKGHGLPQEGVLSPILMNIYLHLFDQEFYRLSMRYEALHPGFDKDEDM 330

Query: 324 SQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDF 383
           S S+LR+WFRRQLK N+  Y+  +    RV+CCR+MDEIF A+SGSKDVA SF+SEI DF
Sbjct: 331 SYSKLRNWFRRQLKENDVKYTVNDDSSPRVHCCRFMDEIFFAISGSKDVALSFKSEIVDF 390

Query: 384 VQKTLHLDVNREE-EMVSC-ETHGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQ 443
            + +L LDV+ E+ E++ C E++GIRFLG LVRRSVQE PA +++HKLKEKV+LF  QKQ
Sbjct: 391 FKNSLELDVDDEQTEILPCNESNGIRFLGALVRRSVQEGPATRAVHKLKEKVKLFASQKQ 450

Query: 444 ETWNAWTVWLGKKWLAHGLKKVKESEIKHLA-KNSSLNKISSFRKPGMETDHWYKVLLKI 503
           + WNA TV +G+KWLAHGLKKVKESEI+HLA   S+L+KIS FRK GMETDHWYKVL KI
Sbjct: 451 DAWNAGTVGIGRKWLAHGLKKVKESEIEHLADSGSTLSKISCFRKAGMETDHWYKVLTKI 510

Query: 504 WMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYD 563
           WMQD+ A+AAE+EE ILSK  VE +LP EL++S+YEF +   EY+ SETA+TLALLPN  
Sbjct: 511 WMQDIKAKAAENEESILSKCVVEPALPQELKESYYEFLKRANEYVYSETAATLALLPNSS 570

Query: 564 PSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLF 623
            +A    ITEIIAPVN+I+KRLLRY L T++G+P     L+LQDN QIIDWF G+  R  
Sbjct: 571 SNAGSVAITEIIAPVNAIKKRLLRYGLTTSEGYPRVVSLLVLQDNFQIIDWFSGIVCRWL 630

Query: 624 RWYNNSSNFSELFLIFDQV-RKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQE- 683
           RWY    NF+E+ LI   + RKSCIRTLAAK+RIHESEIEK+FDSEL +I S+ E++QE 
Sbjct: 631 RWYRECDNFNEIKLIISTILRKSCIRTLAAKYRIHESEIEKQFDSELCRIPSTEEVEQEL 690

Query: 684 KEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVM 743
             +++D+H  D DEAL YGISYSGLCLLSLARMVSQSRPCNCFV+GC   APSVYTLH M
Sbjct: 691 TYETSDSHSFDSDEALMYGISYSGLCLLSLARMVSQSRPCNCFVMGCSMAAPSVYTLHAM 750

Query: 744 ERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK 787
           ERQKFPGWKTGFSS IHPSLNKRR GLCK+HL DLYLG ISLQS++FGAWK
Sbjct: 751 ERQKFPGWKTGFSSCIHPSLNKRRIGLCKKHLKDLYLGHISLQSINFGAWK 801

BLAST of CSPI04G10980 vs. TrEMBL
Match: A0A067JY85_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25078 PE=4 SV=1)

HSP 1 Score: 943.7 bits (2438), Expect = 1.4e-271
Identity = 484/771 (62.78%), Postives = 601/771 (77.95%), Query Frame = 1

Query: 24  GKCVQIVQGSENYSTLARAE--IDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRS 83
           G+ ++I+Q   NYSTLA      D    ++ LA NLA ++EES +V+ RR K++MELKRS
Sbjct: 46  GRPLEILQLWANYSTLAEVNDSFDNDAGKITLAKNLAFVLEESSNVNDRRPKSRMELKRS 105

Query: 84  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMA 143
           +E+RIK+RVK Q+LNGKF DL+  VIA P TLQ+ Y+CIR+N+NVDI S+    SFES+A
Sbjct: 106 LELRIKKRVKEQFLNGKFRDLITKVIANPETLQDAYNCIRLNANVDIASDKDDTSFESVA 165

Query: 144 EELSNGNFDVNTNTFSILSS--RKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC 203
           EELSNG+FD++ NTFSI +   RKE+L+LPK+KLKV+QEA+RI LE V+RPHFSKISHGC
Sbjct: 166 EELSNGSFDISANTFSISTKGVRKEILVLPKLKLKVVQEALRIALEVVYRPHFSKISHGC 225

Query: 204 RSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSI 263
           RSGRGH +ALKYI KEI +PDWWFT+ +SKK+D  V+ KLI++MEDKIEDP L+ +I+ +
Sbjct: 226 RSGRGHHSALKYISKEISNPDWWFTLTISKKLDACVLDKLISIMEDKIEDPCLYDIIQGM 285

Query: 264 YLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQD 323
             A  LN+EFGG+PKGHGLPQEGVLSPIL NIYLN+FD E +RLSMKYEA++   +    
Sbjct: 286 DAAKVLNMEFGGYPKGHGLPQEGVLSPILMNIYLNVFDHEIYRLSMKYEALSSAFHLEGG 345

Query: 324 GSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFD 383
              S+LR WFRRQLKGN    +GE     +++ CR+MDE+F AVSGSKD+A  F SEI  
Sbjct: 346 QLNSKLRRWFRRQLKGNGLKTTGEVNSCPKIHSCRFMDELFFAVSGSKDIALGFMSEIMG 405

Query: 384 FVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQ 443
           ++Q TL LDV  + E+  C     IRFLG L+RR V++SPAV+++HKL+EKV+LF  QKQ
Sbjct: 406 YLQNTLLLDVTGKMEVAPCAGPQVIRFLGTLLRRRVKDSPAVRAVHKLREKVKLFASQKQ 465

Query: 444 ETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNKISSFRKPGMETDHWYKVLLKI 503
           E W+  T+ +GKKWLAHGL+KVKESEIKHLA +SS L++IS FRK GMETDHWYK+L+KI
Sbjct: 466 EAWDVGTIRIGKKWLAHGLRKVKESEIKHLADSSSVLSQISCFRKVGMETDHWYKLLIKI 525

Query: 504 WMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYD 563
           WMQD+ A+AAESEE ILSK+  E +LP ELRDSF+EFQ+    Y++SETA+TLALLPN  
Sbjct: 526 WMQDITAKAAESEEFILSKYIAEPALPKELRDSFHEFQKRANGYVNSETATTLALLPNSS 585

Query: 564 PSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLF 623
            S +   ITEIIAPVN I+KRLLRY L+T  GH C +  LILQD   II WF G+ RR  
Sbjct: 586 SSTE--MITEIIAPVNVIKKRLLRYGLITPAGHSCVNRQLILQDKAHIIYWFSGIVRRWQ 645

Query: 624 RWYNNSSNFSELFLIFD-QVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEK 683
           RWY +  NF++L LI   QV KSCIRTLAAK+RIHE E+EK+FD EL+ I S+ +I +E+
Sbjct: 646 RWYGDCKNFADLELIIKFQVWKSCIRTLAAKYRIHEDEVEKRFDLELNNILSTQDIKEER 705

Query: 684 E-KSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVM 743
           E +++++   D+DE L YGISYSGLCLLSLARMVSQSRPCNCFV+GC A APSVYTLHVM
Sbjct: 706 ENQASNSLAFDNDEMLTYGISYSGLCLLSLARMVSQSRPCNCFVMGCSAAAPSVYTLHVM 765

Query: 744 ERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK 787
           ERQKFPGWKTGFS+ IHPSLN RR GLC QHL D Y+G ISLQS+DF +WK
Sbjct: 766 ERQKFPGWKTGFSTCIHPSLNGRRIGLCNQHLKDFYVGDISLQSIDFSSWK 814

BLAST of CSPI04G10980 vs. TAIR10
Match: AT1G74350.1 (AT1G74350.1 Intron maturase, type II family protein)

HSP 1 Score: 845.1 bits (2182), Expect = 3.4e-245
Identity = 439/743 (59.08%), Postives = 549/743 (73.89%), Query Frame = 1

Query: 53  LAINLASLVEESLDV--DLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACP 112
           LA  LASLVEES     D  + +++MELKRS+E+R+K+RVK Q +NGKF DL+  VIA P
Sbjct: 11  LAGELASLVEESSSHVDDDSKPRSRMELKRSLELRLKKRVKEQCINGKFSDLLKKVIARP 70

Query: 113 NTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILS--SRKEVLIL 172
            TL++ YDCIR+NSNV I   +  ++F+S+AEELS+G FDV +NTFSI++    KEVL+L
Sbjct: 71  ETLRDAYDCIRLNSNVSITERNGSVAFDSIAEELSSGVFDVASNTFSIVARDKTKEVLVL 130

Query: 173 PKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLS 232
           P + LKV+QEAIRIVLE VF PHFSKISH CRSGRG ++ALKYI   I   DW FT+ L+
Sbjct: 131 PSVALKVVQEAIRIVLEVVFSPHFSKISHSCRSGRGRASALKYINNNISRSDWCFTLSLN 190

Query: 233 KKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPIL 292
           KK+D  V   L++VME+K+ED  L  ++RS++ A  LNLEFGGFPKGHGLPQEGVLS +L
Sbjct: 191 KKLDVSVFENLLSVMEEKVEDSSLSILLRSMFEARVLNLEFGGFPKGHGLPQEGVLSRVL 250

Query: 293 TNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKI 352
            NIYL+ FD EF+R+SM++EA+     T +D   S+LRSWFRRQ        + E+   +
Sbjct: 251 MNIYLDRFDHEFYRISMRHEALGLDSKTDEDSPGSKLRSWFRRQAGEQGLKSTTEQDVAL 310

Query: 353 RVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLG 412
           RVYCCR+MDEI+ +VSG K VA   RSE   F++ +LHLD+  E +   CE T G+R LG
Sbjct: 311 RVYCCRFMDEIYFSVSGPKKVASDIRSEAIGFLRNSLHLDITDETDPSPCEATSGLRVLG 370

Query: 413 CLVRRSVQESPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKH 472
            LVR++V+ESP VK++HKLKEKV LF LQK+E W   TV +GKKWL HGLKKVKESEIK 
Sbjct: 371 TLVRKNVRESPTVKAVHKLKEKVRLFALQKEEAWTLGTVRIGKKWLGHGLKKVKESEIKG 430

Query: 473 LA-KNSSLNKISSFRKPGMETDHWYKVLLKIWMQD-LNARAAESEEKILSKHAVELSLPF 532
           LA  NS+L++IS  RK GMETDHWYK+LL+IWM+D L   A  SEE +LSKH VE ++P 
Sbjct: 431 LADSNSTLSQISCHRKAGMETDHWYKILLRIWMEDVLRTSADRSEEFVLSKHVVEPTVPQ 490

Query: 533 ELRDSFYEFQRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLV 592
           ELRD+FY+FQ     Y+SSETA+  ALLP      +P F  +++AP N+I +RL RY L+
Sbjct: 491 ELRDAFYKFQNAAAAYVSSETANLEALLPCPQSHDRPVFFGDVVAPTNAIGRRLYRYGLI 550

Query: 593 TNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTL 652
           T KG+  S+  LIL D  QIIDW+ G+ RR   WY   SNF E+  LI +Q+R SCIRTL
Sbjct: 551 TAKGYARSNSMLILLDTAQIIDWYSGLVRRWVIWYEGCSNFDEIKALIDNQIRMSCIRTL 610

Query: 653 AAKHRIHESEIEKKFDSELSKIYSSSEIDQE-KEKSTDTHVLDHDEALKYGISYSGLCLL 712
           AAK+RIHE+EIEK+ D ELS I S+ +I+QE + +  D+   D DE L YG+S SGLCLL
Sbjct: 611 AAKYRIHENEIEKRLDLELSTIPSAEDIEQEIQHEKLDSPAFDRDEHLTYGLSNSGLCLL 670

Query: 713 SLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLC 772
           SLAR+VS+SRPCNCFVIGC   AP+VYTLH MERQKFPGWKTGFS  I  SLN RR GLC
Sbjct: 671 SLARLVSESRPCNCFVIGCSMAAPAVYTLHAMERQKFPGWKTGFSVCIPSSLNGRRIGLC 730

Query: 773 KQHLADLYLGRISLQSVDFGAWK 787
           KQHL DLY+G+ISLQ+VDFGAW+
Sbjct: 731 KQHLKDLYIGQISLQAVDFGAWR 753

BLAST of CSPI04G10980 vs. TAIR10
Match: AT5G04050.2 (AT5G04050.2 RNA-directed DNA polymerase (reverse transcriptase))

HSP 1 Score: 183.0 bits (463), Expect = 7.4e-46
Identity = 140/490 (28.57%), Postives = 230/490 (46.94%), Query Frame = 1

Query: 82  IEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRL---ISFE 141
           ++  ++  V  QY +GKF  L+ N ++ P  L      + +++N      DR+    S E
Sbjct: 45  VKSELEALVLKQYSHGKFYSLVKNAVSLPCVLLAACQNLSLSANSSGDLADRVSRRFSIE 104

Query: 142 SMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHG 201
            M  E+  G FD+ +     +SS    L+LP +KLKVL EAIR+VLE V+   F+  S+G
Sbjct: 105 EMGREIREGRFDIRSCCVEFISSS---LVLPNLKLKVLIEAIRMVLEIVYDDRFATFSYG 164

Query: 202 CRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKM-DELVMAKLITVMEDKIEDPKLFAVIR 261
            R G G  TA++Y+K  +++P WWF V  +++M +E  +  L   + +KI D  L  +I+
Sbjct: 165 GRVGMGRHTAIRYLKNSVENPRWWFRVSFAREMFEERNVDILCGFVGEKINDVMLIEMIK 224

Query: 262 SIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTG 321
            ++  G L +E GG   G G PQE  L  IL N+Y +  D+E   L +K +  N    TG
Sbjct: 225 KLFEFGILKIELGGCNSGRGFPQECGLCSILINVYFDGLDKEIQDLRLKMKVKNPRVGTG 284

Query: 322 QDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEI 381
            + S   +  +F+                 + +Y  RY+DEI +  SGSK +    +  I
Sbjct: 285 DEESTGNV--FFK----------------PVNIYAVRYLDEILVITSGSKMLTMDLKKRI 344

Query: 382 FDFVQKTLHLDVNREEEMV-SCETHGIRFLGCL-------VRRSVQESPAVKSIHKLKEK 441
            D +++ L L V+R    + S  +  I FLG         V R  +   AV+++ K + +
Sbjct: 345 VDILEQRLELRVDRLNTSIHSAVSEKINFLGMYLQAVPPSVLRPPKSEKAVRAMKKYQRQ 404

Query: 442 VELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSFRKPGMETDH 501
            ++  L+ +         LG K   H LKK+K+S              + F+  G E ++
Sbjct: 405 KDVRKLELRNARERNRKTLGLKIFRHVLKKIKQS--------------NGFKFEG-EIEN 464

Query: 502 WYKVLLKIW----MQDLNARAAE--------SEEKILSKHAVELSLPFELRDSFYEFQRH 548
             + + + W    MQD      E        +    LS   +   LP +L D++ EFQ  
Sbjct: 465 EVRDIFQSWGEEVMQDFMGSLEERWKWHWLLTRGDFLSLRHIREKLPQDLIDAYDEFQEQ 498

BLAST of CSPI04G10980 vs. NCBI nr
Match: gi|778692419|ref|XP_011653460.1| (PREDICTED: uncharacterized protein LOC101219510 [Cucumis sativus])

HSP 1 Score: 1565.1 bits (4051), Expect = 0.0e+00
Identity = 783/786 (99.62%), Postives = 784/786 (99.75%), Query Frame = 1

Query: 1   MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL 60
           MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL
Sbjct: 14  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAEIDKGMERMKLAINLASL 73

Query: 61  VEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI 120
           VEESLDVDLRRSKTQMELKRS+EIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI
Sbjct: 74  VEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCI 133

Query: 121 RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI 180
           RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI
Sbjct: 134 RINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEAI 193

Query: 181 RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI 240
           RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI
Sbjct: 194 RIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLI 253

Query: 241 TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF 300
           TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF
Sbjct: 254 TVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEF 313

Query: 301 FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF 360
           FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF
Sbjct: 314 FRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIF 373

Query: 361 LAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV 420
           LAVSGSKDVAHSFRSEIF FVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV
Sbjct: 374 LAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPAV 433

Query: 421 KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF 480
           KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF
Sbjct: 434 KSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISSF 493

Query: 481 RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE 540
           RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE
Sbjct: 494 RKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKE 553

Query: 541 YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ 600
           YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ
Sbjct: 554 YISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQ 613

Query: 601 DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD 660
           DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD
Sbjct: 614 DNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKFD 673

Query: 661 SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVI 720
           SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLS ARMVSQSRPCNCFVI
Sbjct: 674 SELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCNCFVI 733

Query: 721 GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV 780
           GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV
Sbjct: 734 GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSV 793

Query: 781 DFGAWK 787
           DFGAWK
Sbjct: 794 DFGAWK 799

BLAST of CSPI04G10980 vs. NCBI nr
Match: gi|659082762|ref|XP_008442019.1| (PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo])

HSP 1 Score: 1490.7 bits (3858), Expect = 0.0e+00
Identity = 746/787 (94.79%), Postives = 765/787 (97.20%), Query Frame = 1

Query: 2   RNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLAS 61
           RNFSN QSVNVCI NSSFVSDIGKC QIVQ SENYSTLARA  EIDKGME+MKLA+NLAS
Sbjct: 15  RNFSNSQSVNVCIVNSSFVSDIGKCFQIVQSSENYSTLARADDEIDKGMEKMKLAMNLAS 74

Query: 62  LVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDC 121
           LVEESLDVDLRRSKT+MELKRS+EI+IKERVKAQYLNGKFLDLMGNVIACPNTLQN YDC
Sbjct: 75  LVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVIACPNTLQNAYDC 134

Query: 122 IRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEA 181
           IRINSNVDIKSND LISFESMA+ELS+GNFDVNTNTFSILSSRKEVLILPKIKLKVLQEA
Sbjct: 135 IRINSNVDIKSNDCLISFESMAKELSHGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEA 194

Query: 182 IRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKL 241
           IRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKL
Sbjct: 195 IRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKL 254

Query: 242 ITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQE 301
           ITVMEDKIEDPKLFAVIRSI+LAGALNLEFG FPKGHGLPQEGVLSPILTNIYLNLFDQE
Sbjct: 255 ITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPILTNIYLNLFDQE 314

Query: 302 FFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEI 361
           FFRLSMKYEAINEYGNTGQDGSQS+LRSWFRRQLK N+SDY GEEKDKIRVYCCRYMDEI
Sbjct: 315 FFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKENSSDYPGEEKDKIRVYCCRYMDEI 374

Query: 362 FLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCETHGIRFLGCLVRRSVQESPA 421
           FLAVSGSKDVA SFRSEIFDF+QKTLHLDVN EEEMVSCETHGIRFLGCLVRRSVQESPA
Sbjct: 375 FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSCETHGIRFLGCLVRRSVQESPA 434

Query: 422 VKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKISS 481
           VKSIHKLKEKVELFGLQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+ISS
Sbjct: 435 VKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISS 494

Query: 482 FRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVK 541
           FRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVE SLPFELRDSFYEFQR V+
Sbjct: 495 FRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFELRDSFYEFQRRVE 554

Query: 542 EYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLIL 601
           EYISSETASTLALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFLIL
Sbjct: 555 EYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLIL 614

Query: 602 QDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEKKF 661
           QDNTQIIDWF+GVSRR FRWYN SSNFSELFLIFDQVRKSCIRTLAAKH+IHESEIEKKF
Sbjct: 615 QDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAKHQIHESEIEKKF 674

Query: 662 DSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFV 721
           DSELSKIYSS EI+QEKEKSTDTHVLDHDEAL YGISYSGLCLLSLARMVS+SRPCNCFV
Sbjct: 675 DSELSKIYSSPEIEQEKEKSTDTHVLDHDEALNYGISYSGLCLLSLARMVSRSRPCNCFV 734

Query: 722 IGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQS 781
           +GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQS
Sbjct: 735 VGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQS 794

Query: 782 VDFGAWK 787
           VDFGAWK
Sbjct: 795 VDFGAWK 801

BLAST of CSPI04G10980 vs. NCBI nr
Match: gi|657950519|ref|XP_008348277.1| (PREDICTED: uncharacterized protein LOC103411417 [Malus domestica])

HSP 1 Score: 990.7 bits (2560), Expect = 1.4e-285
Identity = 507/768 (66.02%), Postives = 613/768 (79.82%), Query Frame = 1

Query: 30  VQGSENYSTLARAEID---KGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSIEIRI 89
           VQ S ++  +A A  D    G+ +MKLA NLA LVEES  ++ RR K +M+LKR +E+RI
Sbjct: 55  VQESADHCAVASAAADGVNSGIRKMKLAENLACLVEESSHINERRPKGRMQLKRCLELRI 114

Query: 90  KERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAEELSN 149
           K+RVK QY+NGKF DLM  VIA P TLQ+ YDCIR+NSNVDI  +D   SF+SMAEE+ +
Sbjct: 115 KKRVKEQYINGKFRDLMVKVIANPETLQDAYDCIRLNSNVDIALSDAKNSFDSMAEEMRH 174

Query: 150 GNFDVNTNTFSILSSR---KEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGR 209
           G+FD N NTFSI S R    EVL+LP + LKV+QEAIR+VLE V++P FSKISHG RSGR
Sbjct: 175 GSFDANANTFSI-SKRGVGNEVLVLPNLNLKVIQEAIRVVLEVVYKPDFSKISHGYRSGR 234

Query: 210 GHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIYLAG 269
           GHSTALKYI KEI +PDWWFTV L+KK+D  ++ +L+  ME KI DP LF +I+S++ A 
Sbjct: 235 GHSTALKYISKEISNPDWWFTVLLNKKLDACILGELLKAMEGKIVDPSLFDMIKSMFHAN 294

Query: 270 ALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDGSQS 329
            LNLEFGGFPKGHGLPQEG+LSPIL NIYL+ FD+EF+RLSMKYEA++      Q+ SQS
Sbjct: 295 VLNLEFGGFPKGHGLPQEGILSPILMNIYLDQFDREFYRLSMKYEALSLDSQNDQN-SQS 354

Query: 330 RLRSWFRRQLKGNNS-DYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDFVQ 389
           +LRSWFRR LKGNN    +GEE    RV+ CR+MDEIF + SGSKD A  F+SE+ +++Q
Sbjct: 355 KLRSWFRRHLKGNNDLGCAGEESCSARVHSCRFMDEIFFSXSGSKDAALEFKSEVLNYLQ 414

Query: 390 KTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQETW 449
           K+LHL+V+ + E++ C+  HGIRFLG LVRR+V ESPA K++HKLKEKV LFGLQKQE W
Sbjct: 415 KSLHLEVDDQTELLPCQKPHGIRFLGTLVRRNVIESPATKAVHKLKEKVALFGLQKQEAW 474

Query: 450 NAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNKISSFRKPGMETDHWYKVLLKIWMQ 509
           N  TV +GKKWL HGLKKVKESEIKHLA +SS LN+IS FRK GMETDHWYK LLKIWMQ
Sbjct: 475 NVGTVHIGKKWLGHGLKKVKESEIKHLADSSSVLNQISHFRKFGMETDHWYKHLLKIWMQ 534

Query: 510 DLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDPSA 569
           D+NA+A ESEE +LSKH  E +LP EL +SFYEFQR V++Y+SSET+S LALLPN   SA
Sbjct: 535 DVNAKAEESEEAVLSKHVAEPALPEELTNSFYEFQRQVEKYVSSETSSILALLPNAGSSA 594

Query: 570 KPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFRWY 629
           +   ITEIIAPVN+++KRL RY L T+ G+P +S  L+LQDN QIIDWF G+ RR  RWY
Sbjct: 595 ESVVITEIIAPVNAVKKRLQRYGLTTSDGYPRTSSLLVLQDNDQIIDWFSGIVRRWLRWY 654

Query: 630 NNSSNFSEL-FLIFDQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEK-EK 689
               NF E+  LI D VRKSCIRTLAAK+R+HE+EIE++FD+ELS+I S+ EI+QE  ++
Sbjct: 655 AECVNFKEVKLLISDLVRKSCIRTLAAKYRVHENEIERRFDTELSRIPSTQEIEQEMVDE 714

Query: 690 STDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVMERQ 749
           ++DT   ++DEAL YGISYSGLC+LSLARMVS+SRPCNCFV GC+A APSVYTLHVMERQ
Sbjct: 715 TSDTQAFENDEALMYGISYSGLCVLSLARMVSESRPCNCFVFGCMASAPSVYTLHVMERQ 774

Query: 750 KFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK 787
           KFPGWKTGFSS IHPSLN+RR GLCKQHL DLYLG +SLQSVDFGAWK
Sbjct: 775 KFPGWKTGFSSCIHPSLNRRRIGLCKQHLKDLYLGHVSLQSVDFGAWK 820

BLAST of CSPI04G10980 vs. NCBI nr
Match: gi|703078258|ref|XP_010090835.1| (Group II intron-encoded protein ltrA [Morus notabilis])

HSP 1 Score: 986.9 bits (2550), Expect = 2.1e-284
Identity = 502/793 (63.30%), Postives = 624/793 (78.69%), Query Frame = 1

Query: 1   MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARAE-IDKGMERMKLAINLAS 60
           +R  + +Q +N  +  SSF  D GK  + +Q   ++ST A A+ I+    +  LA NLAS
Sbjct: 14  LRKLATMQRINQILLYSSFFIDRGKSSERIQEPRHFSTAAAADAINMCSGKNTLATNLAS 73

Query: 61  LVEESLDVDLRRSKTQMELKRSIEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDC 120
           L+EES++VD R+  ++MELKRS+E R+K+RVK QY+NGKF +L+  VIA P TLQ+ Y+C
Sbjct: 74  LLEESVEVDERKPSSRMELKRSLEYRVKKRVKEQYVNGKFHNLLEKVIANPETLQDAYNC 133

Query: 121 IRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILS--SRKEVLILPKIKLKVLQ 180
           IR+NSNVDI  N+   SFES+ EEL  GNFDV  NT SI +  +RKEVL+LP +KLKV+Q
Sbjct: 134 IRLNSNVDIMLNNETTSFESVPEELFCGNFDVKANTVSISTRGARKEVLVLPNLKLKVIQ 193

Query: 181 EAIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMA 240
           EAIRIVLE V+RPHFSKISHGCRSGRGH TALK+IKK+I  P WW T+ ++KK+D  ++ 
Sbjct: 194 EAIRIVLEVVYRPHFSKISHGCRSGRGHFTALKFIKKDICAPIWWSTLIVNKKLDTCILD 253

Query: 241 KLITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFD 300
           KLI+V+E+KI DP LF++IRS++ +  +NLEFGGFPKGHGLPQEG+LSPIL NIYL+LFD
Sbjct: 254 KLISVLEEKIVDPGLFSIIRSMFESQVINLEFGGFPKGHGLPQEGILSPILMNIYLDLFD 313

Query: 301 QEFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMD 360
           +EF RLS+KYEA++         SQS+LRSWFRR LK  +   +GEEK  +RV+ CR+MD
Sbjct: 314 REFCRLSLKYEALDLDLEANHQKSQSKLRSWFRRNLKAKDLSGAGEEKFSLRVHSCRFMD 373

Query: 361 EIFLAVSGSKDVAHSFRSEIFDFVQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQE 420
           EIFLAVSGSKD A  F+SEI ++++ +LHLDV+ E E++ C+  HGIRF+G LVRR+V+E
Sbjct: 374 EIFLAVSGSKDAALGFKSEIQNYLKNSLHLDVDDETELLPCDGLHGIRFMGTLVRRTVKE 433

Query: 421 SPAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA-KNSSLN 480
           SPA K+IHKLKEKVELF +QKQE W+  TV +GKKWL HGLKKVKESEI+HLA   S L+
Sbjct: 434 SPATKAIHKLKEKVELFAIQKQEAWDVGTVRIGKKWLGHGLKKVKESEIRHLADPESVLS 493

Query: 481 KISSFRKPGMETDHWYKVLLKIWMQDLNAR-AAESEEKILSKHAVELSLPFELRDSFYEF 540
           +IS FRK GMETDHWYK LLKIWMQD+ A+ AAE EE ILSK+  E +LP EL++SFY F
Sbjct: 494 QISHFRKAGMETDHWYKHLLKIWMQDIKAKAAAECEETILSKYVAEPALPQELKNSFYVF 553

Query: 541 QRHVKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSS 600
           QRH +EY+SSETA T ALL + D S +P  IT+I AP+N+I+KRLLRY LVT KG+  + 
Sbjct: 554 QRHAQEYVSSETAFTCALLKSSDASTQPVIITQIFAPINAIKKRLLRYGLVTTKGYSRAC 613

Query: 601 PFLILQDNTQIIDWFVGVSRRLFRWYNNSSNFSEL-FLIFDQVRKSCIRTLAAKHRIHES 660
             LILQD+ QIIDWF+G+ RR FRWY+   NFS++ FL+  Q+RKSCIRTLA+KH IHE+
Sbjct: 614 SCLILQDDNQIIDWFLGIVRRWFRWYSECDNFSDIKFLVCGQIRKSCIRTLASKHHIHET 673

Query: 661 EIEKKFDSELSKIYSSSEIDQEKEKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSR 720
           EIEK+FD+ELS+I SS +++QE      + V + DEAL YGISYSGLC LSLARMVSQSR
Sbjct: 674 EIEKRFDAELSRIPSSEDLEQEMVNDETSDVFEKDEALMYGISYSGLCALSLARMVSQSR 733

Query: 721 PCNCFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLG 780
           PC CFV GC APA SVY+LHVMERQKFPGWKTGFSS IHPSLN+RRFGLCKQHL DLYLG
Sbjct: 734 PCTCFVTGCQAPAQSVYSLHVMERQKFPGWKTGFSSCIHPSLNRRRFGLCKQHLKDLYLG 793

Query: 781 RISLQSVDFGAWK 787
            ISLQS+DFG+WK
Sbjct: 794 HISLQSIDFGSWK 806

BLAST of CSPI04G10980 vs. NCBI nr
Match: gi|731440172|ref|XP_010646090.1| (PREDICTED: uncharacterized protein LOC100251856 [Vitis vinifera])

HSP 1 Score: 980.7 bits (2534), Expect = 1.5e-282
Identity = 501/770 (65.06%), Postives = 614/770 (79.74%), Query Frame = 1

Query: 25  KCVQIVQGSENYSTLARA--EIDKGMERMKLAINLASLVEESLDVDLRRSKTQMELKRSI 84
           + V+ +Q    YSTL     + DK + +  LA NLA L+EES +  + R   +MELKRS 
Sbjct: 33  RLVERMQACAVYSTLGAVSGDADKDIGKPTLAKNLAFLMEESSN-HVIRPMARMELKRSF 92

Query: 85  EIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYDCIRINSNVDIKSNDRLISFESMAE 144
           E+RIK+RVK QY+NGKF DLM  VIA P TL++ Y+CIRINSNVD+  +   ISF+SMAE
Sbjct: 93  ELRIKKRVKEQYVNGKFQDLMVKVIANPQTLEDAYNCIRINSNVDLALDGDNISFKSMAE 152

Query: 145 ELSNGNFDVNTNTFSIL--SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCR 204
           EL  G+F+VN NTFSI   S+RKEVLILP +KLKV+QEAIRIVLE V+RP+FSKISHGCR
Sbjct: 153 ELLGGSFNVNVNTFSISTKSARKEVLILPSLKLKVVQEAIRIVLEIVYRPYFSKISHGCR 212

Query: 205 SGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKLITVMEDKIEDPKLFAVIRSIY 264
           SGRGHSTALKYI KEI +PDWWF + ++KK+D +V+AKLI+ M+DKIEDP LF +I++++
Sbjct: 213 SGRGHSTALKYISKEISNPDWWFILHVNKKLDAVVLAKLISTMQDKIEDPNLFVMIQNMF 272

Query: 265 LAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQEFFRLSMKYEAINEYGNTGQDG 324
            A  LNLEFGGFPKGHGLPQEGVLSPIL NIYL+LFD EF+R+SM+YEA++       D 
Sbjct: 273 HAQVLNLEFGGFPKGHGLPQEGVLSPILMNIYLDLFDHEFYRMSMRYEALDPGMCIDHDK 332

Query: 325 SQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDEIFLAVSGSKDVAHSFRSEIFDF 384
           S S+LRSWFRRQLKGN+  Y+G E    RV+ CR+MDEIF A+SGSKD+A  F+SEI ++
Sbjct: 333 SHSKLRSWFRRQLKGNDVKYTGRESSNFRVHSCRFMDEIFFAISGSKDIAIEFKSEILNY 392

Query: 385 VQKTLHLDVNREEEMVSCE-THGIRFLGCLVRRSVQESPAVKSIHKLKEKVELFGLQKQE 444
           +Q +LHLDV+ + E++ C   HGI+FLG LV+RSV+ESP V+++HKLKEKV LF  QKQE
Sbjct: 393 MQNSLHLDVSNQSELLPCHGPHGIQFLGTLVKRSVRESPTVRAVHKLKEKVRLFASQKQE 452

Query: 445 TWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNKISSFRKPGMETDHWYKVLLKIW 504
            W+A T+ +GKKWLAHGLKKVKESEI+HLA   S L++IS FRK GMETDHWYK+LLKIW
Sbjct: 453 AWDAGTLRIGKKWLAHGLKKVKESEIRHLADTDSVLSQISCFRKTGMETDHWYKLLLKIW 512

Query: 505 MQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRHVKEYISSETASTLALLPNYDP 564
           + D+ A+AAE+E  ILSK+  E  LP ELRDSFYEFQ+  ++Y++SETAS LALLPN   
Sbjct: 513 LHDVKAKAAENEGVILSKYIAEPLLPKELRDSFYEFQKRAEDYVASETASMLALLPNSKS 572

Query: 565 SAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFLILQDNTQIIDWFVGVSRRLFR 624
             +   I +IIAPVN I+KRLLRYRL   KG+PC+SP LILQD+ QI+DWF G++RR   
Sbjct: 573 CTESVPIIKIIAPVNVIKKRLLRYRLTNAKGYPCASPMLILQDDIQIVDWFSGLARRWLI 632

Query: 625 WYNNSSNFSELFLIF-DQVRKSCIRTLAAKHRIHESEIEKKFDSELSKIYSSSEIDQEK- 684
           WY+   NFSE+ LI  DQ+RKSCIRTLAAK+R+HE+EIEK+ D+EL +I S+ EI+QEK 
Sbjct: 633 WYSECDNFSEVKLIICDQLRKSCIRTLAAKYRLHETEIEKRSDTELCRIPSTLEIEQEKV 692

Query: 685 EKSTDTHVLDHDEALKYGISYSGLCLLSLARMVSQSRPCNCFVIGCLAPAPSVYTLHVME 744
            +++D+   D +EAL YGISYSGLCLLSLARMVSQSR CNCFV+GCLA APSVYTLHVME
Sbjct: 693 NETSDSQASDTNEALMYGISYSGLCLLSLARMVSQSRRCNCFVMGCLAAAPSVYTLHVME 752

Query: 745 RQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISLQSVDFGAWK 787
           RQKFPGWKTGFSS IHPSLN RR GLCKQHL DLYLG ISLQS++FGAWK
Sbjct: 753 RQKFPGWKTGFSSCIHPSLNGRRIGLCKQHLKDLYLGHISLQSIEFGAWK 801

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LTRA_LACLM2.8e-2328.09Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (stra... [more]
LTRA_LACLC2.8e-2328.09Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=lt... [more]
NICA_PSEPU2.1e-1827.66Putative nicotine oxidoreductase OS=Pseudomonas putida GN=nicA PE=3 SV=1[more]
YMF40_MARPO5.2e-1727.13Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha GN=YMF40 PE... [more]
YMC6_SCHPO6.7e-1728.15Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strai... [more]
Match NameE-valueIdentityDescription
A0A0A0KWB0_CUCSA0.0e+0099.62Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188380 PE=4 SV=1[more]
W9QLZ5_9ROSA1.4e-28463.30Group II intron-encoded protein ltrA OS=Morus notabilis GN=L484_020694 PE=4 SV=1[more]
F6I1Y2_VITVI3.0e-28265.36Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0174g00300 PE=4 SV=... [more]
A0A061DIX4_THECC5.9e-27863.68Intron maturase isoform 1 OS=Theobroma cacao GN=TCM_001510 PE=4 SV=1[more]
A0A067JY85_JATCU1.4e-27162.78Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25078 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G74350.13.4e-24559.08 Intron maturase, type II family protein[more]
AT5G04050.27.4e-4628.57 RNA-directed DNA polymerase (reverse transcriptase)[more]
Match NameE-valueIdentityDescription
gi|778692419|ref|XP_011653460.1|0.0e+0099.62PREDICTED: uncharacterized protein LOC101219510 [Cucumis sativus][more]
gi|659082762|ref|XP_008442019.1|0.0e+0094.79PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo][more]
gi|657950519|ref|XP_008348277.1|1.4e-28566.02PREDICTED: uncharacterized protein LOC103411417 [Malus domestica][more]
gi|703078258|ref|XP_010090835.1|2.1e-28463.30Group II intron-encoded protein ltrA [Morus notabilis][more]
gi|731440172|ref|XP_010646090.1|1.5e-28265.06PREDICTED: uncharacterized protein LOC100251856 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR024937Domain_X
Vocabulary: Biological Process
TermDefinition
GO:0006397mRNA processing
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G10980.1CSPI04G10980.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 167..408
score: 1.4
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..410
score: 10
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 567..679
score: 1.5
NoneNo IPR availablePANTHERPTHR33642FAMILY NOT NAMEDcoord: 31..785
score:
NoneNo IPR availablePANTHERPTHR33642:SF3INTRON MATURASE, TYPE II FAMILY PROTEINcoord: 31..785
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 171..308
score: 5.19E-12coord: 346..430
score: 5.19