Cla012109 (gene) Watermelon (97103) v1

NameCla012109
Typegene
OrganismCitrullus. lanatus (Watermelon (97103) v1)
DescriptionMaturase (Fragment) (AHRD V1 *-*- A1XJ30_9ROSI); contains Interpro domain(s) IPR000477 RNA-directed DNA polymerase (reverse transcriptase)
LocationChr4 : 16048204 .. 16050858 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCGGAACATTTCAATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGTAAATATCCAATCTACTATGCTGCTCTAGTTAGGTTATAGTTGCTTTTTTCAGTGTTCATGTTATGAAATGTTAGATTTACTTCTTTATGATGTTCCAAATTTGATAGAATGTGATGATGTAATGAACCTTCTCTTTTCTGCAATTGATCTTTACTTTCTTGTTAAATAATCAGTTTGAATTTTTGAAAATCAAACATTTCATTTGTAATTTTTCTTCTAACATCTAGAAAATTATGTGACTTGTATGTGATTGCCTGTGGACCTTTATTACTTATGGCAGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATGCTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCTTCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTGATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATATATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATTGAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTGCTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAATGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCAGTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCTTTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAACTTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTATTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA

mRNA sequence

ATGCGGAACATTTCAATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATGCTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCTTCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTGATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATATATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATTGAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTGCTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAATGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCAGTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCTTTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAACTTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTATTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA

Coding sequence (CDS)

ATGCGGAACATTTCAATTTTGCAAAATGTAAATGTTTGCAAAGTCAATTCATCCTTTGTTTCTGACATTGGAGAGTGTGTTCAGAGAGTTCAAAGTTCTGGAAATTATTCAACTCTCGCCTATGCTGATGATGAAATTGACAAGGGAATAGAGAAAAAGAAACTGGCCACAAACTTGGCCTCACTTATTGAAGAATCTCTTGATGTTGATCTGAGAAGATCAAAGACTCAAATGGAACTGAAGAGATCACTTGAAATTCAGATCAAGAAGAGGGTGAAGGCACAATATTTGAATGGGAAGTTTTTGGACTTGATGGGGAAAGTGATTGCTTGCCCCACAACTCTTCAGAATGCTTATGACTGTGTTAGAATTAACTCAAATGTTGATATAATGTCGACTGACTGTTTAATCTCATTTGAATCTATGGCTGAAGAGCTATCTAATGGTAACTTTGATGTCAATGCCAATACTTTCTCCATATTAAGCTCAAGAAAAGAAGTACTCATTTTACCGAAGATAAAGTTGAAGGTTCTTCAAGAAGCCATTAGGATAGTTTTGGAGTGTGTGTTTAGGCCACATTTTTCCAAGATATCTCATGGCTGTCGAAGTGGAAGAGGACACTCAACGGTATTGAAGTACATCAGAAAAGAGATAAAAAATCCTGATTGGTGGTTCACAGTTGACTTAAGCAAAAAGATGGATGAGCTTGTAATGGCTAAACTCATTACAGTAATGGAGGACAAGATAGATGACCCCAGATTATTTGCTGTTATTAGAAGTATATATGTGGCTGGGGCACTGAACTTGGAGTTTGGGGGTTTCCCAAAAGGTCACGGTCTTCCACAAGAGGGGGTTTTGTCTCCTATATTAATGAACATCTATCTCAACCTCTTTGACCAAGAATTTTTCAGATTATCTATGAAATATGAGGCTATTAATGAGAATGGCAATACCGGTCAAGATGGGTCACAATCAAGGCTGCGGAGCTGGTTTAGGAGACAATTGAAAGGAAATAGTTTTGAATATCCAGGTGAGGAGAAAGACAAAATAAGAGTATACTGTTGTCGCTATATGGATGAAATTTTTTTGGCGGTATCGGGTTCTAAAGATGTTGCTCTTAGTTTTAGGTCTGAGATTTTTGATTTCATGCAGAAGACTTTGCATTTGGACGTCAATCATCAAGAGGAAATGGTATCATGTGGGGAGACTCATGGAATTCGTTTTCTTGGTTGTTTGGTCAGACGAAGTGTGCAGGAAAGTCCAGCTGTGAAATCTGTCCACAAGTTGAAGGAAAAAGTTGAGCTATTTGCTTTACAAAAGCAGGAGACTTGGAATGCTTGGACGGTGTGGTTGGGAAAGAAATGGCTCGCTCATGGTTTGAAGAAGGTTAAAGAGTCGGAGATCAAGCATTTAGCTAAAAATAGCTCTTTGAATCAAATTTCCAGTTTTCGTAAAGCTGGAATGGAAACTGATCACTGGTACAAGGTTCTATTGAAAATTTGGATGCAAGATCTAAATGCAAGGGCTGCAGAGAGTGAAGAAAAAATCTTATCTAAGTATGCAGTGGAACCTTCTCTTCCTATGGAACTTCGAGATTCCTTTTATGAGTTCCAAAGGTGTGTGAAACAATATATTTCTGCTGAGACAGCTTCTACTGTTGCCCTTTTACCAAATTATGATCCTTCTGTCAAACCTACTTTCATAACTGAGATTATAGCACCTGTCAATTCTATAAGAAAACGACTTTTTCGATATAGGTTAGTTACAAATAAGGGACATCCATGCTCCTCTCCTTTCCTCATCTTACAAGATAACACTCAAATTATTGACTGGTTTTTAGGAGTATCTCGCCGTTGGTTTAGATGGTATAACAACTGTTCTAACTTCAGCGAGTTGGTCTTAATTTGTGATCAAGTTAGGAAATCCTGTATCCGAACGCTAGCAGCAAAGCATCGTATACACGAAAGTGAAATAGAAAAGAAGTTTGACTCAGAACTGAGTAACATTTACTCCTCTCCTGAACTAGAGCAAGAAGAAGAGAAGAAGTCATCAGATACTCATGGTTTAGACCATGATGAGGCACTAAAGTATGGAATTTCATATAGTGGTCTGTGTTTGCTATCTCTTGCTAGAATGGTCAACCCATCTCGTCCATGCAATTGTTTTGTCGTTGGGTGTTTGGCTCCTGCACCAAGCGTTTATACTCTTCATGTCATGGAGAGACAAAAGTTTCCAGGATGGAAGACTGGATTCTCGAGTTCCATCCATCCTAGCTTGAACAAACGACGATTCGGGTTATGCAAAAAACATTTGGAGGATTTGTATTTGGGTCACATTTCATTGCAATCTATTGACTTTGGTGCATGGAAGTGA

Protein sequence

MRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQISSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK
BLAST of Cla012109 vs. Swiss-Prot
Match: LTRA_LACLC (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=ltrA PE=1 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 5.0e-20
Identity = 80/290 (27.59%), Postives = 140/290 (48.28%), Query Frame = 1

Query: 162 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNP 221
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 222 DWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGH-GL 281
            W+   D+    D +    LI ++  KI D ++  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 282 PQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKG 341
           PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIKRISHRLKKLEG 273

Query: 342 NS-----FEYPGEEKDKIRVYC----------CRYMDEIFLAVSGSKDVALSFRSEIFDF 401
                   EY  + K    + C           RY D+  ++V GSK+     + ++  F
Sbjct: 274 EEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDCQWIKEQLKLF 333

Query: 402 MQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK 433
           +   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 IHNKLKMELSEEKTLIT-HSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of Cla012109 vs. Swiss-Prot
Match: LTRA_LACLM (Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (strain MG1363) GN=ltrA PE=1 SV=1)

HSP 1 Score: 101.3 bits (251), Expect = 5.0e-20
Identity = 80/290 (27.59%), Postives = 140/290 (48.28%), Query Frame = 1

Query: 162 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNP 221
           S +   L +P    K++QEA+RI+LE ++ P F  +SHG R  R   T LK I++E    
Sbjct: 94  SKKMRPLGIPTFTDKLIQEAVRIILESIYEPVFEDVSHGFRPQRSCHTALKTIKREFGGA 153

Query: 222 DWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGH-GL 281
            W+   D+    D +    LI ++  KI D ++  +I     AG   LE   + K + G 
Sbjct: 154 RWFVEGDIKGCFDNIDHVTLIGLINLKIKDMKMSQLIYKFLKAG--YLENWQYHKTYSGT 213

Query: 282 PQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD---GSQSRLRSWFRRQLKG 341
           PQ G+LSP+L NIYL+  D+   +L MK++  +    T +     ++ +  S   ++L+G
Sbjct: 214 PQGGILSPLLANIYLHELDKFVLQLKMKFDRESPERITPEYRELHNEIKRISHRLKKLEG 273

Query: 342 NS-----FEYPGEEKDKIRVYC----------CRYMDEIFLAVSGSKDVALSFRSEIFDF 401
                   EY  + K    + C           RY D+  ++V GSK+     + ++  F
Sbjct: 274 EEKAKVLLEYQEKRKRLPTLPCTSQTNKVLKYVRYADDFIISVKGSKEDCQWIKEQLKLF 333

Query: 402 MQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEK 433
           +   L ++++ ++ +++   +   RFLG  +R  V+ S  +K   K+K++
Sbjct: 334 IHNKLKMELSEEKTLIT-HSSQPARFLGYDIR--VRRSGTIKRSGKVKKR 378

BLAST of Cla012109 vs. Swiss-Prot
Match: YMF40_MARPO (Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha GN=YMF40 PE=3 SV=1)

HSP 1 Score: 92.0 bits (227), Expect = 3.0e-17
Identity = 72/258 (27.91%), Postives = 116/258 (44.96%), Query Frame = 1

Query: 170 LPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDL 229
           +P  + K++QE +R +LE VF P F   SHG R  R   T L+ IR+      W    D+
Sbjct: 80  IPSPRDKIVQEVMRRILEPVFEPRFLDSSHGFRPHRSPHTALRQIRR-WTGTSWMIEGDI 139

Query: 230 SKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGH---GLPQEGVL 289
               D +    L   + + + D RL A+   +  AG +N    G  + H   G+PQ  +L
Sbjct: 140 KGYFDNIDHHLLAGFIAELVKDQRLLALYWKLVRAGYVN---QGKAEPHLLTGVPQGRIL 199

Query: 290 SPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSW-FRRQLKGNSFEYPGE 349
           SP+L NIYL+ FD     + +KY              ++R + +   + LK +S E    
Sbjct: 200 SPLLSNIYLHQFDLFMEEIKVKYTTTGALSKNNPIYLKARNKYYKLVKSLKASSAEIIRA 259

Query: 350 EKDKI----------RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQE 409
            +D +          RV   RY D+  + V+G K +A+  + E+  F+Q+ L L +  ++
Sbjct: 260 RRDMLKMTYGIQTGSRVRYVRYADDWVIGVTGPKALAVQIKEEVSTFLQEKLKLSLQAEK 319

Query: 410 EMVSCGETHGIRFLGCLV 414
             ++        FLG L+
Sbjct: 320 TRITNLSRSEALFLGTLI 333

BLAST of Cla012109 vs. Swiss-Prot
Match: YMC6_SCHPO (Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) GN=SPMIT.06 PE=3 SV=4)

HSP 1 Score: 87.4 bits (215), Expect = 7.5e-16
Identity = 76/270 (28.15%), Postives = 117/270 (43.33%), Query Frame = 1

Query: 162 SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNP 221
           S  K  L +   + K++QE +RIVLE ++ P F+  SHG R GR   + L+ I    K  
Sbjct: 304 SGGKRPLTIGSPRDKLVQEILRIVLEAIYEPLFNTASHGFRPGRSCHSALRSIFTNFKGC 363

Query: 222 DWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLP 281
            WW   D+    D +   KLI ++  KI D R   +IR    AG L      +    G P
Sbjct: 364 TWWIEGDIKACFDSIPHDKLIALLSSKIKDQRFIQLIRKALNAGYLTENRYKYDI-VGTP 423

Query: 282 QEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFE 341
           Q  ++SPIL NIYL+  D+  F  ++K E   +     +  S+SR   +   + K  + +
Sbjct: 424 QGSIVSPILANIYLHQLDE--FIENLKSEFDYKGPIARKRTSESRHLHYLMAKAKRENAD 483

Query: 342 YPGEEKDKI---------------RVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKT 401
                K  I               ++   RY D+  +AV+GS        ++I  F   +
Sbjct: 484 SKTIRKIAIEMRNVPNKIHGIQSNKLMYVRYADDWIVAVNGSYTQTKEILAKITCFC-SS 543

Query: 402 LHLDVNHQEEMVSCGETHGIRFLGCLVRRS 417
           + L V+  +  ++   T  I FLG  +  S
Sbjct: 544 IGLTVSPTKTKITNSYTDKILFLGTNISHS 569

BLAST of Cla012109 vs. Swiss-Prot
Match: AI1M_YEAST (Putative COX1/OXI3 intron 1 protein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=AI1 PE=3 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 1.1e-14
Identity = 65/246 (26.42%), Postives = 114/246 (46.34%), Query Frame = 1

Query: 176 KVLQEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDE 235
           K++QE +R++L+ +F    S  SHG R      T +  +R      +W+  VDL K  D 
Sbjct: 333 KIVQEVMRMILDTIFDKKMSTHSHGFRKNMSCQTAIWEVRNMFGGSNWFIEVDLKKCFDT 392

Query: 236 LVMAKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGH-GLPQEGVLSPILMNIY 295
           +    +I  ++  I D     ++  +  AG ++ E G + K   GLPQ  ++SPIL NI 
Sbjct: 393 ISHDLIIKELKRYISDKGFIDLVYKLLRAGYID-EKGTYHKPMLGLPQGSLISPILCNIV 452

Query: 296 LNLFD---QEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQL-------KGNSFEYPG 355
           + L D   +++  L  K +   ++    +          F  +L       KG +F Y  
Sbjct: 453 MTLVDNWLEDYINLYNKGKVKKQHPTYKKLSRMIAKAKMFSTRLKLHKERAKGPTFIY-- 512

Query: 356 EEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETH 411
            + +  R+   RY D+I + V GSK+     + ++ +F+  +L L +N ++ +++C    
Sbjct: 513 NDPNFKRMKYVRYADDILIGVLGSKNDCKMIKRDLNNFL-NSLGLTMNEEKTLITCATET 572

BLAST of Cla012109 vs. TrEMBL
Match: A0A0A0KWB0_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188380 PE=4 SV=1)

HSP 1 Score: 1417.1 bits (3667), Expect = 0.0e+00
Identity = 711/790 (90.00%), Postives = 743/790 (94.05%), Query Frame = 1

Query: 1   MRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLA 60
           MRN S LQ+VNVC  NSSFVSDIG+CVQ VQ S NYSTLA A  EIDKG+E+ KLA NLA
Sbjct: 14  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLA 73

Query: 61  SLIEESLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYD 120
           SL+EESLDVDLRRSKTQMELKRSLEI+IK+RVKAQYLNGKFLDLMG VIACP TLQN YD
Sbjct: 74  SLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYD 133

Query: 121 CVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQE 180
           C+RINSNVDI S D LISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQE
Sbjct: 134 CIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQE 193

Query: 181 AIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAK 240
           AIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAK
Sbjct: 194 AIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAK 253

Query: 241 LITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQ 300
           LITVMEDKI+DP+LFAVIRSIY+AGALNLEFGGFPKGHGLPQEGVLSPIL NIYLNLFDQ
Sbjct: 254 LITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ 313

Query: 301 EFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDE 360
           EFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGN+ +Y GEEKDKIRVYCCRYMDE
Sbjct: 314 EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDE 373

Query: 361 IFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQES 420
           IFLAVSGSKDVA SFRSEIF F+QKTLHLDVN +EEMVSC ETHGIRFLGCLVRRSVQES
Sbjct: 374 IFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSC-ETHGIRFLGCLVRRSVQES 433

Query: 421 PAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQI 480
           PAVKS+HKLKEKVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+I
Sbjct: 434 PAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI 493

Query: 481 SSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRC 540
           SSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVE SLP ELRDSFYEFQR 
Sbjct: 494 SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRH 553

Query: 541 VKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFL 600
           VK+YIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFL
Sbjct: 554 VKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFL 613

Query: 601 ILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEK 660
           ILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEK
Sbjct: 614 ILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEK 673

Query: 661 KFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCN 720
           KFDSELS IYSS E++QE+E KS+DTH LDHDEALKYGISYSGLCLLS ARMV+ SRPCN
Sbjct: 674 KFDSELSKIYSSSEIDQEKE-KSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCN 733

Query: 721 CFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHIS 780
           CFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG IS
Sbjct: 734 CFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRIS 793

Query: 781 LQSIDFGAWK 791
           LQS+DFGAWK
Sbjct: 794 LQSVDFGAWK 799

BLAST of Cla012109 vs. TrEMBL
Match: F6I1Y2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0174g00300 PE=4 SV=1)

HSP 1 Score: 1022.7 bits (2643), Expect = 2.4e-295
Identity = 508/765 (66.41%), Postives = 618/765 (80.78%), Query Frame = 1

Query: 30  VQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEIQIK 89
           +Q+   YSTL     + DK I K  LA NLA L+EES +  + R   +MELKRS E++IK
Sbjct: 1   MQACAVYSTLGAVSGDADKDIGKPTLAKNLAFLMEESSN-HVIRPMARMELKRSFELRIK 60

Query: 90  KRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEELSNG 149
           KRVK QY+NGKF DLM KVIA P TL++AY+C+RINSNVD+      ISF+SMAEEL  G
Sbjct: 61  KRVKEQYVNGKFQDLMVKVIANPQTLEDAYNCIRINSNVDLALDGDNISFKSMAEELLGG 120

Query: 150 NFDVNANTFSIL--SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSGRGH 209
           +F+VN NTFSI   S+RKEVLILP +KLKV+QEAIRIVLE V+RP+FSKISHGCRSGRGH
Sbjct: 121 SFNVNVNTFSISTKSARKEVLILPSLKLKVVQEAIRIVLEIVYRPYFSKISHGCRSGRGH 180

Query: 210 STVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVAGAL 269
           ST LKYI KEI NPDWWF + ++KK+D +V+AKLI+ M+DKI+DP LF +I++++ A  L
Sbjct: 181 STALKYISKEISNPDWWFILHVNKKLDAVVLAKLISTMQDKIEDPNLFVMIQNMFHAQVL 240

Query: 270 NLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQSRL 329
           NLEFGGFPKGHGLPQEGVLSPILMNIYL+LFD EF+R+SM+YEA++       D S S+L
Sbjct: 241 NLEFGGFPKGHGLPQEGVLSPILMNIYLDLFDHEFYRMSMRYEALDPGMCIDHDKSHSKL 300

Query: 330 RSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQKTL 389
           RSWFRRQLKGN  +Y G E    RV+ CR+MDEIF A+SGSKD+A+ F+SEI ++MQ +L
Sbjct: 301 RSWFRRQLKGNDVKYTGRESSNFRVHSCRFMDEIFFAISGSKDIAIEFKSEILNYMQNSL 360

Query: 390 HLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETWNAW 449
           HLDV++Q E++ C   HGI+FLG LV+RSV+ESP V++VHKLKEKV LFA QKQE W+A 
Sbjct: 361 HLDVSNQSELLPCHGPHGIQFLGTLVKRSVRESPTVRAVHKLKEKVRLFASQKQEAWDAG 420

Query: 450 TVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNQISSFRKAGMETDHWYKVLLKIWMQDLN 509
           T+ +GKKWLAHGLKKVKESEI+HLA   S L+QIS FRK GMETDHWYK+LLKIW+ D+ 
Sbjct: 421 TLRIGKKWLAHGLKKVKESEIRHLADTDSVLSQISCFRKTGMETDHWYKLLLKIWLHDVK 480

Query: 510 ARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSVKPT 569
           A+AAE+E  ILSKY  EP LP ELRDSFYEFQ+  + Y+++ETAS +ALLPN     +  
Sbjct: 481 AKAAENEGVILSKYIAEPLLPKELRDSFYEFQKRAEDYVASETASMLALLPNSKSCTESV 540

Query: 570 FITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWYNNC 629
            I +IIAPVN I+KRL RYRL   KG+PC+SP LILQD+ QI+DWF G++RRW  WY+ C
Sbjct: 541 PIIKIIAPVNVIKKRLLRYRLTNAKGYPCASPMLILQDDIQIVDWFSGLARRWLIWYSEC 600

Query: 630 SNFSEL-VLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKKSSD 689
            NFSE+ ++ICDQ+RKSCIRTLAAK+R+HE+EIEK+ D+EL  I S+ E+EQE+  ++SD
Sbjct: 601 DNFSEVKLIICDQLRKSCIRTLAAKYRLHETEIEKRSDTELCRIPSTLEIEQEKVNETSD 660

Query: 690 THGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQKFP 749
           +   D +EAL YGISYSGLCLLSLARMV+ SR CNCFV+GCLA APSVYTLHVMERQKFP
Sbjct: 661 SQASDTNEALMYGISYSGLCLLSLARMVSQSRRCNCFVMGCLAAAPSVYTLHVMERQKFP 720

Query: 750 GWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK 791
           GWKTGFSS IHPSLN RR GLCK+HL+DLYLGHISLQSI+FGAWK
Sbjct: 721 GWKTGFSSCIHPSLNGRRIGLCKQHLKDLYLGHISLQSIEFGAWK 764

BLAST of Cla012109 vs. TrEMBL
Match: W9QLZ5_9ROSA (Group II intron-encoded protein ltrA OS=Morus notabilis GN=L484_020694 PE=4 SV=1)

HSP 1 Score: 1016.5 bits (2627), Expect = 1.7e-293
Identity = 512/796 (64.32%), Postives = 634/796 (79.65%), Query Frame = 1

Query: 1   MRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLA 60
           +R ++ +Q +N   + SSF  D G+  +R+Q   ++ST A AD  I+    K  LATNLA
Sbjct: 14  LRKLATMQRINQILLYSSFFIDRGKSSERIQEPRHFSTAAAAD-AINMCSGKNTLATNLA 73

Query: 61  SLIEESLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYD 120
           SL+EES++VD R+  ++MELKRSLE ++KKRVK QY+NGKF +L+ KVIA P TLQ+AY+
Sbjct: 74  SLLEESVEVDERKPSSRMELKRSLEYRVKKRVKEQYVNGKFHNLLEKVIANPETLQDAYN 133

Query: 121 CVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILS--SRKEVLILPKIKLKVL 180
           C+R+NSNVDIM  +   SFES+ EEL  GNFDV ANT SI +  +RKEVL+LP +KLKV+
Sbjct: 134 CIRLNSNVDIMLNNETTSFESVPEELFCGNFDVKANTVSISTRGARKEVLVLPNLKLKVI 193

Query: 181 QEAIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVM 240
           QEAIRIVLE V+RPHFSKISHGCRSGRGH T LK+I+K+I  P WW T+ ++KK+D  ++
Sbjct: 194 QEAIRIVLEVVYRPHFSKISHGCRSGRGHFTALKFIKKDICAPIWWSTLIVNKKLDTCIL 253

Query: 241 AKLITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLF 300
            KLI+V+E+KI DP LF++IRS++ +  +NLEFGGFPKGHGLPQEG+LSPILMNIYL+LF
Sbjct: 254 DKLISVLEEKIVDPGLFSIIRSMFESQVINLEFGGFPKGHGLPQEGILSPILMNIYLDLF 313

Query: 301 DQEFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYM 360
           D+EF RLS+KYEA++ +       SQS+LRSWFRR LK       GEEK  +RV+ CR+M
Sbjct: 314 DREFCRLSLKYEALDLDLEANHQKSQSKLRSWFRRNLKAKDLSGAGEEKFSLRVHSCRFM 373

Query: 361 DEIFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQ 420
           DEIFLAVSGSKD AL F+SEI ++++ +LHLDV+ + E++ C   HGIRF+G LVRR+V+
Sbjct: 374 DEIFLAVSGSKDAALGFKSEIQNYLKNSLHLDVDDETELLPCDGLHGIRFMGTLVRRTVK 433

Query: 421 ESPAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLA-KNSSL 480
           ESPA K++HKLKEKVELFA+QKQE W+  TV +GKKWL HGLKKVKESEI+HLA   S L
Sbjct: 434 ESPATKAIHKLKEKVELFAIQKQEAWDVGTVRIGKKWLGHGLKKVKESEIRHLADPESVL 493

Query: 481 NQISSFRKAGMETDHWYKVLLKIWMQDLNAR-AAESEEKILSKYAVEPSLPMELRDSFYE 540
           +QIS FRKAGMETDHWYK LLKIWMQD+ A+ AAE EE ILSKY  EP+LP EL++SFY 
Sbjct: 494 SQISHFRKAGMETDHWYKHLLKIWMQDIKAKAAAECEETILSKYVAEPALPQELKNSFYV 553

Query: 541 FQRCVKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCS 600
           FQR  ++Y+S+ETA T ALL + D S +P  IT+I AP+N+I+KRL RY LVT KG+  +
Sbjct: 554 FQRHAQEYVSSETAFTCALLKSSDASTQPVIITQIFAPINAIKKRLLRYGLVTTKGYSRA 613

Query: 601 SPFLILQDNTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTLAAKHRIHE 660
              LILQD+ QIIDWFLG+ RRWFRWY+ C NFS++  L+C Q+RKSCIRTLA+KH IHE
Sbjct: 614 CSCLILQDDNQIIDWFLGIVRRWFRWYSECDNFSDIKFLVCGQIRKSCIRTLASKHHIHE 673

Query: 661 SEIEKKFDSELSNIYSSPELEQE-EEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVN 720
           +EIEK+FD+ELS I SS +LEQE    ++SD    + DEAL YGISYSGLC LSLARMV+
Sbjct: 674 TEIEKRFDAELSRIPSSEDLEQEMVNDETSDV--FEKDEALMYGISYSGLCALSLARMVS 733

Query: 721 PSRPCNCFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDL 780
            SRPC CFV GC APA SVY+LHVMERQKFPGWKTGFSS IHPSLN+RRFGLCK+HL+DL
Sbjct: 734 QSRPCTCFVTGCQAPAQSVYSLHVMERQKFPGWKTGFSSCIHPSLNRRRFGLCKQHLKDL 793

Query: 781 YLGHISLQSIDFGAWK 791
           YLGHISLQSIDFG+WK
Sbjct: 794 YLGHISLQSIDFGSWK 806

BLAST of Cla012109 vs. TrEMBL
Match: A0A061DIX4_THECC (Intron maturase isoform 1 OS=Theobroma cacao GN=TCM_001510 PE=4 SV=1)

HSP 1 Score: 996.9 bits (2576), Expect = 1.4e-287
Identity = 495/772 (64.12%), Postives = 617/772 (79.92%), Query Frame = 1

Query: 24  GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS 83
           G+ ++++ +   YS+ +  + ++    EK  LA +LA L+EES   D R++K++MELKRS
Sbjct: 31  GKPIEKLHAWVCYSSFS-TNGDLKGAHEKMTLAKDLACLVEESSHQDERKAKSRMELKRS 90

Query: 84  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMA 143
           LE+++KKRVK QYLNG F +LM KVIA P TLQ+AY+C+R+NSNVDI      + F+SMA
Sbjct: 91  LELRVKKRVKEQYLNGNFHNLMAKVIANPATLQDAYNCIRLNSNVDISVKHDSVCFKSMA 150

Query: 144 EELSNGNFDVNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC 203
           EEL  G+FDV ANTFS+ +  + KEVL+LP +K++++QEAIRIVLE V++PHFSKISHGC
Sbjct: 151 EELLEGSFDVKANTFSVSTRGASKEVLVLPNLKMRIVQEAIRIVLEVVYKPHFSKISHGC 210

Query: 204 RSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI 263
           RSGR HST L+YI KEI +P WWFT+ L+KK+D  ++AKLI+ ++DK++D +L A I+S+
Sbjct: 211 RSGRDHSTALRYISKEIASPSWWFTLILNKKVDSSILAKLISKLQDKVEDNQLLATIQSM 270

Query: 264 YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD 323
           + A  LN EFGGFPKGHGLPQEGVLSPILMNIYL+LFDQEF+RLSM+YEA++   +  +D
Sbjct: 271 FDAQVLNFEFGGFPKGHGLPQEGVLSPILMNIYLHLFDQEFYRLSMRYEALHPGFDKDED 330

Query: 324 GSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFD 383
            S S+LR+WFRRQLK N  +Y   +    RV+CCR+MDEIF A+SGSKDVALSF+SEI D
Sbjct: 331 MSYSKLRNWFRRQLKENDVKYTVNDDSSPRVHCCRFMDEIFFAISGSKDVALSFKSEIVD 390

Query: 384 FMQKTLHLDV-NHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQK 443
           F + +L LDV + Q E++ C E++GIRFLG LVRRSVQE PA ++VHKLKEKV+LFA QK
Sbjct: 391 FFKNSLELDVDDEQTEILPCNESNGIRFLGALVRRSVQEGPATRAVHKLKEKVKLFASQK 450

Query: 444 QETWNAWTVWLGKKWLAHGLKKVKESEIKHLA-KNSSLNQISSFRKAGMETDHWYKVLLK 503
           Q+ WNA TV +G+KWLAHGLKKVKESEI+HLA   S+L++IS FRKAGMETDHWYKVL K
Sbjct: 451 QDAWNAGTVGIGRKWLAHGLKKVKESEIEHLADSGSTLSKISCFRKAGMETDHWYKVLTK 510

Query: 504 IWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNY 563
           IWMQD+ A+AAE+EE ILSK  VEP+LP EL++S+YEF +   +Y+ +ETA+T+ALLPN 
Sbjct: 511 IWMQDIKAKAAENEESILSKCVVEPALPQELKESYYEFLKRANEYVYSETAATLALLPNS 570

Query: 564 DPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRW 623
             +     ITEIIAPVN+I+KRL RY L T++G+P     L+LQDN QIIDWF G+  RW
Sbjct: 571 SSNAGSVAITEIIAPVNAIKKRLLRYGLTTSEGYPRVVSLLVLQDNFQIIDWFSGIVCRW 630

Query: 624 FRWYNNCSNFSELVLICDQV-RKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQE 683
            RWY  C NF+E+ LI   + RKSCIRTLAAK+RIHESEIEK+FDSEL  I S+ E+EQE
Sbjct: 631 LRWYRECDNFNEIKLIISTILRKSCIRTLAAKYRIHESEIEKQFDSELCRIPSTEEVEQE 690

Query: 684 EEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHV 743
              ++SD+H  D DEAL YGISYSGLCLLSLARMV+ SRPCNCFV+GC   APSVYTLH 
Sbjct: 691 LTYETSDSHSFDSDEALMYGISYSGLCLLSLARMVSQSRPCNCFVMGCSMAAPSVYTLHA 750

Query: 744 MERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK 791
           MERQKFPGWKTGFSS IHPSLNKRR GLCKKHL+DLYLGHISLQSI+FGAWK
Sbjct: 751 MERQKFPGWKTGFSSCIHPSLNKRRIGLCKKHLKDLYLGHISLQSINFGAWK 801

BLAST of Cla012109 vs. TrEMBL
Match: A0A067JY85_JATCU (Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25078 PE=4 SV=1)

HSP 1 Score: 983.8 bits (2542), Expect = 1.2e-283
Identity = 492/771 (63.81%), Postives = 607/771 (78.73%), Query Frame = 1

Query: 24  GECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRS 83
           G  ++ +Q   NYSTLA  +D  D    K  LA NLA ++EES +V+ RR K++MELKRS
Sbjct: 46  GRPLEILQLWANYSTLAEVNDSFDNDAGKITLAKNLAFVLEESSNVNDRRPKSRMELKRS 105

Query: 84  LEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMA 143
           LE++IKKRVK Q+LNGKF DL+ KVIA P TLQ+AY+C+R+N+NVDI S     SFES+A
Sbjct: 106 LELRIKKRVKEQFLNGKFRDLITKVIANPETLQDAYNCIRLNANVDIASDKDDTSFESVA 165

Query: 144 EELSNGNFDVNANTFSILSS--RKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGC 203
           EELSNG+FD++ANTFSI +   RKE+L+LPK+KLKV+QEA+RI LE V+RPHFSKISHGC
Sbjct: 166 EELSNGSFDISANTFSISTKGVRKEILVLPKLKLKVVQEALRIALEVVYRPHFSKISHGC 225

Query: 204 RSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSI 263
           RSGRGH + LKYI KEI NPDWWFT+ +SKK+D  V+ KLI++MEDKI+DP L+ +I+ +
Sbjct: 226 RSGRGHHSALKYISKEISNPDWWFTLTISKKLDACVLDKLISIMEDKIEDPCLYDIIQGM 285

Query: 264 YVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQD 323
             A  LN+EFGG+PKGHGLPQEGVLSPILMNIYLN+FD E +RLSMKYEA++   +    
Sbjct: 286 DAAKVLNMEFGGYPKGHGLPQEGVLSPILMNIYLNVFDHEIYRLSMKYEALSSAFHLEGG 345

Query: 324 GSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFD 383
              S+LR WFRRQLKGN  +  GE     +++ CR+MDE+F AVSGSKD+AL F SEI  
Sbjct: 346 QLNSKLRRWFRRQLKGNGLKTTGEVNSCPKIHSCRFMDELFFAVSGSKDIALGFMSEIMG 405

Query: 384 FMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQ 443
           ++Q TL LDV  + E+  C     IRFLG L+RR V++SPAV++VHKL+EKV+LFA QKQ
Sbjct: 406 YLQNTLLLDVTGKMEVAPCAGPQVIRFLGTLLRRRVKDSPAVRAVHKLREKVKLFASQKQ 465

Query: 444 ETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNQISSFRKAGMETDHWYKVLLKI 503
           E W+  T+ +GKKWLAHGL+KVKESEIKHLA +SS L+QIS FRK GMETDHWYK+L+KI
Sbjct: 466 EAWDVGTIRIGKKWLAHGLRKVKESEIKHLADSSSVLSQISCFRKVGMETDHWYKLLIKI 525

Query: 504 WMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYD 563
           WMQD+ A+AAESEE ILSKY  EP+LP ELRDSF+EFQ+    Y+++ETA+T+ALLPN  
Sbjct: 526 WMQDITAKAAESEEFILSKYIAEPALPKELRDSFHEFQKRANGYVNSETATTLALLPNSS 585

Query: 564 PSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWF 623
            S +   ITEIIAPVN I+KRL RY L+T  GH C +  LILQD   II WF G+ RRW 
Sbjct: 586 SSTE--MITEIIAPVNVIKKRLLRYGLITPAGHSCVNRQLILQDKAHIIYWFSGIVRRWQ 645

Query: 624 RWYNNCSNFSELVLICD-QVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEE 683
           RWY +C NF++L LI   QV KSCIRTLAAK+RIHE E+EK+FD EL+NI S+ ++++E 
Sbjct: 646 RWYGDCKNFADLELIIKFQVWKSCIRTLAAKYRIHEDEVEKRFDLELNNILSTQDIKEER 705

Query: 684 EKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVM 743
           E ++S++   D+DE L YGISYSGLCLLSLARMV+ SRPCNCFV+GC A APSVYTLHVM
Sbjct: 706 ENQASNSLAFDNDEMLTYGISYSGLCLLSLARMVSQSRPCNCFVMGCSAAAPSVYTLHVM 765

Query: 744 ERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK 791
           ERQKFPGWKTGFS+ IHPSLN RR GLC +HL+D Y+G ISLQSIDF +WK
Sbjct: 766 ERQKFPGWKTGFSTCIHPSLNGRRIGLCNQHLKDFYVGDISLQSIDFSSWK 814

BLAST of Cla012109 vs. NCBI nr
Match: gi|659082762|ref|XP_008442019.1| (PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo])

HSP 1 Score: 1444.9 bits (3739), Expect = 0.0e+00
Identity = 719/789 (91.13%), Postives = 751/789 (95.18%), Query Frame = 1

Query: 2   RNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLAS 61
           RN S  Q+VNVC VNSSFVSDIG+C Q VQSS NYSTLA ADDEIDKG+EK KLA NLAS
Sbjct: 15  RNFSNSQSVNVCIVNSSFVSDIGKCFQIVQSSENYSTLARADDEIDKGMEKMKLAMNLAS 74

Query: 62  LIEESLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDC 121
           L+EESLDVDLRRSKT+MELKRSLEIQIK+RVKAQYLNGKFLDLMG VIACP TLQNAYDC
Sbjct: 75  LVEESLDVDLRRSKTRMELKRSLEIQIKERVKAQYLNGKFLDLMGNVIACPNTLQNAYDC 134

Query: 122 VRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQEA 181
           +RINSNVDI S DCLISFESMA+ELS+GNFDVN NTFSILSSRKEVLILPKIKLKVLQEA
Sbjct: 135 IRINSNVDIKSNDCLISFESMAKELSHGNFDVNTNTFSILSSRKEVLILPKIKLKVLQEA 194

Query: 182 IRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKL 241
           IRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAKL
Sbjct: 195 IRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAKL 254

Query: 242 ITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQE 301
           ITVMEDKI+DP+LFAVIRSI++AGALNLEFG FPKGHGLPQEGVLSPIL NIYLNLFDQE
Sbjct: 255 ITVMEDKIEDPKLFAVIRSIHLAGALNLEFGSFPKGHGLPQEGVLSPILTNIYLNLFDQE 314

Query: 302 FFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEI 361
           FFRLSMKYEAINE GNTGQDGSQS+LRSWFRRQLK NS +YPGEEKDKIRVYCCRYMDEI
Sbjct: 315 FFRLSMKYEAINEYGNTGQDGSQSKLRSWFRRQLKENSSDYPGEEKDKIRVYCCRYMDEI 374

Query: 362 FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESP 421
           FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNH+EEMVSC ETHGIRFLGCLVRRSVQESP
Sbjct: 375 FLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHEEEMVSC-ETHGIRFLGCLVRRSVQESP 434

Query: 422 AVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQIS 481
           AVKS+HKLKEKVELF LQKQETW +WTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQIS
Sbjct: 435 AVKSIHKLKEKVELFGLQKQETWKSWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQIS 494

Query: 482 SFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCV 541
           SFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVEPSLP ELRDSFYEFQR V
Sbjct: 495 SFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVEPSLPFELRDSFYEFQRRV 554

Query: 542 KQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLI 601
           ++YIS+ETAST+ALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLI
Sbjct: 555 EEYISSETASTLALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLI 614

Query: 602 LQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEKK 661
           LQDNTQIIDWFLGVSRRWFRWYN  SNFSEL LI DQVRKSCIRTLAAKH+IHESEIEKK
Sbjct: 615 LQDNTQIIDWFLGVSRRWFRWYNKSSNFSELFLIFDQVRKSCIRTLAAKHQIHESEIEKK 674

Query: 662 FDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNC 721
           FDSELS IYSSPE+EQE+E KS+DTH LDHDEAL YGISYSGLCLLSLARMV+ SRPCNC
Sbjct: 675 FDSELSKIYSSPEIEQEKE-KSTDTHVLDHDEALNYGISYSGLCLLSLARMVSRSRPCNC 734

Query: 722 FVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISL 781
           FVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG ISL
Sbjct: 735 FVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRISL 794

Query: 782 QSIDFGAWK 791
           QS+DFGAWK
Sbjct: 795 QSVDFGAWK 801

BLAST of Cla012109 vs. NCBI nr
Match: gi|778692419|ref|XP_011653460.1| (PREDICTED: uncharacterized protein LOC101219510 [Cucumis sativus])

HSP 1 Score: 1417.1 bits (3667), Expect = 0.0e+00
Identity = 711/790 (90.00%), Postives = 743/790 (94.05%), Query Frame = 1

Query: 1   MRNISILQNVNVCKVNSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLA 60
           MRN S LQ+VNVC  NSSFVSDIG+CVQ VQ S NYSTLA A  EIDKG+E+ KLA NLA
Sbjct: 14  MRNFSNLQSVNVCIFNSSFVSDIGKCVQIVQGSENYSTLARA--EIDKGMERMKLAINLA 73

Query: 61  SLIEESLDVDLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYD 120
           SL+EESLDVDLRRSKTQMELKRSLEI+IK+RVKAQYLNGKFLDLMG VIACP TLQN YD
Sbjct: 74  SLVEESLDVDLRRSKTQMELKRSLEIRIKERVKAQYLNGKFLDLMGNVIACPNTLQNVYD 133

Query: 121 CVRINSNVDIMSTDCLISFESMAEELSNGNFDVNANTFSILSSRKEVLILPKIKLKVLQE 180
           C+RINSNVDI S D LISFESMAEELSNGNFDVN NTFSILSSRKEVLILPKIKLKVLQE
Sbjct: 134 CIRINSNVDIKSNDRLISFESMAEELSNGNFDVNTNTFSILSSRKEVLILPKIKLKVLQE 193

Query: 181 AIRIVLECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAK 240
           AIRIVLECVFRPHFSKISHGCRSGRGHST LKYI+KEIK+PDWWFTVDLSKKMDELVMAK
Sbjct: 194 AIRIVLECVFRPHFSKISHGCRSGRGHSTALKYIKKEIKDPDWWFTVDLSKKMDELVMAK 253

Query: 241 LITVMEDKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQ 300
           LITVMEDKI+DP+LFAVIRSIY+AGALNLEFGGFPKGHGLPQEGVLSPIL NIYLNLFDQ
Sbjct: 254 LITVMEDKIEDPKLFAVIRSIYLAGALNLEFGGFPKGHGLPQEGVLSPILTNIYLNLFDQ 313

Query: 301 EFFRLSMKYEAINENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDE 360
           EFFRLSMKYEAINE GNTGQDGSQSRLRSWFRRQLKGN+ +Y GEEKDKIRVYCCRYMDE
Sbjct: 314 EFFRLSMKYEAINEYGNTGQDGSQSRLRSWFRRQLKGNNSDYSGEEKDKIRVYCCRYMDE 373

Query: 361 IFLAVSGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQES 420
           IFLAVSGSKDVA SFRSEIF F+QKTLHLDVN +EEMVSC ETHGIRFLGCLVRRSVQES
Sbjct: 374 IFLAVSGSKDVAHSFRSEIFYFVQKTLHLDVNREEEMVSC-ETHGIRFLGCLVRRSVQES 433

Query: 421 PAVKSVHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNQI 480
           PAVKS+HKLKEKVELF LQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLN+I
Sbjct: 434 PAVKSIHKLKEKVELFGLQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSSLNKI 493

Query: 481 SSFRKAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRC 540
           SSFRK GMETDHWYKVLLKIWMQDLNARAAESEEKILSK+AVE SLP ELRDSFYEFQR 
Sbjct: 494 SSFRKPGMETDHWYKVLLKIWMQDLNARAAESEEKILSKHAVELSLPFELRDSFYEFQRH 553

Query: 541 VKQYISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFL 600
           VK+YIS+ETAST+ALLPNYDPS KPTFITEIIAPVNSIRKRL RYRLVTNKGHPCSSPFL
Sbjct: 554 VKEYISSETASTLALLPNYDPSAKPTFITEIIAPVNSIRKRLLRYRLVTNKGHPCSSPFL 613

Query: 601 ILQDNTQIIDWFLGVSRRWFRWYNNCSNFSELVLICDQVRKSCIRTLAAKHRIHESEIEK 660
           ILQDNTQIIDWF+GVSRR FRWYNN SNFSEL LI DQVRKSCIRTLAAKHRIHESEIEK
Sbjct: 614 ILQDNTQIIDWFVGVSRRLFRWYNNSSNFSELFLIFDQVRKSCIRTLAAKHRIHESEIEK 673

Query: 661 KFDSELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCN 720
           KFDSELS IYSS E++QE+E KS+DTH LDHDEALKYGISYSGLCLLS ARMV+ SRPCN
Sbjct: 674 KFDSELSKIYSSSEIDQEKE-KSTDTHVLDHDEALKYGISYSGLCLLSFARMVSQSRPCN 733

Query: 721 CFVVGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHIS 780
           CFV+GCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCK+HL DLYLG IS
Sbjct: 734 CFVIGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKQHLADLYLGRIS 793

Query: 781 LQSIDFGAWK 791
           LQS+DFGAWK
Sbjct: 794 LQSVDFGAWK 799

BLAST of Cla012109 vs. NCBI nr
Match: gi|731440172|ref|XP_010646090.1| (PREDICTED: uncharacterized protein LOC100251856 [Vitis vinifera])

HSP 1 Score: 1026.9 bits (2654), Expect = 1.8e-296
Identity = 510/768 (66.41%), Postives = 621/768 (80.86%), Query Frame = 1

Query: 27  VQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSKTQMELKRSLEI 86
           V+R+Q+   YSTL     + DK I K  LA NLA L+EES +  + R   +MELKRS E+
Sbjct: 35  VERMQACAVYSTLGAVSGDADKDIGKPTLAKNLAFLMEESSN-HVIRPMARMELKRSFEL 94

Query: 87  QIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDCLISFESMAEEL 146
           +IKKRVK QY+NGKF DLM KVIA P TL++AY+C+RINSNVD+      ISF+SMAEEL
Sbjct: 95  RIKKRVKEQYVNGKFQDLMVKVIANPQTLEDAYNCIRINSNVDLALDGDNISFKSMAEEL 154

Query: 147 SNGNFDVNANTFSIL--SSRKEVLILPKIKLKVLQEAIRIVLECVFRPHFSKISHGCRSG 206
             G+F+VN NTFSI   S+RKEVLILP +KLKV+QEAIRIVLE V+RP+FSKISHGCRSG
Sbjct: 155 LGGSFNVNVNTFSISTKSARKEVLILPSLKLKVVQEAIRIVLEIVYRPYFSKISHGCRSG 214

Query: 207 RGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPRLFAVIRSIYVA 266
           RGHST LKYI KEI NPDWWF + ++KK+D +V+AKLI+ M+DKI+DP LF +I++++ A
Sbjct: 215 RGHSTALKYISKEISNPDWWFILHVNKKLDAVVLAKLISTMQDKIEDPNLFVMIQNMFHA 274

Query: 267 GALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAINENGNTGQDGSQ 326
             LNLEFGGFPKGHGLPQEGVLSPILMNIYL+LFD EF+R+SM+YEA++       D S 
Sbjct: 275 QVLNLEFGGFPKGHGLPQEGVLSPILMNIYLDLFDHEFYRMSMRYEALDPGMCIDHDKSH 334

Query: 327 SRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVALSFRSEIFDFMQ 386
           S+LRSWFRRQLKGN  +Y G E    RV+ CR+MDEIF A+SGSKD+A+ F+SEI ++MQ
Sbjct: 335 SKLRSWFRRQLKGNDVKYTGRESSNFRVHSCRFMDEIFFAISGSKDIAIEFKSEILNYMQ 394

Query: 387 KTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKVELFALQKQETW 446
            +LHLDV++Q E++ C   HGI+FLG LV+RSV+ESP V++VHKLKEKV LFA QKQE W
Sbjct: 395 NSLHLDVSNQSELLPCHGPHGIQFLGTLVKRSVRESPTVRAVHKLKEKVRLFASQKQEAW 454

Query: 447 NAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNQISSFRKAGMETDHWYKVLLKIWMQ 506
           +A T+ +GKKWLAHGLKKVKESEI+HLA   S L+QIS FRK GMETDHWYK+LLKIW+ 
Sbjct: 455 DAGTLRIGKKWLAHGLKKVKESEIRHLADTDSVLSQISCFRKTGMETDHWYKLLLKIWLH 514

Query: 507 DLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETASTVALLPNYDPSV 566
           D+ A+AAE+E  ILSKY  EP LP ELRDSFYEFQ+  + Y+++ETAS +ALLPN     
Sbjct: 515 DVKAKAAENEGVILSKYIAEPLLPKELRDSFYEFQKRAEDYVASETASMLALLPNSKSCT 574

Query: 567 KPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWFLGVSRRWFRWY 626
           +   I +IIAPVN I+KRL RYRL   KG+PC+SP LILQD+ QI+DWF G++RRW  WY
Sbjct: 575 ESVPIIKIIAPVNVIKKRLLRYRLTNAKGYPCASPMLILQDDIQIVDWFSGLARRWLIWY 634

Query: 627 NNCSNFSEL-VLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYSSPELEQEEEKK 686
           + C NFSE+ ++ICDQ+RKSCIRTLAAK+R+HE+EIEK+ D+EL  I S+ E+EQE+  +
Sbjct: 635 SECDNFSEVKLIICDQLRKSCIRTLAAKYRLHETEIEKRSDTELCRIPSTLEIEQEKVNE 694

Query: 687 SSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAPSVYTLHVMERQ 746
           +SD+   D +EAL YGISYSGLCLLSLARMV+ SR CNCFV+GCLA APSVYTLHVMERQ
Sbjct: 695 TSDSQASDTNEALMYGISYSGLCLLSLARMVSQSRRCNCFVMGCLAAAPSVYTLHVMERQ 754

Query: 747 KFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK 791
           KFPGWKTGFSS IHPSLN RR GLCK+HL+DLYLGHISLQSI+FGAWK
Sbjct: 755 KFPGWKTGFSSCIHPSLNGRRIGLCKQHLKDLYLGHISLQSIEFGAWK 801

BLAST of Cla012109 vs. NCBI nr
Match: gi|657950519|ref|XP_008348277.1| (PREDICTED: uncharacterized protein LOC103411417 [Malus domestica])

HSP 1 Score: 1023.5 bits (2645), Expect = 2.0e-295
Identity = 515/787 (65.44%), Postives = 627/787 (79.67%), Query Frame = 1

Query: 11  NVCKVNSSFVSDIGECVQRVQSSGNYSTLAYAD-DEIDKGIEKKKLATNLASLIEESLDV 70
           N  KV ++  +      +RVQ S ++  +A A  D ++ GI K KLA NLA L+EES  +
Sbjct: 36  NTEKVTTAAXAGYDRASERVQESADHCAVASAAADGVNSGIRKMKLAENLACLVEESSHI 95

Query: 71  DLRRSKTQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVD 130
           + RR K +M+LKR LE++IKKRVK QY+NGKF DLM KVIA P TLQ+AYDC+R+NSNVD
Sbjct: 96  NERRPKGRMQLKRCLELRIKKRVKEQYINGKFRDLMVKVIANPETLQDAYDCIRLNSNVD 155

Query: 131 IMSTDCLISFESMAEELSNGNFDVNANTFSILSSR---KEVLILPKIKLKVLQEAIRIVL 190
           I  +D   SF+SMAEE+ +G+FD NANTFSI S R    EVL+LP + LKV+QEAIR+VL
Sbjct: 156 IALSDAKNSFDSMAEEMRHGSFDANANTFSI-SKRGVGNEVLVLPNLNLKVIQEAIRVVL 215

Query: 191 ECVFRPHFSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVME 250
           E V++P FSKISHG RSGRGHST LKYI KEI NPDWWFTV L+KK+D  ++ +L+  ME
Sbjct: 216 EVVYKPDFSKISHGYRSGRGHSTALKYISKEISNPDWWFTVLLNKKLDACILGELLKAME 275

Query: 251 DKIDDPRLFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLS 310
            KI DP LF +I+S++ A  LNLEFGGFPKGHGLPQEG+LSPILMNIYL+ FD+EF+RLS
Sbjct: 276 GKIVDPSLFDMIKSMFHANVLNLEFGGFPKGHGLPQEGILSPILMNIYLDQFDREFYRLS 335

Query: 311 MKYEAINENGNTGQDGSQSRLRSWFRRQLKGNS-FEYPGEEKDKIRVYCCRYMDEIFLAV 370
           MKYEA++ +    Q+ SQS+LRSWFRR LKGN+     GEE    RV+ CR+MDEIF + 
Sbjct: 336 MKYEALSLDSQNDQN-SQSKLRSWFRRHLKGNNDLGCAGEESCSARVHSCRFMDEIFFSX 395

Query: 371 SGSKDVALSFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKS 430
           SGSKD AL F+SE+ +++QK+LHL+V+ Q E++ C + HGIRFLG LVRR+V ESPA K+
Sbjct: 396 SGSKDAALEFKSEVLNYLQKSLHLEVDDQTELLPCQKPHGIRFLGTLVRRNVIESPATKA 455

Query: 431 VHKLKEKVELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNQISSFR 490
           VHKLKEKV LF LQKQE WN  TV +GKKWL HGLKKVKESEIKHLA +SS LNQIS FR
Sbjct: 456 VHKLKEKVALFGLQKQEAWNVGTVHIGKKWLGHGLKKVKESEIKHLADSSSVLNQISHFR 515

Query: 491 KAGMETDHWYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQY 550
           K GMETDHWYK LLKIWMQD+NA+A ESEE +LSK+  EP+LP EL +SFYEFQR V++Y
Sbjct: 516 KFGMETDHWYKHLLKIWMQDVNAKAEESEEAVLSKHVAEPALPEELTNSFYEFQRQVEKY 575

Query: 551 ISAETASTVALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQD 610
           +S+ET+S +ALLPN   S +   ITEIIAPVN+++KRL RY L T+ G+P +S  L+LQD
Sbjct: 576 VSSETSSILALLPNAGSSAESVVITEIIAPVNAVKKRLQRYGLTTSDGYPRTSSLLVLQD 635

Query: 611 NTQIIDWFLGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTLAAKHRIHESEIEKKFD 670
           N QIIDWF G+ RRW RWY  C NF E+ +LI D VRKSCIRTLAAK+R+HE+EIE++FD
Sbjct: 636 NDQIIDWFSGIVRRWLRWYAECVNFKEVKLLISDLVRKSCIRTLAAKYRVHENEIERRFD 695

Query: 671 SELSNIYSSPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFV 730
           +ELS I S+ E+EQE   ++SDT   ++DEAL YGISYSGLC+LSLARMV+ SRPCNCFV
Sbjct: 696 TELSRIPSTQEIEQEMVDETSDTQAFENDEALMYGISYSGLCVLSLARMVSESRPCNCFV 755

Query: 731 VGCLAPAPSVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQS 790
            GC+A APSVYTLHVMERQKFPGWKTGFSS IHPSLN+RR GLCK+HL+DLYLGH+SLQS
Sbjct: 756 FGCMASAPSVYTLHVMERQKFPGWKTGFSSCIHPSLNRRRIGLCKQHLKDLYLGHVSLQS 815

BLAST of Cla012109 vs. NCBI nr
Match: gi|470142878|ref|XP_004307117.1| (PREDICTED: uncharacterized protein LOC101309387 [Fragaria vesca subsp. vesca])

HSP 1 Score: 1017.7 bits (2630), Expect = 1.1e-293
Identity = 502/779 (64.44%), Postives = 630/779 (80.87%), Query Frame = 1

Query: 16  NSSFVSDIGECVQRVQSSGNYSTLAYADDEIDKGIEKKKLATNLASLIEESLDVDLRRSK 75
           +++ V+       R+Q   ++ST+  A  +I+ G+ + KLA NLA L++ES  ++ RR +
Sbjct: 40  STALVAHYDRASDRIQELADHSTVTTAGHDINNGVHETKLAKNLACLVDESSHINERRPR 99

Query: 76  TQMELKRSLEIQIKKRVKAQYLNGKFLDLMGKVIACPTTLQNAYDCVRINSNVDIMSTDC 135
           ++MELKRS+E++IKKRVK QYLNGKF  LM KVIA P TLQ+AYDC+R+NSN+DI+ TD 
Sbjct: 100 SRMELKRSIELRIKKRVKEQYLNGKFQHLMAKVIATPETLQDAYDCIRLNSNIDIVLTDG 159

Query: 136 LISFESMAEELSNGNFDVNANTFSILS--SRKEVLILPKIKLKVLQEAIRIVLECVFRPH 195
             +F SMAEEL  G+FDVNANTFSI +  +RK+VL+LP + LK++QEAIRIVLE V++PH
Sbjct: 160 KTTFGSMAEELYLGSFDVNANTFSISTKGARKDVLVLPNVNLKIIQEAIRIVLEVVYKPH 219

Query: 196 FSKISHGCRSGRGHSTVLKYIRKEIKNPDWWFTVDLSKKMDELVMAKLITVMEDKIDDPR 255
           FSKISHG RSGRGHST LKYI KE    DWWFT+ ++KK+D  ++AKLI+VME+KI+DP 
Sbjct: 220 FSKISHGYRSGRGHSTALKYISKETAGSDWWFTLLVNKKLDACILAKLISVMEEKIEDPS 279

Query: 256 LFAVIRSIYVAGALNLEFGGFPKGHGLPQEGVLSPILMNIYLNLFDQEFFRLSMKYEAIN 315
           L+ +I+S++ A  LN EFGGFPKGHGLPQEGVLSPILMNIYL+LFD+EF+RLSMKYEA+ 
Sbjct: 280 LYVMIQSMFHANVLNFEFGGFPKGHGLPQEGVLSPILMNIYLDLFDREFYRLSMKYEALV 339

Query: 316 ENGNTGQDGSQSRLRSWFRRQLKGNSFEYPGEEKDKIRVYCCRYMDEIFLAVSGSKDVAL 375
              +T Q  S+S+LRSWFRR LKGN     GEE    RV+ CR+MDEIF + +GSKD AL
Sbjct: 340 PGFHTDQK-SKSKLRSWFRRNLKGNDLGCAGEES--FRVHSCRFMDEIFFSFAGSKDAAL 399

Query: 376 SFRSEIFDFMQKTLHLDVNHQEEMVSCGETHGIRFLGCLVRRSVQESPAVKSVHKLKEKV 435
           +F+SE+ +++QK+LHL+V+ Q E++ C  + GIRFLG L++R+V+ESPA K+VHKLKEKV
Sbjct: 400 NFKSEVLNYVQKSLHLEVDDQTELLPCQMSQGIRFLGTLIKRNVKESPATKAVHKLKEKV 459

Query: 436 ELFALQKQETWNAWTVWLGKKWLAHGLKKVKESEIKHLAKNSS-LNQISSFRKAGMETDH 495
            LF LQKQE W++ TV +GKKWL HGLKKVKESEIKHLA + S L+QIS  RK+GMETDH
Sbjct: 460 VLFGLQKQEAWDSGTVSIGKKWLGHGLKKVKESEIKHLANSRSVLSQISHLRKSGMETDH 519

Query: 496 WYKVLLKIWMQDLNARAAESEEKILSKYAVEPSLPMELRDSFYEFQRCVKQYISAETAST 555
           WYK LLKIWMQD+NA+AAESEE ILSKY  EP+LP ELR+SFYEFQR V++Y+S+ETA+T
Sbjct: 520 WYKYLLKIWMQDVNAKAAESEEAILSKYVSEPALPEELRNSFYEFQRQVEKYVSSETAAT 579

Query: 556 VALLPNYDPSVKPTFITEIIAPVNSIRKRLFRYRLVTNKGHPCSSPFLILQDNTQIIDWF 615
           +ALLPN   S     +TEIIAPV +I+KRL RY L+T  G+P ++  L+LQDN QIIDWF
Sbjct: 580 LALLPNAGSSTDSVIVTEIIAPVIAIKKRLQRYGLITRDGYPRATSLLVLQDNLQIIDWF 639

Query: 616 LGVSRRWFRWYNNCSNFSEL-VLICDQVRKSCIRTLAAKHRIHESEIEKKFDSELSNIYS 675
            G+ RRW RWY  C NF+E+ +LICD VRKSCIRTLA+K+R+HE++IE +FD+ELS+I S
Sbjct: 640 AGIVRRWLRWYAKCDNFNEVKLLICDLVRKSCIRTLASKYRVHEADIENRFDTELSSIPS 699

Query: 676 SPELEQEEEKKSSDTHGLDHDEALKYGISYSGLCLLSLARMVNPSRPCNCFVVGCLAPAP 735
           + E+EQE   ++SD    ++DEAL YGISYSGLCLLSLARMV+ SRPCNCFV+GC APAP
Sbjct: 700 TLEVEQEMVDETSDPQAFENDEALMYGISYSGLCLLSLARMVSQSRPCNCFVIGCTAPAP 759

Query: 736 SVYTLHVMERQKFPGWKTGFSSSIHPSLNKRRFGLCKKHLEDLYLGHISLQSIDFGAWK 791
            VYTLHVMERQKFPGWKTGFSS IHPSLN+RR  LCK+HL++LYLG ISLQSIDFGAWK
Sbjct: 760 CVYTLHVMERQKFPGWKTGFSSCIHPSLNRRRVALCKQHLKNLYLGDISLQSIDFGAWK 815

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
LTRA_LACLC5.0e-2027.59Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris GN=lt... [more]
LTRA_LACLM5.0e-2027.59Group II intron-encoded protein LtrA OS=Lactococcus lactis subsp. cremoris (stra... [more]
YMF40_MARPO3.0e-1727.91Uncharacterized mitochondrial protein ymf40 OS=Marchantia polymorpha GN=YMF40 PE... [more]
YMC6_SCHPO7.5e-1628.15Uncharacterized 91 kDa protein in cob intron OS=Schizosaccharomyces pombe (strai... [more]
AI1M_YEAST1.1e-1426.42Putative COX1/OXI3 intron 1 protein OS=Saccharomyces cerevisiae (strain ATCC 204... [more]
Match NameE-valueIdentityDescription
A0A0A0KWB0_CUCSA0.0e+0090.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G188380 PE=4 SV=1[more]
F6I1Y2_VITVI2.4e-29566.41Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0174g00300 PE=4 SV=... [more]
W9QLZ5_9ROSA1.7e-29364.32Group II intron-encoded protein ltrA OS=Morus notabilis GN=L484_020694 PE=4 SV=1[more]
A0A061DIX4_THECC1.4e-28764.12Intron maturase isoform 1 OS=Theobroma cacao GN=TCM_001510 PE=4 SV=1[more]
A0A067JY85_JATCU1.2e-28363.81Uncharacterized protein OS=Jatropha curcas GN=JCGZ_25078 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659082762|ref|XP_008442019.1|0.0e+0091.13PREDICTED: uncharacterized protein LOC103486008 [Cucumis melo][more]
gi|778692419|ref|XP_011653460.1|0.0e+0090.00PREDICTED: uncharacterized protein LOC101219510 [Cucumis sativus][more]
gi|731440172|ref|XP_010646090.1|1.8e-29666.41PREDICTED: uncharacterized protein LOC100251856 [Vitis vinifera][more]
gi|657950519|ref|XP_008348277.1|2.0e-29565.44PREDICTED: uncharacterized protein LOC103411417 [Malus domestica][more]
gi|470142878|ref|XP_004307117.1|1.1e-29364.44PREDICTED: uncharacterized protein LOC101309387 [Fragaria vesca subsp. vesca][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000477RT_dom
IPR024937Domain_X
Vocabulary: Biological Process
TermDefinition
GO:0006397mRNA processing
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006397 mRNA processing
biological_process GO:0000373 Group II intron splicing
biological_process GO:0006315 homing of group II introns
biological_process GO:0090615 mitochondrial mRNA processing
biological_process GO:1900864 mitochondrial RNA modification
biological_process GO:0007005 mitochondrion organization
biological_process GO:0032885 regulation of polysaccharide biosynthetic process
biological_process GO:0006278 RNA-dependent DNA biosynthetic process
biological_process GO:0009845 seed germination
cellular_component GO:0005575 cellular_component
cellular_component GO:0005739 mitochondrion
molecular_function GO:0003674 molecular_function
molecular_function GO:0003964 RNA-directed DNA polymerase activity
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
WMU44714watermelon EST collection version 2.0transcribed_cluster
WMU48772watermelon EST collection version 2.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cla012109Cla012109.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
WMU44714WMU44714transcribed_cluster
WMU48772WMU48772transcribed_cluster


Analysis Name: InterPro Annotations of watermelon (97103)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 170..411
score: 2.0
IPR000477Reverse transcriptase domainPROFILEPS50878RT_POLcoord: 1..413
score: 9
IPR024937Domain XPFAMPF01348Intron_maturas2coord: 570..680
score: 2.4
NoneNo IPR availablePANTHERPTHR33642FAMILY NOT NAMEDcoord: 33..789
score:
NoneNo IPR availablePANTHERPTHR33642:SF3INTRON MATURASE, TYPE II FAMILY PROTEINcoord: 33..789
score:
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 171..307
score: 1.63E-11coord: 348..416
score: 1.63