CSPI01G21320 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI01G21320
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRibonuclease H
LocationChr1: 16872579 .. 16876683 (-)
RNA-Seq ExpressionCSPI01G21320
SyntenyCSPI01G21320
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAAAGAAAAACCTTTGTTACTCTCAATACAAGTCAAGGTTCGTTGAATGTAAAAAGACATAATGTTATAGTAACGAATCCTAAAAAAGAAGAGCCAGAACACGGGGAAGGTGAAACTTCATGTCATTATATCACCATTATTGAGGGATCAAAGGCTGAAACTCATGAAGATGACGCAAAAGATGTCCCACAGAGTTTAGAGGATGGTGGCCAATCTACTGTAGATGAGCTAAAAGAGGTGAACGTTGGTACAATAGAGGAACCATGCTCAACTTTCATTAGTGCATCCCTCTCTAACGAGGAGGAGGATAAATACATGAGTTTGCTCACCGAATACAGAGACATCTTTGCTTGGTCGTACAAGGAGATGTCAGAACTTGATCCAAAAGTAGCAGTCCATCATCTTGCAATTAAACCAGGGTATCGATCGATTAAACAAGCACAACGACATTTTCAAGTGTATCTTATTCCTCGTATCGAGGTGGAAGTCAACAAGTTGATTGAAGAAGGGTTCATTCGCGAGGTCAATTATCCAACGTGGATAGAAAACATTGTCCCTGTTAGAAAAAAGAACGGGCAACTTCGCGTTTGTGTAGAATTTTGCGACCTGAATAATGCATGCCCTAAAGGTGATTTTCCCTTGCCTATCATAGAAATCATGGTTGATGCAACTACTGGATACAGAGCATTGTCCTTTATGGATGGGTCGTCTGGATATAATCTAACCCTCTCAGATGAAGAAATGATAGCTTTTAGGACTCCAAAGGGAATATACTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTCGGTGCTACTTACCAATGTGTCGTGCATAAAGTGTTTGATGATATGTTGCAAAGTATGTCGAATGTTATGTTGATGACCTTGTAGTCAAATCAAAAAGACAATAAGACCATTTGAAGGATCTAAAAGTTGTGTTCGACCGATTGCGAAAATAACAGCTAAAGATGAACCCTCTCAAGAGTGCGTTTGGTATGACTTCAGAAAAGTTTCTTGGCTTCATTGTAAGGCACCGAGGGATCGAAATAGACCAATCCAAGTTGATTCAGAAGATGCCAAAACCTAAGAGTTTGCATGACCTAAGAAGTCTCCAAGGACGATTGGCTTACATCCAAAGGTTCATCTCTAACATGGCCGGTCGGTGTCAACCTTTTGAAAAGTTGATGAGAAAAGAAAAAAATTTGTATGGGATGAGGCTTGTTAGAATGCTTTTGATAGCATAAAGAAATATTTGCTTACTCCCCTAGTGTTGAGAGCTCCAGTACCTGACAAACCATTAATATTGTACATTGCTGCACAAGATGGGTCCTTAGGAGCATTACTAGTACAAGAGGAGGAAAAGGGAAAGGAGCGTGCTCTCTACTATTTAAGCAGAACATTAATTAGGGCTGAAGTTAACTATTCTCCCATCGAGAACATGTGCCTCACACTTTTCTTTGCCATTGATAAGTTGAGTCATTATATGCAAACCTTCACGATTTATCTAGTGGCAAAAGTATACCTATAAAATATGTTCTGTTCAGGCCAATTATCTCTAGACGCTTAGCCAAATGGGTGGTTCTACTCCAACAATATGACATTGTCAATATTCTCCAAAAGGTGATAAAAGACAAGTGCTAGCAACCTTTTTAGCAGACCACCCAATTCCTTCGGATTGGAAGTTATGTGAAGACCTGTCAGACGATGAGGTTTTCTTCACGCGAACCTTGGACTATGTTTTTTTACGGTGCAACATTAAGAAGTGGTGCAGGGGCTGACATCGTTCTCATTGCTCCTGAGAAGCATATGTTGCCTTATAGCTTTGCATTTGTTGAACTGTGTTCAAATAATGTGGTTGAATATCAGGCTTTGATAATTGGCCTTCAAATGGCATTGGAAATCGAAGTATCATTCATAGAAATTTATGGTGATTCAAAGTTGATAATCAATCATCTTTCGCTTCAGTATGATGTGAAACATGAAAACTTGAAGACATACTTCTCTTATGCTCGAAAATTGATGGAAAAGTTCGTTAGTATGATGCTAGAACACGTCCCTAGAGTAGAAAATAAGAGGGCAGACACAATGACGAATTTAGCCACTGCCTTAATGATGCTAGATGATGTAACTTCGAACATACCACTCTGTCAACGATGAATTATGCCTCCAATTATGTCTGAATGTCAGGAAGTGAACGCGACGACATTCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAACATAGAAAGCTTCCAAAGGATTCCCGTCATAAAACTGAGGTACGAAGAAGAGCTACACGCTTCATTTATTACAAGGGAACCTTTTATCGCCGTTCTCTTCAAGGGTTCTTCCTTCGATGCCTTGGAAAGGAAGAGTTAGTAAAAGCTCTAAAGGAAGTACATGCAGATGTTTGTGGAGCACATCAATCGAGACCAAAACTTCAATGCAAACTAAGAAGAATGAGCTACTATTGGCGTAAGATGATCCAAGACTCAATAGACTATGTAAAGAAGTGTGAAGCTTGTCAATACCATGCAAACTTCATACACCAACCTCTAGAGCCTCTTCATCCAATTGTGGCTTCTTGGTCGTTTGAGGCTTGGAGACTCGATCTGGTTGACTCTATTACACAAAAATCACCAGTAGGGCATGATAGAACTAAGACATGCACAACGGAAAAAGAATCGAATTTCTACCCTAATTGCCATAATTAACATGTAACCCTAAACTAATCAAATTAGGGTTTTAAGAAGTATTACCTTTGAAGCTTTCAAAAGGTTTGATGTCTTCTTACCAATTCAAACCGAGACCACCACTAGCACTCAATTACTATCCTCTGGACAAAGAACCGAGTTGTGGGACCCAATTTGGTGTACAAAGTAATGGAGATAAAAGGGGATTGGAAGGTAGAAATTTTTTTTCTTGGTTGAGATATTGGAGAATGAAAAAAAGGCAAAAATTCTCAACCATTTGAAAACACCCCTATTTATAACCAATTTGTCAAATCTCAACACCTAATGTTCCACTAACAATTAGTGGAACTTAGTGGGCTAGGTGTCTATATCTCATATAGCCACATATCCCACTAAGAGTTAGTGGGGTTATCCAACAAAATGTTGGATTTTCCCAGTAACTTAGTCCAAGGGTAAAATGGTCATTAGATATTTCAAGTCAAAAGTCAAACTTTGACTTTTCTAAGTCAAAAGTTTTTTACCATTTTATCCATCTTGACTAATTCCAACCTCTCGAGCATGAATCCGCATTCATTTTTTCAAAATTTAAATCATATTTGAATATAAGGCCGGTCAAAGTTTGACTTTTCGAAGTCAAAAGTCAACATTTTGACTTTTTTATAATTTTTGACAAATTCCATCAATTTTGAGCTTCTTAGTATGAATCCGCATTCATACTTATAGTATGTAAAACATAAAGCTCTATTTCTAATTAGAAGACCGACGACTATATCACTATATTTGTCGGTTTCCCTTTCTTCTCCCAATTCGAACAATTCGACTTATTTCATCACACTGTTCTAAGTTTAATCCATATGAGCTAGCAGAGGAACCTAATGGACCTATAGATCATGGGCTCCAACGATTCAAGATTAACTAGCTAAACTCTTTTAAACCGAGTTAATCAACATTCGTTAACTAACGGGTCATTCCACTAAAGTCCCGTAGTTGCACTCCCCTCACTATAGATATATTTGTGTCCATTTGATAAAACCATAATCAGTAAGTTAATCCTTCACAGGTTGCTCGTAACCTTGGCTGGGTCAAAATACCGTTTTACCCCCAAGATTACATCTTGCTCCTTAAGTCCCACTAATCCACTATTGAACAATTGGTTTAAGGTTCAACCTATAAACTTAATCCCTCTCGGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGACTTGGATTCAGTGCTTAAGAGAGCAACCTATCTACTAACCCTAAAGCGGGTAGGAGTGAATTCCATCTTGTACCCTATGTTCCCAGCTATCCACCCGATCTTACCCCTGAAATGGGAGGCTTATTGGGCCAACGCTGATGAGCTGCCCTCACCTATGCAGATCTAA

mRNA sequence

ATGAAAAGAAAAACCTTTGTTACTCTCAATACAAGTCAAGGTTCGTTGAATGTAAAAAGACATAATGTTATAGTAACGAATCCTAAAAAAGAAGAGCCAGAACACGGGGAAGGTGAAACTTCATGTCATTATATCACCATTATTGAGGGATCAAAGGCTGAAACTCATGAAGATGACGCAAAAGATGTCCCACAGAGTTTAGAGGATGGTGGCCAATCTACTGTAGATGAGCTAAAAGAGGTGAACGTTGGTACAATAGAGGAACCATGCTCAACTTTCATTAGTGCATCCCTCTCTAACGAGGAGGAGGATAAATACATGAGTTTGCTCACCGAATACAGAGACATCTTTGCTTGGTCGTACAAGGAGATGTCAGAACTTGATCCAAAAGTAGCAGTCCATCATCTTGCAATTAAACCAGGGTATCGATCGATTAAACAAGCACAACGACATTTTCAAGTGTATCTTATTCCTCGTATCGAGGTGGAAGTCAACAAGTTGATTGAAGAAGGGTTCATTCGCGAGGTCAATTATCCAACGTGGATAGAAAACATTGTCCCTGTTAGAAAAAAGAACGGGCAACTTCGCGTTTGTGTAGAATTTTGCGACCTGAATAATGCATGCCCTAAAGGTGATTTTCCCTTGCCTATCATAGAAATCATGGTTGATGCAACTACTGGATACAGAGCATTGTCCTTTATGGATGGGTCGTCTGGATATAATCTAACCCTCTCAGATGAAGAAATGATAGCTTTTAGGACTCCAAAGGGAATATACTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTCGGTGCTACTTACCAATGTGTCGTGCATAAAGTGTTTGATGATATGTTGCAAAAAAAGTTTCTTGGCTTCATTGTAAGGCACCGAGGGATCGAAATAGACCAATCCAAGTTGATTCAGAAGATGCCAAAACCTAAGAGTTTGCATGACCTAAGAAGTCTCCAAGGACGATTGGCTTACATCCAAAGGTTCATCTCTAACATGGCCGTGTTGAGAGCTCCAGTACCTGACAAACCATTAATATTGTACATTGCTGCACAAGATGGGTCCTTAGGAGCATTACTAGTACAAGAGGAGGAAAAGGGAAAGGAGCGTGCTCTCTACTATTTAAGCAGAACATTAATTAGGGCTGAAGTTAACTATTCTCCCATCGAGAACATTGGCAAAAGTATACCTATAAAATATGTTCTGTTCAGGCCAATTATCTCTAGACGCTTAGCCAAATGGGTGGTTCTACTCCAACAATATGACATTGTCAATATTCTCCAAAAGACCACCCAATTCCTTCGGATTGGAAGTTATGTGAAGACCTGTCAGACGATGAGGTTTTCTTCACGCGAACCTTGGACTATGTTTTTTTACGGTGCAACATTAAGAAGTGGTGCAGGGGCTGACATCGTTCTCATTGCTCCTGAGAAGCATATGTTGCCTTATAGCTTTGCATTTGTTGAACTGTGTTCAAATAATGTGGTTGAATATCAGGCTTTGATAATTGGCCTTCAAATGGCATTGGAAATCGAAGTATCATTCATAGAAATTTATGGTGATTCAAAGTTGATAATCAATCATCTTTCGCTTCAGTATGATGTGAAACATGAAAACTTGAAGACATACTTCTCTTATGCTCGAAAATTGATGGAAAAGTTCGTTAGTATGATGCTAGAACACGTCCCTAGAGTAGAAAATAAGAGGGCAGACACAATGACGAATTTAGCCACTGCCTTAATGATGCTAGATGATGAAGTGAACGCGACGACATTCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAACATAGAAAGCTTCCAAAGGATTCCCGTCATAAAACTGAGGTACGAAGAAGAGCTACACGCTTCATTTATTACAAGGGAACCTTTTATCGCCGTTCTCTTCAAGGGTTCTTCCTTCGATGCCTTGGAAAGGAAGAGTTAGTAAAAGCTCTAAAGGAAGTACATGCAGATGTTTGTGGAGCACATCAATCGAGACCAAAACTTCAATGCAAACTAAGAAGAATGAGCTACTATTGGCGTAAGATGATCCAAGACTCAATAGACTATGTAAAGAAGTGTGAAGCTTGTCAATACCATGCAAACTTCATACACCAACCTCTAGAGCCTCTTCATCCAATTGTGGCTTCTTGGTCGTTTGAGGCTTGGAGACTCGATCTGGTTGACTCTATTACACAAAAATCACCACTTTCAAAAGGTTTGATGTCTTCTTACCAATTCAAACCGAGACCACCACTAGCACTCAATTACTATCCTCTGGACAAAGAACCGAGTTGTGGGACCCAATTTGGTGTACAAAGTAATGGAGATAAAAGGGGATTGGAAGGTTCAACCTATAAACTTAATCCCTCTCGGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGACTTGGATTCAGTGCTTAAGAGAGCAACCTATCTACTAACCCTAAAGCGGGTAGGAGTGAATTCCATCTTGTACCCTATGTTCCCAGCTATCCACCCGATCTTACCCCTGAAATGGGAGGCTTATTGGGCCAACGCTGATGAGCTGCCCTCACCTATGCAGATCTAA

Coding sequence (CDS)

ATGAAAAGAAAAACCTTTGTTACTCTCAATACAAGTCAAGGTTCGTTGAATGTAAAAAGACATAATGTTATAGTAACGAATCCTAAAAAAGAAGAGCCAGAACACGGGGAAGGTGAAACTTCATGTCATTATATCACCATTATTGAGGGATCAAAGGCTGAAACTCATGAAGATGACGCAAAAGATGTCCCACAGAGTTTAGAGGATGGTGGCCAATCTACTGTAGATGAGCTAAAAGAGGTGAACGTTGGTACAATAGAGGAACCATGCTCAACTTTCATTAGTGCATCCCTCTCTAACGAGGAGGAGGATAAATACATGAGTTTGCTCACCGAATACAGAGACATCTTTGCTTGGTCGTACAAGGAGATGTCAGAACTTGATCCAAAAGTAGCAGTCCATCATCTTGCAATTAAACCAGGGTATCGATCGATTAAACAAGCACAACGACATTTTCAAGTGTATCTTATTCCTCGTATCGAGGTGGAAGTCAACAAGTTGATTGAAGAAGGGTTCATTCGCGAGGTCAATTATCCAACGTGGATAGAAAACATTGTCCCTGTTAGAAAAAAGAACGGGCAACTTCGCGTTTGTGTAGAATTTTGCGACCTGAATAATGCATGCCCTAAAGGTGATTTTCCCTTGCCTATCATAGAAATCATGGTTGATGCAACTACTGGATACAGAGCATTGTCCTTTATGGATGGGTCGTCTGGATATAATCTAACCCTCTCAGATGAAGAAATGATAGCTTTTAGGACTCCAAAGGGAATATACTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTCGGTGCTACTTACCAATGTGTCGTGCATAAAGTGTTTGATGATATGTTGCAAAAAAAGTTTCTTGGCTTCATTGTAAGGCACCGAGGGATCGAAATAGACCAATCCAAGTTGATTCAGAAGATGCCAAAACCTAAGAGTTTGCATGACCTAAGAAGTCTCCAAGGACGATTGGCTTACATCCAAAGGTTCATCTCTAACATGGCCGTGTTGAGAGCTCCAGTACCTGACAAACCATTAATATTGTACATTGCTGCACAAGATGGGTCCTTAGGAGCATTACTAGTACAAGAGGAGGAAAAGGGAAAGGAGCGTGCTCTCTACTATTTAAGCAGAACATTAATTAGGGCTGAAGTTAACTATTCTCCCATCGAGAACATTGGCAAAAGTATACCTATAAAATATGTTCTGTTCAGGCCAATTATCTCTAGACGCTTAGCCAAATGGGTGGTTCTACTCCAACAATATGACATTGTCAATATTCTCCAAAAGACCACCCAATTCCTTCGGATTGGAAGTTATGTGAAGACCTGTCAGACGATGAGGTTTTCTTCACGCGAACCTTGGACTATGTTTTTTTACGGTGCAACATTAAGAAGTGGTGCAGGGGCTGACATCGTTCTCATTGCTCCTGAGAAGCATATGTTGCCTTATAGCTTTGCATTTGTTGAACTGTGTTCAAATAATGTGGTTGAATATCAGGCTTTGATAATTGGCCTTCAAATGGCATTGGAAATCGAAGTATCATTCATAGAAATTTATGGTGATTCAAAGTTGATAATCAATCATCTTTCGCTTCAGTATGATGTGAAACATGAAAACTTGAAGACATACTTCTCTTATGCTCGAAAATTGATGGAAAAGTTCGTTAGTATGATGCTAGAACACGTCCCTAGAGTAGAAAATAAGAGGGCAGACACAATGACGAATTTAGCCACTGCCTTAATGATGCTAGATGATGAAGTGAACGCGACGACATTCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAACATAGAAAGCTTCCAAAGGATTCCCGTCATAAAACTGAGGTACGAAGAAGAGCTACACGCTTCATTTATTACAAGGGAACCTTTTATCGCCGTTCTCTTCAAGGGTTCTTCCTTCGATGCCTTGGAAAGGAAGAGTTAGTAAAAGCTCTAAAGGAAGTACATGCAGATGTTTGTGGAGCACATCAATCGAGACCAAAACTTCAATGCAAACTAAGAAGAATGAGCTACTATTGGCGTAAGATGATCCAAGACTCAATAGACTATGTAAAGAAGTGTGAAGCTTGTCAATACCATGCAAACTTCATACACCAACCTCTAGAGCCTCTTCATCCAATTGTGGCTTCTTGGTCGTTTGAGGCTTGGAGACTCGATCTGGTTGACTCTATTACACAAAAATCACCACTTTCAAAAGGTTTGATGTCTTCTTACCAATTCAAACCGAGACCACCACTAGCACTCAATTACTATCCTCTGGACAAAGAACCGAGTTGTGGGACCCAATTTGGTGTACAAAGTAATGGAGATAAAAGGGGATTGGAAGGTTCAACCTATAAACTTAATCCCTCTCGGGCCAATGAGAGGGTGGGGCCCCTTGTTCAAGACTTGGATTCAGTGCTTAAGAGAGCAACCTATCTACTAACCCTAAAGCGGGTAGGAGTGAATTCCATCTTGTACCCTATGTTCCCAGCTATCCACCCGATCTTACCCCTGAAATGGGAGGCTTATTGGGCCAACGCTGATGAGCTGCCCTCACCTATGCAGATCTAA

Protein sequence

MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDAKDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWSYKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYNLTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQKKFLGFIVRHRGIEIDQSKLIQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEVNYSPIENIGKSIPIKYVLFRPIISRRLAKWVVLLQQYDIVNILQKTTQFLRIGSYVKTCQTMRFSSREPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDDEVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEVHADVCGAHQSRPKLQCKLRRMSYYWRKMIQDSIDYVKKCEACQYHANFIHQPLEPLHPIVASWSFEAWRLDLVDSITQKSPLSKGLMSSYQFKPRPPLALNYYPLDKEPSCGTQFGVQSNGDKRGLEGSTYKLNPSRANERVGPLVQDLDSVLKRATYLLTLKRVGVNSILYPMFPAIHPILPLKWEAYWANADELPSPMQI*
Homology
BLAST of CSPI01G21320 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 82.0 bits (201), Expect = 3.6e-14
Identity = 71/264 (26.89%), Postives = 106/264 (40.15%), Query Frame = 0

Query: 134 HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNG 193
           H + IKPG R  +    H        I   V KL++  FI     P     +V V KK+G
Sbjct: 586 HDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPC-SSPVVLVPKKDG 645

Query: 194 QLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMI 253
             R+CV++  LN A     FPLP I+ ++      +  + +D  SGY+   +   D    
Sbjct: 646 TFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKT 705

Query: 254 AFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML------------- 313
           AF TP G Y Y VMPFGL N  +T+   +   F          DD+L             
Sbjct: 706 AFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHL 765

Query: 314 ------------------------QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDL 346
                                   + +FLG+ +  + I   Q K   I+  P PK++   
Sbjct: 766 DTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQA 825

BLAST of CSPI01G21320 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 82.0 bits (201), Expect = 3.6e-14
Identity = 71/264 (26.89%), Postives = 106/264 (40.15%), Query Frame = 0

Query: 134 HHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPTWIENIVPVRKKNG 193
           H + IKPG R  +    H        I   V KL++  FI     P     +V V KK+G
Sbjct: 612 HDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSKSPC-SSPVVLVPKKDG 671

Query: 194 QLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGYN---LTLSDEEMI 253
             R+CV++  LN A     FPLP I+ ++      +  + +D  SGY+   +   D    
Sbjct: 672 TFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHSGYHQIPMEPKDRYKT 731

Query: 254 AFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVF----------DDML------------- 313
           AF TP G Y Y VMPFGL N  +T+   +   F          DD+L             
Sbjct: 732 AFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDLRFVNVYLDDILIFSESPEEHWKHL 791

Query: 314 ------------------------QKKFLGFIVRHRGIEIDQSK--LIQKMPKPKSLHDL 346
                                   + +FLG+ +  + I   Q K   I+  P PK++   
Sbjct: 792 DTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQHKCAAIRDFPTPKTVKQA 851

BLAST of CSPI01G21320 vs. ExPASy Swiss-Prot
Match: P0CT41 (Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-12 PE=3 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 8.8e-13
Identity = 60/240 (25.00%), Postives = 102/240 (42.50%), Query Frame = 0

Query: 163 EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIE 222
           E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE
Sbjct: 431 EINQGLKSGIIRE----SKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIE 490

Query: 223 IMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQ 282
            ++    G    + +D  S Y+L      DE  +AFR P+G++ Y VMP+G+    A +Q
Sbjct: 491 QLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQ 550

Query: 283 CVVHKV------------FDDML------------------------------------- 342
             ++ +             DD+L                                     
Sbjct: 551 YFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQS 610

Query: 343 QKKFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV 346
           Q KF+G+ +  +G    Q  +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Sbjct: 611 QVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPL 666

BLAST of CSPI01G21320 vs. ExPASy Swiss-Prot
Match: P0CT34 (Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-1 PE=3 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 8.8e-13
Identity = 60/240 (25.00%), Postives = 102/240 (42.50%), Query Frame = 0

Query: 163 EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIE 222
           E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE
Sbjct: 431 EINQGLKSGIIRE----SKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIE 490

Query: 223 IMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQ 282
            ++    G    + +D  S Y+L      DE  +AFR P+G++ Y VMP+G+    A +Q
Sbjct: 491 QLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQ 550

Query: 283 CVVHKV------------FDDML------------------------------------- 342
             ++ +             DD+L                                     
Sbjct: 551 YFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQS 610

Query: 343 QKKFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV 346
           Q KF+G+ +  +G    Q  +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Sbjct: 611 QVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPL 666

BLAST of CSPI01G21320 vs. ExPASy Swiss-Prot
Match: P0CT35 (Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843) OX=284812 GN=Tf2-2 PE=3 SV=1)

HSP 1 Score: 77.4 bits (189), Expect = 8.8e-13
Identity = 60/240 (25.00%), Postives = 102/240 (42.50%), Query Frame = 0

Query: 163 EVNKLIEEGFIREVNYPTWIENIVPVR---KKNGQLRVCVEFCDLNNACPKGDFPLPIIE 222
           E+N+ ++ G IRE    +   N  PV    KK G LR+ V++  LN       +PLP+IE
Sbjct: 431 EINQGLKSGIIRE----SKAINACPVMFVPKKEGTLRMVVDYKPLNKYVKPNIYPLPLIE 490

Query: 223 IMVDATTGYRALSFMDGSSGYNL---TLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQ 282
            ++    G    + +D  S Y+L      DE  +AFR P+G++ Y VMP+G+    A +Q
Sbjct: 491 QLLAKIQGSTIFTKLDLKSAYHLIRVRKGDEHKLAFRCPRGVFEYLVMPYGISTAPAHFQ 550

Query: 283 CVVHKV------------FDDML------------------------------------- 342
             ++ +             DD+L                                     
Sbjct: 551 YFINTILGEAKESHVVCYMDDILIHSKSESEHVKHVKDVLQKLKNANLIINQAKCEFHQS 610

Query: 343 QKKFLGFIVRHRGIEIDQSKL--IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPV 346
           Q KF+G+ +  +G    Q  +  + +  +PK+  +LR   G + Y+++FI   + L  P+
Sbjct: 611 QVKFIGYHISEKGFTPCQENIDKVLQWKQPKNRKELRQFLGSVNYLRKFIPKTSQLTHPL 666

BLAST of CSPI01G21320 vs. ExPASy TrEMBL
Match: A0A5D3DB95 (RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold284G00330 PE=4 SV=1)

HSP 1 Score: 1010.4 bits (2611), Expect = 4.6e-291
Identity = 547/848 (64.50%), Postives = 604/848 (71.23%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE+ + GEGE S H+ITI+E  + ET E+D 
Sbjct: 197  MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEDSKQGEGEISFHHITILEELEIETPEEDV 256

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            +DVPQSLEDGGQS VDELKEVN+GTIEEP  TFISASLS+EEE KYMSLLTEY+DIFAWS
Sbjct: 257  EDVPQSLEDGGQSIVDELKEVNLGTIEEPRPTFISASLSSEEEGKYMSLLTEYKDIFAWS 316

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVA+HHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE  FIR+V YPT
Sbjct: 317  YKEMPRLDPKVAIHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEARFIRQVKYPT 376

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NI+PV+KKNGQLRVCV FCDLNNACPK +FPL I EI+VDATTG+  LSFMDGSSGY
Sbjct: 377  WIANIIPVKKKNGQLRVCVGFCDLNNACPKDEFPLSITEIIVDATTGHEVLSFMDGSSGY 436

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKN+ ATYQ  + KVF DML K        
Sbjct: 437  NQIQMVLSDEEMTAFRTPKGIYCYKVMPFGLKNISATYQRAMQKVFGDMLHKYVECYIDD 496

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KF+ FIVRHRGIEIDQSK+
Sbjct: 497  LVVKSKRRQDHLKDLQVVFDRLRKYQLRMNPLECVFSVTSGKFIDFIVRHRGIEIDQSKI 556

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVPDKPLILYIAAQDGSLGALLV 420
              IQKMP+PKSLHDLRSLQGRLAYI+R                          SLGALL 
Sbjct: 557  DVIQKMPRPKSLHDLRSLQGRLAYIRR--------------------------SLGALLA 616

Query: 421  QEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPI 480
            QE+EKGKE ALYYLSRTL+ AEVNYSPIE                        + K+ PI
Sbjct: 617  QEKEKGKEHALYYLSRTLVGAEVNYSPIEKMCLALFFAIDKLRHYMQVFTVHLVAKANPI 676

Query: 481  KYVLFRPIISRRLAKWVVLLQQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFS 540
            KYVL RPIISRRLAKW V++QQYDIV I QK         FL    I S +K C+ +   
Sbjct: 677  KYVLSRPIISRRLAKWAVIVQQYDIVYIFQKAIKDQALADFLADHPIPSDLKLCKDLPDD 736

Query: 541  S------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII 600
                    EPWTM+F GA   SGAG  IVLI+PEKHMLPYS A  ELCSNNV EYQALII
Sbjct: 737  EVFFTEVVEPWTMYFDGAARMSGAGPGIVLISPEKHMLPYSSALAELCSNNVAEYQALII 796

Query: 601  GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVP 660
            GLQM LEI VSFIEIYGDSKLIIN LSLQ DVKHE+LK YF+YAR+LME+F S+MLEHVP
Sbjct: 797  GLQMVLEIGVSFIEIYGDSKLIINQLSLQDDVKHEDLKPYFTYARQLMERFDSVMLEHVP 856

Query: 661  RVENKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQ 720
            R ENKRAD + NLAT LMM ++                     + N TT HLIDEEDW Q
Sbjct: 857  RTENKRADALANLATTLMMPNNVALNIPLCQQWIMPPLLPECQKANVTTSHLIDEEDWHQ 916

Query: 721  PIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEV 736
            PIIEYLEHRKL KDS HKTEVRRR   FIYYKGT YRRSL+G FLRCLGKEE +KAL+E 
Sbjct: 917  PIIEYLEHRKLSKDSCHKTEVRRRDAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEA 976

BLAST of CSPI01G21320 vs. ExPASy TrEMBL
Match: A0A5A7SPY4 (RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold43053G00550 PE=4 SV=1)

HSP 1 Score: 1010.4 bits (2611), Expect = 4.6e-291
Identity = 547/848 (64.50%), Postives = 604/848 (71.23%), Query Frame = 0

Query: 1   MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
           MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE+ + GEGE S H+ITI+E  + ET E+D 
Sbjct: 160 MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEDSKQGEGEISFHHITILEELEIETPEEDV 219

Query: 61  KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
           +DVPQSLEDGGQS VDELKEVN+GTIEEP  TFISASLS+EEE KYMSLLTEY+DIFAWS
Sbjct: 220 EDVPQSLEDGGQSIVDELKEVNLGTIEEPRPTFISASLSSEEEGKYMSLLTEYKDIFAWS 279

Query: 121 YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
           YKEM  LDPKVA+HHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE  FIR+V YPT
Sbjct: 280 YKEMPRLDPKVAIHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEARFIRQVKYPT 339

Query: 181 WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
           WI NI+PV+KKNGQLRVCV FCDLNNACPK +FPL I EI+VDATTG+  LSFMDGSSGY
Sbjct: 340 WIANIIPVKKKNGQLRVCVGFCDLNNACPKDEFPLSITEIIVDATTGHEVLSFMDGSSGY 399

Query: 241 N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
           N   + LSDEEM AFRTPKGIYCYKVMPFGLKN+ ATYQ  + KVF DML K        
Sbjct: 400 NQIQMVLSDEEMTAFRTPKGIYCYKVMPFGLKNISATYQRAMQKVFGDMLHKYVECYIDD 459

Query: 301 -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                    KF+ FIVRHRGIEIDQSK+
Sbjct: 460 LVVKSKRRQDHLKDLQVVFDRLRKYQLRMNPLECVFSVTSGKFIDFIVRHRGIEIDQSKI 519

Query: 361 --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMAVLRAPVPDKPLILYIAAQDGSLGALLV 420
             IQKMP+PKSLHDLRSLQGRLAYI+R                          SLGALL 
Sbjct: 520 DVIQKMPRPKSLHDLRSLQGRLAYIRR--------------------------SLGALLA 579

Query: 421 QEEEKGKERALYYLSRTLIRAEVNYSPIEN-----------------------IGKSIPI 480
           QE+EKGKE ALYYLSRTL+ AEVNYSPIE                        + K+ PI
Sbjct: 580 QEKEKGKEHALYYLSRTLVGAEVNYSPIEKMCLALFFAIDKLRHYMQVFTVHLVAKANPI 639

Query: 481 KYVLFRPIISRRLAKWVVLLQQYDIVNILQKT------TQFLR---IGSYVKTCQTMRFS 540
           KYVL RPIISRRLAKW V++QQYDIV I QK         FL    I S +K C+ +   
Sbjct: 640 KYVLSRPIISRRLAKWAVIVQQYDIVYIFQKAIKDQALADFLADHPIPSDLKLCKDLPDD 699

Query: 541 S------REPWTMFFYGATLRSGAGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALII 600
                   EPWTM+F GA   SGAG  IVLI+PEKHMLPYS A  ELCSNNV EYQALII
Sbjct: 700 EVFFTEVVEPWTMYFDGAARMSGAGPGIVLISPEKHMLPYSSALAELCSNNVAEYQALII 759

Query: 601 GLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVP 660
           GLQM LEI VSFIEIYGDSKLIIN LSLQ DVKHE+LK YF+YAR+LME+F S+MLEHVP
Sbjct: 760 GLQMVLEIGVSFIEIYGDSKLIINQLSLQDDVKHEDLKPYFTYARQLMERFDSVMLEHVP 819

Query: 661 RVENKRADTMTNLATALMMLDD---------------------EVNATTFHLIDEEDWRQ 720
           R ENKRAD + NLAT LMM ++                     + N TT HLIDEEDW Q
Sbjct: 820 RTENKRADALANLATTLMMPNNVALNIPLCQQWIMPPLLPECQKANVTTSHLIDEEDWHQ 879

Query: 721 PIIEYLEHRKLPKDSRHKTEVRRRATRFIYYKGTFYRRSLQGFFLRCLGKEELVKALKEV 736
           PIIEYLEHRKL KDS HKTEVRRR   FIYYKGT YRRSL+G FLRCLGKEE +KAL+E 
Sbjct: 880 PIIEYLEHRKLSKDSCHKTEVRRRDAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEA 939

BLAST of CSPI01G21320 vs. ExPASy TrEMBL
Match: A0A5A7SPV8 (Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold9G00010 PE=4 SV=1)

HSP 1 Score: 971.8 bits (2511), Expect = 1.8e-279
Identity = 542/897 (60.42%), Postives = 599/897 (66.78%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQ                      GEGE SCH+ITI+E  + ET E+DA
Sbjct: 212  MKRKTFVTLNTSQ----------------------GEGEISCHHITILEKLEIETPEEDA 271

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            +D PQSLEDGGQS VDELKE+N+                            EY+DIFAWS
Sbjct: 272  EDAPQSLEDGGQSIVDELKEINL----------------------------EYKDIFAWS 331

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+I+VEVNKLIE GFIREV YPT
Sbjct: 332  YKEMPGLDPKVAVHHLAIKPGYRLIKQAQRRFRPELIPQIQVEVNKLIEAGFIREVKYPT 391

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+  LSFMDGSSGY
Sbjct: 392  WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHETLSFMDGSSGY 451

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKV+PFGLKN GATYQ  + KVFDDML K        
Sbjct: 452  NQIRMALSDEEMTAFRTPKGIYCYKVIPFGLKNAGATYQRAMQKVFDDMLHKYVECYVDD 511

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 512  LVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFSVTSGKFLGFIVRHRGIEIDQSKI 571

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKMP+PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 572  DAIQKMPRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 631

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL AP+P +PLILYIAAQ+ SLGALL QE+EKGKE ALYYLSRTL+ AEV
Sbjct: 632  SIKKYLLNPPVLGAPIPGEPLILYIAAQERSLGALLAQEKEKGKEHALYYLSRTLVGAEV 691

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K  PIKYVL RPIIS  LAKW V+LQQY
Sbjct: 692  NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKPDPIKYVLSRPIISGHLAKWAVILQQY 751

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMR----FSSR--EPWTMFFYGATLRSG 600
            DIV I QKT +   + +++         K C+ +     F ++  EPWTM+F GA  RSG
Sbjct: 752  DIVYISQKTIKGQALANFLADHPIPSDWKLCEYLPDDEVFFTKMVEPWTMYFDGAARRSG 811

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+ E+HMLPYSF   ELC NNV EYQALIIGLQMALEI VSFI+IYGDSKLII
Sbjct: 812  AGAGIVLISSEQHMLPYSFMLAELCLNNVAEYQALIIGLQMALEIRVSFIKIYGDSKLII 871

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LME+F S+ML+HVPR ENKRAD + NLATALMM D+ 
Sbjct: 872  NQLSLQYDVKHEDLKPYFTYARQLMERFDSVMLKHVPRTENKRADALANLATALMMPDNV 931

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                E N T  HLI+EEDW QPIIEYLEH KLPKDSRHKTEVRR
Sbjct: 932  ALNIPLCQKWIMPPLLLECQEANITKSHLINEEDWHQPIIEYLEHGKLPKDSRHKTEVRR 991

BLAST of CSPI01G21320 vs. ExPASy TrEMBL
Match: A0A5A7TZU9 (Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940 PE=4 SV=1)

HSP 1 Score: 944.5 bits (2440), Expect = 3.1e-271
Identity = 507/897 (56.52%), Postives = 595/897 (66.33%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRK FV++NT +GSL VKRH+V+ T P+  EPE       C+++TI E S  +  E+DA
Sbjct: 1221 MKRKMFVSVNT-EGSLKVKRHDVVFTRPEDNEPEDEPDVAGCYHVTIEETSDHDIFEEDA 1280

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            +  P SLEDGGQST+DELKEVN+GT EEP  TFIS  LS+ +E++Y++LL  Y+D+FAWS
Sbjct: 1281 EAAPLSLEDGGQSTIDELKEVNLGTKEEPRPTFISTQLSDNDENEYVNLLKAYKDVFAWS 1340

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVH LAIKP +R +KQAQR F+  LI +IE EVNKLIE GFIREV YPT
Sbjct: 1341 YKEMPGLDPKVAVHRLAIKPEHRPVKQAQRRFRPELISQIEEEVNKLIEAGFIREVKYPT 1400

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI+EIM+DAT G+ ALSFMDGSSGY
Sbjct: 1401 WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPIMEIMIDATAGHEALSFMDGSSGY 1460

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + L DEE  AFRTPKGIYCYKVMPFGLKN GATYQ  + ++FDDML K        
Sbjct: 1461 NQIRMALEDEEKTAFRTPKGIYCYKVMPFGLKNAGATYQRAMQRIFDDMLHKHVECYVDD 1520

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIE+D SK+
Sbjct: 1521 LVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEVDHSKI 1580

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKMP PK+LH+LR LQGRLAYI+RFISN+A                           
Sbjct: 1581 DAIQKMPSPKNLHELRRLQGRLAYIRRFISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFD 1640

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL AP   KPLILYIAAQ+ SLGALL QE +KGKE ALYYLSRTL  AE+
Sbjct: 1641 SIKKYLLNPPVLSAPATGKPLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAEL 1700

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ P+KY+L RP+IS RLAKW ++LQQY
Sbjct: 1701 NYSPIEKMCLALFFAIDKLRHYMQAFTIHLVAKADPVKYILSRPVISGRLAKWAIILQQY 1760

Query: 541  DIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C      + +   S EPW MFF GA  RSG
Sbjct: 1761 DIVYIPQKAVKGQALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFDGAARRSG 1820

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AG  IV I+PEKHMLPYSF   ELCSNNV EYQA IIGLQMA E  +  IEI+GDSKLII
Sbjct: 1821 AGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGIKCIEIFGDSKLII 1880

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LS QY+VKH++LK YFSYAR+LM++F S++LEH+PR ENK+AD + NLATAL + +D 
Sbjct: 1881 NQLSYQYEVKHQDLKPYFSYARRLMDRFDSIILEHIPRSENKKADALANLATALTVSEDI 1940

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                E +  + + IDEEDWRQPII+YLEH KLP D RH+ E+RR
Sbjct: 1941 PINISLCQKWIVPSIESQYEEADVISVYAIDEEDWRQPIIDYLEHGKLPTDPRHRAEIRR 2000

BLAST of CSPI01G21320 vs. ExPASy TrEMBL
Match: A0A5D3D1E5 (Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G004020 PE=4 SV=1)

HSP 1 Score: 944.5 bits (2440), Expect = 3.1e-271
Identity = 507/897 (56.52%), Postives = 595/897 (66.33%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRK FV++NT +GSL VKRH+V+ T P+  EPE       C+++TI E S  +  E+DA
Sbjct: 1151 MKRKMFVSVNT-EGSLKVKRHDVVFTRPEDNEPEDEPDVAGCYHVTIEETSDHDIFEEDA 1210

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            +  P SLEDGGQST+DELKEVN+GT EEP  TFIS  LS+ +E++Y++LL  Y+D+FAWS
Sbjct: 1211 EAAPLSLEDGGQSTIDELKEVNLGTKEEPRPTFISTQLSDNDENEYVNLLKAYKDVFAWS 1270

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVH LAIKP +R +KQAQR F+  LI +IE EVNKLIE GFIREV YPT
Sbjct: 1271 YKEMPGLDPKVAVHRLAIKPEHRPVKQAQRRFRPELISQIEEEVNKLIEAGFIREVKYPT 1330

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI+EIM+DAT G+ ALSFMDGSSGY
Sbjct: 1331 WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPIMEIMIDATAGHEALSFMDGSSGY 1390

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + L DEE  AFRTPKGIYCYKVMPFGLKN GATYQ  + ++FDDML K        
Sbjct: 1391 NQIRMALEDEEKTAFRTPKGIYCYKVMPFGLKNAGATYQRAMQRIFDDMLHKHVECYVDD 1450

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIE+D SK+
Sbjct: 1451 LVVKSKKKCDHLKDLKLVLDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEVDHSKI 1510

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKMP PK+LH+LR LQGRLAYI+RFISN+A                           
Sbjct: 1511 DAIQKMPSPKNLHELRRLQGRLAYIRRFISNLAGRCQPFQRLMRKDAVFDWDQSCQNAFD 1570

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL AP   KPLILYIAAQ+ SLGALL QE +KGKE ALYYLSRTL  AE+
Sbjct: 1571 SIKKYLLNPPVLSAPATGKPLILYIAAQETSLGALLAQENDKGKECALYYLSRTLTGAEL 1630

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ P+KY+L RP+IS RLAKW ++LQQY
Sbjct: 1631 NYSPIEKMCLALFFAIDKLRHYMQAFTIHLVAKADPVKYILSRPVISGRLAKWAIILQQY 1690

Query: 541  DIVNILQKTTQFLRIGSYV---------KTC------QTMRFSSREPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C      + +   S EPW MFF GA  RSG
Sbjct: 1691 DIVYIPQKAVKGQALADFLADHPVPSNWKLCDDLPDEEVLFVESMEPWIMFFDGAARRSG 1750

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AG  IV I+PEKHMLPYSF   ELCSNNV EYQA IIGLQMA E  +  IEI+GDSKLII
Sbjct: 1751 AGVGIVFISPEKHMLPYSFTLGELCSNNVAEYQAFIIGLQMASEFGIKCIEIFGDSKLII 1810

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LS QY+VKH++LK YFSYAR+LM++F S++LEH+PR ENK+AD + NLATAL + +D 
Sbjct: 1811 NQLSYQYEVKHQDLKPYFSYARRLMDRFDSIILEHIPRSENKKADALANLATALTVSEDI 1870

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                E +  + + IDEEDWRQPII+YLEH KLP D RH+ E+RR
Sbjct: 1871 PINISLCQKWIVPSIESQYEEADVISVYAIDEEDWRQPIIDYLEHGKLPTDPRHRAEIRR 1930

BLAST of CSPI01G21320 vs. NCBI nr
Match: XP_031735972.1 (uncharacterized protein LOC116401693 [Cucumis sativus])

HSP 1 Score: 1112.1 bits (2875), Expect = 0.0e+00
Identity = 606/897 (67.56%), Postives = 650/897 (72.46%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GEGETSCH+ITIIE S+  THE+DA
Sbjct: 1263 MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEGSEQGEGETSCHHITIIEESETGTHEEDA 1322

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            ++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSNEE DKYMSLLTEYRDIFAWS
Sbjct: 1323 ENAPQSLEDGGQSTVDELKEVNLGTIEEPRPTFISASLSNEEVDKYMSLLTEYRDIFAWS 1382

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPT
Sbjct: 1383 YKEMPGLDPKVAVHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEAGFIREVKYPT 1442

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGY
Sbjct: 1443 WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHEALSFMDGSSGY 1502

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKNVGATYQ  + KVFDDML +        
Sbjct: 1503 NQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHRYVECYVDD 1562

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 1563 LVVKTKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEIDQSKI 1622

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKM +PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 1623 DAIQKMSRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 1682

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEV
Sbjct: 1683 SIKKYLLTPPVLGAPVPDKPLILYIAAQERSLGALLAQEEVKGKERSLYYLSRTLIGAEV 1742

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ PIKYVL RPIIS RLAKW VLLQQY
Sbjct: 1743 NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWAVLLQQY 1802

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Sbjct: 1803 DIVYIPQKAIKGQALADFLADHPIPSDWKLCDDLPDDEVFFTEVMEPWTMYFDGAARRSG 1862

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLII
Sbjct: 1863 AGAGIVLISPEKHMLPYSFALSELCSNNVAEYQALIIGLQIALEIGVSFIEVYGDSKLII 1922

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVENKRAD + NLATAL M DD 
Sbjct: 1923 NQLSLQYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTMPDDV 1982

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RR
Sbjct: 1983 TLNIPLCQRWIIPPVRPECQEVNMATSYLIDEEDWRQPIIEYLEHGKLPKDSRHKIEIRR 2042

BLAST of CSPI01G21320 vs. NCBI nr
Match: XP_031737039.1 (uncharacterized protein LOC116402129 [Cucumis sativus])

HSP 1 Score: 1110.9 bits (2872), Expect = 0.0e+00
Identity = 605/897 (67.45%), Postives = 650/897 (72.46%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GEGETSCH+ITIIE S+  THE+DA
Sbjct: 237  MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEGSEQGEGETSCHHITIIEESETGTHEEDA 296

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            ++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSNEE DKYMSLLTEYRDIFAWS
Sbjct: 297  ENAPQSLEDGGQSTVDELKEVNLGTIEEPRPTFISASLSNEEVDKYMSLLTEYRDIFAWS 356

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPT
Sbjct: 357  YKEMPGLDPKVAVHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEAGFIREVKYPT 416

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGY
Sbjct: 417  WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHEALSFMDGSSGY 476

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKNVGATYQ  + KVFDDML +        
Sbjct: 477  NQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHRYVECYVDD 536

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 537  LVVKTKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEIDQSKI 596

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKM +PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 597  DAIQKMSRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 656

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEV
Sbjct: 657  SIKKYLLTPPVLGAPVPDKPLILYIAAQERSLGALLAQEEVKGKERSLYYLSRTLIGAEV 716

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ PIKYVL RPII+ RLAKW VLLQQY
Sbjct: 717  NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIAGRLAKWAVLLQQY 776

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Sbjct: 777  DIVYIPQKAIKGQALADFLADHPIPSDWKLCDDLPDDEVFFTEVMEPWTMYFDGAARRSG 836

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLII
Sbjct: 837  AGAGIVLISPEKHMLPYSFALSELCSNNVAEYQALIIGLQIALEIGVSFIEVYGDSKLII 896

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVENKRAD + NLATAL M DD 
Sbjct: 897  NQLSLQYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTMPDDV 956

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RR
Sbjct: 957  TLNIPLCQRWIIPPVRPECQEVNMATSYLIDEEDWRQPIIEYLEHGKLPKDSRHKIEIRR 1016

BLAST of CSPI01G21320 vs. NCBI nr
Match: XP_031737045.1 (uncharacterized protein LOC116402134 [Cucumis sativus])

HSP 1 Score: 1105.9 bits (2859), Expect = 0.0e+00
Identity = 603/897 (67.22%), Postives = 648/897 (72.24%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA
Sbjct: 638  MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEGSEQGECETSCHHITIIEESETGTHEEDA 697

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            ++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSNEE DKYMSLLTEYRDIFAWS
Sbjct: 698  ENAPQSLEDGGQSTVDELKEVNLGTIEEPRPTFISASLSNEEVDKYMSLLTEYRDIFAWS 757

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPT
Sbjct: 758  YKEMPGLDPKVAVHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEAGFIREVKYPT 817

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGY
Sbjct: 818  WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHEALSFMDGSSGY 877

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Sbjct: 878  NQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHRYVECYVDD 937

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 938  LVVKTKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEIDQSKI 997

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKM +PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 998  DAIQKMSRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 1057

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEV
Sbjct: 1058 SIKKYLLTPPVLGAPVPDKPLILYIAAQERSLGALLAQEEVKGKERSLYYLSRTLIGAEV 1117

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ PIKYVL RPII+ RLAKW VLLQQY
Sbjct: 1118 NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIAGRLAKWAVLLQQY 1177

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Sbjct: 1178 DIVYIPQKAIKGQALADFLADHPIPSDWKLCDDLPDDEVFFTEVMEPWTMYFDGAARRSG 1237

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLII
Sbjct: 1238 AGAGIVLISPEKHMLPYSFALSELCSNNVAEYQALIIGLQIALEIGVSFIEVYGDSKLII 1297

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVENKRAD + NLATAL M DD 
Sbjct: 1298 NQLSLQYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTMPDDV 1357

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RR
Sbjct: 1358 TLNIPLCQRWIIPPVRPECQEVNMATSYLIDEEDWRQPIIEYLEHGKLPKDSRHKIEIRR 1417

BLAST of CSPI01G21320 vs. NCBI nr
Match: XP_031739134.1 (uncharacterized protein LOC116402863 [Cucumis sativus])

HSP 1 Score: 1105.9 bits (2859), Expect = 0.0e+00
Identity = 603/897 (67.22%), Postives = 648/897 (72.24%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA
Sbjct: 1263 MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEGSEQGECETSCHHITIIEESETGTHEEDA 1322

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            ++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSNEE DKYMSLLTEYRDIFAWS
Sbjct: 1323 ENAPQSLEDGGQSTVDELKEVNLGTIEEPRPTFISASLSNEEVDKYMSLLTEYRDIFAWS 1382

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPT
Sbjct: 1383 YKEMPGLDPKVAVHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEAGFIREVKYPT 1442

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGY
Sbjct: 1443 WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHEALSFMDGSSGY 1502

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Sbjct: 1503 NQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHRYVECYVDD 1562

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 1563 LVVKTKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEIDQSKI 1622

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKM +PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 1623 DAIQKMSRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 1682

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEV
Sbjct: 1683 SIKKYLLTPPVLGAPVPDKPLILYIAAQERSLGALLAQEEVKGKERSLYYLSRTLIGAEV 1742

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ PIKYVL RPII+ RLAKW VLLQQY
Sbjct: 1743 NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIAGRLAKWAVLLQQY 1802

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Sbjct: 1803 DIVYIPQKAIKGQALADFLADHPIPSDWKLCDDLPDDEVFFTEVMEPWTMYFDGAARRSG 1862

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLII
Sbjct: 1863 AGAGIVLISPEKHMLPYSFALSELCSNNVAEYQALIIGLQIALEIGVSFIEVYGDSKLII 1922

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVENKRAD + NLATAL M DD 
Sbjct: 1923 NQLSLQYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTMPDDV 1982

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RR
Sbjct: 1983 TLNIPLCQRWIIPPVRPECQEVNMATSYLIDEEDWRQPIIEYLEHGKLPKDSRHKIEIRR 2042

BLAST of CSPI01G21320 vs. NCBI nr
Match: XP_031742032.1 (uncharacterized protein LOC116404025 [Cucumis sativus])

HSP 1 Score: 1105.9 bits (2859), Expect = 0.0e+00
Identity = 603/897 (67.22%), Postives = 648/897 (72.24%), Query Frame = 0

Query: 1    MKRKTFVTLNTSQGSLNVKRHNVIVTNPKKEEPEHGEGETSCHYITIIEGSKAETHEDDA 60
            MKRKTFVTLNTSQGSL VKRH+VI+TNP+KE  E GE ETSCH+ITIIE S+  THE+DA
Sbjct: 1263 MKRKTFVTLNTSQGSLKVKRHDVILTNPEKEGSEQGECETSCHHITIIEESETGTHEEDA 1322

Query: 61   KDVPQSLEDGGQSTVDELKEVNVGTIEEPCSTFISASLSNEEEDKYMSLLTEYRDIFAWS 120
            ++ PQSLEDGGQSTVDELKEVN+GTIEEP  TFISASLSNEE DKYMSLLTEYRDIFAWS
Sbjct: 1323 ENAPQSLEDGGQSTVDELKEVNLGTIEEPRPTFISASLSNEEVDKYMSLLTEYRDIFAWS 1382

Query: 121  YKEMSELDPKVAVHHLAIKPGYRSIKQAQRHFQVYLIPRIEVEVNKLIEEGFIREVNYPT 180
            YKEM  LDPKVAVHHLAIKPGYR IKQAQR F+  LIP+IEVEVNKLIE GFIREV YPT
Sbjct: 1383 YKEMPGLDPKVAVHHLAIKPGYRPIKQAQRRFRPELIPQIEVEVNKLIEAGFIREVKYPT 1442

Query: 181  WIENIVPVRKKNGQLRVCVEFCDLNNACPKGDFPLPIIEIMVDATTGYRALSFMDGSSGY 240
            WI NIVPVRKKNGQLRVCV+F DLNNACPK DFPLPI EIMVDATTG+ ALSFMDGSSGY
Sbjct: 1443 WIANIVPVRKKNGQLRVCVDFRDLNNACPKDDFPLPITEIMVDATTGHEALSFMDGSSGY 1502

Query: 241  N---LTLSDEEMIAFRTPKGIYCYKVMPFGLKNVGATYQCVVHKVFDDMLQK-------- 300
            N   + LSDEEM AFRTPKGIYCYKVMPFGLKN GATYQ  + KVFDDML +        
Sbjct: 1503 NQIRMALSDEEMTAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHRYVECYVDD 1562

Query: 301  -----------------------------------------KFLGFIVRHRGIEIDQSKL 360
                                                     KFLGFIVRHRGIEIDQSK+
Sbjct: 1563 LVVKTKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRHRGIEIDQSKI 1622

Query: 361  --IQKMPKPKSLHDLRSLQGRLAYIQRFISNMA--------------------------- 420
              IQKM +PKSLHDLRSLQGRLAYI+RFISN+A                           
Sbjct: 1623 DAIQKMSRPKSLHDLRSLQGRLAYIRRFISNLAGRCQPFQKLMRKGENFVWDEACQNAFD 1682

Query: 421  ----------VLRAPVPDKPLILYIAAQDGSLGALLVQEEEKGKERALYYLSRTLIRAEV 480
                      VL APVPDKPLILYIAAQ+ SLGALL QEE KGKER+LYYLSRTLI AEV
Sbjct: 1683 SIKKYLLTPPVLGAPVPDKPLILYIAAQERSLGALLAQEEVKGKERSLYYLSRTLIGAEV 1742

Query: 481  NYSPIEN-----------------------IGKSIPIKYVLFRPIISRRLAKWVVLLQQY 540
            NYSPIE                        + K+ PIKYVL RPII+ RLAKW VLLQQY
Sbjct: 1743 NYSPIEKMCLALFFAIDKLRHYMQAFTVHLVAKADPIKYVLSRPIIAGRLAKWAVLLQQY 1802

Query: 541  DIVNILQKTTQFLRIGSYV---------KTCQTMRFSS------REPWTMFFYGATLRSG 600
            DIV I QK  +   +  ++         K C  +           EPWTM+F GA  RSG
Sbjct: 1803 DIVYIPQKAIKGQALADFLADHPIPSDWKLCDDLPDDEVFFTEVMEPWTMYFDGAARRSG 1862

Query: 601  AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 660
            AGA IVLI+PEKHMLPYSFA  ELCSNNV EYQALIIGLQ+ALEI VSFIE+YGDSKLII
Sbjct: 1863 AGAGIVLISPEKHMLPYSFALSELCSNNVAEYQALIIGLQIALEIGVSFIEVYGDSKLII 1922

Query: 661  NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATALMMLDD- 720
            N LSLQYDVKHE+LK YF+YAR+LMEKF ++MLEHVPRVENKRAD + NLATAL M DD 
Sbjct: 1923 NQLSLQYDVKHEDLKPYFAYARQLMEKFDNVMLEHVPRVENKRADALANLATALTMPDDV 1982

Query: 721  --------------------EVNATTFHLIDEEDWRQPIIEYLEHRKLPKDSRHKTEVRR 748
                                EVN  T +LIDEEDWRQPIIEYLEH KLPKDSRHK E+RR
Sbjct: 1983 TLNIPLCQRWIIPPVRPECQEVNMATSYLIDEEDWRQPIIEYLEHGKLPKDSRHKIEIRR 2042

BLAST of CSPI01G21320 vs. TAIR 10
Match: AT3G01410.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 61.6 bits (148), Expect = 3.6e-09
Identity = 37/114 (32.46%), Postives = 58/114 (50.88%), Query Frame = 0

Query: 472 AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 531
           AGA  VL A +  +L Y    V   +NNV EY+AL++GL+ AL+     + + GDS L+ 
Sbjct: 170 AGAGAVLRASDNSVLFYLREGVGNATNNVAEYRALLLGLRSALDKGFKNVHVLGDSMLVC 229

Query: 532 NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATAL 586
             +   +   H  +      A++LM  F +  ++H+ R +N  AD   N A  L
Sbjct: 230 MQVQGAWKTNHPKMAELCKQAKELMNSFKTFDIKHIAREKNSEADKQANSAIFL 283

BLAST of CSPI01G21320 vs. TAIR 10
Match: AT3G01410.2 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 61.6 bits (148), Expect = 3.6e-09
Identity = 37/114 (32.46%), Postives = 58/114 (50.88%), Query Frame = 0

Query: 472 AGADIVLIAPEKHMLPYSFAFVELCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLII 531
           AGA  VL A +  +L Y    V   +NNV EY+AL++GL+ AL+     + + GDS L+ 
Sbjct: 170 AGAGAVLRASDNSVLFYLREGVGNATNNVAEYRALLLGLRSALDKGFKNVHVLGDSMLVC 229

Query: 532 NHLSLQYDVKHENLKTYFSYARKLMEKFVSMMLEHVPRVENKRADTMTNLATAL 586
             +   +   H  +      A++LM  F +  ++H+ R +N  AD   N A  L
Sbjct: 230 MQVQGAWKTNHPKMAELCKQAKELMNSFKTFDIKHIAREKNSEADKQANSAIFL 283

BLAST of CSPI01G21320 vs. TAIR 10
Match: AT1G24090.1 (RNase H family protein )

HSP 1 Score: 60.8 bits (146), Expect = 6.1e-09
Identity = 37/91 (40.66%), Postives = 49/91 (53.85%), Query Frame = 0

Query: 495 LCSNNVVEYQALIIGLQMALEIEVSFIEIYGDSKLIINHLSLQYDVKHENLKTYFSYARK 554
           + +NN  EY ALI+GL+ A+E     I++ GDSKL+   +  Q+ V HE L      A+ 
Sbjct: 255 IATNNAAEYHALILGLKYAIEKGYKNIKVKGDSKLVCMQIKGQWKVNHEVLAKLHKEAKL 314

Query: 555 LMEKFVSMMLEHVPRVENKRADTMTNLATAL 586
           L  K VS  + HV R  N  AD   NLA  L
Sbjct: 315 LCNKCVSFEISHVLRNLNADADEQANLAVRL 345

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q993153.6e-1426.89Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q7LHG53.6e-1426.89Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
P0CT418.8e-1325.00Transposon Tf2-12 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 24... [more]
P0CT348.8e-1325.00Transposon Tf2-1 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
P0CT358.8e-1325.00Transposon Tf2-2 polyprotein OS=Schizosaccharomyces pombe (strain 972 / ATCC 248... [more]
Match NameE-valueIdentityDescription
A0A5D3DB954.6e-29164.50RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5A7SPY44.6e-29164.50RNase H domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... [more]
A0A5A7SPV81.8e-27960.42Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold9G00010 P... [more]
A0A5A7TZU93.1e-27156.52Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold498G00940... [more]
A0A5D3D1E53.1e-27156.52Ribonuclease H OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold306G00402... [more]
Match NameE-valueIdentityDescription
XP_031735972.10.0e+0067.56uncharacterized protein LOC116401693 [Cucumis sativus][more]
XP_031737039.10.0e+0067.45uncharacterized protein LOC116402129 [Cucumis sativus][more]
XP_031737045.10.0e+0067.22uncharacterized protein LOC116402134 [Cucumis sativus][more]
XP_031739134.10.0e+0067.22uncharacterized protein LOC116402863 [Cucumis sativus][more]
XP_031742032.10.0e+0067.22uncharacterized protein LOC116404025 [Cucumis sativus][more]
Match NameE-valueIdentityDescription
AT3G01410.13.6e-0932.46Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT3G01410.23.6e-0932.46Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT1G24090.16.1e-0940.66RNase H family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR041588Integrase zinc-binding domainPFAMPF17921Integrase_H2C2coord: 662..712
e-value: 1.3E-7
score: 31.6
NoneNo IPR availableGENE3D1.10.340.70coord: 620..712
e-value: 2.7E-10
score: 42.3
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 120..270
e-value: 1.1E-31
score: 111.5
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 100..289
NoneNo IPR availablePANTHERPTHR24559:SF341RNA-DIRECTED DNA POLYMERASE HOMOLOGcoord: 100..289
NoneNo IPR availableCDDcd09279RNase_HI_likecoord: 459..582
e-value: 1.08706E-38
score: 137.989
NoneNo IPR availableCDDcd01647RT_LTRcoord: 171..295
e-value: 2.87928E-37
score: 135.801
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 331..399
e-value: 1.1E-10
score: 41.4
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 492..581
e-value: 2.9E-17
score: 62.6
IPR002156Ribonuclease H domainPROSITEPS50879RNASE_Hcoord: 455..584
score: 12.479687
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 305..355
e-value: 4.6E-5
score: 25.3
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 460..595
e-value: 9.4E-25
score: 89.1
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 131..429
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 457..578

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G21320.1CSPI01G21320.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006259 DNA metabolic process
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity