CSPI03G24620 (gene) Wild cucumber (PI 183967)

NameCSPI03G24620
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
LocationChr3 : 21711884 .. 21713827 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGCTTCGGATCGAGTCGAAGGTCACCACCCCAAGGATAGGGATTCTCTTCAGAGGCAAGCCACTGGCTCGACTCTTTCTTTCTCCCTTTCAGTTATTTAAGAGCTAAAAAAGCTCTTTCCCAACCAAAAAAGGCATAATGAACTACTAGAATACGACCCAAGCAAGCCAACCTATTGTCCACGAATGCAGTTCTTTCATCTCTTTCGGAATGAAAATAAGGAAGAGGAATTTAGTTACTCGATGCCCGTCATTCAAGGCGAGGCCAATGCTTGTCCCATCTGTAGCTTTTGAAGACACATAAGTTTTGTAGTTTAAGCAAAAGCTTAGTTCATTCACTCATTAATCTCAAAAATAATAAAAGATTCTCCATTCTCAATTCAATAAGGAATAAAAAGAAGAAAAGACGATAAAGGAGCTAGGTAATAAAAAGAAGAGAAGACGAGTCAAGAGGTTTTGATGATTGTAAAGACTATTACAAAGACCTGCAGAAACCTAAAAAGTACCAATGCTAAGAGAAAGAAGAGTATAGAATGAACGAGTCAATTCAATATGAACCTAGAAAAGTAGGCGTTTAAGATGAACCGAAAATCTTCAACAATATCCCTGGAGGGCATATCTGTGGAGAGGTAAGACGGAGGGTAGAGGTGGTGCTTAAGTTGCCTGGGATGAGGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTAAAGATCTTATGGTTGCTTCTAGTTAAATCTGGTAGTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGCTCTTGGAGATCGATGTTGGGGTGGGTCGATCTTGGTGTTTTAGGGCAATCTTGCGTAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGGTGGGCAATGGTAAGAAGTGTAGAGTGTGGTTGGATCCAAGGATTCAGGGTGGACCGATTATCCAGCAGTTTGGGGAGAGGGTGATTTATGATGCAGATAGTCGGCGGGATGCGAGGCTTGTGGATTTCATGGGTCGGGATGGAGATTGGAGGTGGTCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGAATTCAGGGAGTGAGGCCGAGTCCGAGTGTTGAGGATATGTGGGTCCGGGTGCCAGGTAGTCATGAGAGTTTTTCGATCGTCAGTGCGTGCGAGGCTATTCGTCCTCATAGTAGTAGGGTTGGCTGGTCGGGTTTACTGTGGGGTGGGGGAAATATTCCTAAGCACTCCTTCTATACTTGGTTGGCCATCAAGGATAGGTTGGGTACTAGAGATAGATTAAGTCGGTGGGATAGTTCGATTCCTTTATCGTGTATTCTTTGTGGAGGGAACTATGAGTCTCGTGACCATTTGTTTTTTCCTTGTCCTTTTGGGTGGGAGATTTGGTCTAGAATCCTTTTGTTTATGTCATCTTCTCACAGGATACGGTATTGGGGGGTTGAGTTATCTTAGATTTGCAATTAGGGTATTGGGAAGGGTGTGAGGAGAAAATTGTGGCGCCTTCTCTGGTGTGCTACAATTTATTTCATTTGGCAGGAGCGAAATCTTCGTCTTCATGGAGGTGCTGTTTGGGAGCCTATGGTTATTTTCCAGCTCATTCGGTCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATGGTCTTATTTAGTGTGCTTTTCTTTTGCATGTCCTCGGGCTGTGGGGTTTTTTTTCTCTTTTTTGTCTTTGTTGTCTTTCTCTTTTCTTCTTGTCCTTTTATTGTTTGATACACTAGTTGGTTTCTTTTGGTTTTGGTTCTTGTCCCCGAGATGTGGGGTTGTTTTGGGTACTTATGGGTTGTTTTGTCTAGTTACTTGTTCTATGAGTGTTGTTCGCTCTTGTGCCTTGACCTCAGGCTGTGA

mRNA sequence

ATGGCGCTTCGGATCGAGTCGAAGGTCACCACCCCAAGGATAGGGATTCTCTTCAGAGGCAAGCCACTGGCTCGACTCTTTCTTTCTCCCTTTCATTCTTTCATCTCTTTCGGAATGAAAATAAGGAAGAGGAATTTAGTTACTCGATGCCCGTCATTCAAGGCGAGGCCAATGCTTGTCCCATCTGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTAAAGATCTTATGGTTGCTTCTAGTTAAATCTGGTAGTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGCTCTTGGAGATCGATGTTGGGGTGGGTCGATCTTGGTGTTTTAGGGCAATCTTGCGTAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGGTGGGCAATGGTAAGAAGTGTAGAGTGTGGTTGGATCCAAGGATTCAGGGTGGACCGATTATCCAGCAGTTTGGGGAGAGGGTGATTTATGATGCAGATAGTCGGCGGGATGCGAGGCTTGTGGATTTCATGGGTCGGGATGGAGATTGGAGGTGGTCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGAATTCAGGGAGTGAGGCCGAGTCCGAGTGTTGAGGATATGTGGGTCCGGGTGCCAGGTAGTCATGAGAGTTTTTCGATCGTCAGTGCGTGCGAGGCTATTCGTCCTCATAGTAGTAGGGTTGGCTGGTCGGGTTTACTGTGGGGTGGGGGAAATATTCCTAAGCACTCCTTCTATACTTGGTTGGCCATCAAGGATAGGTTGGGTACTAGAGATAGATTAAGTCGGTGGGATAGTTCGATTCCTTTATCGTGTATTCTTTGTGGAGGGAACTATGAGTCTCGTGACCATTTGTTTTTTCCTTGTCCTTTTGGGTGGGAGATTTGGTCTAGAATCCTTTTGTTTATGTCATCTTCTCACAGGATACGGTATTGGGGGGAGCGAAATCTTCGTCTTCATGGAGGTGCTGTTTGGGAGCCTATGGTTATTTTCCAGCTCATTCGGTCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATGATGTGGGGTTGTTTTGGGTACTTATGGGTTGTTTTGTCTAGTTACTTGTTCTATGAGTGTTGTTCGCTCTTGTGCCTTGACCTCAGGCTGTGA

Coding sequence (CDS)

ATGGCGCTTCGGATCGAGTCGAAGGTCACCACCCCAAGGATAGGGATTCTCTTCAGAGGCAAGCCACTGGCTCGACTCTTTCTTTCTCCCTTTCATTCTTTCATCTCTTTCGGAATGAAAATAAGGAAGAGGAATTTAGTTACTCGATGCCCGTCATTCAAGGCGAGGCCAATGCTTGTCCCATCTGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTAAAGATCTTATGGTTGCTTCTAGTTAAATCTGGTAGTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGCTCTTGGAGATCGATGTTGGGGTGGGTCGATCTTGGTGTTTTAGGGCAATCTTGCGTAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGGTGGGCAATGGTAAGAAGTGTAGAGTGTGGTTGGATCCAAGGATTCAGGGTGGACCGATTATCCAGCAGTTTGGGGAGAGGGTGATTTATGATGCAGATAGTCGGCGGGATGCGAGGCTTGTGGATTTCATGGGTCGGGATGGAGATTGGAGGTGGTCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGAATTCAGGGAGTGAGGCCGAGTCCGAGTGTTGAGGATATGTGGGTCCGGGTGCCAGGTAGTCATGAGAGTTTTTCGATCGTCAGTGCGTGCGAGGCTATTCGTCCTCATAGTAGTAGGGTTGGCTGGTCGGGTTTACTGTGGGGTGGGGGAAATATTCCTAAGCACTCCTTCTATACTTGGTTGGCCATCAAGGATAGGTTGGGTACTAGAGATAGATTAAGTCGGTGGGATAGTTCGATTCCTTTATCGTGTATTCTTTGTGGAGGGAACTATGAGTCTCGTGACCATTTGTTTTTTCCTTGTCCTTTTGGGTGGGAGATTTGGTCTAGAATCCTTTTGTTTATGTCATCTTCTCACAGGATACGGTATTGGGGGGAGCGAAATCTTCGTCTTCATGGAGGTGCTGTTTGGGAGCCTATGGTTATTTTCCAGCTCATTCGGTCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATGATGTGGGGTTGTTTTGGGTACTTATGGGTTGTTTTGTCTAGTTACTTGTTCTATGAGTGTTGTTCGCTCTTGTGCCTTGACCTCAGGCTGTGA
BLAST of CSPI03G24620 vs. TrEMBL
Match: A0A068F615_BRANA (Uncharacterized protein OS=Brassica napus PE=4 SV=1)

HSP 1 Score: 179.5 bits (454), Expect = 8.2e-42
Identity = 92/265 (34.72%), Postives = 142/265 (53.58%), Query Frame = 1

Query: 62  SVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGV 121
           SVC P + GGL ++  + WN+   LK++WLL    GSLWV+WV   ++   +   +    
Sbjct: 73  SVCTPKEAGGLGLKRLADWNVVLGLKLIWLLFAAGGSLWVSWVRRNLIGRENFWMLVPNR 132

Query: 122 GRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRD 181
             SW +R++ + R   +  +K +VG+G     W+D     GP+I+  GER         +
Sbjct: 133 RGSWIWRSLCKLRPKARPFIKCQVGSGITASFWMDDWTSLGPLIEVVGERGPVVTGLSIN 192

Query: 182 ARLVDFMGRDGDW---RWSLVSLDLMDIWDRIQGVRP--SPSVEDMWVRVPGSH---ESF 241
           AR+VD +  DG W   R    S  +M + D +   +P  +  V+D +V V   H   E+F
Sbjct: 193 ARVVDALTSDG-WCFERSRSRSPSIMLLRDSMPDAQPILNSEVDDTYVWVTSGHNGSETF 252

Query: 242 SIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSC 301
           S     + + P    V W  ++W  G IPKH+F +W+A +DR+ TRDRL RW   +P +C
Sbjct: 253 STSETWKRLFPCMLEVFWHEVIWFSGRIPKHAFLSWVAARDRMVTRDRLLRWGLLVPATC 312

Query: 302 ILCGGNYESRDHLFFPCPFGWEIWS 319
           +LC G+ E R HLFF C F  ++W+
Sbjct: 313 VLCVGHNEDRQHLFFDCNFSKQVWT 336

BLAST of CSPI03G24620 vs. TrEMBL
Match: A0A087GG31_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G055000 PE=4 SV=1)

HSP 1 Score: 159.5 bits (402), Expect = 8.8e-36
Identity = 85/275 (30.91%), Postives = 141/275 (51.27%), Query Frame = 1

Query: 64   CLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVGR 123
            CLP DEGGL +R    WN    LK++W+L +K+ SLWVAW  A+ LK  S          
Sbjct: 1060 CLPKDEGGLGLRSFRLWNKVFNLKLIWMLFIKTDSLWVAWNRAHRLKRISFWAAKPNNNS 1119

Query: 124  SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDAR 183
            SW ++ +L  +D+ K  ++ ++G+G+    W D     G +I   G            A 
Sbjct: 1120 SWIWKNLLSLKDLAKGFIRCQIGDGQVASFWFDQWNDVGCLIDYIGVDGPRLMGIPLHAP 1179

Query: 184  LVDFMGRDGDWRWSLVS--------LDLMDIWDRIQGVRPSPSVEDMWVRVPGSHES--F 243
            +   M ++   RW+ ++         +L+D    + G  P+P+ +D+++   G+ +S  F
Sbjct: 1180 VSAVMNKE---RWNFLTRAQRNPAVKNLLD--SVLSGPFPNPNDKDLFMWGLGTKDSQVF 1239

Query: 244  SIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSC 303
            S     + IRP + +V W+ ++W    IPKHSF+ W+A  +RL  + RL  W   + L+C
Sbjct: 1240 SSGKTWDWIRPSAGKVPWAKMVWFKHAIPKHSFHFWIANLNRLPVKQRLLSWGLVVDLNC 1299

Query: 304  ILCGGNYESRDHLFFPCPFGWEIWSRILLFMSSSH 329
             LCG   E+R+HLF  C +   IW +++  + + H
Sbjct: 1300 CLCGSALETRNHLFLHCDYSLVIWGKLMHRLCADH 1329

BLAST of CSPI03G24620 vs. TrEMBL
Match: Q9T0D8_ARATH (Putative uncharacterized protein AT4g11710 OS=Arabidopsis thaliana GN=At4g11710 PE=4 SV=1)

HSP 1 Score: 157.5 bits (397), Expect = 3.3e-35
Identity = 89/261 (34.10%), Postives = 131/261 (50.19%), Query Frame = 1

Query: 63  VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEI--DVG 122
           VC P +EGGL +R     N    LK++W ++  + SLWV W+++ +LK  S   +  +  
Sbjct: 114 VCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVSFWAVRENTS 173

Query: 123 VGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRR 182
           +G SW +R IL+ RDI +   K+E+ NG +   W D     G +I   G+R   D    +
Sbjct: 174 LG-SWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLGRLIDSAGDRGAIDLGINK 233

Query: 183 DARLVDFMGRDGDWRWSLVSLDLMDIWDRI---QGVRPSPSVEDMWVRVPGSHES-FSIV 242
            A +V+  G     R     L+ ++  +R+      R       +W        S FS  
Sbjct: 234 HATVVEAWGNRRRRRHRTNFLNRVE--ERLILSWNSRNQAEDRALWKGKENRFRSIFSTK 293

Query: 243 SACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILC 302
                IR  S++V W   +W    IPKH+F  WLA+ +RL T DR++ W+  +  +CILC
Sbjct: 294 DTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVDATCILC 353

Query: 303 GGNYESRDHLFFPCPFGWEIW 318
               ESRDHLFF CPF  EIW
Sbjct: 354 NKALESRDHLFFSCPFATEIW 371

BLAST of CSPI03G24620 vs. TrEMBL
Match: Q9FL83_ARATH (Non-LTR retroelement reverse transcriptase-like protein OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 154.8 bits (390), Expect = 2.2e-34
Identity = 87/278 (31.29%), Postives = 134/278 (48.20%), Query Frame = 1

Query: 63   VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVG 122
            VC P DEGGL +R     N    LK++W ++  S SLWV WV+ ++L+  S  E+   V 
Sbjct: 865  VCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASFWEVKQTVS 924

Query: 123  R-SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRD 182
            + SW ++ +L+ R++ K   K+EVGNGK+   W D     G ++++ G+R + D    R 
Sbjct: 925  QGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGLIDLGISRR 984

Query: 183  ARLVDFMGRDGDWR-----WSLVSLDLMDIWDRIQGVRPSPSVEDMW-VRVPGSHESFSI 242
              + +        R     ++++   L   WD     R     + +W  +      +FS 
Sbjct: 985  MTVEEAWTNRRQRRHRNDVYNVIEDALKKSWD----TRTETEDKVLWRGKSDVFRTTFST 1044

Query: 243  VSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCIL 302
                   R  S+RV W  ++W     PK+SF +WLA   RL T DR+  W + I   CI 
Sbjct: 1045 RDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWANGIATDCIF 1104

Query: 303  CGGNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYW 334
            C G  E+RDHLFF C F   IW  +   +  +    +W
Sbjct: 1105 CQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHW 1138

BLAST of CSPI03G24620 vs. TrEMBL
Match: Q9FYJ4_ARATH (F17F8.5 OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 149.1 bits (375), Expect = 1.2e-32
Identity = 83/263 (31.56%), Postives = 131/263 (49.81%), Query Frame = 1

Query: 63  VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVG 122
           VC P  EGGL +R+    N  S LK++W ++  S SLW  WV  Y+++ +S+  +     
Sbjct: 512 VCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSIWSLKQSTS 571

Query: 123 R-SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRD 182
             SW +R IL+ RD+ K+  ++EVGNG+    W D     G +I   G++   D    R+
Sbjct: 572 MGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDLGIPRE 631

Query: 183 ARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGS--HESFSIVSAC 242
           A + D   R    R     L+ ++     Q +  S + + +  R         FS     
Sbjct: 632 ASVADAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKPHFSTRDTW 691

Query: 243 EAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLS--CILCG 302
             I+  SS V W   +W     PK++  TWLAI +RL T DR+ +W+SS  +S  C+LC 
Sbjct: 692 HLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSVSGNCVLCT 751

Query: 303 GNYESRDHLFFPCPFGWEIWSRI 321
            N ++ +HLFF C +   +W+ +
Sbjct: 752 NNSKTLEHLFFSCSYASTVWAAL 774

BLAST of CSPI03G24620 vs. TAIR10
Match: AT1G60720.1 (AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 120.6 bits (301), Expect = 2.3e-27
Identity = 86/274 (31.39%), Postives = 129/274 (47.08%), Query Frame = 1

Query: 130 ILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRR---DARLVD 189
           +L  R + +  VK  +GNG+    W D     GP+I+  G+   Y + S R   +AR+V+
Sbjct: 2   LLFLRPLAEQFVKCNLGNGRIAHFWHDSWTSLGPLIKVMGD---YGSRSLRIPLNARVVE 61

Query: 190 FMGRDGDWRWSLV-SLDLMDIWDRIQGVR-PSPS-VEDMWVRVPGSH--ESFSIVSACEA 249
            +G +G W+  L  S     I D I  +  PSP+ +ED +  V G    + FS     +A
Sbjct: 62  ALGVNG-WKLPLSRSAPAQAIHDHISTITTPSPATIEDSFDWVVGGVVCQGFSSARTWDA 121

Query: 250 IRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNYE 309
           IRP +  + W+  +W  G +PKH+F  W++  DRL TR RL+ W       C LC    E
Sbjct: 122 IRPRAPELDWAKAVWFKGAVPKHAFNMWISQLDRLPTRQRLASWGHIQSFDCCLCTIETE 181

Query: 310 SRDHLFFPCPFGWEIW----SRI--------------------------LLFMSSSHRIR 364
           SRDHL F C F  ++W    SR+                          LL   S+H I 
Sbjct: 182 SRDHLLFSCEFAAQVWRLAFSRLCPRQRLFCSWAELLSWMRSSSSSAPSLLRKVSAHAII 241

BLAST of CSPI03G24620 vs. TAIR10
Match: AT5G16486.1 (AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 114.0 bits (284), Expect = 2.1e-25
Identity = 68/204 (33.33%), Postives = 98/204 (48.04%), Query Frame = 1

Query: 124 SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDAR 183
           SW +++I + R + +  V  +VG+G  C  W +     GP+I   G+     +   R+A 
Sbjct: 8   SWIWKSICKLRPMAREFVVCKVGSGITCNFWSENWTNLGPLIHLTGDLGPRVSGLPRNAS 67

Query: 184 LVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWV---------RVPGSHESFS 243
           + D + RDG W W   S     I   ++   P  SV ++ V         +V GS  S  
Sbjct: 68  VADAL-RDGVW-WINGSRSRNPIIQLLKNCLPLSSVVNLQVDAEDDLFMWKVGGSEASVG 127

Query: 244 IVSACEAIR--PHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLS 303
             SA   I   P   +V W   +W  G IPKH+F +W+ I+ RL TRD+L  W   +P  
Sbjct: 128 FSSAATWIHLNPVGEKVDWHKAIWFKGRIPKHAFISWVNIRHRLPTRDKLLSWGLHVPSL 187

Query: 304 CILCGGNYESRDHLFFPCPFGWEI 317
           C+LC    E+R HLFF C F  EI
Sbjct: 188 CLLCNAFDETRQHLFFDCVFAGEI 209

BLAST of CSPI03G24620 vs. TAIR10
Match: AT3G24255.1 (AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 112.8 bits (281), Expect = 4.8e-25
Identity = 98/340 (28.82%), Postives = 138/340 (40.59%), Query Frame = 1

Query: 25  RLFLSPFHSFISFGMKIRK------RNLVTRCPSF---------KARPMLVPSVCLPFDE 84
           +L  S  HS  +F M   +      + + + C SF         K   +    VC P DE
Sbjct: 68  QLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDE 127

Query: 85  GGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVGRSWCFRA 144
           GGL IR     N               GS W        + G + L        SW ++ 
Sbjct: 128 GGLGIRSLKEAN--------------KGSFWS-------ISGNTTLG-------SWMWKK 187

Query: 145 ILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDFMG 204
           IL+ R +    VK ++ NG     W D   + G +I   G R   D      A + + + 
Sbjct: 188 ILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAVV 247

Query: 205 RDGDWRWSLVSLDLMDIWDRIQGVRPS--PSVEDMWVRVPGSHE----SFSIVSACEAIR 264
                R    +L  + I D I  VR     S ED  VR  G+ +     F+      A R
Sbjct: 248 NHRPRRHRHDTL--LRIEDVIAEVRHQGLTSGEDT-VRWKGNGDIFKPCFNTKETWAATR 307

Query: 265 PHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNYESR 324
               +V W   +W     PK+S   W+AIK+RL T DR+  W++    SC+LC    E+R
Sbjct: 308 EPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVETR 367

Query: 325 DHLFFPCPFGWEI-WSRILLFMSSSHRIRYWGERNLRLHG 343
           DHLFF CP+  E+ +     F  + H +  W ERN R HG
Sbjct: 368 DHLFFTCPYSAEVPFLTRYTFQLTLHSL--WKERNGRRHG 374

BLAST of CSPI03G24620 vs. TAIR10
Match: AT4G04650.1 (AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 99.4 bits (246), Expect = 5.5e-21
Identity = 58/191 (30.37%), Postives = 85/191 (44.50%), Query Frame = 1

Query: 134 RDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDFMGRDGD 193
           R + +  +  EVG+G   + W D  I  GP+I+  G           DA + D +     
Sbjct: 5   RVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGTSW 64

Query: 194 WRWSLVS-----LDLMDIWDRIQGVRPSPSVED-MW-VRVPGSHESFSIVSACEAIRPHS 253
           W  S  S     + L ++    QG+      +  +W   +      FS      A+ P S
Sbjct: 65  WIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQS 124

Query: 254 SRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNYESRDHL 313
             V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP  C+LC  + +SR HL
Sbjct: 125 HTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHL 184

Query: 314 FFPCPFGWEIW 318
           FF C F   +W
Sbjct: 185 FFECQFSGVVW 195

BLAST of CSPI03G24620 vs. TAIR10
Match: AT1G43730.1 (AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 97.4 bits (241), Expect = 2.1e-20
Identity = 58/197 (29.44%), Postives = 88/197 (44.67%), Query Frame = 1

Query: 124 SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDAR 183
           SW +R + + R++ +  V  +VG+G   + W D     GP                    
Sbjct: 50  SWIWRRLCKLREVARPFVVCDVGSGVTAKFWHDNWTGHGP-------------------- 109

Query: 184 LVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACE--- 243
           L+D +G  G     L  +D + + D           +D ++     H   +I S  +   
Sbjct: 110 LIDLVGPLGPQTVGL-PIDAVGLIDCQH--------DDSFIWKTDLHAPSNIFSTAKTSL 169

Query: 244 AIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNY 303
           A+ P +  V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP  C+LC  + 
Sbjct: 170 ALHPQNHIVPWYKAVWFKNHVPKHAFICWVVAWNRLHTRDRLRSWGLSIPAVCLLCNSHD 217

Query: 304 ESRDHLFFPCPFGWEIW 318
           ESR HLFF CPF   +W
Sbjct: 230 ESRAHLFFECPFCGAVW 217

BLAST of CSPI03G24620 vs. NCBI nr
Match: gi|659121154|ref|XP_008460525.1| (PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein At1g65750 [Cucumis melo])

HSP 1 Score: 392.5 bits (1007), Expect = 8.8e-106
Identity = 183/272 (67.28%), Postives = 209/272 (76.84%), Query Frame = 1

Query: 63  VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVG 122
           VCLPF+EGGL IRDG SWNIASTLKILWL+L  SGSLWVAWVEAYILKGRSL ++D  VG
Sbjct: 227 VCLPFEEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDVDSRVG 286

Query: 123 RSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDA 182
           +SWC RAILRKR+ LK  V+M+VGNG   RVWLDP +  G I++Q GERV+YDA SRR A
Sbjct: 287 KSWCLRAILRKREKLKHLVRMKVGNGNSFRVWLDPWLPEGAILEQVGERVMYDAASRRKA 346

Query: 183 RLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAI 242
           RL DF+  DG+W W  VSL+L+D+W+R+Q V P  SV D WV VPG    FSI SA EA+
Sbjct: 347 RLSDFIDPDGEWLWPRVSLELIDLWERVQEVSPCLSVSDSWVWVPGRRGGFSIASAWEAV 406

Query: 243 RPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNYES 302
           RP   RV W GLLWGGGNI KH F  WLAIKDRLGT DRL RWDSS+P+ CILC G  ES
Sbjct: 407 RPRGGRVLWDGLLWGGGNIQKHFFCAWLAIKDRLGTIDRLHRWDSSVPMLCILCRGXVES 466

Query: 303 RDHLFFPCPF-GWEIWSRILLFMSSSHRIRYW 334
           RDHLFF C F G ++WSR+L  M SSHRI +W
Sbjct: 467 RDHLFFSCSFGGGDVWSRVLRIMGSSHRIGHW 498

BLAST of CSPI03G24620 vs. NCBI nr
Match: gi|659102432|ref|XP_008452126.1| (PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo])

HSP 1 Score: 353.6 bits (906), Expect = 4.6e-94
Identity = 177/310 (57.10%), Postives = 206/310 (66.45%), Query Frame = 1

Query: 40  KIRKRNLVTRCPSFKARPMLVPSVC------------LP---FDEGGLAIRDGSSWNIAS 99
           +IR R+   R  SF  R  LV SV             LP    +EGGL IRDG++W  AS
Sbjct: 607 RIRSRS--ARVLSFAGRLQLVCSVLCSLQVYWAIVFVLPAYVHNEGGLGIRDGTAWKFAS 666

Query: 100 TLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVGRSWCFRAILRKRDILKAHVKME 159
           TLKILWL+L  SGSLWVAWVEAY+LKGRSL ++D  VGRSWC RAILRK++ LK HV+M+
Sbjct: 667 TLKILWLMLTNSGSLWVAWVEAYVLKGRSLWDVDSRVGRSWCLRAILRKQEKLKQHVRMK 726

Query: 160 VGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDFMGRDGDWRWSLVSLDLM 219
           VGNG +CRVWLDP +Q G I+++ GERV+YDA SRR+A L +F+G DG+W W        
Sbjct: 727 VGNGNRCRVWLDPWLQRGAILERVGERVLYDAASRREASLSNFIGPDGEWLW-------- 786

Query: 220 DIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKH 279
                                       FSI SA EAIRP   RV W GLLWGGGNIPKH
Sbjct: 787 ------------------------PRGGFSIASAWEAIRPRGGRVLWDGLLWGGGNIPKH 846

Query: 280 SFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGGNYESRDHLFFPCPFGWEIWSRILLFM 335
           SF  WLAIKDRLGTRDR  RWDSS+PLSCILC G  ESRDHLFF CPFG ++WSR+L  M
Sbjct: 847 SFCAWLAIKDRLGTRDRFHRWDSSVPLSCILCEGGMESRDHLFFSCPFGGDVWSRVLRIM 882

BLAST of CSPI03G24620 vs. NCBI nr
Match: gi|659116542|ref|XP_008458124.1| (PREDICTED: putative ribonuclease H protein At1g65750 [Cucumis melo])

HSP 1 Score: 272.3 bits (695), Expect = 1.3e-69
Identity = 125/187 (66.84%), Postives = 149/187 (79.68%), Query Frame = 1

Query: 63  VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEIDVGVG 122
           VCLPF+EGGL IRDG SWNIA+TLKILWL+L  SGSLWVAW+EAYILKG+SL ++D  VG
Sbjct: 76  VCLPFEEGGLGIRDGPSWNIANTLKILWLMLTNSGSLWVAWMEAYILKGKSLWDVDSRVG 135

Query: 123 RSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDA 182
           RSWCFRAILRKR+ LK HV+M+VGNG +CRVWLDP +QGG I++Q GERV+YDA SRR+A
Sbjct: 136 RSWCFRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQGGAILEQVGERVLYDAASRREA 195

Query: 183 RLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAI 242
           RL DF+  +G+W W  VSL+L+D+W+R+Q V P  SV D WV VPG    FSI SA EAI
Sbjct: 196 RLSDFIDPNGEWLWPRVSLELIDLWERVQEVSPCLSVSDSWVWVPGRQGGFSIASAWEAI 255

Query: 243 RPHSSRV 250
            P    V
Sbjct: 256 CPRGCEV 262

BLAST of CSPI03G24620 vs. NCBI nr
Match: gi|659114595|ref|XP_008457134.1| (PREDICTED: uncharacterized protein LOC103496880 [Cucumis melo])

HSP 1 Score: 190.3 bits (482), Expect = 6.7e-45
Identity = 107/234 (45.73%), Postives = 135/234 (57.69%), Query Frame = 1

Query: 78   SSWNIASTLKIL----WLLLVKS--GSLWVAWVEAYILKGRSLLEIDVGVGRSWCFRAIL 137
            +SW  + T ++L     L LV+S   SL V W   ++L      E+D  + RS+ +R   
Sbjct: 958  TSWIRSWTARVLSFFGTLQLVRSVLHSLQVYWASMFVLPAYVHNEVDK-ILRSYLWRGKE 1017

Query: 138  RKRDILKA---HVKMEVGNGKKC----RVWLDPRIQGGPIIQQFGERVIYDADSRRDARL 197
              R   K     V +    G+        W    I  GPI++Q GERV YDA SRR+ARL
Sbjct: 1018 EGRGGFKVAWVEVCLPFEEGELAIQDGPSW---NIARGPILEQVGERVFYDAASRREARL 1077

Query: 198  VDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAIRP 257
             DF+G DG+W+W  VS++L+D+WDR+Q V P  SV D WV +PG    FSI S  E IRP
Sbjct: 1078 SDFIGLDGEWQWPRVSMELIDLWDRVQAVSPCLSVRDRWVWIPGRQGGFSIASTWETIRP 1137

Query: 258  HSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGG 299
               RV W+ LLWGG NIPKHSF  WLAIKD LGTRDRL RWDSS+ +      G
Sbjct: 1138 RGGRVRWASLLWGGENIPKHSFCAWLAIKDMLGTRDRLHRWDSSVSMCAFFVRG 1187

BLAST of CSPI03G24620 vs. NCBI nr
Match: gi|923697088|ref|XP_013658511.1| (PREDICTED: uncharacterized protein LOC106363284 [Brassica napus])

HSP 1 Score: 182.2 bits (461), Expect = 1.8e-42
Identity = 103/309 (33.33%), Postives = 163/309 (52.75%), Query Frame = 1

Query: 63  VCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSGSLWVAWVEAYILKGRSLLEI-DVGV 122
           VC P++EGGL IR     +   +LK++W L  +S SLW AWV+ Y+L+G ++ ++ D G+
Sbjct: 366 VCCPYEEGGLEIRRVMEVSTVFSLKLIWRLCTQSPSLWGAWVKRYLLRGETIWDVKDTGL 425

Query: 123 GRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRD 182
           G SW +R +LR R + K +++M +GNG+  R W D     G +++  G          R+
Sbjct: 426 G-SWNWRKLLRYRSLAKQYIRMSIGNGQLVRFWTDIWFPKGRLLEITGAFGTQKMGIARN 485

Query: 183 ARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVED------MWVRVPGSH-ESFS 242
           AR+ D +  DG WR  L       I   +Q ++ +P   +       W   P S+ + F 
Sbjct: 486 ARICDVL-VDGVWR--LRDCRDQQIAALVQEIKSTPITLNHEADGVKWKLGPDSYGDCFI 545

Query: 243 IVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCI 302
                + IR    +V WS L+W    +P+++F TWLA +DRL T  R S+W S  P  C+
Sbjct: 546 SAEVLQQIRSRKGKVPWSKLVWFPQRVPRYAFITWLAFRDRLSTGHRTSKWGS--PQGCL 605

Query: 303 LCGGNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGERNLRLHGGAVWEPMVIFQL 362
            CG   E+RDHLFF CP+ + +W +++  +  +     WG   LRL  G      + F L
Sbjct: 606 HCGEPDETRDHLFFACPYTYTLWLKVVGNLFGAEPDPDWGITILRLQTGTY--DRITFIL 665

Query: 363 IRSCIKVVL 364
           +R  ++V +
Sbjct: 666 LRMVLQVTI 666

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A068F615_BRANA8.2e-4234.72Uncharacterized protein OS=Brassica napus PE=4 SV=1[more]
A0A087GG31_ARAAL8.8e-3630.91Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G055000 PE=4 SV=1[more]
Q9T0D8_ARATH3.3e-3534.10Putative uncharacterized protein AT4g11710 OS=Arabidopsis thaliana GN=At4g11710 ... [more]
Q9FL83_ARATH2.2e-3431.29Non-LTR retroelement reverse transcriptase-like protein OS=Arabidopsis thaliana ... [more]
Q9FYJ4_ARATH1.2e-3231.56F17F8.5 OS=Arabidopsis thaliana PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G60720.12.3e-2731.39 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT5G16486.12.1e-2533.33 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT3G24255.14.8e-2528.82 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT4G04650.15.5e-2130.37 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT1G43730.12.1e-2029.44 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
Match NameE-valueIdentityDescription
gi|659121154|ref|XP_008460525.1|8.8e-10667.28PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein At1g65750 [Cucum... [more]
gi|659102432|ref|XP_008452126.1|4.6e-9457.10PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo][more]
gi|659116542|ref|XP_008458124.1|1.3e-6966.84PREDICTED: putative ribonuclease H protein At1g65750 [Cucumis melo][more]
gi|659114595|ref|XP_008457134.1|6.7e-4545.73PREDICTED: uncharacterized protein LOC103496880 [Cucumis melo][more]
gi|923697088|ref|XP_013658511.1|1.8e-4233.33PREDICTED: uncharacterized protein LOC106363284 [Brassica napus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR026960RVT-Znf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G24620.1CSPI03G24620.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 233..317
score: 1.1
NoneNo IPR availablePANTHERPTHR25952FAMILY NOT NAMEDcoord: 164..349
score: 2.1
NoneNo IPR availablePANTHERPTHR25952:SF191PROTEIN T12G3.2, ISOFORM Dcoord: 164..349
score: 2.1

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI03G24620Csa3G511980Cucumber (Chinese Long) v2cpicuB128
CSPI03G24620CsaV3_3G026370Cucumber (Chinese Long) v3cpicucB144
CSPI03G24620Cucsa.300790Cucumber (Gy14) v1cgycpiB451
CSPI03G24620CsGy3G023370Cucumber (Gy14) v2cgybcpiB110
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI03G24620Cucumber (Chinese Long) v2cpicuB120
CSPI03G24620Melon (DHL92) v3.6.1cpimedB247