Cucsa.300790 (gene) Cucumber (Gy14) v1

NameCucsa.300790
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRNA-directed DNA polymerase (reverse transcriptase)-related family protein
Locationscaffold02931 : 390505 .. 392538 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGACAACGACAACAACAACAATAATAATGAAAAAAGGAAGTGTCAATTTGAAACTAGAAGTTGAGAGTTAAAATTGTGTTAGATAAGTGTAGCATTGAAAGTATTAAAGGCATTGACGAAGGGACCATATGTATTTTTACTTCATGTGGTCCTCCATTTATGTGAAAAGAAGATAGAAAAAAGGTTAGTTTTGTAATATGAAAACTATTTTAGGTTTAAAAAATAGTTTTAGGTTAAAAAATAGCATTGTTTAATTAATAATAAATATCAAATAGCAAAGACTAAAAATCAGTATTATTGACTATCAAATAACAAAAACTTTGCGCCGTATGGCGCTTCGGATCGAGTCGAAGGTCACCACCCCAAGGATAGGGATTCTCTTCAGAGGCAAGCCACTGGCTCGACTCTTTCTTTCTCCCTTTCAGTTATTTAAGAGCTAAAAAAGCTCTTTCCCAACCAAAAAAGGCATAATGAACTACTAGAATACGACCCAAGCAAGCCAACCTATTGTCCACGAATGCAGTTCTTTCATCTCTTTCGGAATGAAAATAAGGAAGAGGAATTTAGTTACTCGATGCCCGTCATTCAAGGCGAGGCCAATGCTTGTCCCATCTGTAGCTTTTGAAGACACATAAGTTTTGTAGTTTAAGCAAAAGCTTAGTTCATTCACTCATTAATCTCAAAAATAATAAAAGATTCTCCATTCTCAATTCAATAAGGAATAAAAAGAAGAAAAGACGATAAAGGAGCTAGGTAATAAAAAGAAGAGAAGACGAGTCAAGAGGTTTTGATGATTGTAAAGACTATTACAAAGACCTGCAGAAACCTAAAAAGTACCAATGCTAAGAGAAAGAAGAGTATAGAATGAACGAGTCAATTCAATATGAACCTAGAAAAGTAGGCGTTTAAGATGAACCGAAAATCTTCAACAATATCCATGGAGGGCATATCTGTGGAGAGGTAAGACGGAGGGTAGAGGTGGTGCTTAAGTTGCCTGGGATGAGGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTAAAGATCTTATGGTTGCTTCTAGTTAAATCTTGTAGTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGTTGTGGGAGATCGATGTTGGGGTGGGTCGATCTTGGTGTTTTAGGGCAATCTTGCGTAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGGTGGGCAATGGTAAGAAGTGTAGAGTGTGGTTGGATCCAAGGATTCAGGGTGGACCGATTATCCAGCAGTTTGGGGAGAGGGTGATTTATGATGCAGATAGTCGGCGGGATGCGAGGCTTGTGGATTTCATGGGTCGGGATGGAGATTGGAGGTGGTCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGAATTCAGGGAGTGAGGCCGAGTCCGAGTGTTGAGGATATGTGGGTCCGGGTGCCAGGTAGTCATGAGAGTTTTTCGATCGTCAGTGCGTGCGAGGCTATTCGTCCTCATAGTAGTAGGGTTGGCTGGTCGGGTTTACTGTGGGGTGGGGGAAaTATTCCTAAGCACTCCTTCTATACTTGGTTGGCCATCAAGGATAGGTTGGGTACTAGAGATAGATTAAGTCGGTGGGATAGTTCGATTCCTTTATCGTGTATTCTTTGTGGACGGAACTATGAGTCTCGTGACCATTTGTTTTTtCCTTGTCCTTTTGGGTGGGAGATTTGGTCTAGAATCCTTTTGTTTATGTCATCTTCTCACAGGATACGGTATTGGGGGGTTGAGTTATCTTAGATTTGCAATTAGGGTATTGGGAAGGGTGTGAGGAGAAAATTGTGGCGCCTTCTCTGGTGTGCTACAATTTATTTCATTTGGCAGGAGCGAAATCTTCGTCTTCATGGAGGTGCTGTTTGGGAGCCTATGGTTATTTTCCAGCTCATTCGGTCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATG

mRNA sequence

atgacgacaacgacaacaacaacaataataaTGAAAAAAGGAAttgcctgggatgaggtttgtcttccttttgatgaaggaggtcttgctattcgcgatggatcttcttggaatatagcaagcacgttaaagatcttatggttgcttctagttaaatcttgtagtttgtgggttgcttgggtggaagcttatatccttaaagggagatcgttgtgggagatcgatgttggggtgggtcgatcttggtgttttagggcaatcttgcgtaagcgggatatccttaaagctcatgttaagatggaggtgggcaatggtaagaagtgtagagtgtggttggatccaaggattcagggtggaccgattatccagcagtttggggagagggtgatttatgatgcagatagtcggcgggatgcgaggcttgtggatttcatgggtcgggatggagattggaggtggtcgcttgtttctttggatttgatggacatttgggatagaattcagggagtgaggccgagtccgagtgttgaggatatgtgggtccgggtgccaggtagtcatgagagtttttcgatcgtcagtgcgtgcgaggctattcgtcctcatagtagtagggttggctggtcgggtttactgtggggtgggggaaatattcctaagcactccttctatacttggttggccatcaaggataggttgggtactagagatagattaagtcggtgggatagttcgattcctttatcgtgtattctttgtggacggaactatgagtctcgtgaccatttgttttttccttgtccttttgggtgggagatttggtctagaatccttttgtttatgtcatcttctcacaggatacggtattgggggggtgtgaggagaaaattgtggcgccttctctggtgtgctacaatttatttcatttggcaggagcgaaatcttcgtcttcatggaggtgctgtttgggagcctatggttattttccagctcattcggtCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATG

Coding sequence (CDS)

ATGACGACAACGACAACAACAACAATAATAATGAAAAAAGGAATTGCCTGGGATGAGGTTTGTCTTCCTTTTGATGAAGGAGGTCTTGCTATTCGCGATGGATCTTCTTGGAATATAGCAAGCACGTTAAAGATCTTATGGTTGCTTCTAGTTAAATCTTGTAGTTTGTGGGTTGCTTGGGTGGAAGCTTATATCCTTAAAGGGAGATCGTTGTGGGAGATCGATGTTGGGGTGGGTCGATCTTGGTGTTTTAGGGCAATCTTGCGTAAGCGGGATATCCTTAAAGCTCATGTTAAGATGGAGGTGGGCAATGGTAAGAAGTGTAGAGTGTGGTTGGATCCAAGGATTCAGGGTGGACCGATTATCCAGCAGTTTGGGGAGAGGGTGATTTATGATGCAGATAGTCGGCGGGATGCGAGGCTTGTGGATTTCATGGGTCGGGATGGAGATTGGAGGTGGTCGCTTGTTTCTTTGGATTTGATGGACATTTGGGATAGAATTCAGGGAGTGAGGCCGAGTCCGAGTGTTGAGGATATGTGGGTCCGGGTGCCAGGTAGTCATGAGAGTTTTTCGATCGTCAGTGCGTGCGAGGCTATTCGTCCTCATAGTAGTAGGGTTGGCTGGTCGGGTTTACTGTGGGGTGGGGGAAaTATTCCTAAGCACTCCTTCTATACTTGGTTGGCCATCAAGGATAGGTTGGGTACTAGAGATAGATTAAGTCGGTGGGATAGTTCGATTCCTTTATCGTGTATTCTTTGTGGACGGAACTATGAGTCTCGTGACCATTTGTTTTTtCCTTGTCCTTTTGGGTGGGAGATTTGGTCTAGAATCCTTTTGTTTATGTCATCTTCTCACAGGATACGGTATTGGGGGGGTGTGAGGAGAAAATTGTGGCGCCTTCTCTGGTGTGCTACAATTTATTTCATTTGGCAGGAGCGAAATCTTCGTCTTCATGGAGGTGCTGTTTGGGAGCCTATGGTTATTTTCCAGCTCATTCGGTCGTGTATTAAAGTCGTGCTGCTTCTTGGTCCAATTGAGTTCATG

Protein sequence

MTTTTTTTIIMKKGIAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEIDVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGVRRKLWRLLWCATIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIKVVLLLGPIEFM
BLAST of Cucsa.300790 vs. TrEMBL
Match: A0A068F615_BRANA (Uncharacterized protein OS=Brassica napus PE=4 SV=1)

HSP 1 Score: 194.9 bits (494), Expect = 1.6e-46
Identity = 111/345 (32.17%), Postives = 175/345 (50.72%), Query Frame = 1

Query: 15  IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
           IAW+ VC P + GGL ++  + WN+   LK++WLL     SLWV+WV   ++   + W +
Sbjct: 69  IAWESVCTPKEAGGLGLKRLADWNVVLGLKLIWLLFAAGGSLWVSWVRRNLIGRENFWML 128

Query: 75  DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
                 SW +R++ + R   +  +K +VG+G     W+D     GP+I+  GER      
Sbjct: 129 VPNRRGSWIWRSLCKLRPKARPFIKCQVGSGITASFWMDDWTSLGPLIEVVGERGPVVTG 188

Query: 135 SRRDARLVDFMGRDGDW---RWSLVSLDLMDIWDRIQGVRP--SPSVEDMWVRVPGSH-- 194
              +AR+VD +  DG W   R    S  +M + D +   +P  +  V+D +V V   H  
Sbjct: 189 LSINARVVDALTSDG-WCFERSRSRSPSIMLLRDSMPDAQPILNSEVDDTYVWVTSGHNG 248

Query: 195 -ESFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSI 254
            E+FS     + + P    V W  ++W  G IPKH+F +W+A +DR+ TRDRL RW   +
Sbjct: 249 SETFSTSETWKRLFPCMLEVFWHEVIWFSGRIPKHAFLSWVAARDRMVTRDRLLRWGLLV 308

Query: 255 PLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGVR-----------R 314
           P +C+LC  + E R HLFF C F  ++W+     +     + +  G+R           +
Sbjct: 309 PATCVLCVGHNEDRQHLFFDCNFSKQVWTFFTSRLRLDPPVLFEDGLRWLKNPAREKNVK 368

Query: 315 KLWRLLWCATIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIKVVL 341
            + RLL  A +Y IW+ERN R+H      P  I   ++  I++ L
Sbjct: 369 LIVRLLHQACLYIIWKERNSRIHTDEARSPEAIIAEVKQIIRLRL 412

BLAST of Cucsa.300790 vs. TrEMBL
Match: Q9FL83_ARATH (Non-LTR retroelement reverse transcriptase-like protein OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 170.2 bits (430), Expect = 4.3e-39
Identity = 104/327 (31.80%), Postives = 157/327 (48.01%), Query Frame = 1

Query: 12   KKGIAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSL 71
            K  I+W  VC P DEGGL +R     N    LK++W ++  S SLWV WV+ ++L+  S 
Sbjct: 857  KAKISWHMVCKPKDEGGLGLRSLKEANDVCCLKLVWKIVSHSNSLWVKWVDQHLLRNASF 916

Query: 72   WEIDVGVGR-SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVI 131
            WE+   V + SW ++ +L+ R++ K   K+EVGNGK+   W D     G ++++ G+R +
Sbjct: 917  WEVKQTVSQGSWIWKKLLKYREVAKTLSKVEVGNGKQTSFWYDNWSDLGQLLERTGDRGL 976

Query: 132  YDADSRRDARLVDFMGRDGDWR-----WSLVSLDLMDIWDRIQGVRPSPSVEDMW-VRVP 191
             D    R   + +        R     ++++   L   WD     R     + +W  +  
Sbjct: 977  IDLGISRRMTVEEAWTNRRQRRHRNDVYNVIEDALKKSWD----TRTETEDKVLWRGKSD 1036

Query: 192  GSHESFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDS 251
                +FS        R  S+RV W  ++W     PK+SF +WLA   RL T DR+  W +
Sbjct: 1037 VFRTTFSTRDTWHHTRSTSARVPWHKVIWFSHATPKYSFCSWLAAHGRLPTGDRMINWAN 1096

Query: 252  SIPLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGV---------RR 311
             I   CI C    E+RDHLFF C F   IW  +   +  +    +W  +          R
Sbjct: 1097 GIATDCIFCQGTLETRDHLFFTCSFTSVIWVDLARGIFKTQYTSHWQSIIEAITNSQHHR 1156

Query: 312  KLW---RLLWCATIYFIWQERNLRLHG 320
              W   R ++ ATIY +W+ERN R HG
Sbjct: 1157 VEWFLRRYVFQATIYIVWRERNGRRHG 1179

BLAST of Cucsa.300790 vs. TrEMBL
Match: M4DQ66_BRARP (Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1)

HSP 1 Score: 169.5 bits (428), Expect = 7.3e-39
Identity = 99/338 (29.29%), Postives = 150/338 (44.38%), Query Frame = 1

Query: 15   IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
            +AW++V  P   GGL +R+ + WN    +K+LWLLL +  S+W +W++  ++K  S WE+
Sbjct: 1153 VAWEKVATPRKNGGLGLRNLAIWNRTCIIKLLWLLLFRPESVWASWMQDNVIKDESFWEL 1212

Query: 75   DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
               +  +W F+ IL +R      + +  GNG+    WLDP    G +I   G +      
Sbjct: 1213 KPRLNHTWIFKRILEERQTALQWIMISPGNGRNVNFWLDPWTPFGQLISFIGHQGPRLTG 1272

Query: 135  SRRDARLVDFMGRDGDWRWSLVSLDLMD---IWDRIQGVRPSPSVEDMWVRVPGSHESFS 194
              R   + D +  +G W +       M+    +     +   PS    W+       +FS
Sbjct: 1273 ISRATNVAD-LWINGQWSFRHARSPQMEELLTYLTTVTLTDEPS-NAQWIMDGAPKHTFS 1332

Query: 195  IVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCI 254
              +   +   HS  V W  ++W    IPKH    W+ + +R  TRDRL  W       C+
Sbjct: 1333 SSAVYNSFFEHSPIVPWHPIIWIKKGIPKHKSLAWIMLLNRSPTRDRLLSWGLQTDPRCL 1392

Query: 255  LCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGVRRKLWRLL---------- 314
            LC  + ESR+HL+F CPF   IWS     +        W  V   L  L           
Sbjct: 1393 LCNLSDESRNHLYFECPFSIAIWSYYAARLRIPQTSNSWADVTHSLLSLTGTKDHIYLSI 1452

Query: 315  --WCATIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIK 338
              W AT+Y +W ERN RLH G      ++ + I   IK
Sbjct: 1453 LSWQATVYEVWWERNERLHRGKFRSVDMVVKKINGLIK 1488

BLAST of Cucsa.300790 vs. TrEMBL
Match: A0A087GG31_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G055000 PE=4 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 2.1e-38
Identity = 98/326 (30.06%), Postives = 164/326 (50.31%), Query Frame = 1

Query: 15   IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
            ++W   CLP DEGGL +R    WN    LK++W+L +K+ SLWVAW  A+ LK  S W  
Sbjct: 1054 VSWSACCLPKDEGGLGLRSFRLWNKVFNLKLIWMLFIKTDSLWVAWNRAHRLKRISFWAA 1113

Query: 75   DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
                  SW ++ +L  +D+ K  ++ ++G+G+    W D     G +I   G        
Sbjct: 1114 KPNNNSSWIWKNLLSLKDLAKGFIRCQIGDGQVASFWFDQWNDVGCLIDYIGVDGPRLMG 1173

Query: 135  SRRDARLVDFMGRDGDWRWSLVS--------LDLMDIWDRIQGVRPSPSVEDMWVRVPGS 194
                A +   M ++   RW+ ++         +L+D    + G  P+P+ +D+++   G+
Sbjct: 1174 IPLHAPVSAVMNKE---RWNFLTRAQRNPAVKNLLD--SVLSGPFPNPNDKDLFMWGLGT 1233

Query: 195  HES--FSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDS 254
             +S  FS     + IRP + +V W+ ++W    IPKHSF+ W+A  +RL  + RL  W  
Sbjct: 1234 KDSQVFSSGKTWDWIRPSAGKVPWAKMVWFKHAIPKHSFHFWIANLNRLPVKQRLLSWGL 1293

Query: 255  SIPLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSH-------RIRYW-----GG 314
             + L+C LCG   E+R+HLF  C +   IW +++  + + H        +  W       
Sbjct: 1294 VVDLNCCLCGSALETRNHLFLHCDYSLVIWGKLMHRLCADHVTFADWDSLMSWLSAKTPQ 1353

Query: 315  VRRKLWRLLWCATIYFIWQERNLRLH 319
            V  +L + +  + +Y +W+ERN RLH
Sbjct: 1354 VSSRLKKYVVHSLLYNLWKERNARLH 1374

BLAST of Cucsa.300790 vs. TrEMBL
Match: Q9FYJ4_ARATH (F17F8.5 OS=Arabidopsis thaliana PE=4 SV=1)

HSP 1 Score: 167.5 bits (423), Expect = 2.8e-38
Identity = 107/337 (31.75%), Postives = 163/337 (48.37%), Query Frame = 1

Query: 12  KKGIAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSL 71
           K  I+WD VC P  EGGL +R+    N  S LK++W ++  S SLW  WV  Y+++ +S+
Sbjct: 504 KAKISWDIVCKPKAEGGLGLRNLKEANDVSCLKLVWRIISNSNSLWTKWVAEYLIRKKSI 563

Query: 72  WEIDVGVGR-SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVI 131
           W +       SW +R IL+ RD+ K+  ++EVGNG+    W D     G +I   G++  
Sbjct: 564 WSLKQSTSMGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGT 623

Query: 132 YDADSRRDARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGS--HE 191
            D    R+A + D   R    R     L+ ++     Q +  S + + +  R        
Sbjct: 624 IDLGIPREASVADAWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVFKP 683

Query: 192 SFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPL 251
            FS       I+  SS V W   +W     PK++  TWLAI +RL T DR+ +W+SS  +
Sbjct: 684 HFSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSGSV 743

Query: 252 S--CILCGRNYESRDHLFFPCPFGWEIWSRI---------------LLFMSSSHRIRYWG 311
           S  C+LC  N ++ +HLFF C +   +W+ +               LL   S+H   +  
Sbjct: 744 SGNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTH---FQD 803

Query: 312 GVRRKLWRLLWCATIYFIWQERNLRLHGGAVWEPMVI 329
            V   L R ++ ATIY +W+ERN R H  A   P  +
Sbjct: 804 RVEGFLTRYIFQATIYHVWRERNGRRHDAAPNTPATV 837

BLAST of Cucsa.300790 vs. TAIR10
Match: AT1G60720.1 (AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 119.0 bits (297), Expect = 5.8e-27
Identity = 87/274 (31.75%), Postives = 134/274 (48.91%), Query Frame = 1

Query: 87  ILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRR---DARLVD 146
           +L  R + +  VK  +GNG+    W D     GP+I+  G+   Y + S R   +AR+V+
Sbjct: 2   LLFLRPLAEQFVKCNLGNGRIAHFWHDSWTSLGPLIKVMGD---YGSRSLRIPLNARVVE 61

Query: 147 FMGRDGDWRWSLV-SLDLMDIWDRIQGVR-PSPS-VEDMWVRVPGSH--ESFSIVSACEA 206
            +G +G W+  L  S     I D I  +  PSP+ +ED +  V G    + FS     +A
Sbjct: 62  ALGVNG-WKLPLSRSAPAQAIHDHISTITTPSPATIEDSFDWVVGGVVCQGFSSARTWDA 121

Query: 207 IRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYE 266
           IRP +  + W+  +W  G +PKH+F  W++  DRL TR RL+ W       C LC    E
Sbjct: 122 IRPRAPELDWAKAVWFKGAVPKHAFNMWISQLDRLPTRQRLASWGHIQSFDCCLCTIETE 181

Query: 267 SRDHLFFPCPFGWEIW--------SRILLFMSSSHRIRYW----GGVRRKLWRLLWCATI 326
           SRDHL F C F  ++W         R  LF S +  + +           L ++   A I
Sbjct: 182 SRDHLLFSCEFAAQVWRLAFSRLCPRQRLFCSWAELLSWMRSSSSSAPSLLRKVSAHAII 241

Query: 327 YFIWQERNLRLHGGAVWEPMVIFQLIRSCIKVVL 341
           Y IW++RN  LH      P++IF+++   I+ ++
Sbjct: 242 YNIWRQRNNVLHNNLRIAPIIIFKIVDREIRNII 271

BLAST of Cucsa.300790 vs. TAIR10
Match: AT1G43730.1 (AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 106.3 bits (264), Expect = 3.9e-23
Identity = 79/266 (29.70%), Postives = 117/266 (43.98%), Query Frame = 1

Query: 69  RSLWEIDVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGER 128
           R+ W ++     SW +R + + R++ +  V  +VG+G   + W D     GP        
Sbjct: 38  RNFWTLNSTTSDSWIWRRLCKLREVARPFVVCDVGSGVTAKFWHDNWTGHGP-------- 97

Query: 129 VIYDADSRRDARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHE 188
                       L+D +G  G     L  +D + + D           +D ++     H 
Sbjct: 98  ------------LIDLVGPLGPQTVGL-PIDAVGLIDCQH--------DDSFIWKTDLHA 157

Query: 189 SFSIVSACE---AIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSS 248
             +I S  +   A+ P +  V W   +W   ++PKH+F  W+   +RL TRDRL  W  S
Sbjct: 158 PSNIFSTAKTSLALHPQNHIVPWYKAVWFKNHVPKHAFICWVVAWNRLHTRDRLRSWGLS 217

Query: 249 IPLSCILCGRNYESRDHLFFPCPFGWEIW----SRILLFMSSSHR---IRYWGGVRRK-- 308
           IP  C+LC  + ESR HLFF CPF   +W     R  LF  +      I      R K  
Sbjct: 218 IPAVCLLCNSHDESRAHLFFECPFCGAVWRFFTGRANLFPPAQLMYCLIWLLNPSRDKNT 274

Query: 309 --LWRLLWCATIYFIWQERNLRLHGG 321
             + RL + A +Y IW+ERNL LH G
Sbjct: 278 TLIIRLAFQAYVYAIWRERNLCLHTG 274

BLAST of Cucsa.300790 vs. TAIR10
Match: AT5G16486.1 (AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 105.1 bits (261), Expect = 8.6e-23
Identity = 68/204 (33.33%), Postives = 98/204 (48.04%), Query Frame = 1

Query: 81  SWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDAR 140
           SW +++I + R + +  V  +VG+G  C  W +     GP+I   G+     +   R+A 
Sbjct: 8   SWIWKSICKLRPMAREFVVCKVGSGITCNFWSENWTNLGPLIHLTGDLGPRVSGLPRNAS 67

Query: 141 LVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWV---------RVPGSHESFS 200
           + D + RDG W W   S     I   ++   P  SV ++ V         +V GS  S  
Sbjct: 68  VADAL-RDGVW-WINGSRSRNPIIQLLKNCLPLSSVVNLQVDAEDDLFMWKVGGSEASVG 127

Query: 201 IVSACEAIR--PHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLS 260
             SA   I   P   +V W   +W  G IPKH+F +W+ I+ RL TRD+L  W   +P  
Sbjct: 128 FSSAATWIHLNPVGEKVDWHKAIWFKGRIPKHAFISWVNIRHRLPTRDKLLSWGLHVPSL 187

Query: 261 CILCGRNYESRDHLFFPCPFGWEI 274
           C+LC    E+R HLFF C F  EI
Sbjct: 188 CLLCNAFDETRQHLFFDCVFAGEI 209

BLAST of Cucsa.300790 vs. TAIR10
Match: AT4G04650.1 (AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 102.4 bits (254), Expect = 5.6e-22
Identity = 75/252 (29.76%), Postives = 110/252 (43.65%), Query Frame = 1

Query: 91  RDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDFMGRDGD 150
           R + +  +  EVG+G   + W D  I  GP+I+  G           DA + D +     
Sbjct: 5   RVVARPFIVCEVGSGVTAKFWHDNWIGLGPLIEVIGPLGPRTVGLPIDAVVRDALRGTSW 64

Query: 151 WRWSLVS-----LDLMDIWDRIQGVRPSPSVED-MW-VRVPGSHESFSIVSACEAIRPHS 210
           W  S  S     + L ++    QG+      +  +W   +      FS      A+ P S
Sbjct: 65  WIASSRSRNPIIVQLKNLLPEAQGLLDCQHDDSFLWKTDLHAPSNRFSAPRTWSALHPQS 124

Query: 211 SRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYESRDHL 270
             V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP  C+LC  + +SR HL
Sbjct: 125 HTVPWHKAVWFKNHVPKHAFICWVVAWNRLHTRDRLQNWGLSIPAECLLCNAHDDSRAHL 184

Query: 271 FFPCPFGWEIWSRILLFMSSSHRIR---------YW--GGVRRK----LWRLLWCATIYF 321
           FF C F   +W     F ++S  +           W     R K    + RL + + +Y 
Sbjct: 185 FFECQFSGVVWR----FFTASTNLNPPAQLMDCLNWLLSPSREKNICLIIRLAFHSCVYA 244

BLAST of Cucsa.300790 vs. TAIR10
Match: AT2G02520.1 (AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein)

HSP 1 Score: 94.4 bits (233), Expect = 1.5e-19
Identity = 52/153 (33.99%), Postives = 75/153 (49.02%), Query Frame = 1

Query: 199 IRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYE 258
           + P   RV W   +W  G IPKH+F  W+ ++ RL T+DR+  W    P  C+ C  + E
Sbjct: 34  LNPIGERVDWFKAIWFKGKIPKHAFIAWVNMRHRLHTKDRMISWGFIFPPLCLFCNTHDE 93

Query: 259 SRDHLFFPCPFGWEIWSRILLFMSSSH---RIRYWGGVR-----------RKLWRLLWCA 318
           +R HLFF C F  E+W   + F S  H    + +  G+R             + RL   A
Sbjct: 94  TRQHLFFDCEFAREVW---IYFTSRVHVFPPLLFEDGIRWLKNPCQDKNVTTILRLSHHA 153

Query: 319 TIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIK 338
           ++Y IW+ERN RLH  A      +   I+S I+
Sbjct: 154 SVYTIWKERNARLHDSASRPAAALILEIKSVIR 183

BLAST of Cucsa.300790 vs. NCBI nr
Match: gi|659102432|ref|XP_008452126.1| (PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo])

HSP 1 Score: 404.1 bits (1037), Expect = 2.5e-109
Identity = 196/326 (60.12%), Postives = 228/326 (69.94%), Query Frame = 1

Query: 25  DEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEIDVGVGRSWCF 84
           +EGGL IRDG++W  ASTLKILWL+L  S SLWVAWVEAY+LKGRSLW++D  VGRSWC 
Sbjct: 648 NEGGLGIRDGTAWKFASTLKILWLMLTNSGSLWVAWVEAYVLKGRSLWDVDSRVGRSWCL 707

Query: 85  RAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDADSRRDARLVDF 144
           RAILRK++ LK HV+M+VGNG +CRVWLDP +Q G I+++ GERV+YDA SRR+A L +F
Sbjct: 708 RAILRKQEKLKQHVRMKVGNGNRCRVWLDPWLQRGAILERVGERVLYDAASRREASLSNF 767

Query: 145 MGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVSACEAIRPHSS 204
           +G DG+W W                                    FSI SA EAIRP   
Sbjct: 768 IGPDGEWLW--------------------------------PRGGFSIASAWEAIRPRGG 827

Query: 205 RVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCGRNYESRDHLF 264
           RV W GLLWGGGNIPKHSF  WLAIKDRLGTRDR  RWDSS+PLSCILC    ESRDHLF
Sbjct: 828 RVLWDGLLWGGGNIPKHSFCAWLAIKDRLGTRDRFHRWDSSVPLSCILCEGGMESRDHLF 887

Query: 265 FPCPFGWEIWSRILLFMSSSHRIRYWG-------------GVRRKLWRLLWCATIYFIWQ 324
           F CPFG ++WSR+L  M+SSHRI +WG             GVRRKLWR+LWCATIYFIW 
Sbjct: 888 FSCPFGGDVWSRVLRIMASSHRIGHWGVELSWICHQGIRKGVRRKLWRVLWCATIYFIWN 941

Query: 325 ERNLRLHGGAVWEPMVIFQLIRSCIK 338
           ERN RLHGG   +P+VIF LI + I+
Sbjct: 948 ERNHRLHGGQAPDPIVIFHLICTWIR 941

BLAST of Cucsa.300790 vs. NCBI nr
Match: gi|659121154|ref|XP_008460525.1| (PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein At1g65750 [Cucumis melo])

HSP 1 Score: 388.7 bits (997), Expect = 1.1e-104
Identity = 184/277 (66.43%), Postives = 212/277 (76.53%), Query Frame = 1

Query: 15  IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
           +AW +VCLPF+EGGL IRDG SWNIASTLKILWL+L  S SLWVAWVEAYILKGRSLW++
Sbjct: 222 VAWVDVCLPFEEGGLGIRDGPSWNIASTLKILWLMLTNSGSLWVAWVEAYILKGRSLWDV 281

Query: 75  DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
           D  VG+SWC RAILRKR+ LK  V+M+VGNG   RVWLDP +  G I++Q GERV+YDA 
Sbjct: 282 DSRVGKSWCLRAILRKREKLKHLVRMKVGNGNSFRVWLDPWLPEGAILEQVGERVMYDAA 341

Query: 135 SRRDARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVS 194
           SRR ARL DF+  DG+W W  VSL+L+D+W+R+Q V P  SV D WV VPG    FSI S
Sbjct: 342 SRRKARLSDFIDPDGEWLWPRVSLELIDLWERVQEVSPCLSVSDSWVWVPGRRGGFSIAS 401

Query: 195 ACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSIPLSCILCG 254
           A EA+RP   RV W GLLWGGGNI KH F  WLAIKDRLGT DRL RWDSS+P+ CILC 
Sbjct: 402 AWEAVRPRGGRVLWDGLLWGGGNIQKHFFCAWLAIKDRLGTIDRLHRWDSSVPMLCILCR 461

Query: 255 RNYESRDHLFFPCPF-GWEIWSRILLFMSSSHRIRYW 291
              ESRDHLFF C F G ++WSR+L  M SSHRI +W
Sbjct: 462 GXVESRDHLFFSCSFGGGDVWSRVLRIMGSSHRIGHW 498

BLAST of Cucsa.300790 vs. NCBI nr
Match: gi|659116542|ref|XP_008458124.1| (PREDICTED: putative ribonuclease H protein At1g65750 [Cucumis melo])

HSP 1 Score: 275.0 bits (702), Expect = 1.8e-70
Identity = 127/192 (66.15%), Postives = 153/192 (79.69%), Query Frame = 1

Query: 15  IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
           +AW +VCLPF+EGGL IRDG SWNIA+TLKILWL+L  S SLWVAW+EAYILKG+SLW++
Sbjct: 71  VAWVDVCLPFEEGGLGIRDGPSWNIANTLKILWLMLTNSGSLWVAWMEAYILKGKSLWDV 130

Query: 75  DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
           D  VGRSWCFRAILRKR+ LK HV+M+VGNG +CRVWLDP +QGG I++Q GERV+YDA 
Sbjct: 131 DSRVGRSWCFRAILRKREKLKHHVRMKVGNGNRCRVWLDPWLQGGAILEQVGERVLYDAA 190

Query: 135 SRRDARLVDFMGRDGDWRWSLVSLDLMDIWDRIQGVRPSPSVEDMWVRVPGSHESFSIVS 194
           SRR+ARL DF+  +G+W W  VSL+L+D+W+R+Q V P  SV D WV VPG    FSI S
Sbjct: 191 SRREARLSDFIDPNGEWLWPRVSLELIDLWERVQEVSPCLSVSDSWVWVPGRQGGFSIAS 250

Query: 195 ACEAIRPHSSRV 207
           A EAI P    V
Sbjct: 251 AWEAICPRGCEV 262

BLAST of Cucsa.300790 vs. NCBI nr
Match: gi|685297015|ref|XP_009139329.1| (PREDICTED: putative ribonuclease H protein At1g65750 [Brassica rapa])

HSP 1 Score: 201.4 bits (511), Expect = 2.5e-48
Identity = 118/346 (34.10%), Postives = 176/346 (50.87%), Query Frame = 1

Query: 15  IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
           +AW ++CLP +EGGL +R+ S WN    LK++WLL  KS SLWVAW+  + L+  S W  
Sbjct: 21  VAWHQICLPKEEGGLGLRNFSLWNKTLNLKLIWLLFSKSDSLWVAWMRNHYLRQGSFWNA 80

Query: 75  DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
            +    SW ++ +L  R + +  ++ +V +G+    W D  +  GP+I   G+       
Sbjct: 81  PIRTASSWIWKTMLSLRPLARRFLRCDVSDGQSSSFWFDHWLDIGPLIDVVGQDGPQPMG 140

Query: 135 SRRDARLVDFMGRDGDWRW---SLVSLDLMDIWDRIQGVRPSPSVEDM----WVRVPGSH 194
              ++R+ D +   G WR       +  L  + D +       SVE      W+ VPGS 
Sbjct: 141 IPIESRVSDAVTSRG-WRLPPSRTRNPSLAAVRDCLLRTPLPSSVEHSDCFEWI-VPGSA 200

Query: 195 ES-FSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSI 254
            + FS     + +RP   R  W   +W  G IPKH+F+ W+ I +RL  RDRL RW   +
Sbjct: 201 TTIFSSALTWDHLRPRMPRPPWHTGVWFKGCIPKHAFHFWVVILNRLPVRDRLVRWGLEV 260

Query: 255 PLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGVRR----------- 314
              C+LC +N E+RDHLF  C +  E+W+ +   + +S     WG               
Sbjct: 261 SDKCLLCTQNIETRDHLFLSCDYSREVWNIVSRRLGASWLCDNWGAFMNWISDTASPPAT 320

Query: 315 KLWRLLWCATIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIKVVLL 342
            L RL++ ATIY IW+ERN RLH G    P ++F+ I  CIK  +L
Sbjct: 321 LLKRLVFQATIYLIWRERNSRLHAGPSLLPSLLFRQIDRCIKDAIL 364

BLAST of Cucsa.300790 vs. NCBI nr
Match: gi|923628714|ref|XP_013749866.1| (PREDICTED: uncharacterized protein LOC106452330 isoform X1 [Brassica napus])

HSP 1 Score: 194.9 bits (494), Expect = 2.3e-46
Identity = 111/345 (32.17%), Postives = 175/345 (50.72%), Query Frame = 1

Query: 15  IAWDEVCLPFDEGGLAIRDGSSWNIASTLKILWLLLVKSCSLWVAWVEAYILKGRSLWEI 74
           IAW+ VC P + GGL ++  + WN+   LK++WLL     SLWV+WV   ++   + W +
Sbjct: 347 IAWESVCTPKEAGGLGLKRLADWNVVLGLKLIWLLFAAGGSLWVSWVRRNLIGRENFWML 406

Query: 75  DVGVGRSWCFRAILRKRDILKAHVKMEVGNGKKCRVWLDPRIQGGPIIQQFGERVIYDAD 134
                 SW +R++ + R   +  +K +VG+G     W+D     GP+I+  GER      
Sbjct: 407 VPNRRGSWIWRSLCKLRPKARPFIKCQVGSGITASFWMDDWTSLGPLIEVVGERGPVVTG 466

Query: 135 SRRDARLVDFMGRDGDW---RWSLVSLDLMDIWDRIQGVRP--SPSVEDMWVRVPGSH-- 194
              +AR+VD +  DG W   R    S  +M + D +   +P  +  V+D +V V   H  
Sbjct: 467 LSINARVVDALTSDG-WCFERSRSRSPSIMLLRDSMPDAQPILNSEVDDTYVWVTSGHNG 526

Query: 195 -ESFSIVSACEAIRPHSSRVGWSGLLWGGGNIPKHSFYTWLAIKDRLGTRDRLSRWDSSI 254
            E+FS     + + P    V W  ++W  G IPKH+F +W+A +DR+ TRDRL RW   +
Sbjct: 527 SETFSTSETWKRLFPCMLEVFWHEVIWFSGRIPKHAFLSWVAARDRMVTRDRLLRWGLLV 586

Query: 255 PLSCILCGRNYESRDHLFFPCPFGWEIWSRILLFMSSSHRIRYWGGVR-----------R 314
           P +C+LC  + E R HLFF C F  ++W+     +     + +  G+R           +
Sbjct: 587 PATCVLCVGHNEDRQHLFFDCNFSKQVWTFFTSRLRLDPPVLFEDGLRWLKNPAREKNVK 646

Query: 315 KLWRLLWCATIYFIWQERNLRLHGGAVWEPMVIFQLIRSCIKVVL 341
            + RLL  A +Y IW+ERN R+H      P  I   ++  I++ L
Sbjct: 647 LIVRLLHQACLYIIWKERNSRIHTDEARSPEAIIAEVKQIIRLRL 690

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A068F615_BRANA1.6e-4632.17Uncharacterized protein OS=Brassica napus PE=4 SV=1[more]
Q9FL83_ARATH4.3e-3931.80Non-LTR retroelement reverse transcriptase-like protein OS=Arabidopsis thaliana ... [more]
M4DQ66_BRARP7.3e-3929.29Uncharacterized protein OS=Brassica rapa subsp. pekinensis PE=4 SV=1[more]
A0A087GG31_ARAAL2.1e-3830.06Uncharacterized protein OS=Arabis alpina GN=AALP_AA7G055000 PE=4 SV=1[more]
Q9FYJ4_ARATH2.8e-3831.75F17F8.5 OS=Arabidopsis thaliana PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G60720.15.8e-2731.75 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT1G43730.13.9e-2329.70 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT5G16486.18.6e-2333.33 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT4G04650.15.6e-2229.76 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
AT2G02520.11.5e-1933.99 RNA-directed DNA polymerase (reverse transcriptase)-related family p... [more]
Match NameE-valueIdentityDescription
gi|659102432|ref|XP_008452126.1|2.5e-10960.12PREDICTED: uncharacterized protein LOC103493225 [Cucumis melo][more]
gi|659121154|ref|XP_008460525.1|1.1e-10466.43PREDICTED: LOW QUALITY PROTEIN: putative ribonuclease H protein At1g65750 [Cucum... [more]
gi|659116542|ref|XP_008458124.1|1.8e-7066.15PREDICTED: putative ribonuclease H protein At1g65750 [Cucumis melo][more]
gi|685297015|ref|XP_009139329.1|2.5e-4834.10PREDICTED: putative ribonuclease H protein At1g65750 [Brassica rapa][more]
gi|923628714|ref|XP_013749866.1|2.3e-4632.17PREDICTED: uncharacterized protein LOC106452330 isoform X1 [Brassica napus][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR026960RVT-Znf
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.300790.1Cucsa.300790.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 190..274
score: 3.1
NoneNo IPR availablePANTHERPTHR25952FAMILY NOT NAMEDcoord: 121..333
score: 7.9
NoneNo IPR availablePANTHERPTHR25952:SF191PROTEIN T12G3.2, ISOFORM Dcoord: 121..333
score: 7.9

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cucsa.300790CSPI03G24620Wild cucumber (PI 183967)cgycpiB451
The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
Cucsa.300790Cucumber (Chinese Long) v2cgycuB427
Cucsa.300790Cucumber (Chinese Long) v2cgycuB428
Cucsa.300790Cucumber (Chinese Long) v3cgycucB464