Cucsa.001350 (gene) Cucumber (Gy14) v1

NameCucsa.001350
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
Locationscaffold00020 : 116022 .. 118120 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGGATCATTACAAGGCGATTTGAGTTCCTCTGACCTCAACCATCCTTTTCTCTTTGAAGGAGAAAATTTTAAGAGGTGGAAACAAAAGATGCTGTTTTTtCTTACTTTGAAGAAGGTGGCTACCGCATGTACCTCTACAGAGCAAAAAATCTCAGACTCCAATCCATTTGAAGAGAAAGTCAAAACTCATATCGCGTGGACAGAGACCAATTTTATATGTAAGAATTTAATACTTAATGGTCTTACTAATGAGTTGTATGATTATTACAGCGTAATGTCTACCGCGAAAGAAGTTTGGGATGCGTTACAAAAGAAGTATGACATTGAGGAAGCTGGATCTAAGAAGTATGCAGAAAGTCGATACTTGCGTTATCAAATCACTAATGATAGATCAGTGGAAGGATCAATAGTATGTGATTTAAAAGATAACACATGAAATCATAAACGAAGATATGCCGCTTCACAATCAATTTCAAGTTGCGGTTATTATTGATAAATTACCTCCTCTGTGGAAGGATTTCAAAAATACTATAAGACACAAAACCAAGGAGTTCTCGCTGAAAAGTCTCATCACACAATTAAGGATTGAGGAAGAGGCGAGAAGACATGACCAAAAAGAAGAGGGGAACACAATTTCTAGAAAGAAGTCAAACGCAATGCTGAAACTAGATCTGAAGCCTAAAGGAAATAAAATAAAATGTGGATCCAACAAGCAAAGAAATTCGCAAAGACCCTAGTCCAGAAGTACAGTACAAATTATTTGCTACAATTGTAATAAGGCTTGGACACTTTAGCTAGAAATTGTAGAAACGTAAATTATCCTACTGCGTAGGCAAACCTGGTAGGAGATGAATATGTAGCTATGATTTCTGAAGTTAATTTCATTGGGGGATCAGAAGGTTGGTGGCTAGACACAGGTGCATCTCGCCATGTCAGTCATGACCTTAATTTATTTAGAAAGTATAATGAGATAAAGGATAAGAATATCCTTCTAGGAGATCATCACATGACTCAGGCGGCAGGCATTAAAAAAGTAGAACGGAAATTCACATCTGGCAAAATGCTTGTGTTAAAGGAAGTTCTTCACAGGACTGAAATTCGAAAGAATTTGATCTCTGGATATCTCCTCAACAAAGCTAGCTTCTTGTAAACTATAGGGTCATATTTATTTACTTTAACTAAGAACAATGTTTTGTGGGAAAGGGCTGCGTGACAGAAGGCATGTTTAAAGTAAATTTAGAAACGAATAAGAATTTAACTTTCGCTCACATGTTATCTTCTTTTAATGTTTGGCATGCTAGACTTTGTCATATTAATAAAAGACTAACTAGCAACATGAGTAGATTAAATCTCATACCTAAGTTATCTATGCATGAATTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACAAAAACCTCACATAAGTCTATAAATAGAGTAACAAAGCCTTTAGAATTAATACATTCTGACTTATGTGAATTTGATGGCACATTAACTAGGAACAGTAAAAGGTATGTAATAACCTTTATAGATGACTCTTCCGACTACACTTTTATTTATCTACTTAAAGATAAAAGTGATGCCTTTGACATGTTCAAGGTGTATGTAACTGAAATAGAGAATCAATTTAACAAAAGAATTAAGAGACTTCGTAGTGATAGAGGAACATAATATTATTTAGTTGCCTTCAATGAGTTTTATAACTCAAAAGGAATAATACATGAAAGTACTGCACCCTATTCTCCTGAAATGAATGGAAAAGCATAAAAGAAAAATAGAACACTAACGGAGTTAGTAGTTGCTATCTTACTTGAATCAGAAGCAACACCATCTTTGTGGGGTGAAATAATTAAGACTATTAATTATTTTCTTGATAGGATTCCTAAATCAAACAGTAAATCCTCGCCATGCAAAGTCCTTAAGAATAAAACACCAAACCTGTCTTATCTTAGAACTTGGGATTGTCTAGCCTATGTTAGAATACTTGATCCACAGAGAAGAAAATTAGCCAGTAGAGCTTACGAATTTGTTTTCATAGGATATGCTGAAAATAATAAGACCTATAAGTTCTATGACTTAGAAAACAAA

mRNA sequence

atggccggatcattacaaggcgatttgagttcctctgacctcaaccatccttttctctttgaaggagaaaattttaagaggtggaaacaaaagatgctgttttttcttactttgaagaaggtggctaccgcatgtacctctacagagcaaaaaatctcagactccaatccatttgaagagaaagtcaaaactcatatcgcgtggacagagaccaattttatatgtaagaatttaatacttaatggtcttactaatgagttgtatgattattacagcgtaatgtctaccgcgaaagaagtttgggatgcgttacaaaagaagtatgacattgaggaagctggatctaagaagtatgcagaaagtcgatacttgcgttatcaaatcactaatgatagatcagtggaaggatcaatagtatatatgccgcttcacaatcaatttcaagttgcggttattattgataaattacctcctctgtggaaggatttcaaaaatactataagacacaaaaccaaggagttctcgctgaaaagtctcatcacacaattaaggattgaggaagaggcgagaagacatgaccaaaaagaagaggggaacacaatttctagaaagaagtcaaacgcaatgctgaaactagatctgaagcctaaaggaaataaaataaaatgttggtggctagacacaggtgcatctcgccatgtcagtcatgaccttaatttatttagaaagtataatgagataaaggataagaatatccttctaggagatcatcacatgactcaggcggcaggcattaaaaaagtagaacggaaattcacatctggcaaaatgcttgtgttaaaggaagttcttcacaggactgaaattcgaaagaatttgatctctggatatctcctcaacaaagctagcttcttactttgtcatattaataaaagactaactagcaacatgagtagattaaatctcatacctaagttatctatgcatgaatttgagaaatgtgcatgttgtagtcaagctaagataacaaaaacctcacataagtctataaatagagtaacaaagcctttagaattaatacattctgacttatgtgaatttgatggcacattaactaggaacagtaaaaggtatgtaataacctttatagatgactcttccgactacacttttatttatctacttaaagataaaagtgatgcctttgacatgttcaaggtgtataaaaatagaacactaacggagttagtagttgctatcttacttgaatcagaagcaacaccatctttgtggggtgaaataattaagactattaattattttcttgataggattcctaaatcaaacagtaaatcctcgccatgcaaagtccttaagaataaaacaccaaacctgtcttatcttagaacttgggattgtctagcctatgttagaatacttgatccacagagaagaaaattagccagtagagcttacgaatttgttttcataggatatgctgaaaataataagacctataagttctatgacttagaaaacaaa

Coding sequence (CDS)

ATGGCCGGATCATTACAAGGCGATTTGAGTTCCTCTGACCTCAACCATCCTTTTCTCTTTGAAGGAGAAAATTTTAAGAGGTGGAAACAAAAGATGCTGTTTTTtCTTACTTTGAAGAAGGTGGCTACCGCATGTACCTCTACAGAGCAAAAAATCTCAGACTCCAATCCATTTGAAGAGAAAGTCAAAACTCATATCGCGTGGACAGAGACCAATTTTATATGTAAGAATTTAATACTTAATGGTCTTACTAATGAGTTGTATGATTATTACAGCGTAATGTCTACCGCGAAAGAAGTTTGGGATGCGTTACAAAAGAAGTATGACATTGAGGAAGCTGGATCTAAGAAGTATGCAGAAAGTCGATACTTGCGTTATCAAATCACTAATGATAGATCAGTGGAAGGATCAATAGTATATATGCCGCTTCACAATCAATTTCAAGTTGCGGTTATTATTGATAAATTACCTCCTCTGTGGAAGGATTTCAAAAATACTATAAGACACAAAACCAAGGAGTTCTCGCTGAAAAGTCTCATCACACAATTAAGGATTGAGGAAGAGGCGAGAAGACATGACCAAAAAGAAGAGGGGAACACAATTTCTAGAAAGAAGTCAAACGCAATGCTGAAACTAGATCTGAAGCCTAAAGGAAATAAAATAAAATGTTGGTGGCTAGACACAGGTGCATCTCGCCATGTCAGTCATGACCTTAATTTATTTAGAAAGTATAATGAGATAAAGGATAAGAATATCCTTCTAGGAGATCATCACATGACTCAGGCGGCAGGCATTAAAAAAGTAGAACGGAAATTCACATCTGGCAAAATGCTTGTGTTAAAGGAAGTTCTTCACAGGACTGAAATTCGAAAGAATTTGATCTCTGGATATCTCCTCAACAAAGCTAGCTTCTTACTTTGTCATATTAATAAAAGACTAACTAGCAACATGAGTAGATTAAATCTCATACCTAAGTTATCTATGCATGAATTTGAGAAATGTGCATGTTGTAGTCAAGCTAAGATAACAAAAACCTCACATAAGTCTATAAATAGAGTAACAAAGCCTTTAGAATTAATACATTCTGACTTATGTGAATTTGATGGCACATTAACTAGGAACAGTAAAAGGTATGTAATAACCTTTATAGATGACTCTTCCGACTACACTTTTATTTATCTACTTAAAGATAAAAGTGATGCCTTTGACATGTTCAAGGTGTATAAAAATAGAACACTAACGGAGTTAGTAGTTGCTATCTTACTTGAATCAGAAGCAACACCATCTTTGTGGGGTGAAATAATTAAGACTATTAATTATTTTCTTGATAGGATTCCTAAATCAAACAGTAAATCCTCGCCATGCAAAGTCCTTAAGAATAAAACACCAAACCTGTCTTATCTTAGAACTTGGGATTGTCTAGCCTATGTTAGAATACTTGATCCACAGAGAAGAAAATTAGCCAGTAGAGCTTACGAATTTGTTTTCATAGGATATGCTGAAAATAATAAGACCTATAAGTTCTATGACTTAGAAAACAAA

Protein sequence

MAGSLQGDLSSSDLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETNFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDRSVEGSIVYMPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLITQLRIEEEARRHDQKEEGNTISRKKSNAMLKLDLKPKGNKIKCWWLDTGASRHVSHDLNLFRKYNEIKDKNILLGDHHMTQAAGIKKVERKFTSGKMLVLKEVLHRTEIRKNLISGYLLNKASFLLCHINKRLTSNMSRLNLIPKLSMHEFEKCACCSQAKITKTSHKSINRVTKPLELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFIYLLKDKSDAFDMFKVYKNRTLTELVVAILLESEATPSLWGEIIKTINYFLDRIPKSNSKSSPCKVLKNKTPNLSYLRTWDCLAYVRILDPQRRKLASRAYEFVFIGYAENNKTYKFYDLENK
BLAST of Cucsa.001350 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 59.3 bits (142), Expect = 1.4e-07
Identity = 33/107 (30.84%), Postives = 57/107 (53.27%), Query Frame = 1

Query: 411 RTLTELVVAILLESEATPSLWGEIIKTINYFLDRIPKS---NSKSSPCKVLKNKTPNLSY 470
           RT+TE    ++  ++   S WGE + T  Y ++RIP     +S  +P ++  NK P L +
Sbjct: 590 RTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKH 649

Query: 471 LRTWDCLAYVRILDPQRRKLASRAYEFVFIGYAENNKTYKFYDLENK 515
           LR +    YV I + Q  K   ++++ +F+GY  N   +K +D  N+
Sbjct: 650 LRVFGATVYVHIKNKQ-GKFDDKSFKSIFVGYEPNG--FKLWDAVNE 693


HSP 2 Score: 58.5 bits (140), Expect = 2.4e-07
Identity = 33/96 (34.38%), Postives = 51/96 (53.12%), Query Frame = 1

Query: 311 KRLTSNMSRLNLIPKLSMHEFEKCACCSQAKITKTSHKSINRVTKPLELIHSDLCEFDGT 370
           K + S+ S LN + +LS    E C    QA++     K    + +PL ++HSD+C     
Sbjct: 436 KNMFSDQSLLNNL-ELSCEICEPCLNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITP 495

Query: 371 LTRNSKRYVITFIDDSSDYTFIYLLKDKSDAFDMFK 407
           +T + K Y + F+D  + Y   YL+K KSD F MF+
Sbjct: 496 VTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQ 530

BLAST of Cucsa.001350 vs. TrEMBL
Match: A5B994_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_023328 PE=4 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 3.0e-73
Identity = 164/434 (37.79%), Postives = 247/434 (56.91%), Query Frame = 1

Query: 17  PFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETNFICK 76
           P  F G  FKRW+QKMLF++T   +A         + +    +EKV    AW  ++F+C+
Sbjct: 18  PEKFNGNEFKRWQQKMLFYVTTLNLARFLHEECPILEEGETNKEKVAAVDAWKHSDFLCR 77

Query: 77  NLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDRSVEG 136
           N +LNGL N LY+ Y  + TA+E+WD+L KKY  E+AG KK+   ++L +++ + ++V  
Sbjct: 78  NYVLNGLDNTLYNVYCSLKTAEELWDSLDKKYKTEDAGMKKFIVGKFLDFKMIDSKTVRS 137

Query: 137 SIVY------------MPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLITQLR 196
            +              M L++ FQVA +I+KLPPLWKDFKN ++HK KE +L+ LI +LR
Sbjct: 138 QVQELQVILHEIHSEGMSLNDSFQVAAVIEKLPPLWKDFKNYLKHKRKEMNLEELIVRLR 197

Query: 197 IEEEARRHDQKEEGNTISRKKSNAMLKL---DLKPKGNKIKCWWLDTGASRHVSHDLNLF 256
           IEE+ R+ ++K  GN     K+N  L     +    GN  K WW+DTGA+RH+  +  +F
Sbjct: 198 IEEDNRKFEKK--GNNSMEAKANMNLSAVVSECNIVGN-TKEWWVDTGATRHICSNKWMF 257

Query: 257 RKYNEI-KDKNILLGDHHMTQAAGIKKVERKFTSGKMLVLKEVLHRTEIRKNLISGYLLN 316
             Y  + +++ + +G    ++  G  KV  K TSGK L L +VLH  +IRKNL+S  LL+
Sbjct: 258 STYKPVEQNEELFMGYSSSSKVEGRGKVILKMTSGKELTLNDVLHVPDIRKNLVSRSLLS 317

Query: 317 KASFLLCHINKR--LTSN---------------MSRLNLIPKLSMHEFEKCACCSQAKIT 376
           K  F L  ++ +  LT N               M+ + ++PK            S A + 
Sbjct: 318 KNGFKLVFVSDKFVLTKNEMFVGKGYLSDGLFKMNVMTVVPK----SINNNKIDSSAYLL 377

Query: 377 KTSH-KSINRVTKPLELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFIYLLKDKSDAF 417
           ++S+   + R T+PL+LIHSD+C+     TR  K+Y ITFIDD + Y ++YLLK K +A 
Sbjct: 378 ESSNIWHVERSTEPLDLIHSDICDLKFVQTRGGKKYFITFIDDCTRYCYVYLLKSKDEAI 437

BLAST of Cucsa.001350 vs. TrEMBL
Match: Q7Y017_ORYSJ (Putative polyprotein, 3'-partial (Fragment) OS=Oryza sativa subsp. japonica GN=OJ1112_G08.22 PE=4 SV=1)

HSP 1 Score: 223.0 bits (567), Expect = 8.3e-55
Identity = 169/571 (29.60%), Postives = 264/571 (46.23%), Query Frame = 1

Query: 12  SDLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTET 71
           +D   P  F G +FKRW+ ++  +LT  K     T   + +  +       K    + E 
Sbjct: 5   ADALRPDKFTGVHFKRWQIRVTLWLTAMKCFWVSTGKPEGVLTA-------KQQKQFEEA 64

Query: 72  NFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITND 131
             +    IL+ L + L + Y  M+ AKE+WDAL  K+   +A +  Y   ++  Y++ ++
Sbjct: 65  TTLFVGCILSVLGDRLVEVYMHMTDAKELWDALNTKFGATDASNDLYIMEQFHDYKMADN 124

Query: 132 RSV------------EGSIVYMPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSL 191
           RSV            E  ++   L ++F    II KLPP W+ F   ++HK +E+S++ L
Sbjct: 125 RSVVEQAHEIQTMAKELELLKCVLPDKFVAGCIIAKLPPSWRSFGTALKHKRQEYSVEGL 184

Query: 192 ITQLRIEEEAR---------------------------RHDQKEEGNTISRKKSNAMLKL 251
           I  L +EE+AR                           ++  ++  N   +KK+N     
Sbjct: 185 IASLDVEEKAREKDAASKGDGGQSSANVVHKAQNKSKGKYKAQQTTNFKKQKKNNNNPNQ 244

Query: 252 D------------LKPKGNKIKCWWLDTGASRHVSHDLNLFRKYNEIK--DKNILLGDHH 311
           D            L  K N+   WW+DTGA+   S  +   +    +   D+N++ G   
Sbjct: 245 DERTCFVCGQVGHLARKFNQSTNWWVDTGAN-FTSGKIVQLKNVQHVPSIDRNLVSGSRL 304

Query: 312 MTQAAGIKKVERKFTSGKMLVLKEV-----------LHRTEIRK------NLISGYLLNK 371
                G K V   F S K++V K             L R  +        N I G + ++
Sbjct: 305 TRD--GFKLV---FESNKVVVSKHGYFIGKGYECGGLFRFSLSDFCNKSVNHICGSVDDE 364

Query: 372 ASFL---LCHINKRLTSNMSRLNLIPKLSMHEFEKCACCSQAKITKTSHKSIN-RVTKPL 431
           A+     LCHIN  L S +S + LIPK S+ +  KC  C Q+K  +  HK+   R   PL
Sbjct: 365 ANVWHSRLCHINFGLMSRLSSMCLIPKFSIVKGSKCHSCVQSKQPRKPHKAAEERNLAPL 424

Query: 432 ELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFIYLLKDKSDAFDMFKVYKNRTLTELV 491
           EL+HSDLCE +G LT+  KRY +T IDD++ + ++YLLK K +A D FK+YK     +L 
Sbjct: 425 ELLHSDLCEMNGVLTKGGKRYFMTLIDDATRFCYVYLLKTKDEALDYFKIYKAEVENQL- 484

Query: 492 VAILLESEATPSLWGEIIKTINYFLDRIPKSNSKSSPCKVLKNKTPNLSYLRTWDCLAYV 509
                            IK +     R+P  N   +P ++   + P+LSYLRTW CLA V
Sbjct: 485 --------------DRKIKRL-----RVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLAKV 542

BLAST of Cucsa.001350 vs. TrEMBL
Match: M5X570_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb012171mg PE=4 SV=1)

HSP 1 Score: 218.0 bits (554), Expect = 2.7e-53
Identity = 115/207 (55.56%), Postives = 144/207 (69.57%), Query Frame = 1

Query: 13  DLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETN 72
           D N P  FEG +FKRW+QKMLF+ T KK+A+ CTS +   SD NP  E+      WTE +
Sbjct: 11  DPNKPSKFEGLHFKRWRQKMLFYPTTKKLASVCTSDKPYASD-NPTPEQTWALQTWTEND 70

Query: 73  FICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDR 132
           F+CKN ILNGL+++LYDYYS   TAK++WDALQK Y+ EEAG+KK+A SRYL++Q+ +++
Sbjct: 71  FLCKNYILNGLSDDLYDYYSSYDTAKDLWDALQKNYNTEEAGAKKFAVSRYLKFQMIDEK 130

Query: 133 SVEGS------------IVYMPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLI 192
           SVE              I  M L  QFQVAVIIDKLPP WKDFKN +     +FSL+SLI
Sbjct: 131 SVEAQSHELQKNAHEIIIEGMNLDEQFQVAVIIDKLPPNWKDFKNAL-----QFSLESLI 190

Query: 193 TQLRIEEEARRHDQKEEGNTISRKKSN 208
           T+LRIEEEAR+HD KEE   +S  K N
Sbjct: 191 TRLRIEEEARKHDMKEEVLLVSNNKKN 211

BLAST of Cucsa.001350 vs. TrEMBL
Match: A5CBN2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016964 PE=4 SV=1)

HSP 1 Score: 212.2 bits (539), Expect = 1.5e-51
Identity = 125/348 (35.92%), Postives = 193/348 (55.46%), Query Frame = 1

Query: 17  PFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETNFICK 76
           P  F G  FKRW+QKMLF+LT   +A         + +    +EKV    AW  ++F+C+
Sbjct: 211 PEKFNGNEFKRWQQKMLFYLTTLNLARFLHEECPILEEGETNKEKVAVVDAWKHSDFLCR 270

Query: 77  NLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDRSV-- 136
           N +LNGL N LY+ Y  + TAKE+WD+L KKY  E+AG KK+   ++L +++ + ++V  
Sbjct: 271 NYMLNGLDNTLYNVYCSLKTAKELWDSLDKKYKTEDAGIKKFIVGKFLDFKMIDSKTVIS 330

Query: 137 ---EGSIVYMPLHNQ-------FQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLITQLR 196
              E  ++   +H++       FQVA +I+KLPPLWKDFKN ++HK KE +L+ LI +LR
Sbjct: 331 QVQELQVILHEIHSEGMSFSDSFQVAAVIEKLPPLWKDFKNYLKHKRKEMNLEELIVRLR 390

Query: 197 IEEEARR----------------------HDQKEEGNTISRKKSNAMLKLDLK-----PK 256
           IEE+ R+                      + +++ G+    ++SN   K   K       
Sbjct: 391 IEEDNRKSEKKGNNSMEAKANVIEQGPKTNKKRKHGDQNQNQESNVAKKFKGKCYNCGKT 450

Query: 257 GNK------IKCWWLDTGASRHVSHDLNLFRKYNEIK-DKNILLGDHHMTQAAGIKKVER 316
           G+K       K WW+DTGA+RH+  +  +F  Y  ++ ++ + +G+   ++  G  KV  
Sbjct: 451 GHKSNDYRNTKEWWVDTGATRHICSNKWMFSTYKPVEQNEELFMGNSSSSKIEGRGKVIL 510

BLAST of Cucsa.001350 vs. TrEMBL
Match: A5CBN2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016964 PE=4 SV=1)

HSP 1 Score: 127.9 bits (320), Expect = 3.6e-26
Identity = 81/255 (31.76%), Postives = 123/255 (48.24%), Query Frame = 1

Query: 276 KMLVLKEVLHRTEIRKNLISGYLLNKASFL---LCHINKRLTSNMSRLNLIPKLSMHEFE 335
           KM V+  V       K   S YLL  ++     L H+N      +  L+ +PK ++    
Sbjct: 573 KMNVMTVVPKSINNNKIDSSAYLLKSSNIWHGRLGHVNYDTLCRLIHLDYLPKFNIDPNH 632

Query: 336 KCACCSQAKITKTSHKSINRVTKPLELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFI 395
           KC  C ++K+TK    S+ R T+PL+L HSD+C+     TR  K+Y ITFIDD + Y ++
Sbjct: 633 KCETCVESKLTKVPFHSVERSTEPLDLFHSDICDLKFVQTRGGKKYFITFIDDCTRYCYV 692

Query: 396 YLLKDKSDAFDMFKVYK---NRTLTELVVAILLESEAT-PSLWGEIIKTINYFLDRIPKS 455
           YLLK K +A +MFK YK      L++ + AI  +      S + E           I   
Sbjct: 693 YLLKSKDEAIEMFKHYKIEVENQLSKKIKAIRSDRGGEYESPFEEFCLEHGIIHQTIAPY 752

Query: 456 NSKSSPCKVLKNKT---------------PNLSYLRTWDCLAYVRILDPQRRKLASRAYE 509
           + +S+     KN+T               P   YL+ W CLA V +  P++ K+  +  +
Sbjct: 753 SPQSNGMAERKNRTLKEMMNAMLLSSGHKPCYKYLKVWGCLAKVEVPKPKKVKIGPKTID 812


HSP 2 Score: 48.9 bits (115), Expect = 2.1e-02
Identity = 23/61 (37.70%), Postives = 33/61 (54.10%), Query Frame = 1

Query: 17   PFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETNFICK 76
            P  F G  FKRW+QKMLF+LT   +A         + +    +EKV    AW  ++F+C+
Sbjct: 1256 PEKFNGNEFKRWQQKMLFYLTTLNLARFLHEECPILEEGETNKEKVAAVDAWKHSDFLCR 1315

Query: 77   N 78
            N
Sbjct: 1316 N 1316


HSP 3 Score: 199.9 bits (507), Expect = 7.5e-48
Identity = 124/386 (32.12%), Postives = 187/386 (48.45%), Query Frame = 1

Query: 87  LYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDRSVEGSIVY------ 146
           +Y+ YS + T KE+W+ L+KKY  E AG KK+   R+L Y++ + ++V   +        
Sbjct: 1   MYNVYSQVKTTKELWEFLEKKYKTENAGMKKFIVGRFLDYKMVDSKTVTSEVQELQIILH 60

Query: 147 ------MPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLITQLRIEEE------ 206
                 M L   FQV  I++KLPP WKDFKN+++HK KE  L+ LI +L+IEE+      
Sbjct: 61  ELHAEKMELSESFQVTTIVEKLPPSWKDFKNSLKHKRKEMGLEDLIVRLKIEEDNCVSEK 120

Query: 207 -ARRHDQKEEGNTISRKKSNAMLKLDLKPKG--NKIKCWWLDTGASRHVSHDLNLFRKYN 266
              +H  + + N +  K +  +      P    + +  WW+DTGA+RHV  + N+F   N
Sbjct: 121 KVGKHPMESKVNLVEPKANKKIKHFGEGPTNMVDNLNEWWVDTGATRHVCGERNMFSSTN 180

Query: 267 E------------IKDKNILLGDHHMTQAAGIKKVERKFTSGKMLVLK------------ 326
           +               K ++L D  +   + I+K    F   K + +K            
Sbjct: 181 QWVVGIGKVVLNMTSGKELVLTD--VLHVSDIRKNLLVFEPDKFVFMKIGMYVGKGFMTN 240

Query: 327 --------EVLHRTEIRKNLISGYLLNKASFL---LCHINKRLTSNMSRLNLIPKLSMHE 386
                    V H     K   S YL+   +F    L H+N +    +  +NL+   S+  
Sbjct: 241 GLFNMNVMTVKHDFNNNKVSTSVYLIESFTFWHNRLGHVNNKTLKMLINMNLLHFFSIDF 300

Query: 387 FEKCACCSQAKITKTSHKSINRVTKPLELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYT 417
             KC  C +AK+ K    SI R TKP +LIH+D+C+     TR  K Y ITFIDD   Y 
Sbjct: 301 KHKCEVCVEAKMAKPPFHSIERNTKPPDLIHNDICDLKFVQTRGGKMYFITFIDDCIRYC 360

BLAST of Cucsa.001350 vs. NCBI nr
Match: gi|659123591|ref|XP_008461736.1| (PREDICTED: uncharacterized protein LOC103500269 [Cucumis melo])

HSP 1 Score: 285.0 bits (728), Expect = 2.5e-73
Identity = 151/234 (64.53%), Postives = 174/234 (74.36%), Query Frame = 1

Query: 1   MAGSLQGDLSSSDLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEE 60
           MAG  Q  L S DLN PF FEG +FKRWKQKMLFFLTLKKVAT CT+ + K+S+ +P EE
Sbjct: 1   MAGQFQSYLMSFDLNCPFRFEGAHFKRWKQKMLFFLTLKKVATTCTTEKLKVSEKDPTEE 60

Query: 61  KVKTHIAWTETNFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAE 120
           ++K    WTET+FI KNLILNGLT+ELYDYYS M+TAKEVWDA QKKYD EE GSKKYA 
Sbjct: 61  QLKNLATWTETDFIYKNLILNGLTDELYDYYSTMTTAKEVWDAQQKKYDTEETGSKKYAV 120

Query: 121 SRYLRYQITNDRSVEG------SIVY------MPLHNQFQVAVIIDKLPPLWKDFKNTIR 180
           + YLRYQITNDRSVE        I +      MPL +QFQVAVIIDKLP LWKDFKNT+R
Sbjct: 121 NPYLRYQITNDRSVEAQSHEIQKIAHEIISEGMPLDDQFQVAVIIDKLPSLWKDFKNTLR 180

Query: 181 HKTKEFSLKSLITQLRIEEEARRHDQKEEGNTISRKKSNAMLKLDLKPKGNKIK 223
           HKTK FSL+SLIT+LRIE E      K  G   S+K+ +   + DLKPKGN++K
Sbjct: 181 HKTKVFSLESLITRLRIEVE-EESMIKRRGERYSQKEVHCSFETDLKPKGNQMK 233

BLAST of Cucsa.001350 vs. NCBI nr
Match: gi|147863092|emb|CAN82978.1| (hypothetical protein VITISV_023328 [Vitis vinifera])

HSP 1 Score: 284.3 bits (726), Expect = 4.3e-73
Identity = 164/434 (37.79%), Postives = 247/434 (56.91%), Query Frame = 1

Query: 17  PFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTETNFICK 76
           P  F G  FKRW+QKMLF++T   +A         + +    +EKV    AW  ++F+C+
Sbjct: 18  PEKFNGNEFKRWQQKMLFYVTTLNLARFLHEECPILEEGETNKEKVAAVDAWKHSDFLCR 77

Query: 77  NLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITNDRSVEG 136
           N +LNGL N LY+ Y  + TA+E+WD+L KKY  E+AG KK+   ++L +++ + ++V  
Sbjct: 78  NYVLNGLDNTLYNVYCSLKTAEELWDSLDKKYKTEDAGMKKFIVGKFLDFKMIDSKTVRS 137

Query: 137 SIVY------------MPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSLITQLR 196
            +              M L++ FQVA +I+KLPPLWKDFKN ++HK KE +L+ LI +LR
Sbjct: 138 QVQELQVILHEIHSEGMSLNDSFQVAAVIEKLPPLWKDFKNYLKHKRKEMNLEELIVRLR 197

Query: 197 IEEEARRHDQKEEGNTISRKKSNAMLKL---DLKPKGNKIKCWWLDTGASRHVSHDLNLF 256
           IEE+ R+ ++K  GN     K+N  L     +    GN  K WW+DTGA+RH+  +  +F
Sbjct: 198 IEEDNRKFEKK--GNNSMEAKANMNLSAVVSECNIVGN-TKEWWVDTGATRHICSNKWMF 257

Query: 257 RKYNEI-KDKNILLGDHHMTQAAGIKKVERKFTSGKMLVLKEVLHRTEIRKNLISGYLLN 316
             Y  + +++ + +G    ++  G  KV  K TSGK L L +VLH  +IRKNL+S  LL+
Sbjct: 258 STYKPVEQNEELFMGYSSSSKVEGRGKVILKMTSGKELTLNDVLHVPDIRKNLVSRSLLS 317

Query: 317 KASFLLCHINKR--LTSN---------------MSRLNLIPKLSMHEFEKCACCSQAKIT 376
           K  F L  ++ +  LT N               M+ + ++PK            S A + 
Sbjct: 318 KNGFKLVFVSDKFVLTKNEMFVGKGYLSDGLFKMNVMTVVPK----SINNNKIDSSAYLL 377

Query: 377 KTSH-KSINRVTKPLELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFIYLLKDKSDAF 417
           ++S+   + R T+PL+LIHSD+C+     TR  K+Y ITFIDD + Y ++YLLK K +A 
Sbjct: 378 ESSNIWHVERSTEPLDLIHSDICDLKFVQTRGGKKYFITFIDDCTRYCYVYLLKSKDEAI 437

BLAST of Cucsa.001350 vs. NCBI nr
Match: gi|571447932|ref|XP_006577615.1| (PREDICTED: uncharacterized protein LOC102665777 [Glycine max])

HSP 1 Score: 274.2 bits (700), Expect = 4.5e-70
Identity = 146/265 (55.09%), Postives = 185/265 (69.81%), Query Frame = 1

Query: 59  EEKVKTHIA-WTETNFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKK 118
           ++K+   +A W E +++CKN ILNGLTN+LYDYYS   +AK VW AL+KKYD EEAG+ K
Sbjct: 24  KDKMTMELALWNENDYLCKNFILNGLTNDLYDYYSSYKSAKLVWLALEKKYDTEEAGTTK 83

Query: 119 YAESRYLRYQITNDRSVEGS------IVY------MPLHNQFQVAVIIDKLPPLWKDFKN 178
           Y  SRYL+YQ+T+D+SVE        I +      M L  QFQVAVIIDKLPP WKDFKN
Sbjct: 84  YVVSRYLKYQMTDDKSVESQSHEIHKIAHNIISEGMALDEQFQVAVIIDKLPPGWKDFKN 143

Query: 179 TIRHKTKEFSLKSLITQLRIEEEARRHDQKEEGNTISRKKSNAMLKLDLKPKGNKIKC-- 238
            +RHKTKEFSL+SLIT+LRIEEEARR DQK+E  T+S   +N M +        +I    
Sbjct: 144 LLRHKTKEFSLESLITRLRIEEEARRQDQKDEVLTVSH--NNTMTQEPYIAVITEINMIG 203

Query: 239 ----WWLDTGASRHVSHDLNLFRKYNEIKDKNILLGDHHMTQAAGIKKVERKFTSGKMLV 298
               WW+DTGASRHV +D  +F+ Y  +++K +LLGD H T  AG   VE KFTSGK L+
Sbjct: 204 GSDGWWVDTGASRHVCYDRAMFKTYMNVENKKVLLGDSHTTIVAGTGDVELKFTSGKTLI 263

Query: 299 LKEVLHRTEIRKNLISGYLLNKASF 305
           LK+V+H  E+RKN +SG+LLNK+ F
Sbjct: 264 LKDVMHTPEMRKNRVSGFLLNKSGF 286

BLAST of Cucsa.001350 vs. NCBI nr
Match: gi|571447932|ref|XP_006577615.1| (PREDICTED: uncharacterized protein LOC102665777 [Glycine max])

HSP 1 Score: 106.7 bits (265), Expect = 1.2e-19
Identity = 51/70 (72.86%), Postives = 57/70 (81.43%), Query Frame = 1

Query: 445 IPKSNSKSSPCKVLKNKTPNLSYLRTWDCLAYVRILDPQRRKLASRAYEFVFIGYAENNK 504
           IPKS SK+SP ++LK + PNLSYLRTW CLAYVRI DP+R KLASRAYE VFIGYA N K
Sbjct: 331 IPKSKSKTSPYEILKKRQPNLSYLRTWGCLAYVRIPDPKRVKLASRAYECVFIGYAINRK 390

Query: 505 TYKFYDLENK 515
            Y+FYDL  K
Sbjct: 391 AYRFYDLNAK 400


HSP 2 Score: 223.0 bits (567), Expect = 1.2e-54
Identity = 169/571 (29.60%), Postives = 264/571 (46.23%), Query Frame = 1

Query: 12  SDLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIAWTET 71
           +D   P  F G +FKRW+ ++  +LT  K     T   + +  +       K    + E 
Sbjct: 5   ADALRPDKFTGVHFKRWQIRVTLWLTAMKCFWVSTGKPEGVLTA-------KQQKQFEEA 64

Query: 72  NFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRYQITND 131
             +    IL+ L + L + Y  M+ AKE+WDAL  K+   +A +  Y   ++  Y++ ++
Sbjct: 65  TTLFVGCILSVLGDRLVEVYMHMTDAKELWDALNTKFGATDASNDLYIMEQFHDYKMADN 124

Query: 132 RSV------------EGSIVYMPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEFSLKSL 191
           RSV            E  ++   L ++F    II KLPP W+ F   ++HK +E+S++ L
Sbjct: 125 RSVVEQAHEIQTMAKELELLKCVLPDKFVAGCIIAKLPPSWRSFGTALKHKRQEYSVEGL 184

Query: 192 ITQLRIEEEAR---------------------------RHDQKEEGNTISRKKSNAMLKL 251
           I  L +EE+AR                           ++  ++  N   +KK+N     
Sbjct: 185 IASLDVEEKAREKDAASKGDGGQSSANVVHKAQNKSKGKYKAQQTTNFKKQKKNNNNPNQ 244

Query: 252 D------------LKPKGNKIKCWWLDTGASRHVSHDLNLFRKYNEIK--DKNILLGDHH 311
           D            L  K N+   WW+DTGA+   S  +   +    +   D+N++ G   
Sbjct: 245 DERTCFVCGQVGHLARKFNQSTNWWVDTGAN-FTSGKIVQLKNVQHVPSIDRNLVSGSRL 304

Query: 312 MTQAAGIKKVERKFTSGKMLVLKEV-----------LHRTEIRK------NLISGYLLNK 371
                G K V   F S K++V K             L R  +        N I G + ++
Sbjct: 305 TRD--GFKLV---FESNKVVVSKHGYFIGKGYECGGLFRFSLSDFCNKSVNHICGSVDDE 364

Query: 372 ASFL---LCHINKRLTSNMSRLNLIPKLSMHEFEKCACCSQAKITKTSHKSIN-RVTKPL 431
           A+     LCHIN  L S +S + LIPK S+ +  KC  C Q+K  +  HK+   R   PL
Sbjct: 365 ANVWHSRLCHINFGLMSRLSSMCLIPKFSIVKGSKCHSCVQSKQPRKPHKAAEERNLAPL 424

Query: 432 ELIHSDLCEFDGTLTRNSKRYVITFIDDSSDYTFIYLLKDKSDAFDMFKVYKNRTLTELV 491
           EL+HSDLCE +G LT+  KRY +T IDD++ + ++YLLK K +A D FK+YK     +L 
Sbjct: 425 ELLHSDLCEMNGVLTKGGKRYFMTLIDDATRFCYVYLLKTKDEALDYFKIYKAEVENQL- 484

Query: 492 VAILLESEATPSLWGEIIKTINYFLDRIPKSNSKSSPCKVLKNKTPNLSYLRTWDCLAYV 509
                            IK +     R+P  N   +P ++   + P+LSYLRTW CLA V
Sbjct: 485 --------------DRKIKRL-----RVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLAKV 542

BLAST of Cucsa.001350 vs. NCBI nr
Match: gi|955304629|ref|XP_014634108.1| (PREDICTED: uncharacterized protein LOC106799690 [Glycine max])

HSP 1 Score: 221.9 bits (564), Expect = 2.6e-54
Identity = 115/203 (56.65%), Postives = 145/203 (71.43%), Query Frame = 1

Query: 10  SSSDLNHPFLFEGENFKRWKQKMLFFLTLKKVATACTSTEQKISDSNPFEEKVKTHIA-- 69
           +S+DLN PF FEG +FK W+QKM+F+LT++KVA    +    + +    E K K  +   
Sbjct: 10  NSNDLNKPFRFEGYHFKHWQQKMMFYLTMRKVAYVLNTGIPVVLEDAEKEVKDKMTMELA 69

Query: 70  -WTETNFICKNLILNGLTNELYDYYSVMSTAKEVWDALQKKYDIEEAGSKKYAESRYLRY 129
            W E +++CKN ILNGL ++LYDYYS   +AK VW AL+KKYD EE G+KKYA SRYL+Y
Sbjct: 70  LWNENDYLCKNFILNGLADDLYDYYSPYKSAKLVWLALEKKYDTEETGTKKYAASRYLKY 129

Query: 130 QITNDRSVEG------SIVY------MPLHNQFQVAVIIDKLPPLWKDFKNTIRHKTKEF 189
           Q+T+D+S+E        I +      M L  QFQVAVIIDKLPP WKDFKN +RHKTKEF
Sbjct: 130 QMTDDKSIESQSHEIQKIAHDIISEGMTLDEQFQVAVIIDKLPPGWKDFKNNLRHKTKEF 189

Query: 190 SLKSLITQLRIEEEARRHDQKEE 198
           SL+SLIT+LRIEEEARR DQK+E
Sbjct: 190 SLESLITRLRIEEEARRQDQKDE 212

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
COPIA_DROME1.4e-0730.84Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A5B994_VITVI3.0e-7337.79Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_023328 PE=4 SV=1[more]
Q7Y017_ORYSJ8.3e-5529.60Putative polyprotein, 3'-partial (Fragment) OS=Oryza sativa subsp. japonica GN=O... [more]
M5X570_PRUPE2.7e-5355.56Uncharacterized protein OS=Prunus persica GN=PRUPE_ppb012171mg PE=4 SV=1[more]
A5CBN2_VITVI1.5e-5135.92Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016964 PE=4 SV=1[more]
A5CBN2_VITVI3.6e-2631.76Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_016964 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
gi|659123591|ref|XP_008461736.1|2.5e-7364.53PREDICTED: uncharacterized protein LOC103500269 [Cucumis melo][more]
gi|147863092|emb|CAN82978.1|4.3e-7337.79hypothetical protein VITISV_023328 [Vitis vinifera][more]
gi|571447932|ref|XP_006577615.1|4.5e-7055.09PREDICTED: uncharacterized protein LOC102665777 [Glycine max][more]
gi|571447932|ref|XP_006577615.1|1.2e-1972.86PREDICTED: uncharacterized protein LOC102665777 [Glycine max][more]
gi|955304629|ref|XP_014634108.1|2.6e-5456.65PREDICTED: uncharacterized protein LOC106799690 [Glycine max][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR012337RNaseH-like_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.001350.1Cucsa.001350.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 352..410
score: 4.
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 352..456
score: 4.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 17..512
score: 6.4
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 17..512
score: 6.4
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 68..192
score: 2.1

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None