CSPI02G16360 (gene) Wild cucumber (PI 183967)

NameCSPI02G16360
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrotransposon protein, putative, Ty1-copia subclass
LocationChr2 : 15759836 .. 15761626 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
AATGATGAACGCAATGCTTATAAGTTCAGGTTTACCCCAAAATTTGTGGGGAGAAGCTTTGTTAACATCAAATTATTTATTAAACAGAATTCCTCATAAGAAGTCACAAAATATTTCTTATGAAAAATGGAAAGGAAGAAAACTTTCATATAAATTCTTAAAAGTATGGGGGTGCCTAGCAAAGGTTGTCATGCCTAAACCTAAAATGGTTAAAATTGGACCAAAAACTATTGATTGCATATTCATTGGTTATGCTAGTAACAGTTGTGCATATCGATTTATAGTTCATAAATCAGATATTTCAGATATACATGTTAATACAATCATGGTATCGAGGAATGCAACATTCTTTGAGAATATTTTTCCTCATAAAATGTTTTGTGAAGCAAGGTTACAAAAACGTTCGTTTGATGCTATAACGAGTGAATATCACAATAGATCAAATGTTGAGTTAACGAACAATGAGGAACTCCGACAAAGTAAAAGGATGAGGATCTCAAAATCATTTGGTCCTGATTATTTAACTTATTTGTTAGAAAACGAACCTCAAACATTTAAAGAGGCCATGTCCTCTCCTGAAGCTTCATATTGGAAAGAGGCTGTGAATAGTGAAATTGAGTCCATTATGCATAACCATACTTGGGAACCAGTTAATCTTCCATTAGTAAGTAAACCACCTGGTTGCAAGTGGATTTTCAAATGGAAATTGAAGACCGATGGGTCAATAGATAAATATAAAGCAAGACTTGTCGCTAAAGGTTACAAGCAACAAGAAGGACTTGACTATTTTGTTACATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAACATCAGCTTTGCATGGATTTGAGATACATCAGATGGATGTCAAGACGACATTTTTAAACGGTGAGTTAGATGAAGAGATCTACATGCAACAACCCGAAAGGTTTGTTTCTTTAAGTCAAGAAAAGAAAGTCTGCAAAATTAATTAGGTCTCTTTATGAACTAAAACAAGCATCGAAACAATGGCATGAAAAATTTGGCAGTGCAATGATGGCCAATGGGTTTAAAATCAATGAATGTGACAAATGTGTATATGTCAAAAACAATGAGCATGACCATGTCATTGTGTGCTTATATGTTGATGATATGCTAATTATAGGTAGCAATATCAACATTATTAAGAAAACCAAACAAATGTTGGCCAATAAATTTGAGATGAAAGACATGGGTGTCGCAGATGTTATTCTAGGTATCAAAAAATTCTAGAACCCCACAAGGGTTAGTGCTATCTCAATCACATTATATTGATAAAATATTGAAGAAATATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAATAATGGAGATAGTATAGCACAATTGGAATACTCTCGCATCATTGGTAGTTTGATGTGCATCATGAGTTGTACACGTCCTGATATAGCGTATGCGGTAAGCAAATTAAGTCGCTATACAAGTAATCCAGGTCGTGATCATTGGAAAGTTATCTTGAGAGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATAAACTATACTCGATATCCTGCTGTACTTGAAGGCTATAGTGATGTCAATTGGATATCGAGCACTAAAGACTCCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAAACATGTATAGCACGATCCACAATGGAATTTGAATTTATAGCCTTAGACAAGGCTGGAGAAGAAGCATAA

mRNA sequence

ATGATGAACGCAATGCTTATAAGTTCAGGTTTACCCCAAAATTTGTGGGGAGAAGCTTTGTTAACATCAAATTATTTATTAAACAGAATTCCTCATAAGAAGTCACAAAATATTTCTTATGAAAAATGGAAAGGAAGAAAACTTTCATATAAATTCTTAAAAGTATGGGGGTGCCTAGCAAAGGTTGTCATGCCTAAACCTAAAATGGTTAAAATTGGACCAAAAACTATTGATTGCATATTCATTGGTTATGCTAGTAACAGTTGTGCATATCGATTTATAGTTCATAAATCAGATATTTCAGATATACATGTTAATACAATCATGGTATCGAGGAATGCAACATTCTTTGAGAATATTTTTCCTCATAAAATGTTTTGTGAAGCAAGGTTACAAAAACGTTCGTTTGATGCTATAACGAGTGAATATCACAATAGATCAAATGTTGAGTTAACGAACAATGAGGAACTCCGACAAAGTAAAAGGATGAGGATCTCAAAATCATTTGGTCCTGATTATTTAACTTATTTGTTAGAAAACGAACCTCAAACATTTAAAGAGGCCATGTCCTCTCCTGAAGCTTCATATTGGAAAGAGGCTGTGAATAGTGAAATTGAGTCCATTATGCATAACCATACTTGGGAACCAGTTAATCTTCCATTAGTAAGTAAACCACCTGGTTGCAAGTGGATTTTCAAATGGAAATTGAAGACCGATGGGTCAATAGATAAATATAAAGCAAGACTTGTCGCTAAAGGTTACAAGCAACAAGAAGGACTTGACTATTTTGTTACATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAACATCAGCTTTGCATGGATTTGAGATACATCAGATGGATGTCAAGACGACATTTTTAAACGGTGAGTTAGATGAAGAGATCTACATGCAACAACCCGAAAGTGCAATGATGGCCAATGGGTTTAAAATCAATGAATGTGACAAATGTGTATATGTCAAAAACAATGAGCATGACCATGTCATTGTATCAAAAAATTCTAGAACCCCACAAGGGTTAGTGCTATCTCAATCACATTATATTGATAAAATATTGAAGAAATATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAATAATGGAGATAGTATAGCACAATTGGAATACTCTCGCATCATTGGTAGTTTGATGTGCATCATGAGTTGTACACGTCCTGATATAGCGTATGCGGTAAGCAAATTAAGTCGCTATACAAGTAATCCAGGTCGTGATCATTGGAAAGTTATCTTGAGAGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATAAACTATACTCGATATCCTGCTGTACTTGAAGGCTATAGTGATGTCAATTGGATATCGAGCACTAAAGACTCCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAAACATGTATAGCACGATCCACAATGGAATTTGAATTTATAGCCTTAGACAAGGCTGGAGAAGAAGCATAA

Coding sequence (CDS)

ATGATGAACGCAATGCTTATAAGTTCAGGTTTACCCCAAAATTTGTGGGGAGAAGCTTTGTTAACATCAAATTATTTATTAAACAGAATTCCTCATAAGAAGTCACAAAATATTTCTTATGAAAAATGGAAAGGAAGAAAACTTTCATATAAATTCTTAAAAGTATGGGGGTGCCTAGCAAAGGTTGTCATGCCTAAACCTAAAATGGTTAAAATTGGACCAAAAACTATTGATTGCATATTCATTGGTTATGCTAGTAACAGTTGTGCATATCGATTTATAGTTCATAAATCAGATATTTCAGATATACATGTTAATACAATCATGGTATCGAGGAATGCAACATTCTTTGAGAATATTTTTCCTCATAAAATGTTTTGTGAAGCAAGGTTACAAAAACGTTCGTTTGATGCTATAACGAGTGAATATCACAATAGATCAAATGTTGAGTTAACGAACAATGAGGAACTCCGACAAAGTAAAAGGATGAGGATCTCAAAATCATTTGGTCCTGATTATTTAACTTATTTGTTAGAAAACGAACCTCAAACATTTAAAGAGGCCATGTCCTCTCCTGAAGCTTCATATTGGAAAGAGGCTGTGAATAGTGAAATTGAGTCCATTATGCATAACCATACTTGGGAACCAGTTAATCTTCCATTAGTAAGTAAACCACCTGGTTGCAAGTGGATTTTCAAATGGAAATTGAAGACCGATGGGTCAATAGATAAATATAAAGCAAGACTTGTCGCTAAAGGTTACAAGCAACAAGAAGGACTTGACTATTTTGTTACATACTCACCAGTTACTCGAATTACTTCCATTCGCATGCTCATAGCAACATCAGCTTTGCATGGATTTGAGATACATCAGATGGATGTCAAGACGACATTTTTAAACGGTGAGTTAGATGAAGAGATCTACATGCAACAACCCGAAAGTGCAATGATGGCCAATGGGTTTAAAATCAATGAATGTGACAAATGTGTATATGTCAAAAACAATGAGCATGACCATGTCATTGTATCAAAAAATTCTAGAACCCCACAAGGGTTAGTGCTATCTCAATCACATTATATTGATAAAATATTGAAGAAATATACAAAACATGAAATTGTGATTGCAAAGACCCCAATTGATGGAAGTCTCCATTTAAGTAAAAATAATGGAGATAGTATAGCACAATTGGAATACTCTCGCATCATTGGTAGTTTGATGTGCATCATGAGTTGTACACGTCCTGATATAGCGTATGCGGTAAGCAAATTAAGTCGCTATACAAGTAATCCAGGTCGTGATCATTGGAAAGTTATCTTGAGAGTTTTGGGATACTTAAAGCATACTAAAAATTATGGAATAAACTATACTCGATATCCTGCTGTACTTGAAGGCTATAGTGATGTCAATTGGATATCGAGCACTAAAGACTCCAAATCCACCAGTGGTTACATTTTTACCCTTGGAGGCGGTGCTGTTTCTTGGAAATCTTCCAAACAAACATGTATAGCACGATCCACAATGGAATTTGAATTTATAGCCTTAGACAAGGCTGGAGAAGAAGCATAA
BLAST of CSPI02G16360 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 149.8 bits (377), Expect = 8.1e-35
Identity = 111/359 (30.92%), Postives = 172/359 (47.91%), Query Frame = 1

Query: 2   MNAMLISSGLPQNLWGEALLTSNYLLNRIPHKK-SQNISYEKWKGRKLSYKFLKVWGCLA 61
           + +ML  + LP++ WGEA+ T+ YL+NR P    +  I    W  +++SY  LKV+GC A
Sbjct: 596 VRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRA 655

Query: 62  KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 121
              +PK +  K+  K+I CIFIGY      YR       + D     ++ SR+  F E+ 
Sbjct: 656 FAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYR-------LWDPVKKKVIRSRDVVFRESE 715

Query: 122 FPHKMFCEARLQK---RSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYL 181
                    +++     +F  I S  +N ++ E T +E   Q ++       G      +
Sbjct: 716 VRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGV 775

Query: 182 LENEPQTFKEAMSSPEASYWKEAVNS----EIESIMHNHTWEPVNL-------------- 241
            E E  T  E    P     +  V S      E ++ +   EP +L              
Sbjct: 776 EEVEHPTQGEEQHQPLRRSERPRVESRRYPSTEYVLISDDREPESLKEVLSHPEKNQLMK 835

Query: 242 ---------------PLVSKPPG-----CKWIFKWKLKTDGSIDKYKARLVAKGYKQQEG 301
                           LV  P G     CKW+FK K   D  + +YKARLV KG++Q++G
Sbjct: 836 AMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFEQKKG 895

Query: 302 LDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPESAMMA 319
           +D+   +SPV ++TSIR +++ +A    E+ Q+DVKT FL+G+L+EEIYM+QPE   +A
Sbjct: 896 IDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVA 947

BLAST of CSPI02G16360 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 114.0 bits (284), Expect = 4.9e-24
Identity = 63/190 (33.16%), Postives = 103/190 (54.21%), Query Frame = 1

Query: 149  VELTNNEELRQSKRMRISKSFGPDYLTYLLENE-------PQTFKEAMSSPEASYWKEAV 208
            +E+ N    R   + +IS +   + L  ++ N        P +F E     + S W+EA+
Sbjct: 851  IEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAI 910

Query: 209  NSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQEGLD 268
            N+E+ +   N+TW     P        +W+F  K    G+  +YKARLVA+G+ Q+  +D
Sbjct: 911  NTELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQID 970

Query: 269  YFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPESAMMANGF 328
            Y  T++PV RI+S R +++    +  ++HQMDVKT FLNG L EEIYM+ P+  +  N  
Sbjct: 971  YEETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQ-GISCNSD 1030

Query: 329  KINECDKCVY 332
             + + +K +Y
Sbjct: 1031 NVCKLNKAIY 1039

BLAST of CSPI02G16360 vs. Swiss-Prot
Match: M820_ARATH (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 4.5e-17
Identity = 43/106 (40.57%), Postives = 68/106 (64.15%), Query Frame = 1

Query: 174 LTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFK 233
           +T  ++ EP++   A+  P    W +A+  E++++  N TW  V  P+     GCKW+FK
Sbjct: 20  ITTTIKKEPKSVIFALKDPG---WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFK 79

Query: 234 WKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLI 280
            KL +DG++D+ KARLVAKG+ Q+EG+ +  TYSPV R  +IR ++
Sbjct: 80  TKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTIL 122

BLAST of CSPI02G16360 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 82.0 bits (201), Expect = 2.1e-14
Identity = 49/167 (29.34%), Postives = 88/167 (52.69%), Query Frame = 1

Query: 349 PQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCI 408
           P GL LSQ+ Y ++IL      +     TP+   L+ S +        ++  I+G+L   
Sbjct: 51  PSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQ-Y 110

Query: 409 MSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSD 468
           ++ TRPDI+YAV+ + +    P    + ++ RVL Y+K T  +G+   +   + ++ + D
Sbjct: 111 LTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 170

Query: 469 VNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIAL 515
            +W   T   +ST+G+   LG   +SW + +Q  ++RS+ E E+ AL
Sbjct: 171 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRAL 216

BLAST of CSPI02G16360 vs. TrEMBL
Match: Q2QY02_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g04310 PE=4 SV=1)

HSP 1 Score: 543.9 bits (1400), Expect = 2.2e-151
Identity = 284/563 (50.44%), Postives = 372/563 (66.07%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            ++NAML ++GLP+  WGEALLTSN++LNR+P++      YE W GRK S  +L+ WGCLA
Sbjct: 475  LVNAMLDTAGLPKAWWGEALLTSNHVLNRVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLA 534

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S AYRF++ KS++ D+HV TIM SR+ATFFE+ 
Sbjct: 535  KVNVPITKKRKLGPKTVDCVFLGYAHHSIAYRFLIVKSEVPDMHVGTIMESRDATFFESF 594

Query: 121  FPHKMFCEARLQKRSF--DAITSEYHNRSNVELTNNEEL----RQSKRMRISKSFGPDYL 180
            FP K       Q       +IT         EL + E++    R+SKR R +KSFG D+ 
Sbjct: 595  FPMKDTHSGSNQPSEIIPSSITPPEQTEHTHELVSEEDVSEAPRRSKRQRTAKSFGDDFT 654

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P++  EA +SP+A YWKEAV SE++SI+ N TWE    P   KP GCKW+FK 
Sbjct: 655  VYLVDDTPKSISEAYASPDADYWKEAVRSEMDSIIANGTWEVTERPYGCKPVGCKWVFKK 714

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ +G+I+KYKARLVAKGY Q+EG D+F TYSPV R+T+IR+L++ +A HG  +HQMDV
Sbjct: 715  KLRPNGTIEKYKARLVAKGYTQKEGEDFFDTYSPVARLTTIRVLLSLAASHGLLVHQMDV 774

Query: 301  KTTFLNGELDEEIYMQQPE---------------------------------SAMMANGF 360
            KT FLNGELDEEIYM QP+                                   + + GF
Sbjct: 775  KTAFLNGELDEEIYMDQPDGFVVEGQEGKVCKLLKSLYGLKQAPKQWHEKFDKTLTSAGF 834

Query: 361  KINECDKCVYVKNNEHDHVIVSKNSRTPQGLVLS---QSHYIDKILKKYTKHEIVIAKTP 420
             +NE DKCVY ++   + VI+         L+     +SHY++KIL ++   +   + TP
Sbjct: 835  AVNEADKCVYYRHGGGEGVILCLY--VDDILIFGTNLESHYVEKILNRFGYIDSKPSPTP 894

Query: 421  IDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVI 480
             D SL L KN   +  QLEYS+IIGSLM + S TRPDI++AVSKLSR+TSNPG DHW+ +
Sbjct: 895  YDPSLLLRKNKRIARNQLEYSQIIGSLMYLTSATRPDISFAVSKLSRFTSNPGDDHWRAL 954

Query: 481  LRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSK 522
             RV+ YLK T   G++YT YPAVLEGYSD NWIS   + K+TSGY+FTLGGGAVSW+S K
Sbjct: 955  ERVMRYLKGTVELGLHYTGYPAVLEGYSDSNWISDVDEIKATSGYVFTLGGGAVSWRSCK 1014

BLAST of CSPI02G16360 vs. TrEMBL
Match: A0A151R256_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_042281 PE=4 SV=1)

HSP 1 Score: 542.0 bits (1395), Expect = 8.2e-151
Identity = 281/561 (50.09%), Postives = 374/561 (66.67%), Query Frame = 1

Query: 2   MNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAK 61
           MNA+L SS  P NLWGEALLT+ +L NRIPH+++    YE WKG   + K+L+VWGCLAK
Sbjct: 1   MNALLNSSYAPDNLWGEALLTACFLQNRIPHRRTGKTPYELWKGYVPNLKYLRVWGCLAK 60

Query: 62  VVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENIF 121
           V++P PK  KIGPKT DC+FIGYA  S AYRF+V KS + D   NTI+ ++NA FFENIF
Sbjct: 61  VLLPDPKKRKIGPKTSDCMFIGYAERSAAYRFLVLKSSVIDC--NTIVETKNAEFFENIF 120

Query: 122 PHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENE 181
           P K        + S    +SE+           E+LR+SKR R   SFG D+ TYL+EN+
Sbjct: 121 PLKSSINTSSTQPSPLETSSEHMF---------EDLRRSKRQRKETSFGSDFYTYLVEND 180

Query: 182 PQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGS 241
           P +F EA+SS +A +WKEA+  EI+SI  N+TW  V+LP  +KP GCKWIFK K   DGS
Sbjct: 181 PISFSEAISSSDAIFWKEAIRIEIDSINENNTWTLVDLPKGAKPIGCKWIFKRKYNPDGS 240

Query: 242 IDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNG 301
           I+KYKARLVAKG+ Q++ +D+F T++PV RI+SIR+LIA +++H   IHQMDVKT FLNG
Sbjct: 241 IEKYKARLVAKGFTQKQDIDFFDTFAPVARISSIRVLIALASIHRLVIHQMDVKTAFLNG 300

Query: 302 ELDEEIYMQQPE---------------------------------SAMMANGFKINECDK 361
           EL+EEIYM QPE                                   ++ +GF  +  DK
Sbjct: 301 ELEEEIYMTQPEGCEVPGQENKVCRLLKSLYGLKQAPKQWHEKFDQVLLNDGFSSSSADK 360

Query: 362 CVYVKNNEHDHVIVSKN--------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPID 421
           CVY K+   D VI+            R    ++LSQ HY++++LKK+  ++     TP D
Sbjct: 361 CVYTKSMNDDCVIICLYVDDMLIFVVRKGDSILLSQRHYVERLLKKFDYYDCKYVTTPYD 420

Query: 422 GSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILR 481
            +  L +N GDS+AQ +Y++IIGSL+ +M+ +RPDIAYAVS+LSRYT  P +DHW+ + R
Sbjct: 421 VNSQLKQNKGDSLAQSQYAQIIGSLLHLMNFSRPDIAYAVSRLSRYTHCPNQDHWEALAR 480

Query: 482 VLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQT 522
           ++ YL+ T +YGI Y+ +PAVLEGYSD NWIS + ++KSTS Y+FTLGGGA+SW+S +Q+
Sbjct: 481 LMRYLRGTMDYGIEYSGFPAVLEGYSDANWISDSDETKSTSYYVFTLGGGAISWRSVRQS 540

BLAST of CSPI02G16360 vs. TrEMBL
Match: A0A151TB57_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_018882 PE=4 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 1.3e-148
Identity = 278/573 (48.52%), Postives = 374/573 (65.27%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            MMNA+L SS  P NLWGEALLT+ +L NRIPH+++    YE WKG   + K+L+VWGCLA
Sbjct: 546  MMNALLNSSSAPDNLWGEALLTACFLQNRIPHRRTGKTPYELWKGYVPNLKYLRVWGCLA 605

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV++P PK  KIGPKT DC+FIGYA  S AYRF+V KS + D   NTI+ ++NA FFENI
Sbjct: 606  KVLLPDPKKRKIGPKTSDCMFIGYAERSAAYRFLVLKSSVIDC--NTIVETKNAEFFENI 665

Query: 121  FPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLEN 180
            FP K        + S    +SE+           E+LR+SKR R   SFG D+ TYL+EN
Sbjct: 666  FPLKSSINTSSTQPSPLETSSEHMF---------EDLRRSKRQRKETSFGSDFYTYLVEN 725

Query: 181  EPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDG 240
            +P +F EA+SS +A +WKEA+  EI+SI  N+TW  V+LP  +KP GCKWIFK K   DG
Sbjct: 726  DPISFSEAISSSDAIFWKEAIRIEIDSIKENNTWTLVDLPKGAKPIGCKWIFKRKYNPDG 785

Query: 241  SIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN 300
            SI+KYKARLVAKG+ Q++ +D+F T++PV RI+SIR+LIA +++H   IHQMDVKT FLN
Sbjct: 786  SIEKYKARLVAKGFTQKQDIDFFDTFAPVARISSIRVLIALASIHRLVIHQMDVKTAFLN 845

Query: 301  GELDEEIYMQQPES---------------------------------AMMANGFKINECD 360
            GEL+EEIYM QPE                                   ++ +GF  +  D
Sbjct: 846  GELEEEIYMTQPEGCEVPGQENKVCRLLKSLYGLKQAPKQWHEKFDQVLLNDGFSSSSAD 905

Query: 361  KCVYVKNNEHDHVIV-------------------SKNSRTPQGLVLSQSHYIDKILKKYT 420
            +CVY K  + D VI+                   +K+       +    HY++++LKK+ 
Sbjct: 906  RCVYTKCMDDDCVIICLYVDDMLIFGTCDDIVFKTKSFLASNFDMKDMGHYVERLLKKFD 965

Query: 421  KHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTS 480
             ++     TP D +  L +N GDS+AQ +Y++IIGSL+ +M+ +RPDIAYAVS+LSRYT 
Sbjct: 966  YYDCKSVTTPYDVNSQLKQNKGDSLAQSQYAQIIGSLLHLMNFSRPDIAYAVSRLSRYTH 1025

Query: 481  NPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLG 522
             P +DHW+ + R++ YL+ T +YGI Y+ +PAVLEGYSD NWIS + ++KSTSGY+FTLG
Sbjct: 1026 CPNQDHWEALARLMRYLRGTMDYGIEYSGFPAVLEGYSDANWISDSDETKSTSGYVFTLG 1085

BLAST of CSPI02G16360 vs. TrEMBL
Match: Q2QRF6_ORYSJ (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica GN=LOC_Os12g27730 PE=4 SV=2)

HSP 1 Score: 534.6 bits (1376), Expect = 1.3e-148
Identity = 283/554 (51.08%), Postives = 365/554 (65.88%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            ++NAML ++GLP+  WGEALLTSN++LNR+P++      YE W GRK S  +L+ WGCLA
Sbjct: 604  LVNAMLDTAGLPKAWWGEALLTSNHVLNRVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLA 663

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S AYRF++ KS++ D+HV TIM SR+ATFFE+ 
Sbjct: 664  KVNVPITKKRKLGPKTVDCVFLGYAHHSIAYRFLIVKSEVPDMHVGTIMESRDATFFESF 723

Query: 121  FPHKMFCEARLQKRSF--DAITSEYHNRSNVELTNNEEL----RQSKRMRISKSFGPDYL 180
            FP K       Q       +IT         EL + E++    R+SKR R +KSFG D+ 
Sbjct: 724  FPMKDTHSGSNQPFEIIPSSITPPEQTEHTHELVSEEDVSEAPRRSKRQRTAKSFGDDFT 783

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P++  EA +SP+A YWKE V+SE++SI+ N TWE    P   KP GCKW+FK 
Sbjct: 784  VYLVDDTPKSISEAYASPDADYWKEVVHSEMDSIIANGTWEVTERPYGCKPVGCKWVFKK 843

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ DG+I+KYKARLVAKGY Q+EG D F TYSPV R+T+IR+L++ +A HG  +HQMDV
Sbjct: 844  KLRPDGTIEKYKARLVAKGYTQKEGEDLFDTYSPVARLTTIRVLLSLAASHGLLVHQMDV 903

Query: 301  KTTFLNGELDEEIYMQQPESAMMANGFKINECDKCVYVKNNEHDHVIV------------ 360
            KT FLNGELDEEIYM QP+   +  GF +NE DKCVY ++   + VI+            
Sbjct: 904  KTAFLNGELDEEIYMYQPDG-FVVEGFAVNEADKCVYYRHGGGEGVILCLYVDDILIFGT 963

Query: 361  ------SKNSRTPQGLVLSQSHYIDKILK-KYTKHEIVI--------AKTPIDGSLHLSK 420
                     S   Q   +      D IL  K  + ++          + TP D SL L K
Sbjct: 964  NLEVINEVKSFLSQNFDMKDLGVADVILNIKLIREDLESFGYIDSKPSPTPYDPSLLLRK 1023

Query: 421  NNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKH 480
            N   +  QLEYS+IIGSLM + S TRPDI++AVSKLSR+TSNPG DHW+   RV+ YLK 
Sbjct: 1024 NKRIARNQLEYSQIIGSLMYLASATRPDISFAVSKLSRFTSNPGDDHWRAFERVMRYLKG 1083

Query: 481  TKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTM 522
            T   G++YT YPAVLEGYSD NWIS   + K+TSGY+FTLGGGAVSW+S KQT + RSTM
Sbjct: 1084 TMELGLHYTGYPAVLEGYSDSNWISDVDEIKATSGYVFTLGGGAVSWRSCKQTILTRSTM 1143

BLAST of CSPI02G16360 vs. TrEMBL
Match: Q5WMW8_ORYSJ (Putative polyprotein OS=Oryza sativa subsp. japonica GN=OJ1037_G10.4 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 1.7e-143
Identity = 277/561 (49.38%), Postives = 365/561 (65.06%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            M+NAML ++GL +  WGEA+LT+ ++LN+IP K  +   +E+W+ +KL+  +L+ WGCLA
Sbjct: 732  MVNAMLDTAGLSKEWWGEAVLTACHVLNKIPMKHKEVTPFEEWEWKKLNLSYLRTWGCLA 791

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S  YRF++  S + D+H  TI  SR+ATFFEN 
Sbjct: 792  KVNVPIAKKRKLGPKTVDCVFLGYAIHSVGYRFLIANSGVPDMHAGTIFESRDATFFENE 851

Query: 121  FPHKMFC-----EARLQKRSFDAIT-SEYHNRSNVELTNNEELRQSKRMRISKSFGPDYL 180
            FP K        E  +    F  I  ++     N E  N  + R+SKR R++KSFG DY+
Sbjct: 852  FPMKYTPSTSSKETVMPHEHFAPIEHNDQMPEENPEEDNIVDTRKSKRQRVAKSFGDDYI 911

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P+T +EA SSP+A YWKEAV SE++SIM N TWE V  P   KP GCKW+FK 
Sbjct: 912  VYLVDDTPRTVEEAYSSPDADYWKEAVRSEMDSIMSNGTWEVVERPYGCKPIGCKWVFKK 971

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ DG+I+KYKARLVAK Y Q+EG D+F TYSPV R+T+IR+L+A +A HG  +HQMDV
Sbjct: 972  KLRPDGTIEKYKARLVAKSYTQKEGEDFFDTYSPVARLTTIRVLLALAASHGLLVHQMDV 1031

Query: 301  KTTFLNGELDEEIY--------------------------MQQP-------ESAMMANGF 360
            KT FLNGEL+EEIY                           Q P       ++ + + GF
Sbjct: 1032 KTAFLNGELEEEIYMDQPDGYVLEGQEGMVCKLLKSLYGLKQAPKQWHEKFDTTLTSAGF 1091

Query: 361  KINECDKCVYVKNNEHDHVIVSKNSRTPQGLVLSQS-HYIDKILKKYTKHEIVIAKTPID 420
             +NE DKCVY +    + VI+         L+   S + I+++L ++   +   A TP D
Sbjct: 1092 VVNEADKCVYYRYGGGEGVILCLY--VDDILIFGTSLNVIEEVLSRFGYSDCKPAPTPYD 1151

Query: 421  GSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILR 480
             S+ L KN   +  QL YS+IIGSLM + S TRPDI++AVSKLSR+ SNPG DHW+ + R
Sbjct: 1152 PSVLLRKNRRIARDQLRYSQIIGSLMYLASATRPDISFAVSKLSRFVSNPGDDHWQALER 1211

Query: 481  VLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQT 522
            V+ YLK T +YGI+YT YP VLEGYSD NWIS   + K+TSGY FTLGGGAVSWKS KQT
Sbjct: 1212 VMRYLKGTMSYGIHYTGYPKVLEGYSDSNWISDADEIKATSGYAFTLGGGAVSWKSCKQT 1271

BLAST of CSPI02G16360 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 149.4 bits (376), Expect = 6.0e-36
Identity = 90/257 (35.02%), Postives = 135/257 (52.53%), Query Frame = 1

Query: 138 AITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENEPQTFKEAMSSPEASYW 197
           A   +Y+  S   LT ++  +     ++S  +    +      EP T+ EA    E   W
Sbjct: 42  AYLQDYYCHSVASLTIHDISQFLSYEKVSPLYHSFLVCIAKAKEPSTYNEAK---EFLVW 101

Query: 198 KEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGSIDKYKARLVAKGYKQQ 257
             A++ EI ++   HTWE   LP   KP GCKW++K K  +DG+I++YKARLVAKGY QQ
Sbjct: 102 CGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIERYKARLVAKGYTQQ 161

Query: 258 EGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNGELDEEIYMQQPESAMM 317
           EG+D+  T+SPV ++TS+++++A SA++ F +HQ+D+   FLNG+LDEEIYM+ P     
Sbjct: 162 EGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAA 221

Query: 318 ANGFKINECDKCVYVKNNEHDHVIVS-----KNSRTPQGLVLSQSHYIDKILKKYTKHEI 377
             G  +     C Y+K + +     S     K S T  G    QSH       K T    
Sbjct: 222 RQGDSLPPNAVC-YLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLF 281

Query: 378 VIAKTPIDGSLHLSKNN 390
           +     +D  +  S N+
Sbjct: 282 LCVLVYVDDIIICSNND 294

BLAST of CSPI02G16360 vs. TAIR10
Match: ATMG00820.1 (ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase))

HSP 1 Score: 90.9 bits (224), Expect = 2.5e-18
Identity = 43/106 (40.57%), Postives = 68/106 (64.15%), Query Frame = 1

Query: 174 LTYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFK 233
           +T  ++ EP++   A+  P    W +A+  E++++  N TW  V  P+     GCKW+FK
Sbjct: 20  ITTTIKKEPKSVIFALKDPG---WCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFK 79

Query: 234 WKLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLI 280
            KL +DG++D+ KARLVAKG+ Q+EG+ +  TYSPV R  +IR ++
Sbjct: 80  TKLHSDGTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTIL 122

BLAST of CSPI02G16360 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 82.0 bits (201), Expect = 1.2e-15
Identity = 49/167 (29.34%), Postives = 88/167 (52.69%), Query Frame = 1

Query: 349 PQGLVLSQSHYIDKILKKYTKHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCI 408
           P GL LSQ+ Y ++IL      +     TP+   L+ S +        ++  I+G+L   
Sbjct: 51  PSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQ-Y 110

Query: 409 MSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKHTKNYGINYTRYPAV-LEGYSD 468
           ++ TRPDI+YAV+ + +    P    + ++ RVL Y+K T  +G+   +   + ++ + D
Sbjct: 111 LTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCD 170

Query: 469 VNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTMEFEFIAL 515
            +W   T   +ST+G+   LG   +SW + +Q  ++RS+ E E+ AL
Sbjct: 171 SDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRAL 216

BLAST of CSPI02G16360 vs. NCBI nr
Match: gi|77552972|gb|ABA95768.1| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 543.9 bits (1400), Expect = 3.1e-151
Identity = 284/563 (50.44%), Postives = 372/563 (66.07%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            ++NAML ++GLP+  WGEALLTSN++LNR+P++      YE W GRK S  +L+ WGCLA
Sbjct: 475  LVNAMLDTAGLPKAWWGEALLTSNHVLNRVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLA 534

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S AYRF++ KS++ D+HV TIM SR+ATFFE+ 
Sbjct: 535  KVNVPITKKRKLGPKTVDCVFLGYAHHSIAYRFLIVKSEVPDMHVGTIMESRDATFFESF 594

Query: 121  FPHKMFCEARLQKRSF--DAITSEYHNRSNVELTNNEEL----RQSKRMRISKSFGPDYL 180
            FP K       Q       +IT         EL + E++    R+SKR R +KSFG D+ 
Sbjct: 595  FPMKDTHSGSNQPSEIIPSSITPPEQTEHTHELVSEEDVSEAPRRSKRQRTAKSFGDDFT 654

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P++  EA +SP+A YWKEAV SE++SI+ N TWE    P   KP GCKW+FK 
Sbjct: 655  VYLVDDTPKSISEAYASPDADYWKEAVRSEMDSIIANGTWEVTERPYGCKPVGCKWVFKK 714

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ +G+I+KYKARLVAKGY Q+EG D+F TYSPV R+T+IR+L++ +A HG  +HQMDV
Sbjct: 715  KLRPNGTIEKYKARLVAKGYTQKEGEDFFDTYSPVARLTTIRVLLSLAASHGLLVHQMDV 774

Query: 301  KTTFLNGELDEEIYMQQPE---------------------------------SAMMANGF 360
            KT FLNGELDEEIYM QP+                                   + + GF
Sbjct: 775  KTAFLNGELDEEIYMDQPDGFVVEGQEGKVCKLLKSLYGLKQAPKQWHEKFDKTLTSAGF 834

Query: 361  KINECDKCVYVKNNEHDHVIVSKNSRTPQGLVLS---QSHYIDKILKKYTKHEIVIAKTP 420
             +NE DKCVY ++   + VI+         L+     +SHY++KIL ++   +   + TP
Sbjct: 835  AVNEADKCVYYRHGGGEGVILCLY--VDDILIFGTNLESHYVEKILNRFGYIDSKPSPTP 894

Query: 421  IDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVI 480
             D SL L KN   +  QLEYS+IIGSLM + S TRPDI++AVSKLSR+TSNPG DHW+ +
Sbjct: 895  YDPSLLLRKNKRIARNQLEYSQIIGSLMYLTSATRPDISFAVSKLSRFTSNPGDDHWRAL 954

Query: 481  LRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSK 522
             RV+ YLK T   G++YT YPAVLEGYSD NWIS   + K+TSGY+FTLGGGAVSW+S K
Sbjct: 955  ERVMRYLKGTVELGLHYTGYPAVLEGYSDSNWISDVDEIKATSGYVFTLGGGAVSWRSCK 1014

BLAST of CSPI02G16360 vs. NCBI nr
Match: gi|1012324613|gb|KYP36589.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 542.0 bits (1395), Expect = 1.2e-150
Identity = 281/561 (50.09%), Postives = 374/561 (66.67%), Query Frame = 1

Query: 2   MNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLAK 61
           MNA+L SS  P NLWGEALLT+ +L NRIPH+++    YE WKG   + K+L+VWGCLAK
Sbjct: 1   MNALLNSSYAPDNLWGEALLTACFLQNRIPHRRTGKTPYELWKGYVPNLKYLRVWGCLAK 60

Query: 62  VVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENIF 121
           V++P PK  KIGPKT DC+FIGYA  S AYRF+V KS + D   NTI+ ++NA FFENIF
Sbjct: 61  VLLPDPKKRKIGPKTSDCMFIGYAERSAAYRFLVLKSSVIDC--NTIVETKNAEFFENIF 120

Query: 122 PHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLENE 181
           P K        + S    +SE+           E+LR+SKR R   SFG D+ TYL+EN+
Sbjct: 121 PLKSSINTSSTQPSPLETSSEHMF---------EDLRRSKRQRKETSFGSDFYTYLVEND 180

Query: 182 PQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDGS 241
           P +F EA+SS +A +WKEA+  EI+SI  N+TW  V+LP  +KP GCKWIFK K   DGS
Sbjct: 181 PISFSEAISSSDAIFWKEAIRIEIDSINENNTWTLVDLPKGAKPIGCKWIFKRKYNPDGS 240

Query: 242 IDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLNG 301
           I+KYKARLVAKG+ Q++ +D+F T++PV RI+SIR+LIA +++H   IHQMDVKT FLNG
Sbjct: 241 IEKYKARLVAKGFTQKQDIDFFDTFAPVARISSIRVLIALASIHRLVIHQMDVKTAFLNG 300

Query: 302 ELDEEIYMQQPE---------------------------------SAMMANGFKINECDK 361
           EL+EEIYM QPE                                   ++ +GF  +  DK
Sbjct: 301 ELEEEIYMTQPEGCEVPGQENKVCRLLKSLYGLKQAPKQWHEKFDQVLLNDGFSSSSADK 360

Query: 362 CVYVKNNEHDHVIVSKN--------SRTPQGLVLSQSHYIDKILKKYTKHEIVIAKTPID 421
           CVY K+   D VI+            R    ++LSQ HY++++LKK+  ++     TP D
Sbjct: 361 CVYTKSMNDDCVIICLYVDDMLIFVVRKGDSILLSQRHYVERLLKKFDYYDCKYVTTPYD 420

Query: 422 GSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILR 481
            +  L +N GDS+AQ +Y++IIGSL+ +M+ +RPDIAYAVS+LSRYT  P +DHW+ + R
Sbjct: 421 VNSQLKQNKGDSLAQSQYAQIIGSLLHLMNFSRPDIAYAVSRLSRYTHCPNQDHWEALAR 480

Query: 482 VLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQT 522
           ++ YL+ T +YGI Y+ +PAVLEGYSD NWIS + ++KSTS Y+FTLGGGA+SW+S +Q+
Sbjct: 481 LMRYLRGTMDYGIEYSGFPAVLEGYSDANWISDSDETKSTSYYVFTLGGGAISWRSVRQS 540

BLAST of CSPI02G16360 vs. NCBI nr
Match: gi|108862657|gb|ABA98136.2| (retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Group])

HSP 1 Score: 534.6 bits (1376), Expect = 1.9e-148
Identity = 283/554 (51.08%), Postives = 365/554 (65.88%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            ++NAML ++GLP+  WGEALLTSN++LNR+P++      YE W GRK S  +L+ WGCLA
Sbjct: 604  LVNAMLDTAGLPKAWWGEALLTSNHVLNRVPNRNKDKTPYEIWIGRKPSLSYLRTWGCLA 663

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S AYRF++ KS++ D+HV TIM SR+ATFFE+ 
Sbjct: 664  KVNVPITKKRKLGPKTVDCVFLGYAHHSIAYRFLIVKSEVPDMHVGTIMESRDATFFESF 723

Query: 121  FPHKMFCEARLQKRSF--DAITSEYHNRSNVELTNNEEL----RQSKRMRISKSFGPDYL 180
            FP K       Q       +IT         EL + E++    R+SKR R +KSFG D+ 
Sbjct: 724  FPMKDTHSGSNQPFEIIPSSITPPEQTEHTHELVSEEDVSEAPRRSKRQRTAKSFGDDFT 783

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P++  EA +SP+A YWKE V+SE++SI+ N TWE    P   KP GCKW+FK 
Sbjct: 784  VYLVDDTPKSISEAYASPDADYWKEVVHSEMDSIIANGTWEVTERPYGCKPVGCKWVFKK 843

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ DG+I+KYKARLVAKGY Q+EG D F TYSPV R+T+IR+L++ +A HG  +HQMDV
Sbjct: 844  KLRPDGTIEKYKARLVAKGYTQKEGEDLFDTYSPVARLTTIRVLLSLAASHGLLVHQMDV 903

Query: 301  KTTFLNGELDEEIYMQQPESAMMANGFKINECDKCVYVKNNEHDHVIV------------ 360
            KT FLNGELDEEIYM QP+   +  GF +NE DKCVY ++   + VI+            
Sbjct: 904  KTAFLNGELDEEIYMYQPDG-FVVEGFAVNEADKCVYYRHGGGEGVILCLYVDDILIFGT 963

Query: 361  ------SKNSRTPQGLVLSQSHYIDKILK-KYTKHEIVI--------AKTPIDGSLHLSK 420
                     S   Q   +      D IL  K  + ++          + TP D SL L K
Sbjct: 964  NLEVINEVKSFLSQNFDMKDLGVADVILNIKLIREDLESFGYIDSKPSPTPYDPSLLLRK 1023

Query: 421  NNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILRVLGYLKH 480
            N   +  QLEYS+IIGSLM + S TRPDI++AVSKLSR+TSNPG DHW+   RV+ YLK 
Sbjct: 1024 NKRIARNQLEYSQIIGSLMYLASATRPDISFAVSKLSRFTSNPGDDHWRAFERVMRYLKG 1083

Query: 481  TKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQTCIARSTM 522
            T   G++YT YPAVLEGYSD NWIS   + K+TSGY+FTLGGGAVSW+S KQT + RSTM
Sbjct: 1084 TMELGLHYTGYPAVLEGYSDSNWISDVDEIKATSGYVFTLGGGAVSWRSCKQTILTRSTM 1143

BLAST of CSPI02G16360 vs. NCBI nr
Match: gi|1012353102|gb|KYP64290.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 534.6 bits (1376), Expect = 1.9e-148
Identity = 278/573 (48.52%), Postives = 374/573 (65.27%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            MMNA+L SS  P NLWGEALLT+ +L NRIPH+++    YE WKG   + K+L+VWGCLA
Sbjct: 546  MMNALLNSSSAPDNLWGEALLTACFLQNRIPHRRTGKTPYELWKGYVPNLKYLRVWGCLA 605

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV++P PK  KIGPKT DC+FIGYA  S AYRF+V KS + D   NTI+ ++NA FFENI
Sbjct: 606  KVLLPDPKKRKIGPKTSDCMFIGYAERSAAYRFLVLKSSVIDC--NTIVETKNAEFFENI 665

Query: 121  FPHKMFCEARLQKRSFDAITSEYHNRSNVELTNNEELRQSKRMRISKSFGPDYLTYLLEN 180
            FP K        + S    +SE+           E+LR+SKR R   SFG D+ TYL+EN
Sbjct: 666  FPLKSSINTSSTQPSPLETSSEHMF---------EDLRRSKRQRKETSFGSDFYTYLVEN 725

Query: 181  EPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKWKLKTDG 240
            +P +F EA+SS +A +WKEA+  EI+SI  N+TW  V+LP  +KP GCKWIFK K   DG
Sbjct: 726  DPISFSEAISSSDAIFWKEAIRIEIDSIKENNTWTLVDLPKGAKPIGCKWIFKRKYNPDG 785

Query: 241  SIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDVKTTFLN 300
            SI+KYKARLVAKG+ Q++ +D+F T++PV RI+SIR+LIA +++H   IHQMDVKT FLN
Sbjct: 786  SIEKYKARLVAKGFTQKQDIDFFDTFAPVARISSIRVLIALASIHRLVIHQMDVKTAFLN 845

Query: 301  GELDEEIYMQQPES---------------------------------AMMANGFKINECD 360
            GEL+EEIYM QPE                                   ++ +GF  +  D
Sbjct: 846  GELEEEIYMTQPEGCEVPGQENKVCRLLKSLYGLKQAPKQWHEKFDQVLLNDGFSSSSAD 905

Query: 361  KCVYVKNNEHDHVIV-------------------SKNSRTPQGLVLSQSHYIDKILKKYT 420
            +CVY K  + D VI+                   +K+       +    HY++++LKK+ 
Sbjct: 906  RCVYTKCMDDDCVIICLYVDDMLIFGTCDDIVFKTKSFLASNFDMKDMGHYVERLLKKFD 965

Query: 421  KHEIVIAKTPIDGSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTS 480
             ++     TP D +  L +N GDS+AQ +Y++IIGSL+ +M+ +RPDIAYAVS+LSRYT 
Sbjct: 966  YYDCKSVTTPYDVNSQLKQNKGDSLAQSQYAQIIGSLLHLMNFSRPDIAYAVSRLSRYTH 1025

Query: 481  NPGRDHWKVILRVLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLG 522
             P +DHW+ + R++ YL+ T +YGI Y+ +PAVLEGYSD NWIS + ++KSTSGY+FTLG
Sbjct: 1026 CPNQDHWEALARLMRYLRGTMDYGIEYSGFPAVLEGYSDANWISDSDETKSTSGYVFTLG 1085

BLAST of CSPI02G16360 vs. NCBI nr
Match: gi|54291731|gb|AAV32100.1| (putative polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 517.7 bits (1332), Expect = 2.4e-143
Identity = 277/561 (49.38%), Postives = 365/561 (65.06%), Query Frame = 1

Query: 1    MMNAMLISSGLPQNLWGEALLTSNYLLNRIPHKKSQNISYEKWKGRKLSYKFLKVWGCLA 60
            M+NAML ++GL +  WGEA+LT+ ++LN+IP K  +   +E+W+ +KL+  +L+ WGCLA
Sbjct: 732  MVNAMLDTAGLSKEWWGEAVLTACHVLNKIPMKHKEVTPFEEWEWKKLNLSYLRTWGCLA 791

Query: 61   KVVMPKPKMVKIGPKTIDCIFIGYASNSCAYRFIVHKSDISDIHVNTIMVSRNATFFENI 120
            KV +P  K  K+GPKT+DC+F+GYA +S  YRF++  S + D+H  TI  SR+ATFFEN 
Sbjct: 792  KVNVPIAKKRKLGPKTVDCVFLGYAIHSVGYRFLIANSGVPDMHAGTIFESRDATFFENE 851

Query: 121  FPHKMFC-----EARLQKRSFDAIT-SEYHNRSNVELTNNEELRQSKRMRISKSFGPDYL 180
            FP K        E  +    F  I  ++     N E  N  + R+SKR R++KSFG DY+
Sbjct: 852  FPMKYTPSTSSKETVMPHEHFAPIEHNDQMPEENPEEDNIVDTRKSKRQRVAKSFGDDYI 911

Query: 181  TYLLENEPQTFKEAMSSPEASYWKEAVNSEIESIMHNHTWEPVNLPLVSKPPGCKWIFKW 240
             YL+++ P+T +EA SSP+A YWKEAV SE++SIM N TWE V  P   KP GCKW+FK 
Sbjct: 912  VYLVDDTPRTVEEAYSSPDADYWKEAVRSEMDSIMSNGTWEVVERPYGCKPIGCKWVFKK 971

Query: 241  KLKTDGSIDKYKARLVAKGYKQQEGLDYFVTYSPVTRITSIRMLIATSALHGFEIHQMDV 300
            KL+ DG+I+KYKARLVAK Y Q+EG D+F TYSPV R+T+IR+L+A +A HG  +HQMDV
Sbjct: 972  KLRPDGTIEKYKARLVAKSYTQKEGEDFFDTYSPVARLTTIRVLLALAASHGLLVHQMDV 1031

Query: 301  KTTFLNGELDEEIY--------------------------MQQP-------ESAMMANGF 360
            KT FLNGEL+EEIY                           Q P       ++ + + GF
Sbjct: 1032 KTAFLNGELEEEIYMDQPDGYVLEGQEGMVCKLLKSLYGLKQAPKQWHEKFDTTLTSAGF 1091

Query: 361  KINECDKCVYVKNNEHDHVIVSKNSRTPQGLVLSQS-HYIDKILKKYTKHEIVIAKTPID 420
             +NE DKCVY +    + VI+         L+   S + I+++L ++   +   A TP D
Sbjct: 1092 VVNEADKCVYYRYGGGEGVILCLY--VDDILIFGTSLNVIEEVLSRFGYSDCKPAPTPYD 1151

Query: 421  GSLHLSKNNGDSIAQLEYSRIIGSLMCIMSCTRPDIAYAVSKLSRYTSNPGRDHWKVILR 480
             S+ L KN   +  QL YS+IIGSLM + S TRPDI++AVSKLSR+ SNPG DHW+ + R
Sbjct: 1152 PSVLLRKNRRIARDQLRYSQIIGSLMYLASATRPDISFAVSKLSRFVSNPGDDHWQALER 1211

Query: 481  VLGYLKHTKNYGINYTRYPAVLEGYSDVNWISSTKDSKSTSGYIFTLGGGAVSWKSSKQT 522
            V+ YLK T +YGI+YT YP VLEGYSD NWIS   + K+TSGY FTLGGGAVSWKS KQT
Sbjct: 1212 VMRYLKGTMSYGIHYTGYPKVLEGYSDSNWISDADEIKATSGYAFTLGGGAVSWKSCKQT 1271

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC8.1e-3530.92Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME4.9e-2433.16Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M820_ARATH4.5e-1740.57Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana GN=AtMg0... [more]
M810_ARATH2.1e-1429.34Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
Match NameE-valueIdentityDescription
Q2QY02_ORYSJ2.2e-15150.44Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
A0A151R256_CAJCA8.2e-15150.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151TB57_CAJCA1.3e-14848.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
Q2QRF6_ORYSJ1.3e-14851.08Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
Q5WMW8_ORYSJ1.7e-14349.38Putative polyprotein OS=Oryza sativa subsp. japonica GN=OJ1037_G10.4 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.16.0e-3635.02 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00820.12.5e-1840.57ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)[more]
ATMG00810.11.2e-1529.34ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|77552972|gb|ABA95768.1|3.1e-15150.44retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|1012324613|gb|KYP36589.1|1.2e-15050.09Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|108862657|gb|ABA98136.2|1.9e-14851.08retrotransposon protein, putative, Ty1-copia subclass [Oryza sativa Japonica Gro... [more]
gi|1012353102|gb|KYP64290.1|1.9e-14848.52Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|54291731|gb|AAV32100.1|2.4e-14349.38putative polyprotein [Oryza sativa Japonica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006310 DNA recombination
cellular_component GO:0031410 cytoplasmic vesicle
molecular_function GO:0003677 DNA binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G16360.1CSPI02G16360.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 211..315
score: 1.8
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..521
score: 2.5E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 1..521
score: 2.5E

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None