CSPI07G01990 (gene) Wild cucumber (PI 183967)

NameCSPI07G01990
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7 : 1691571 .. 1692767 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAATTGGAAGAGGAAGATATATGTTACTCATCCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAATGCTCTTTACGGATTGAGGCAAGCTCCACAAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCACTCAAGAGCAAGCAATCTACACAAGAAGAGAAAAAGAGGAATATGTTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGTAGCATTGAAAAGGTCAATAAGTTCAAGCAATAAATGATGGCAAAATTTGAAATGAGTGACTTAGGCCTTCTCTCTTACTACTTAGGAATTGAAGTTGAACAACAGAAGGGTCGAATCATGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACAGAAGAAGCACCAATTGATGCTACGAAGTATAGAAGCATCTTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAGATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGGTGGTCAAGCAAATACTTAGGTATTTGAGAGGGACGATTCATTTTGGGCTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCAGACAGTGATTTAGCTGGTGATCTCGACAGGAGGAAAAGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCACTCTCATCTTGCGAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCATTGTGGTTAAGATGCCTTGTTAGCGAGATAGTCGGAATGGAGCCAAGGCCGGTAACATTATTTGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCCGTATTTCATGGTCGCAACAAGCACATAGATACATGTTTTCATTTCATTCGAGAGTGTGTCAAGAATGGACAAATTATCGTTGAAGTTGTCAACACTGGAGAACAACGAGCCGATGTCCTGACTAAAGCATTGACGGGAGTAAAGTTAGCTGCTATGCGTCAACTACTTGGTGTTCGTAAGTTAGAATCATGCCAGAATTAG

mRNA sequence

ATGGAGAATTGGAAGAGGAAGATATATGTTACTCATCCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAATGCTCTTTACGGATTGAGGCAAGCTCCACAAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCACTCAAGAGCAAGCAATCTACACAAGAAGAGAAAAAGAGGAATATGTTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGAATTGAAGTTGAACAACAGAAGGGTCGAATCATGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACAGAAGAAGCACCAATTGATGCTACGAAGTATAGAAGCATCTTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAGATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGGTGGTCAAGCAAATACTTAGGTATTTGAGAGGGACGATTCATTTTGGGCTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCAGACAGTGATTTAGCTGGTGATCTCGACAGGAGGAAAAGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCACTCTCATCTTGCGAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCATTGTGGTTAAGATGCCTTGTTAGCGAGATAGTCGGAATGGAGCCAAGGCCGGTAACATTATTTGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCCGTATTTCATGGTCGCAACAAGCACATAGATACATGTTTTCATTTCATTCGAGAGTGTGTCAAGAATGGACAAATTATCGTTGAAGTTGTCAACACTGGAGAACAACGAGCCGATGTCCTGACTAAAGCATTGACGGGAGTAAAGTTAGCTGCTATGCGTCAACTACTTGGTGTTCGTAAGTTAGAATCATGCCAGAATTAG

Coding sequence (CDS)

ATGGAGAATTGGAAGAGGAAGATATATGTTACTCATCCGGAGGGTTTTGAGGTTCCAAATGAAAAACACAAGGTGTATAGATTGTCAAATGCTCTTTACGGATTGAGGCAAGCTCCACAAGCTTGGAACATTCGACTTGATAGGAGTCTCAAAGATCTTGGTTTTAGAAAATGCACTCAAGAGCAAGCAATCTACACAAGAAGAGAAAAAGAGGAATATGTTCTTGTTGGAGTGTATGTTGACGATCTCATTGTAACAGGAAGAATTGAAGTTGAACAACAGAAGGGTCGAATCATGCTCAAACAACCAACTTATGCCAAAAGAATTTTGTCCCAATTTGGAATGGCTGATTGCAATGCCACAAAGTACCCGATGGAACCCAAGGCACAACTTCATAAAGACACAGAAGAAGCACCAATTGATGCTACGAAGTATAGAAGCATCTTTGGTTGTCTTAGATACTTACTGAACACAAGGCCAGATCTTTCATATGTTGTTGGGATGGCGAGTAGGTATATGGAAAGGCCTACAACCATGCATTACAAGGTGGTCAAGCAAATACTTAGGTATTTGAGAGGGACGATTCATTTTGGGCTCACTTATACGAAAGGTCCCAGAGAATTCAATATATTCGGTTACTCAGACAGTGATTTAGCTGGTGATCTCGACAGGAGGAAAAGCACAAGTGGAATGACATTCTACTTAAACGAAAGCTTGGTTTCATGGAATTCGCAAAAGCAAAAGACGGTGGCACTCTCATCTTGCGAAGCCGAGTTCATTGCAGCTACTACCGCAGCTTGCCAAGCATTGTGGTTAAGATGCCTTGTTAGCGAGATAGTCGGAATGGAGCCAAGGCCGGTAACATTATTTGTGGACAACAAATCCGCGATAGCTCTCATGAAGAATCCCGTATTTCATGGTCGCAACAAGCACATAGATACATGTTTTCATTTCATTCGAGAGTGTGTCAAGAATGGACAAATTATCGTTGAAGTTGTCAACACTGGAGAACAACGAGCCGATGTCCTGACTAAAGCATTGACGGGAGTAAAGTTAGCTGCTATGCGTCAACTACTTGGTGTTCGTAAGTTAGAATCATGCCAGAATTAG
BLAST of CSPI07G01990 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 235.0 bits (598), Expect = 1.4e-60
Identity = 134/396 (33.84%), Postives = 219/396 (55.30%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + +IY+  PEGFEV  +KH V +L+ +LYGL+QAP+ W ++ D  +K   + K   +  +
Sbjct: 933  EEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCV 992

Query: 65   YTRREKEE-YVLVGVYVDDLIVTGR-------------------------------IEVE 124
            Y +R  E  ++++ +YVDD+++ G+                               I  E
Sbjct: 993  YFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRE 1052

Query: 125  QQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDT------EEAPIDATKYR 184
            +   ++ L Q  Y +R+L +F M +      P+    +L K        E+  +    Y 
Sbjct: 1053 RTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYS 1112

Query: 185  SIFGCLRY-LLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGP 244
            S  G L Y ++ TRPD+++ VG+ SR++E P   H++ VK ILRYLRGT    L +  G 
Sbjct: 1113 SAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGDCLCF--GG 1172

Query: 245  REFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTA 304
             +  + GY+D+D+AGD+D RKS++G  F  +   +SW S+ QK VALS+ EAE+IAAT  
Sbjct: 1173 SDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATET 1232

Query: 305  ACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKN 362
              + +WL+  + E+ G+  +   ++ D++SAI L KN ++H R KHID  +H+IRE V +
Sbjct: 1233 GKEMIWLKRFLQEL-GLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHWIREMVDD 1292

BLAST of CSPI07G01990 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 215.3 bits (547), Expect = 1.1e-54
Identity = 136/391 (34.78%), Postives = 210/391 (53.71%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            K +IY+  P+G    ++   V +L+ A+YGL+QA + W    +++LK+  F   + ++ I
Sbjct: 1013 KEEIYMRLPQGISCNSDN--VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSSVDRCI 1072

Query: 65   YT--RREKEEYVLVGVYVDDLIV-TGR----------------------------IEVEQ 124
            Y   +    E + V +YVDD+++ TG                             I +E 
Sbjct: 1073 YILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEM 1132

Query: 125  QKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLR 184
            Q+ +I L Q  Y K+ILS+F M +CNA   P+  K        +   + T  RS+ GCL 
Sbjct: 1133 QEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCN-TPCRSLIGCLM 1192

Query: 185  YL-LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPR-EFNIF 244
            Y+ L TRPDL+  V + SRY  +  +  ++ +K++LRYL+GTI   L + K    E  I 
Sbjct: 1193 YIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKII 1252

Query: 245  GYSDSDLAGDLDRRKSTSGMTFYLNE-SLVSWNSQKQKTVALSSCEAEFIAATTAACQAL 304
            GY DSD AG    RKST+G  F + + +L+ WN+++Q +VA SS EAE++A   A  +AL
Sbjct: 1253 GYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREAL 1312

Query: 305  WLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIV 362
            WL+ L++ I      P+ ++ DN+  I++  NP  H R KHID  +HF RE V+N  I +
Sbjct: 1313 WLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICL 1372

BLAST of CSPI07G01990 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 125.2 bits (313), Expect = 1.5e-27
Identity = 66/183 (36.07%), Postives = 104/183 (56.83%), Query Frame = 1

Query: 89  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSI 148
           I+++     + L Q  YA++IL+  GM DC     P+  K      T + P D + +RSI
Sbjct: 45  IQIKTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSI 104

Query: 149 FGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREF 208
            G L+YL  TRPD+SY V +  + M  PT   + ++K++LRY++GTI  GL Y     + 
Sbjct: 105 VGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGL-YIHKNSKL 164

Query: 209 NIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQ 268
           N+  + DSD AG    R+ST+G   +L  +++SW++++Q TV+ SS E E+ A    A +
Sbjct: 165 NVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAE 224

Query: 269 ALW 272
             W
Sbjct: 225 LTW 225

BLAST of CSPI07G01990 vs. Swiss-Prot
Match: YCH4_YEAST (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY5A PE=5 SV=2)

HSP 1 Score: 120.2 bits (300), Expect = 4.9e-26
Identity = 87/288 (30.21%), Postives = 130/288 (45.14%), Query Frame = 1

Query: 8   IYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAIYTR 67
           IYV  P GF        V+ L   +YGL+QAP  WN  ++ +LK +GF +   E  +Y R
Sbjct: 16  IYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKKIGFCRHEGEHGLYFR 75

Query: 68  REKEEYVLVGVYVDDLIVTG-------RIEVEQQK-----------------------GR 127
              +  + + VYVDDL+V         R++ E  K                       G 
Sbjct: 76  STSDGPIYIAVYVDDLLVAAPSPKIYDRVKQELTKLYSMKDLGKVDKFLGLNIHQSSNGD 135

Query: 128 IMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLN 187
           I L    Y  +  S+  +     T+ P+     L + T     D T Y+SI G L +  N
Sbjct: 136 ITLSLQDYIAKAASESEINTFKLTQTPLCNSKPLFETTSPHLKDITPYQSIVGQLLFCAN 195

Query: 188 T-RPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDS 247
           T RPD+SY V + SR++  P  +H +  +++LRYL  T    L Y  G  +  +  Y D+
Sbjct: 196 TGRPDISYPVSLLSRFLREPRAIHLESARRVLRYLYTTRSMCLKYRSG-SQLALTVYCDA 255

Query: 248 DLAGDLDRRKSTSGMTFYLNESLVSWNSQKQK-TVALSSCEAEFIAAT 264
                 D   ST G    L  + V+W+S+K K  + + S EAE+I A+
Sbjct: 256 SHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGVIPVPSTEAEYITAS 302

BLAST of CSPI07G01990 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 53.5 bits (127), Expect = 5.6e-06
Identity = 26/86 (30.23%), Postives = 48/86 (55.81%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + +IY+ HP      +++  V +L+ ALYGL+Q+P+ WN  L + L  +G +  +    +
Sbjct: 1394 EEEIYIPHP------HDRRCVVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGL 1453

Query: 65   YTRREKEEYVLVGVYVDDLIVTGRIE 91
            Y   +K   +++ VYVDD ++    E
Sbjct: 1454 YQTEDKN--LMIAVYVDDCVIAASNE 1471

BLAST of CSPI07G01990 vs. TrEMBL
Match: B8BDZ6_ORYSI (Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4 SV=1)

HSP 1 Score: 478.4 bits (1230), Expect = 7.9e-132
Identity = 242/394 (61.42%), Postives = 289/394 (73.35%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+
Sbjct: 1034 EEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAV 1093

Query: 65   YTRREKEEYVLVGVYVDDLIVTG-----------------------------RIEVEQQK 124
            YTR + +  V+VGVYVDDLIVTG                              IEV Q +
Sbjct: 1094 YTRGKGQAGVIVGVYVDDLIVTGENPQEIAMFKQQMMGEFEMSDLGLLSYYLGIEVIQGE 1153

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
              I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYL
Sbjct: 1154 NGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIGCLRYL 1213

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPTTMH K VK ILRYL+GT+  GL +  G    +I G++D
Sbjct: 1214 LHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDITGFTD 1273

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Sbjct: 1274 SDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHALWLRAL 1333

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            +SE++G E +PV LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V++
Sbjct: 1334 LSEMMGTEAKPVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILIEFVSS 1393

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD +TK L   KLA  R LLGVR L   Q+
Sbjct: 1394 EEQRADAMTKGLPAAKLATARHLLGVRDLRPRQD 1427

BLAST of CSPI07G01990 vs. TrEMBL
Match: Q10RM4_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=LOC_Os03g05850 PE=4 SV=1)

HSP 1 Score: 477.6 bits (1228), Expect = 1.3e-131
Identity = 240/394 (60.91%), Postives = 290/394 (73.60%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    ++H V +L  ALYGLRQAP+AWNIRLDRSL++LGF +CTQEQA+
Sbjct: 1016 EEEVYVAQPEGFARSGKEHLVLKLHKALYGLRQAPRAWNIRLDRSLRELGFDRCTQEQAV 1075

Query: 65   YTRREKEEYVLVGVYVDDLIVT---------------GR--------------IEVEQQK 124
            YTR    + ++VGVYVDDLIVT               G               IEV+Q +
Sbjct: 1076 YTRGRGSDGIIVGVYVDDLIVTGENPSELKVFKEQMMGEFEMSDLGLLTYYLGIEVDQDE 1135

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
                LKQ  YAK++LSQFGM +CN+   P++P++QL KD E  P+DAT+YR I G LRYL
Sbjct: 1136 SATTLKQTAYAKKLLSQFGMMECNSVSIPIDPRSQLSKDPEGHPVDATEYRRIIGSLRYL 1195

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPT MH+K VKQILRY++GT+ +GL Y  G     I GY+D
Sbjct: 1196 LHTRPDLSYAVGVASRFMERPTVMHFKAVKQILRYIKGTMDYGLVYAAGTGALKITGYTD 1255

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGDLD R+ST GM FY+N+SLV+W+SQKQKTVALSSCEAEF+AATTAACQALWLR L
Sbjct: 1256 SDLAGDLDDRRSTGGMAFYINQSLVAWSSQKQKTVALSSCEAEFMAATTAACQALWLRLL 1315

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            ++E+ G+E + V LFVDN+SAIALMKNPVFHGR+KHIDT +HFIRECV  GQI+VE V T
Sbjct: 1316 LAEVAGVEEKAVKLFVDNRSAIALMKNPVFHGRSKHIDTRYHFIRECVDGGQIVVEFVRT 1375

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD LTK L   KL   R LLGVR L S QN
Sbjct: 1376 EEQRADALTKGLPAAKLVTARHLLGVRSLGSRQN 1409

BLAST of CSPI07G01990 vs. TrEMBL
Match: Q0J8A6_ORYSJ (Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1)

HSP 1 Score: 474.6 bits (1220), Expect = 1.1e-130
Identity = 241/394 (61.17%), Postives = 287/394 (72.84%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+
Sbjct: 1034 EEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAV 1093

Query: 65   YTRREKEEYVLVGVYVDDLIVTG-----------------------------RIEVEQQK 124
            YTR + +  V+VGVYVDDLIVTG                              IEV Q +
Sbjct: 1094 YTRGKGQAGVIVGVYVDDLIVTGENPHEIAMFKQQMMGEFEMSDLGLLSYYLGIEVIQGE 1153

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
              I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYL
Sbjct: 1154 NGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIGCLRYL 1213

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPTTMH K VK ILRYL+GT+  GL +  G    +I G++D
Sbjct: 1214 LHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDITGFTD 1273

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Sbjct: 1274 SDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHALWLRAL 1333

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            +SE++G E + V LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V +
Sbjct: 1334 LSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILIEFVRS 1393

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD +TK L   KLA  R LLGVR L   Q+
Sbjct: 1394 EEQRADAMTKGLPAAKLATARHLLGVRDLRPRQD 1427

BLAST of CSPI07G01990 vs. TrEMBL
Match: Q84SW8_ORYSJ (Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1)

HSP 1 Score: 430.3 bits (1105), Expect = 2.5e-117
Identity = 218/358 (60.89%), Postives = 268/358 (74.86%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV+ PEGF    ++H VY+LS ALYGLRQAP+AWN RLDRS+K+LGF +C QEQA+
Sbjct: 989  EEEVYVSQPEGFVEKGKEHLVYKLSKALYGLRQAPRAWNTRLDRSMKELGFSRCAQEQAV 1048

Query: 65   YTRREKEEYVLVGVYVDDLIVTGRIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYP 124
            YTR      ++VGVYVDDLIVTG    E        KQ     +++ +F M+D     Y 
Sbjct: 1049 YTRGTGSTGIIVGVYVDDLIVTG----ESPSDITAFKQ-----QMMGEFEMSDLGLLTYY 1108

Query: 125  MEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVV 184
            +    +LHKD + + +D T+YR + GCLRYLL+TRPDLSY VG+ASR+MERPT MH+K V
Sbjct: 1109 LG--IELHKDAQGSTVDPTEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTVMHFKAV 1168

Query: 185  KQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNS 244
            KQILRYL+GTI+ GL ++ G     I G++DSDLAGD D R+STSGM FY N SLVSW+S
Sbjct: 1169 KQILRYLKGTINCGLMFSGGNGAVEITGFTDSDLAGDSDDRRSTSGMAFYFNGSLVSWSS 1228

Query: 245  QKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPV 304
            QKQKTVALSSCEAEF+AAT AAC ALWLR L+ E++G E RPV L+VDNKSAIALMKNPV
Sbjct: 1229 QKQKTVALSSCEAEFMAATAAACHALWLRGLLIEMIGAEARPVKLYVDNKSAIALMKNPV 1288

Query: 305  FHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVR 363
            FHGR+KHIDT +HFIRECV++G+I +E V   EQRAD LTK L   +LA  R LLGVR
Sbjct: 1289 FHGRSKHIDTRYHFIRECVESGKIQIEFVRIEEQRADALTKGLPAARLATARHLLGVR 1335

BLAST of CSPI07G01990 vs. TrEMBL
Match: Q84MV7_ORYSJ (Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica GN=Os03g28110 PE=4 SV=1)

HSP 1 Score: 419.9 bits (1078), Expect = 3.3e-114
Identity = 216/369 (58.54%), Postives = 263/369 (71.27%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV   EGF    ++H V +LS ALYGLRQAP+AWNI LD+SLK+LGF +C QEQA+
Sbjct: 919  EEEVYVAQLEGFIKKGQEHLVLKLSKALYGLRQAPRAWNICLDKSLKELGFMRCKQEQAV 978

Query: 65   YTRREKEEYVLVGVYVDDLIVTGRIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKY- 124
            YTR    E ++VGVYVDDLIVTG     +   R+      + K+++ +F M D     Y 
Sbjct: 979  YTRGRGAEAMIVGVYVDDLIVTG-----ENPARV----EAFKKQMMGEFEMNDLGLLSYY 1038

Query: 125  --------PMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER 184
                    P++PK QL KD +  P+DA +YR + GCLRYLL+TRPDLSY VG+ASR+ME 
Sbjct: 1039 LGIEVGQVPVDPKTQLQKDADGHPVDAIEYRRVIGCLRYLLHTRPDLSYAVGVASRFMEH 1098

Query: 185  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYL 244
            PT  H K VKQILRYL+GTI  GL YT G  E  I GY+DSDLAGD+D R+ST GM FY+
Sbjct: 1099 PTMTHLKAVKQILRYLKGTIDCGLVYTAGIGEITITGYTDSDLAGDVDDRRSTGGMAFYI 1158

Query: 245  NESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKS 304
            N SLV+W+SQKQKTV LSSCEAEF+AAT AAC ALWLR L+ E++G E + V LFVDNKS
Sbjct: 1159 NNSLVAWSSQKQKTVTLSSCEAEFMAATAAACHALWLRALLGELLGEEAKLVKLFVDNKS 1218

Query: 305  AIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAM 364
            AIALMKNPVF GR+KHIDT + FIREC++  QI+VE V + EQRAD LTK L   KL   
Sbjct: 1219 AIALMKNPVFRGRSKHIDTRYQFIRECIERRQILVEFVRSEEQRADALTKGLPAAKLVTA 1278

BLAST of CSPI07G01990 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 167.9 bits (424), Expect = 1.2e-41
Identity = 104/348 (29.89%), Postives = 166/348 (47.70%), Query Frame = 1

Query: 7   KIYVTHPEGFEVPN----EKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQ 66
           +IY+  P G+          + V  L  ++YGL+QA + W ++   +L   GF +   + 
Sbjct: 207 EIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDH 266

Query: 67  AIYTRREKEEYVLVGVYVDDLIVTG-----------------------------RIEVEQ 126
             + +     ++ V VYVDD+I+                                +E+ +
Sbjct: 267 TYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIAR 326

Query: 127 QKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLR 186
               I + Q  YA  +L + G+  C  +  PM+P       +    +DA  YR + G L 
Sbjct: 327 SAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLM 386

Query: 187 YLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGY 246
           YL  TR D+S+ V   S++ E P   H + V +IL Y++GT+  GL Y+    E  +  +
Sbjct: 387 YLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSS-QAEMQLQVF 446

Query: 247 SDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLR 306
           SD+      D R+ST+G   +L  SL+SW S+KQ+ V+ SS EAE+ A + A  + +WL 
Sbjct: 447 SDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLA 506

Query: 307 CLVSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRE 322
               E+     +P  LF DN +AI +  N VFH R KHI++  H +RE
Sbjct: 507 QFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of CSPI07G01990 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 125.2 bits (313), Expect = 8.6e-29
Identity = 66/183 (36.07%), Postives = 104/183 (56.83%), Query Frame = 1

Query: 89  IEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSI 148
           I+++     + L Q  YA++IL+  GM DC     P+  K      T + P D + +RSI
Sbjct: 45  IQIKTHPSGLFLSQTKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP-DPSDFRSI 104

Query: 149 FGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREF 208
            G L+YL  TRPD+SY V +  + M  PT   + ++K++LRY++GTI  GL Y     + 
Sbjct: 105 VGALQYLTLTRPDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGL-YIHKNSKL 164

Query: 209 NIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQ 268
           N+  + DSD AG    R+ST+G   +L  +++SW++++Q TV+ SS E E+ A    A +
Sbjct: 165 NVQAFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAE 224

Query: 269 ALW 272
             W
Sbjct: 225 LTW 225

BLAST of CSPI07G01990 vs. NCBI nr
Match: gi|218201855|gb|EEC84282.1| (hypothetical protein OsI_30754 [Oryza sativa Indica Group])

HSP 1 Score: 478.4 bits (1230), Expect = 1.1e-131
Identity = 242/394 (61.42%), Postives = 289/394 (73.35%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+
Sbjct: 1034 EEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAV 1093

Query: 65   YTRREKEEYVLVGVYVDDLIVTG-----------------------------RIEVEQQK 124
            YTR + +  V+VGVYVDDLIVTG                              IEV Q +
Sbjct: 1094 YTRGKGQAGVIVGVYVDDLIVTGENPQEIAMFKQQMMGEFEMSDLGLLSYYLGIEVIQGE 1153

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
              I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYL
Sbjct: 1154 NGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIGCLRYL 1213

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPTTMH K VK ILRYL+GT+  GL +  G    +I G++D
Sbjct: 1214 LHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDITGFTD 1273

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Sbjct: 1274 SDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHALWLRAL 1333

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            +SE++G E +PV LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V++
Sbjct: 1334 LSEMMGTEAKPVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILIEFVSS 1393

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD +TK L   KLA  R LLGVR L   Q+
Sbjct: 1394 EEQRADAMTKGLPAAKLATARHLLGVRDLRPRQD 1427

BLAST of CSPI07G01990 vs. NCBI nr
Match: gi|108706239|gb|ABF94034.1| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 477.6 bits (1228), Expect = 1.9e-131
Identity = 240/394 (60.91%), Postives = 290/394 (73.60%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    ++H V +L  ALYGLRQAP+AWNIRLDRSL++LGF +CTQEQA+
Sbjct: 1016 EEEVYVAQPEGFARSGKEHLVLKLHKALYGLRQAPRAWNIRLDRSLRELGFDRCTQEQAV 1075

Query: 65   YTRREKEEYVLVGVYVDDLIVT---------------GR--------------IEVEQQK 124
            YTR    + ++VGVYVDDLIVT               G               IEV+Q +
Sbjct: 1076 YTRGRGSDGIIVGVYVDDLIVTGENPSELKVFKEQMMGEFEMSDLGLLTYYLGIEVDQDE 1135

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
                LKQ  YAK++LSQFGM +CN+   P++P++QL KD E  P+DAT+YR I G LRYL
Sbjct: 1136 SATTLKQTAYAKKLLSQFGMMECNSVSIPIDPRSQLSKDPEGHPVDATEYRRIIGSLRYL 1195

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPT MH+K VKQILRY++GT+ +GL Y  G     I GY+D
Sbjct: 1196 LHTRPDLSYAVGVASRFMERPTVMHFKAVKQILRYIKGTMDYGLVYAAGTGALKITGYTD 1255

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGDLD R+ST GM FY+N+SLV+W+SQKQKTVALSSCEAEF+AATTAACQALWLR L
Sbjct: 1256 SDLAGDLDDRRSTGGMAFYINQSLVAWSSQKQKTVALSSCEAEFMAATTAACQALWLRLL 1315

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            ++E+ G+E + V LFVDN+SAIALMKNPVFHGR+KHIDT +HFIRECV  GQI+VE V T
Sbjct: 1316 LAEVAGVEEKAVKLFVDNRSAIALMKNPVFHGRSKHIDTRYHFIRECVDGGQIVVEFVRT 1375

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD LTK L   KL   R LLGVR L S QN
Sbjct: 1376 EEQRADALTKGLPAAKLVTARHLLGVRSLGSRQN 1409

BLAST of CSPI07G01990 vs. NCBI nr
Match: gi|113622864|dbj|BAF22809.1| (Os08g0125300 [Oryza sativa Japonica Group])

HSP 1 Score: 474.6 bits (1220), Expect = 1.6e-130
Identity = 241/394 (61.17%), Postives = 287/394 (72.84%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV  PEGF    E+H V RLS ALYGLRQAP+AWN RLD+ LK+LGF +CTQEQA+
Sbjct: 1034 EEEVYVAQPEGFVKRGEEHLVLRLSKALYGLRQAPRAWNTRLDKCLKELGFARCTQEQAV 1093

Query: 65   YTRREKEEYVLVGVYVDDLIVTG-----------------------------RIEVEQQK 124
            YTR + +  V+VGVYVDDLIVTG                              IEV Q +
Sbjct: 1094 YTRGKGQAGVIVGVYVDDLIVTGENPHEIAMFKQQMMGEFEMSDLGLLSYYLGIEVIQGE 1153

Query: 125  GRIMLKQPTYAKRILSQFGMADCNATKYPMEPKAQLHKDTEEAPIDATKYRSIFGCLRYL 184
              I +KQ  YAK+ILSQFGM  CN T  PMEP++ LHKD +  PIDAT+YR + GCLRYL
Sbjct: 1154 NGIAIKQAAYAKKILSQFGMQGCNPTSIPMEPRSLLHKDADGNPIDATEYRRVIGCLRYL 1213

Query: 185  LNTRPDLSYVVGMASRYMERPTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSD 244
            L+TRPDLSY VG+ASR+MERPTTMH K VK ILRYL+GT+  GL +  G    +I G++D
Sbjct: 1214 LHTRPDLSYAVGVASRFMERPTTMHLKAVKMILRYLKGTLDSGLVFASGSGSLDITGFTD 1273

Query: 245  SDLAGDLDRRKSTSGMTFYLNESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCL 304
            SDLAGD+D R+ST GM FY+N SLVSW SQKQKTVALSSCEAEF+AAT AAC ALWLR L
Sbjct: 1274 SDLAGDMDDRRSTGGMAFYVNSSLVSWCSQKQKTVALSSCEAEFMAATAAACHALWLRAL 1333

Query: 305  VSEIVGMEPRPVTLFVDNKSAIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNT 364
            +SE++G E + V LFVDNKSAIALMKNPVFHGR+KHIDT +HFIRECV++GQI++E V +
Sbjct: 1334 LSEMMGTEAKRVKLFVDNKSAIALMKNPVFHGRSKHIDTRYHFIRECVESGQILIEFVRS 1393

Query: 365  GEQRADVLTKALTGVKLAAMRQLLGVRKLESCQN 370
             EQRAD +TK L   KLA  R LLGVR L   Q+
Sbjct: 1394 EEQRADAMTKGLPAAKLATARHLLGVRDLRPRQD 1427

BLAST of CSPI07G01990 vs. NCBI nr
Match: gi|29150404|gb|AAO72413.1| (gag-pol polyprotein [Oryza sativa Japonica Group])

HSP 1 Score: 430.3 bits (1105), Expect = 3.5e-117
Identity = 218/358 (60.89%), Postives = 268/358 (74.86%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV+ PEGF    ++H VY+LS ALYGLRQAP+AWN RLDRS+K+LGF +C QEQA+
Sbjct: 989  EEEVYVSQPEGFVEKGKEHLVYKLSKALYGLRQAPRAWNTRLDRSMKELGFSRCAQEQAV 1048

Query: 65   YTRREKEEYVLVGVYVDDLIVTGRIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKYP 124
            YTR      ++VGVYVDDLIVTG    E        KQ     +++ +F M+D     Y 
Sbjct: 1049 YTRGTGSTGIIVGVYVDDLIVTG----ESPSDITAFKQ-----QMMGEFEMSDLGLLTYY 1108

Query: 125  MEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMERPTTMHYKVV 184
            +    +LHKD + + +D T+YR + GCLRYLL+TRPDLSY VG+ASR+MERPT MH+K V
Sbjct: 1109 LG--IELHKDAQGSTVDPTEYRRVIGCLRYLLHTRPDLSYAVGVASRFMERPTVMHFKAV 1168

Query: 185  KQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYLNESLVSWNS 244
            KQILRYL+GTI+ GL ++ G     I G++DSDLAGD D R+STSGM FY N SLVSW+S
Sbjct: 1169 KQILRYLKGTINCGLMFSGGNGAVEITGFTDSDLAGDSDDRRSTSGMAFYFNGSLVSWSS 1228

Query: 245  QKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKSAIALMKNPV 304
            QKQKTVALSSCEAEF+AAT AAC ALWLR L+ E++G E RPV L+VDNKSAIALMKNPV
Sbjct: 1229 QKQKTVALSSCEAEFMAATAAACHALWLRGLLIEMIGAEARPVKLYVDNKSAIALMKNPV 1288

Query: 305  FHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAMRQLLGVR 363
            FHGR+KHIDT +HFIRECV++G+I +E V   EQRAD LTK L   +LA  R LLGVR
Sbjct: 1289 FHGRSKHIDTRYHFIRECVESGKIQIEFVRIEEQRADALTKGLPAARLATARHLLGVR 1335

BLAST of CSPI07G01990 vs. NCBI nr
Match: gi|30017513|gb|AAP12935.1| (retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group])

HSP 1 Score: 419.9 bits (1078), Expect = 4.8e-114
Identity = 216/369 (58.54%), Postives = 263/369 (71.27%), Query Frame = 1

Query: 5    KRKIYVTHPEGFEVPNEKHKVYRLSNALYGLRQAPQAWNIRLDRSLKDLGFRKCTQEQAI 64
            + ++YV   EGF    ++H V +LS ALYGLRQAP+AWNI LD+SLK+LGF +C QEQA+
Sbjct: 919  EEEVYVAQLEGFIKKGQEHLVLKLSKALYGLRQAPRAWNICLDKSLKELGFMRCKQEQAV 978

Query: 65   YTRREKEEYVLVGVYVDDLIVTGRIEVEQQKGRIMLKQPTYAKRILSQFGMADCNATKY- 124
            YTR    E ++VGVYVDDLIVTG     +   R+      + K+++ +F M D     Y 
Sbjct: 979  YTRGRGAEAMIVGVYVDDLIVTG-----ENPARV----EAFKKQMMGEFEMNDLGLLSYY 1038

Query: 125  --------PMEPKAQLHKDTEEAPIDATKYRSIFGCLRYLLNTRPDLSYVVGMASRYMER 184
                    P++PK QL KD +  P+DA +YR + GCLRYLL+TRPDLSY VG+ASR+ME 
Sbjct: 1039 LGIEVGQVPVDPKTQLQKDADGHPVDAIEYRRVIGCLRYLLHTRPDLSYAVGVASRFMEH 1098

Query: 185  PTTMHYKVVKQILRYLRGTIHFGLTYTKGPREFNIFGYSDSDLAGDLDRRKSTSGMTFYL 244
            PT  H K VKQILRYL+GTI  GL YT G  E  I GY+DSDLAGD+D R+ST GM FY+
Sbjct: 1099 PTMTHLKAVKQILRYLKGTIDCGLVYTAGIGEITITGYTDSDLAGDVDDRRSTGGMAFYI 1158

Query: 245  NESLVSWNSQKQKTVALSSCEAEFIAATTAACQALWLRCLVSEIVGMEPRPVTLFVDNKS 304
            N SLV+W+SQKQKTV LSSCEAEF+AAT AAC ALWLR L+ E++G E + V LFVDNKS
Sbjct: 1159 NNSLVAWSSQKQKTVTLSSCEAEFMAATAAACHALWLRALLGELLGEEAKLVKLFVDNKS 1218

Query: 305  AIALMKNPVFHGRNKHIDTCFHFIRECVKNGQIIVEVVNTGEQRADVLTKALTGVKLAAM 364
            AIALMKNPVF GR+KHIDT + FIREC++  QI+VE V + EQRAD LTK L   KL   
Sbjct: 1219 AIALMKNPVFRGRSKHIDTRYQFIRECIERRQILVEFVRSEEQRADALTKGLPAAKLVTA 1278

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.4e-6033.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
COPIA_DROME1.1e-5434.78Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
M810_ARATH1.5e-2736.07Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YCH4_YEAST4.9e-2630.21Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
YH41B_YEAST5.6e-0630.23Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
B8BDZ6_ORYSI7.9e-13261.42Putative uncharacterized protein OS=Oryza sativa subsp. indica GN=OsI_30754 PE=4... [more]
Q10RM4_ORYSJ1.3e-13160.91Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
Q0J8A6_ORYSJ1.1e-13061.17Os08g0125300 protein OS=Oryza sativa subsp. japonica GN=Os08g0125300 PE=2 SV=1[more]
Q84SW8_ORYSJ2.5e-11760.89Gag-pol polyprotein OS=Oryza sativa subsp. japonica GN=LOC_Os03g47410 PE=4 SV=1[more]
Q84MV7_ORYSJ3.3e-11458.54Retrotransposon protein, putative, unclassified OS=Oryza sativa subsp. japonica ... [more]
Match NameE-valueIdentityDescription
AT4G23160.11.2e-4129.89 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.18.6e-2936.07ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|218201855|gb|EEC84282.1|1.1e-13161.42hypothetical protein OsI_30754 [Oryza sativa Indica Group][more]
gi|108706239|gb|ABF94034.1|1.9e-13160.91retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
gi|113622864|dbj|BAF22809.1|1.6e-13061.17Os08g0125300 [Oryza sativa Japonica Group][more]
gi|29150404|gb|AAO72413.1|3.5e-11760.89gag-pol polyprotein [Oryza sativa Japonica Group][more]
gi|30017513|gb|AAP12935.1|4.8e-11458.54retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009536 plastid
molecular_function GO:0046872 metal ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI07G01990.1CSPI07G01990.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 89..126
score: 6.8E-6coord: 5..87
score: 2.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 8..286
score: 5.5E
NoneNo IPR availablePANTHERPTHR11439:SF127SUBFAMILY NOT NAMEDcoord: 8..286
score: 5.5E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 16..317
score: 8.45

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI07G01990CsaV3_7G018740Cucumber (Chinese Long) v3cpicucB416
The following gene(s) are paralogous to this gene:

None