CSPI02G09070 (gene) Wild cucumber (PI 183967)

NameCSPI02G09070
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr2 : 8766371 .. 8767751 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGTTGCTAGATTAGAAGCTATTAGAATGTTGCTTGCTTTTGCTTCTTATAAAAATTTCATTTTGTATCAAATGGATGTAAAAAAGTGCTTTTTTAAACGGTTATATTGTGGAGGAAGTTTATGTAGAACAACCACCGGGCTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGGAAAAGACTCTTTATGCCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATATTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATACTAGTGCCATAAATTTGACTAAGAATCCTATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGTTCCAATAATCAATTAGCGGATATATTTACCAAACCTTTGAGTGAAGAAAGCTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGTGATGCATCTTGA

mRNA sequence

ATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGAAGTTTATGTAGAACAACCACCGGGCTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGGAAAAGACTCTTTATGCCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATATTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATACTAGTGCCATAAATTTGACTAAGAATCCTATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGTTCCAATAATCAATTAGCGGATATATTTACCAAACCTTTGAGTGAAGAAAGCTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGTGATGCATCTTGA

Coding sequence (CDS)

ATGGATGAAAATGGAAATATCATTAGAAATAAAGCTAGACTTGTAGCTCAAGGTTATTGTCAAGAAGAAGGTATAGATTATGAAGAGACTTTTGCACCGGAAGTTTATGTAGAACAACCACCGGGCTTTGAAAGTTTTGATTTACCTAATCATGTTTATAAGTTGGAAAAGACTCTTTATGCCTTAAAACAAGCTCCAAGAGCTTGGTATGATAGACTTAGCAAGTTTTTACTTGAGAATGACTTTAAGATGGGAAAAATTGATAATACTCTATTTATTAAAGTTAAAAATAATGACATGCTTATAGTACAAATTTATGTGGATGATATTATATTTGGTTCTACTAATTCATCTTTGTGTGAAGAATTTTCCAAGTGTATGCATAATGAGTTTGAGATGAGTATGATGGGAGAACTTAGTTTCTTCCTTGGTCTTCAAATCAAACAACTCAAGGATGGCATCTTCATAAGTCAAGAAAAATACACAAGGGATTTGCTCAAGAAATTCAAATTAAATGAAGGTAAAGTTGCAAAAACTCCTATGAGCACTACCACTAAGCTTGACAAAGATGAAAAAGGTAAGTGTGTGGATATAAAGACTTATCGAGGTATGATCGGATCTTTACTTTATTTGACCGCTAGTAGACCCGATATCATGTTTAGTGTATGTCTTTGTGCTAGATTTCAATCTTGTCCTAAAGAATCACATTTCCATGCCGTTAAAAGGATACTTAAATATTTGCTTGGAACTATTGATGTTGGATTATATTGGTATCCTAGAAATGTTGAGTTTAATTTGGTAGGATATTCCGATGCGGACTTTGCCGGTAGTTTACTTGACCGTAAAAGTACTAGTGGGACTTGTCAATTTCTTGGTAGTTCCTTAGTATCTTGGTTTAGTAAAAAGCAAAATTCGGTTGCCTTATCCACTACCGAAGCGGAATATATTGCGGTTGCTAGTTGTTGTGCACAAATTCTTTGGATGAAACAAACTCTTTGTGATTTTGGATTAAAATTTGATAATGTGCCTATATTTTGTGATAATACTAGTGCCATAAATTTGACTAAGAATCCTATTCATCATTCTAGAACTAAGCATATAGATATTAGGCATCACTTTATTAGAGAGCATGTACAAAATGGTCATATTACTCTTGAGTTTGTAAGTTCCAATAATCAATTAGCGGATATATTTACCAAACCTTTGAGTGAAGAAAGCTTTTGTAAAAATAGGCTTGAGCTTGGAATTATTCGTTGTGATGCATCTTGA
BLAST of CSPI02G09070 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 276.2 bits (705), Expect = 6.1e-73
Identity = 166/466 (35.62%), Postives = 260/466 (55.79%), Query Frame = 1

Query: 2    DENGNIIRNKARLVAQGYCQEEGIDYEETFAP---------------------------- 61
            +E GN IR KARLVA+G+ Q+  IDYEETFAP                            
Sbjct: 946  NELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKT 1005

Query: 62   ---------EVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFKM 121
                     E+Y+  P G       ++V KL K +Y LKQA R W++   + L E +F  
Sbjct: 1006 AFLNGTLKEEIYMRLPQGISCNS--DNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVN 1065

Query: 122  GKIDNTLFIKVKNN--DMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFF 181
              +D  ++I  K N  + + V +YVDD++  + + +    F + +  +F M+ + E+  F
Sbjct: 1066 SSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHF 1125

Query: 182  LGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKT-Y 241
            +G++I+  +D I++SQ  Y + +L KF +       TP+ +  K++ +      D  T  
Sbjct: 1126 IGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPS--KINYELLNSDEDCNTPC 1185

Query: 242  RGMIGSLLYLT-ASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPR 301
            R +IG L+Y+   +RPD+  +V + +R+ S      +  +KR+L+YL GTID+ L  + +
Sbjct: 1186 RSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLI-FKK 1245

Query: 302  NVEFN--LVGYSDADFAGSLLDRKSTSGTC-QFLGSSLVSWFSKKQNSVALSTTEAEYIA 361
            N+ F   ++GY D+D+AGS +DRKST+G   +    +L+ W +K+QNSVA S+TEAEY+A
Sbjct: 1246 NLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMA 1305

Query: 362  VASCCAQILWMKQTLCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIRE 421
            +     + LW+K  L    +K +N + I+ DN   I++  NP  H R KHIDI++HF RE
Sbjct: 1306 LFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFARE 1365

Query: 422  HVQNGHITLEFVSSNNQLADIFTKPLSEESFCKNRLELGIIRCDAS 423
             VQN  I LE++ + NQLADIFTKPL    F + R +LG+++ D S
Sbjct: 1366 QVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQDDQS 1406

BLAST of CSPI02G09070 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 251.5 bits (641), Expect = 1.6e-65
Identity = 157/458 (34.28%), Postives = 245/458 (53.49%), Query Frame = 1

Query: 2    DENGNIIRNKARLVAQGYCQEEGIDYEETFAP---------------------------- 61
            D +  ++R KARLV +G+ Q++GID++E F+P                            
Sbjct: 866  DGDCKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKT 925

Query: 62   ---------EVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFKM 121
                     E+Y+EQP GFE     + V KL K+LY LKQAPR WY +   F+    +  
Sbjct: 926  AFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLK 985

Query: 122  GKIDNTLFIK-VKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 181
               D  ++ K    N+ +I+ +YVDD++    +  L  +    +   F+M  +G     L
Sbjct: 986  TYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQIL 1045

Query: 182  GLQIKQLKDG--IFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDK-------DEKGK 241
            G++I + +    +++SQEKY   +L++F +   K   TP++   KL K       +EKG 
Sbjct: 1046 GMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGN 1105

Query: 242  CVDIKTYRGMIGSLLY-LTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDV 301
               +  Y   +GSL+Y +  +RPDI  +V + +RF   P + H+ AVK IL+YL GT   
Sbjct: 1106 MAKVP-YSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTTGD 1165

Query: 302  GLYWYPRNVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEA 361
             L +     +  L GY+DAD AG + +RKS++G         +SW SK Q  VALSTTEA
Sbjct: 1166 CLCF--GGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEA 1225

Query: 362  EYIAVASCCAQILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHF 410
            EYIA      +++W+K+ L + GL      ++CD+ SAI+L+KN ++H+RTKHID+R+H+
Sbjct: 1226 EYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKNSMYHARTKHIDVRYHW 1285

BLAST of CSPI02G09070 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 7.3e-34
Identity = 82/226 (36.28%), Postives = 133/226 (58.85%), Query Frame = 1

Query: 105 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 164
           +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 165 LLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFS 224
           +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++
Sbjct: 65  ILN----NAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 124

Query: 225 VCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVEFNLVGYSDADFAGSLLDR 284
           V +  +    P  + F  +KR+L+Y+ GTI  GLY + +N + N+  + D+D+AG    R
Sbjct: 125 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIH-KNSKLNVQAFCDSDWAGCTSTR 184

Query: 285 KSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW 328
           +ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Sbjct: 185 RSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI02G09070 vs. Swiss-Prot
Match: YJ41B_YEAST (Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-J PE=3 SV=3)

HSP 1 Score: 114.0 bits (284), Expect = 4.0e-24
Identity = 101/377 (26.79%), Postives = 178/377 (47.21%), Query Frame = 1

Query: 52   VYKLEKTLYALKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDII 111
            V KL K LY LKQ+P+ W D L ++L     K       L+     N  L++ +YVDD +
Sbjct: 1409 VVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGLYQTEDKN--LMIAVYVDDCV 1468

Query: 112  FGSTNSSLCEEFSKCMHNEFEMSMMGEL------SFFLGLQIKQLK--DGIFISQEKYTR 171
              ++N    +EF   + + FE+ + G L      +  LG+ +   K    I ++ + +  
Sbjct: 1469 IAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLTLKSFIN 1528

Query: 172  DLLKKFKLNEGKVAKT--PMSTTTKLDKDEKGKCVDIKTYR-------GMIGSLLYLT-A 231
             + KK+     K+ K+  P  +T K+D  +    +  + +R        ++G L Y+   
Sbjct: 1529 RMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEFRQGVLKLQQLLGELNYVRHK 1588

Query: 232  SRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPR--NVEFNLVGYSD 291
             R DI F+V   AR  + P E  F+ + +I++YL+   D+G++ Y R  N +  ++  +D
Sbjct: 1589 CRYDIEFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIH-YDRDCNKDKKVIAITD 1648

Query: 292  ADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQT 351
            A   GS  D +S  G   + G ++ + +S K  +  +S+TEAE  A+    A    +K T
Sbjct: 1649 AS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGYADSETLKVT 1708

Query: 352  LCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSS 408
            L + G   +N + +  D+  AI          + K   I+   I+E ++   I L  ++ 
Sbjct: 1709 LKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEKSIKLLKITG 1768

BLAST of CSPI02G09070 vs. Swiss-Prot
Match: YH41B_YEAST (Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) GN=TY4B-H PE=3 SV=1)

HSP 1 Score: 114.0 bits (284), Expect = 4.0e-24
Identity = 101/377 (26.79%), Postives = 178/377 (47.21%), Query Frame = 1

Query: 52   VYKLEKTLYALKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIYVDDII 111
            V KL K LY LKQ+P+ W D L ++L     K       L+     N  L++ +YVDD +
Sbjct: 1408 VVKLNKALYGLKQSPKEWNDHLRQYLNGIGLKDNSYTPGLYQTEDKN--LMIAVYVDDCV 1467

Query: 112  FGSTNSSLCEEFSKCMHNEFEMSMMGEL------SFFLGLQIKQLK--DGIFISQEKYTR 171
              ++N    +EF   + + FE+ + G L      +  LG+ +   K    I ++ + +  
Sbjct: 1468 IAASNEQRLDEFINKLKSNFELKITGTLIDDVLDTDILGMDLVYNKRLGTIDLTLKSFIN 1527

Query: 172  DLLKKFKLNEGKVAKT--PMSTTTKLDKDEKGKCVDIKTYR-------GMIGSLLYLT-A 231
             + KK+     K+ K+  P  +T K+D  +    +  + +R        ++G L Y+   
Sbjct: 1528 RMDKKYNEELKKIRKSSIPHMSTYKIDPKKDVLQMSEEEFRQGVLKLQQLLGELNYVRHK 1587

Query: 232  SRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPR--NVEFNLVGYSD 291
             R DI F+V   AR  + P E  F+ + +I++YL+   D+G++ Y R  N +  ++  +D
Sbjct: 1588 CRYDINFAVKKVARLVNYPHERVFYMIYKIIQYLVRYKDIGIH-YDRDCNKDKKVIAITD 1647

Query: 292  ADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQT 351
            A   GS  D +S  G   + G ++ + +S K  +  +S+TEAE  A+    A    +K T
Sbjct: 1648 AS-VGSEYDAQSRIGVILWYGMNIFNVYSNKSTNRCVSSTEAELHAIYEGYADSETLKVT 1707

Query: 352  LCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSS 408
            L + G   +N + +  D+  AI          + K   I+   I+E ++   I L  ++ 
Sbjct: 1708 LKELGEGDNNDIVMITDSKPAIQGLNRSYQQPKEKFTWIKTEIIKEKIKEKSIKLLKITG 1767

BLAST of CSPI02G09070 vs. TrEMBL
Match: A5BEZ1_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013310 PE=4 SV=1)

HSP 1 Score: 615.5 bits (1586), Expect = 4.7e-173
Identity = 309/454 (68.06%), Postives = 353/454 (77.75%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETFAP--------------------------- 60
           MDENG I+RNKARLVAQG+ QEEGIDYEETFAP                           
Sbjct: 1   MDENGIIVRNKARLVAQGFNQEEGIDYEETFAPVARLEAIRMLLAFACFKDFVLYQMDVK 60

Query: 61  ----------EVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGF+SF+ PNHV++L+KTLY LKQA RAWY+RLSKF L+  FK
Sbjct: 61  SAFLNGFINEEVYVEQPPGFQSFNFPNHVFRLKKTLYGLKQAXRAWYERLSKFXLKKGFK 120

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
           MGKID TLFIK K NDML+VQIYVDDIIFG+TN SLCEEFSKCMH+EFEMSMMGEL+FF+
Sbjct: 121 MGKIDXTLFIKTKXNDMLLVQIYVDDIIFGATNVSLCEEFSKCMHSEFEMSMMGELNFFI 180

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQIKQLK+G FI+Q KY RDLLK+F + E K  KTPMS++ KLD D KGK ++   YRG
Sbjct: 181 GLQIKQLKEGTFINQAKYIRDLLKRFNMEEAKTMKTPMSSSIKLDMDXKGKLINSTMYRG 240

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYLTASRPDIM+S+CLCARFQSCPKESH  AVKRIL+YL G +D+GL WYP+   
Sbjct: 241 MIGSLLYLTASRPDIMYSICLCARFQSCPKESHLSAVKRILRYLKGIMDIGL-WYPKGDN 300

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           F L+GY DADFAG  ++RKSTS TC FLG SLVSW SKKQNS+ALST EAEYIA    CA
Sbjct: 301 FELIGYLDADFAGCKVERKSTSDTCHFLGHSLVSWHSKKQNSIALSTAEAEYIAAGLYCA 360

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQTL DF L F++VPI CDNTSAIN++KN + HSRTKHI+IRHHF+R+H Q G I
Sbjct: 361 QILWMKQTLSDFNLIFEHVPIKCDNTSAINISKNLVQHSRTKHIEIRHHFLRDHAQKGDI 420

BLAST of CSPI02G09070 vs. TrEMBL
Match: A0A151TIF5_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_013123 PE=4 SV=1)

HSP 1 Score: 598.2 bits (1541), Expect = 7.8e-168
Identity = 301/454 (66.30%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
           +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 441 LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 500

Query: 61  --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 501 SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 560

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
            GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSMMGEL+FFL
Sbjct: 561 RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFL 620

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQI+Q K+GIFI+Q KY ++LLK+F +   K   TPMSTT  LDKDE GK +D+K YRG
Sbjct: 621 GLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRG 680

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYL+ASRPDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GL WYP+N+ 
Sbjct: 681 MIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGL-WYPKNMP 740

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 741 FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 800

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQ L D+GL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 801 QILWMKQQLSDYGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDY 860

BLAST of CSPI02G09070 vs. TrEMBL
Match: A0A151QU14_CAJCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=KK1_045365 PE=4 SV=1)

HSP 1 Score: 595.9 bits (1535), Expect = 3.9e-167
Identity = 299/454 (65.86%), Postives = 349/454 (76.87%), Query Frame = 1

Query: 1    MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
            +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 552  LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 611

Query: 61   --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                      EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 612  SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 671

Query: 121  MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
             GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSMMGEL+FFL
Sbjct: 672  RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFL 731

Query: 181  GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
            GLQI+Q K+GIFI+Q KY ++LLK+F +   K   TPMSTT  LDKDE GK +D+K YRG
Sbjct: 732  GLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRG 791

Query: 241  MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
            MIGSLLYL+ASRPDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GL WYP+N+ 
Sbjct: 792  MIGSLLYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGL-WYPKNMS 851

Query: 301  FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
            FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 852  FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 911

Query: 361  QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
            QILWMKQ L D+GL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 912  QILWMKQQLSDYGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDC 971

BLAST of CSPI02G09070 vs. TrEMBL
Match: A0A151SK19_CAJCA (Copia protein OS=Cajanus cajan GN=KK1_001310 PE=4 SV=1)

HSP 1 Score: 595.5 bits (1534), Expect = 5.0e-167
Identity = 298/454 (65.64%), Postives = 350/454 (77.09%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
           +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 36  LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 95

Query: 61  --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 96  SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 155

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
            GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSM+GEL+FF+
Sbjct: 156 RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMIGELNFFI 215

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQI+Q K+GIFI+Q KY ++ LK+F +   K    P+STT  LDKDE GK +D+K YRG
Sbjct: 216 GLQIRQTKNGIFINQSKYCKEFLKRFGMENAKSMTAPISTTCYLDKDEVGKSIDVKKYRG 275

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYL+ASRPDIMFSVCLCAR+QS PKESH  AVKRI++YLLGT ++GL WYP+N+ 
Sbjct: 276 MIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRYLLGTTNLGL-WYPKNMP 335

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 336 FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 395

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQ L DFGL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 396 QILWMKQQLSDFGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDC 455

BLAST of CSPI02G09070 vs. TrEMBL
Match: A5BLV7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025872 PE=4 SV=1)

HSP 1 Score: 594.3 bits (1531), Expect = 1.1e-166
Identity = 305/431 (70.77%), Postives = 345/431 (80.05%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETFAPEVYVEQP------PGFESFDLPNHVYK 60
           MDENG IIRNKARLVA G+ QEEGIDYEETFAP V +E          F+ F L     K
Sbjct: 456 MDENGIIIRNKARLVAXGFNQEEGIDYEETFAPVVRLEAIRMLLAFACFKDFVLYQMDVK 515

Query: 61  L--------EKTLYALKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 120
                    E+ LY LKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K+NDML+VQIY
Sbjct: 516 SAFLNNFINEEALYGLKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKDNDMLLVQIY 575

Query: 121 VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 180
           VDDIIFG+TN SLCE FSKCMH+EFEMSMMGEL+FFLGLQIKQLK+G FI+Q KY RDLL
Sbjct: 576 VDDIIFGATNVSLCEGFSKCMHSEFEMSMMGELNFFLGLQIKQLKEGTFINQAKYIRDLL 635

Query: 181 KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 240
           K+F + E K  KTPMS++ KLD DEK K V+   YRGMIGSLLYLT SRPDIM+SVCLCA
Sbjct: 636 KRFNMEEAKTMKTPMSSSIKLDMDEKCKPVNSTMYRGMIGSLLYLTTSRPDIMYSVCLCA 695

Query: 241 RFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVEFNLVGYSDADFAGSLLDRKSTSG 300
           RFQSCPK+SH  AVKRIL+YL GT+D+GL WYP+   F L+GYSDADF G  ++RKSTS 
Sbjct: 696 RFQSCPKKSHLSAVKRILRYLKGTMDIGL-WYPKGDNFELIGYSDADFDGCKVERKSTSD 755

Query: 301 TCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLKFDNVPIFC 360
           TC FLG SLVSW+SKKQNSVALST EAEYIAV  CCAQILWMKQTL DF L F++VPI C
Sbjct: 756 TCHFLGHSLVSWYSKKQNSVALSTVEAEYIAVGLCCAQILWMKQTLSDFNLIFEHVPIKC 815

Query: 361 DNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEES 418
           DNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE 
Sbjct: 816 DNTSAINISKNPVQHSRTKHIEIRHHFLRDHAQKGDITLEFVSTKDQLADIFTKPLSEEQ 875

BLAST of CSPI02G09070 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 253.8 bits (647), Expect = 1.8e-67
Identity = 162/445 (36.40%), Postives = 234/445 (52.58%), Query Frame = 1

Query: 2   DENGNIIRNKARLVAQGYCQEEGIDYEETFAP---------------------------- 61
           + +G I R KARLVA+GY Q+EGID+ ETF+P                            
Sbjct: 138 NSDGTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISN 197

Query: 62  ---------EVYVEQPPGFESFD----LPNHVYKLEKTLYALKQAPRAWYDRLSKFLLEN 121
                    E+Y++ PPG+ +       PN V  L+K++Y LKQA R W+ + S  L+  
Sbjct: 198 AFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGF 257

Query: 122 DFKMGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELS 181
            F     D+T F+K+     L V +YVDDII  S N +  +E    + + F++  +G L 
Sbjct: 258 GFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLK 317

Query: 182 FFLGLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKT 241
           +FLGL+I +   GI I Q KY  DLL +  L   K +  PM  +        G  VD K 
Sbjct: 318 YFLGLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKA 377

Query: 242 YRGMIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPR 301
           YR +IG L+YL  +R DI F+V   ++F   P+ +H  AV +IL Y+ GT+  GL+ Y  
Sbjct: 378 YRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLF-YSS 437

Query: 302 NVEFNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVAS 361
             E  L  +SDA F      R+ST+G C FLG+SL+SW SKKQ  V+ S+ EAEY A++ 
Sbjct: 438 QAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSF 497

Query: 362 CCAQILWMKQTLCDFGLKFDN-VPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREH-V 404
              +++W+ Q   +  L       +FCDNT+AI++  N + H RTKHI+   H +RE  V
Sbjct: 498 ATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRERSV 557

BLAST of CSPI02G09070 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 146.4 bits (368), Expect = 4.1e-35
Identity = 82/226 (36.28%), Postives = 133/226 (58.85%), Query Frame = 1

Query: 105 IYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRD 164
           +YVDDI+   ++++L       + + F M  +G + +FLG+QIK    G+F+SQ KY   
Sbjct: 5   LYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYAEQ 64

Query: 165 LLKKFKLNEGKVAKTPMSTTTKLDKDEK---GKCVDIKTYRGMIGSLLYLTASRPDIMFS 224
           +L     N G +   PMST   L  +      K  D   +R ++G+L YLT +RPDI ++
Sbjct: 65  ILN----NAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYA 124

Query: 225 VCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVEFNLVGYSDADFAGSLLDR 284
           V +  +    P  + F  +KR+L+Y+ GTI  GLY + +N + N+  + D+D+AG    R
Sbjct: 125 VNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIH-KNSKLNVQAFCDSDWAGCTSTR 184

Query: 285 KSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILW 328
           +ST+G C FLG +++SW +K+Q +V+ S+TE EY A+A   A++ W
Sbjct: 185 RSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI02G09070 vs. NCBI nr
Match: gi|147861798|emb|CAN83181.1| (hypothetical protein VITISV_013310 [Vitis vinifera])

HSP 1 Score: 617.1 bits (1590), Expect = 2.3e-173
Identity = 309/454 (68.06%), Postives = 354/454 (77.97%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETFAP--------------------------- 60
           MDENG I+RNKARLVAQG+ QEEGIDYEETFAP                           
Sbjct: 1   MDENGIIVRNKARLVAQGFNQEEGIDYEETFAPVARLEAIRMLLAFACFKDFVLYQMDVK 60

Query: 61  ----------EVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGF+SF+ PNHV++L+KTLY LKQA RAWY+RLSKF+L+  FK
Sbjct: 61  SAFLNGFINEEVYVEQPPGFQSFNFPNHVFRLKKTLYGLKQAXRAWYERLSKFJLKKGFK 120

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
           MGKID TLFIK K NDML+VQIYVDDIIFG+TN SLCEEFSKCMH+EFEMSMMGEL+FF+
Sbjct: 121 MGKIDXTLFIKTKXNDMLLVQIYVDDIIFGATNVSLCEEFSKCMHSEFEMSMMGELNFFI 180

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQIKQLK+G FI+Q KY RDLLK+F + E K  KTPMS++ KLD D KGK ++   YRG
Sbjct: 181 GLQIKQLKEGTFINQAKYIRDLLKRFNMEEAKTMKTPMSSSIKLDMDXKGKLINSTMYRG 240

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYLTASRPDIM+S+CLCARFQSCPKESH  AVKRIL+YL G +D+GL WYP+   
Sbjct: 241 MIGSLLYLTASRPDIMYSICLCARFQSCPKESHLSAVKRILRYLKGIMDIGL-WYPKGDN 300

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           F L+GY DADFAG  ++RKSTS TC FLG SLVSW SKKQNS+ALST EAEYIA    CA
Sbjct: 301 FELIGYLDADFAGCKVERKSTSDTCHFLGHSLVSWHSKKQNSIALSTAEAEYIAAGLYCA 360

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQTL DF L F++VPI CDNTSAIN++KN + HSRTKHI+IRHHF+R+H Q G I
Sbjct: 361 QILWMKQTLSDFNLIFEHVPIKCDNTSAINISKNLVQHSRTKHIEIRHHFLRDHAQKGDI 420

BLAST of CSPI02G09070 vs. NCBI nr
Match: gi|1012355625|gb|KYP66812.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 598.2 bits (1541), Expect = 1.1e-167
Identity = 301/454 (66.30%), Postives = 351/454 (77.31%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
           +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 441 LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 500

Query: 61  --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 501 SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 560

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
            GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSMMGEL+FFL
Sbjct: 561 RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFL 620

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQI+Q K+GIFI+Q KY ++LLK+F +   K   TPMSTT  LDKDE GK +D+K YRG
Sbjct: 621 GLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRG 680

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYL+ASRPDIMFSVCLCAR+QS PKESH  AVKRI++ LLGT ++GL WYP+N+ 
Sbjct: 681 MIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRCLLGTTNLGL-WYPKNMP 740

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 741 FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 800

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQ L D+GL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 801 QILWMKQQLSDYGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDY 860

BLAST of CSPI02G09070 vs. NCBI nr
Match: gi|1012321187|gb|KYP33754.1| (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan])

HSP 1 Score: 595.9 bits (1535), Expect = 5.5e-167
Identity = 299/454 (65.86%), Postives = 349/454 (76.87%), Query Frame = 1

Query: 1    MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
            +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 552  LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 611

Query: 61   --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                      EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 612  SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 671

Query: 121  MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
             GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSMMGEL+FFL
Sbjct: 672  RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMMGELNFFL 731

Query: 181  GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
            GLQI+Q K+GIFI+Q KY ++LLK+F +   K   TPMSTT  LDKDE GK +D+K YRG
Sbjct: 732  GLQIRQTKNGIFINQSKYCKELLKRFGMENAKSMATPMSTTCYLDKDEVGKSIDVKKYRG 791

Query: 241  MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
            MIGSLLYL+ASRPDIMFSVC CAR+QS PKESH  AVKRI++YLL T ++GL WYP+N+ 
Sbjct: 792  MIGSLLYLSASRPDIMFSVCFCARYQSNPKESHLSAVKRIMRYLLRTTNLGL-WYPKNMS 851

Query: 301  FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
            FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 852  FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 911

Query: 361  QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
            QILWMKQ L D+GL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 912  QILWMKQQLSDYGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDC 971

BLAST of CSPI02G09070 vs. NCBI nr
Match: gi|1012343913|gb|KYP55105.1| (Copia protein [Cajanus cajan])

HSP 1 Score: 595.5 bits (1534), Expect = 7.2e-167
Identity = 298/454 (65.64%), Postives = 350/454 (77.09%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETF----------------------------- 60
           +DE+G +IRNKARLVA+GY QEEGIDYEET+                             
Sbjct: 36  LDEHGLVIRNKARLVAKGYNQEEGIDYEETYAPVARLEAIRMLLAYASIMDFKLYQMDVK 95

Query: 61  --------APEVYVEQPPGFESFDLPNHVYKLEKTLYALKQAPRAWYDRLSKFLLENDFK 120
                     EVYVEQPPGFE+ + PNHV+KL+K LY LKQAPRAWY+RLSKFLLE +F 
Sbjct: 96  SAFLNGFIQEEVYVEQPPGFENSEFPNHVFKLKKALYGLKQAPRAWYERLSKFLLEKEFT 155

Query: 121 MGKIDNTLFIKVKNNDMLIVQIYVDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFL 180
            GK+D TLFIK K ND+L+VQIYVDDIIFG+TN  LC+EFS  M +EFEMSM+GEL+FF+
Sbjct: 156 RGKVDTTLFIKRKMNDILLVQIYVDDIIFGATNDYLCKEFSNDMQSEFEMSMIGELNFFI 215

Query: 181 GLQIKQLKDGIFISQEKYTRDLLKKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRG 240
           GLQI+Q K+GIFI+Q KY ++ LK+F +   K    P+STT  LDKDE GK +D+K YRG
Sbjct: 216 GLQIRQTKNGIFINQSKYCKEFLKRFGMENAKSMTAPISTTCYLDKDEVGKSIDVKKYRG 275

Query: 241 MIGSLLYLTASRPDIMFSVCLCARFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVE 300
           MIGSLLYL+ASRPDIMFSVCLCAR+QS PKESH  AVKRI++YLLGT ++GL WYP+N+ 
Sbjct: 276 MIGSLLYLSASRPDIMFSVCLCARYQSNPKESHLSAVKRIMRYLLGTTNLGL-WYPKNMP 335

Query: 301 FNLVGYSDADFAGSLLDRKSTSGTCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCA 360
           FNLVGYSD+DFAG   DRKSTSGTC F+GS+LVSW SKKQNSVALST EAEYIA  SCCA
Sbjct: 336 FNLVGYSDSDFAGCKTDRKSTSGTCHFIGSALVSWHSKKQNSVALSTAEAEYIAAGSCCA 395

Query: 361 QILWMKQTLCDFGLKFDNVPIFCDNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHI 418
           QILWMKQ L DFGL  D++PI CDNTSAINL+KNP+ HSRTKHI+IRHHF+R+HVQ G  
Sbjct: 396 QILWMKQQLSDFGLSLDHIPIKCDNTSAINLSKNPVLHSRTKHIEIRHHFLRDHVQKGDC 455

BLAST of CSPI02G09070 vs. NCBI nr
Match: gi|147816020|emb|CAN72461.1| (hypothetical protein VITISV_025872 [Vitis vinifera])

HSP 1 Score: 594.3 bits (1531), Expect = 1.6e-166
Identity = 305/431 (70.77%), Postives = 345/431 (80.05%), Query Frame = 1

Query: 1   MDENGNIIRNKARLVAQGYCQEEGIDYEETFAPEVYVEQP------PGFESFDLPNHVYK 60
           MDENG IIRNKARLVA G+ QEEGIDYEETFAP V +E          F+ F L     K
Sbjct: 456 MDENGIIIRNKARLVAXGFNQEEGIDYEETFAPVVRLEAIRMLLAFACFKDFVLYQMDVK 515

Query: 61  L--------EKTLYALKQAPRAWYDRLSKFLLENDFKMGKIDNTLFIKVKNNDMLIVQIY 120
                    E+ LY LKQAPRAWY+RLSKFLL+  FKMGKID TLFIK K+NDML+VQIY
Sbjct: 516 SAFLNNFINEEALYGLKQAPRAWYERLSKFLLKKGFKMGKIDTTLFIKTKDNDMLLVQIY 575

Query: 121 VDDIIFGSTNSSLCEEFSKCMHNEFEMSMMGELSFFLGLQIKQLKDGIFISQEKYTRDLL 180
           VDDIIFG+TN SLCE FSKCMH+EFEMSMMGEL+FFLGLQIKQLK+G FI+Q KY RDLL
Sbjct: 576 VDDIIFGATNVSLCEGFSKCMHSEFEMSMMGELNFFLGLQIKQLKEGTFINQAKYIRDLL 635

Query: 181 KKFKLNEGKVAKTPMSTTTKLDKDEKGKCVDIKTYRGMIGSLLYLTASRPDIMFSVCLCA 240
           K+F + E K  KTPMS++ KLD DEK K V+   YRGMIGSLLYLT SRPDIM+SVCLCA
Sbjct: 636 KRFNMEEAKTMKTPMSSSIKLDMDEKCKPVNSTMYRGMIGSLLYLTTSRPDIMYSVCLCA 695

Query: 241 RFQSCPKESHFHAVKRILKYLLGTIDVGLYWYPRNVEFNLVGYSDADFAGSLLDRKSTSG 300
           RFQSCPK+SH  AVKRIL+YL GT+D+GL WYP+   F L+GYSDADF G  ++RKSTS 
Sbjct: 696 RFQSCPKKSHLSAVKRILRYLKGTMDIGL-WYPKGDNFELIGYSDADFDGCKVERKSTSD 755

Query: 301 TCQFLGSSLVSWFSKKQNSVALSTTEAEYIAVASCCAQILWMKQTLCDFGLKFDNVPIFC 360
           TC FLG SLVSW+SKKQNSVALST EAEYIAV  CCAQILWMKQTL DF L F++VPI C
Sbjct: 756 TCHFLGHSLVSWYSKKQNSVALSTVEAEYIAVGLCCAQILWMKQTLSDFNLIFEHVPIKC 815

Query: 361 DNTSAINLTKNPIHHSRTKHIDIRHHFIREHVQNGHITLEFVSSNNQLADIFTKPLSEES 418
           DNTSAIN++KNP+ HSRTKHI+IRHHF+R+H Q G ITLEFVS+ +QLADIFTKPLSEE 
Sbjct: 816 DNTSAINISKNPVQHSRTKHIEIRHHFLRDHAQKGDITLEFVSTKDQLADIFTKPLSEEQ 875

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
COPIA_DROME6.1e-7335.62Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
POLX_TOBAC1.6e-6534.28Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
M810_ARATH7.3e-3436.28Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
YJ41B_YEAST4.0e-2426.79Transposon Ty4-J Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
YH41B_YEAST4.0e-2426.79Transposon Ty4-H Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A5BEZ1_VITVI4.7e-17368.06Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_013310 PE=4 SV=1[more]
A0A151TIF5_CAJCA7.8e-16866.30Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151QU14_CAJCA3.9e-16765.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cajanus cajan GN=... [more]
A0A151SK19_CAJCA5.0e-16765.64Copia protein OS=Cajanus cajan GN=KK1_001310 PE=4 SV=1[more]
A5BLV7_VITVI1.1e-16670.77Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_025872 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.11.8e-6736.40 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.14.1e-3536.28ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|147861798|emb|CAN83181.1|2.3e-17368.06hypothetical protein VITISV_013310 [Vitis vinifera][more]
gi|1012355625|gb|KYP66812.1|1.1e-16766.30Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|1012321187|gb|KYP33754.1|5.5e-16765.86Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan][more]
gi|1012343913|gb|KYP55105.1|7.2e-16765.64Copia protein [Cajanus cajan][more]
gi|147816020|emb|CAN72461.1|1.6e-16670.77hypothetical protein VITISV_025872 [Vitis vinifera][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G09070.1CSPI02G09070.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 2..33
score: 2.3E-8coord: 34..182
score: 1.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 2..342
score: 1.7E
NoneNo IPR availablePANTHERPTHR11439:SF192SUBFAMILY NOT NAMEDcoord: 2..342
score: 1.7E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 46..161
score: 4.45E-18coord: 191..370
score: 4.45

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None

The following block(s) are covering this gene:
GeneOrganismBlock
CSPI02G09070Cucurbita pepo (Zucchini)cpecpiB132
CSPI02G09070Cucurbita pepo (Zucchini)cpecpiB304
CSPI02G09070Bottle gourd (USVL1VR-Ls)cpilsiB088
CSPI02G09070Bottle gourd (USVL1VR-Ls)cpilsiB124
CSPI02G09070Bottle gourd (USVL1VR-Ls)cpilsiB128
CSPI02G09070Melon (DHL92) v3.6.1cpimedB127
CSPI02G09070Melon (DHL92) v3.6.1cpimedB139
CSPI02G09070Melon (DHL92) v3.6.1cpimedB141
CSPI02G09070Cucumber (Gy14) v2cgybcpiB056
CSPI02G09070Cucumber (Gy14) v2cgybcpiB109
CSPI02G09070Silver-seed gourdcarcpiB0258
CSPI02G09070Silver-seed gourdcarcpiB1111
CSPI02G09070Cucumber (Chinese Long) v3cpicucB080
CSPI02G09070Cucumber (Chinese Long) v3cpicucB093
CSPI02G09070Cucumber (Chinese Long) v3cpicucB100
CSPI02G09070Watermelon (97103) v2cpiwmbB142
CSPI02G09070Watermelon (97103) v2cpiwmbB144
CSPI02G09070Wax gourdcpiwgoB126
CSPI02G09070Wild cucumber (PI 183967)cpicpiB063
CSPI02G09070Wild cucumber (PI 183967)cpicpiB071
CSPI02G09070Cucumber (Gy14) v1cgycpiB377
CSPI02G09070Cucurbita maxima (Rimu)cmacpiB239
CSPI02G09070Cucurbita maxima (Rimu)cmacpiB358
CSPI02G09070Cucurbita maxima (Rimu)cmacpiB902
CSPI02G09070Cucurbita moschata (Rifu)cmocpiB227
CSPI02G09070Cucurbita moschata (Rifu)cmocpiB351
CSPI02G09070Cucurbita moschata (Rifu)cmocpiB882
CSPI02G09070Cucumber (Chinese Long) v2cpicuB065
CSPI02G09070Cucumber (Chinese Long) v2cpicuB072
CSPI02G09070Melon (DHL92) v3.5.1cpimeB134
CSPI02G09070Melon (DHL92) v3.5.1cpimeB145
CSPI02G09070Melon (DHL92) v3.5.1cpimeB147
CSPI02G09070Watermelon (Charleston Gray)cpiwcgB149
CSPI02G09070Watermelon (Charleston Gray)cpiwcgB150
CSPI02G09070Watermelon (97103) v1cpiwmB128
CSPI02G09070Watermelon (97103) v1cpiwmB134
CSPI02G09070Watermelon (97103) v1cpiwmB170