CSPI01G19310 (gene) Wild cucumber (PI 183967)

NameCSPI01G19310
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr1 : 14782324 .. 14783642 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAGACATATTGTATGTAAATAACTTGCACCTTTCTGTTTTTTCTGATGAGAAGCCTGACGACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGGGTTTATGAGGCTATGGGTAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCTCTAAAACTGGAAATAATAAAATGTTTCTGATTAAACATATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGTTATCTAGAATGAATATCAAGTTTGAGGATGAGATACATGAGTTATCGGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCCCAAATGGTGTACTAAGTATGGACCTAGTAAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAGAGGGGGAGGAGTAAAAGTAAGAGTCCAAGAGGTAATAACATAAGCAAAAGTAGAAGTGACCGGTTTGCCAATGTTGAGTGTCACTATTGCCATGAAAAGCATATAAAGAAGTATTATTGAAAATTGAAAAGAGACAGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGATGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACATAGCATTTGGGTGATTGATAGTGGTGCATCAGTTCATGCTACTTTGAAGAGGGATTTGCATCCTATACTCCTGGTGATTTTGGCAGTGTTAGGATGGGTAATGACAGATCAACAAATACAGTTGGCATCGTTGATGTACACTTGAAGAACAAAAATGGTTTTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCCACATGTAAGCTTGATGATGAAGGTTTCTGCAGTACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGCACAAAAATTTTCTTCACTGTACTACATGGATGCAAAAATCATGGAGTCTGATATAAATATGGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCCTAATTTAAAGAGTACACCTCTAAAACGGTGTCCTCATTGTTTGGCAGGAAAGTAG

mRNA sequence

ATGGAAGACATATTGTATGTAAATAACTTGCACCTTTCTGTTTTTTCTGATGAGAAGCCTGACGACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGGGTTTATGAGGCTATGGGTAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCTCTAAAACTGGAAATAATAAAATGTTTCTGATTAAACATATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGTTATCTAGAATGAATATCAAGTTTGAGGATGAGATACATGAGTTATCGGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCCCAAATGGTGTACTAAGTATGGACCTAGTAAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAGAGGGGGAGGAGTAAAAGTAAGAGTCCAAGAGACAGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGATGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACATAGCATTTGGGTGATTGATAGTGGTGCATCAGTTCATGCTACTTTGAAGAGGGATTTGCATCCTATACTCCTGGTGATTTTGGCAGTGTTAGGATGGAACAAAAATGGTTTTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCCACATGTAAGCTTGATGATGAAGGTTTCTGCAGTACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGCACAAAAATTTTCTTCACTGTACTACATGGATGCAAAAATCATGGAGTCTGATATAAATATGGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCCTAATTTAAAGAGTACACCTCTAAAACGGTGTCCTCATTGTTTGGCAGGAAAGTAG

Coding sequence (CDS)

ATGGAAGACATATTGTATGTAAATAACTTGCACCTTTCTGTTTTTTCTGATGAGAAGCCTGACGACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGGGTTTATGAGGCTATGGGTAGAAGATAACTTTCTAAACCATATTTGTGAAGAAACTCATGCGCGAACTATGTGGAATAAGCTTGAATCGCTATGTGCCTCTAAAACTGGAAATAATAAAATGTTTCTGATTAAACATATGATGGAGTTAAAGTATCAAGATGGAGCGCCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGTTATCTAGAATGAATATCAAGTTTGAGGATGAGATACATGAGTTATCGGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCCCAAATGGTGTACTAAGTATGGACCTAGTAAAAAGTAGCGTGTTGAACGAGGAGATGAGAAGAAAGTCTCAAAGTTCTTCTTCACAGTCAGATGTTCTGGTTACTGAAAAGAGGGGGAGGAGTAAAAGTAAGAGTCCAAGAGACAGTAAAAATCATAAGGGCAAGGAAAAGAAGAATGATGATGATAGTGATGCTGATACAATCATTGTAGCCACTGAAGATTTTTACATCTTGTCTGATGGTGATGTTGTAAATCTTGCCACACAACATAGCATTTGGGTGATTGATAGTGGTGCATCAGTTCATGCTACTTTGAAGAGGGATTTGCATCCTATACTCCTGGTGATTTTGGCAGTGTTAGGATGGAACAAAAATGGTTTTAGGCTGATTTTGAAAAATGTGAAACATATTCCTGATATTCGCATGAACTTGATTTCCACATGTAAGCTTGATGATGAAGGTTTCTGCAGTACCTTCGACAATGGCATATGGAAGCTTACTAAAGGTTCAATGGTTATAGCAAAGGCACAAAAATTTTCTTCACTGTACTACATGGATGCAAAAATCATGGAGTCTGATATAAATATGGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGCCATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCCTAATTTAAAGAGTACACCTCTAAAACGGTGTCCTCATTGTTTGGCAGGAAAGTAG
BLAST of CSPI01G19310 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 224.9 bits (572), Expect = 1.5e-57
Identity = 151/448 (33.71%), Postives = 231/448 (51.56%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSD-EKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHART 60
           M D+L    LH  +  D +KPD    ++W     +    +RL + D+ +N+I +E  AR 
Sbjct: 24  MRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIRLHLSDDVVNNIIDEDTARG 83

Query: 61  MWNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEI 120
           +W +LESL  SKT  NK++L K +  L   +G   L HLN F G++ QL+ + +K E+E 
Sbjct: 84  IWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFNGLITQLANLGVKIEEED 143

Query: 121 HELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTE 180
             + +L +LP S+    T++ +      L  D+  + +LNE+MR+K +   +Q   L+TE
Sbjct: 144 KAILLLNSLPSSYDNLATTILHGKTTIELK-DVTSALLLNEKMRKKPE---NQGQALITE 203

Query: 181 KRGRSKSKSPRD--SKNHKGKEK------------------------------------K 240
            RGRS  +S  +      +GK K                                    K
Sbjct: 204 GRGRSYQRSSNNYGRSGARGKSKNRSKSRVRNCYNCNQPGHFKRDCPNPRKGKGETSGQK 263

Query: 241 NDDDSDADTIIVATED---FYILSDGDVVNLATQHSIWVIDSGASVHATLKRDL------ 300
           NDD++ A   +V   D    +I  + + ++L+   S WV+D+ AS HAT  RDL      
Sbjct: 264 NDDNTAA---MVQNNDNVVLFINEEEECMHLSGPESEWVVDTAASHHATPVRDLFCRYVA 323

Query: 301 ------------HPILLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCS 360
                       +  +  I  +      G  L+LK+V+H+PD+RMNLIS   LD +G+ S
Sbjct: 324 GDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLKDVRHVPDLRMNLISGIALDRDGYES 383

Query: 361 TFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKG 389
            F N  W+LTKGS+VIAK     +LY  +A+I + ++N   DE +V+LWHKR+ H+SEKG
Sbjct: 384 YFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNAAQDEISVDLWHKRMGHMSEKG 443

BLAST of CSPI01G19310 vs. Swiss-Prot
Match: M300_ARATH (Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana GN=AtMg00300 PE=4 SV=1)

HSP 1 Score: 64.3 bits (155), Expect = 3.3e-09
Identity = 32/94 (34.04%), Postives = 50/94 (53.19%), Query Frame = 1

Query: 296 EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEAN-VELWHKRLS 355
           E  CS    G+ K+ KG   I K  +  SLY +   +   + N+     +   LWH RL+
Sbjct: 21  EASCS---EGVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLA 80

Query: 356 HISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK 389
           H+S++G+++L KK  L + K + LK C  C+ GK
Sbjct: 81  HMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGK 111

BLAST of CSPI01G19310 vs. TrEMBL
Match: A5C9D7_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007304 PE=4 SV=1)

HSP 1 Score: 456.1 bits (1172), Expect = 4.4e-125
Identity = 246/441 (55.78%), Postives = 292/441 (66.21%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LYV + +L VF  E+P++KTD EW L HR+VCG++R WV+DN LNH+ EE H R+ 
Sbjct: 1   MEDLLYVKDYYLXVFXSERPENKTDAEWNLLHRQVCGYIRXWVDDNXLNHVSEEKHXRSX 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KT NNK+FLIK MM LKYQDG    DHLNTFQGI+NQL+ MNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTXNNKLFLIKKMMSLKYQDGTXXTDHLNTFQGIINQLAGMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLPDSW+ FRTSLSNSAP+G+++MDLVKS VLNEEMRRKSQ SSSQS+VLV  K
Sbjct: 121 GLWLLGTLPDSWETFRTSLSNSAPDGIMNMDLVKSCVLNEEMRRKSQGSSSQSNVLVIXK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
            GRSKS+ P                                   RD K  K KEKKND+ 
Sbjct: 181 XGRSKSRGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCRQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILL------VI 300
            + D +     DF I+ D DVVN A Q S WVID GAS+HAT ++D            V 
Sbjct: 241 GEDDQVATTISDFLIVYDSDVVNFACQESXWVIDXGASIHATPQKDFFTSYTSGDFGSVR 300

Query: 301 LAVLGWNK------------NGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIW 360
           +   G  K            NG  L LKNVKHIPDIRMNLIST KLDDEGFC+TF +  W
Sbjct: 301 MGNDGSAKAIGMGDVRLETSNGTMLTLKNVKHIPDIRMNLISTGKLDDEGFCNTFRDSQW 360

Query: 361 KLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKK 389
           KLT+GSMVIAK  K SSLY M A++++S IN V+D++  ELWH +L H+SEKGL IL KK
Sbjct: 361 KLTRGSMVIAKGNKSSSLYLMQARVIDSSINAVDDDSTFELWHNKLGHMSEKGLMILAKK 420

BLAST of CSPI01G19310 vs. TrEMBL
Match: A5BAF2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027081 PE=4 SV=1)

HSP 1 Score: 454.9 bits (1169), Expect = 9.8e-125
Identity = 242/428 (56.54%), Postives = 290/428 (67.76%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LY+ + +L VF+ E+P++KTD EW L HR+VCG++R WV+DN LNH+ +E HAR++
Sbjct: 1   MEDLLYMKDYYLPVFASERPENKTDAEWNLLHRQVCGYIRQWVDDNVLNHVSKEKHARSL 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+FLIK MM LKYQDG PM DHLNTFQGI+NQL+RMNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTGNNKLFLIKKMMSLKYQDGTPMTDHLNTFQGIINQLARMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLPDSW+ FRTSLSNSAP               +EMRRKSQ SSSQS+VLVTEK
Sbjct: 121 GLWLLGTLPDSWETFRTSLSNSAP---------------DEMRRKSQGSSSQSNVLVTEK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           RGRSKSK P                                   RD K  K KEKKND+ 
Sbjct: 181 RGRSKSKGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCHQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLVILAV-LG 300
            + D +   T DF I+ D DVVN A   + WVIDSGAS+HATL++D         A+ +G
Sbjct: 241 GEDDQVATTTSDFLIVYDSDVVNFACXETSWVIDSGASIHATLRKDFFTSYTSAKAIGMG 300

Query: 301 ----WNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQ 360
                  NG  L LKNVKHIPDIRMNLIST KLDDEGFC+TF +  W LT+GSMVIAK  
Sbjct: 301 DVRLETSNGTMLTLKNVKHIPDIRMNLISTRKLDDEGFCNTFRDSQWNLTRGSMVIAKGN 360

Query: 361 KFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKR 389
           K SSLY M A++++S+IN V+D++  ELWH RL H+SEKGL IL KKN L  +K   LKR
Sbjct: 361 KSSSLYLMQARVIDSNINAVDDDSTFELWHNRLGHMSEKGLMILAKKNLLSGMKKGSLKR 413

BLAST of CSPI01G19310 vs. TrEMBL
Match: A5AJF5_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010919 PE=4 SV=1)

HSP 1 Score: 422.2 bits (1084), Expect = 7.0e-115
Identity = 233/441 (52.83%), Postives = 285/441 (64.63%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LY  + +L VF+ EKP++KTD +W+L HR VCG++R WV++N LNH+ EE HAR++
Sbjct: 23  MEDLLYAKDYYLLVFASEKPENKTDAKWDLLHRHVCGYIRQWVDNNVLNHVSEEKHARSL 82

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+FLIK M+ LKYQD   M DHLNTFQGI+NQL RMNIKFE+E+ 
Sbjct: 83  WNKLEQLYARKTGNNKLFLIKKMISLKYQDETTMTDHLNTFQGIINQLVRMNIKFEEEMQ 142

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTL DSW+ FRTSLSNSAP+G ++MDLVKS VLNEEM RKSQ SSSQ DVLVT+K
Sbjct: 143 GLWLLGTLSDSWETFRTSLSNSAPDGTMNMDLVKSCVLNEEMGRKSQGSSSQLDVLVTKK 202

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           + RSKS+ P                                   RD K  K K+KKND+ 
Sbjct: 203 KERSKSRGPNNRDRRKSKTNKFANVECHYFHLKGHIVKYCRQLKRDMKQGKVKDKKNDNG 262

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPI---------- 300
            + D +   T DF I+ D DVVN A Q + WVIDSGA +HAT ++D              
Sbjct: 263 GEDDRVATTTSDFLIVYDSDVVNFACQETSWVIDSGALIHATPQKDFFTSYTFGDFGSVX 322

Query: 301 --------LLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIW 360
                    + +  V     NG  LILKNVKHIPDIR   IST KLDDEGF +TF +  W
Sbjct: 323 MDNEGSAKAIGMRYVRLETSNGTMLILKNVKHIPDIR---ISTGKLDDEGFYNTFHDSQW 382

Query: 361 KLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKK 389
           KLT GSMV  K +K SSL  M A++++S IN V+D++ VELWH +L H+SEKGL IL KK
Sbjct: 383 KLTIGSMVATKGKKCSSLCLMQARVIDSSINAVDDDSIVELWHNKLGHMSEKGLMILAKK 442

BLAST of CSPI01G19310 vs. TrEMBL
Match: A5C3L0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007384 PE=4 SV=1)

HSP 1 Score: 389.8 bits (1000), Expect = 3.9e-105
Identity = 216/431 (50.12%), Postives = 266/431 (61.72%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LYV + +  VF+ E+P++K D EW L HR+VCG++R WV+DN LNH+ EE HAR++
Sbjct: 1   MEDLLYVKDYYXPVFASERPENKXDAEWNLLHRQVCGYIRQWVDDNVLNHVSEEKHARSL 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+ LIK MM LKYQDG PM DHLNTFQGI+NQL  MNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTGNNKLLLIKKMMSLKYQDGTPMTDHLNTFQGIINQLVGMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLP+ W+ FRTSLSNSA +G+++MDLVKS VLNEEMRRKSQ SSSQS+VLVTEK
Sbjct: 121 GLWLLGTLPNLWETFRTSLSNSALDGIMNMDLVKSCVLNEEMRRKSQGSSSQSNVLVTEK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           +G+SKS+ P                                   RD K  K KEKKND+ 
Sbjct: 181 KGKSKSRGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCRQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLVILAVLGW 300
            + D +   T DF I+ D DVVN A Q + WVIDSGAS+HAT ++D             +
Sbjct: 241 GEDDQVATTTSDFLIVYDSDVVNFACQETSWVIDSGASIHATPRKDFFT---------SY 300

Query: 301 NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGI--------WKLTKGSMVIA 360
               F            +RM    + K    G  S    G             +GSMVIA
Sbjct: 301 TSGDFG----------SVRMGNDGSAKAIGMGDESLMMKGSATPSVIVSGSSLRGSMVIA 360

Query: 361 KAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTP 389
           K  K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL IL K N L  +K   
Sbjct: 361 KGNKSSSLYLMQARVIDSSINAVDDDSTFELWHNRLGHMSEKGLMILAKNNLLSGMKKGS 412

BLAST of CSPI01G19310 vs. TrEMBL
Match: Q9ZRJ0_TOBAC (Retrotransposon Tto1 DNA OS=Nicotiana tabacum PE=4 SV=1)

HSP 1 Score: 385.6 bits (989), Expect = 7.3e-104
Identity = 202/442 (45.70%), Postives = 285/442 (64.48%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           M+D+L+V  +HL VFS +KP+DK+D++WE  H +VCG++R +VEDN  NHI   THAR++
Sbjct: 23  MKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFVEDNVYNHISGVTHARSL 82

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           W+KLE L ASKTGNNK+F +  +M++KY +G  + DHLN  QGI++QLS M IKF+DE+ 
Sbjct: 83  WDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQGIVDQLSGMGIKFDDEVL 142

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQ-SSSSQSDVLVTE 180
            L VL TLP+SW+  + S++NSAPNGV++M+ VKS +LNEEMRR+SQ +SSSQS+VL   
Sbjct: 143 ALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMRRRSQGTSSSQSEVLAVT 202

Query: 181 KRGRSKSKSP-----------------------------------RDSKNHKGKEKKNDD 240
            RGRS++KS                                     D K +KGK+ K ++
Sbjct: 203 TRGRSQNKSQSNRDKSRGKSNKFANVECHYCKKKGHIKRFCRQFQNDQKKNKGKKVKPEE 262

Query: 241 DSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV------ 300
            SD +T      +F ++ D D++NL TQ   WVIDSGA++HAT +R+L     +      
Sbjct: 263 SSDDETNSFG--EFNVVYDDDIINLTTQEMTWVIDSGATIHATPRRELFSSYTLGDFGRV 322

Query: 301 ------ILAVLG------WNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGI 360
                    V+G         NG +L+L++V+H+PD+R+NLIS  KLD+EG+C+TF NG 
Sbjct: 323 KMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRLNLISVDKLDEEGYCNTFHNGQ 382

Query: 361 WKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTK 389
           WKLTKGS+++A+  K S LY   A I +  IN+  +++N++LWH+RL H+SEK +  L K
Sbjct: 383 WKLTKGSLMVARGTKQSKLYVTQASISQQVINVAENDSNIKLWHRRLGHMSEKSMARLVK 442

BLAST of CSPI01G19310 vs. TAIR10
Match: ATMG00300.1 (ATMG00300.1 Gag-Pol-related retrotransposon family protein)

HSP 1 Score: 64.3 bits (155), Expect = 1.9e-10
Identity = 32/94 (34.04%), Postives = 50/94 (53.19%), Query Frame = 1

Query: 296 EGFCSTFDNGIWKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEAN-VELWHKRLS 355
           E  CS    G+ K+ KG   I K  +  SLY +   +   + N+     +   LWH RL+
Sbjct: 21  EASCS---EGVLKVLKGCRTILKGNRHDSLYILQGSVETGESNLAETAKDETRLWHSRLA 80

Query: 356 HISEKGLKILTKKNHLPNLKSTPLKRCPHCLAGK 389
           H+S++G+++L KK  L + K + LK C  C+ GK
Sbjct: 81  HMSQRGMELLVKKGFLDSSKVSSLKFCEDCIYGK 111

BLAST of CSPI01G19310 vs. NCBI nr
Match: gi|147784778|emb|CAN75440.1| (hypothetical protein VITISV_007304 [Vitis vinifera])

HSP 1 Score: 456.1 bits (1172), Expect = 6.3e-125
Identity = 246/441 (55.78%), Postives = 292/441 (66.21%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LYV + +L VF  E+P++KTD EW L HR+VCG++R WV+DN LNH+ EE H R+ 
Sbjct: 1   MEDLLYVKDYYLXVFXSERPENKTDAEWNLLHRQVCGYIRXWVDDNXLNHVSEEKHXRSX 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KT NNK+FLIK MM LKYQDG    DHLNTFQGI+NQL+ MNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTXNNKLFLIKKMMSLKYQDGTXXTDHLNTFQGIINQLAGMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLPDSW+ FRTSLSNSAP+G+++MDLVKS VLNEEMRRKSQ SSSQS+VLV  K
Sbjct: 121 GLWLLGTLPDSWETFRTSLSNSAPDGIMNMDLVKSCVLNEEMRRKSQGSSSQSNVLVIXK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
            GRSKS+ P                                   RD K  K KEKKND+ 
Sbjct: 181 XGRSKSRGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCRQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILL------VI 300
            + D +     DF I+ D DVVN A Q S WVID GAS+HAT ++D            V 
Sbjct: 241 GEDDQVATTISDFLIVYDSDVVNFACQESXWVIDXGASIHATPQKDFFTSYTSGDFGSVR 300

Query: 301 LAVLGWNK------------NGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIW 360
           +   G  K            NG  L LKNVKHIPDIRMNLIST KLDDEGFC+TF +  W
Sbjct: 301 MGNDGSAKAIGMGDVRLETSNGTMLTLKNVKHIPDIRMNLISTGKLDDEGFCNTFRDSQW 360

Query: 361 KLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKK 389
           KLT+GSMVIAK  K SSLY M A++++S IN V+D++  ELWH +L H+SEKGL IL KK
Sbjct: 361 KLTRGSMVIAKGNKSSSLYLMQARVIDSSINAVDDDSTFELWHNKLGHMSEKGLMILAKK 420

BLAST of CSPI01G19310 vs. NCBI nr
Match: gi|147776056|emb|CAN69911.1| (hypothetical protein VITISV_027081 [Vitis vinifera])

HSP 1 Score: 454.9 bits (1169), Expect = 1.4e-124
Identity = 242/428 (56.54%), Postives = 290/428 (67.76%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LY+ + +L VF+ E+P++KTD EW L HR+VCG++R WV+DN LNH+ +E HAR++
Sbjct: 1   MEDLLYMKDYYLPVFASERPENKTDAEWNLLHRQVCGYIRQWVDDNVLNHVSKEKHARSL 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+FLIK MM LKYQDG PM DHLNTFQGI+NQL+RMNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTGNNKLFLIKKMMSLKYQDGTPMTDHLNTFQGIINQLARMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLPDSW+ FRTSLSNSAP               +EMRRKSQ SSSQS+VLVTEK
Sbjct: 121 GLWLLGTLPDSWETFRTSLSNSAP---------------DEMRRKSQGSSSQSNVLVTEK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           RGRSKSK P                                   RD K  K KEKKND+ 
Sbjct: 181 RGRSKSKGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCHQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLVILAV-LG 300
            + D +   T DF I+ D DVVN A   + WVIDSGAS+HATL++D         A+ +G
Sbjct: 241 GEDDQVATTTSDFLIVYDSDVVNFACXETSWVIDSGASIHATLRKDFFTSYTSAKAIGMG 300

Query: 301 ----WNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIWKLTKGSMVIAKAQ 360
                  NG  L LKNVKHIPDIRMNLIST KLDDEGFC+TF +  W LT+GSMVIAK  
Sbjct: 301 DVRLETSNGTMLTLKNVKHIPDIRMNLISTRKLDDEGFCNTFRDSQWNLTRGSMVIAKGN 360

Query: 361 KFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTPLKR 389
           K SSLY M A++++S+IN V+D++  ELWH RL H+SEKGL IL KKN L  +K   LKR
Sbjct: 361 KSSSLYLMQARVIDSNINAVDDDSTFELWHNRLGHMSEKGLMILAKKNLLSGMKKGSLKR 413

BLAST of CSPI01G19310 vs. NCBI nr
Match: gi|147777716|emb|CAN66809.1| (hypothetical protein VITISV_010919 [Vitis vinifera])

HSP 1 Score: 422.2 bits (1084), Expect = 1.0e-114
Identity = 233/441 (52.83%), Postives = 285/441 (64.63%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LY  + +L VF+ EKP++KTD +W+L HR VCG++R WV++N LNH+ EE HAR++
Sbjct: 23  MEDLLYAKDYYLLVFASEKPENKTDAKWDLLHRHVCGYIRQWVDNNVLNHVSEEKHARSL 82

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+FLIK M+ LKYQD   M DHLNTFQGI+NQL RMNIKFE+E+ 
Sbjct: 83  WNKLEQLYARKTGNNKLFLIKKMISLKYQDETTMTDHLNTFQGIINQLVRMNIKFEEEMQ 142

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTL DSW+ FRTSLSNSAP+G ++MDLVKS VLNEEM RKSQ SSSQ DVLVT+K
Sbjct: 143 GLWLLGTLSDSWETFRTSLSNSAPDGTMNMDLVKSCVLNEEMGRKSQGSSSQLDVLVTKK 202

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           + RSKS+ P                                   RD K  K K+KKND+ 
Sbjct: 203 KERSKSRGPNNRDRRKSKTNKFANVECHYFHLKGHIVKYCRQLKRDMKQGKVKDKKNDNG 262

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPI---------- 300
            + D +   T DF I+ D DVVN A Q + WVIDSGA +HAT ++D              
Sbjct: 263 GEDDRVATTTSDFLIVYDSDVVNFACQETSWVIDSGALIHATPQKDFFTSYTFGDFGSVX 322

Query: 301 --------LLVILAVLGWNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGIW 360
                    + +  V     NG  LILKNVKHIPDIR   IST KLDDEGF +TF +  W
Sbjct: 323 MDNEGSAKAIGMRYVRLETSNGTMLILKNVKHIPDIR---ISTGKLDDEGFYNTFHDSQW 382

Query: 361 KLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKK 389
           KLT GSMV  K +K SSL  M A++++S IN V+D++ VELWH +L H+SEKGL IL KK
Sbjct: 383 KLTIGSMVATKGKKCSSLCLMQARVIDSSINAVDDDSIVELWHNKLGHMSEKGLMILAKK 442

BLAST of CSPI01G19310 vs. NCBI nr
Match: gi|147816208|emb|CAN66323.1| (hypothetical protein VITISV_007384 [Vitis vinifera])

HSP 1 Score: 389.8 bits (1000), Expect = 5.5e-105
Identity = 216/431 (50.12%), Postives = 266/431 (61.72%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           MED+LYV + +  VF+ E+P++K D EW L HR+VCG++R WV+DN LNH+ EE HAR++
Sbjct: 1   MEDLLYVKDYYXPVFASERPENKXDAEWNLLHRQVCGYIRQWVDDNVLNHVSEEKHARSL 60

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           WNKLE L A KTGNNK+ LIK MM LKYQDG PM DHLNTFQGI+NQL  MNIKFE+E+ 
Sbjct: 61  WNKLEQLYARKTGNNKLLLIKKMMSLKYQDGTPMTDHLNTFQGIINQLVGMNIKFEEEVQ 120

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQSSSSQSDVLVTEK 180
            L +LGTLP+ W+ FRTSLSNSA +G+++MDLVKS VLNEEMRRKSQ SSSQS+VLVTEK
Sbjct: 121 GLWLLGTLPNLWETFRTSLSNSALDGIMNMDLVKSCVLNEEMRRKSQGSSSQSNVLVTEK 180

Query: 181 RGRSKSKSP-----------------------------------RDSKNHKGKEKKNDDD 240
           +G+SKS+ P                                   RD K  K KEKKND+ 
Sbjct: 181 KGKSKSRGPKNRDRSKSKTNKFANVECHYCHLKGHIKKYCRQLKRDMKQGKVKEKKNDNG 240

Query: 241 SDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLVILAVLGW 300
            + D +   T DF I+ D DVVN A Q + WVIDSGAS+HAT ++D             +
Sbjct: 241 GEDDQVATTTSDFLIVYDSDVVNFACQETSWVIDSGASIHATPRKDFFT---------SY 300

Query: 301 NKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGI--------WKLTKGSMVIA 360
               F            +RM    + K    G  S    G             +GSMVIA
Sbjct: 301 TSGDFG----------SVRMGNDGSAKAIGMGDESLMMKGSATPSVIVSGSSLRGSMVIA 360

Query: 361 KAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTKKNHLPNLKSTP 389
           K  K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL IL K N L  +K   
Sbjct: 361 KGNKSSSLYLMQARVIDSSINAVDDDSTFELWHNRLGHMSEKGLMILAKNNLLSGMKKGS 412

BLAST of CSPI01G19310 vs. NCBI nr
Match: gi|1167523|dbj|BAA11674.1| (unnamed protein product [Nicotiana tabacum])

HSP 1 Score: 385.6 bits (989), Expect = 1.0e-103
Identity = 202/442 (45.70%), Postives = 285/442 (64.48%), Query Frame = 1

Query: 1   MEDILYVNNLHLSVFSDEKPDDKTDKEWELCHRKVCGFMRLWVEDNFLNHICEETHARTM 60
           M+D+L+V  +HL VFS +KP+DK+D++WE  H +VCG++R +VEDN  NHI   THAR++
Sbjct: 23  MKDLLFVTKMHLPVFSSQKPEDKSDEDWEFEHNQVCGYIRQFVEDNVYNHISGVTHARSL 82

Query: 61  WNKLESLCASKTGNNKMFLIKHMMELKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIH 120
           W+KLE L ASKTGNNK+F +  +M++KY +G  + DHLN  QGI++QLS M IKF+DE+ 
Sbjct: 83  WDKLEELYASKTGNNKLFYLTKLMQVKYVEGTTVADHLNEIQGIVDQLSGMGIKFDDEVL 142

Query: 121 ELSVLGTLPDSWKIFRTSLSNSAPNGVLSMDLVKSSVLNEEMRRKSQ-SSSSQSDVLVTE 180
            L VL TLP+SW+  + S++NSAPNGV++M+ VKS +LNEEMRR+SQ +SSSQS+VL   
Sbjct: 143 ALMVLATLPESWETLKVSITNSAPNGVVNMETVKSGILNEEMRRRSQGTSSSQSEVLAVT 202

Query: 181 KRGRSKSKSP-----------------------------------RDSKNHKGKEKKNDD 240
            RGRS++KS                                     D K +KGK+ K ++
Sbjct: 203 TRGRSQNKSQSNRDKSRGKSNKFANVECHYCKKKGHIKRFCRQFQNDQKKNKGKKVKPEE 262

Query: 241 DSDADTIIVATEDFYILSDGDVVNLATQHSIWVIDSGASVHATLKRDLHPILLV------ 300
            SD +T      +F ++ D D++NL TQ   WVIDSGA++HAT +R+L     +      
Sbjct: 263 SSDDETNSFG--EFNVVYDDDIINLTTQEMTWVIDSGATIHATPRRELFSSYTLGDFGRV 322

Query: 301 ------ILAVLG------WNKNGFRLILKNVKHIPDIRMNLISTCKLDDEGFCSTFDNGI 360
                    V+G         NG +L+L++V+H+PD+R+NLIS  KLD+EG+C+TF NG 
Sbjct: 323 KMGNANFSTVVGKGDVCLETMNGMKLLLRDVRHVPDMRLNLISVDKLDEEGYCNTFHNGQ 382

Query: 361 WKLTKGSMVIAKAQKFSSLYYMDAKIMESDINMVNDEANVELWHKRLSHISEKGLKILTK 389
           WKLTKGS+++A+  K S LY   A I +  IN+  +++N++LWH+RL H+SEK +  L K
Sbjct: 383 WKLTKGSLMVARGTKQSKLYVTQASISQQVINVAENDSNIKLWHRRLGHMSEKSMARLVK 442

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POLX_TOBAC1.5e-5733.71Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
M300_ARATH3.3e-0934.04Uncharacterized mitochondrial protein AtMg00300 OS=Arabidopsis thaliana GN=AtMg0... [more]
Match NameE-valueIdentityDescription
A5C9D7_VITVI4.4e-12555.78Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007304 PE=4 SV=1[more]
A5BAF2_VITVI9.8e-12556.54Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_027081 PE=4 SV=1[more]
A5AJF5_VITVI7.0e-11552.83Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_010919 PE=4 SV=1[more]
A5C3L0_VITVI3.9e-10550.12Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_007384 PE=4 SV=1[more]
Q9ZRJ0_TOBAC7.3e-10445.70Retrotransposon Tto1 DNA OS=Nicotiana tabacum PE=4 SV=1[more]
Match NameE-valueIdentityDescription
ATMG00300.11.9e-1034.04ATMG00300.1 Gag-Pol-related retrotransposon family protein[more]
Match NameE-valueIdentityDescription
gi|147784778|emb|CAN75440.1|6.3e-12555.78hypothetical protein VITISV_007304 [Vitis vinifera][more]
gi|147776056|emb|CAN69911.1|1.4e-12456.54hypothetical protein VITISV_027081 [Vitis vinifera][more]
gi|147777716|emb|CAN66809.1|1.0e-11452.83hypothetical protein VITISV_010919 [Vitis vinifera][more]
gi|147816208|emb|CAN66323.1|5.5e-10550.12hypothetical protein VITISV_007384 [Vitis vinifera][more]
gi|1167523|dbj|BAA11674.1|1.0e-10345.70unnamed protein product [Nicotiana tabacum][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR025724GAG-pre-integrase_dom
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0005488 binding
molecular_function GO:0016787 hydrolase activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI01G19310.1CSPI01G19310.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 325..388
score: 8.1
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 1..388
score: 1.1
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 28..166
score: 1.8