CSPI07G14380.1 (mRNA) Wild cucumber (PI 183967)

NameCSPI07G14380.1
TypemRNA
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr7 : 12896094 .. 12897274 (+)
Sequence length813
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTGCTACCCATAAATGGTTATTGCATCAACTTGACATTAACAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGTGAGAGTGATAAAGTATGCAACCTTCGATAAACTTTGTATGGTTTGAAACAGAGTCCTTGTGCGTGGTTTGGTAAGTTTAGTCAAACTCCTGTATCCTTTGGTATGCAGAAGAGTACATATGATCATTCAGTTTTCTATCGCCGATCTGATAACGGTATAGTTCTACTTGTTGTATATGTTGATGATATTGTTATTACTGGAAATTATGCATCGGGTATTTTGTCTCTCAAAAATTTCCTTCATGGTTAGTTTTATACGAAAGATTTGGGCCAATTGAAATATTTTTTGGGCATTGAAGTAATGAGAAGCAAGAAAGATATTTATTTCTCATCGAAAATATGTATTTAATTTGTTGTCTGAAACATAAAAATTAGGAGCCAAACCAAGTGGCACTCCTATGATGCCAAATCAACAACTTGTTAAAGGAGATTTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCGACTAGACATTGCTTATTCTGTAAGTGTAAATGTTGTAAGTTAGTTCATGTCTTCCCCTACAGCGGATCATTAGGCTGCAGTAGAGCAGATTTTGTGTTATCTGAAAGCTGCTCCTGGATGTGGGATCTTATACAAAGATCATGGACATACGAGAATTGAATGTTTTTCTGATGCTGATTGGGTGGGATCTCGTGAGGATAGAAGATCAACTTCTGGATATTGTGTCTTTGTAGGTGGAAACTTAGTCTCATGGAAGAGTAAGAAAAATAGTATGGATTCACCAACTATTATCTGAGATAGGCTACAGTATTACAGTGCCAGCTAAATTATGGTGTGATAATCAAGCTGCACTTCACATTGCATCTAATCCAGTATTTCGTGAACAAACTAAACATATTAAGGTGGATTGTCACTTCATTTGTGAGAAAATCCAAGATGGGTTGGTGTCCATAGGATATGTAATGACTGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGGAATGATAAGTTATCTGCGCAACAAGCTGGACGTGATTGACATATTTGCTCCAACTTGA

mRNA sequence

ATGGCTGCTACCCATAAATGGTTATTGCATCAACTTGACATTAACAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGTGAGAGTGATAAAAAGAGTACATATGATCATTCAGTTTTCTATCGCCGATCTGATAACGGTATAGTTCTACTTGTTGTATATGTTGATGATATTGTTATTACTGGAAATTATGCATCGGGTATTTTGTCTCTCAAAAATTTCCTTCATGGAGCCAAACCAAGTGGCACTCCTATGATGCCAAATCAACAACTTGTTAAAGGAGATTTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCGACTAGACATTGCTTATTCTGCTGCAGTAGAGCAGATTTTGTGTTATCTGAAAGCTGCTCCTGGATGTGGGATCTTATACAAAGATCATGGACATACGAGAATTGAATGTTTTTCTGATGCTGATTGGGTGGGATCTCGTGAGGATAGAAGATCAACTTCTGGATATTGTGTCTTTGTAGGTGGAAACTTAGTCTCATGGAAGACTGCACTTCACATTGCATCTAATCCAGTATTTCGTGAACAAACTAAACATATTAAGGTGGATTGTCACTTCATTTGTGAGAAAATCCAAGATGGGTTGGTGTCCATAGGATATGTAATGACTGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGGAATGATAAGTTATCTGCGCAACAAGCTGGACGTGATTGACATATTTGCTCCAACTTGA

Coding sequence (CDS)

ATGGCTGCTACCCATAAATGGTTATTGCATCAACTTGACATTAACAATGCTTTTCTGCATGGTAATCTTCAAGAGGAAGTTTATATGGAGCAACCACCTGGGTTTGTTGCTCAGGGTGAGAGTGATAAAAAGAGTACATATGATCATTCAGTTTTCTATCGCCGATCTGATAACGGTATAGTTCTACTTGTTGTATATGTTGATGATATTGTTATTACTGGAAATTATGCATCGGGTATTTTGTCTCTCAAAAATTTCCTTCATGGAGCCAAACCAAGTGGCACTCCTATGATGCCAAATCAACAACTTGTTAAAGGAGATTTATGTAAAGATCCTGAGAGATATAGGAGATTAGTTGGGAAGTTGAACTACTTAATAGTGACTCGACTAGACATTGCTTATTCTGCTGCAGTAGAGCAGATTTTGTGTTATCTGAAAGCTGCTCCTGGATGTGGGATCTTATACAAAGATCATGGACATACGAGAATTGAATGTTTTTCTGATGCTGATTGGGTGGGATCTCGTGAGGATAGAAGATCAACTTCTGGATATTGTGTCTTTGTAGGTGGAAACTTAGTCTCATGGAAGACTGCACTTCACATTGCATCTAATCCAGTATTTCGTGAACAAACTAAACATATTAAGGTGGATTGTCACTTCATTTGTGAGAAAATCCAAGATGGGTTGGTGTCCATAGGATATGTAATGACTGGAGAACAATTGGGAGATATTCTAACTAAAGCTTTAAATGGAGGAATGATAAGTTATCTGCGCAACAAGCTGGACGTGATTGACATATTTGCTCCAACTTGA
BLAST of CSPI07G14380.1 vs. Swiss-Prot
Match: M810_ARATH (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 75.9 bits (185), Expect = 7.7e-13
Identity = 43/137 (31.39%), Postives = 66/137 (48.18%), Query Frame = 1

Query: 76  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAY 135
           YA  IL+    L   KP  TP+ +     V      DP  +R +VG L YL +TR DI+Y
Sbjct: 61  YAEQILNNAGMLD-CKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISY 120

Query: 136 SAAV----------------EQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDR 195
           +  +                +++L Y+K     G+    +    ++ F D+DW G    R
Sbjct: 121 AVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTR 180

BLAST of CSPI07G14380.1 vs. Swiss-Prot
Match: COPIA_DROME (Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3)

HSP 1 Score: 65.5 bits (158), Expect = 1.0e-09
Identity = 77/328 (23.48%), Postives = 134/328 (40.85%), Query Frame = 1

Query: 21   GNLQEEVY--MEQPPGFVAQGESDKKSTYDHSVF--YRRSD-NGI-----VLLVVYVDDI 80
            GN+ E +Y  +      +A G+  + + +   +   +R +D N I     + + +  D I
Sbjct: 1076 GNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKI 1135

Query: 81   VITGN-YASGILSLKNFLH-GAKPSGTPMMPNQQLVKGDL-CKDPERYRRLVGKLNYLIV 140
             ++ + Y   ILS  N  +  A  +  P   N +L+  D  C  P   R L+G L Y+++
Sbjct: 1136 YLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDEDCNTP--CRSLIGCLMYIML 1195

Query: 141  -TRLDIA--------YSAA--------VEQILCYLKAAPGCGILYKDH--GHTRIECFSD 200
             TR D+         YS+         ++++L YLK      +++K +     +I  + D
Sbjct: 1196 CTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVD 1255

Query: 201  ADWVGSREDRRSTSGY------------------------------CVFVGGNLVSW--- 260
            +DW GS  DR+ST+GY                               +F       W   
Sbjct: 1256 SDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREALWLKF 1315

Query: 261  -------------------KTALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVM 265
                               +  + IA+NP   ++ KHI +  HF  E++Q+ ++ + Y+ 
Sbjct: 1316 LLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIP 1375

BLAST of CSPI07G14380.1 vs. Swiss-Prot
Match: POLX_TOBAC (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1)

HSP 1 Score: 60.5 bits (145), Expect = 3.4e-08
Identity = 40/120 (33.33%), Postives = 58/120 (48.33%), Query Frame = 1

Query: 1    MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESD------------------ 60
            +AA+    + QLD+  AFLHG+L+EE+YMEQP GF   G+                    
Sbjct: 910  LAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQ 969

Query: 61   ---------KKSTY-----DHSVFYRR-SDNGIVLLVVYVDDIVITGNYASGILSLKNFL 88
                     K  TY     D  V+++R S+N  ++L++YVDD++I G     I  LK  L
Sbjct: 970  WYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDL 1029

BLAST of CSPI07G14380.1 vs. TrEMBL
Match: A0A061DJK1_THECC (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_001639 PE=4 SV=1)

HSP 1 Score: 275.4 bits (703), Expect = 7.4e-71
Identity = 164/328 (50.00%), Postives = 196/328 (59.76%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK-----KSTY-----DHS 60
           MAAT+ W LHQLDI NA LHG+LQEEVYMEQPP FVAQGE  K     KS Y      H+
Sbjct: 385 MAATYDWPLHQLDIKNALLHGDLQEEVYMEQPPEFVAQGEYGKVYHLRKSLYGLKQNPHA 444

Query: 61  ----------------------VFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLH 120
                                 VFY++S  GI+LLVVYVDDIVITG+  +  L       
Sbjct: 445 WFGKFSETIQEFGMKKSKCDHSVFYKQSKAGIILLVVYVDDIVITGSDTARKL------- 504

Query: 121 GAKPSGTPMMPNQQLVK--GDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAVEQILCYLK 180
           GAKP   PM PN QL K  G+L +DPE+YRRLVGKL+YL VTR DIAYS +V   +    
Sbjct: 505 GAKPCNAPMTPNLQLTKKDGELFEDPEKYRRLVGKLDYLTVTRPDIAYSVSV---VSQFM 564

Query: 181 AAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK---------- 240
           +AP             +E    A+W GS+ DRRST+GYCVF+GGNLVSWK          
Sbjct: 565 SAPTINY------WAALEQILYANWAGSKSDRRSTTGYCVFIGGNLVSWKIWMYQLLSEV 624

Query: 241 ---------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQL 270
                           ALHIASNPVF E+TKHI++D HFI EKIQ   ++ GYV T +QL
Sbjct: 625 GPKSFLPTKLWCDNQAALHIASNPVFHERTKHIEIDYHFIREKIQQKFIATGYVKTKDQL 684

BLAST of CSPI07G14380.1 vs. TrEMBL
Match: A5APJ0_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035380 PE=4 SV=1)

HSP 1 Score: 255.0 bits (650), Expect = 1.0e-64
Identity = 155/349 (44.41%), Postives = 191/349 (54.73%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDKK---------------- 60
           M A   W L+QLDI NAFLHG+L EEVYMEQP GFVAQGES                   
Sbjct: 634 MDAMRSWPLYQLDIKNAFLHGDLAEEVYMEQPLGFVAQGESGLVCRLRRSLYGLKQSPRA 693

Query: 61  ----------------STYDHSVFYRRSDNG-IVLLVVYVDDIVITG------------- 120
                           ST +H VFY  + +G  + LVVYVDDIVITG             
Sbjct: 694 WFSRFSYVVQEFGMFCSTTNHFVFYHHNSSGQCIYLVVYVDDIVITGIEIAQSSSGVVLS 753

Query: 121 --NYASGILSLKNFLHGAKPSGTPMMPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDI 180
              YA  IL   + L   KP  TPM PN   VK +  +DP RYRRLVGKL+YL +TR DI
Sbjct: 754 QRKYALDILEETDMLD-CKPVDTPMDPN---VKLEPLRDPGRYRRLVGKLSYLTITRPDI 813

Query: 181 AYSAAVE----------------QILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSRE 240
            +  ++                 +IL Y+K+ PG G+LY++ GHT++  ++DADWVGS  
Sbjct: 814 YFPVSIVSQLLQSPCDCHWDVVIRILRYIKSTPGQGVLYENRGHTQVVGYTDADWVGSLT 873

Query: 241 DRRSTSGYCVFVGGNLVSWK-----------------------TALHIASNPVFREQTKH 263
           DRRSTSGYCV +GGNL+ WK                       +ALHIASN VF E+TKH
Sbjct: 874 DRRSTSGYCVIIGGNLIYWKSKKQVVVAKFGAEAEYRAMALATSALHIASNLVFHERTKH 933

BLAST of CSPI07G14380.1 vs. TrEMBL
Match: Q6L3Q0_SOLDE (Polyprotein, putative OS=Solanum demissum GN=SDM1_42t00018 PE=4 SV=2)

HSP 1 Score: 245.7 bits (626), Expect = 6.3e-62
Identity = 126/227 (55.51%), Postives = 150/227 (66.08%), Query Frame = 1

Query: 90   AKPSGTPMMPNQQLVK--GDLCKDPERYRRLVGKLNYLIVTRLDIAYS------------ 149
            AKP  TPM+PN QL    GD   DPERYRRLVGKLNYL VTR DI+++            
Sbjct: 1175 AKPCSTPMVPNTQLTNDDGDPFDDPERYRRLVGKLNYLTVTRPDISFAVSIVSQFMSTPT 1234

Query: 150  ----AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGN 209
                AA+EQILCYLK APG GI+Y+++ HTRIECF+D DW GS+ DRRST+GYCVFVGGN
Sbjct: 1235 IKHWAALEQILCYLKGAPGLGIVYRNNEHTRIECFADVDWAGSKIDRRSTTGYCVFVGGN 1294

Query: 210  LVSW-----------------------------KTALHIASNPVFREQTKHIKVDCHFIC 269
            LVSW                             + ALHIASNPV+ E+TKHI+VDCHFI 
Sbjct: 1295 LVSWRMQNPSTELWHNPQVRLCGYFNPKLWCDNQAALHIASNPVYHERTKHIEVDCHFIR 1354

BLAST of CSPI07G14380.1 vs. TrEMBL
Match: A5C163_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015916 PE=4 SV=1)

HSP 1 Score: 224.9 bits (572), Expect = 1.1e-55
Identity = 147/374 (39.30%), Postives = 184/374 (49.20%), Query Frame = 1

Query: 1    MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK----------------- 60
            MAA   W L+QLDI N FLHG+L EEVYMEQPPGFVAQGES                   
Sbjct: 905  MAAMRSWHLYQLDIKNVFLHGDLVEEVYMEQPPGFVAQGESSLVCRLRRSLYGLKQSPRA 964

Query: 61   ---------------KSTYDHSVFYRRSDNG-IVLLVVYVDDIVITGNYASGI------- 120
                           ++T DHSVFY  + +G  + LVVYVDDIVITG+  +GI       
Sbjct: 965  WFSRFSSVVQEFGMFRNTADHSVFYHHNSSGQCIYLVVYVDDIVITGSDQNGIEIAQSSS 1024

Query: 121  ---LSLKNFLHG---------AKPSGTPMMPNQQLV--KGDLCKDPERYRRLVGKLNYLI 180
               LS + ++            KP  TPM PN +L+  +G    D  RY RLVGKLNYL 
Sbjct: 1025 GEVLSQRKYVLDILEETGTLDCKPVDTPMEPNVKLIPGQGKPLGDLGRYWRLVGKLNYLA 1084

Query: 181  VTRLDIAYSAAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCV 240
            +TR +I++                      + GHT++  ++DADW GS  DRRSTSGYCV
Sbjct: 1085 ITRPNISFP---------------------NRGHTQVVGYTDADWAGSPTDRRSTSGYCV 1144

Query: 241  FVGGNLV-----------------------------SW---------------------- 270
            F+GGNL+                              W                      
Sbjct: 1145 FIGGNLISWKSKKQDVVARSSAEAEYRAMTLATCELIWLKYLLPELRFGKDEQMKLICDN 1204

BLAST of CSPI07G14380.1 vs. TrEMBL
Match: A0A061EWC9_THECC (Uncharacterized protein OS=Theobroma cacao GN=TCM_024268 PE=4 SV=1)

HSP 1 Score: 222.2 bits (565), Expect = 7.4e-55
Identity = 128/274 (46.72%), Postives = 150/274 (54.74%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK----------------- 60
           M AT+ W LHQLDI NAFLHG+LQ+EVYMEQP G VAQGE  K                 
Sbjct: 444 MVATYDWPLHQLDIKNAFLHGDLQDEVYMEQPLGLVAQGEYGKVCHLQKCLYGLKQSPRA 503

Query: 61  ---------------KSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLH 120
                          KS  DHSVFY++S+ GI+LLVVYVDDIVITG+             
Sbjct: 504 WFGKFSEVVQEFGMKKSKCDHSVFYKQSEAGIILLVVYVDDIVITGS------------- 563

Query: 121 GAKPSGTPMMPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV---------- 180
                            G+L +D E+YR LVGKLNYL VTR DIAYS +V          
Sbjct: 564 -------------DTADGELFEDSEKYRGLVGKLNYLTVTRPDIAYSVSVVNQFMSDPTI 623

Query: 181 ------EQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNL 226
                 +QILCYLK APGCG+ Y +HGHT IECFS+ADW  S+ DRRST+ YCVF+GGNL
Sbjct: 624 NHWTDLKQILCYLKGAPGCGLFYGNHGHTNIECFSNADWASSKSDRRSTTRYCVFIGGNL 683

BLAST of CSPI07G14380.1 vs. TAIR10
Match: AT4G23160.1 (AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 8)

HSP 1 Score: 99.4 bits (246), Expect = 3.7e-21
Identity = 75/249 (30.12%), Postives = 109/249 (43.78%), Query Frame = 1

Query: 76  YASGILSLKNFLHGAKPSGTPMMPNQQLVK---GDLCKDPERYRRLVGKLNYLIVTRLDI 135
           YA  +L     L G KPS  PM P+        GD   D + YRRL+G+L YL +TRLDI
Sbjct: 338 YALDLLDETGLL-GCKPSSVPMDPSVTFSAHSGGDFV-DAKAYRRLIGRLMYLQITRLDI 397

Query: 136 AYSA----------------AVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSRE 195
           +++                 AV +IL Y+K   G G+ Y      +++ FSDA +   ++
Sbjct: 398 SFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKD 457

Query: 196 DRRSTSGYCVFVGGNLVSWKT----------------ALHIA------------------ 252
            RRST+GYC+F+G +L+SWK+                AL  A                  
Sbjct: 458 TRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPL 517

BLAST of CSPI07G14380.1 vs. TAIR10
Match: ATMG00810.1 (ATMG00810.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 75.9 bits (185), Expect = 4.4e-14
Identity = 43/137 (31.39%), Postives = 66/137 (48.18%), Query Frame = 1

Query: 76  YASGILSLKNFLHGAKPSGTPM-MPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAY 135
           YA  IL+    L   KP  TP+ +     V      DP  +R +VG L YL +TR DI+Y
Sbjct: 61  YAEQILNNAGMLD-CKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISY 120

Query: 136 SAAV----------------EQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDR 195
           +  +                +++L Y+K     G+    +    ++ F D+DW G    R
Sbjct: 121 AVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTR 180

BLAST of CSPI07G14380.1 vs. NCBI nr
Match: gi|590709600|ref|XP_007048598.1| (Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao])

HSP 1 Score: 275.4 bits (703), Expect = 1.1e-70
Identity = 164/328 (50.00%), Postives = 196/328 (59.76%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK-----KSTY-----DHS 60
           MAAT+ W LHQLDI NA LHG+LQEEVYMEQPP FVAQGE  K     KS Y      H+
Sbjct: 385 MAATYDWPLHQLDIKNALLHGDLQEEVYMEQPPEFVAQGEYGKVYHLRKSLYGLKQNPHA 444

Query: 61  ----------------------VFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLH 120
                                 VFY++S  GI+LLVVYVDDIVITG+  +  L       
Sbjct: 445 WFGKFSETIQEFGMKKSKCDHSVFYKQSKAGIILLVVYVDDIVITGSDTARKL------- 504

Query: 121 GAKPSGTPMMPNQQLVK--GDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAVEQILCYLK 180
           GAKP   PM PN QL K  G+L +DPE+YRRLVGKL+YL VTR DIAYS +V   +    
Sbjct: 505 GAKPCNAPMTPNLQLTKKDGELFEDPEKYRRLVGKLDYLTVTRPDIAYSVSV---VSQFM 564

Query: 181 AAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNLVSWK---------- 240
           +AP             +E    A+W GS+ DRRST+GYCVF+GGNLVSWK          
Sbjct: 565 SAPTINY------WAALEQILYANWAGSKSDRRSTTGYCVFIGGNLVSWKIWMYQLLSEV 624

Query: 241 ---------------TALHIASNPVFREQTKHIKVDCHFICEKIQDGLVSIGYVMTGEQL 270
                           ALHIASNPVF E+TKHI++D HFI EKIQ   ++ GYV T +QL
Sbjct: 625 GPKSFLPTKLWCDNQAALHIASNPVFHERTKHIEIDYHFIREKIQQKFIATGYVKTKDQL 684

BLAST of CSPI07G14380.1 vs. NCBI nr
Match: gi|147856196|emb|CAN80284.1| (hypothetical protein VITISV_035380 [Vitis vinifera])

HSP 1 Score: 255.0 bits (650), Expect = 1.5e-64
Identity = 155/349 (44.41%), Postives = 191/349 (54.73%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDKK---------------- 60
           M A   W L+QLDI NAFLHG+L EEVYMEQP GFVAQGES                   
Sbjct: 634 MDAMRSWPLYQLDIKNAFLHGDLAEEVYMEQPLGFVAQGESGLVCRLRRSLYGLKQSPRA 693

Query: 61  ----------------STYDHSVFYRRSDNG-IVLLVVYVDDIVITG------------- 120
                           ST +H VFY  + +G  + LVVYVDDIVITG             
Sbjct: 694 WFSRFSYVVQEFGMFCSTTNHFVFYHHNSSGQCIYLVVYVDDIVITGIEIAQSSSGVVLS 753

Query: 121 --NYASGILSLKNFLHGAKPSGTPMMPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDI 180
              YA  IL   + L   KP  TPM PN   VK +  +DP RYRRLVGKL+YL +TR DI
Sbjct: 754 QRKYALDILEETDMLD-CKPVDTPMDPN---VKLEPLRDPGRYRRLVGKLSYLTITRPDI 813

Query: 181 AYSAAVE----------------QILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSRE 240
            +  ++                 +IL Y+K+ PG G+LY++ GHT++  ++DADWVGS  
Sbjct: 814 YFPVSIVSQLLQSPCDCHWDVVIRILRYIKSTPGQGVLYENRGHTQVVGYTDADWVGSLT 873

Query: 241 DRRSTSGYCVFVGGNLVSWK-----------------------TALHIASNPVFREQTKH 263
           DRRSTSGYCV +GGNL+ WK                       +ALHIASN VF E+TKH
Sbjct: 874 DRRSTSGYCVIIGGNLIYWKSKKQVVVAKFGAEAEYRAMALATSALHIASNLVFHERTKH 933

BLAST of CSPI07G14380.1 vs. NCBI nr
Match: gi|113205323|gb|AAT38747.2| (Polyprotein, putative [Solanum demissum])

HSP 1 Score: 245.7 bits (626), Expect = 9.0e-62
Identity = 126/227 (55.51%), Postives = 150/227 (66.08%), Query Frame = 1

Query: 90   AKPSGTPMMPNQQLVK--GDLCKDPERYRRLVGKLNYLIVTRLDIAYS------------ 149
            AKP  TPM+PN QL    GD   DPERYRRLVGKLNYL VTR DI+++            
Sbjct: 1175 AKPCSTPMVPNTQLTNDDGDPFDDPERYRRLVGKLNYLTVTRPDISFAVSIVSQFMSTPT 1234

Query: 150  ----AAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGN 209
                AA+EQILCYLK APG GI+Y+++ HTRIECF+D DW GS+ DRRST+GYCVFVGGN
Sbjct: 1235 IKHWAALEQILCYLKGAPGLGIVYRNNEHTRIECFADVDWAGSKIDRRSTTGYCVFVGGN 1294

Query: 210  LVSW-----------------------------KTALHIASNPVFREQTKHIKVDCHFIC 269
            LVSW                             + ALHIASNPV+ E+TKHI+VDCHFI 
Sbjct: 1295 LVSWRMQNPSTELWHNPQVRLCGYFNPKLWCDNQAALHIASNPVYHERTKHIEVDCHFIR 1354

BLAST of CSPI07G14380.1 vs. NCBI nr
Match: gi|147833373|emb|CAN66243.1| (hypothetical protein VITISV_015916 [Vitis vinifera])

HSP 1 Score: 224.9 bits (572), Expect = 1.6e-55
Identity = 147/374 (39.30%), Postives = 184/374 (49.20%), Query Frame = 1

Query: 1    MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK----------------- 60
            MAA   W L+QLDI N FLHG+L EEVYMEQPPGFVAQGES                   
Sbjct: 905  MAAMRSWHLYQLDIKNVFLHGDLVEEVYMEQPPGFVAQGESSLVCRLRRSLYGLKQSPRA 964

Query: 61   ---------------KSTYDHSVFYRRSDNG-IVLLVVYVDDIVITGNYASGI------- 120
                           ++T DHSVFY  + +G  + LVVYVDDIVITG+  +GI       
Sbjct: 965  WFSRFSSVVQEFGMFRNTADHSVFYHHNSSGQCIYLVVYVDDIVITGSDQNGIEIAQSSS 1024

Query: 121  ---LSLKNFLHG---------AKPSGTPMMPNQQLV--KGDLCKDPERYRRLVGKLNYLI 180
               LS + ++            KP  TPM PN +L+  +G    D  RY RLVGKLNYL 
Sbjct: 1025 GEVLSQRKYVLDILEETGTLDCKPVDTPMEPNVKLIPGQGKPLGDLGRYWRLVGKLNYLA 1084

Query: 181  VTRLDIAYSAAVEQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCV 240
            +TR +I++                      + GHT++  ++DADW GS  DRRSTSGYCV
Sbjct: 1085 ITRPNISFP---------------------NRGHTQVVGYTDADWAGSPTDRRSTSGYCV 1144

Query: 241  FVGGNLV-----------------------------SW---------------------- 270
            F+GGNL+                              W                      
Sbjct: 1145 FIGGNLISWKSKKQDVVARSSAEAEYRAMTLATCELIWLKYLLPELRFGKDEQMKLICDN 1204

BLAST of CSPI07G14380.1 vs. NCBI nr
Match: gi|590634785|ref|XP_007028466.1| (Uncharacterized protein TCM_024268 [Theobroma cacao])

HSP 1 Score: 222.2 bits (565), Expect = 1.1e-54
Identity = 128/274 (46.72%), Postives = 150/274 (54.74%), Query Frame = 1

Query: 1   MAATHKWLLHQLDINNAFLHGNLQEEVYMEQPPGFVAQGESDK----------------- 60
           M AT+ W LHQLDI NAFLHG+LQ+EVYMEQP G VAQGE  K                 
Sbjct: 444 MVATYDWPLHQLDIKNAFLHGDLQDEVYMEQPLGLVAQGEYGKVCHLQKCLYGLKQSPRA 503

Query: 61  ---------------KSTYDHSVFYRRSDNGIVLLVVYVDDIVITGNYASGILSLKNFLH 120
                          KS  DHSVFY++S+ GI+LLVVYVDDIVITG+             
Sbjct: 504 WFGKFSEVVQEFGMKKSKCDHSVFYKQSEAGIILLVVYVDDIVITGS------------- 563

Query: 121 GAKPSGTPMMPNQQLVKGDLCKDPERYRRLVGKLNYLIVTRLDIAYSAAV---------- 180
                            G+L +D E+YR LVGKLNYL VTR DIAYS +V          
Sbjct: 564 -------------DTADGELFEDSEKYRGLVGKLNYLTVTRPDIAYSVSVVNQFMSDPTI 623

Query: 181 ------EQILCYLKAAPGCGILYKDHGHTRIECFSDADWVGSREDRRSTSGYCVFVGGNL 226
                 +QILCYLK APGCG+ Y +HGHT IECFS+ADW  S+ DRRST+ YCVF+GGNL
Sbjct: 624 NHWTDLKQILCYLKGAPGCGLFYGNHGHTNIECFSNADWASSKSDRRSTTRYCVFIGGNL 683

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
M810_ARATH7.7e-1331.39Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana GN=AtMg0... [more]
COPIA_DROME1.0e-0923.48Copia protein OS=Drosophila melanogaster GN=GIP PE=1 SV=3[more]
POLX_TOBAC3.4e-0833.33Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Match NameE-valueIdentityDescription
A0A061DJK1_THECC7.4e-7150.00Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 OS=Theobroma cacao GN=TCM_001... [more]
A5APJ0_VITVI1.0e-6444.41Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_035380 PE=4 SV=1[more]
Q6L3Q0_SOLDE6.3e-6255.51Polyprotein, putative OS=Solanum demissum GN=SDM1_42t00018 PE=4 SV=2[more]
A5C163_VITVI1.1e-5539.30Putative uncharacterized protein OS=Vitis vinifera GN=VITISV_015916 PE=4 SV=1[more]
A0A061EWC9_THECC7.4e-5546.72Uncharacterized protein OS=Theobroma cacao GN=TCM_024268 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.13.7e-2130.12 cysteine-rich RLK (RECEPTOR-like protein kinase) 8[more]
ATMG00810.14.4e-1431.39ATMG00810.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|590709600|ref|XP_007048598.1|1.1e-7050.00Cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [Theobroma cacao][more]
gi|147856196|emb|CAN80284.1|1.5e-6444.41hypothetical protein VITISV_035380 [Vitis vinifera][more]
gi|113205323|gb|AAT38747.2|9.0e-6255.51Polyprotein, putative [Solanum demissum][more]
gi|147833373|emb|CAN66243.1|1.6e-5539.30hypothetical protein VITISV_015916 [Vitis vinifera][more]
gi|590634785|ref|XP_007028466.1|1.1e-5446.72Uncharacterized protein TCM_024268 [Theobroma cacao][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR013103RVT_2
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI07G14380CSPI07G14380gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI07G14380.1CSPI07G14380.1-proteinpolypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI07G14380.1.cds1CSPI07G14380.1.cds1CDS
CSPI07G14380.1.cds2CSPI07G14380.1.cds2CDS
CSPI07G14380.1.cds3CSPI07G14380.1.cds3CDS
CSPI07G14380.1.cds4CSPI07G14380.1.cds4CDS
CSPI07G14380.1.cds5CSPI07G14380.1.cds5CDS


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 44..87
score: 7.6E-6coord: 1..43
score: 1.2
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 2..197
score: 1.2
NoneNo IPR availablePANTHERPTHR11439:SF185SUBFAMILY NOT NAMEDcoord: 2..197
score: 1.2
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 5..215
score: 4.89