CSPI03G20720 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G20720
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr3: 16745740 .. 16746695 (-)
RNA-Seq ExpressionCSPI03G20720
SyntenyCSPI03G20720
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCAAACATCACTAGTGATCAAAAGACAGCGGAGGTGTGTTTCATTGATAGCGGGTGTTCGAATCACATGACAGGCTTGAAGCCTATATTCAATGAGCTTAAAGAAGGAGAAAAGTTGAAGGTGGAACTTGGAAACAACAAGGAGCTACAAGTAGAACGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAGCTAATGGAGAGTGGGCATTCTATCTTGTTTGATGATGGTGCGTGCTTGATAAAAAATAAGCAAACATGATGAGTTTTGGCAAAAGTAAAGATGACCAAAAGCAAAATGTTTCCGCTGGAAGTTTTAAATGTGGAAAATTTTGCTCTTACTGCAACTGCAACTAATACAACAAAGAACAACTCGAAGTTATGACATTTAAGGTATGTACATCTTAATATTAAAGGGCTTTCTTTGCTAAATCAAAGAGATATGGTTATTCGGTTATCGAAAATCAGTGCTATCAATATCTGTGAGGGATGTGTGTATGGAAAACAAACTCGAAAATCTTTACCTATTGGGAAAGCTTGGAGAGCCTCGAAGTATCTCGAGCTAATTCATGTTGATTTGTGCGAGCCAATGCAAACAAAGTCTCTTGGTGGGAGATTTTATTTCTTGATTTTTATAGACGATTATAGTCGTGTATGAGTTGGATATATTTTCTAGAAAGAAAATCAGAAACATCTGAGAAGTTCAAGCATTTCAAGGCAAAGGTAGAAAAGCAAAGTGGCATGTTCATCAAATCTCTTCGCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTCAACCATTTTTGTGAGGAACATGGCATCCATAGGGAGTTGACAACACCTTACACTCCAGAGCAAAATGGGAAAGCTGAGAGGAAGAATTGA

mRNA sequence

ATGGCAAACATCACTAGTGATCAAAAGACAGCGGAGGTGTGTTTCATTGATAGCGGGTGTTCGAATCACATGACAGGCTTGAAGCCTATATTCAATGAGCTTAAAGAAGGAGAAAAGTTGAAGGTGGAACTTGGAAACAACAAGGAGCTACAAGTAGAACGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAGCTAATGGAGAGTGGGCATTCTATCTTGTTTGATGATGAGATATGGTTATTCGGTTATCGAAAATCAGTGCTATCAATATCTGTGAGGGATGTGTGTATGGAAAACAAACTCGAAAATCTTTACCTATTGGGAAAGCTTGGAGAGCCTCGAAAAAGAAAATCAGAAACATCTGAGAAGTTCAAGCATTTCAAGGCAAAGGTAGAAAAGCAAAGTGGCATGTTCATCAAATCTCTTCGCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTCAACCATTTTTGTGAGGAACATGGCATCCATAGGGAGTTGACAACACCTTACACTCCAGAGCAAAATGGGAAAGCTGAGAGGAAGAATTGA

Coding sequence (CDS)

ATGGCAAACATCACTAGTGATCAAAAGACAGCGGAGGTGTGTTTCATTGATAGCGGGTGTTCGAATCACATGACAGGCTTGAAGCCTATATTCAATGAGCTTAAAGAAGGAGAAAAGTTGAAGGTGGAACTTGGAAACAACAAGGAGCTACAAGTAGAACGCAAAGGAACGGTGGGAATTGAAACTCACCATGGAAATAGAATTCTCACAAATGTTCAGTATGTGCCCGATATTGGATATAATTTGCTGAGTGTTGGACAGCTAATGGAGAGTGGGCATTCTATCTTGTTTGATGATGAGATATGGTTATTCGGTTATCGAAAATCAGTGCTATCAATATCTGTGAGGGATGTGTGTATGGAAAACAAACTCGAAAATCTTTACCTATTGGGAAAGCTTGGAGAGCCTCGAAAAAGAAAATCAGAAACATCTGAGAAGTTCAAGCATTTCAAGGCAAAGGTAGAAAAGCAAAGTGGCATGTTCATCAAATCTCTTCGCAGTGATAGAGGTGGAGAATTTTTGTCCAACAACTTCAACCATTTTTGTGAGGAACATGGCATCCATAGGGAGTTGACAACACCTTACACTCCAGAGCAAAATGGGAAAGCTGAGAGGAAGAATTGA

Protein sequence

MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGIETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCMENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPYTPEQNGKAERKN*
Homology
BLAST of CSPI03G20720 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 4.4e-11
Identity = 40/111 (36.04%), Postives = 58/111 (52.25%), Query Frame = 0

Query: 109 SVLSISVRDVCMENKLE----NLYLLGKLGEPR--------KRKSETSEKFKHFKAKVEK 168
           ++L +   DVC   ++E    N Y +  + +          K K +  + F+ F A VE+
Sbjct: 479 NILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVER 538

Query: 169 QSGMFIKSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPYTPEQNGKAERKN 208
           ++G  +K LRSD GGE+ S  F  +C  HGI  E T P TP+ NG AER N
Sbjct: 539 ETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMN 589

BLAST of CSPI03G20720 vs. ExPASy Swiss-Prot
Match: Q87040 (Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol PE=3 SV=1)

HSP 1 Score: 51.2 bits (121), Expect = 1.6e-05
Identity = 22/45 (48.89%), Postives = 30/45 (66.67%), Query Frame = 0

Query: 163 KSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPYTPEQNGKAERKN 208
           K + SD+G  F S+ F  + +E GIH E +TPY P+ +GK ERKN
Sbjct: 931 KVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSSGKVERKN 975

BLAST of CSPI03G20720 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 50.4 bits (119), Expect = 2.8e-05
Identity = 29/72 (40.28%), Postives = 42/72 (58.33%), Query Frame = 0

Query: 136 PRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPY 195
           P K+KS+  + F  FK+ VE +    I +L SD GGEF+      +  +HGI    + P+
Sbjct: 539 PLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVV--LRDYLSQHGISHFTSPPH 598

Query: 196 TPEQNGKAERKN 208
           TPE NG +ERK+
Sbjct: 599 TPEHNGLSERKH 608

BLAST of CSPI03G20720 vs. ExPASy Swiss-Prot
Match: P14350 (Pro-Pol polyprotein OS=Human spumaretrovirus OX=11963 GN=pol PE=1 SV=2)

HSP 1 Score: 48.5 bits (114), Expect = 1.0e-04
Identity = 21/45 (46.67%), Postives = 28/45 (62.22%), Query Frame = 0

Query: 163 KSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPYTPEQNGKAERKN 208
           K + SD+G  F S+ F  + +E GIH E +TPY P+   K ERKN
Sbjct: 931 KVIHSDQGAAFTSSTFAEWAKERGIHLEFSTPYHPQSGSKVERKN 975

BLAST of CSPI03G20720 vs. ExPASy Swiss-Prot
Match: P23074 (Pro-Pol polyprotein OS=Simian foamy virus type 1 OX=338478 GN=pol PE=1 SV=3)

HSP 1 Score: 48.1 bits (113), Expect = 1.4e-04
Identity = 22/45 (48.89%), Postives = 29/45 (64.44%), Query Frame = 0

Query: 163 KSLRSDRGGEFLSNNFNHFCEEHGIHRELTTPYTPEQNGKAERKN 208
           K L SD+G  F S+ F  + +E GI  E +TPY P+ +GK ERKN
Sbjct: 930 KVLHSDQGAAFTSSTFADWAKEKGIQLEFSTPYHPQSSGKVERKN 974

BLAST of CSPI03G20720 vs. ExPASy TrEMBL
Match: A0A5A7SV62 (Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold845G001630 PE=4 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 2.5e-70
Identity = 147/207 (71.01%), Postives = 151/207 (72.95%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQK A+V FIDSGCSN MT LKP+F EL EGEKLKVELGN KELQVE KGTVGI
Sbjct: 52  MTNIPSDQKIAKVWFIDSGCSNQMTSLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGI 111

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQY  DIGYNLLSVGQLMESGHSILFDDE                    
Sbjct: 112 ETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSILFDDE-------------------- 171

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                             RKSE  EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFNH
Sbjct: 172 ------------------RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNH 220

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+E GIHREL TPYTPEQNG AERKN
Sbjct: 232 FCKERGIHRELITPYTPEQNGIAERKN 220

BLAST of CSPI03G20720 vs. ExPASy TrEMBL
Match: A0A5D3DDC3 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold275G00620 PE=4 SV=1)

HSP 1 Score: 275.0 bits (702), Expect = 2.5e-70
Identity = 147/207 (71.01%), Postives = 151/207 (72.95%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQK A+V FIDSGCSN MT LKP+F EL EGEKLKVELGN KELQVE KGTVGI
Sbjct: 52  MTNIPSDQKIAKVWFIDSGCSNQMTSLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGI 111

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQY  DIGYNLLSVGQLMESGHSILFDDE                    
Sbjct: 112 ETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSILFDDE-------------------- 171

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                             RKSE  EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFNH
Sbjct: 172 ------------------RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNH 220

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+E GIHREL TPYTPEQNG AERKN
Sbjct: 232 FCKERGIHRELITPYTPEQNGIAERKN 220

BLAST of CSPI03G20720 vs. ExPASy TrEMBL
Match: A0A5A7V170 (UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold39G00590 PE=4 SV=1)

HSP 1 Score: 273.5 bits (698), Expect = 7.4e-70
Identity = 148/207 (71.50%), Postives = 160/207 (77.29%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQKTAEV FIDSGCSNHMT LKP+F EL EGEKLKVEL N K+LQVE KGTVGI
Sbjct: 180 MTNIPSDQKTAEVWFIDSGCSNHMTCLKPVFKELNEGEKLKVELENGKKLQVEGKGTVGI 239

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ET+ GNRILTNVQYVPDIGYNLLSV QLMESG+SILFDD        KS    +      
Sbjct: 240 ETYRGNRILTNVQYVPDIGYNLLSVEQLMESGYSILFDD----VSNAKSFAVTATATNTT 299

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
           +N  E  +L        +++    EKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFN+
Sbjct: 300 KNNSELWHL--------RKQIRNIEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNN 359

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+EHGI RELTTPYTPEQNG AERKN
Sbjct: 360 FCKEHGILRELTTPYTPEQNGVAERKN 374

BLAST of CSPI03G20720 vs. ExPASy TrEMBL
Match: A0A5D3DWC7 (Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold289G00230 PE=4 SV=1)

HSP 1 Score: 267.7 bits (683), Expect = 4.1e-68
Identity = 154/253 (60.87%), Postives = 169/253 (66.80%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQKT EV FIDSG  NHMT LKP+F EL EGEKLKVELGN KELQVE K T+GI
Sbjct: 281 MTNIPSDQKTTEVWFIDSGHLNHMTDLKPVFKELNEGEKLKVELGNGKELQVEGKRTMGI 340

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKS----------- 120
           ETH+GNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDD   L   +++           
Sbjct: 341 ETHNGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDGACLIKNKQTGRVLAKVKMTQ 400

Query: 121 --VLSISVRDV-----------CMENKLE--------------NLYLLGKLGEPR----- 180
             +  + V +V             +N  E              + Y L  + +       
Sbjct: 401 SKMFPLEVSNVENFALTATATNTTKNNSELWHLRPMQTKSLGGSFYFLIFIDDYSRMSWI 460

Query: 181 ---KRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCEEHGIHRELTTP 208
              K +SET EKFKHFKAKVEKQSGMFIKS RSDRGG+FLSNNFNHFCEEHGIHRELTTP
Sbjct: 461 YFLKSQSETFEKFKHFKAKVEKQSGMFIKSFRSDRGGDFLSNNFNHFCEEHGIHRELTTP 520

BLAST of CSPI03G20720 vs. ExPASy TrEMBL
Match: A0A5A7TSJ0 (DUF4219 domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G003550 PE=4 SV=1)

HSP 1 Score: 265.8 bits (678), Expect = 1.5e-67
Identity = 141/199 (70.85%), Postives = 147/199 (73.87%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQKT EV FIDS CSNHMTGLK +F EL EGEKLKVEL N KELQVE KGTVGI
Sbjct: 333 MTNIPSDQKTMEVWFIDSSCSNHMTGLKRVFKELNEGEKLKVELENGKELQVEGKGTVGI 392

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQYVPDIGYNLLSVGQL+ESG+SILFDDE                    
Sbjct: 393 ETHHGNRILTNVQYVPDIGYNLLSVGQLIESGYSILFDDE-------------------- 452

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                              KS+T EKFKHFKAKV+KQSG+FIKSLRSDRGGEFL NNFNH
Sbjct: 453 ------------------SKSKTFEKFKHFKAKVKKQSGVFIKSLRSDRGGEFLFNNFNH 493

Query: 181 FCEEHGIHRELTTPYTPEQ 200
           FCEEHGIHRELTTPYTPEQ
Sbjct: 513 FCEEHGIHRELTTPYTPEQ 493

BLAST of CSPI03G20720 vs. NCBI nr
Match: XP_031739225.1 (uncharacterized protein LOC101208246 [Cucumis sativus])

HSP 1 Score: 320.5 bits (820), Expect = 1.1e-83
Identity = 166/207 (80.19%), Postives = 167/207 (80.68%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           MAN+TSDQKTAEVCFIDSGCSNHMTGLKPIFNEL EGEKLKVELGNNKELQVERKGTVGI
Sbjct: 1   MANVTSDQKTAEVCFIDSGCSNHMTGLKPIFNELNEGEKLKVELGNNKELQVERKGTVGI 60

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDE                    
Sbjct: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDE-------------------- 120

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                             RKSETSEKFKHFKAKVEKQSGMFIKSLRSD GGEFLSNNFNH
Sbjct: 121 ------------------RKSETSEKFKHFKAKVEKQSGMFIKSLRSDGGGEFLSNNFNH 169

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FCEEHGIHRELTTPYTPEQNGKAERKN
Sbjct: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 169

BLAST of CSPI03G20720 vs. NCBI nr
Match: TYK21566.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 275.0 bits (702), Expect = 5.3e-70
Identity = 147/207 (71.01%), Postives = 151/207 (72.95%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQK A+V FIDSGCSN MT LKP+F EL EGEKLKVELGN KELQVE KGTVGI
Sbjct: 52  MTNIPSDQKIAKVWFIDSGCSNQMTSLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGI 111

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQY  DIGYNLLSVGQLMESGHSILFDDE                    
Sbjct: 112 ETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSILFDDE-------------------- 171

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                             RKSE  EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFNH
Sbjct: 172 ------------------RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNH 220

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+E GIHREL TPYTPEQNG AERKN
Sbjct: 232 FCKERGIHRELITPYTPEQNGIAERKN 220

BLAST of CSPI03G20720 vs. NCBI nr
Match: KAA0033341.1 (putative gag-pol polyprotein, identical [Cucumis melo var. makuwa])

HSP 1 Score: 275.0 bits (702), Expect = 5.3e-70
Identity = 147/207 (71.01%), Postives = 151/207 (72.95%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQK A+V FIDSGCSN MT LKP+F EL EGEKLKVELGN KELQVE KGTVGI
Sbjct: 52  MTNIPSDQKIAKVWFIDSGCSNQMTSLKPVFKELNEGEKLKVELGNGKELQVEGKGTVGI 111

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ETHHGNRILTNVQY  DIGYNLLSVGQLMESGHSILFDDE                    
Sbjct: 112 ETHHGNRILTNVQYGLDIGYNLLSVGQLMESGHSILFDDE-------------------- 171

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
                             RKSE  EKFKHF AKVEKQSGMF+KSLRSDRGGEFLSNNFNH
Sbjct: 172 ------------------RKSEIFEKFKHFNAKVEKQSGMFVKSLRSDRGGEFLSNNFNH 220

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+E GIHREL TPYTPEQNG AERKN
Sbjct: 232 FCKERGIHRELITPYTPEQNGIAERKN 220

BLAST of CSPI03G20720 vs. NCBI nr
Match: KAA0061308.1 (UBN2 domain-containing protein [Cucumis melo var. makuwa] >TYK09887.1 UBN2 domain-containing protein [Cucumis melo var. makuwa])

HSP 1 Score: 273.5 bits (698), Expect = 1.5e-69
Identity = 148/207 (71.50%), Postives = 160/207 (77.29%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQKTAEV FIDSGCSNHMT LKP+F EL EGEKLKVEL N K+LQVE KGTVGI
Sbjct: 180 MTNIPSDQKTAEVWFIDSGCSNHMTCLKPVFKELNEGEKLKVELENGKKLQVEGKGTVGI 239

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKSVLSISVRDVCM 120
           ET+ GNRILTNVQYVPDIGYNLLSV QLMESG+SILFDD        KS    +      
Sbjct: 240 ETYRGNRILTNVQYVPDIGYNLLSVEQLMESGYSILFDD----VSNAKSFAVTATATNTT 299

Query: 121 ENKLENLYLLGKLGEPRKRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNH 180
           +N  E  +L        +++    EKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFN+
Sbjct: 300 KNNSELWHL--------RKQIRNIEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNN 359

Query: 181 FCEEHGIHRELTTPYTPEQNGKAERKN 208
           FC+EHGI RELTTPYTPEQNG AERKN
Sbjct: 360 FCKEHGILRELTTPYTPEQNGVAERKN 374

BLAST of CSPI03G20720 vs. NCBI nr
Match: TYK28117.1 (putative gag-pol polyprotein, identical [Cucumis melo var. makuwa])

HSP 1 Score: 267.7 bits (683), Expect = 8.4e-68
Identity = 154/253 (60.87%), Postives = 169/253 (66.80%), Query Frame = 0

Query: 1   MANITSDQKTAEVCFIDSGCSNHMTGLKPIFNELKEGEKLKVELGNNKELQVERKGTVGI 60
           M NI SDQKT EV FIDSG  NHMT LKP+F EL EGEKLKVELGN KELQVE K T+GI
Sbjct: 281 MTNIPSDQKTTEVWFIDSGHLNHMTDLKPVFKELNEGEKLKVELGNGKELQVEGKRTMGI 340

Query: 61  ETHHGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDEIWLFGYRKS----------- 120
           ETH+GNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDD   L   +++           
Sbjct: 341 ETHNGNRILTNVQYVPDIGYNLLSVGQLMESGHSILFDDGACLIKNKQTGRVLAKVKMTQ 400

Query: 121 --VLSISVRDV-----------CMENKLE--------------NLYLLGKLGEPR----- 180
             +  + V +V             +N  E              + Y L  + +       
Sbjct: 401 SKMFPLEVSNVENFALTATATNTTKNNSELWHLRPMQTKSLGGSFYFLIFIDDYSRMSWI 460

Query: 181 ---KRKSETSEKFKHFKAKVEKQSGMFIKSLRSDRGGEFLSNNFNHFCEEHGIHRELTTP 208
              K +SET EKFKHFKAKVEKQSGMFIKS RSDRGG+FLSNNFNHFCEEHGIHRELTTP
Sbjct: 461 YFLKSQSETFEKFKHFKAKVEKQSGMFIKSFRSDRGGDFLSNNFNHFCEEHGIHRELTTP 520

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109784.4e-1136.04Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q870401.6e-0548.89Pro-Pol polyprotein OS=Simian foamy virus (isolate chimpanzee) OX=298339 GN=pol ... [more]
Q9ZT942.8e-0540.28Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P143501.0e-0446.67Pro-Pol polyprotein OS=Human spumaretrovirus OX=11963 GN=pol PE=1 SV=2[more]
P230741.4e-0448.89Pro-Pol polyprotein OS=Simian foamy virus type 1 OX=338478 GN=pol PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5A7SV622.5e-7071.01Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5D3DDC32.5e-7071.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7V1707.4e-7071.50UBN2 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... [more]
A0A5D3DWC74.1e-6860.87Putative gag-pol polyprotein, identical OS=Cucumis melo var. makuwa OX=1194695 G... [more]
A0A5A7TSJ01.5e-6770.85DUF4219 domain-containing protein/UBN2 domain-containing protein OS=Cucumis melo... [more]
Match NameE-valueIdentityDescription
XP_031739225.11.1e-8380.19uncharacterized protein LOC101208246 [Cucumis sativus][more]
TYK21566.15.3e-7071.01Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0033341.15.3e-7071.01putative gag-pol polyprotein, identical [Cucumis melo var. makuwa][more]
KAA0061308.11.5e-6971.50UBN2 domain-containing protein [Cucumis melo var. makuwa] >TYK09887.1 UBN2 domai... [more]
TYK28117.18.4e-6860.87putative gag-pol polyprotein, identical [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 131..207
e-value: 8.6E-19
score: 69.7
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 4..117
NoneNo IPR availablePANTHERPTHR34222:SF31SUBFAMILY NOT NAMEDcoord: 4..117
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 163..207
score: 12.452122
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 139..207

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G20720.1CSPI03G20720.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding