CSPI03G34520 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G34520
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy1-copia retrotransposon protein
LocationChr3: 30327524 .. 30329328 (-)
RNA-Seq ExpressionCSPI03G34520
SyntenyCSPI03G34520
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAGTTCAAGGCTTTTGGAGGAAAAATCAAGAAAAATAGACTCATCTGCTATGCCTGCGGAAAGGAGGGCCACAATTCCTACCAGTGCAATCAAAGAAAGGGAAAATCAAACAACCAAAGACCAACACCTCAAGTCAACCTTGCTGAACCAGATGATGAAGTCATAGCTGCAGTAGTGGAAGTAAACCTAATTGAAAACAAGACCGAATAGATTCTGGATACTGGAGCATCAAGACACTTCTGTACCAACAGAGATCTTTTCCATGACTATGAAGTATGAAGACGCTGCTGATGGAAAACACGTGTTCATGGGAAACTCAGCCACCGCAGGAGTAATCAGGAAAGGGAAGGTTATTTTAAAATTAACCTCTGGAAAAACTTTATCCTTAAGTAATGTTTTATATGTCCCTTCCCTAAGTAGGAACCTGGTGTCTGGGACTCTTTTGAATCAGACGGGTCTTAAAATTACACGGGAAGGTGACAAAGTGGTCCTCACCAAAAATGGAGAGTTTATCGGTAAGGGGTATCTGTCTAATGGACTTTTTGTACTCAATAATGCCTTTAGGAATGCAAATGTTTCTTGTTTTGCCTACATAACTGAATTTATTGATCTATGGCATGGTAAACTAGGACATGCAAACTTTGCATCAATTAGGAAGCTTAAGGATTTAAAGCTAATTAATGCATGTGAATCGCATGAAAATGGTAAATGTCATGTATGTGTAGAAAGTAAGTGCGTTAAGAAACCTTACAAATCTATTTTGACAAGAAGTACAGAATTATTAGAGTTAATTCACTCAGATCTAGCTGATTTTGGAAATACCCCTAGTAGAGGGAGCAAGGATTATTATGTATCTTTTGTTGATGACTTCTCTAGATATACTAAGATTTACTTCATAAAAACAAAAGATGAAGCTAGTAGCATGTTTATGAAATTTAAGGTAGAGTCTGAAAATCAACTGAGCAAAAGAATAAAAAGATTAAGATCAGATAGAGGTGGTGAGTATTCTGATAGAACTCTAAAAGAATATTTTGAATCAAATGGAATTATTCATGAATTTACTACGCCTTACTCACGACAGCAAAATGGTAGAGCAGAACGTAGAAATAGAACTCTTAAGGAAATGATGAATGCTATGTTATTAAGCTCTGGTTTATCTAACAATATGTGAGGGGAAGCCATGTTGTCTGCTTGTTTTGTTCTTAACAGGGTTCCCGACAGAAAACTGGACAAAACCTCCTACGAACTCTGGAAAGGCGGTCCAAAACTCAGCTACCTGAAAGTGTGGGGTTGCTTGGGTAAGGTACCATTTCCTGCACTAAAGAAATCCACGACAGGGTCTAAAACCTTTGACTGTATATTTATCGGTTATGCTCAAAATAGTGTTGCATATAGGTTTATGTGTTTGAATAAGCATAAATGAATCTAGGGATGTAGAATTCTTTGAGCATGTACTACCTTTAAAGAAATCTCTGTCTTTTTCTTGTCAATCGGAAAATATGCATGATCTAAATAATCCCAAGAATGTCAGTGATACACCTGAGGTAGATACTTCTAGCATAAGATATGATTTAGAACCTGGAAGAAGCAAAAGACAAAGAACTGAGAGAAGTTTTGGACCTGATTTCCTAAGCACCTTTATAGTAGAAAGGCACGATGAAATTAATTGCAATTTCACTAGCTTGTTTTTAATTGATGAAGATCCTAAAACTTATCCAGAGGCCTTAAGCTCGATAGAATCCAGTATGCGGAAAGAGGCCATTAAAAGCGAATTGGATTCACTGACCATGAATCAAACATAG

mRNA sequence

ATGCAGTTCAAGGCTTTTGGAGGAAAAATCAAGAAAAATAGACTCATCTGCTATGCCTGCGGAAAGGAGGGCCACAATTCCTACCAGTGCAATCAAAGAAAGGGAAAATCAAACAACCAAAGACCAACACCTCAAGTCAACCTTGCTGAACCAGATGATGAAGTCATAGCTGCAGTAGTGGAATATGAAGACGCTGCTGATGGAAAACACGTGTTCATGGGAAACTCAGCCACCGCAGGAGTAATCAGGAAAGGGAAGACGGGTCTTAAAATTACACGGGAAGGTGACAAAGTGGTCCTCACCAAAAATGGAGAGTTTATCGGTAAGGGGTATCTGTCTAATGGACTTTTTGTACTCAATAATGCCTTTAGGAATGCAAATGTTTCTTGTTTTGCCTACATAACTGAATTTATTGATCTATGGCATGGTAAACTAGGACATGCAAACTTTGCATCAATTAGGAAGCTTAAGGATTTAAAGCTAATTAATGCATGTGAATCGCATGAAAATGGTAAATGTCATGTATGTGTAGAAAGTAAGTGCGTTAAGAAACCTTACAAATCTATTTTGACAAGAAGTACAGAATTATTAGAGTTAATTCACTCAGATCTAGCTGATTTTGGAAATACCCCTAGTAGAGGGAGCAAGGATTATTATGTATCTTTTGTTGATGACTTCTCTAGATATACTAAGATTTACTTCATAAAAACAAAAGATGAAGCTAGTAGCATGTTTATGAAATTTAAGGTAGAGTCTGAAAATCAACTGAGCAAAAGAATAAAAAGATTAAGATCAGATAGAGGTGGTGAGTATTCTGATAGAACTCTAAAAGAATATTTTGAATCAAATGGAATTATTCATGAATTTACTACGCCTTACTCACGACAGCAAAATGGTAGAGCAGAACGTAGAAATAGAACTCTTAAGGAAATGATGAATGCTATGTTATTAAGCTCTGGTTTATCTAACAATATGGTTCCCGACAGAAAACTGGACAAAACCTCCTACGAACTCTGGAAAGGCGGTCCAAAACTCAGCTACCTGAAAGTGTGGGGTTGCTTGGGTAAGGTACCATTTCCTGCACTAAAGAAATCCACGACAGGGGATGTAGAATTCTTTGAGCATGTACTACCTTTAAAGAAATCTCTGTCTTTTTCTTGTCAATCGGAAAATATGCATGATCTAAATAATCCCAAGAATGTCAGTGATACACCTGAGGTAGATACTTCTAGCATAAGATATGATTTAGAACCTGGAAGAAGCAAAAGACAAAGAACTGAGAGAAGTTTTGGACCTGATTTCCTAAGCACCTTTATAGTAGAAAGGCACGATGAAATTAATTGCAATTTCACTAGCTTGTTTTTAATTGATGAAGATCCTAAAACTTATCCAGAGGCCTTAAGCTCGATAGAATCCAGTATGCGGAAAGAGGCCATTAAAAGCGAATTGGATTCACTGACCATGAATCAAACATAG

Coding sequence (CDS)

ATGCAGTTCAAGGCTTTTGGAGGAAAAATCAAGAAAAATAGACTCATCTGCTATGCCTGCGGAAAGGAGGGCCACAATTCCTACCAGTGCAATCAAAGAAAGGGAAAATCAAACAACCAAAGACCAACACCTCAAGTCAACCTTGCTGAACCAGATGATGAAGTCATAGCTGCAGTAGTGGAATATGAAGACGCTGCTGATGGAAAACACGTGTTCATGGGAAACTCAGCCACCGCAGGAGTAATCAGGAAAGGGAAGACGGGTCTTAAAATTACACGGGAAGGTGACAAAGTGGTCCTCACCAAAAATGGAGAGTTTATCGGTAAGGGGTATCTGTCTAATGGACTTTTTGTACTCAATAATGCCTTTAGGAATGCAAATGTTTCTTGTTTTGCCTACATAACTGAATTTATTGATCTATGGCATGGTAAACTAGGACATGCAAACTTTGCATCAATTAGGAAGCTTAAGGATTTAAAGCTAATTAATGCATGTGAATCGCATGAAAATGGTAAATGTCATGTATGTGTAGAAAGTAAGTGCGTTAAGAAACCTTACAAATCTATTTTGACAAGAAGTACAGAATTATTAGAGTTAATTCACTCAGATCTAGCTGATTTTGGAAATACCCCTAGTAGAGGGAGCAAGGATTATTATGTATCTTTTGTTGATGACTTCTCTAGATATACTAAGATTTACTTCATAAAAACAAAAGATGAAGCTAGTAGCATGTTTATGAAATTTAAGGTAGAGTCTGAAAATCAACTGAGCAAAAGAATAAAAAGATTAAGATCAGATAGAGGTGGTGAGTATTCTGATAGAACTCTAAAAGAATATTTTGAATCAAATGGAATTATTCATGAATTTACTACGCCTTACTCACGACAGCAAAATGGTAGAGCAGAACGTAGAAATAGAACTCTTAAGGAAATGATGAATGCTATGTTATTAAGCTCTGGTTTATCTAACAATATGGTTCCCGACAGAAAACTGGACAAAACCTCCTACGAACTCTGGAAAGGCGGTCCAAAACTCAGCTACCTGAAAGTGTGGGGTTGCTTGGGTAAGGTACCATTTCCTGCACTAAAGAAATCCACGACAGGGGATGTAGAATTCTTTGAGCATGTACTACCTTTAAAGAAATCTCTGTCTTTTTCTTGTCAATCGGAAAATATGCATGATCTAAATAATCCCAAGAATGTCAGTGATACACCTGAGGTAGATACTTCTAGCATAAGATATGATTTAGAACCTGGAAGAAGCAAAAGACAAAGAACTGAGAGAAGTTTTGGACCTGATTTCCTAAGCACCTTTATAGTAGAAAGGCACGATGAAATTAATTGCAATTTCACTAGCTTGTTTTTAATTGATGAAGATCCTAAAACTTATCCAGAGGCCTTAAGCTCGATAGAATCCAGTATGCGGAAAGAGGCCATTAAAAGCGAATTGGATTCACTGACCATGAATCAAACATAG

Protein sequence

MQFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVEYEDAADGKHVFMGNSATAGVIRKGKTGLKITREGDKVVLTKNGEFIGKGYLSNGLFVLNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVESKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTKDEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLKEMMNAMLLSSGLSNNMVPDRKLDKTSYELWKGGPKLSYLKVWGCLGKVPFPALKKSTTGDVEFFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTPEVDTSSIRYDLEPGRSKRQRTERSFGPDFLSTFIVERHDEINCNFTSLFLIDEDPKTYPEALSSIESSMRKEAIKSELDSLTMNQT*
Homology
BLAST of CSPI03G34520 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 7.7e-30
Identity = 80/237 (33.76%), Postives = 123/237 (51.90%), Query Frame = 0

Query: 87  TGLKITREG-------DKVVLTKNGEFIGKGYLSNGLFVLNNAFRNANVSCFAYITEFID 146
           +G+ + R+G        K  LTK    I KG     L+  N       ++  A     +D
Sbjct: 365 SGIALDRDGYESYFANQKWRLTKGSLVIAKGVARGTLYRTNAEICQGELNA-AQDEISVD 424

Query: 147 LWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVESKCVKKPYKSILTRSTELLEL 206
           LWH ++GH +   ++ L    LI+  +      C  C+  K  +  +++   R   +L+L
Sbjct: 425 LWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSFQTSSERKLNILDL 484

Query: 207 IHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTKDEASSMFMKFKVESENQLSKR 266
           ++SD+       S G   Y+V+F+DD SR   +Y +KTKD+   +F KF    E +  ++
Sbjct: 485 VYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVERETGRK 544

Query: 267 IKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLKEMMNAML 317
           +KRLRSD GGEY+ R  +EY  S+GI HE T P + Q NG AER NRT+ E + +ML
Sbjct: 545 LKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKVRSML 600

BLAST of CSPI03G34520 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 105.9 bits (263), Expect = 1.3e-21
Identity = 85/301 (28.24%), Postives = 132/301 (43.85%), Query Frame = 0

Query: 90  KITREGDKVVLTKNGEFIGKGYLSNGLFVLNNAFRNANVSCFAYITEFID--------LW 149
           ++   G  +   K+G  I K    NGL V+ N+    NV    +    I+        LW
Sbjct: 363 RLQEAGMSIEFDKSGVTISK----NGLMVVKNSGMLNNVPVINFQAYSINAKHKNNFRLW 422

Query: 150 HGKLGHANFASIRKLK------DLKLIN----ACESHENGKCHVCVESKCVKKPYKSI-- 209
           H + GH +   + ++K      D  L+N    +CE      C  C+  K  + P+K +  
Sbjct: 423 HERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEI-----CEPCLNGKQARLPFKQLKD 482

Query: 210 LTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTKDEASSMFMKFK 269
            T     L ++HSD+       +   K+Y+V FVD F+ Y   Y IK K +  SMF  F 
Sbjct: 483 KTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFV 542

Query: 270 VESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLK 329
            +SE   + ++  L  D G EY    ++++    GI +  T P++ Q NG +ER  RT+ 
Sbjct: 543 AKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTIT 602

Query: 330 EMMNAMLLSSGLSN--------------NMVPDRKL---DKTSYELWKG-GPKLSYLKVW 353
           E    M+  + L                N +P R L    KT YE+W    P L +L+V+
Sbjct: 603 EKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVF 654

BLAST of CSPI03G34520 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 6.1e-19
Identity = 75/234 (32.05%), Postives = 116/234 (49.57%), Query Frame = 0

Query: 141 WHGKLGHANFA---SIRKLKDLKLINACESHENGKCHVCVESKCVKKPYKSILTRSTELL 200
           WH +LGH + A   S+     L ++N   SH+   C  C  +K  K P+ +    S++ L
Sbjct: 446 WHSRLGHPSLAILNSVISNHSLPVLN--PSHKLLSCSDCFINKSHKVPFSNSTITSSKPL 505

Query: 201 ELIHSDLADFGNTP--SRGSKDYYVSFVDDFSRYTKIYFIKTKDEASSMFMKFKVESENQ 260
           E I+SD+    ++P  S  +  YYV FVD F+RYT +Y +K K +    F+ FK   EN+
Sbjct: 506 EYIYSDV---WSSPILSIDNYRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENR 565

Query: 261 LSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLKEMMNAM 320
              RI  L SD GGE+    L++Y   +GI H  + P++ + NG +ER++R + EM   +
Sbjct: 566 FQTRIGTLYSDNGGEFV--VLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTL 625

Query: 321 LLSSG---------------LSNNM-VPDRKLDKTSYELWKGGPKLSYLKVWGC 354
           L  +                L N +  P  +L     +L+   P    LKV+GC
Sbjct: 626 LSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGC 672

BLAST of CSPI03G34520 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 95.9 bits (237), Expect = 1.4e-18
Identity = 72/234 (30.77%), Postives = 111/234 (47.44%), Query Frame = 0

Query: 141 WHGKLGH---ANFASIRKLKDLKLINACESHENGKCHVCVESKCVKKPYKSILTRSTELL 200
           WH +LGH   +   S+     L ++N   SH+   C  C+ +K  K P+      ST  L
Sbjct: 467 WHARLGHPAPSILNSVISNYSLSVLN--PSHKFLSCSDCLINKSNKVPFSQSTINSTRPL 526

Query: 201 ELIHSDLADFGNTP--SRGSKDYYVSFVDDFSRYTKIYFIKTKDEASSMFMKFKVESENQ 260
           E I+SD+    ++P  S  +  YYV FVD F+RYT +Y +K K +    F+ FK   EN+
Sbjct: 527 EYIYSDV---WSSPILSHDNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENR 586

Query: 261 LSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLKEMMNAM 320
              RI    SD GGE+    L EYF  +GI H  + P++ + NG +ER++R + E    +
Sbjct: 587 FQTRIGTFYSDNGGEFV--ALWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTL 646

Query: 321 LLSSGLSNNM----------------VPDRKLDKTSYELWKGGPKLSYLKVWGC 354
           L  + +                     P  +L+    +L+   P    L+V+GC
Sbjct: 647 LSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGC 693

BLAST of CSPI03G34520 vs. ExPASy Swiss-Prot
Match: Q12491 (Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-B PE=3 SV=1)

HSP 1 Score: 84.7 bits (208), Expect = 3.1e-15
Identity = 71/244 (29.10%), Postives = 120/244 (49.18%), Query Frame = 0

Query: 100 LTKNGEF--IGKGYL---SNGLFVLNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIR 159
           + K+G+F  + K YL         +NN  ++ +V+ + Y      L H  LGHANF SI+
Sbjct: 553 IVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPY-----PLIHRMLGHANFRSIQ 612

Query: 160 K-LKDLKLINACESH------ENGKCHVCVESKCVK----KPYKSILTRSTELLELIHSD 219
           K LK   +    ES          +C  C+  K  K    K  +     S E  + +H+D
Sbjct: 613 KSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEPFQYLHTD 672

Query: 220 LADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTKDEAS--SMFMKFKVESENQLSKRIK 279
           +    +   + +  Y++SF D+ +R+  +Y +  + E S  ++F       +NQ + R+ 
Sbjct: 673 IFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKNQFNARVL 732

Query: 280 RLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQNGRAERRNRTLKEMMNAMLLSSGL 326
            ++ DRG EY+++TL ++F + GI   +TT    + +G AER NRTL      +L  SGL
Sbjct: 733 VIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRTLLHCSGL 791

BLAST of CSPI03G34520 vs. ExPASy TrEMBL
Match: A0A5D3DWC4 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G001190 PE=4 SV=1)

HSP 1 Score: 658.3 bits (1697), Expect = 2.5e-185
Identity = 363/603 (60.20%), Postives = 411/603 (68.16%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SR+TKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRFTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFVKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. ExPASy TrEMBL
Match: A0A5D3DJE2 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold142G004120 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 3.3e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 163 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 222

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 223 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 282

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 283 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 342

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 343 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 402

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 403 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 462

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 463 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 522

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 523 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 582

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 583 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 642

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 643 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 702

BLAST of CSPI03G34520 vs. ExPASy TrEMBL
Match: A0A5D3DSQ3 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold124G00930 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 3.3e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. ExPASy TrEMBL
Match: A0A5D3C5T2 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold70G00500 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 3.3e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. ExPASy TrEMBL
Match: A0A5D3C7Z8 (Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold265G001160 PE=4 SV=1)

HSP 1 Score: 657.9 bits (1696), Expect = 3.3e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. NCBI nr
Match: TYK27931.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 658.3 bits (1697), Expect = 5.2e-185
Identity = 363/603 (60.20%), Postives = 411/603 (68.16%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SR+TKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRFTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFVKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. NCBI nr
Match: KAA0056761.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 657.9 bits (1696), Expect = 6.8e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. NCBI nr
Match: TYK06518.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 657.9 bits (1696), Expect = 6.8e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. NCBI nr
Match: TYK07981.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 657.9 bits (1696), Expect = 6.8e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

BLAST of CSPI03G34520 vs. NCBI nr
Match: TYK02676.1 (ty1-copia retrotransposon protein [Cucumis melo var. makuwa])

HSP 1 Score: 657.9 bits (1696), Expect = 6.8e-185
Identity = 363/603 (60.20%), Postives = 410/603 (67.99%), Query Frame = 0

Query: 2   QFKAFGGKIKKNRLICYACGKEGHNSYQCNQRKGKSNNQRPTPQVNLAEPDDEVIAAVVE 61
           QFK  GG+IKK +L+CY CGKEGH SYQCNQRKG+  +Q+PTPQ NLAE D E+IAA+VE
Sbjct: 267 QFKTTGGQIKKKKLVCYVCGKEGHKSYQCNQRKGRP-SQKPTPQANLAEQDSEIIAAIVE 326

Query: 62  -----------------------------YEDAADGKHVFMGNSATAGVIRKGK------ 121
                                        YED ADG+ VFMGNSATAGVI KGK      
Sbjct: 327 ANLIENKTDWILDTGASRHFCTNRELLHDYEDTADGECVFMGNSATAGVIGKGKVILKLT 386

Query: 122 ----------------------------TGLKITREGDKVVLTKNGEFIGKGYLSNGLFV 181
                                        GLKI  EGDKVVLTKNG+F+GKGYLSNGLFV
Sbjct: 387 SGKTLSLSNVLYVPSLRRNLVSGSLLNRAGLKIVLEGDKVVLTKNGDFVGKGYLSNGLFV 446

Query: 182 LNNAFRNANVSCFAYITEFIDLWHGKLGHANFASIRKLKDLKLINACESHENGKCHVCVE 241
           LN    NAN S  AY+ E  +LWHG+LGH NFASIRKLKD++LIN  E+HE GKC +C+E
Sbjct: 447 LNTISMNANASSSAYLIESANLWHGRLGHVNFASIRKLKDMRLINTSETHETGKCSICIE 506

Query: 242 SKCVKKPYKSILTRSTELLELIHSDLADFGNTPSRGSKDYYVSFVDDFSRYTKIYFIKTK 301
           SK  KKP+K +  R+TELLELIHSDLADF  T SRG K+YYVSFVDD+SRYTKIY I+TK
Sbjct: 507 SKFHKKPFKPVEYRTTELLELIHSDLADFRTTASRGGKNYYVSFVDDYSRYTKIYLIRTK 566

Query: 302 DEASSMFMKFKVESENQLSKRIKRLRSDRGGEYSDRTLKEYFESNGIIHEFTTPYSRQQN 361
           +EA SMF+KFK ESENQL KRIKRLRSDRGGEYSD+TLKE+ ESNGIIHEFT PYS QQN
Sbjct: 567 NEAVSMFIKFKAESENQLGKRIKRLRSDRGGEYSDKTLKEFCESNGIIHEFTAPYSPQQN 626

Query: 362 GRAERRNRTLKEMMNAMLLSSGLSNNM--------------VPDRKLDKTSYELWKG-GP 421
           G AER+NRTLKEMMNAMLLSSGLS+NM              +P ++LDKT YELWKG  P
Sbjct: 627 GIAERKNRTLKEMMNAMLLSSGLSDNMWGEAVLSACFILNRIPHKRLDKTPYELWKGHAP 686

Query: 422 KLSYLKVWGCLGKVPFPALKKSTTG--------------------------------DVE 481
            LSYLKVWGCL KVP PALKK+T G                                D E
Sbjct: 687 NLSYLKVWGCLAKVPLPALKKTTVGPKTFDCIFIGYAQNSAAYRFMCLNDKTINESRDAE 746

Query: 482 FFEHVLPLKKSLSFSCQSENMHDLNNPKNVSDTP---EVDTSSIRYDLEPGRSKRQRTER 492
           FFEHV PLK+SL     S  MHD   P+ VS+ P    VDT ++  +LEP RSKRQRTE+
Sbjct: 747 FFEHVFPLKQSLYAPSLSNRMHD---PEIVSEIPVSETVDTPNLSCELEPRRSKRQRTEK 806

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109787.7e-3033.76Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041461.3e-2128.24Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT946.1e-1932.05Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.4e-1830.77Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q124913.1e-1529.10Transposon Ty2-B Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A5D3DWC42.5e-18560.20Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3DJE23.3e-18560.20Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3DSQ33.3e-18560.20Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3C5T23.3e-18560.20Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
A0A5D3C7Z83.3e-18560.20Ty1-copia retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
TYK27931.15.2e-18560.20ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
KAA0056761.16.8e-18560.20ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
TYK06518.16.8e-18560.20ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
TYK07981.16.8e-18560.20ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
TYK02676.16.8e-18560.20ty1-copia retrotransposon protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 486..491
NoneNo IPR availablePANTHERPTHR11439:SF324RIBONUCLEASE H-LIKE DOMAIN, GAG-PRE-INTEGRASE DOMAIN, GAG-POLYPEPTIDE OF LTR COPIA-TYPE-RELATEDcoord: 137..401
NoneNo IPR availablePANTHERPTHR11439GAG-POL-RELATED RETROTRANSPOSONcoord: 137..401
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 115..180
e-value: 4.0E-9
score: 36.2
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 189..349
e-value: 1.4E-31
score: 111.3
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 197..294
e-value: 6.6E-13
score: 48.8
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 191..366
score: 19.159683
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 17..32
score: 9.537339
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 7..39
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 192..336

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G34520.1CSPI03G34520.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding