CSPI04G09820 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI04G09820
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationChr4: 7793028 .. 7794293 (-)
RNA-Seq ExpressionCSPI04G09820
SyntenyCSPI04G09820
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGTCTGTTCGTAGTCTTTTAGCCATTGCAGCTGCAAAACAGTGGCCCCTCTTACAAATGGATGTTAAAAATGCATTTCTCAATGGAACTCTATCTGAAGAAGTCTATATGAAACCACCACCTGGTACTACCCCGCCACATCAGAAAGTATGTCTTCTTCGACGAGCTCTCTATGGTCTCAAACAGGCTCCCCGAGCCTGGTTTGCAACTTTTAGCTCCACTATTACTCAACTTGGGTTCACTTCCAGCCCTCATGATTCAGCCTTGTTCACCCGCCAGACACCTAATGGTATTGTACTCCTTCTTTTATATGTTGATGACATGATTATTACAGGCGATGACCCTCAAGCTATATCTGAATTGCAATGCTACTTGGGAAAGCACTTTGAGATGAAGGATTTAGGACCTCTCAGTTATTTCCTTGGCCTTGAGATCACTTTCGTGTCTGATGGTTATTACTTATCTCAAGCTAAATATGCTTCTGACCTACTCAGTCGATCTGGTATTACTGATTCTACAACATCCTCAACACCTCTGGATCCAAATGTTCGACTTACTCCTTATGATGGTGTTCCCCTTGACGATCCTACTTTGTACCGGCAACTTGTTGGTAGCTTGATTTACTTAACTGTAACTCGCCCTGATATTGCGTTCGCAGTTCACATAGTCAGTCAATTCATGGCCGCTCCTCGTACTATTCATTTCACTGCTGTCCTTCGGATTCTTCGCTACATTAAAGGCACTTTGGGTCATGGTCTCCACTTCTCCTCACAGTCTTCTCTGGTTCTCTCTGGATTCTCTGATGCTGATTGGGCAGGAGATCCTACTGATAGAAGATCCACCACTGGCTATTGTTTTTATTTAGGTGATGCTCTTATCTCCTGGCGCAGCAAGAAACAATCTGTTGTTTCCCGGTCTAGCACTGAGTCTGAATATCGTGCTCTAGCTGATGCCACTTCAGAATTATTATGGCTTCGTTGGCTTCTTACTGATATGGGAGCCCCACAAACATCATCTACTACTCTCCATTGTGATAACCGCAGTGCCATTCAGATTGCACACAATGATGTATTTCATGAACGAACAAAGCACATAGAGAACGATTGTCATTTTGTTCGTCATCATCTTCAAAGCAACACTCTCCATCTTCAATCTATCTCTACCATTGATCAACCTGCAGATATCTTCACCAAAGCTCTCCATTCTCCTCGTTTCACTCTGTTACTTCACAAACTCAAGGTGGTTTCTACTCTACCAACTTGA

mRNA sequence

ATGACGTCTGTTCGTAGTCTTTTAGCCATTGCAGCTGCAAAACAGTGGCCCCTCTTACAAATGGATGTTAAAAATGCATTTCTCAATGGAACTCTATCTGAAGAAGTCTATATGAAACCACCACCTGGTACTACCCCGCCACATCAGAAAGTATGTCTTCTTCGACGAGCTCTCTATGGTCTCAAACAGGCTCCCCGAGCCTGGTTTGCAACTTTTAGCTCCACTATTACTCAACTTGGGTTCACTTCCAGCCCTCATGATTCAGCCTTGTTCACCCGCCAGACACCTAATGGTATTGTACTCCTTCTTTTATATGTTGATGACATGATTATTACAGGCGATGACCCTCAAGCTATATCTGAATTGCAATGCTACTTGGGAAAGCACTTTGAGATGAAGGATTTAGGACCTCTCAGTTATTTCCTTGGCCTTGAGATCACTTTCGTGTCTGATGGTTATTACTTATCTCAAGCTAAATATGCTTCTGACCTACTCAGTCGATCTGGTATTACTGATTCTACAACATCCTCAACACCTCTGGATCCAAATGTTCGACTTACTCCTTATGATGGTGTTCCCCTTGACGATCCTACTTTGTACCGGCAACTTGTTGGTAGCTTGATTTACTTAACTGTAACTCGCCCTGATATTGCGTTCGCAGTTCACATAGTCAGTCAATTCATGGCCGCTCCTCGTACTATTCATTTCACTGCTGTCCTTCGGATTCTTCGCTACATTAAAGGCACTTTGGGTCATGGTCTCCACTTCTCCTCACAGTCTTCTCTGGTTCTCTCTGGATTCTCTGATGCTGATTGGGCAGGAGATCCTACTGATAGAAGATCCACCACTGGCTATTGTTTTTATTTAGGTGATGCTCTTATCTCCTGGCGCAGCAAGAAACAATCTGTTGTTTCCCGGTCTAGCACTGAGTCTGAATATCGTGCTCTAGCTGATGCCACTTCAGAATTATTATGGCTTCGTTGGCTTCTTACTGATATGGGAGCCCCACAAACATCATCTACTACTCTCCATTGTGATAACCGCAGTGCCATTCAGATTGCACACAATGATGTATTTCATGAACGAACAAAGCACATAGAGAACGATTGTCATTTTGTTCGTCATCATCTTCAAAGCAACACTCTCCATCTTCAATCTATCTCTACCATTGATCAACCTGCAGATATCTTCACCAAAGCTCTCCATTCTCCTCGTTTCACTCTGTTACTTCACAAACTCAAGGTGGTTTCTACTCTACCAACTTGA

Coding sequence (CDS)

ATGACGTCTGTTCGTAGTCTTTTAGCCATTGCAGCTGCAAAACAGTGGCCCCTCTTACAAATGGATGTTAAAAATGCATTTCTCAATGGAACTCTATCTGAAGAAGTCTATATGAAACCACCACCTGGTACTACCCCGCCACATCAGAAAGTATGTCTTCTTCGACGAGCTCTCTATGGTCTCAAACAGGCTCCCCGAGCCTGGTTTGCAACTTTTAGCTCCACTATTACTCAACTTGGGTTCACTTCCAGCCCTCATGATTCAGCCTTGTTCACCCGCCAGACACCTAATGGTATTGTACTCCTTCTTTTATATGTTGATGACATGATTATTACAGGCGATGACCCTCAAGCTATATCTGAATTGCAATGCTACTTGGGAAAGCACTTTGAGATGAAGGATTTAGGACCTCTCAGTTATTTCCTTGGCCTTGAGATCACTTTCGTGTCTGATGGTTATTACTTATCTCAAGCTAAATATGCTTCTGACCTACTCAGTCGATCTGGTATTACTGATTCTACAACATCCTCAACACCTCTGGATCCAAATGTTCGACTTACTCCTTATGATGGTGTTCCCCTTGACGATCCTACTTTGTACCGGCAACTTGTTGGTAGCTTGATTTACTTAACTGTAACTCGCCCTGATATTGCGTTCGCAGTTCACATAGTCAGTCAATTCATGGCCGCTCCTCGTACTATTCATTTCACTGCTGTCCTTCGGATTCTTCGCTACATTAAAGGCACTTTGGGTCATGGTCTCCACTTCTCCTCACAGTCTTCTCTGGTTCTCTCTGGATTCTCTGATGCTGATTGGGCAGGAGATCCTACTGATAGAAGATCCACCACTGGCTATTGTTTTTATTTAGGTGATGCTCTTATCTCCTGGCGCAGCAAGAAACAATCTGTTGTTTCCCGGTCTAGCACTGAGTCTGAATATCGTGCTCTAGCTGATGCCACTTCAGAATTATTATGGCTTCGTTGGCTTCTTACTGATATGGGAGCCCCACAAACATCATCTACTACTCTCCATTGTGATAACCGCAGTGCCATTCAGATTGCACACAATGATGTATTTCATGAACGAACAAAGCACATAGAGAACGATTGTCATTTTGTTCGTCATCATCTTCAAAGCAACACTCTCCATCTTCAATCTATCTCTACCATTGATCAACCTGCAGATATCTTCACCAAAGCTCTCCATTCTCCTCGTTTCACTCTGTTACTTCACAAACTCAAGGTGGTTTCTACTCTACCAACTTGA

Protein sequence

MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYGLKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAISELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPLDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKKQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFHERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLPT*
Homology
BLAST of CSPI04G09820 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 365.9 bits (938), Expect = 6.0e-100
Identity = 194/422 (45.97%), Postives = 262/422 (62.09%), Query Frame = 0

Query: 2    TSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQK--VCLLRRALY 61
            TS+R +L +A  + WP+ Q+DV NAFL GTL+++VYM  PPG     +   VC LR+ALY
Sbjct: 1045 TSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALY 1104

Query: 62   GLKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAI 121
            GLKQAPRAW+    + +  +GF +S  D++LF  Q    IV +L+YVDD++ITG+DP  +
Sbjct: 1105 GLKQAPRAWYVELRNYLLTIGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLL 1164

Query: 122  SELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTP 181
                  L + F +KD   L YFLG+E   V  G +LSQ +Y  DLL+R+ +  +   +TP
Sbjct: 1165 HNTLDNLSQRFSVKDHEELHYFLGIEAKRVPTGLHLSQRRYILDLLARTNMITAKPVTTP 1224

Query: 182  LDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAV 241
            + P+ +L+ Y G  L DPT YR +VGSL YL  TRPDI++AV+ +SQFM  P   H  A+
Sbjct: 1225 MAPSPKLSLYSGTKLTDPTEYRGIVGSLQYLAFTRPDISYAVNRLSQFMHMPTEEHLQAL 1284

Query: 242  LRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSK 301
             RILRY+ GT  HG+     ++L L  +SDADWAGD  D  ST GY  YLG   ISW SK
Sbjct: 1285 KRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGHHPISWSSK 1344

Query: 302  KQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVF 361
            KQ  V RSSTE+EYR++A+ +SE+ W+  LLT++G   T    ++CDN  A  +  N VF
Sbjct: 1345 KQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGATYLCANPVF 1404

Query: 362  HERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTL 421
            H R KHI  D HF+R+ +QS  L +  +ST DQ AD  TK L    F     K+ V    
Sbjct: 1405 HSRMKHIAIDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRTAFQNFASKIGVTRVP 1464

BLAST of CSPI04G09820 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 357.1 bits (915), Expect = 2.8e-97
Identity = 187/422 (44.31%), Postives = 262/422 (62.09%), Query Frame = 0

Query: 2    TSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQK--VCLLRRALY 61
            TS+R +L +A  + WP+ Q+DV NAFL GTL++EVYM  PPG     +   VC LR+A+Y
Sbjct: 1028 TSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIY 1087

Query: 62   GLKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAI 121
            GLKQAPRAW+    + +  +GF +S  D++LF  Q    I+ +L+YVDD++ITG+D   +
Sbjct: 1088 GLKQAPRAWYVELRTYLLTVGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLL 1147

Query: 122  SELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTP 181
                  L + F +K+   L YFLG+E   V  G +LSQ +Y  DLL+R+ +  +   +TP
Sbjct: 1148 KHTLDALSQRFSVKEHEDLHYFLGIEAKRVPQGLHLSQRRYTLDLLARTNMLTAKPVATP 1207

Query: 182  LDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAV 241
            +  + +LT + G  L DPT YR +VGSL YL  TRPD+++AV+ +SQ+M  P   H+ A+
Sbjct: 1208 MATSPKLTLHSGTKLPDPTEYRGIVGSLQYLAFTRPDLSYAVNRLSQYMHMPTDDHWNAL 1267

Query: 242  LRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSK 301
             R+LRY+ GT  HG+     ++L L  +SDADWAGD  D  ST GY  YLG   ISW SK
Sbjct: 1268 KRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGHHPISWSSK 1327

Query: 302  KQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVF 361
            KQ  V RSSTE+EYR++A+ +SEL W+  LLT++G   +    ++CDN  A  +  N VF
Sbjct: 1328 KQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVGATYLCANPVF 1387

Query: 362  HERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTL 421
            H R KHI  D HF+R+ +QS  L +  +ST DQ AD  TK L    F     K+ V+   
Sbjct: 1388 HSRMKHIALDYHFIRNQVQSGALRVVHVSTHDQLADTLTKPLSRVAFQNFSRKIGVIKVP 1447

BLAST of CSPI04G09820 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 284.3 bits (726), Expect = 2.3e-75
Identity = 171/420 (40.71%), Postives = 250/420 (59.52%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQK--VCLLRRAL 60
            MTS+R++L++AA+    + Q+DVK AFL+G L EE+YM+ P G     +K  VC L ++L
Sbjct: 901  MTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSL 960

Query: 61   YGLKQAPRAWFATFSSTITQLGFTSSPHDSAL-FTRQTPNGIVLLLLYVDDMIITGDDPQ 120
            YGLKQAPR W+  F S +    +  +  D  + F R + N  ++LLLYVDDM+I G D  
Sbjct: 961  YGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKG 1020

Query: 121  AISELQCYLGKHFEMKDLGPLSYFLGLEIT--FVSDGYYLSQAKYASDLLSRSGITDSTT 180
             I++L+  L K F+MKDLGP    LG++I     S   +LSQ KY   +L R  + ++  
Sbjct: 1021 LIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKP 1080

Query: 181  SSTPLDPNVRL------TPYDGVPLDDPTLYRQLVGSLIYLTV-TRPDIAFAVHIVSQFM 240
             STPL  +++L      T  +         Y   VGSL+Y  V TRPDIA AV +VS+F+
Sbjct: 1081 VSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFL 1140

Query: 241  AAPRTIHFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFY 300
              P   H+ AV  ILRY++GT G  L F   S  +L G++DAD AGD  +R+S+TGY F 
Sbjct: 1141 ENPGKEHWEAVKWILRYLRGTTGDCLCFGG-SDPILKGYTDADMAGDIDNRKSSTGYLFT 1200

Query: 301  LGDALISWRSKKQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNR 360
                 ISW+SK Q  V+ S+TE+EY A  +   E++WL+  L ++G  Q     ++CD++
Sbjct: 1201 FSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQ-KEYVVYCDSQ 1260

Query: 361  SAIQIAHNDVFHERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTL 409
            SAI ++ N ++H RTKHI+   H++R  +   +L +  IST + PAD+ TK +   +F L
Sbjct: 1261 SAIDLSKNSMYHARTKHIDVRYHWIREMVDDESLKVLKISTNENPADMLTKVVPRNKFEL 1318

BLAST of CSPI04G09820 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 275.8 bits (704), Expect = 8.2e-73
Identity = 162/421 (38.48%), Postives = 244/421 (57.96%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            ++S R +L++       + QMDVK AFLNGTL EE+YM+ P G +     VC L +A+YG
Sbjct: 981  ISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDNVCKLNKAIYG 1040

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALF--TRQTPNGIVLLLLYVDDMIITGDDPQA 120
            LKQA R WF  F   + +  F +S  D  ++   +   N  + +LLYVDD++I   D   
Sbjct: 1041 LKQAARCWFEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTR 1100

Query: 121  ISELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSST 180
            ++  + YL + F M DL  + +F+G+ I    D  YLSQ+ Y   +LS+  + +    ST
Sbjct: 1101 MNNFKRYLMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVST 1160

Query: 181  PLDPNVRLTPYDGVPLDDP--TLYRQLVGSLIYLTV-TRPDIAFAVHIVSQFMAAPRTIH 240
            PL   +    Y+ +  D+   T  R L+G L+Y+ + TRPD+  AV+I+S++ +   +  
Sbjct: 1161 PLPSKIN---YELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSEL 1220

Query: 241  FTAVLRILRYIKGTLGHGLHFSSQSSL--VLSGFSDADWAGDPTDRRSTTGYCFYLGD-A 300
            +  + R+LRY+KGT+   L F    +    + G+ D+DWAG   DR+STTGY F + D  
Sbjct: 1221 WQNLKRVLRYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFN 1280

Query: 301  LISWRSKKQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQ 360
            LI W +K+Q+ V+ SSTE+EY AL +A  E LWL++LLT +     +   ++ DN+  I 
Sbjct: 1281 LICWNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCIS 1340

Query: 361  IAHNDVFHERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHK 414
            IA+N   H+R KHI+   HF R  +Q+N + L+ I T +Q ADIFTK L + RF  L  K
Sbjct: 1341 IANNPSCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDK 1398

BLAST of CSPI04G09820 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 204.9 bits (520), Expect = 1.8e-51
Identity = 108/224 (48.21%), Postives = 145/224 (64.73%), Query Frame = 0

Query: 102 LLLYVDDMIITGDDPQAISELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYA 161
           LLLYVDD+++TG     ++ L   L   F MKDLGP+ YFLG++I     G +LSQ KYA
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 162 SDLLSRSGITDSTTSSTPLDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAV 221
             +L+ +G+ D    STPL P    +        DP+ +R +VG+L YLT+TRPDI++AV
Sbjct: 63  EQILNNAGMLDCKPMSTPL-PLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 222 HIVSQFMAAPRTIHFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRS 281
           +IV Q M  P    F  + R+LRY+KGT+ HGL+    S L +  F D+DWAG  + RRS
Sbjct: 123 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 282 TTGYCFYLGDALISWRSKKQSVVSRSSTESEYRALADATSELLW 326
           TTG+C +LG  +ISW +K+Q  VSRSSTE+EYRALA   +EL W
Sbjct: 183 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI04G09820 vs. ExPASy TrEMBL
Match: A0A5D3DG18 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold195G00850 PE=4 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 5.6e-226
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 926  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 985

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 986  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1045

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1046 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1105

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1106 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1165

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1166 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1225

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1226 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1285

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1286 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1345

Query: 421  T 422
            +
Sbjct: 1346 S 1346

BLAST of CSPI04G09820 vs. ExPASy TrEMBL
Match: A0A5D3DWU7 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold289G00160 PE=4 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 5.6e-226
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1   MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
           MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 342 MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 401

Query: 61  LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
           LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 402 LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 461

Query: 121 ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
           +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 462 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 521

Query: 181 DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
           DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 522 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 581

Query: 241 RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
           RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 582 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 641

Query: 301 QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
           QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 642 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 701

Query: 361 ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
           ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 702 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 761

Query: 421 T 422
           +
Sbjct: 762 S 762

BLAST of CSPI04G09820 vs. ExPASy TrEMBL
Match: A0A5A7SZ66 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold191G00840 PE=4 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 5.6e-226
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 926  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 985

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 986  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1045

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1046 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1105

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1106 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1165

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1166 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1225

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1226 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1285

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1286 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1345

Query: 421  T 422
            +
Sbjct: 1346 S 1346

BLAST of CSPI04G09820 vs. ExPASy TrEMBL
Match: A0A5A7VDW0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold17G00430 PE=4 SV=1)

HSP 1 Score: 793.1 bits (2047), Expect = 5.6e-226
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 894  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 953

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 954  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1013

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1014 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1073

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1074 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1133

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1134 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1193

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1194 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1253

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1254 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1313

Query: 421  T 422
            +
Sbjct: 1314 S 1314

BLAST of CSPI04G09820 vs. ExPASy TrEMBL
Match: A0A5A7UVX4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold274G006520 PE=4 SV=1)

HSP 1 Score: 785.0 bits (2026), Expect = 1.5e-223
Identity = 392/421 (93.11%), Postives = 403/421 (95.72%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 843  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 902

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 903  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 962

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YF+GLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 963  DLQCYLGKHFEMKDLGNLNYFIGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1022

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1023 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1082

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1083 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1142

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDM APQTS   LHCDN SAIQIAHNDVFH
Sbjct: 1143 QSVVSRSSTESEYRALADATSELIWLRWLLTDMRAPQTSPIILHCDNHSAIQIAHNDVFH 1202

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKV STLP
Sbjct: 1203 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVFSTLP 1262

Query: 421  T 422
            +
Sbjct: 1263 S 1263

BLAST of CSPI04G09820 vs. NCBI nr
Match: KAA0043149.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK22647.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 793.1 bits (2047), Expect = 1.2e-225
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 926  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 985

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 986  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1045

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1046 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1105

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1106 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1165

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1166 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1225

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1226 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1285

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1286 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1345

Query: 421  T 422
            +
Sbjct: 1346 S 1346

BLAST of CSPI04G09820 vs. NCBI nr
Match: KAA0065380.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 793.1 bits (2047), Expect = 1.2e-225
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 894  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 953

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 954  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1013

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1014 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1073

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1074 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1133

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1134 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1193

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1194 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1253

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1254 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1313

Query: 421  T 422
            +
Sbjct: 1314 S 1314

BLAST of CSPI04G09820 vs. NCBI nr
Match: KAA0036574.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 793.1 bits (2047), Expect = 1.2e-225
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 926  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 985

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 986  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 1045

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 1046 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1105

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1106 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1165

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1166 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1225

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 1226 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 1285

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 1286 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 1345

Query: 421  T 422
            +
Sbjct: 1346 S 1346

BLAST of CSPI04G09820 vs. NCBI nr
Match: TYK12316.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa] >TYK28111.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 793.1 bits (2047), Expect = 1.2e-225
Identity = 396/421 (94.06%), Postives = 406/421 (96.44%), Query Frame = 0

Query: 1   MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
           MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 342 MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 401

Query: 61  LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
           LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 402 LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 461

Query: 121 ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
           +LQCYLGKHFEMKDLG L+YFLGLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 462 DLQCYLGKHFEMKDLGNLNYFLGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 521

Query: 181 DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
           DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 522 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 581

Query: 241 RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
           RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 582 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 641

Query: 301 QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
           QSVVSRSSTESEYRALADATSEL+WLRWLLTDMGAPQTS T LHCDN SAIQIAHNDVFH
Sbjct: 642 QSVVSRSSTESEYRALADATSELIWLRWLLTDMGAPQTSPTILHCDNHSAIQIAHNDVFH 701

Query: 361 ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
           ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKVVSTLP
Sbjct: 702 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVVSTLP 761

Query: 421 T 422
           +
Sbjct: 762 S 762

BLAST of CSPI04G09820 vs. NCBI nr
Match: KAA0058316.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 785.0 bits (2026), Expect = 3.2e-223
Identity = 392/421 (93.11%), Postives = 403/421 (95.72%), Query Frame = 0

Query: 1    MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPHQKVCLLRRALYG 60
            MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPP QKVCLLRRALYG
Sbjct: 843  MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPPGTTPPPQKVCLLRRALYG 902

Query: 61   LKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 120
            LKQAPRAWFATFSSTITQLGFTSS HDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS
Sbjct: 903  LKQAPRAWFATFSSTITQLGFTSSSHDSALFTRQTPNGIVLLLLYVDDMIITGDDPQAIS 962

Query: 121  ELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDSTTSSTPL 180
            +LQCYLGKHFEMKDLG L+YF+GLEI+  S GYYLSQAKYASDLL+RSGITDS T STPL
Sbjct: 963  DLQCYLGKHFEMKDLGNLNYFIGLEISSSSSGYYLSQAKYASDLLNRSGITDSATFSTPL 1022

Query: 181  DPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVL 240
            DPNVRLTP+DGVPL+DPTLYRQLVGSLIYLTVTRPDIA+AVHIVSQFMAAPRTIHFTAVL
Sbjct: 1023 DPNVRLTPFDGVPLEDPTLYRQLVGSLIYLTVTRPDIAYAVHIVSQFMAAPRTIHFTAVL 1082

Query: 241  RILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 300
            RILRYIKGTLGHGL FSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK
Sbjct: 1083 RILRYIKGTLGHGLQFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALISWRSKK 1142

Query: 301  QSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIAHNDVFH 360
            QSVVSRSSTESEYRALADATSEL+WLRWLLTDM APQTS   LHCDN SAIQIAHNDVFH
Sbjct: 1143 QSVVSRSSTESEYRALADATSELIWLRWLLTDMRAPQTSPIILHCDNHSAIQIAHNDVFH 1202

Query: 361  ERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKALHSPRFTLLLHKLKVVSTLP 420
            ERTKHIENDCHFVRHHLQSNTLHLQ IST DQPADIFTKALHSPRFT L+HKLKV STLP
Sbjct: 1203 ERTKHIENDCHFVRHHLQSNTLHLQPISTTDQPADIFTKALHSPRFTQLIHKLKVFSTLP 1262

Query: 421  T 422
            +
Sbjct: 1263 S 1263

BLAST of CSPI04G09820 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 358.6 bits (919), Expect = 6.8e-99
Identity = 194/407 (47.67%), Postives = 257/407 (63.14%), Query Frame = 0

Query: 1   MTSVRSLLAIAAAKQWPLLQMDVKNAFLNGTLSEEVYMKPPP------GTTPPHQKVCLL 60
           +TSV+ +LAI+A   + L Q+D+ NAFLNG L EE+YMK PP      G + P   VC L
Sbjct: 173 LTSVKLILAISAIYNFTLHQLDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYL 232

Query: 61  RRALYGLKQAPRAWFATFSSTITQLGFTSSPHDSALFTRQTPNGIVLLLLYVDDMIITGD 120
           ++++YGLKQA R WF  FS T+   GF  S  D   F + T    + +L+YVDD+II  +
Sbjct: 233 KKSIYGLKQASRQWFLKFSVTLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSN 292

Query: 121 DPQAISELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYASDLLSRSGITDST 180
           +  A+ EL+  L   F+++DLGPL YFLGLEI   + G  + Q KYA DLL  +G+    
Sbjct: 293 NDAAVDELKSQLKSCFKLRDLGPLKYFLGLEIARSAAGINICQRKYALDLLDETGLLGCK 352

Query: 181 TSSTPLDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAVHIVSQFMAAPRTI 240
            SS P+DP+V  + + G    D   YR+L+G L+YL +TR DI+FAV+ +SQF  APR  
Sbjct: 353 PSSVPMDPSVTFSAHSGGDFVDAKAYRRLIGRLMYLQITRLDISFAVNKLSQFSEAPRLA 412

Query: 241 HFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRSTTGYCFYLGDALI 300
           H  AV++IL YIKGT+G GL +SSQ+ + L  FSDA +      RRST GYC +LG +LI
Sbjct: 413 HQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSLI 472

Query: 301 SWRSKKQSVVSRSSTESEYRALADATSELLWLRWLLTDMGAPQTSSTTLHCDNRSAIQIA 360
           SW+SKKQ VVS+SS E+EYRAL+ AT E++WL     ++  P +  T L CDN +AI IA
Sbjct: 473 SWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPTLLFCDNTAAIHIA 532

Query: 361 HNDVFHERTKHIENDCHFVRHHLQSNTLHLQSISTIDQPADIFTKAL 402
            N VFHERTKHIE+DCH VR           S    D+  D FT+ L
Sbjct: 533 TNAVFHERTKHIESDCHSVRERSVYQATLSYSFQAYDE-QDGFTEYL 578

BLAST of CSPI04G09820 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 204.9 bits (520), Expect = 1.3e-52
Identity = 108/224 (48.21%), Postives = 145/224 (64.73%), Query Frame = 0

Query: 102 LLLYVDDMIITGDDPQAISELQCYLGKHFEMKDLGPLSYFLGLEITFVSDGYYLSQAKYA 161
           LLLYVDD+++TG     ++ L   L   F MKDLGP+ YFLG++I     G +LSQ KYA
Sbjct: 3   LLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSGLFLSQTKYA 62

Query: 162 SDLLSRSGITDSTTSSTPLDPNVRLTPYDGVPLDDPTLYRQLVGSLIYLTVTRPDIAFAV 221
             +L+ +G+ D    STPL P    +        DP+ +R +VG+L YLT+TRPDI++AV
Sbjct: 63  EQILNNAGMLDCKPMSTPL-PLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISYAV 122

Query: 222 HIVSQFMAAPRTIHFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGFSDADWAGDPTDRRS 281
           +IV Q M  P    F  + R+LRY+KGT+ HGL+    S L +  F D+DWAG  + RRS
Sbjct: 123 NIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTRRS 182

Query: 282 TTGYCFYLGDALISWRSKKQSVVSRSSTESEYRALADATSELLW 326
           TTG+C +LG  +ISW +K+Q  VSRSSTE+EYRALA   +EL W
Sbjct: 183 TTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI04G09820 vs. TAIR 10
Match: ATMG00240.1 (Gag-Pol-related retrotransposon family protein )

HSP 1 Score: 92.4 bits (228), Expect = 9.1e-19
Identity = 42/79 (53.16%), Postives = 57/79 (72.15%), Query Frame = 0

Query: 208 IYLTVTRPDIAFAVHIVSQFMAAPRTIHFTAVLRILRYIKGTLGHGLHFSSQSSLVLSGF 267
           +YLT+TRPD+ FAV+ +SQF +A RT    AV ++L Y+KGT+G GL +S+ S L L  F
Sbjct: 1   MYLTITRPDLTFAVNRLSQFSSASRTAQMQAVYKVLHYVKGTVGQGLFYSATSDLQLKAF 60

Query: 268 SDADWAGDPTDRRSTTGYC 287
           +D+DWA  P  RRS TG+C
Sbjct: 61  ADSDWASCPDTRRSVTGFC 79

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q94HW26.0e-10045.97Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT942.8e-9744.31Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P109782.3e-7540.71Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041468.2e-7338.48Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925191.8e-5148.21Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5D3DG185.6e-22694.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5D3DWU75.6e-22694.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7SZ665.6e-22694.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7VDW05.6e-22694.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A5A7UVX41.5e-22393.11Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
Match NameE-valueIdentityDescription
KAA0043149.11.2e-22594.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0065380.11.2e-22594.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0036574.11.2e-22594.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
TYK12316.11.2e-22594.06Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAA0058316.13.2e-22393.11Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
Match NameE-valueIdentityDescription
AT4G23160.16.8e-9947.67cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.11.3e-5248.21DNA/RNA polymerases superfamily protein [more]
ATMG00240.19.1e-1953.16Gag-Pol-related retrotransposon family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 2..180
e-value: 7.9E-51
score: 173.0
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 2..314
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 266..404
e-value: 3.20484E-83
score: 249.308
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 11..372

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI04G09820.1CSPI04G09820.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
cellular_component GO:0016021 integral component of membrane
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding