CSPI05G01350 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI05G01350
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag-pol polyprotein
LocationChr5: 1849038 .. 1853483 (+)
RNA-Seq ExpressionCSPI05G01350
SyntenyCSPI05G01350
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGAGGTAAATTTTGAGAAAGGTAAATTACCAAATGATAATGTAAATGAGTCCATTGCCCTGTTGACTAAACACTTCTCTAATGTTATAAGAAAAATTAAAAAACACAAACAATCATGGTTTAAATAATAGAGAACAAAACAATTATAGAAAGAGAGATTATGACAAACCAAATGGTAGATATAGCAAATCATTCAAATATAAGAGATGTGGTGGTCATGGCTATTATCAGGCTGAATGTCTGACTTACCTGAGGAGACAGAAGAAAAGTTTTGGTGCTACCCTATCTAATGAAGAGTCTGATGTAAGTAACGAGGAAGCAGAATATACCAATGTTGTTATCAGTATTACATCTAAAGATGAATCTGTATTTGGCTTTCAAGATTGAAATTGTCAAACAGAGGATACTCTATCATTTGGTCAGTTGCAACGTCAATGGAAGGAGAATTCGCCAGCTCGAGCTATTCAGAAGGAAAATATACAAAAGTTGCTGGATGAAAATCAACAATTATTAACTGTTATATCATCTTTGAAACCAAAATTGAAAGAAGCGAAAATAGAGTATGAAAAAATGTTCAAATTAGTAAAAATGTTGAATTCTGGCACAAATAGTCTAAATCAGATCCTAAATAGGGTAAAAATAGTTCAAATAAGCATGGTCTTGGTTATAACTCTTTGAAACATGCTTATAAATAGAAGAACTCAACAGTTTTTGTCCCAGCAAAAGACAAGAGTAATAATGAAGAAAGATCAGTAGAGGAAATAGACAATAAATTGATCTATAAAAAACCAAATGTCTGGATATGTCACTACTGTGGACAAAGGAGCCATATTAGACCCTTCTATCGCAAACTAGAAAGTGACATGATTTATCATCAGAAAATGTCATACATCTCAGAAGGATAAGCCAGTTGCTCGTATGCTTAAAAGAACTAGGATGATGTGAAAAATTAAGCAATCTTCTGTAAATTATCATATTGCTTTTACTACAATTAAAACTTCAAATGAAGATTGGTAATTTGATAGCGGTTGTTCGAGACACATGACTGGCAATTGCTCATACTTTTCTGAGTTGACAAAATGTTCTTGAGGACATGTAACTTTTGACGATGGAGCAAAAGGAAGAATTTTAGCCAAAAAAAAATATTGTTAATACAAATCTTGCTAGACTTGATGACGTGAGATATGTTGAAGGGCTCACAACCAATTTAATTAGCATCAGTCAATTGTGTGACCAAGGTTACACAATGAATTTCTGCAAAGAAGAATGTATTGTGACTGATAATAATAATGCTGAGATTATAAAAGGGATTTGTCAAATTGATAACTGTTATCATTGGATTTCTAACAAAAATATATTTTGCAACTTGTCTAAGGAAGAACAAACTCAGCTGTGGCATAGGAAACTTGGACATGCAAGTCTCAGTACAGTAAGCAACGATCTGAAACACGATGCTATCTTGAGAATACCAAATCTGGATATAAATAGTCAGTTATTCTATAAAGATTGTCAACGTGGCAAGAAAATCAGAACATCACATAAAAGTATTAGTGAATGCTATACTAATAGAGTTCTTGAATTTCTTCATATGGATCTAATGGGTCCAATGCAAACCAAGAGCCTTGACAGAAAGAAGTATGTATTTATTTGTGTTGATGATTATTCATGGTTTACGTGGGTAAGATTTTTAAGAGGTAAAGTTGAGATTTCTAAAGTGTGTATAAGTTTGTGCCTGAGTCTACAACGTGAGCAGGGAAAGAATATTGTGATAATCCGTAGTGATCATGGAAGGGACATTGAGAATGAAGAATTTGATAATTTATGTGAAAAAGAAGGGATTCATCGTGAATACTCAGCTCCTTTCTCAACAAAATGGAGTCGTAGAAAGAAAAAACAGAACATTACAGAAGATGGTCAGAATAATGCTTCATGCTAAGAACCTCTCACTACAATTTTGGGCTGAAGCTATGCATACAACTTGTCACATTCATAACAGAATTACAGTTCACACTAGCACCACATTAACAATATATGAACTATGGAAAGGTAAAAAAACCTAGTGTAATATTTTTTCATATATTTGAAAGTGTTTGCTTTATTCTTGCAGATCGAGATTATCATAGAAAATGAGATGTCAAGTCAGATAAAGGAATTTTCTTGGGATATTCTCAAAATAGCCGTGCATACAGAGTCTTCAATATACATACTCAATTTGTTATGGAAACTATCAATGTTGTGATAAATGATGGTGAGAAAATTTCGCTTAGAGGATGTGACGATGAAGATGCAGTATTCATGCAAAATACAAATATTCTTCTGAACCTGTGCAAAGTGTTGACAGTTTGTCTTCAACTAACGAGGATGATAAAAATGATAACAACAACTTTGATCCTCTTCAAAGAACCTTGGAAACTGAGACTGAAACAGAGACTCCCTCTAAACATGTTGCTCCATCATCACATGCCAAAAAGAATCAGTACCCGACAAGTTGTATTATATATCGGTGATCTCAACACTGGAGTCACAACCAAAAAGAAGAACATAATAGATTGTGCAAAGTTAATTGCTAATATCTTTTACACTTCATCTTTTGAACCTATTTTAGTTAGTGAAGCGCTTAAAGATGAGCTCTAGATAAATGCAATGCAAGAAAAGCTTCTGCAGTTTCGAAGAAATAATGTCTAGACCATGGTGCCTAAGCTTGAACCTACCAATGTCATCGGAACAAAATTGGATATTAATAAATAAAACTGATGAGAAGGACTGTGTTACAAGAACCAAGGCAAGATTGGTGGCTCAAGGTTATTCTCAAGTTGAAGTGTGGATTTTGATGAAACTTTTGCACCTGTAGCTAGACTTGAAGTCATATGCTTGTTGGTTATCATAGCGTGTATGCAAAAGATCAAATTATATCAAATGGATGTCAAAAGTGCCTTCTTAAATTGGTACTTAAATGAAGAAGTATATGTTGCACAACCTAAGGGATTCATAGATCCAGTTTTTTCTCAGCACGCGTCCAAATTGAATAAAACTCTTTATGGTCTCAAGCAAAGCACCTAGGGCATGGTATGGATGACTTACTATCTATCTTCGTCAGCAAGGATATGTTGGAGGAGGCACTGATAAAACTCTATTTATTAGTTGGACAAACAAAAACATAATTATGGCTCAACTATATGTTGATGATATCGTCTTTGGTGGTTTTCAAGACGATATTGTAAATAGTTTTATTAATATTATGTAATCTGAATTTGAGATGAGCATGGTTTGGGAATTGTCCTTTTTCCTTAGGTTGCAAATCAAGCAAGGCAAAAATGGTATATTTATCTCTTAAGAAAAATATGCCAAGAATATTATGAAGAAATTCGGTTTAGAAAAATCTCAGCACAAGAGAACACCAACAACTACATAGATTAAATTGACAAAAGATTATGAAGCTGAAGCTGTGGATCACAAATTATATAGAAGTATGATTGCTAGTTTGTTTTTTAACAGCTACATAGAATTACTTTCTAGTTATATCGTATGCTGAAGGAGTTTGTGCTCTCTTTCAGGTTGATCTAAGTAACTCACATCTTATGGCTGCTAAAATAATTATCAAATATGTTCATGGAACCTATGACTTTGGTATTTTGTACTCTTTTGTTACAAATTCATTTTTGGTTGGATATTATGATGCAGGTTGGGCTGGATGCTCTGATGATAGGAAAAACACTTTAGAAGGGTGTTTCTTTCTAGGAAACAATTTAATATCTTGGTTCAGTAAGAAGCAAAATTCTATTTCTCTATCTAAATCTGAAGTTGAATACATTGTTGTAGGAAGTGCACGTTCTCAACTAATTTGGATGAAACAAATGTTGTGTGAGTATGGTATTTCTCGAGATACCATGATTCTTTACAGTGATAGTATAAGTGTAATTGACATTTCGATGAATCTTGTTCAACACAGTAGAACTAAACATGTAAGCCTCGAACCCTAAGGAACTTAGATCTTTTTAAGAGCTTGAAAAGATTTGATTTTAAGTGATGTTAATACAAGAAAGTGTTAGTTTTGTGTATCATAGAGAAAAAGGGTAAAACTAAGTTGTTAAGGTTTAAAAGTTAAATAAGTTAGTTAAAAATAAGATAAAATTTGGCTAAGTATGAAAGTTTAGCGTTTTAGCTTTTGCTTAGGCGTTTAGAGATTGCATTGGACGACCAAGAAAAGGAAAGAATCTCTAAAGAGGTTATTTGGCATCTTGGAGATGGGTCTAGGCGTTTTGAGATGGCTTAACGAAACGACTTTAGAGGTTATCATGCATTTGGAGATTATTTTTGGCTTGTGGCTAGGCGAGAAGAACTTCCAAGTGGGTGTCATTGGGTGAGAAAGGTGCTATGCAAGAAGACATAGAGGTTAGTAAGGGATTTTGGACGTATAAGGTTGGTAAGAAAGAAGGATCATAG

mRNA sequence

ATGAAGAGAGAACAAAACAATTATAGAAAGAGAGATTATGACAAACCAAATGGTAGATATAGCAAATCATTCAAATATAAGAGATGTGGTGGTCATGGCTATTATCAGGCTGAATGTCTGACTTACCTGAGGAGACAGAAGAAAAGTTTTGGTGCTACCCTATCTAATGAAGAGTCTGATGTAAGTAACGAGGAAGCAGAATATACCAATGTTGTTATCAGTATTACATCTAAAGATGAATCTTTGCAACGTCAATGGAAGGAGAATTCGCCAGCTCGAGCTATTCAGAAGGAAAATATACAAAAGTTGCTGGATGAAAATCAACAATTATTAACTGAAGAACAAACTCAGCTGTGGCATAGGAAACTTGGACATGCAAGTCTCAGTACAGTAAGCAACGATCTGAAACACGATGCTATCTTGAGAATACCAAATCTGGATATAAATAGTCAGTTATTCTATAAAGATTGTCAACGTGGCAAGAAAATCAGAACATCACATAAAAGTATTAGTGAATGCTATACTAATAGAGTTCTTGAATTTCTTCATATGGATCTAATGGGTCCAATGCAAACCAAGAGCCTTGACAGAAAGAAGTATGTATTTATTTGTGTTGATGATTATTCATGGTTTACGTGGATCGAGATTATCATAGAAAATGAGATGTCAAGTCAGATAAAGGAATTTTCTTGGGATATTCTCAAAATAGCCGTGCATACAGATATTCATGCAAAATACAAATATTCTTCTGAACCTGTGCAAAGTGTTGACAGTTTGTCTTCAACTAACGAGGATGATAAAAATGATAACAACAACTTTGATCCTCTTCAAAGAACCTTGGAAACTGAGACTGAAACAGAGACTCCCTCTAAACATGTTGCTCCATCATCACATGCCAAAAAGAATCACGTGTATGCAAAAGATCAAATTATATCAAATGGATGTCAAAAGTGCCTTCTTAAATTGCAAGGATATGTTGGAGGAGGCACTGATAAAACTCTATTTATTAGTTGGACAAACAAAAACATAATTATGGCTCAACTATATGTTGATGATATCGTCTTTGGTGGTTTTCAAGACGATATTGTTGATCTAAGTAACTCACATCTTATGGCTGCTAAAATAATTATCAAATATGTTCATGGAACCTATGACTTTGGTTGGGCTGGATGCTCTGATGATAGGAAAAACACTTTAGAAGGGTGTTTCTTTCTAGGAAACAATTTAATATCTTGGTTCAGTAAGAAGCAAAATTCTATTTCTCTATCTAAATCTGAAGTTGAATACATTGTTGTAGGAAGTGCACGTTCTCAACTAATTTGGATGAAACAAATGTTGTGTGAGTATGGTATTTCTCGAGATACCATGATTCTTTACAGTGATAGTATAAGTGTAATTGACATTTCGATGAATCTTGTTCAACACAGTAGAACTAAACATAGATTGCATTGGACGACCAAGAAAAGGAAAGAATCTCTAAAGAGGTTATTTGGCATCTTGGAGATGGGTCTAGGCGTTTTGAGATGGCTTAACGAAACGACTTTAGAGGCGAGAAGAACTTCCAAGTGGGTGTCATTGGGTGAGAAAGGTGCTATGCAAGAAGACATAGAGGTTAGTAAGGGATTTTGGACGTATAAGGTTGGTAAGAAAGAAGGATCATAG

Coding sequence (CDS)

ATGAAGAGAGAACAAAACAATTATAGAAAGAGAGATTATGACAAACCAAATGGTAGATATAGCAAATCATTCAAATATAAGAGATGTGGTGGTCATGGCTATTATCAGGCTGAATGTCTGACTTACCTGAGGAGACAGAAGAAAAGTTTTGGTGCTACCCTATCTAATGAAGAGTCTGATGTAAGTAACGAGGAAGCAGAATATACCAATGTTGTTATCAGTATTACATCTAAAGATGAATCTTTGCAACGTCAATGGAAGGAGAATTCGCCAGCTCGAGCTATTCAGAAGGAAAATATACAAAAGTTGCTGGATGAAAATCAACAATTATTAACTGAAGAACAAACTCAGCTGTGGCATAGGAAACTTGGACATGCAAGTCTCAGTACAGTAAGCAACGATCTGAAACACGATGCTATCTTGAGAATACCAAATCTGGATATAAATAGTCAGTTATTCTATAAAGATTGTCAACGTGGCAAGAAAATCAGAACATCACATAAAAGTATTAGTGAATGCTATACTAATAGAGTTCTTGAATTTCTTCATATGGATCTAATGGGTCCAATGCAAACCAAGAGCCTTGACAGAAAGAAGTATGTATTTATTTGTGTTGATGATTATTCATGGTTTACGTGGATCGAGATTATCATAGAAAATGAGATGTCAAGTCAGATAAAGGAATTTTCTTGGGATATTCTCAAAATAGCCGTGCATACAGATATTCATGCAAAATACAAATATTCTTCTGAACCTGTGCAAAGTGTTGACAGTTTGTCTTCAACTAACGAGGATGATAAAAATGATAACAACAACTTTGATCCTCTTCAAAGAACCTTGGAAACTGAGACTGAAACAGAGACTCCCTCTAAACATGTTGCTCCATCATCACATGCCAAAAAGAATCACGTGTATGCAAAAGATCAAATTATATCAAATGGATGTCAAAAGTGCCTTCTTAAATTGCAAGGATATGTTGGAGGAGGCACTGATAAAACTCTATTTATTAGTTGGACAAACAAAAACATAATTATGGCTCAACTATATGTTGATGATATCGTCTTTGGTGGTTTTCAAGACGATATTGTTGATCTAAGTAACTCACATCTTATGGCTGCTAAAATAATTATCAAATATGTTCATGGAACCTATGACTTTGGTTGGGCTGGATGCTCTGATGATAGGAAAAACACTTTAGAAGGGTGTTTCTTTCTAGGAAACAATTTAATATCTTGGTTCAGTAAGAAGCAAAATTCTATTTCTCTATCTAAATCTGAAGTTGAATACATTGTTGTAGGAAGTGCACGTTCTCAACTAATTTGGATGAAACAAATGTTGTGTGAGTATGGTATTTCTCGAGATACCATGATTCTTTACAGTGATAGTATAAGTGTAATTGACATTTCGATGAATCTTGTTCAACACAGTAGAACTAAACATAGATTGCATTGGACGACCAAGAAAAGGAAAGAATCTCTAAAGAGGTTATTTGGCATCTTGGAGATGGGTCTAGGCGTTTTGAGATGGCTTAACGAAACGACTTTAGAGGCGAGAAGAACTTCCAAGTGGGTGTCATTGGGTGAGAAAGGTGCTATGCAAGAAGACATAGAGGTTAGTAAGGGATTTTGGACGTATAAGGTTGGTAAGAAAGAAGGATCATAG

Protein sequence

MKREQNNYRKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEESDVSNEEAEYTNVVISITSKDESLQRQWKENSPARAIQKENIQKLLDENQQLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEIIIENEMSSQIKEFSWDILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSKHVAPSSHAKKNHVYAKDQIISNGCQKCLLKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDLSNSHLMAAKIIIKYVHGTYDFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMNLVQHSRTKHRLHWTTKKRKESLKRLFGILEMGLGVLRWLNETTLEARRTSKWVSLGEKGAMQEDIEVSKGFWTYKVGKKEGS*
Homology
BLAST of CSPI05G01350 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 82.8 bits (203), Expect = 1.3e-14
Identity = 53/149 (35.57%), Postives = 74/149 (49.66%), Query Frame = 0

Query: 369  HLMAAKIIIKYVHGT-----------------YDFGWAGCSDDRKNTLEGCFFLGNNLIS 428
            H  A K I++Y+ GT                  D   AG  D+RK++    F      IS
Sbjct: 1147 HWEAVKWILRYLRGTTGDCLCFGGSDPILKGYTDADMAGDIDNRKSSTGYLFTFSGGAIS 1206

Query: 429  WFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGISRDTMILYSDSISVIDISMN 488
            W SK Q  ++LS +E EYI       ++IW+K+ L E G+ +   ++Y DS S ID+S N
Sbjct: 1207 WQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQELGLHQKEYVVYCDSQSAIDLSKN 1266

Query: 489  LVQHSRTKH---RLHWTTKK-RKESLKRL 497
             + H+RTKH   R HW  +    ESLK L
Sbjct: 1267 SMYHARTKHIDVRYHWIREMVDDESLKVL 1295

BLAST of CSPI05G01350 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 71.6 bits (174), Expect = 3.1e-11
Identity = 44/134 (32.84%), Postives = 66/134 (49.25%), Query Frame = 0

Query: 366  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGN 425
            ++ H  A K +++Y+ GT D G                  WAG +DD  +T     +LG+
Sbjct: 1260 TDDHWNALKRVLRYLAGTPDHGIFLKKGNTLSLHAYSDADWAGDTDDYVSTNGYIVYLGH 1319

Query: 426  NLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGIS-RDTMILYSDSISVI 481
            + ISW SKKQ  +  S +E EY  V +  S+L W+  +L E GI      ++Y D++   
Sbjct: 1320 HPISWSSKKQKGVVRSSTEAEYRSVANTSSELQWICSLLTELGIQLSHPPVIYCDNVGAT 1379

BLAST of CSPI05G01350 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 70.9 bits (172), Expect = 5.3e-11
Identity = 44/134 (32.84%), Postives = 65/134 (48.51%), Query Frame = 0

Query: 366  SNSHLMAAKIIIKYVHGTYDFG------------------WAGCSDDRKNTLEGCFFLGN 425
            +  HL A K I++Y+ GT + G                  WAG  DD  +T     +LG+
Sbjct: 1277 TEEHLQALKRILRYLAGTPNHGIFLKKGNTLSLHAYSDADWAGDKDDYVSTNGYIVYLGH 1336

Query: 426  NLISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCEYGIS-RDTMILYSDSISVI 481
            + ISW SKKQ  +  S +E EY  V +  S++ W+  +L E GI      ++Y D++   
Sbjct: 1337 HPISWSSKKQKGVVRSSTEAEYRSVANTSSEMQWICSLLTELGIRLTRPPVIYCDNVGAT 1396

BLAST of CSPI05G01350 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 63.9 bits (154), Expect = 6.4e-09
Identity = 37/101 (36.63%), Postives = 55/101 (54.46%), Query Frame = 0

Query: 382  GTYDFGWAGCSDDRKNTLEGCFFLGN-NLISWFSKKQNSISLSKSEVEYIVVGSARSQLI 441
            G  D  WAG   DRK+T    F + + NLI W +K+QNS++ S +E EY+ +  A  + +
Sbjct: 1250 GYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTKRQNSVAASSTEAEYMALFEAVREAL 1309

Query: 442  WMKQMLCEYGISRDTMI-LYSDSISVIDISMNLVQHSRTKH 481
            W+K +L    I  +  I +Y D+   I I+ N   H R KH
Sbjct: 1310 WLKFLLTSINIKLENPIKIYEDNQGCISIANNPSCHKRAKH 1350

BLAST of CSPI05G01350 vs. ExPASy Swiss-Prot
Match: P92519 (Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 GN=AtMg00810 PE=4 SV=1)

HSP 1 Score: 57.0 bits (136), Expect = 7.9e-07
Identity = 24/57 (42.11%), Postives = 36/57 (63.16%), Query Frame = 0

Query: 385 DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW 442
           D  WAGC+  R++T   C FLG N+ISW +K+Q ++S S +E EY  +    ++L W
Sbjct: 169 DSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

BLAST of CSPI05G01350 vs. ExPASy TrEMBL
Match: A0A5A7SYL2 (Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold285G003730 PE=4 SV=1)

HSP 1 Score: 218.4 bits (555), Expect = 7.5e-53
Identity = 196/703 (27.88%), Postives = 286/703 (40.68%), Query Frame = 0

Query: 9   RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEE 68
           R  D+ K      +SF+ + C    +YQAECLTYLRRQKK++ ATL +E+S   +V +  
Sbjct: 36  RNSDHSKKKEDIGRSFRCRECEEFSHYQAECLTYLRRQKKNYYATLFDEDSNDDEVDHSM 95

Query: 69  AEYTNVVISITSKDES---------------LQRQWKENSPARAIQKENIQKLLDENQQL 128
             +T  +  I S+ ES               L+   KE+S ARAIQKE IQ L++EN++L
Sbjct: 96  NAFTACITEINSEVESECSDNDEDEELTLEKLKMLRKEDSEARAIQKERIQDLMEENERL 155

Query: 129 --------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIP 188
                                     L +  ++L  +KLGH SL ++   ++++A++ IP
Sbjct: 156 MRVISFLKVKLREGRMVQVSIVSGLMLQQGVSKLHLKKLGHISLISLDKVIRNEAVVDIP 215

Query: 189 NLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFIC 248
           +L+IN + F  DCQ GK+ +TSH+S+ ECYT RVLE LH+DLMGPMQ +SL  KKYV + 
Sbjct: 216 SLNINDKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLMGPMQIESLGGKKYVLVV 275

Query: 249 VDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS 308
           VDDYS  TW++ +         I +E ++ I                         K   
Sbjct: 276 VDDYSRITWVQFLKGKSDTSEGIHHEFAAPITPQQNGVVEQKNGTLQEMVRVMIHAKSLP 335

Query: 309 WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKN 368
            +    A++T  H   K+  +  Q +                      ++++    D ++
Sbjct: 336 LNFWAEAINTACHIHNKWDVKSDQGIFLGYSLNSRAYKVFNIKSGTVMETINVVVNDFES 395

Query: 369 DNNNF-----------DPLQRTLETETETETPSKHVAPSS------------HAKKNHVY 428
           + N F           D     L+   + ++    + P+S              KK++  
Sbjct: 396 NVNQFNIEDDETSVTPDVTSTLLKEMPKDDSQPNTIEPTSVGNVLKDEYWIMPCKKSYYS 455

Query: 429 A-----KDQIISNG-------------------------------CQKCLLKL------- 435
           +     K ++++ G                                 K L  L       
Sbjct: 456 SSITTNKARLVAQGYAQVEGVDFDETFALVARLEAIRLLLSYIYKLNKALYGLKQPPRDW 515

BLAST of CSPI05G01350 vs. ExPASy TrEMBL
Match: A0A2K3MSR1 (Gag-protease polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g017007 PE=4 SV=1)

HSP 1 Score: 202.2 bits (513), Expect = 5.6e-48
Identity = 147/477 (30.82%), Postives = 208/477 (43.61%), Query Frame = 0

Query: 113  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISE 172
            E++ + WH+KLGH +  ++   +  +AI  +PNL I       +CQ GK+ +  H  +  
Sbjct: 591  EDEVRPWHQKLGHLNPRSMKKAISEEAIRGLPNLKIEEGSICGECQIGKQTKMPHPKLQH 650

Query: 173  CYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII-IENEMSSQIKEFSW 232
              T RV+E LHMDLMGP+QT+SL  K+Y ++ VD +S +TWI  I  ++E     K+   
Sbjct: 651  LTTTRVIELLHMDLMGPVQTESLGGKRYAYVVVDGFSRYTWINFIRKKSETFDVFKDLVI 710

Query: 233  DILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSK 292
             + +   + D   K++ S         L S+       ++   P Q  +  E +  T +K
Sbjct: 711  QLQREKNNVDHGKKFENS-----KFSDLCSSEGIIHEFSSPITPQQNGV-VERKNRTITK 770

Query: 293  HVAPSSHAKK---------------NHVYAKDQIISNGCQ--------------KCLLK- 352
                  HAKK               NHV  +    S   +              KC +  
Sbjct: 771  SARVMIHAKKLPQGFWAKAMNTACYNHVTLRSGTTSTLYELWKGRKPTVNVFGSKCYILS 830

Query: 353  ----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL 412
                              GY        ++ S T   +    + +DD+     +DD+ D 
Sbjct: 831  DREPRSKMDPKNDEGIFLGYSTNSRAYRVYNSRTKTMMESINVVIDDVSSEAVEDDVEDA 890

Query: 413  --------------------------------------------SNSHLMAAKIIIKYVH 472
                                                          SHL   K I+KYV+
Sbjct: 891  VASIPVVKGSKTVEENKISTDTTSPDPTSAMPKKGVCVRYQAEPRMSHLAQVKEILKYVN 950

Query: 473  GTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLS 481
            GT D+G                  W GC+DDRK+T   CFFLGNNLISWFSKKQN +SLS
Sbjct: 951  GTSDYGILYTHGENSMLIGHYDAVWEGCADDRKSTSGACFFLGNNLISWFSKKQNCVSLS 1010

BLAST of CSPI05G01350 vs. ExPASy TrEMBL
Match: A0A5A7SLH7 (F5J5.1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold134G001330 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.3e-46
Identity = 170/589 (28.86%), Postives = 231/589 (39.22%), Query Frame = 0

Query: 109 QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHK 168
           QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HK
Sbjct: 248 QLTRSDQTWVWHKKLGHVSMRGLEKIIKNEAIMGIPDLDVNGKFFCRDCQIGKQTRSTHK 307

Query: 169 SISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII----------- 228
           S+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDYS +TW+  +           
Sbjct: 308 SLKECYTNRVLELLHMDLMGPMQTKSLRGKRYVMVVVDDYSRYTWVCFLKGKTDTVEICK 367

Query: 229 ----------------IENEMSSQIKEFSWDILK-------------------------- 288
                           I N+ + +     WD                             
Sbjct: 368 NLCLKLQREKGKKIARIRNDHADREYRQKWDARSEQGIFLGNSQNSRAFRVFNNRSESVM 427

Query: 289 ---IAVHTDIHAKYKYSSE-----PVQSVDSLSSTNEDDKNDNNNFDP------------ 348
                V  D+++  K  ++     P  S    +ST E  K DN++ DP            
Sbjct: 428 ETINVVINDLNSAIKQMNDKEDETPKMSEARTTSTVEGSKVDNSSDDPGKSDSSAGMQTR 487

Query: 349 ---------------------LQR-----------TLETETETE---------------- 408
                                +Q            TL ++TE                  
Sbjct: 488 RKEKIDYMKMVTDLCQYWLNAMQEELLQFKRNNVWTLVSKTEARLVAQGYTQVEGVDFDE 547

Query: 409 -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCL 468
                                 K    S H K  HVY  ++ +    Q            
Sbjct: 548 TFAPIARLEAIQLLLEVYVAQPKDFVDSKHPK--HVYKLNKALYGLKQAPRAWYERLTVY 607

Query: 469 LKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVD--------------- 481
            + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF  D+V+               
Sbjct: 608 SRGKGYSRGEIDKTLFIHRKSDQILVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMV 667

BLAST of CSPI05G01350 vs. ExPASy TrEMBL
Match: A0A5D3D0U1 (F5J5.1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold96G001300 PE=4 SV=1)

HSP 1 Score: 196.8 bits (499), Expect = 2.3e-46
Identity = 170/589 (28.86%), Postives = 231/589 (39.22%), Query Frame = 0

Query: 109 QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHK 168
           QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HK
Sbjct: 248 QLTRSDQTWVWHKKLGHVSMRGLEKIIKNEAIMGIPDLDVNGKFFCRDCQIGKQTRSTHK 307

Query: 169 SISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII----------- 228
           S+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDYS +TW+  +           
Sbjct: 308 SLKECYTNRVLELLHMDLMGPMQTKSLRGKRYVMVVVDDYSRYTWVCFLKGKTDTVEICK 367

Query: 229 ----------------IENEMSSQIKEFSWDILK-------------------------- 288
                           I N+ + +     WD                             
Sbjct: 368 NLCLKLQREKGKKIARIRNDHADREYRQKWDARSEQGIFLGNSQNSRAFRVFNNRSESVM 427

Query: 289 ---IAVHTDIHAKYKYSSE-----PVQSVDSLSSTNEDDKNDNNNFDP------------ 348
                V  D+++  K  ++     P  S    +ST E  K DN++ DP            
Sbjct: 428 ETINVVINDLNSAIKQMNDKEDETPKMSEARTTSTVEGSKVDNSSDDPGKSDSSAGMQTR 487

Query: 349 ---------------------LQR-----------TLETETETE---------------- 408
                                +Q            TL ++TE                  
Sbjct: 488 RKEKIDYMKMVTDLCQYWLNAMQEELLQFKRNNVWTLVSKTEARLVAQGYTQVEGVDFDE 547

Query: 409 -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCL 468
                                 K    S H K  HVY  ++ +    Q            
Sbjct: 548 TFAPIARLEAIQLLLEVYVAQPKDFVDSKHPK--HVYKLNKALYGLKQAPRAWYERLTVY 607

Query: 469 LKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVD--------------- 481
            + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF  D+V+               
Sbjct: 608 SRGKGYSRGEIDKTLFIHRKSDQILVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMV 667

BLAST of CSPI05G01350 vs. ExPASy TrEMBL
Match: A0A2Z6MGE8 (Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_63730 PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 2.5e-40
Identity = 173/616 (28.08%), Postives = 245/616 (39.77%), Query Frame = 0

Query: 65   EAEYTNVVISITSK-DESLQR--QWKENSPARAIQKE-NIQKLLDENQQLLTEEQTQLWH 124
            +  +T     +TS+ DE L +  + K+N      Q+E N+   L     +  E++  LWH
Sbjct: 541  KVNFTKTECLVTSESDELLMKGVRSKDNCYLWVSQEEANLSTCL-----IAKEDEVTLWH 600

Query: 125  RKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLE 184
            +KLGH +L ++   +  +AI  +P L I       +CQ GK+ +  HK +    T RV E
Sbjct: 601  QKLGHLNLRSMKKVISEEAIRGLPQLKIVEGNICGECQIGKQTKMPHKMLQHSTTTRVFE 660

Query: 185  FLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEIIIENEMSSQIKEFSWDILKIAVHT 244
             LHMDLMGPMQ +SL  KKY  + VDD+S +TWI  I E       K  ++DI K     
Sbjct: 661  LLHMDLMGPMQVESLGGKKYADVVVDDFSRYTWINFIKE-------KSDTFDIFK----- 720

Query: 245  DIHAKYKYSSEPVQSVDS-LSSTNEDDKND----------------------NNNFDPLQ 304
            D+  + +   + V  V +     N+ D+N                       +  F P+ 
Sbjct: 721  DLCVQLQREKDNVNVVGTKWVYKNKSDENGVVTRNKVRLVAQGYAQIEGIDFDETFAPVA 780

Query: 305  R-----------------TLETETETETPSKHVAPSSHAKK----------NHVYAKDQI 364
            R                   + + ++   + ++      ++          NHVY K + 
Sbjct: 781  RLESIRLLLGVACILKFKLFQMDVKSAFLNGYLNEEVFVEQTKGFNETTLLNHVY-KLKK 840

Query: 365  ISNGCQKC----------LLKLQGYVGGGTDKTLFIS----------------------- 424
               G ++            L  QGY  GG DK LF+                        
Sbjct: 841  APYGLKQAPRAWYERLTEFLLSQGYRKGGNDKILFVKEEEEKSALLFEVVPLQIPMFDEL 900

Query: 425  ----WTNKNIIMAQ---------------------LYVDDIVFG-----------GFQDD 481
                W N + +  Q                     LY+     G           G +  
Sbjct: 901  WRCRWYNSSSVSEQGRNPCCSLSEHLANVQPELQMLYMSVERAGGCLMYTTAGIRGIRSK 960

BLAST of CSPI05G01350 vs. NCBI nr
Match: KAE8648228.1 (hypothetical protein Csa_018353 [Cucumis sativus])

HSP 1 Score: 218.4 bits (555), Expect = 1.6e-52
Identity = 102/108 (94.44%), Postives = 104/108 (96.30%), Query Frame = 0

Query: 110 LLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKS 169
           L  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKS
Sbjct: 41  LSKEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKS 100

Query: 170 ISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII 218
           ISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTW+  +
Sbjct: 101 ISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWVRFL 148

BLAST of CSPI05G01350 vs. NCBI nr
Match: KAA0035673.1 (gag-pol polyprotein [Cucumis melo var. makuwa])

HSP 1 Score: 218.4 bits (555), Expect = 1.6e-52
Identity = 196/703 (27.88%), Postives = 286/703 (40.68%), Query Frame = 0

Query: 9   RKRDYDKPNGRYSKSFKYKRCGGHGYYQAECLTYLRRQKKSFGATLSNEES---DVSNEE 68
           R  D+ K      +SF+ + C    +YQAECLTYLRRQKK++ ATL +E+S   +V +  
Sbjct: 36  RNSDHSKKKEDIGRSFRCRECEEFSHYQAECLTYLRRQKKNYYATLFDEDSNDDEVDHSM 95

Query: 69  AEYTNVVISITSKDES---------------LQRQWKENSPARAIQKENIQKLLDENQQL 128
             +T  +  I S+ ES               L+   KE+S ARAIQKE IQ L++EN++L
Sbjct: 96  NAFTACITEINSEVESECSDNDEDEELTLEKLKMLRKEDSEARAIQKERIQDLMEENERL 155

Query: 129 --------------------------LTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIP 188
                                     L +  ++L  +KLGH SL ++   ++++A++ IP
Sbjct: 156 MRVISFLKVKLREGRMVQVSIVSGLMLQQGVSKLHLKKLGHISLISLDKVIRNEAVVDIP 215

Query: 189 NLDINSQLFYKDCQRGKKIRTSHKSISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFIC 248
           +L+IN + F  DCQ GK+ +TSH+S+ ECYT RVLE LH+DLMGPMQ +SL  KKYV + 
Sbjct: 216 SLNINDKFFCGDCQVGKQTKTSHRSLKECYTIRVLELLHLDLMGPMQIESLGGKKYVLVV 275

Query: 249 VDDYSWFTWIEII---------IENEMSSQI-------------------------KEFS 308
           VDDYS  TW++ +         I +E ++ I                         K   
Sbjct: 276 VDDYSRITWVQFLKGKSDTSEGIHHEFAAPITPQQNGVVEQKNGTLQEMVRVMIHAKSLP 335

Query: 309 WDILKIAVHTDIHAKYKYSSEPVQSV----------------------DSLSSTNEDDKN 368
            +    A++T  H   K+  +  Q +                      ++++    D ++
Sbjct: 336 LNFWAEAINTACHIHNKWDVKSDQGIFLGYSLNSRAYKVFNIKSGTVMETINVVVNDFES 395

Query: 369 DNNNF-----------DPLQRTLETETETETPSKHVAPSS------------HAKKNHVY 428
           + N F           D     L+   + ++    + P+S              KK++  
Sbjct: 396 NVNQFNIEDDETSVTPDVTSTLLKEMPKDDSQPNTIEPTSVGNVLKDEYWIMPCKKSYYS 455

Query: 429 A-----KDQIISNG-------------------------------CQKCLLKL------- 435
           +     K ++++ G                                 K L  L       
Sbjct: 456 SSITTNKARLVAQGYAQVEGVDFDETFALVARLEAIRLLLSYIYKLNKALYGLKQPPRDW 515

BLAST of CSPI05G01350 vs. NCBI nr
Match: PNX93845.1 (gag-protease polyprotein, partial [Trifolium pratense])

HSP 1 Score: 202.2 bits (513), Expect = 1.2e-47
Identity = 147/477 (30.82%), Postives = 208/477 (43.61%), Query Frame = 0

Query: 113  EEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHKSISE 172
            E++ + WH+KLGH +  ++   +  +AI  +PNL I       +CQ GK+ +  H  +  
Sbjct: 591  EDEVRPWHQKLGHLNPRSMKKAISEEAIRGLPNLKIEEGSICGECQIGKQTKMPHPKLQH 650

Query: 173  CYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII-IENEMSSQIKEFSW 232
              T RV+E LHMDLMGP+QT+SL  K+Y ++ VD +S +TWI  I  ++E     K+   
Sbjct: 651  LTTTRVIELLHMDLMGPVQTESLGGKRYAYVVVDGFSRYTWINFIRKKSETFDVFKDLVI 710

Query: 233  DILKIAVHTDIHAKYKYSSEPVQSVDSLSSTNEDDKNDNNNFDPLQRTLETETETETPSK 292
             + +   + D   K++ S         L S+       ++   P Q  +  E +  T +K
Sbjct: 711  QLQREKNNVDHGKKFENS-----KFSDLCSSEGIIHEFSSPITPQQNGV-VERKNRTITK 770

Query: 293  HVAPSSHAKK---------------NHVYAKDQIISNGCQ--------------KCLLK- 352
                  HAKK               NHV  +    S   +              KC +  
Sbjct: 771  SARVMIHAKKLPQGFWAKAMNTACYNHVTLRSGTTSTLYELWKGRKPTVNVFGSKCYILS 830

Query: 353  ----------------LQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVDL 412
                              GY        ++ S T   +    + +DD+     +DD+ D 
Sbjct: 831  DREPRSKMDPKNDEGIFLGYSTNSRAYRVYNSRTKTMMESINVVIDDVSSEAVEDDVEDA 890

Query: 413  --------------------------------------------SNSHLMAAKIIIKYVH 472
                                                          SHL   K I+KYV+
Sbjct: 891  VASIPVVKGSKTVEENKISTDTTSPDPTSAMPKKGVCVRYQAEPRMSHLAQVKEILKYVN 950

Query: 473  GTYDFG------------------WAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLS 481
            GT D+G                  W GC+DDRK+T   CFFLGNNLISWFSKKQN +SLS
Sbjct: 951  GTSDYGILYTHGENSMLIGHYDAVWEGCADDRKSTSGACFFLGNNLISWFSKKQNCVSLS 1010

BLAST of CSPI05G01350 vs. NCBI nr
Match: TYK16854.1 (F5J5.1 [Cucumis melo var. makuwa])

HSP 1 Score: 196.8 bits (499), Expect = 4.8e-46
Identity = 170/589 (28.86%), Postives = 231/589 (39.22%), Query Frame = 0

Query: 109 QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHK 168
           QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HK
Sbjct: 248 QLTRSDQTWVWHKKLGHVSMRGLEKIIKNEAIMGIPDLDVNGKFFCRDCQIGKQTRSTHK 307

Query: 169 SISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII----------- 228
           S+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDYS +TW+  +           
Sbjct: 308 SLKECYTNRVLELLHMDLMGPMQTKSLRGKRYVMVVVDDYSRYTWVCFLKGKTDTVEICK 367

Query: 229 ----------------IENEMSSQIKEFSWDILK-------------------------- 288
                           I N+ + +     WD                             
Sbjct: 368 NLCLKLQREKGKKIARIRNDHADREYRQKWDARSEQGIFLGNSQNSRAFRVFNNRSESVM 427

Query: 289 ---IAVHTDIHAKYKYSSE-----PVQSVDSLSSTNEDDKNDNNNFDP------------ 348
                V  D+++  K  ++     P  S    +ST E  K DN++ DP            
Sbjct: 428 ETINVVINDLNSAIKQMNDKEDETPKMSEARTTSTVEGSKVDNSSDDPGKSDSSAGMQTR 487

Query: 349 ---------------------LQR-----------TLETETETE---------------- 408
                                +Q            TL ++TE                  
Sbjct: 488 RKEKIDYMKMVTDLCQYWLNAMQEELLQFKRNNVWTLVSKTEARLVAQGYTQVEGVDFDE 547

Query: 409 -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCL 468
                                 K    S H K  HVY  ++ +    Q            
Sbjct: 548 TFAPIARLEAIQLLLEVYVAQPKDFVDSKHPK--HVYKLNKALYGLKQAPRAWYERLTVY 607

Query: 469 LKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVD--------------- 481
            + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF  D+V+               
Sbjct: 608 SRGKGYSRGEIDKTLFIHRKSDQILVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMV 667

BLAST of CSPI05G01350 vs. NCBI nr
Match: KAA0032034.1 (F5J5.1 [Cucumis melo var. makuwa])

HSP 1 Score: 196.8 bits (499), Expect = 4.8e-46
Identity = 170/589 (28.86%), Postives = 231/589 (39.22%), Query Frame = 0

Query: 109 QLLTEEQTQLWHRKLGHASLSTVSNDLKHDAILRIPNLDINSQLFYKDCQRGKKIRTSHK 168
           QL   +QT +WH+KLGH S+  +   +K++AI+ IP+LD+N + F +DCQ GK+ R++HK
Sbjct: 248 QLTRSDQTWVWHKKLGHVSMRGLEKIIKNEAIMGIPDLDVNGKFFCRDCQIGKQTRSTHK 307

Query: 169 SISECYTNRVLEFLHMDLMGPMQTKSLDRKKYVFICVDDYSWFTWIEII----------- 228
           S+ ECYTNRVLE LHMDLMGPMQTKSL  K+YV + VDDYS +TW+  +           
Sbjct: 308 SLKECYTNRVLELLHMDLMGPMQTKSLRGKRYVMVVVDDYSRYTWVCFLKGKTDTVEICK 367

Query: 229 ----------------IENEMSSQIKEFSWDILK-------------------------- 288
                           I N+ + +     WD                             
Sbjct: 368 NLCLKLQREKGKKIARIRNDHADREYRQKWDARSEQGIFLGNSQNSRAFRVFNNRSESVM 427

Query: 289 ---IAVHTDIHAKYKYSSE-----PVQSVDSLSSTNEDDKNDNNNFDP------------ 348
                V  D+++  K  ++     P  S    +ST E  K DN++ DP            
Sbjct: 428 ETINVVINDLNSAIKQMNDKEDETPKMSEARTTSTVEGSKVDNSSDDPGKSDSSAGMQTR 487

Query: 349 ---------------------LQR-----------TLETETETE---------------- 408
                                +Q            TL ++TE                  
Sbjct: 488 RKEKIDYMKMVTDLCQYWLNAMQEELLQFKRNNVWTLVSKTEARLVAQGYTQVEGVDFDE 547

Query: 409 -------------------TPSKHVAPSSHAKKNHVYAKDQIISNGCQ---------KCL 468
                                 K    S H K  HVY  ++ +    Q            
Sbjct: 548 TFAPIARLEAIQLLLEVYVAQPKDFVDSKHPK--HVYKLNKALYGLKQAPRAWYERLTVY 607

Query: 469 LKLQGYVGGGTDKTLFISWTNKNIIMAQLYVDDIVFGGFQDDIVD--------------- 481
            + +GY  G  DKTLFI   +  I++AQ+YVDDI+FGGF  D+V+               
Sbjct: 608 SRGKGYSRGEIDKTLFIHRKSDQILVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMV 667

BLAST of CSPI05G01350 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 74.7 bits (182), Expect = 2.6e-13
Identity = 45/133 (33.83%), Postives = 66/133 (49.62%), Query Frame = 0

Query: 368 SHLMAAKIIIKYVHGTY------------------DFGWAGCSDDRKNTLEGCFFLGNNL 427
           +H  A   I+ Y+ GT                   D  +  C D R++T   C FLG +L
Sbjct: 412 AHQQAVMKILHYIKGTVGQGLFYSSQAEMQLQVFSDASFQSCKDTRRSTNGYCMFLGTSL 471

Query: 428 ISWFSKKQNSISLSKSEVEYIVVGSARSQLIWMKQMLCE--YGISRDTMILYSDSISVID 481
           ISW SKKQ  +S S +E EY  +  A  +++W+ Q   E    +S+ T +L+ D+ + I 
Sbjct: 472 ISWKSKKQQVVSKSSAEAEYRALSFATDEMMWLAQFFRELQLPLSKPT-LLFCDNTAAIH 531

BLAST of CSPI05G01350 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 57.0 bits (136), Expect = 5.6e-08
Identity = 24/57 (42.11%), Postives = 36/57 (63.16%), Query Frame = 0

Query: 385 DFGWAGCSDDRKNTLEGCFFLGNNLISWFSKKQNSISLSKSEVEYIVVGSARSQLIW 442
           D  WAGC+  R++T   C FLG N+ISW +K+Q ++S S +E EY  +    ++L W
Sbjct: 169 DSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTVSRSSTETEYRALALTAAELTW 225

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.3e-1435.57Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q9ZT943.1e-1132.84Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW25.3e-1132.84Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P041466.4e-0936.63Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P925197.9e-0742.11Uncharacterized mitochondrial protein AtMg00810 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A5A7SYL27.5e-5327.88Gag-pol polyprotein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold285G... [more]
A0A2K3MSR15.6e-4830.82Gag-protease polyprotein (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g0170... [more]
A0A5A7SLH72.3e-4628.86F5J5.1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold134G001330 PE=4 S... [more]
A0A5D3D0U12.3e-4628.86F5J5.1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold96G001300 PE=4 SV... [more]
A0A2Z6MGE82.5e-4028.08Uncharacterized protein OS=Trifolium subterraneum OX=3900 GN=TSUD_63730 PE=4 SV=... [more]
Match NameE-valueIdentityDescription
KAE8648228.11.6e-5294.44hypothetical protein Csa_018353 [Cucumis sativus][more]
KAA0035673.11.6e-5227.88gag-pol polyprotein [Cucumis melo var. makuwa][more]
PNX93845.11.2e-4730.82gag-protease polyprotein, partial [Trifolium pratense][more]
TYK16854.14.8e-4628.86F5J5.1 [Cucumis melo var. makuwa][more]
KAA0032034.14.8e-4628.86F5J5.1 [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT4G23160.12.6e-1333.83cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00810.15.6e-0842.11DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 93..117
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 281..300
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 256..275
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 385..484
e-value: 4.20986E-35
score: 126.814
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 112..161
e-value: 5.3E-7
score: 29.4
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 173..244
e-value: 5.5E-7
score: 31.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G01350.1CSPI05G01350.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding