CSPI03G23600.1 (mRNA) Cucumber (PI 183967) v1

Overview
NameCSPI03G23600.1
TypemRNA
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionGag/pol protein
LocationChr3: 20563838 .. 20568793 (-)
Sequence length3426
RNA-Seq ExpressionCSPI03G23600.1
SyntenyCSPI03G23600.1
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATTTGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTCTTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGACACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGGTCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAACCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCTAAAACAAAAGCTAGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAAGGAAAGACTCACAAACAGAACAAGGGTAAGAAAGTTGCATAAAAAGGTAAGTGTTATCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGGGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGAACACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCAGTGGGAGATTTAAAGTTGTTTTTACTTTTTAAAGATAGATATGTGATACTTAAGAATGTGTTATGTGTACCTCAAATGAGAAGAAATTTAATATCTATCTCTTGTATTTTCGAACAAATGAGAAGATATTCTAGTTTGTTATGTCATACTTGAAGACAACTTATATAAGTTAAGACCAACCAGAGCAACTTTTGTCTTAAATACTGAAATATTCAGAACAGTAGAAACTCAGAATAAAAGACAAAAGGTTTCTTCTAATGCCTATTTATGACACTTAAGACTTTGTCACATAAATCTCAATAGAATTGGGAGATTGGTTAAAAAAAAGTGGGCTTCTAAGTCAGTTAGAAAATAATACTTTACCTCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAGTAGTCTTGAAAAGTTCAAACAATATAAGGCTGAAGTAGAAAACAAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTTTGCATCTAGTACACCTCAACAGAACAGTGTATCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTTGCTCTATGATGAGTTTTTCTCAGATGCCAGATTCTTTTTGGAGATATGCTTTATAAACAGCTCCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAATCTAAAAGTGTTTTCAGAAAAACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCTAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACTAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATCGCTATTTGGGTTTAATTGAAACTCAAGTCATCATTCCTAATGATGGCATAGAGGATCCATTAACCTATAAATAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCAAGCTGGTAAAGTACAGACTTTTAAGGCTCGACTTGTTGCAAAGGGTTATACTCAGAGAGAGGGAGTAGACTATGAGGAAACTCTATCTCCCGTTGCCATGCTAAAGTCAATTAGAATACTCTTATAAGTATATGGACTTACGATTCAAAGACTATTTGATAGAAAATGGAATCCAGTCACAACTCTCTGCACCTAGTACACCTCAACAGAACGGTGTAGCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTCAGATGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATAACTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGAGACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTTTTAGTTTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATGTTGTCTAGATATAAAATGCAGAATTCCAAAAAGGGTCTGCTGCCGTACAGATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATGTTATTACTAGACCTGACATTTGCTATTCAGTAGAGATGGTGAGTAGGTATCAATCCAATCCTGGACGTGATCACTGGACAGTCGTTAAAAATATTCTAAAATATCTTTGAAGAACAAAAGACTACATGCTCGTGTATGGTACAAAGGATCTTATCCTTATTGGATACACTGATTCAGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTCGAAGTCGTTTCAAATATGCATCTACCAATCACTTTATACTATGACAACAGTGGTGCGGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTACATCGTGGTGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGTTGTTCCTTTTACAAAAGTCATCACGGCTAAAGTGTTTGAGGGTCACCTACAAAGTTTAGGTCTACGATGTTTGTAA

mRNA sequence

ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATTTGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTCTTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGACACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGGTCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAACCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCTAAAACAAAAGCTAGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAAGGAAAGACTCACAAACAGAACAAGGGTAAGTGTTATCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGGGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGAACACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCATTAGAAAATAATACTTTACCTCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAGTAGTCTTGAAAAGTTCAAACAATATAAGGCTGAAGTAGAAAACAAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTTTGCATCTAGTACACCTCAACAGAACAGTAACGGTGTAGCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTCAGATGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATAACTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGAGACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTTTTAGTTTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATAACAAAAGACTACATGCTCGTGTATGGTACAAAGGATCTTATCCTTATTGGATACACTGATTCAGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTCGAAGTCGTTTCAAATATGCATCTACCAATCACTTTATACTATGACAACAGTGGTGCGGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTACATCGTGGTGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGTTGTTCCTTTTACAAAAGTCATCACGGCTAAAGTGTTTGAGGGTCACCTACAAAGTTTAGGTCTACGATGTTTGTAA

Coding sequence (CDS)

ATGAATAGCTCAATAGTTCAACTTTTAGCTTCCGAAAAACTTAATGGCGATAATTATGCGGCTTGGAAATCAAATCTTAACACAATACTAGTGGTTGACGATTTAAGATTTGTCTTAACTGAGGAATGTCCTCAAAACCCTGCCTCTAATGCTAACCGAACTAGTCGGGATGCATATGATCGATGGATAAAAGCTAATGAAAAAGCCCGTGTCTACATTCTTGCCAGCATGTCTGATGTATTGGCAAAGAAACATGAATCCTTAGCCACGGCTAAAGAGATTATGGATTCATTAAGGGGAATGTTTGGGCAACCAGAATGGTCCTTAAGACACGAGGCAGTCAAATACATTTACACTAAGCGTATGAAGGAAGGGACCTCTGTTAGAGAACATGTTCTGGACATGATGATGCACTTCAACATCGCTGAAGTGAATGGTGGTCCCATCGAAGAGGTTAATCAAGTTAGTTTTATCTTAGAGTCTCTTCCGAAGAGCTTCATTCCATTCCAAACGAATGCGTCTTTGAACAAGATAGAATTTAACTTGACAACCCTTCTGAATGAACTCCAGCGATTCCAAAACCTAACTATGGGTAAAGGAAAACAAGTGGAAGCAAATGTTGCTACCACAAAAAGAAAATTTATAAGAGGATCGTCCTCTAAAACAAAAGCTAGACCCTCAAAACCTAATGCTCAAATAAAAAAGAAGGGAAAAGGAAAGACTCACAAACAGAACAAGGGTAAGTGTTATCATTGTGGCCAAAACGGGCACTGGTTAAGAAACTGCCCAAAATACCTTGCAGAAAAAAAGGCAGAGAAGGAAACACAAGGTAAATATGATTTACTAGTTGTAGAAACATGTTTAGGGGAATATGAAAATTGTACCTGGATACTAGATTCAGGAGCCACTAACCATATTTGCTTCTCATTTCAGGAAAATAGTTCTTGGAAAAAGCTTTCAGAAGGCGAGAACACTCTCAAGGTTGGAACAGGAGAGATGGTCTCAGCTTCAGCATTAGAAAATAATACTTTACCTCCTTGTGAATCTTGTCTTGAAGGAAAAATGACCAAGAGATCTTTTACTGGAAAAGGTCTAAGAGCCAAAGGACCCTTAGAGCTCGTACATTCGGACCTTTGTGGACCAATGAATGTCAAAGCTCGAGGTGGATATGAATATTTCATTAGCTTCATTGATGATTATTCAAGGTATGGTCATATTTACCTAATACATCATAAGTCTAGTAGTCTTGAAAAGTTCAAACAATATAAGGCTGAAGTAGAAAACAAATTAGGTAAAACAATAAAAATACTTCGATCAGATCGAGGTGGAGAGTATATGGACTTACGATTCCGAGACTATTTAATAGAAAATGGAATCCAGTCACAACTCTTTGCATCTAGTACACCTCAACAGAACAGTAACGGTGTAGCAGAAAGAAGAAACCGGACCTTGTTAGACATGGTTCGCTCTATGATGAGTTTTGCTCAGATGCCAGATTCTTTTTGGGGATATGCTTTAGAAACAGCTGCTTATATTTTGAATAATGTTCCCTCTAAAAGTGTTTCAGAAACACCTTATGAGCTATGGAAAGGGCGTAAAGGAAGTTTACGTCATTTTAGAATTTGGGGTTGTCCAGCACACGTGTTGGTACAAAATCCAAAGAAATTGGAACATCGTTCAAAATTATGCTTTTTCATAGGTTATCCAAAAGAATCAAGAGGTGGTTTGTTTTATGATCCTCAAGAAAATAAAATATTTGTGTCAACAAATGCCACATTCTTAGAGGAAGACCATATCAGTGATCATCAACCTCGTAGTAAACTAGTATTAAAAGAAATTTCCAAAAGTGCTATAGATAAACCTAGTTCATCCACTAAGGTAGTTGATAAGACTAGGAAATCTGGTCAATCACATCCTTCTCAACAGTTGAGAGAGCCTCGACGTAGTGGGAGGGTTGTTCATCAGCCTGATAACTATTTGGGTTTAATTGAAACTCAAGTCGTCATACCTGACGATGGCATAGAGGATCCATTAACCTATAAACAGGCAATGAAAGATGTAGATCGTGACCAATGGATCAAAGCCATGGACCTCGAAATGGAGTCTATGTACTTTAATTCTGTCTGGACTCTAGTAGATCAACCAAATGACGTAAAACCTATTGGTTGTAAATGGATCTACAAGAGAAAACGAGACCATGCCGGTAAAGTACAGACTTTTAAGGCTCGACTTGTGGCAAAGGGTTATACCCAGAGAGAGGGAGTAGACTATGAGGAAACTTTCTCTCCTGTTGCCATGTTAAAGTCAATTAGAATACTCTTATCCATCGCCACTTTTTATGATTATGAAATTTGGCAGATGGATGTCAAGACAGCTTTTTTGAATGGTAATCTTGAAGAGAGTATCTATATGTCTCAACCAGAGGGGTTTATAGAACAAGATCAAGAACAAAAGGTTTGTAAGCTTAAAAAATCCATTTATGGATTAAAACAAGCTTCTAGATCCTGGAATATAAGATTTGAGACTGCGATCAAATCTTATGGCTTTGAACAAAATGTTGACGAGCCTTGTGTTTACAAAAAGGTCGTCAATTCCATTGTAGCATTTTTAGTTTTATATGTAGATGATATTCTACTTATTGGAAATGACGTAGGATATCTTACTGATATCAAGAAATGGCTAGCTATGCAATTTCAAATGAAAGATCTGGGAGATGCACAATACGTTCTCGGAATCCAAATTGTTCGAAACCGTAAGAACAAAACACTAGCCATGTCTCAAGCATCTTACATAGACAAAATATATGGAATTCATTTGTCAAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGAAATATTCCCTATGCTTCCGCTGTTGGAAGTTTAATGTATGCAATAACAAAAGACTACATGCTCGTGTATGGTACAAAGGATCTTATCCTTATTGGATACACTGATTCAGATTTTCAAACTGATAAAGATGCTAGAAAGTCTACATCAGGATCAGTATTTACTCTAAATGGAGGAGCAGTAGTTTGGAGAAGCATAAAGCAAACTTGTATAGCTGATTCCACAATGGAAGCTGAATACGTAGCGGCTTGTGAAGCAGCAAAAGAAGCAGTATGGCTAAGAAAATTCTTGACAGATTTCGAAGTCGTTTCAAATATGCATCTACCAATCACTTTATACTATGACAACAGTGGTGCGGTTGCAAATTCAAGAGAACCTAGAAGCCATAAAAGAGGAAAGCATATTGAACGCAAGTACCATCTTATCAGGGAAATTGTACATCGTGGTGACGTTGTAGTAACCCAGATTTCTTCTCAGCAAAACATTGTTGTTCCTTTTACAAAAGTCATCACGGCTAAAGTGTTTGAGGGTCACCTACAAAGTTTAGGTCTACGATGTTTGTAA

Protein sequence

MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASALENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKIYGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAITKDYMLVYGTKDLILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKEAVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVVVTQISSQQNIVVPFTKVITAKVFEGHLQSLGLRCL*
Homology
BLAST of CSPI03G23600.1 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 578.6 bits (1490), Expect = 1.6e-163
Identity = 410/1336 (30.69%), Postives = 633/1336 (47.38%), Query Frame = 0

Query: 13   KLNGDN-YAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARV 72
            K NGDN ++ W+  +  +L+   L  VL  +  +     A        + W   +E+A  
Sbjct: 10   KFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKA--------EDWADLDERAAS 69

Query: 73   YILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVREH 132
             I   +SD +        TA+ I   L  ++     + +    K +Y   M EGT+   H
Sbjct: 70   AIRLHLSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSH 129

Query: 133  VLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQR 192
            +             G  IEE ++   +L SLP S+    T     K    L  + + L  
Sbjct: 130  LNVFNGLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILHGKTTIELKDVTSALLL 189

Query: 193  FQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGKTHKQNKGKCYH 252
             + +      Q +A +   + +  + SS+       +  A+ K K + K+  +N   CY+
Sbjct: 190  NEKMRKKPENQGQALITEGRGRSYQRSSNNY----GRSGARGKSKNRSKSRVRN---CYN 249

Query: 253  CGQNGHWLRNCPKYLAEKKAEKETQGKYD-----LLV-----VETCLGEYENC------- 312
            C Q GH+ R+CP     +K + ET G+ +      +V     V   + E E C       
Sbjct: 250  CNQPGHFKRDCPN---PRKGKGETSGQKNDDNTAAMVQNNDNVVLFINEEEECMHLSGPE 309

Query: 313  -TWILDSGATNH----------------------------------ICF----------- 372
              W++D+ A++H                                  IC            
Sbjct: 310  SEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGCTLVLK 369

Query: 373  ------------------------SFQENSSWKKLSEGENTLKVGT-------------- 432
                                    S+  N  W +L++G   +  G               
Sbjct: 370  DVRHVPDLRMNLISGIALDRDGYESYFANQKW-RLTKGSLVIAKGVARGTLYRTNAEICQ 429

Query: 433  GEMVSA------------------------------SALENNTLPPCESCLEGKMTKRSF 492
            GE+ +A                              S  +  T+ PC+ CL GK  + SF
Sbjct: 430  GELNAAQDEISVDLWHKRMGHMSEKGLQILAKKSLISYAKGTTVKPCDYCLFGKQHRVSF 489

Query: 493  TGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKFK 552
                 R    L+LV+SD+CGPM +++ GG +YF++FIDD SR   +Y++  K    + F+
Sbjct: 490  QTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQ 549

Query: 553  QYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAERR 612
            ++ A VE + G+ +K LRSD GGEY    F +Y   +GI+ +     TPQ   NGVAER 
Sbjct: 550  KFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQH--NGVAERM 609

Query: 613  NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHF 672
            NRT+++ VRSM+  A++P SFWG A++TA Y++N  PS  ++ E P  +W  ++ S  H 
Sbjct: 610  NRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHL 669

Query: 673  RIWGCP--AHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLEED 732
            +++GC   AHV  +   KL+ +S  C FIGY  E  G   +DP + K+  S +  F E +
Sbjct: 670  KVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESE 729

Query: 733  -----HISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQ---------------- 792
                  +S+      +       S  + P+S+    D+  + G+                
Sbjct: 730  VRTAADMSEKVKNGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQLDEGV 789

Query: 793  ---SHPSQ---QLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRD 852
                HP+Q   Q +  RRS R   +   Y       V+I DD   +P + K+ +   +++
Sbjct: 790  EEVEHPTQGEEQHQPLRRSERPRVESRRYPS--TEYVLISDD--REPESLKEVLSHPEKN 849

Query: 853  QWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYT 912
            Q +KAM  EMES+  N  + LV+ P   +P+ CKW++K K+D   K+  +KARLV KG+ 
Sbjct: 850  QLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKGFE 909

Query: 913  QREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGF 972
            Q++G+D++E FSPV  + SIR +LS+A   D E+ Q+DVKTAFL+G+LEE IYM QPEGF
Sbjct: 910  QKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFLHGDLEEEIYMEQPEGF 969

Query: 973  IEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVY-KKVVNSIVAF 1032
                ++  VCKL KS+YGLKQA R W ++F++ +KS  + +   +PCVY K+   +    
Sbjct: 970  EVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFII 1029

Query: 1033 LVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQAS 1092
            L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LG++IVR R ++ L +SQ  
Sbjct: 1030 LLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEK 1089

Query: 1093 YIDKIY-----------------GIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI-- 1131
            YI+++                   + LSK+ CP T +E  +M  +PY+SAVGSLMYA+  
Sbjct: 1090 YIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVC 1149

BLAST of CSPI03G23600.1 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 408.3 bits (1048), Expect = 2.9e-112
Identity = 365/1424 (25.63%), Postives = 612/1424 (42.98%), Query Frame = 0

Query: 15   NGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYDRWIKANEKARVYIL 74
            +G+ YA WK  +  +L   D+  V+    P            +  D W KA   A+  I+
Sbjct: 12   DGEKYAIWKFRIRALLAEQDVLKVVDGLMP-----------NEVDDSWKKAERCAKSTII 71

Query: 75   ASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTKRMKEGTSVRE--HV 134
              +SD       S  TA++I+++L  ++ +   + +    K + + ++    S+    H+
Sbjct: 72   EYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLLSLKLSSEMSLLSHFHI 131

Query: 135  LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEFNLTTLLNELQRF 194
             D ++   +A   G  IEE++++S +L +LP  +    T A     E NLT     L   
Sbjct: 132  FDELISELLAA--GAKIEEMDKISHLLITLPSCYDGIIT-AIETLSEENLT-----LAFV 191

Query: 195  QNLTMGKGKQVEANVATTKRKFIRG---SSSKTKARPSKPNAQIKKKGKGKTHKQNKGKC 254
            +N  + +  +++ +   T +K +     +++ T       N   K K   K + + K KC
Sbjct: 192  KNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIFKGNSKYKVKC 251

Query: 255  YHCGQNGHWLRNCPKY---LAEKKAEKETQ-------GKYDLLVVETCLGEYENCTWILD 314
            +HCG+ GH  ++C  Y   L  K  E E Q       G   ++         +NC ++LD
Sbjct: 252  HHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNTSVMDNCGFVLD 311

Query: 315  SGATNHI----------------------------------------------------- 374
            SGA++H+                                                     
Sbjct: 312  SGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDVLFCK 371

Query: 375  ----------------------------------------------CFSFQENS------ 434
                                                            +FQ  S      
Sbjct: 372  EAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPVINFQAYSINAKHK 431

Query: 435  ----SWKK----LSEGENTLKVGTGEMVSASALENN---TLPPCESCLEGKMTKRSFTGK 494
                 W +    +S+G+  L++    M S  +L NN   +   CE CL GK  +  F  K
Sbjct: 432  NNFRLWHERFGHISDGK-LLEIKRKNMFSDQSLLNNLELSCEICEPCLNGKQARLPF--K 491

Query: 495  GLR----AKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYLIHHKSSSLEKF 554
             L+     K PL +VHSD+CGP+         YF+ F+D ++ Y   YLI +KS     F
Sbjct: 492  QLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMF 551

Query: 555  KQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASSTPQQNSNGVAER 614
            + + A+ E      +  L  D G EY+    R + ++ GI   L    TPQ   NGV+ER
Sbjct: 552  QDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQ--LNGVSER 611

Query: 615  RNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSV---SETPYELWKGRKGSL 674
              RT+ +  R+M+S A++  SFWG A+ TA Y++N +PS+++   S+TPYE+W  +K  L
Sbjct: 612  MIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYL 671

Query: 675  RHFRIWGCPAHVLVQNPK-KLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVSTNATFLE 734
            +H R++G   +V ++N + K + +S    F+GY  E  G   +D    K  V+ +    E
Sbjct: 672  KHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGY--EPNGFKLWDAVNEKFIVARDVVVDE 731

Query: 735  EDHISDHQPRSKLVLKEISKSAIDK--PSSSTKVV------------------DKTRKSG 794
             + ++    + + V  + SK + +K  P+ S K++                  D      
Sbjct: 732  TNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESEN 791

Query: 795  QSHPS--------------------QQLREPRRSGRVV------HQPDNYLG-------- 854
            ++ P+                    Q L++ + S +         + D++L         
Sbjct: 792  KNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKRDDHLNESKGSGNP 851

Query: 855  ----LIETQVVIPDDGIEDP---------------------LTYKQ-------------- 914
                  ET   + + GI++P                     ++Y +              
Sbjct: 852  NESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNEEDNSLNKVVLNAHT 911

Query: 915  AMKDV-----------DRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKR 974
               DV           D+  W +A++ E+ +   N+ WT+  +P +   +  +W++  K 
Sbjct: 912  IFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVKY 971

Query: 975  DHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKT 1034
            +  G    +KARLVA+G+TQ+  +DYEETF+PVA + S R +LS+   Y+ ++ QMDVKT
Sbjct: 972  NELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILSLVIQYNLKVHQMDVKT 1031

Query: 1035 AFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQ 1094
            AFLNG L+E IYM  P+G         VCKL K+IYGLKQA+R W   FE A+K   F  
Sbjct: 1032 AFLNGTLKEEIYMRLPQGI--SCNSDNVCKLNKAIYGLKQAARCWFEVFEQALKECEFVN 1091

Query: 1095 NVDEPCVY---KKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQY 1139
            +  + C+Y   K  +N  + +++LYVDD+++   D+  + + K++L  +F+M DL + ++
Sbjct: 1092 SSVDRCIYILDKGNINENI-YVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKH 1151

BLAST of CSPI03G23600.1 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 351.7 bits (901), Expect = 3.2e-95
Identity = 346/1467 (23.59%), Postives = 567/1467 (38.65%), Query Frame = 0

Query: 1    MNSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPA---SNANRTS 60
            +N++I+ +  S   KL   NY  W   ++ +    +L   L    P  PA   ++A    
Sbjct: 11   VNTNILNVNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTPMPPATIGTDAVPRV 70

Query: 61   RDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVK 120
               Y RW + ++     IL ++S  +        TA +I ++LR ++  P +        
Sbjct: 71   NPDYTRWRRQDKLIYSAILGAISMSVQPAVSRATTAAQIWETLRKIYANPSYG------- 130

Query: 121  YIYTKRMKEGTSVREHV--LDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNA 180
                           HV  L  +  F+   + G P++   QV  +LE+LP  + P     
Sbjct: 131  ---------------HVTQLRFITRFDQLALLGKPMDHDEQVERVLENLPDDYKPVIDQI 190

Query: 181  SLNKIEFNLT----TLLNELQRFQNLTMGKGKQVEANVA------TTKRKFIRGSSSKTK 240
            +      +LT     L+N   +   L   +   + ANV       T + +  RG +    
Sbjct: 191  AAKDTPPSLTEIHERLINRESKLLALNSAEVVPITANVVTHRNTNTNRNQNNRGDNRNYN 250

Query: 241  ARPSKPNA-QIKKKGKGKTHKQNK---GKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKY 300
               ++ N+ Q    G    ++Q K   G+C  C   GH  + CP+    +    + Q   
Sbjct: 251  NNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQLHQFQSTTNQQQSTS 310

Query: 301  DLLVVE-----TCLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTL-------- 360
                 +          Y    W+LDSGAT+HI   F   S  +  + G++ +        
Sbjct: 311  PFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGDDVMIADGSTIP 370

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 371  ITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFPASFQVKDLNTG 430

Query: 421  ---------------KVGTGEMVSA--------------------------SALENNTLP 480
                            + + + VS                           S + N++LP
Sbjct: 431  VPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPSLAILNSVISNHSLP 490

Query: 481  ---------PCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISF 540
                      C  C   K  K  F+   + +  PLE ++SD+     + +   Y Y++ F
Sbjct: 491  VLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWS-SPILSIDNYRYYVIF 550

Query: 541  IDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIE 600
            +D ++RY  +Y +  KS   + F  +K+ VEN+    I  L SD GGE++ L  RDYL +
Sbjct: 551  VDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDNGGEFVVL--RDYLSQ 610

Query: 601  NGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNV 660
            +GI        TP+   NG++ER++R +++M  +++S A +P ++W YA   A Y++N +
Sbjct: 611  HGISHFTSPPHTPEH--NGLSERKHRHIVEMGLTLLSHASVPKTYWPYAFSVAVYLINRL 670

Query: 661  PSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIGYPKESR 720
            P+  +  ++P++   G+  +    +++GC  +  ++  N  KLE +SK C F+GY     
Sbjct: 671  PTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQCAFMGYSLTQS 730

Query: 721  GGLFYDPQENKIFVSTNATFLE---------------EDHISDHQPR---------SKLV 780
              L       +++ S +  F E               ++  SD  P          + LV
Sbjct: 731  AYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVSTSQEQRSDSAPNWPSHTTLPTTPLV 790

Query: 781  L------------------------------KEISKSAIDKPSSSTKVV---DKTRKSGQ 840
            L                                +  S+I  PSSS       +  + + Q
Sbjct: 791  LPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLPSSSISSPSSSEPTAPSHNGPQPTAQ 850

Query: 841  SHPSQQ-------LREPRRSGRVVHQPDNYLGLIETQVVIP------------------- 900
             H +Q        L  P  +    + P+    L ++ +  P                   
Sbjct: 851  PHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQSPISSPHIPTPSTSISEPNSPSSSS 910

Query: 901  ---------------------------------DDGI----------------EDPLTYK 960
                                              DGI                 +P T  
Sbjct: 911  TSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKDGIRKPNQKYSYATSLAANSEPRTAI 970

Query: 961  QAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQTF 1020
            QAMKD   D+W +AM  E+ +   N  W LV   P  V  +GC+WI+ +K +  G +  +
Sbjct: 971  QAMKD---DRWRQAMGSEINAQIGNHTWDLVPPPPPSVTIVGCRWIFTKKFNSDGSLNRY 1030

Query: 1021 KARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEE 1080
            KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L +
Sbjct: 1031 KARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTLTD 1090

Query: 1081 SIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYK 1139
             +YMSQP GF+++D+   VC+L+K+IYGLKQA R+W +   T + + GF  ++ +  ++ 
Sbjct: 1091 EVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLTVGFVNSISDTSLFV 1150

BLAST of CSPI03G23600.1 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 1.5e-92
Identity = 341/1469 (23.21%), Postives = 570/1469 (38.80%), Query Frame = 0

Query: 2    NSSIVQLLASE--KLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPAS----NANRTS 61
            N+SI+ +  S   KL   NY  W   ++ +    +L   L       PA+     A R +
Sbjct: 12   NTSILNVNMSNVTKLTSTNYLMWSRQVHALFDGYELAGFLDGSTTMPPATIGTDAAPRVN 71

Query: 62   RDAYDRWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVK 121
             D Y RW + ++     +L ++S  +        TA +I ++LR ++  P +   H    
Sbjct: 72   PD-YTRWKRQDKLIYSAVLGAISMSVQPAVSRATTAAQIWETLRKIYANPSYG--HVTQL 131

Query: 122  YIYTKRMKEGT-SVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNAS 181
                K+  +GT ++ +++  ++  F+   + G P++   QV  +LE+LP+ + P     +
Sbjct: 132  RTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPMDHDEQVERVLENLPEEYKPVIDQIA 191

Query: 182  LNKIEFNLTTLLNELQRFQNLTMGKGKQV----------EANVATTKRKFIRGSSSKTKA 241
                   LT +   L   ++  +                  N  TT        +++   
Sbjct: 192  AKDTPPTLTEIHERLLNHESKILAVSSATVIPITANAVSHRNTTTTNNNNNGNRNNRYDN 251

Query: 242  RPSKPNAQIKKKGKGKTHKQNK------GKCYHCGQNGHWLRNCPK---YLAEKKAEKET 301
            R +  N++  ++     H  N       GKC  CG  GH  + C +   +L+   +++  
Sbjct: 252  RNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFLSSVNSQQPP 311

Query: 302  ------QGKYDLLVVETCLGE-YENCTWILDSGATNHICFSFQENSSWKKLSEGENTL-- 361
                  Q + +L      LG  Y +  W+LDSGAT+HI   F   S  +  + G++ +  
Sbjct: 312  SPFTPWQPRANL-----ALGSPYSSNNWLLDSGATHHITSDFNNLSLHQPYTGGDDVMVA 371

Query: 362  ------------------------------------------------------------ 421
                                                                        
Sbjct: 372  DGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQV 431

Query: 422  ---------------------KVGTGEMVSA--------------------------SAL 481
                                  + + + VS                           S +
Sbjct: 432  KDLNTGVPLLQGKTKDELYEWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILNSVI 491

Query: 482  ENNTLP---------PCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGY 541
             N +L           C  CL  K  K  F+   + +  PLE ++SD+     + +   Y
Sbjct: 492  SNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWS-SPILSHDNY 551

Query: 542  EYFISFIDDYSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRF 601
             Y++ F+D ++RY  +Y +  KS   E F  +K  +EN+    I    SD GGE++ L  
Sbjct: 552  RYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVAL-- 611

Query: 602  RDYLIENGIQSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAA 661
             +Y  ++GI        TP+   NG++ER++R +++   +++S A +P ++W YA   A 
Sbjct: 612  WEYFSQHGISHLTSPPHTPEH--NGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAV 671

Query: 662  YILNNVPSKSVS-ETPYELWKGRKGSLRHFRIWGCPAHVLVQ--NPKKLEHRSKLCFFIG 721
            Y++N +P+  +  E+P++   G   +    R++GC  +  ++  N  KL+ +S+ C F+G
Sbjct: 672  YLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLG 731

Query: 722  YPKESRGGLFYDPQENKIFVSTNATF---------------------------------- 781
            Y       L    Q +++++S +  F                                  
Sbjct: 732  YSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLSPVQEQRRESSCVWSPHTTL 791

Query: 782  ------LEEDHISD--------HQPRSKLVLKEISKSAIDKPSSST-------------- 841
                  L     SD          P +     ++S S +D   SS+              
Sbjct: 792  PTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDSSFSSSFPSSPEPTAPRQNG 851

Query: 842  --KVVDKTRKSGQSHPS-----------------QQLREPRRSGR--------------- 901
                   T+   Q+H S                 Q L  P +S                 
Sbjct: 852  PQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPAQSSSSSPSPTTSASSSSTS 911

Query: 902  ------VVHQPDNYLGLIETQVVIP----------DDGI----------------EDPLT 961
                  ++H P     ++      P            GI                 +P T
Sbjct: 912  PTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIKPNPKYSLAVSLAAESEPRT 971

Query: 962  YKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLV-DQPNDVKPIGCKWIYKRKRDHAGKVQ 1021
              QA+KD   ++W  AM  E+ +   N  W LV   P+ V  +GC+WI+ +K +  G + 
Sbjct: 972  AIQALKD---ERWRNAMGSEINAQIGNHTWDLVPPPPSHVTIVGCRWIFTKKYNSDGSLN 1031

Query: 1022 TFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNL 1081
             +KARLVAKGY QR G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L
Sbjct: 1032 RYKARLVAKGYNQRPGLDYAETFSPVIKSTSIRIVLGVAVDRSWPIRQLDVNNAFLQGTL 1091

Query: 1082 EESIYMSQPEGFIEQDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCV 1139
             + +YMSQP GFI++D+   VCKL+K++YGLKQA R+W +     + + GF  +V +  +
Sbjct: 1092 TDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLTIGFVNSVSDTSL 1151

BLAST of CSPI03G23600.1 vs. ExPASy Swiss-Prot
Match: Q12490 (Transposon Ty1-BL Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY1B-BL PE=1 SV=1)

HSP 1 Score: 107.1 bits (266), Expect = 1.4e-21
Identity = 74/266 (27.82%), Postives = 128/266 (48.12%), Query Frame = 0

Query: 347 CESCLEGKMTK-RSFTGKGLRAKG---PLELVHSDLCGPMNVKARGGYEYFISFIDDYSR 406
           C  CL GK TK R   G  L+ +    P + +H+D+ GP++   +    YFISF D+ ++
Sbjct: 637 CPDCLIGKSTKHRHIKGSRLKYQNSYEPFQYLHTDIFGPVHNLPKSAPSYFISFTDETTK 696

Query: 407 YGHIYLIH--HKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQ 466
           +  +Y +H   + S L+ F    A ++N+   ++ +++ DRG EY +     +L +NGI 
Sbjct: 697 FRWVYPLHDRREDSILDVFTTILAFIKNQFQASVLVIQMDRGSEYTNRTLHKFLEKNGIT 756

Query: 467 SQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKS 526
                ++T    ++GVAER NRTLLD  R+ +  + +P+  W  A+E +  + N++ S  
Sbjct: 757 P--CYTTTADSRAHGVAERLNRTLLDDCRTQLQCSGLPNHLWFSAIEFSTIVRNSLASP- 816

Query: 527 VSETPYELWKGRKGSLRHFRI----------WGCPAHVLVQNPKKLEHRSKLCFFIGYP- 586
                    K +K + +H  +          +G P  V   NP    H   +  +  +P 
Sbjct: 817 ---------KSKKSARQHAGLAGLDISTLLPFGQPVIVNDHNPNSKIHPRGIPGYALHPS 876

Query: 587 KESRGGLFYDPQENKIFVSTNATFLE 596
           + S G + Y P   K   +TN   L+
Sbjct: 877 RNSYGYIIYLPSLKKTVDTTNYVILQ 890

BLAST of CSPI03G23600.1 vs. ExPASy TrEMBL
Match: E2GK51 (Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1)

HSP 1 Score: 1884.0 bits (4879), Expect = 0.0e+00
Identity = 971/1296 (74.92%), Postives = 1040/1296 (80.25%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYD
Sbjct: 1    MNTSIVQLLASEKLNGDNYSAWKSNLNTILVVDDLRFVLTEECPQAPALNANRTVREAYD 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTK
Sbjct: 61   RWVKANDKARVYILASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTK 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEF
Sbjct: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEF 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGK 240
            NLTTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K  PSK  AQ+KKKGKGK
Sbjct: 181  NLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSSKNKVGPSK--AQMKKKGKGK 240

Query: 241  T-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN 300
                       +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCL E + 
Sbjct: 241  APNTSKVKKNADKGKCFHCNQDGHWKRNCPKYLAEKKAEKATQGKYDLLVVETCLVECDA 300

Query: 301  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA--------------- 360
             TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA A               
Sbjct: 301  STWILDSGATNHICFSFQETSSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFFQDRYLIL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  KDVLYVPLMKRNLISIACILEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRAN 420

Query: 421  --------------------------------------------------LENNTLPPCE 480
                                                              LE+N+LPPCE
Sbjct: 421  VVLNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCE 480

Query: 481  SCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYL 540
            SCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL
Sbjct: 481  SCLEGKMTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYL 540

Query: 541  IHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST 600
            +HHKS S EKFK+YKAEVEN++GKTIK LRSDRGGEYMD +F+DYLIE GIQSQL A ST
Sbjct: 541  LHHKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPST 600

Query: 601  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYEL 660
            PQQ  NGV+ERRNRTLLDMVRSMMS+AQ+PDSFWGYALETA +ILNNVPSKSV ETPYEL
Sbjct: 601  PQQ--NGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYEL 660

Query: 661  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVS 720
            WKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVS
Sbjct: 661  WKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVS 720

Query: 721  TNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRR 780
            TNATFLEEDH  +HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRR
Sbjct: 721  TNATFLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRR 780

Query: 781  SGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSV 840
            SGRVVHQP+ YLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSV
Sbjct: 781  SGRVVHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSV 840

Query: 841  WTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK 900
            WTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLK
Sbjct: 841  WTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLK 900

Query: 901  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYG 960
            SIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYG
Sbjct: 901  SIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYG 960

Query: 961  LKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLT 1020
            LKQASRSWNIRF+TAIKSYGFEQNVDEPCVYKK+VNS+VAFL+LYVDDILLIGNDV YLT
Sbjct: 961  LKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLT 1020

Query: 1021 DIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-------------- 1080
            D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+              
Sbjct: 1021 DVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQL 1080

Query: 1081 ---YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------- 1110
               +GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYA+                     
Sbjct: 1081 PFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNP 1140

BLAST of CSPI03G23600.1 vs. ExPASy TrEMBL
Match: A0A5A7SMH8 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G002560 PE=4 SV=1)

HSP 1 Score: 1630.2 bits (4220), Expect = 0.0e+00
Identity = 851/1342 (63.41%), Postives = 985/1342 (73.40%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. ExPASy TrEMBL
Match: A0A5D3CPJ6 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G00040 PE=4 SV=1)

HSP 1 Score: 1630.2 bits (4220), Expect = 0.0e+00
Identity = 851/1342 (63.41%), Postives = 985/1342 (73.40%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. ExPASy TrEMBL
Match: A0A5A7TZD7 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G001300 PE=4 SV=1)

HSP 1 Score: 1626.3 bits (4210), Expect = 0.0e+00
Identity = 852/1342 (63.49%), Postives = 984/1342 (73.32%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK N NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWK-NTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. ExPASy TrEMBL
Match: A0A5A7TWB9 (Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G00310 PE=4 SV=1)

HSP 1 Score: 1606.7 bits (4159), Expect = 0.0e+00
Identity = 840/1330 (63.16%), Postives = 973/1330 (73.16%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1128
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. NCBI nr
Match: ADJ18449.1 (gag/pol protein, partial [Bryonia dioica])

HSP 1 Score: 1884.0 bits (4879), Expect = 0.0e+00
Identity = 971/1296 (74.92%), Postives = 1040/1296 (80.25%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            MN+SIVQLLASEKLNGDNY+AWKSNLNTILVVDDLRFVLTEECPQ PA NANRT R+AYD
Sbjct: 1    MNTSIVQLLASEKLNGDNYSAWKSNLNTILVVDDLRFVLTEECPQAPALNANRTVREAYD 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW+KAN+KARVYILASM+DVLAKKH+S+ATAK IMDSLR MFGQP WSLRHEA+K+IYTK
Sbjct: 61   RWVKANDKARVYILASMTDVLAKKHDSIATAKGIMDSLREMFGQPSWSLRHEAIKHIYTK 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RMKEGTSVREHVLDMMMHFNIAEVNGGPI+E NQVSFIL+SLPKSF+PFQTNASLNKIEF
Sbjct: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIDEANQVSFILQSLPKSFVPFQTNASLNKIEF 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKPNAQIKKKGKGK 240
            NLTTLLNELQRFQNLT+ KGK+VEANVA TKRKFIRGSSSK K  PSK  AQ+KKKGKGK
Sbjct: 181  NLTTLLNELQRFQNLTLSKGKEVEANVAVTKRKFIRGSSSKNKVGPSK--AQMKKKGKGK 240

Query: 241  T-------HKQNKGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVETCLGEYEN 300
                       +KGKC+HC Q+GHW RNCPKYLAEKKAEK TQGKYDLLVVETCL E + 
Sbjct: 241  APNTSKVKKNADKGKCFHCNQDGHWKRNCPKYLAEKKAEKATQGKYDLLVVETCLVECDA 300

Query: 301  CTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSASA--------------- 360
             TWILDSGATNHICFSFQE SSWKKL EGE TLKVGTGE+VSA A               
Sbjct: 301  STWILDSGATNHICFSFQETSSWKKLKEGEITLKVGTGEVVSAEAVGDLTLFFQDRYLIL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  KDVLYVPLMKRNLISIACILEHIYTISFEVNEVFILCKGIQICSAIRENNLYKLRPTRAN 420

Query: 421  --------------------------------------------------LENNTLPPCE 480
                                                              LE+N+LPPCE
Sbjct: 421  VVLNTEMFRTLETQNKKQKVSSNAYLWHLRLGHINLNRIERLVKSGILNQLEDNSLPPCE 480

Query: 481  SCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDDYSRYGHIYL 540
            SCLEGKMTKRSFTGKGLRAK PLELVHSDLCGPMNVKARGGYEYFISFIDD+SRYGH+YL
Sbjct: 481  SCLEGKMTKRSFTGKGLRAKVPLELVHSDLCGPMNVKARGGYEYFISFIDDFSRYGHVYL 540

Query: 541  IHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGIQSQLFASST 600
            +HHKS S EKFK+YKAEVEN++GKTIK LRSDRGGEYMD +F+DYLIE GIQSQL A ST
Sbjct: 541  LHHKSESFEKFKEYKAEVENEIGKTIKTLRSDRGGEYMDSKFQDYLIEFGIQSQLSAPST 600

Query: 601  PQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVSETPYEL 660
            PQQ  NGV+ERRNRTLLDMVRSMMS+AQ+PDSFWGYALETA +ILNNVPSKSV ETPYEL
Sbjct: 601  PQQ--NGVSERRNRTLLDMVRSMMSYAQLPDSFWGYALETAIHILNNVPSKSVLETPYEL 660

Query: 661  WKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYDPQENKIFVS 720
            WKGRK SLR+FRIWGCPAHVLVQNPKKLE RSKLC F+GYPKESRGGLFY PQENK+FVS
Sbjct: 661  WKGRKSSLRYFRIWGCPAHVLVQNPKKLEPRSKLCLFVGYPKESRGGLFYHPQENKVFVS 720

Query: 721  TNATFLEEDHISDHQPRSKLVLKEISKSAIDKPSSSTKVVDKTRKSGQSHPSQQLREPRR 780
            TNATFLEEDH  +HQPRSK+VLKE+ K+A DKPSSSTKVVDK   S QSH SQ+LR PRR
Sbjct: 721  TNATFLEEDHXRNHQPRSKIVLKEMFKNATDKPSSSTKVVDKANISDQSHTSQELRVPRR 780

Query: 781  SGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSV 840
            SGRVVHQP+ YLGL+ETQ++IPDDG+EDPLTYKQAM DVDRDQWIKAM+LEMESMYFNSV
Sbjct: 781  SGRVVHQPNRYLGLVETQIIIPDDGVEDPLTYKQAMNDVDRDQWIKAMNLEMESMYFNSV 840

Query: 841  WTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQREGVDYEETFSPVAMLK 900
            WTLVD P+DVKPIGCKWIYKRKRD AGKVQTFKARLVAKGYTQ+EGVDYEETFSPVAMLK
Sbjct: 841  WTLVDLPSDVKPIGCKWIYKRKRDQAGKVQTFKARLVAKGYTQKEGVDYEETFSPVAMLK 900

Query: 901  SIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIEQDQEQKVCKLKKSIYG 960
            SIRILLSIATFY+YEIWQMDVKTAFLNGNLEESIYM QPEGFI QDQEQKVCKL+KSIYG
Sbjct: 901  SIRILLSIATFYNYEIWQMDVKTAFLNGNLEESIYMVQPEGFIAQDQEQKVCKLQKSIYG 960

Query: 961  LKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLT 1020
            LKQASRSWNIRF+TAIKSYGFEQNVDEPCVYKK+VNS+VAFL+LYVDDILLIGNDV YLT
Sbjct: 961  LKQASRSWNIRFDTAIKSYGFEQNVDEPCVYKKIVNSVVAFLILYVDDILLIGNDVEYLT 1020

Query: 1021 DIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYIDKI-------------- 1080
            D+KKWL  QFQMKDLG+AQY+LGIQIVRNRKNKTLAMSQASYIDK+              
Sbjct: 1021 DVKKWLNTQFQMKDLGEAQYILGIQIVRNRKNKTLAMSQASYIDKVLSRYKMQNSKKGQL 1080

Query: 1081 ---YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI--------------------- 1110
               +GIHLSKEQCPKTPQEVEDMRNIPY+SAVGSLMYA+                     
Sbjct: 1081 PFRHGIHLSKEQCPKTPQEVEDMRNIPYSSAVGSLMYAMLCTRPDICYSVGIVSRYQSNP 1140

BLAST of CSPI03G23600.1 vs. NCBI nr
Match: KAA0035879.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051221.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0051893.1 gag/pol protein [Cucumis melo var. makuwa] >TYK00551.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1630.2 bits (4220), Expect = 0.0e+00
Identity = 851/1342 (63.41%), Postives = 985/1342 (73.40%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. NCBI nr
Match: KAA0031826.1 (gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0039313.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0043789.1 gag/pol protein [Cucumis melo var. makuwa] >KAA0048789.1 gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1630.2 bits (4220), Expect = 0.0e+00
Identity = 851/1342 (63.41%), Postives = 985/1342 (73.40%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. NCBI nr
Match: KAA0048404.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1626.3 bits (4210), Expect = 0.0e+00
Identity = 852/1342 (63.49%), Postives = 984/1342 (73.32%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK N NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWK-NTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1140
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. NCBI nr
Match: TYK14550.1 (gag/pol protein [Cucumis melo var. makuwa])

HSP 1 Score: 1606.7 bits (4159), Expect = 0.0e+00
Identity = 840/1330 (63.16%), Postives = 973/1330 (73.16%), Query Frame = 0

Query: 1    MNSSIVQLLASEKLNGDNYAAWKSNLNTILVVDDLRFVLTEECPQNPASNANRTSRDAYD 60
            M S+ + +LA++KLNG+NYA+WK+ +NT+L++DDLRFVL EECPQ PA+NA RT R+ Y+
Sbjct: 1    MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPAANATRTVREPYE 60

Query: 61   RWIKANEKARVYILASMSDVLAKKHESLATAKEIMDSLRGMFGQPEWSLRHEAVKYIYTK 120
            RW KANEKAR YILAS+S+VLAKKHES+ TA+EIMDSL+ MFGQ  + ++H+A+KYIY  
Sbjct: 61   RWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNA 120

Query: 121  RMKEGTSVREHVLDMMMHFNIAEVNGGPIEEVNQVSFILESLPKSFIPFQTNASLNKIEF 180
            RM EG SVREHVL+MM+HFN+AE+NG  I+E +QVSFILESLP+SF+ F++NA +NKI +
Sbjct: 121  RMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAY 180

Query: 181  NLTTLLNELQRFQNLTMGKGKQVEANVATTKRKFIRGSSSKTKARPSKP-NAQIKKKGKG 240
             LTTLLNELQ F++L   KG++ EANVAT+ RKF RGS+S TK+ PS   N + KKK  G
Sbjct: 181  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGNKKWKKKKGG 240

Query: 241  KTHKQN-------------KGKCYHCGQNGHWLRNCPKYLAEKKAEKETQGKYDLLVVET 300
            + +K N             KG C+HC Q GHW RNCPKYLAEKK  K  QGKYDLLV+ET
Sbjct: 241  QGNKANLAAAKTTKKTKAAKGICFHCNQEGHWKRNCPKYLAEKK--KAKQGKYDLLVLET 300

Query: 301  CLGEYENCTWILDSGATNHICFSFQENSSWKKLSEGENTLKVGTGEMVSA---------- 360
            CL E ++  WI+DSGATNH+C SFQ  SSW++L  GE T++VGTG +VSA          
Sbjct: 301  CLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCL 360

Query: 361  ------------------------------------------------------------ 420
                                                                        
Sbjct: 361  QKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYV 420

Query: 421  ---------------------------------------------------------SAL 480
                                                                     S L
Sbjct: 421  LRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL 480

Query: 481  ENNTLPPCESCLEGKMTKRSFTGKGLRAKGPLELVHSDLCGPMNVKARGGYEYFISFIDD 540
            E N+LP CESCLEGKMTKR FTGKG RAK PLELVHSDLCGPMNVKARGG+EYFI+F DD
Sbjct: 481  EENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDD 540

Query: 541  YSRYGHIYLIHHKSSSLEKFKQYKAEVENKLGKTIKILRSDRGGEYMDLRFRDYLIENGI 600
            YSRYG++YL+ HKS +LEKFK+YKAEVEN L KTIK  RSDRGGEYMDL+F++YL+E GI
Sbjct: 541  YSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGI 600

Query: 601  QSQLFASSTPQQNSNGVAERRNRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSK 660
             SQL A  TPQQ  NGV+ERRNRTLLDMVRSMMS+A +P+SFWGYA++TA YILN VPSK
Sbjct: 601  VSQLSAPGTPQQ--NGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSK 660

Query: 661  SVSETPYELWKGRKGSLRHFRIWGCPAHVLVQNPKKLEHRSKLCFFIGYPKESRGGLFYD 720
            SVSETP +LW GRKGSLRHFRIWGCPAHVL  NPKKLE RSKLC F+GYPK +RGG FYD
Sbjct: 661  SVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYD 720

Query: 721  PQENKIFVSTNATFLEEDHISDHQPRSKLVLKEISKS-------AIDKPSSSTKVVDKTR 780
            P++NK+FVSTNATFLEEDHI +H+PRSK+VL E+SK         +++PS+ T+VV    
Sbjct: 721  PKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVV-HVG 780

Query: 781  KSGQSHPSQQLREPRRSGRVVHQPDNYLGLIETQVVIPDDGIEDPLTYKQAMKDVDRDQW 840
             S ++H  Q LREPRRSGRV + P  Y+ L ET  VI D  IEDPLT+K+AM+DVD+D+W
Sbjct: 781  SSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEW 840

Query: 841  IKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHAGKVQTFKARLVAKGYTQR 900
            IKAM+LE+ESMYFNSVW LVDQP+ VKPIGCKWIYKRKR   GKVQTFKARLVAKGYTQ 
Sbjct: 841  IKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQV 900

Query: 901  EGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFLNGNLEESIYMSQPEGFIE 960
            EGVDYEETFSPVAMLKSIRILLSIA ++DYEIWQMDVKTAFLNGNLEE+IYM QPEGFI 
Sbjct: 901  EGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFII 960

Query: 961  QDQEQKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFEQNVDEPCVYKKVVNSIVAFLVL 1020
              QEQK+CKL +SIYGLKQASRSWNIRF+TAIKSYGF+Q VDEPCVYK+++N  VAFLVL
Sbjct: 961  PGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVL 1020

Query: 1021 YVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQASYID 1080
            YVDDILLIGND+G LTDIK+WLA QFQMKDLG+AQ+VLGIQI R+RKNK LA+SQASYID
Sbjct: 1021 YVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYID 1080

Query: 1081 KI-----------------YGIHLSKEQCPKTPQEVEDMRNIPYASAVGSLMYAI----- 1128
            KI                 +G+ LSKEQCPKTPQ+VE+MR+IPYASAVGSLMYA+     
Sbjct: 1081 KIVVKYSMQNSKRGLLPFRHGVTLSKEQCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRP 1140

BLAST of CSPI03G23600.1 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 257.3 bits (656), Expect = 5.8e-68
Identity = 152/475 (32.00%), Postives = 245/475 (51.58%), Query Frame = 0

Query: 675  EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 734
            ++P TY +A + +    W  AMD E+ +M     W +   P + KPIGCKW+YK K +  
Sbjct: 84   KEPSTYNEAKEFL---VWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSD 143

Query: 735  GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIATFYDYEIWQMDVKTAFL 794
            G ++ +KARLVAKGYTQ+EG+D+ ETFSPV  L S++++L+I+  Y++ + Q+D+  AFL
Sbjct: 144  GTIERYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLDISNAFL 203

Query: 795  NGNLEESIYMSQPEGFIEQDQE----QKVCKLKKSIYGLKQASRSWNIRFETAIKSYGFE 854
            NG+L+E IYM  P G+  +  +      VC LKKSIYGLKQASR W ++F   +  +GF 
Sbjct: 204  NGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSVTLIGFGFV 263

Query: 855  QNVDEPCVYKKVVNSIVAFLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVL 914
            Q+  +   + K+  ++   +++YVDDI++  N+   + ++K  L   F+++DLG  +Y L
Sbjct: 264  QSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELKSQLKSCFKLRDLGPLKYFL 323

Query: 915  GIQIVRNRKNKTLAMSQ--ASYIDKIYGIHLSKEQCPKTPQEVEDMRN-------IPYAS 974
            G++I R+     +   +     +D+   +       P  P       +         Y  
Sbjct: 324  GLEIARSAAGINICQRKYALDLLDETGLLGCKPSSVPMDPSVTFSAHSGGDFVDAKAYRR 383

Query: 975  AVGSLMY----------------------------AITKDYMLVYGT----------KDL 1034
             +G LMY                            A+ K    + GT           ++
Sbjct: 384  LIGRLMYLQITRLDISFAVNKLSQFSEAPRLAHQQAVMKILHYIKGTVGQGLFYSSQAEM 443

Query: 1035 ILIGYTDSDFQTDKDARKSTSGSVFTLNGGAVVWRSIKQTCIADSTMEAEYVAACEAAKE 1094
             L  ++D+ FQ+ KD R+ST+G    L    + W+S KQ  ++ S+ EAEY A   A  E
Sbjct: 444  QLQVFSDASFQSCKDTRRSTNGYCMFLGTSLISWKSKKQQVVSKSSAEAEYRALSFATDE 503

Query: 1095 AVWLRKFLTDFEVVSNMHLPITLYYDNSGAVANSREPRSHKRGKHIERKYHLIRE 1099
             +WL +F  + ++   +  P  L+ DN+ A+  +     H+R KHIE   H +RE
Sbjct: 504  MMWLAQFFRELQL--PLSKPTLLFCDNTAAIHIATNAVFHERTKHIESDCHSVRE 553

BLAST of CSPI03G23600.1 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 82.0 bits (201), Expect = 3.3e-15
Identity = 40/103 (38.83%), Postives = 62/103 (60.19%), Query Frame = 0

Query: 675 EDPLTYKQAMKDVDRDQWIKAMDLEMESMYFNSVWTLVDQPNDVKPIGCKWIYKRKRDHA 734
           ++P +   A+KD     W +AM  E++++  N  W LV  P +   +GCKW++K K    
Sbjct: 26  KEPKSVIFALKD---PGWCQAMQEELDALSRNKTWILVPPPVNQNILGCKWVFKTKLHSD 85

Query: 735 GKVQTFKARLVAKGYTQREGVDYEETFSPVAMLKSIRILLSIA 778
           G +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L++A
Sbjct: 86  GTLDRLKARLVAKGFHQEEGIYFVETYSPVVRTATIRTILNVA 125

BLAST of CSPI03G23600.1 vs. TAIR 10
Match: ATMG00810.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 56.2 bits (134), Expect = 2.0e-07
Identity = 58/226 (25.66%), Postives = 90/226 (39.82%), Query Frame = 0

Query: 869  FLVLYVDDILLIGNDVGYLTDIKKWLAMQFQMKDLGDAQYVLGIQIVRNRKNKTLAMSQA 928
            +L+LYVDDILL G+    L  +   L+  F MKDLG   Y LGIQI  +     L +SQ 
Sbjct: 2    YLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPSG--LFLSQT 61

Query: 929  SYIDKIY---GIHLSKEQCPKTPQEVEDMRNI-------PYASAVGSLMY-AITK----- 988
             Y ++I    G+   K      P ++    +         + S VG+L Y  +T+     
Sbjct: 62   KYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYPDPSDFRSIVGALQYLTLTRPDISY 121

Query: 989  ---------------DY-----------------MLVYGTKDLILIGYTDSDFQTDKDAR 1047
                           D+                 + ++    L +  + DSD+      R
Sbjct: 122  AVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQAFCDSDWAGCTSTR 181

BLAST of CSPI03G23600.1 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 50.4 bits (119), Expect = 1.1e-05
Identity = 27/82 (32.93%), Postives = 47/82 (57.32%), Query Frame = 0

Query: 481 NRTLLDMVRSMMSFAQMPDSFWGYALETAAYILNNVPSKSVS-ETPYELWKGRKGSLRHF 540
           NRT+++ VRSM+    +P +F   A  TA +I+N  PS +++   P E+W     +  + 
Sbjct: 2   NRTIIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYL 61

Query: 541 RIWGCPAHVLVQNPKKLEHRSK 562
           R +GC A++   +  KL+ R+K
Sbjct: 62  RRFGCVAYIHC-DEGKLKPRAK 82

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P109781.6e-16330.69Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041462.9e-11225.63Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Q9ZT943.2e-9523.59Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.5e-9223.21Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q124901.4e-2127.82Transposon Ty1-BL Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 2... [more]
Match NameE-valueIdentityDescription
E2GK510.0e+0074.92Gag/pol protein (Fragment) OS=Bryonia dioica OX=3652 PE=4 SV=1[more]
A0A5A7SMH80.0e+0063.41Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold219G0025... [more]
A0A5D3CPJ60.0e+0063.41Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold119G0004... [more]
A0A5A7TZD70.0e+0063.49Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold264G0013... [more]
A0A5A7TWB90.0e+0063.16Gag/pol protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold133G0031... [more]
Match NameE-valueIdentityDescription
ADJ18449.10.0e+0074.92gag/pol protein, partial [Bryonia dioica][more]
KAA0035879.10.0e+0063.41gag/pol protein [Cucumis melo var. makuwa] >KAA0044276.1 gag/pol protein [Cucumi... [more]
KAA0031826.10.0e+0063.41gag/pol protein [Cucumis melo var. makuwa] >KAA0032384.1 gag/pol protein [Cucumi... [more]
KAA0048404.10.0e+0063.49gag/pol protein [Cucumis melo var. makuwa][more]
TYK14550.10.0e+0063.16gag/pol protein [Cucumis melo var. makuwa][more]
Match NameE-valueIdentityDescription
AT4G23160.15.8e-6832.00cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
ATMG00820.13.3e-1538.83Reverse transcriptase (RNA-dependent DNA polymerase) [more]
ATMG00810.12.0e-0725.66DNA/RNA polymerases superfamily protein [more]
ATMG00710.11.1e-0532.93Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 175..195
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 62..191
e-value: 7.3E-17
score: 61.4
NoneNo IPR availableGENE3D4.10.60.10coord: 241..285
e-value: 3.5E-7
score: 32.4
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 215..246
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 620..651
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 228..246
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 336..649
coord: 675..962
NoneNo IPR availableCDDcd09272RNase_HI_RT_Ty1coord: 986..1127
e-value: 7.96112E-58
score: 193.454
IPR001878Zinc finger, CCHC-typeSMARTSM00343c2hcfinal6coord: 248..264
e-value: 4.8E-4
score: 29.5
IPR001878Zinc finger, CCHC-typePFAMPF00098zf-CCHCcoord: 247..264
e-value: 1.5E-5
score: 24.9
IPR001878Zinc finger, CCHC-typePROSITEPS50158ZF_CCHCcoord: 248..264
score: 10.542789
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 364..542
e-value: 4.4E-37
score: 129.3
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 706..934
e-value: 3.0E-72
score: 243.2
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 369..462
e-value: 4.1E-10
score: 39.9
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 366..533
score: 20.106445
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 706..1094
IPR036875Zinc finger, CCHC-type superfamilySUPERFAMILY57756Retrovirus zinc finger-like domainscoord: 236..267
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 365..527

Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
CSPI03G23600CSPI03G23600gene


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
CSPI03G23600.1.cds6CSPI03G23600.1.cds6CDS
CSPI03G23600.1.cds5CSPI03G23600.1.cds5CDS
CSPI03G23600.1.cds4CSPI03G23600.1.cds4CDS
CSPI03G23600.1.cds3CSPI03G23600.1.cds3CDS
CSPI03G23600.1.cds2CSPI03G23600.1.cds2CDS
CSPI03G23600.1.cds1CSPI03G23600.1.cds1CDS


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
CSPI03G23600.1CSPI03G23600.1-proteinpolypeptide


GO Annotation
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding