CSPI03G23920 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI03G23920
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
LocationChr3: 21018664 .. 21021915 (+)
RNA-Seq ExpressionCSPI03G23920
SyntenyCSPI03G23920
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAGTGATGGGAAGTGCTTCCAAAGGATCCACGTCTAGCGAAAGAACAATTGCAGACAAGGGAAAGGGTATAAAGGATGAAGAACAAGCTCATGAAGAAAAAAGCGAAGTGAAAACTAACCCGGTTACGAATGAAGGATTTGGAGTACCTTGTGCACCAAACCAGCGGGAGATACCCCTCTTCGACATGAGATTAAGGAAATTAGAGGTACCAATATTTAAGGGAGAAGAAGAAGAAAATGTGGATGGATGGTTACATCGCGTTGAGCGTTATTTTGTGGTGAATAGATTGACGGAGAGAGATAAACTGGATGCTGCGGTTCTTTGCTTGTAGGGAGAGGCGTTGGACTGGTATAAGTGGAAAGATGATCGAACGAAGATTAATAGTTGGGAGGAATTCCGGCAGCTTTTATGGGCAAGATTCAAGCCATCTGGTCAGGGGGATAAGCACGCGAGGTCGATGAAACTGCAGCAAGAAACCACCGTAAGGGAATATCGGCGACGAGTTGAGCAGTTTTCGACCAGCCTTAAGGATATGAGCGATGCAGCCCTAGAAAGTAAATTTGTGTGTGGGCTAAGGGAAGAAATCCAAAGTGAGATTCGTAAATTGAATCCAGTGGGCCTGGAAGCAAAAATGTTAATGGCCCAAGTAATAGAAGATGACCAAGTAGTCCAATTAAAAAGAATACATGGGGTGGGTGCAAACCCTAGTTTGTTAAATAAAACATCTGGTAATGGACCAAATGGGTCAGGAAATCAAACCGGGTCAAAGGTAATGGATCGGGTTGCAACATCAAGAACCATTACGATTAATCCTAGCCGAAATTCGTCTTCATCATCGACAACAATCACTTCCCCTCACGATTTTAACGTGAAAAACTCGATGGCTCAACCTTATCGCCGAATGACTGACAGTGAAATGAGAATGAAGAAGGAAAAGGGGCTATGTTTTCGATGTGACGAGAAATTCAATCCGGGGCCCCGTTGCAAAAGATGAGAATTAAATATCATTGCTATTCAAGAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAGAGGAGACTGAAGATAAGAATGAGCAAATCAATACTGAGATTGCAAATTTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATTGCGAAGTTGTCGTGTTAGTCGATGGGGGAGCTATACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAGTGGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACAGGAATGTGTAAGAGTGTGAATCTGACAATCGTTAATTTATCGATCACTCATGATTTCCTTCCTCTACCACTTGGGAGTGCAGATGTTAATTTAGGAGTTACATGGTTGGAAACTTTGGGGAAAGTAATTTTCAATTACAAGTTATCAGAGATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGAACAAAAGCCTCGTAAAATCACAGGTGTCACTAAAATCAATGATTTTTAAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAGTCGAACAAGGGGGAGCGGAAGAGTCAAAGGATAATTTAGCGGACTGTCTGAGTAATCTAAAACCCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCATAAATCAGTTGCCACCGCCTCGCGATCATGATCATGCTATTGAATTAGAGTCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGCTATCCCCAATTCCAGAAAGATGAAATTGAGAAATTAGTAAAGAAAATGTTGTTAGCCAAAATTATTCAGCCAAGCAAAAGTGCATTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGCGATTCTGCGTGGATTACCGAGCACTGAATCTTGCCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACAATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCATATTAGAGTGCGAGCCACCGATGTTCACAAAACGGCGTTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCGACCACTTTTCAATCAGTAATGAATGATATTCTCCGCCCGTACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAGCTCACTAGAAGAACATTTGCACCAATTGGCCATGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTAAAAAAATGTCAATTCGCAGTAGATCAAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAGCGGCTATGGTTAAATGGCCAGCACCCAAGAATGTGAAGGAATTACGAGGTTTCTTGGGTCTTCCAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTGCCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAGAAAAAGCGTTCCAGCAGTTAAAATCTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTACCCAAGGTTTTGTGCTGAAAACTGATGCTTCAGGTGTTGGTATTGGGGCTGTCCTAATGCAGCATCAACGTCCAGTGGCTTTTTTCAGTCAAGCTTTGCCCATTACCCATAGGTTTAAAGCTGTGTATGAACGGAAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGCGCCCACATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATCAGAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAAAGATGGATCGCTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAACAAAGCGGCTGATGCTTTATCACGATTACCACCGCTATTTGAATTGGGTCTCATAAGTGTTGTGGGCAGGCTCAATCCTTTGATTTTCATTGATCAAGTGACTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCGCTACAAGGAGAGGTTCTATGCTATCACGACCGGTTGGTTCTTCCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACTGCCCGATAGGATGA

mRNA sequence

ATGGGAGTGATGGGAAGTGCTTCCAAAGGATCCACGTCTAGCGAAAGAACAATTGCAGACAAGGGAAAGGGTATAAAGGATGAAGAACAAGCTCATGAAGAAAAAAGCGAAGTGAAAACTAACCCGGTTACGAATGAAGGATTTGGAGTACCTTGTGCACCAAACCAGCGGGAGATACCCCTCTTCGACATGAGATTAAGGAAATTAGAGGGAGAGGCGTTGGACTGGTATAAGTGGAAAGATGATCGAACGAAGATTAATAGTTGGGAGGAATTCCGGCAGCTTTTATGGGCAAGATTCAAGCCATCTGGTCAGGGGGATAAGCACGCGAGGTCGATGAAACTGCAGCAAGAAACCACCGTAAGGGAATATCGGCGACGAGTTGAGCAGTTTTCGACCAGCCTTAAGGATATGAGCGATGCAGCCCTAGAAAGTAAATTTGTGTGTGGGCTAAGGGAAGAAATCCAAAGTGAGATTCGTAAATTGAATCCAGTGGGCCTGGAAGCAAAAATGTTAATGGCCCAAGTAATAGAAGATGACCAAGTAGTCCAATTAAAAAGAATACATGGGGTGGGTGCAAACCCTAGTTTGTTAAATAAAACATCTGGTAATGGACCAAATGGGTCAGGAAATCAAACCGGGTCAAAGGTAATGGATCGGGTTGCAACATCAAGAACCATTACGATTAATCCTAGCCGAAATTCGTCTTCATCATCGACAACAATCACTTCCCCTCACGATTTTAACGTGAAAAACTCGATGGCTCAACCTTATCGCCGAATGACTGACAGTGAAATGAGAATGAAGAAGGAAAAGGGGCTATGTTTTCGATGTGACGAGAAATTCAATCCGGGGCCCCAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAGAGGAGACTGAAGATAAGAATGAGCAAATCAATACTGAGATTGCAAATTTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATTGCGAAGTTGTCGTGTTAGTCGATGGGGGAGCTATACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAGTGGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACAGGAATGTGTAAGAGTGTGAATCTGACAATCGTTAATTTATCGATCACTCATGATTTCCTTCCTCTACCACTTGGGAGTGCAGATGTTAATTTAGGAGTTACATGGTTGGAAACTTTGGGGAAAGTAATTTTCAATTACAAGTTATCAGAGATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGAACAAAAGCCTCGTAAAATCACAGGTGTCACTAAAATCAATGATTTTTAAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAGTCGAACAAGGGGGAGCGGAAGAGTCAAAGGATAATTTAGCGGACTGTCTGAGTAATCTAAAACCCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCATAAATCAGTTGCCACCGCCTCGCGATCATGATCATGCTATTGAATTAGAGTCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGCTATCCCCAATTCCAGAAAGATGAAATTGAGAAATTAGTAAAGAAAATGTTGTTAGCCAAAATTATTCAGCCAAGCAAAAGTGCATTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGCGATTCTGCGTGGATTACCGAGCACTGAATCTTGCCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACAATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCATATTAGAGTGCGAGCCACCGATGTTCACAAAACGGCGTTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCGACCACTTTTCAATCAGTAATGAATGATATTCTCCGCCCGTACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAGCTCACTAGAAGAACATTTGCACCAATTGGCCATGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTAAAAAAATGTCAATTCGCAGTAGATCAAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAGCGGCTATGGTTAAATGGCCAGCACCCAAGAATGTGAAGGAATTACGAGGTTTCTTGGGTCTTCCAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTGCCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAGAAAAAGCGTTCCAGCAGTTAAAATCTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTACCCAAGGTTTTGTGCTGAAAACTGATGCTTCAGGTGTTGGTATTGGGGCTGTCCTAATGCAGCATCAACGTCCAGTGGCTTTTTTCAGTCAAGCTTTGCCCATTACCCATAGGTTTAAAGCTGTGTATGAACGGAAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGCGCCCACATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATCAGAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAAAGATGGATCGCTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAACAAAGCGGCTGATGCTTTATCACGATTACCACCGCTATTTGAATTGGGTCTCATAAGTGTTGTGGGCAGGCTCAATCCTTTGATTTTCATTGATCAAGTGACTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCGCTACAAGGAGAGGTTCTATGCTATCACGACCGGTTGGTTCTTCCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACTGCCCGATAGGATGA

Coding sequence (CDS)

ATGGGAGTGATGGGAAGTGCTTCCAAAGGATCCACGTCTAGCGAAAGAACAATTGCAGACAAGGGAAAGGGTATAAAGGATGAAGAACAAGCTCATGAAGAAAAAAGCGAAGTGAAAACTAACCCGGTTACGAATGAAGGATTTGGAGTACCTTGTGCACCAAACCAGCGGGAGATACCCCTCTTCGACATGAGATTAAGGAAATTAGAGGGAGAGGCGTTGGACTGGTATAAGTGGAAAGATGATCGAACGAAGATTAATAGTTGGGAGGAATTCCGGCAGCTTTTATGGGCAAGATTCAAGCCATCTGGTCAGGGGGATAAGCACGCGAGGTCGATGAAACTGCAGCAAGAAACCACCGTAAGGGAATATCGGCGACGAGTTGAGCAGTTTTCGACCAGCCTTAAGGATATGAGCGATGCAGCCCTAGAAAGTAAATTTGTGTGTGGGCTAAGGGAAGAAATCCAAAGTGAGATTCGTAAATTGAATCCAGTGGGCCTGGAAGCAAAAATGTTAATGGCCCAAGTAATAGAAGATGACCAAGTAGTCCAATTAAAAAGAATACATGGGGTGGGTGCAAACCCTAGTTTGTTAAATAAAACATCTGGTAATGGACCAAATGGGTCAGGAAATCAAACCGGGTCAAAGGTAATGGATCGGGTTGCAACATCAAGAACCATTACGATTAATCCTAGCCGAAATTCGTCTTCATCATCGACAACAATCACTTCCCCTCACGATTTTAACGTGAAAAACTCGATGGCTCAACCTTATCGCCGAATGACTGACAGTGAAATGAGAATGAAGAAGGAAAAGGGGCTATGTTTTCGATGTGACGAGAAATTCAATCCGGGGCCCCAAGGGGAGGACTTAAGTGGGGAAATTGATAAAGTAGCAGAGGAGACTGAAGATAAGAATGAGCAAATCAATACTGAGATTGCAAATTTGTCTTTACATTCGTTGGTAGGTTTTAGTTCTCCTAAAACCATAAAAATAAAAGGCGAAATCAGGAATTGCGAAGTTGTCGTGTTAGTCGATGGGGGAGCTATACATAACTTTATTTCGGAGGAGGTGGTCAAGGAATTAAAAATTCCAGTGGAAACTTTAGATGCTTATGGCGTTGTTTTGGGAACCGGGGGTGTAGTTCGAGCAACAGGAATGTGTAAGAGTGTGAATCTGACAATCGTTAATTTATCGATCACTCATGATTTCCTTCCTCTACCACTTGGGAGTGCAGATGTTAATTTAGGAGTTACATGGTTGGAAACTTTGGGGAAAGTAATTTTCAATTACAAGTTATCAGAGATGGAATTTTCATTGGGAGAATTTTTGGTGATTCTACAAGGGAACAAAAGCCTCGTAAAATCACAGGTGTCACTAAAATCAATGATTTTTAAGAAGGAAGATCAAGGAGTGTTAATCGAGTTGAGTACAGTCGAACAAGGGGGAGCGGAAGAGTCAAAGGATAATTTAGCGGACTGTCTGAGTAATCTAAAACCCGAAGTTCAAAGAATTCTTTTGTCTTTTGGTAGTGTGTTTGAATCCATAAATCAGTTGCCACCGCCTCGCGATCATGATCATGCTATTGAATTAGAGTCAGGGGCCCGGGCGGTGAATGTGCGGCCTTATCGCTATCCCCAATTCCAGAAAGATGAAATTGAGAAATTAGTAAAGAAAATGTTGTTAGCCAAAATTATTCAGCCAAGCAAAAGTGCATTTTCAAGTCCGGTGCTCCTTGTAAAGAAGAAAGATGGTAGCTGGCGATTCTGCGTGGATTACCGAGCACTGAATCTTGCCACCATACCAGATAAGTATCCAATTCCCGTTGTTGATGAACTACTAGATGAATTATTCGGGGCAACAATATTTTCAAAAATTGATCTAAAGTCAGGCTATCACCATATTAGAGTGCGAGCCACCGATGTTCACAAAACGGCGTTTCGTACCCATGAAGGGCACTACGAATTTCTCGTGATGCCATTTGGGTTGAAGAATGCTCCGACCACTTTTCAATCAGTAATGAATGATATTCTCCGCCCGTACTTACGCAAATTTGTTTTGGTTTTCTTCGACGATATCCTAATCTACAGCTCACTAGAAGAACATTTGCACCAATTGGCCATGGTTTTAGAAACCTTAGTAGTTCATAAACTGGTGGCGAATTTAAAAAAATGTCAATTCGCAGTAGATCAAATTGAATATTTGGGTCACATTATCTCATCTGATGGGGTGGCGGCGGATCCAACGAAAATAGCGGCTATGGTTAAATGGCCAGCACCCAAGAATGTGAAGGAATTACGAGGTTTCTTGGGTCTTCCAGGTTATTATCGGAAATTTGTGGCAAACTATGGTTCTATTGCACTGCCCTTGACACAATTGTTAAAGAAAGGAAAGTTTCAGTGGAATGAGACAGCAGAAAAAGCGTTCCAGCAGTTAAAATCTGCTATGATGTCGGTACCGGTGTTAGGTATTCCAGATTTTACCCAAGGTTTTGTGCTGAAAACTGATGCTTCAGGTGTTGGTATTGGGGCTGTCCTAATGCAGCATCAACGTCCAGTGGCTTTTTTCAGTCAAGCTTTGCCCATTACCCATAGGTTTAAAGCTGTGTATGAACGGAAACTTATGGCAATCGTGCGGGCTGTTCAAAAATGGCGCCCACATCTGTTAGGTAAGCCTTTTGTGGTGCGTACAGATCAGAAAAGTTTGAAGTTTCTTCTTGAACAACGAGCCATTGGGGGAGAATATCAAAGATGGATCGCTAAACTATTGGGGTATGATTTTGTGATTGAATACAAGAAAGGAATGGAAAACAAAGCGGCTGATGCTTTATCACGATTACCACCGCTATTTGAATTGGGTCTCATAAGTGTTGTGGGCAGGCTCAATCCTTTGATTTTCATTGATCAAGTGACTGGGAATGAAGCTCTGAATAGTATTCGGTTATCCTTGATAAATGGACAGCCAACCCCGGAGGGATATTCGCTACAAGGAGAGGTTCTATGCTATCACGACCGGTTGGTTCTTCCGGAGGATTCCCCTACTATCCCCTTATTATTAGCGGAGTTTCATAACTGCCCGATAGGATGA

Protein sequence

MGVMGSASKGSTSSERTIADKGKGIKDEEQAHEEKSEVKTNPVTNEGFGVPCAPNQREIPLFDMRLRKLEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRIHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDLSGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMIFKKEDQGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYSSLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPLLLAEFHNCPIG*
Homology
BLAST of CSPI03G23920 vs. ExPASy Swiss-Prot
Match: P04323 (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 342.8 bits (878), Expect = 1.3e-92
Identity = 183/412 (44.42%), Postives = 256/412 (62.14%), Query Frame = 0

Query: 543 YRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLV-KKKDGS----WRFCVDYRALN 602
           Y YPQ  + E+E  ++ ML   II+ S S ++SP+ +V KK+D S    +R  +DYR LN
Sbjct: 213 YSYPQAYEQEVESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLN 272

Query: 603 LATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFL 662
             T+ D++PIP +DE+L +L     F+ IDL  G+H I +    V KTAF T  GHYE+L
Sbjct: 273 EITVGDRHPIPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYL 332

Query: 663 VMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVH 722
            MPFGLKNAP TFQ  MNDILRP L K  LV+ DDI+++S SL+EHL  L +V E L   
Sbjct: 333 RMPFGLKNAPATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKA 392

Query: 723 KLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYY 782
            L   L KC+F   +  +LGH+++ DG+  +P KI A+ K+P P   KE++ FLGL GYY
Sbjct: 393 NLKLQLDKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYY 452

Query: 783 RKFVANYGSIALPLTQLLKKGK--FQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK 842
           RKF+ N+  IA P+T+ LKK       N   + AF++LK  +   P+L +PDFT+ F L 
Sbjct: 453 RKFIPNFADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLT 512

Query: 843 TDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFV 902
           TDAS V +GAVL Q   P+++ S+ L       +  E++L+AIV A + +R +LLG+ F 
Sbjct: 513 TDASDVALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFE 572

Query: 903 VRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL 947
           + +D + L +L   +    +  RW  KL  +DF I+Y KG EN  ADALSR+
Sbjct: 573 ISSDHQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRI 624

BLAST of CSPI03G23920 vs. ExPASy Swiss-Prot
Match: P20825 (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 340.5 bits (872), Expect = 6.6e-92
Identity = 180/414 (43.48%), Postives = 258/414 (62.32%), Query Frame = 0

Query: 541 RPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKD-----GSWRFCVDYRA 600
           + Y   Q  + E+E  V++ML   +I+ S S ++SP  +V KK        +R  +DYR 
Sbjct: 210 KQYPLAQTHEIEVENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRK 269

Query: 601 LNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYE 660
           LN  TIPD+YPIP +DE+L +L     F+ IDL  G+H I +    + KTAF T  GHYE
Sbjct: 270 LNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYE 329

Query: 661 FLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLV 720
           +L MPFGL+NAP TFQ  MN+ILRP L K  LV+ DDI+I+S SL EHL+ + +V   L 
Sbjct: 330 YLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLA 389

Query: 721 VHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPG 780
              L   L KC+F   +  +LGHI++ DG+  +P K+ A+V +P P   KE+R FLGL G
Sbjct: 390 DANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTG 449

Query: 781 YYRKFVANYGSIALPLTQLLKKGKFQWNETAE--KAFQQLKSAMMSVPVLGIPDFTQGFV 840
           YYRKF+ NY  IA P+T  LKK      +  E  +AF++LK+ ++  P+L +PDF + FV
Sbjct: 450 YYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFV 509

Query: 841 LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKP 900
           L TDAS + +GAVL Q+  P++F S+ L       +  E++L+AIV A + +R +LLG+ 
Sbjct: 510 LTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQ 569

Query: 901 FVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRL 947
           F++ +D + L++L   +  G + +RW  +L  Y F I+Y KG EN  ADALSR+
Sbjct: 570 FLIASDHQPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRI 623

BLAST of CSPI03G23920 vs. ExPASy Swiss-Prot
Match: Q7LHG5 (Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-I PE=1 SV=2)

HSP 1 Score: 330.9 bits (847), Expect = 5.2e-89
Identity = 211/529 (39.89%), Postives = 289/529 (54.63%), Query Frame = 0

Query: 517  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSK 576
            N LPP P D +     H IE++ GAR   ++PY   +  + EI K+V+K+L  K I PSK
Sbjct: 596  NDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSK 655

Query: 577  SAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKS 636
            S  SSPV+LV KKDG++R CVDYR LN ATI D +P+P +D LL  +  A IF+ +DL S
Sbjct: 656  SPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHS 715

Query: 637  GYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFF 696
            GYH I +   D +KTAF T  G YE+ VMPFGL NAP+TF   M D  R    +FV V+ 
Sbjct: 716  GYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYL 775

Query: 697  DDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPT 756
            DDILI+S S EEH   L  VLE L    L+   KKC+FA ++ E+LG+ I    +A    
Sbjct: 776  DDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQH 835

Query: 757  KIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF 816
            K AA+  +P PK VK+ + FLG+  YYR+F+ N   IA P+ QL    K QW E  +KA 
Sbjct: 836  KCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAI 895

Query: 817  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPIT 876
            ++LK+A+ + PVL   +    + L TDAS  GIGAVL +          V +FS++L   
Sbjct: 896  EKLKAALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESA 955

Query: 877  HRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLL 936
             +     E +L+ I++A+  +R  L GK F +RTD  SL  L  +       QRW+  L 
Sbjct: 956  QKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLA 1015

Query: 937  GYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGR 996
             YDF +EY  G +N  ADA+SR                         PL    LI +   
Sbjct: 1016 TYDFTLEYLAGPKNVVADAISRAIYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKEL 1075

Query: 997  LNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLP 1010
                +  + ++   A  S +  L   +   + YSL+ E++ Y DRLV+P
Sbjct: 1076 TQHNVTPEDMS---AFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP 1118

BLAST of CSPI03G23920 vs. ExPASy Swiss-Prot
Match: Q99315 (Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY3B-G PE=1 SV=3)

HSP 1 Score: 329.7 bits (844), Expect = 1.2e-88
Identity = 211/529 (39.89%), Postives = 287/529 (54.25%), Query Frame = 0

Query: 517  NQLPP-PRDHD-----HAIELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSK 576
            N LPP P D +     H IE++ GAR   ++PY   +  + EI K+V+K+L  K I PSK
Sbjct: 570  NDLPPRPADINNIPVKHDIEIKPGARLPRLQPYHVTEKNEQEINKIVQKLLDNKFIVPSK 629

Query: 577  SAFSSPVLLVKKKDGSWRFCVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKS 636
            S  SSPV+LV KKDG++R CVDYR LN ATI D +P+P +D LL  +  A IF+ +DL S
Sbjct: 630  SPCSSPVVLVPKKDGTFRLCVDYRTLNKATISDPFPLPRIDNLLSRIGNAQIFTTLDLHS 689

Query: 637  GYHHIRVRATDVHKTAFRTHEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFF 696
            GYH I +   D +KTAF T  G YE+ VMPFGL NAP+TF   M D  R    +FV V+ 
Sbjct: 690  GYHQIPMEPKDRYKTAFVTPSGKYEYTVMPFGLVNAPSTFARYMADTFRDL--RFVNVYL 749

Query: 697  DDILIYS-SLEEHLHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPT 756
            DDILI+S S EEH   L  VLE L    L+   KKC+FA ++ E+LG+ I    +A    
Sbjct: 750  DDILIFSESPEEHWKHLDTVLERLKNENLIVKKKKCKFASEETEFLGYSIGIQKIAPLQH 809

Query: 757  KIAAMVKWPAPKNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAF 816
            K AA+  +P PK VK+ + FLG+  YYR+F+ N   IA P+ QL    K QW E  +KA 
Sbjct: 810  KCAAIRDFPTPKTVKQAQRFLGMINYYRRFIPNCSKIAQPI-QLFICDKSQWTEKQDKAI 869

Query: 817  QQLKSAMMSVPVLGIPDFTQGFVLKTDASGVGIGAVLMQHQRP------VAFFSQALPIT 876
             +LK A+ + PVL   +    + L TDAS  GIGAVL +          V +FS++L   
Sbjct: 870  DKLKDALCNSPVLVPFNNKANYRLTTDASKDGIGAVLEEVDNKNKLVGVVGYFSKSLESA 929

Query: 877  HRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLL 936
             +     E +L+ I++A+  +R  L GK F +RTD  SL  L  +       QRW+  L 
Sbjct: 930  QKNYPAGELELLGIIKALHHFRYMLHGKHFTLRTDHISLLSLQNKNEPARRVQRWLDDLA 989

Query: 937  GYDFVIEYKKGMENKAADALSRL-----------------------PPLFELGLISVVGR 996
             YDF +EY  G +N  ADA+SR                         PL    LI +   
Sbjct: 990  TYDFTLEYLAGPKNVVADAISRAVYTITPETSRPIDTESWKSYYKSDPLCSAVLIHMKEL 1049

Query: 997  LNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLP 1010
                +  + ++   A  S +  L   +   + YSL+ E++ Y DRLV+P
Sbjct: 1050 TQHNVTPEDMS---AFRSYQKKLELSETFRKNYSLEDEMIYYQDRLVVP 1092

BLAST of CSPI03G23920 vs. ExPASy Swiss-Prot
Match: Q8I7P9 (Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogaster OX=7227 GN=pol PE=4 SV=1)

HSP 1 Score: 315.1 bits (806), Expect = 3.0e-84
Identity = 173/434 (39.86%), Postives = 257/434 (59.22%), Query Frame = 0

Query: 538 VNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKK-----DGSWRFCVD 597
           +  + Y YP   + E+E+ + ++L   II+PS S ++SP+ +V KK     +  +R  VD
Sbjct: 124 IYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYNSPIWIVPKKPKPNGEKQYRMVVD 183

Query: 598 YRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEG 657
           ++ LN  TIPD YPIP ++  L  L  A  F+ +DL SG+H I ++ +D+ KTAF T  G
Sbjct: 184 FKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLDLTSGFHQIHMKESDIPKTAFSTLNG 243

Query: 658 HYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLE 717
            YEFL +PFGLKNAP  FQ +++DILR ++ K   V+ DDI+++S   + H   L +VL 
Sbjct: 244 KYEFLRLPFGLKNAPAIFQRMIDDILREHIGKVCYVYIDDIIVFSEDYDTHWKNLRLVLA 303

Query: 718 TLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLG 777
           +L    L  NL+K  F   Q+E+LG+I+++DG+ ADP K+ A+ + P P +VKEL+ FLG
Sbjct: 304 SLSKANLQVNLEKSHFLDTQVEFLGYIVTADGIKADPKKVRAISEMPPPTSVKELKRFLG 363

Query: 778 LPGYYRKFVANYGSIALPLTQLLK------------KGKFQWNETAEKAFQQLKSAMMSV 837
           +  YYRKF+ +Y  +A PLT L +            K     +ETA ++F  LKS + S 
Sbjct: 364 MTSYYRKFIQDYAKVAKPLTNLTRGLYANIKSSQSSKVPITLDETALQSFNDLKSILCSS 423

Query: 838 PVLGIPDFTQGFVLKTDASGVGIGAVLMQ----HQRPVAFFSQALPITHRFKAVYERKLM 897
            +L  P FT+ F L TDAS   IGAVL Q      RP+A+ S++L  T    A  E++++
Sbjct: 424 EILAFPCFTKPFHLTTDASNWAIGAVLSQDDQGRDRPIAYISRSLNKTEENYATIEKEML 483

Query: 898 AIVRAVQKWRPHLLGKPFV-VRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKG 949
           AI+ ++   R +L G   + V TD + L F L  R    + +RW A++  Y+  + YK G
Sbjct: 484 AIIWSLDNLRAYLYGAGTIKVYTDHQPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPG 543

BLAST of CSPI03G23920 vs. ExPASy TrEMBL
Match: J3SDF5 (Ty3/gypsy retrotransposon protein OS=Beta vulgaris subsp. vulgaris OX=3555 PE=4 SV=1)

HSP 1 Score: 871.7 bits (2251), Expect = 3.0e-249
Identity = 475/981 (48.42%), Postives = 646/981 (65.85%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            +EG+AL WY+W++ R    +WE  +  +  +F+P   G  H + +   Q  +V EYRR+ 
Sbjct: 216  MEGDALRWYQWENKRRPFRNWESMKSFVLTQFRPLNVGSLHEQWLSTTQTASVWEYRRKF 275

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIED-DQVVQLKR 188
             + +  L  + +  L  KF+ GL  E+QSEIR LNP  L+  M +A  +E+ ++V   +R
Sbjct: 276  VETAAPLDGIPEEILMGKFIHGLNPELQSEIRVLNPYNLDQAMELALKLEERNRVNGARR 335

Query: 189  IHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHD 248
                  + S+ N+    GPN   N +   V      S   T + + NS++S T++ +   
Sbjct: 336  TGPRSGSFSIYNR----GPN--SNPSLPSVYGSQGGSNASTKSWAINSNASQTSVNNAKP 395

Query: 249  FNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQ---------------GEDLS 308
              + +      RR+T+ E++ K+ KGLCF+CDEK+  G Q                ++L 
Sbjct: 396  PPLSSRGFGEMRRLTEKELQEKRAKGLCFKCDEKWGVGHQCRRKELSVLFMEDNEEDELE 455

Query: 309  GEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN 368
            G +   +E      E+I  E+   SL+S++G S+PKT+K+ G I N EVVV++D GA HN
Sbjct: 456  GALSG-SEAPPSPTEEIPPEV---SLNSVIGLSNPKTMKLSGLIDNHEVVVMIDPGATHN 515

Query: 369  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGS 428
            F+S + + +L IPV   + +GV LG G  VR TG+C++V L +   L +  DFLPL LG+
Sbjct: 516  FLSLKAIDKLGIPVTESEEFGVSLGDGQAVRGTGICRAVALYLDGGLVVVEDFLPLGLGN 575

Query: 429  ADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKED 488
            +DV LGV WLETLG V+ N+K  +M F LG     L G+ +L +S+VSLK+M+   +KE 
Sbjct: 576  SDVILGVQWLETLGTVVSNWKTQKMSFQLGGVPYTLTGDPTLARSKVSLKAMLRTLRKEG 635

Query: 489  QGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAI 548
             G+ +E + VE GGA   +D+  +    + P +Q ++  F  VFE+   LPP R H+HAI
Sbjct: 636  GGLWLECNQVEAGGAGSIRDSKVE--QEIPPFLQELMRRFEGVFETPVGLPPRRGHEHAI 695

Query: 549  ELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRF 608
             L+ G+  V VRPYRYPQFQKDEIE+L+K+ML A IIQPS S FSSPV+LVKKKDGSWRF
Sbjct: 696  VLKEGSNPVGVRPYRYPQFQKDEIERLIKEMLAAGIIQPSTSPFSSPVILVKKKDGSWRF 755

Query: 609  CVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT 668
            CVDYRALN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYH I VR  D HKTAFRT
Sbjct: 756  CVDYRALNKETVPDKYPIPVIDELLDELHGATVFSKLDLRAGYHQILVRPEDTHKTAFRT 815

Query: 669  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAM 728
            HEGHYEFLVMPFGL NAP TFQS+MN++ RP+LR+FVLVF DDILIYS S EEH+  L M
Sbjct: 816  HEGHYEFLVMPFGLTNAPATFQSLMNEVFRPFLRRFVLVFLDDILIYSRSDEEHVGHLEM 875

Query: 729  VLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRG 788
            VL  L  H L  N KKC+F   ++ YLGH+IS  GVA D  K+ A+++W  PKN++ELRG
Sbjct: 876  VLGMLAQHALFVNKKKCEFGKREVAYLGHVISEGGVAMDTEKVKAVLEWEVPKNLRELRG 935

Query: 789  FLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFT 848
            FLGL GYYRKFVANY  IA PLT+ LKK  F+W+ TA +AF+QLKSAM+S PVL +P+F 
Sbjct: 936  FLGLTGYYRKFVANYAHIARPLTEQLKKDNFKWSATATEAFKQLKSAMVSAPVLAMPNFQ 995

Query: 849  QGFVLKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHL 908
              FV++TDASG G+GAVLMQ  RP+A++S+ L    + K+VYE++LMAI  AVQKW+ +L
Sbjct: 996  LTFVVETDASGYGMGAVLMQDNRPIAYYSKLLGTRAQLKSVYEKELMAICFAVQKWKYYL 1055

Query: 909  LGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP 968
            LG+ FVVRTDQ+SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ ADALSR   
Sbjct: 1056 LGRHFVVRTDQQSLRYITQQREIGAEFQKWVSKLMGYDFEIHYKPGLSNRVADALSRKTV 1115

Query: 969  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLV 1028
               ELG I  V  +       ++TG+  L  +R  L  G+ TP  ++L    L +  R V
Sbjct: 1116 GEVELGAIVAVQGVEWAELRREITGDSFLTQVRKELQEGR-TPSHFTLVDGNLLFKGRYV 1175

BLAST of CSPI03G23920 vs. ExPASy TrEMBL
Match: A0A2I0X132 (Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca013044 PE=4 SV=1)

HSP 1 Score: 867.5 bits (2240), Expect = 5.7e-248
Identity = 475/976 (48.67%), Postives = 632/976 (64.75%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R 
Sbjct: 276  LEARALAWYQWTEERQRFRCWAEFREMCLDRFRPPKEGTHHEQFFALTQTGTVSAYRDRF 335

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDD-QVVQLKR 188
            E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L   + MA++IED     Q +R
Sbjct: 336  ELLSSRLRGMTDEVLEGNFMKGLKPHIRSAVRAAKPRSLRETLEMAELIEDRLSSDQYRR 395

Query: 189  IHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPH 248
                G    + N T G  G        G K  DR A ++                     
Sbjct: 396  PTFFGGGQKVANPTGGAKGAYLGAGGEGQKERDRSAPAK--------------------- 455

Query: 249  DFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKV 308
                       +RR+T++E++ K+ KGLC+RCDEKF PG + +D           E ++V
Sbjct: 456  ---------GEFRRLTEAELKDKRAKGLCYRCDEKFGPGHRCKDKLLQVLLVEDPEEEEV 515

Query: 309  AEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS 368
             EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Sbjct: 516  EEELGGEEEGVDHLHLDMIEVSLNSVAGLTAHSTMKMEGRIGSFTVTVLIDSGATHNFIA 575

Query: 369  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN 428
              +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV 
Sbjct: 576  CRLVEELGIPMIQGRGVGVSLGTGQREQCAGRCSGVTLSIQGEDITQDFLVLELGNTDVI 635

Query: 429  LGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKEDQGVL 488
            LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +S+V+LK+++   + E +G L
Sbjct: 636  LGIQWLQTLGEMKVNWKTLMLEYGEGEHRVTLHGDPKLCRSKVALKTILKSLRSEGEGFL 695

Query: 489  IELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELES 548
            IEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ 
Sbjct: 696  IELWRLE--GSEPVEE------QNIPEEVGELLEDFTPVFQMPAGLPPQRSKEHAIVLKG 755

Query: 549  GARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDY 608
            GA  V+VRPYRYP  QK+EIEKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDY
Sbjct: 756  GADPVSVRPYRYPHAQKEEIEKLVREMMEAGVIQPSVSPFSSPVLLVKKKDGSWRFCVDY 815

Query: 609  RALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH 668
            RALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Sbjct: 816  RALNKETVLDKFPIPVIDELLDELGGATMFSKLDLKSGYHQIRMRREDIPKTAFRTHEGH 875

Query: 669  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLET 728
            YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDD LIYS SL+EHL  L  VL T
Sbjct: 876  YEFLVMPFGLTNAPSTFQALMNQVFQPMLRRFVLVFFDDFLIYSRSLQEHLEHLRKVLNT 935

Query: 729  LVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGL 788
            L  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ AM  WP PKN++ LRGFLGL
Sbjct: 936  LQHHQLYVNQKKCSFAQRSVEYLGHIISAEGVAADPSKVEAMTSWPTPKNLRALRGFLGL 995

Query: 789  PGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFV 848
             GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +P+F Q  V
Sbjct: 996  TGYYRKFIKGYGSIAAPLTEQLKKDSFNWGPEADSAMEALKRAMVSAPVLALPNFKQQLV 1055

Query: 849  LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKP 908
            ++TDASG+G+GAVLMQ  RP+AF+SQ L    + K+VYER+LMAIV+A+QKWRP+LLG+ 
Sbjct: 1056 VETDASGLGLGAVLMQQGRPIAFYSQVLSGRAKLKSVYERELMAIVKAIQKWRPYLLGRR 1115

Query: 909  FVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL 968
            F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Sbjct: 1116 FLVRTDQRSLKYLLEQRMVTEEHQRWLSKLLGYDFEIQYKPGLENKAADALSRKVECSQL 1175

Query: 969  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDS 1028
               SV   ++      +   +E L S+R ++  G+  P GY+++ ++L +  RLVLP+ S
Sbjct: 1176 IATSVPQLVDWGNLRTENMTSEELGSLREAIRKGKEIPSGYTVEDQLLLHRGRLVLPKTS 1213

BLAST of CSPI03G23920 vs. ExPASy TrEMBL
Match: A0A2I0WN12 (Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca001655 PE=4 SV=1)

HSP 1 Score: 860.5 bits (2222), Expect = 7.0e-246
Identity = 473/971 (48.71%), Postives = 629/971 (64.78%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LEG A  W+K+ D    + +W +F++ +  RF+ S   +   +   L QE TV EYR++ 
Sbjct: 211  LEGAAFAWFKYMDKWDPVRTWRDFKEAIRERFRGSSPWEISEQFYALTQEGTVEEYRKKF 270

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRI 188
            E     ++ +S++ L   F+ GL+ EI+  ++ + P  L   M +AQ++E+ +       
Sbjct: 271  ESLVGDMEGLSNSTLGGNFMKGLKPEIRDAVKVMRPRDLREAMELAQLVENQKT------ 330

Query: 189  HGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDF 248
                           N  +   N +G         +RT T   +    S S    S  + 
Sbjct: 331  ---------------NARSWRNNHSG-------GPTRTTTTYLAPKGPSPSVPRESGKEK 390

Query: 249  NVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGED-----LSGEIDKVAEETE 308
            +     A  ++++T+ EM+ K+ KGLCFRC+EKF PG + +D     L+  ID+  EE +
Sbjct: 391  SGVTRTAGSFKKLTEEEMQEKRAKGLCFRCEEKFVPGHRCKDRALRALTVYIDEAPEEGD 450

Query: 309  DKNEQIN---TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVK 368
            D  E+ +    E+A +SL+S++GF+   T+K+KG+I   EVVVL+D GA HNFIS +V +
Sbjct: 451  DSEEEQSDPQLEVAEVSLNSVMGFTPSHTMKVKGKIHGREVVVLIDSGATHNFISTQVAE 510

Query: 369  ELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVTW 428
            EL I      +YGV++GTG +  +TG+CK V +++  + +  DFLPL LGS DV LG+ W
Sbjct: 511  ELGIEPTETGSYGVMMGTGKIESSTGICKGVEMSLQEIRVVEDFLPLRLGSTDVILGMKW 570

Query: 429  LETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSM--IFKKEDQGVLIELST 488
            L+TLG+   N+    ME  +    V L+G   L ++ VSL+SM  I  +E  G L+EL +
Sbjct: 571  LQTLGETKVNWGTMVMELMVEGKRVKLRGEPGLSRAGVSLRSMVKIIHEEGGGFLVELQS 630

Query: 489  VEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARAV 548
            +E     E K        ++   VQ  L  F  VF+    LPP R+ +H I L+ G   +
Sbjct: 631  LEDQQEGEEK--------HIPALVQPFLQEFEDVFQPPVGLPPDREREHQIILKEGVSPI 690

Query: 549  NVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALNL 608
            +VRPYRYPQ QKDEIEKLV +ML   IIQPS S FSSPVLLVKKKDGSWRFCVDYRALN 
Sbjct: 691  SVRPYRYPQVQKDEIEKLVGEMLEGGIIQPSVSPFSSPVLLVKKKDGSWRFCVDYRALNK 750

Query: 609  ATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFLV 668
             T+PDK+PIPV+DEL+DEL GA +F+KIDLKSGYH IR+R  DV KTAFRTHEGHYEFLV
Sbjct: 751  ETVPDKFPIPVIDELMDELHGAALFTKIDLKSGYHQIRMRKEDVQKTAFRTHEGHYEFLV 810

Query: 669  MPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVHK 728
            MPFGL NAP TFQ++MN I RP+LR+FVLVFFDDILIYS + EEHL  L +VL  L  H+
Sbjct: 811  MPFGLTNAPATFQALMNRIFRPHLRRFVLVFFDDILIYSRTEEEHLEHLRVVLGVLREHQ 870

Query: 729  LVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYYR 788
            L AN KKC FA  Q+EYLGH+IS +GVAAD +KI AM+ WP PK +K LRGFLGL GYYR
Sbjct: 871  LKANFKKCDFAQAQVEYLGHVISQEGVAADQSKIEAMLAWPQPKTLKGLRGFLGLTGYYR 930

Query: 789  KFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTDA 848
            +FV  Y +IA PLT+LLKK  F W E A +AF++LK AM +VPVL +PDF Q FVL+TDA
Sbjct: 931  RFVRGYSTIAGPLTELLKKDNFLWGEAASEAFEKLKKAMTTVPVLALPDFNQVFVLETDA 990

Query: 849  SGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVRT 908
            SG G+GAVLMQ+ R +A+FSQ L    R K+VYER+LMAIV A+QKWRP+LLG+ F+VRT
Sbjct: 991  SGYGLGAVLMQNHRAIAYFSQILSPRARLKSVYERELMAIVLAIQKWRPYLLGRHFIVRT 1050

Query: 909  DQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLISV 968
            DQ+SLK+LLEQR +  E+QRW++KLLGYDF I Y+ G+ENKAADALSR    F+   +SV
Sbjct: 1051 DQRSLKYLLEQRMVTEEHQRWLSKLLGYDFEIHYRPGLENKAADALSRCMEEFQGMAVSV 1110

Query: 969  VGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIPL 1028
               ++     ++   NE L  I+  +++ + +  GYS+ GE L Y  R V+P  S  IP 
Sbjct: 1111 PVLIDWGAIKEESIQNEELRRIKEEVLSDENSHPGYSVVGERLYYQGRSVIPRSSIHIPQ 1145

BLAST of CSPI03G23920 vs. ExPASy TrEMBL
Match: A0A087GEK8 (Uncharacterized protein OS=Arabis alpina OX=50452 GN=AALP_AA8G499800 PE=4 SV=1)

HSP 1 Score: 850.9 bits (2197), Expect = 5.6e-243
Identity = 476/975 (48.82%), Postives = 632/975 (64.82%), Query Frame = 0

Query: 73   ALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRVEQFS 132
            ALDWY+W+ DR    SW + R  + A++          R + L+Q+  V ++ R     +
Sbjct: 146  ALDWYRWERDRHPFRSWPDPRLRIVAQYASDNNSCAGKRLLVLKQDGAVADFCRDFIGLA 205

Query: 133  TSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRIHGVG 192
            T+  ++ +  LE  F+ GL+  I+S ++   P  LE  M +A++++              
Sbjct: 206  TNAPEVPEFILEWTFMNGLKPHIRSRVQTFEPQTLEKMMSVAKLVDG---WSESAFGSSV 265

Query: 193  ANPSLLNKTSGNGPN---GSGNQTGSKVMDRVATSR-TITINPSRNSSSSSTTITSPHDF 252
            A+    +KT+ +GP    G  N TG      +A ++    + PS N+ S S   T   + 
Sbjct: 266  ASYFPTSKTARDGPTRGLGFSNNTGPTSTTGLALNKPNSQLTPSDNTQSFSQ--TEKRNP 325

Query: 253  NVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQ--------------GEDLSGE 312
               N +  PYRR+T  EM  +K  GLCFRCDEK++   Q              G D+  E
Sbjct: 326  TTHNRVKPPYRRLTPIEMAQRKADGLCFRCDEKWHIRHQCPKKEVNVLLVQEDGPDILWE 385

Query: 313  IDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFI 372
             D   ++  D  +Q  TE+A LSL+S+VG SSP T+K+ G I+  EVVVL+D GA HNF+
Sbjct: 386  AD---DDFTDATDQAITELAELSLNSMVGISSPSTMKLMGTIQTTEVVVLIDSGASHNFV 445

Query: 373  SEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADV 432
            SE++V  L +      +YGV+ G G  VR  G+C+ + L +  L I  DFLPL LGSADV
Sbjct: 446  SEQLVHRLGLQSAKTGSYGVLTGGGMTVRGAGVCRGLVLLLQGLRIRDDFLPLELGSADV 505

Query: 433  NLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMIFKKEDQGVLI 492
             LG+ WL +LG++  N+    M FSLG    +LQG+     S +SLKS++   +DQGV +
Sbjct: 506  ILGIKWLSSLGEMKVNWGRQYMRFSLGGETAVLQGDPGQGCSAISLKSLMRAVKDQGVGL 565

Query: 493  ELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESG 552
                VE  G  +S D +A   + +   +  ++  F  VFE    LPP R   H I LESG
Sbjct: 566  ---LVEYNGL-QSLDQVAGFTTEVPQALVSVMDQFPQVFEDPQGLPPTRGRAHEINLESG 625

Query: 553  ARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYR 612
            A+AV+VRP+RYPQ QK EIEK V  ML A IIQ S S FSSPVLLVKKKDGSWRFC+DYR
Sbjct: 626  AKAVSVRPFRYPQTQKAEIEKQVTAMLAAGIIQESTSTFSSPVLLVKKKDGSWRFCIDYR 685

Query: 613  ALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHY 672
            ALN  TIPD +PIP++D+LLDEL GAT+FSK+DLKSGYH I V+  +V KTAFRTH+GHY
Sbjct: 686  ALNKVTIPDSFPIPMIDQLLDELHGATVFSKLDLKSGYHQILVKPQNVPKTAFRTHDGHY 745

Query: 673  EFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIY-SSLEEHLHQLAMVLETL 732
            EFLVMPFGL NAPTTFQ++MN++ R +LRKFVLVFFDDIL+Y SSL+EH   L +VL+ L
Sbjct: 746  EFLVMPFGLTNAPTTFQALMNEVFRAHLRKFVLVFFDDILVYSSSLQEHQEHLRVVLQIL 805

Query: 733  VVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLP 792
               +L AN KKCQF    IEYLGH+IS +GV+ADP+K+ AMV WP PKN+K LRGFLGL 
Sbjct: 806  FQQQLFANKKKCQFGSSSIEYLGHVISGEGVSADPSKLQAMVSWPLPKNIKALRGFLGLT 865

Query: 793  GYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVL 852
            GYYR+FV  YGSIA PLT LLKK KFQW+E A  AF++LK AM +VPVL + DF++ FV+
Sbjct: 866  GYYRRFVQGYGSIAKPLTSLLKKDKFQWSEEATVAFEKLKVAMSTVPVLALVDFSELFVV 925

Query: 853  KTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPF 912
            ++DASG+G+GAVL+Q Q+PVA+FSQAL    + K+VYER+LMAIV A+QKWR +LLG+ F
Sbjct: 926  ESDASGIGLGAVLLQKQKPVAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKF 985

Query: 913  VVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELG 972
            +VRTDQKSLKFLLEQR +  EYQ+W+ K+LG++F I YK G+ENKAADALSR+  L +L 
Sbjct: 986  LVRTDQKSLKFLLEQREVNLEYQQWLTKILGFNFDIHYKPGLENKAADALSRVEGLPQLY 1045

Query: 973  LISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSP 1029
             +SV   +      ++V  N     I+  ++    T  GYS+    L Y+ +LVLP++S 
Sbjct: 1046 ALSVPAAIQLEEINEEVDRNPVSKKIKEEVLLDASTHSGYSVVQGRLLYNGKLVLPKESY 1105

BLAST of CSPI03G23920 vs. ExPASy TrEMBL
Match: A0A5D3BD16 (Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold453G001350 PE=4 SV=1)

HSP 1 Score: 850.1 bits (2195), Expect = 9.5e-243
Identity = 455/972 (46.81%), Postives = 640/972 (65.84%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            +EG+ L W++W ++R +  SW+E ++ L+ RF+    G   AR + ++QE +V EY +R 
Sbjct: 138  MEGKGLCWFRWAENRKRFRSWKELKERLYTRFRNREYGTGCARFLAIKQEGSVGEYLQRF 197

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRI 188
            E+ S  L +M++  L   F  GL   I++E+  +  VGLE  M  A++ E+   +  +  
Sbjct: 198  EELSAPLPEMAEDVLVGAFTNGLDPVIRTEVFAMRAVGLEDMMDAARLAEEKLEI-ARAS 257

Query: 189  HGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDF 248
            HG    P   +  S   P     +T S  +  +A     ++N + NS + +T +    D 
Sbjct: 258  HG----PYAKDFKSAQKPAPKNVETPSTKIVTLAERIPASVNQANNSQNGATGMGGRRDT 317

Query: 249  NVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDLSGEIDKVAEETED---- 308
                     +RR TDSE++ +++KGLC+RC+E F+ G + ++    +  VA++ ED    
Sbjct: 318  G--------FRRWTDSELQARRDKGLCYRCEEPFSKGHRCKNRELRLCVVADDLEDVEMV 377

Query: 309  ----KNEQIN-TEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVV 368
                + E +  + +  LSL+S+VG ++P T K+KG + N E+V++VD GA HNFIS ++V
Sbjct: 378  DSACEGEMVEVSPVVELSLNSVVGLTAPGTFKLKGTVENQEIVIMVDCGATHNFISLKLV 437

Query: 369  KELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLGVT 428
            + LK+P+     YGV++G+G  V+  G+CK + + +  +SI  DFLPL LG+ D+ LG+ 
Sbjct: 438  ENLKLPMAETTNYGVIMGSGKAVQGRGICKGITVGLPVISIVEDFLPLELGNIDMVLGMQ 497

Query: 429  WLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKEDQGVLIELS 488
            WL+  G +  ++K   M F +G+  VIL+G+ SL + ++SLK ++  ++ +DQG L+   
Sbjct: 498  WLQKQGAMTVDWKALTMTFVVGDTKVILKGDPSLTRMEISLKVLVKTWQPDDQGFLVNFR 557

Query: 489  TVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGARA 548
             +    A+     + D +   + E  ++   FG VFE  + LPP R  DH I+L+ G   
Sbjct: 558  AMGIPKADREL-VVTDAVEEYQSEFAQLQQEFGDVFEMPDGLPPMRRIDHKIQLKEGTDP 617

Query: 549  VNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRALN 608
            +NVRPYRYP  QK+EIE+LV  ML + II+PS S FSSPV+LVKKKDG WRFCVDYRALN
Sbjct: 618  INVRPYRYPHAQKNEIERLVNDMLASGIIRPSTSPFSSPVILVKKKDGGWRFCVDYRALN 677

Query: 609  LATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYEFL 668
             AT+PDK+PIP++DELLDEL GA+IFSKIDLKSGYH IRVR  D+ KTAFRTHEGHYEFL
Sbjct: 678  RATVPDKFPIPMIDELLDELSGASIFSKIDLKSGYHQIRVRDEDISKTAFRTHEGHYEFL 737

Query: 669  VMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLVVH 728
            VMPFGL NAP TFQ++MN + RPYLRKF+LVFFDDIL+YS  +E HL  L MV + L  H
Sbjct: 738  VMPFGLTNAPATFQALMNQVFRPYLRKFLLVFFDDILVYSRDVETHLEHLTMVFQLLRQH 797

Query: 729  KLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPGYY 788
             L AN +KC FA D+IEYLGH +S+ GV AD  KI AM++WP PKN++ELRGFLGL GYY
Sbjct: 798  CLFANRQKCHFAKDRIEYLGHWVSAKGVEADQEKIKAMIEWPIPKNIRELRGFLGLTGYY 857

Query: 789  RKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLKTD 848
            R+FVANYG+IA PLT+L KK  F+W+E A KAF+QLK AM+++PVL +PDF   F ++TD
Sbjct: 858  RRFVANYGAIATPLTKLTKKNNFRWSEEATKAFEQLKRAMVTLPVLALPDFQLPFEVETD 917

Query: 849  ASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFVVR 908
            ASG+G+GAVL Q++RP+A+FSQ L  T R K+VYER+LMAIV AV+KWR +LLG  FVV 
Sbjct: 918  ASGIGLGAVLTQNKRPIAYFSQKLSETAREKSVYERELMAIVLAVEKWRHYLLGHRFVVY 977

Query: 909  TDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGLIS 968
            TDQK+L+ +LEQR I    Q+W+ KL+G+DF I Y+ G ENKAADALSR+P   EL  I+
Sbjct: 978  TDQKALRHILEQREIVPGVQKWLMKLIGFDFEIRYRAGPENKAADALSRMPFETELNAIT 1037

Query: 969  VVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPTIP 1028
            V   L+  +   +V  +E L +I   ++        Y+++   L Y  RLV+   S  IP
Sbjct: 1038 VPSLLDITVIEKEVQADEKLKAIFDRIVADPDCVPRYTIRQGKLFYKGRLVISRTSSFIP 1095

BLAST of CSPI03G23920 vs. NCBI nr
Match: XP_028552250.1 (uncharacterized protein LOC114580023 [Dendrobium catenatum])

HSP 1 Score: 884.8 bits (2285), Expect = 7.2e-253
Identity = 485/974 (49.79%), Postives = 646/974 (66.32%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LEG+AL W++W + R  + SWEEF+ LL  RF+ S +G  + + + L QE TV EYR+  
Sbjct: 258  LEGKALAWFQWLEGRQHVRSWEEFKDLLLHRFRMSSEGTHYEQFVALVQEGTVAEYRKHF 317

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDDQVVQLKRI 188
            E  S+ L+ ++D  LE  F+ GL+  I++ IR ++P GL   M  AQ++ED   V+  R 
Sbjct: 318  ELLSSRLRGITDDLLEGNFMKGLKPHIRAAIRVVDPHGLVKIMETAQLVEDKLKVEPLRR 377

Query: 189  HGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHDF 248
             GV + P+        GP                   TI +     +   +TT  +   F
Sbjct: 378  SGV-SYPTYRAPFMSGGPKA-----------------TIILPHKEVAKERTTTAVNLGGF 437

Query: 249  NVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQ-----------GEDLSGEIDK 308
                      +R+TDSE++ K+ KGLC+RCDEKF  G +           G+    E + 
Sbjct: 438  ----------KRLTDSELKEKRAKGLCYRCDEKFTLGHRCKERTLHVIIVGDSEEEENEG 497

Query: 309  VAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEE 368
             A+E  ++ E  +  +  +SL+S+ G +S  T+K++GEI   +V+VL+D GA HNFI+  
Sbjct: 498  AAKEDGEEGEHPHLAMVEISLNSIAGLTSHSTMKLEGEIAGYKVMVLIDSGATHNFIACR 557

Query: 369  VVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNLG 428
             V+++ +PV      GV+LGTG   R  G CK V LT+       DFL L LGS DV LG
Sbjct: 558  FVEKVGLPVTQGRGVGVILGTGKKERCKGHCKGVTLTLQGEKTEQDFLLLDLGSTDVILG 617

Query: 429  VTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKEDQGVLIE 488
            + WL++LG++  N+K   ME+  G   V LQG+ SL +++V+LKS+    + E +G LIE
Sbjct: 618  MQWLQSLGEMKVNWKKLYMEYGQGGRKVTLQGDPSLCRARVALKSIFKTLRDEGEGYLIE 677

Query: 489  LSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELESGA 548
            L   +Q G E  ++        +  E+Q +L    +VF+    LPP R  +HAI L++G 
Sbjct: 678  L---QQVGTEAEREP-----EVIPEEIQELLQQHSTVFQMPQGLPPVRTREHAIVLKTGV 737

Query: 549  RAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDYRA 608
              ++V+PYRYP  QK+EIE+LV +ML A++IQPS S FSSPVLLVKKKDGSWRFCVDYRA
Sbjct: 738  APISVQPYRYPHAQKEEIERLVTEMLEAQVIQPSVSPFSSPVLLVKKKDGSWRFCVDYRA 797

Query: 609  LNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGHYE 668
            LN  T+ DK+PIPVVDELLDEL GATIFSK+DLKSGYH IR+RA D+ KTAFRTHEGHYE
Sbjct: 798  LNKETVLDKFPIPVVDELLDELGGATIFSKVDLKSGYHQIRMRAEDIQKTAFRTHEGHYE 857

Query: 669  FLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLETLV 728
            FLVMPFGL NAP+TFQ++MN + +PYLR+FVLVFFDDIL+Y+ SL++HLH L +VL TL+
Sbjct: 858  FLVMPFGLTNAPSTFQALMNQVFQPYLRRFVLVFFDDILVYNKSLQDHLHHLGVVLSTLL 917

Query: 729  VHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGLPG 788
             H+L AN KKC FA  ++EYLGHIIS+DGVAADPTKI AMV WP PK++K LRGFLGL G
Sbjct: 918  EHQLYANHKKCSFAQKEVEYLGHIISNDGVAADPTKIEAMVNWPTPKSLKGLRGFLGLTG 977

Query: 789  YYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFVLK 848
            YYR+F+  YGSIA PLT  LKK  FQW E AE A + LK AM S PVL +PDFTQ FV++
Sbjct: 978  YYRRFIKGYGSIASPLTDQLKKDNFQWGEKAEGAMKNLKEAMTSAPVLALPDFTQQFVVE 1037

Query: 849  TDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKPFV 908
            TDAS VG+GAVLMQ+ RP+AFFS+ L    R K+VYER+LMAIV A+QKWRP+LLG+ F+
Sbjct: 1038 TDASRVGLGAVLMQNHRPIAFFSRILSARARLKSVYERELMAIVLAIQKWRPYLLGQRFI 1097

Query: 909  VRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFELGL 968
            VRTDQ+SLK+ LEQR +  E+QRW+AKLLGY+F I+YK G++NKAADALSR+    +L  
Sbjct: 1098 VRTDQRSLKYFLEQRLVAEEHQRWLAKLLGYEFEIQYKPGVQNKAADALSRV-NCSQLLA 1157

Query: 969  ISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDSPT 1028
            +SV   ++    + +    E L  IR ++  G+   +GY L+  +L Y  RLVL  +S  
Sbjct: 1158 LSVPQWVDWGELVKENQQAEELEGIRATIQKGEGGVKGYHLENSLLLYKWRLVLHRESAF 1194

BLAST of CSPI03G23920 vs. NCBI nr
Match: AFK13856.1 (Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris])

HSP 1 Score: 871.7 bits (2251), Expect = 6.3e-249
Identity = 475/981 (48.42%), Postives = 646/981 (65.85%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            +EG+AL WY+W++ R    +WE  +  +  +F+P   G  H + +   Q  +V EYRR+ 
Sbjct: 216  MEGDALRWYQWENKRRPFRNWESMKSFVLTQFRPLNVGSLHEQWLSTTQTASVWEYRRKF 275

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIED-DQVVQLKR 188
             + +  L  + +  L  KF+ GL  E+QSEIR LNP  L+  M +A  +E+ ++V   +R
Sbjct: 276  VETAAPLDGIPEEILMGKFIHGLNPELQSEIRVLNPYNLDQAMELALKLEERNRVNGARR 335

Query: 189  IHGVGANPSLLNKTSGNGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPHD 248
                  + S+ N+    GPN   N +   V      S   T + + NS++S T++ +   
Sbjct: 336  TGPRSGSFSIYNR----GPN--SNPSLPSVYGSQGGSNASTKSWAINSNASQTSVNNAKP 395

Query: 249  FNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQ---------------GEDLS 308
              + +      RR+T+ E++ K+ KGLCF+CDEK+  G Q                ++L 
Sbjct: 396  PPLSSRGFGEMRRLTEKELQEKRAKGLCFKCDEKWGVGHQCRRKELSVLFMEDNEEDELE 455

Query: 309  GEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHN 368
            G +   +E      E+I  E+   SL+S++G S+PKT+K+ G I N EVVV++D GA HN
Sbjct: 456  GALSG-SEAPPSPTEEIPPEV---SLNSVIGLSNPKTMKLSGLIDNHEVVVMIDPGATHN 515

Query: 369  FISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTI-VNLSITHDFLPLPLGS 428
            F+S + + +L IPV   + +GV LG G  VR TG+C++V L +   L +  DFLPL LG+
Sbjct: 516  FLSLKAIDKLGIPVTESEEFGVSLGDGQAVRGTGICRAVALYLDGGLVVVEDFLPLGLGN 575

Query: 429  ADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKED 488
            +DV LGV WLETLG V+ N+K  +M F LG     L G+ +L +S+VSLK+M+   +KE 
Sbjct: 576  SDVILGVQWLETLGTVVSNWKTQKMSFQLGGVPYTLTGDPTLARSKVSLKAMLRTLRKEG 635

Query: 489  QGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAI 548
             G+ +E + VE GGA   +D+  +    + P +Q ++  F  VFE+   LPP R H+HAI
Sbjct: 636  GGLWLECNQVEAGGAGSIRDSKVE--QEIPPFLQELMRRFEGVFETPVGLPPRRGHEHAI 695

Query: 549  ELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRF 608
             L+ G+  V VRPYRYPQFQKDEIE+L+K+ML A IIQPS S FSSPV+LVKKKDGSWRF
Sbjct: 696  VLKEGSNPVGVRPYRYPQFQKDEIERLIKEMLAAGIIQPSTSPFSSPVILVKKKDGSWRF 755

Query: 609  CVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT 668
            CVDYRALN  T+PDKYPIPV+DELLDEL GAT+FSK+DL++GYH I VR  D HKTAFRT
Sbjct: 756  CVDYRALNKETVPDKYPIPVIDELLDELHGATVFSKLDLRAGYHQILVRPEDTHKTAFRT 815

Query: 669  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAM 728
            HEGHYEFLVMPFGL NAP TFQS+MN++ RP+LR+FVLVF DDILIYS S EEH+  L M
Sbjct: 816  HEGHYEFLVMPFGLTNAPATFQSLMNEVFRPFLRRFVLVFLDDILIYSRSDEEHVGHLEM 875

Query: 729  VLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRG 788
            VL  L  H L  N KKC+F   ++ YLGH+IS  GVA D  K+ A+++W  PKN++ELRG
Sbjct: 876  VLGMLAQHALFVNKKKCEFGKREVAYLGHVISEGGVAMDTEKVKAVLEWEVPKNLRELRG 935

Query: 789  FLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFT 848
            FLGL GYYRKFVANY  IA PLT+ LKK  F+W+ TA +AF+QLKSAM+S PVL +P+F 
Sbjct: 936  FLGLTGYYRKFVANYAHIARPLTEQLKKDNFKWSATATEAFKQLKSAMVSAPVLAMPNFQ 995

Query: 849  QGFVLKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHL 908
              FV++TDASG G+GAVLMQ  RP+A++S+ L    + K+VYE++LMAI  AVQKW+ +L
Sbjct: 996  LTFVVETDASGYGMGAVLMQDNRPIAYYSKLLGTRAQLKSVYEKELMAICFAVQKWKYYL 1055

Query: 909  LGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSR-LP 968
            LG+ FVVRTDQ+SL+++ +QR IG E+Q+W++KL+GYDF I YK G+ N+ ADALSR   
Sbjct: 1056 LGRHFVVRTDQQSLRYITQQREIGAEFQKWVSKLMGYDFEIHYKPGLSNRVADALSRKTV 1115

Query: 969  PLFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLV 1028
               ELG I  V  +       ++TG+  L  +R  L  G+ TP  ++L    L +  R V
Sbjct: 1116 GEVELGAIVAVQGVEWAELRREITGDSFLTQVRKELQEGR-TPSHFTLVDGNLLFKGRYV 1175

BLAST of CSPI03G23920 vs. NCBI nr
Match: XP_028552383.1 (uncharacterized protein LOC114580110 [Dendrobium catenatum])

HSP 1 Score: 870.9 bits (2249), Expect = 1.1e-248
Identity = 477/980 (48.67%), Postives = 630/980 (64.29%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R 
Sbjct: 276  LEARALAWYQWTEERQRFRCWAEFREMCLDRFRPPKEGTHHEQFFALTQTGTVSAYRDRF 335

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDD-QVVQLKR 188
            E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L   + MA++IED     Q +R
Sbjct: 336  ELLSSRLRGMTDEVLEGNFMKGLKPHIRSAVRAAKPRSLRETLEMAELIEDRLSSDQYRR 395

Query: 189  IHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPH 248
                G    + N T G  G        G K  DR A ++                     
Sbjct: 396  PTFFGGGQKVANPTGGAKGAYLGAGGEGQKERDRSAPAK--------------------- 455

Query: 249  DFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPG---------------PQGEDL 308
                       +RR+T++E++ K+ KGLC+RCDEKF PG               P+GE++
Sbjct: 456  ---------GEFRRLTEAELKDKRAKGLCYRCDEKFGPGHRCKDKLLQVLLVEDPEGEEV 515

Query: 309  SGEIDKVAEETEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIH 368
              E+       E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA H
Sbjct: 516  EEELG----GEEEGVDHLHLDMIEVSLNSVAGLTAHSTMKMEGRIGSFTVTVLIDSGATH 575

Query: 369  NFISEEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGS 428
            NFI+  +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+
Sbjct: 576  NFIACRLVEELGIPMIQGRGVGVSLGTGQKEQCAGRCSGVTLSIQGEDITQDFLVLELGN 635

Query: 429  ADVNLGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKED 488
             DV LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +S+V+LK+++   + E 
Sbjct: 636  TDVILGIQWLQTLGEMKVNWKTLMLEYGEGEHRVTLHGDPKLCRSKVALKTILKSLRSEG 695

Query: 489  QGVLIELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAI 548
            +G LIEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI
Sbjct: 696  EGFLIELWRLE--GSEPVEE------QNIPEEVGELLEEFTPVFQMPAGLPPQRSKEHAI 755

Query: 549  ELESGARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRF 608
             L+ GA  V+VRPYRYP  QK+EIEKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRF
Sbjct: 756  VLKGGADPVSVRPYRYPHAQKEEIEKLVREMMEAGVIQPSVSPFSSPVLLVKKKDGSWRF 815

Query: 609  CVDYRALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRT 668
            CVDYRALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRT
Sbjct: 816  CVDYRALNKETVLDKFPIPVIDELLDELGGATMFSKLDLKSGYHQIRMRREDIPKTAFRT 875

Query: 669  HEGHYEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAM 728
            HEGHYEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  
Sbjct: 876  HEGHYEFLVMPFGLTNAPSTFQALMNQVFQPMLRRFVLVFFDDILIYSRSLQEHLEHLRK 935

Query: 729  VLETLVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRG 788
            VL TL  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ AM  WP PKN++ LRG
Sbjct: 936  VLNTLQHHQLYVNQKKCSFAQRSVEYLGHIISAEGVAADPSKVEAMTSWPTPKNLRALRG 995

Query: 789  FLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFT 848
            FLGL GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF 
Sbjct: 996  FLGLTGYYRKFIKGYGSIAAPLTEQLKKDSFNWGPEADSAMEALKRAMVSAPVLALPDFK 1055

Query: 849  QGFVLKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHL 908
            Q  V++TDASG+G+GAVLMQ  RP+AF+SQ L    R K+VYER+LMAIV+A+QKWRP+L
Sbjct: 1056 QQLVVETDASGLGLGAVLMQQGRPIAFYSQVLSGRARLKSVYERELMAIVKAIQKWRPYL 1115

Query: 909  LGKPFVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPP 968
            LG+ F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR   
Sbjct: 1116 LGRRFLVRTDQRSLKYLLEQRMVTEEHQRWLSKLLGYDFEIQYKPGLENKAADALSRKVE 1175

Query: 969  LFELGLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVL 1028
              +L   SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVL
Sbjct: 1176 CSQLIATSVPQLVDWGKLRTENMTSEELGSFREAIRKGEEIPAGYTVEDQLLLHKGRLVL 1213

BLAST of CSPI03G23920 vs. NCBI nr
Match: XP_028552640.1 (uncharacterized protein LOC114580166 [Dendrobium catenatum])

HSP 1 Score: 870.5 bits (2248), Expect = 1.4e-248
Identity = 478/976 (48.98%), Postives = 631/976 (64.65%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R 
Sbjct: 276  LEARALAWYQWTEERQRFRCWAEFREMCLDRFRPPKEGTHHEQFFALTQTGTVSAYRDRF 335

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDD-QVVQLKR 188
            E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L   + MA++IED     Q +R
Sbjct: 336  ELLSSRLRGMTDEVLEGNFMKGLKPHIRSAVRAAKPRSLRETLEMAELIEDRLSSDQYRR 395

Query: 189  IHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPH 248
                G    + N T G  G        G K  DR A ++                     
Sbjct: 396  PTFFGGGQKVANPTGGAKGAYLGAGGEGQKERDRSAPAK--------------------- 455

Query: 249  DFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKV 308
                       +RR+T++E++ K+ KGLC+RCDEKF PG + +D           E ++V
Sbjct: 456  ---------GEFRRLTEAELKDKRAKGLCYRCDEKFGPGHRCKDKLLQVLLVEDPEEEEV 515

Query: 309  AEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS 368
             EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Sbjct: 516  EEELGGEEEGVDHLHLDMIEVSLNSVAGLTAHSTMKMEGRIGSFTVTVLIDSGATHNFIA 575

Query: 369  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN 428
              +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV 
Sbjct: 576  CRLVEELGIPMIQGRGVGVSLGTGQKEQCAGRCSGVTLSIQGEDITQDFLVLELGNTDVI 635

Query: 429  LGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKEDQGVL 488
            LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +S+V+LK+++   + E +G L
Sbjct: 636  LGIQWLQTLGEMKVNWKTLMLEYGEGEHRVTLHGDPKLCRSKVALKTILKSLRSEGEGFL 695

Query: 489  IELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELES 548
            IEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ 
Sbjct: 696  IELWRLE--GSEPVEE------QNIPEEVGELLEEFTPVFQMPAGLPPQRSKEHAIVLKG 755

Query: 549  GARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDY 608
            GA  V+VRPYRYP  QK+EIEKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDY
Sbjct: 756  GADPVSVRPYRYPHAQKEEIEKLVREMMEAGVIQPSVSPFSSPVLLVKKKDGSWRFCVDY 815

Query: 609  RALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH 668
            RALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Sbjct: 816  RALNKETVLDKFPIPVIDELLDELGGATMFSKLDLKSGYHQIRMRREDIPKTAFRTHEGH 875

Query: 669  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLET 728
            YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  VL T
Sbjct: 876  YEFLVMPFGLTNAPSTFQALMNQVFQPMLRRFVLVFFDDILIYSRSLQEHLEHLRKVLNT 935

Query: 729  LVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGL 788
            L  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ AM  WP PKN++ LRGFLGL
Sbjct: 936  LQHHQLYVNQKKCSFAQRSVEYLGHIISAEGVAADPSKVEAMTSWPTPKNLRALRGFLGL 995

Query: 789  PGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFV 848
             GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF Q  V
Sbjct: 996  TGYYRKFIKGYGSIAAPLTEQLKKDSFNWGPEADSAMEALKRAMVSAPVLALPDFKQQLV 1055

Query: 849  LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKP 908
            ++TDASG+G+GAVLMQ  RP+AF+SQ L    R K+VYER+LMAIV+A+QKWRP+LLG+ 
Sbjct: 1056 VETDASGLGLGAVLMQQGRPIAFYSQVLSGRARLKSVYERELMAIVKAIQKWRPYLLGRR 1115

Query: 909  FVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL 968
            F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Sbjct: 1116 FLVRTDQRSLKYLLEQRMVTEEHQRWLSKLLGYDFEIQYKPGLENKAADALSRKVECSQL 1175

Query: 969  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDS 1028
               SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVLP  S
Sbjct: 1176 IATSVPQLVDWGKLRTENMTSEELGSFREAIRKGEEIPAGYTVEDQLLLHKGRLVLPRTS 1213

BLAST of CSPI03G23920 vs. NCBI nr
Match: XP_028548251.1 (uncharacterized protein LOC110111203 [Dendrobium catenatum])

HSP 1 Score: 870.5 bits (2248), Expect = 1.4e-248
Identity = 478/976 (48.98%), Postives = 631/976 (64.65%), Query Frame = 0

Query: 69   LEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGDKHARSMKLQQETTVREYRRRV 128
            LE  AL WY+W ++R +   W EFR++   RF+P  +G  H +   L Q  TV  YR R 
Sbjct: 276  LEARALAWYQWTEERQRFRCWAEFREMCLDRFRPPKEGTHHEQFFALTQTGTVSAYRDRF 335

Query: 129  EQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGLEAKMLMAQVIEDD-QVVQLKR 188
            E  S+ L+ M+D  LE  F+ GL+  I+S +R   P  L   + MA++IED     Q +R
Sbjct: 336  ELLSSRLRGMTDEVLEGNFMKGLKPHIRSAVRAAKPRSLRETLEMAELIEDRLSSDQYRR 395

Query: 189  IHGVGANPSLLNKTSG-NGPNGSGNQTGSKVMDRVATSRTITINPSRNSSSSSTTITSPH 248
                G    + N T G  G        G K  DR A ++                     
Sbjct: 396  PTFFGGGQKVANPTGGAKGAYLGAGGEGQKERDRSAPAK--------------------- 455

Query: 249  DFNVKNSMAQPYRRMTDSEMRMKKEKGLCFRCDEKFNPGPQGEDL--------SGEIDKV 308
                       +RR+T++E++ K+ KGLC+RCDEKF PG + +D           E ++V
Sbjct: 456  ---------GEFRRLTEAELKDKRAKGLCYRCDEKFGPGHRCKDKLLQVLLVEDPEEEEV 515

Query: 309  AEE---TEDKNEQINTEIANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFIS 368
             EE    E+  + ++ ++  +SL+S+ G ++  T+K++G I +  V VL+D GA HNFI+
Sbjct: 516  EEELGGEEEGVDHLHLDMIEVSLNSVAGLTAHSTMKMEGRIGSFTVTVLIDSGATHNFIA 575

Query: 369  EEVVKELKIPVETLDAYGVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN 428
              +V+EL IP+      GV LGTG   +  G C  V L+I    IT DFL L LG+ DV 
Sbjct: 576  CRLVEELGIPMIQGRGVGVSLGTGQKEQCAGRCSGVTLSIQGEDITQDFLVLELGNTDVI 635

Query: 429  LGVTWLETLGKVIFNYKLSEMEFSLGEFLVILQGNKSLVKSQVSLKSMI--FKKEDQGVL 488
            LG+ WL+TLG++  N+K   +E+  GE  V L G+  L +S+V+LK+++   + E +G L
Sbjct: 636  LGIQWLQTLGEMKVNWKTLMLEYGEGEHRVTLHGDPKLCRSKVALKTILKSLRSEGEGFL 695

Query: 489  IELSTVEQGGAEESKDNLADCLSNLKPEVQRILLSFGSVFESINQLPPPRDHDHAIELES 548
            IEL  +E  G+E  ++       N+  EV  +L  F  VF+    LPP R  +HAI L+ 
Sbjct: 696  IELWRLE--GSEPVEE------QNIPEEVGELLEEFTPVFQMPAGLPPQRSKEHAIVLKG 755

Query: 549  GARAVNVRPYRYPQFQKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSWRFCVDY 608
            GA  V+VRPYRYP  QK+EIEKLV++M+ A +IQPS S FSSPVLLVKKKDGSWRFCVDY
Sbjct: 756  GADPVSVRPYRYPHAQKEEIEKLVREMMEAGVIQPSVSPFSSPVLLVKKKDGSWRFCVDY 815

Query: 609  RALNLATIPDKYPIPVVDELLDELFGATIFSKIDLKSGYHHIRVRATDVHKTAFRTHEGH 668
            RALN  T+ DK+PIPV+DELLDEL GAT+FSK+DLKSGYH IR+R  D+ KTAFRTHEGH
Sbjct: 816  RALNKETVLDKFPIPVIDELLDELGGATMFSKLDLKSGYHQIRMRREDIPKTAFRTHEGH 875

Query: 669  YEFLVMPFGLKNAPTTFQSVMNDILRPYLRKFVLVFFDDILIYS-SLEEHLHQLAMVLET 728
            YEFLVMPFGL NAP+TFQ++MN + +P LR+FVLVFFDDILIYS SL+EHL  L  VL T
Sbjct: 876  YEFLVMPFGLTNAPSTFQALMNQVFQPMLRRFVLVFFDDILIYSRSLQEHLEHLRKVLNT 935

Query: 729  LVVHKLVANLKKCQFAVDQIEYLGHIISSDGVAADPTKIAAMVKWPAPKNVKELRGFLGL 788
            L  H+L  N KKC FA   +EYLGHIIS++GVAADP+K+ AM  WP PKN++ LRGFLGL
Sbjct: 936  LQHHQLYVNQKKCSFAQRSVEYLGHIISAEGVAADPSKVEAMTSWPTPKNLRALRGFLGL 995

Query: 789  PGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVPVLGIPDFTQGFV 848
             GYYRKF+  YGSIA PLT+ LKK  F W   A+ A + LK AM+S PVL +PDF Q  V
Sbjct: 996  TGYYRKFIKGYGSIAAPLTEQLKKDSFNWGPEADSAMEALKRAMVSAPVLALPDFKQQLV 1055

Query: 849  LKTDASGVGIGAVLMQHQRPVAFFSQALPITHRFKAVYERKLMAIVRAVQKWRPHLLGKP 908
            ++TDASG+G+GAVLMQ  RP+AF+SQ L    R K+VYER+LMAIV+A+QKWRP+LLG+ 
Sbjct: 1056 VETDASGLGLGAVLMQQGRPIAFYSQVLSGRARLKSVYERELMAIVKAIQKWRPYLLGRR 1115

Query: 909  FVVRTDQKSLKFLLEQRAIGGEYQRWIAKLLGYDFVIEYKKGMENKAADALSRLPPLFEL 968
            F+VRTDQ+SLK+LLEQR +  E+QRW++KLLGYDF I+YK G+ENKAADALSR     +L
Sbjct: 1116 FLVRTDQRSLKYLLEQRMVTEEHQRWLSKLLGYDFEIQYKPGLENKAADALSRKVECSQL 1175

Query: 969  GLISVVGRLNPLIFIDQVTGNEALNSIRLSLINGQPTPEGYSLQGEVLCYHDRLVLPEDS 1028
               SV   ++      +   +E L S R ++  G+  P GY+++ ++L +  RLVLP  S
Sbjct: 1176 IATSVPQLVDWGKLRTENMTSEELGSFREAIRKGEEIPAGYTVEDQLLLHKGRLVLPRTS 1213

BLAST of CSPI03G23920 vs. TAIR 10
Match: ATMG00860.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 149.1 bits (375), Expect = 2.0e-35
Identity = 74/134 (55.22%), Postives = 90/134 (67.16%), Query Frame = 0

Query: 703 LHQLAMVLETLVVHKLVANLKKCQFAVDQIEYLG--HIISSDGVAADPTKIAAMVKWPAP 762
           ++ L MVL+    H+  AN KKC F   QI YLG  HIIS +GV+ADP K+ AMV WP P
Sbjct: 1   MNHLGMVLQIWEQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEP 60

Query: 763 KNVKELRGFLGLPGYYRKFVANYGSIALPLTQLLKKGKFQWNETAEKAFQQLKSAMMSVP 822
           KN  ELRGFLGL GYYR+FV NYG I  PLT+LLKK   +W E A  AF+ LK A+ ++P
Sbjct: 61  KNTTELRGFLGLTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLP 120

Query: 823 VLGIPDFTQGFVLK 835
           VL +PD    FV +
Sbjct: 121 VLALPDLKLPFVTR 134

BLAST of CSPI03G23920 vs. TAIR 10
Match: AT3G29750.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 70.9 bits (172), Expect = 6.9e-12
Identity = 44/147 (29.93%), Postives = 78/147 (53.06%), Query Frame = 0

Query: 321 LVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAYGVVLGTGG 380
           ++  +  K ++  G I + +VVV +D GA  NFI  E+   LK+P    +   V+LG   
Sbjct: 115 VIDLTRNKGMRFYGFILDHKVVVAIDSGATDNFILVELAFSLKLPTSITNQASVLLGQRQ 174

Query: 381 VVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVN--LGVTWLETLGKVIFNYKLSEMEF 440
            +++ G C  + L +  + IT +FL L L   DV+  LG  WL  LG+ + N++  +  F
Sbjct: 175 CIQSVGTCLGIRLWVQEVEITENFLLLDLAKTDVDVILGYEWLSKLGETMVNWQNQDFSF 234

Query: 441 SLG-EFLVILQGNKSL--VKSQVSLKS 463
           S   +++ +   ++ L  V ++V +KS
Sbjct: 235 SHNQQWITLCAEHEELEQVTTKVKMKS 261

BLAST of CSPI03G23920 vs. TAIR 10
Match: AT3G30770.1 (Eukaryotic aspartyl protease family protein )

HSP 1 Score: 64.3 bits (155), Expect = 6.5e-10
Identity = 52/180 (28.89%), Postives = 89/180 (49.44%), Query Frame = 0

Query: 313 IANLSLHSLVGFSSPKTIKIKGEIRNCEVVVLVDGGAIHNFISEEVVKELKIPVETLDAY 372
           I  +   S   F+  K ++  G I   +VVV++D GA +NFIS+E+   LK+P  T +  
Sbjct: 267 IRQVKRQSTTEFTKGKDMRFYGFISCHKVVVVIDSGATNNFISDELALVLKLPTSTTNQA 326

Query: 373 GVVLGTGGVVRATGMCKSVNLTIVNLSITHDFLPLPLGSADVNL-----GVTWLETLGKV 432
            V+LG    ++  G C  +NL +  + I  +FL L L   DV++     G   LE    +
Sbjct: 327 SVLLGQRQCIQTIGTCFGINLLVQEVEINENFLLLDLTKTDVDVILGYGGSQNLERQWLI 386

Query: 433 IFNYKLSEMEFSLGEFLVILQGNKSL--VKSQVSLKSMIFKKEDQGVLIELSTVEQGGAE 486
             N   S   F   +++ +   +K L  V ++V +KS  +++E     +E   V +GG++
Sbjct: 387 WLNQDFS--FFHNQQWVTLCAKDKELEQVTTKVKMKSE-YEQEKIDHYLEDKVVLKGGSK 443

BLAST of CSPI03G23920 vs. TAIR 10
Match: AT3G42723.1 (aminoacyl-tRNA ligases;ATP binding;nucleotide binding )

HSP 1 Score: 58.2 bits (139), Expect = 4.7e-08
Identity = 38/132 (28.79%), Postives = 62/132 (46.97%), Query Frame = 0

Query: 48  FGVPCAPNQREIPLFDMRLRKLEGEALDWYKWKDDRTKINSWEEFRQLLWARFKPSGQGD 107
           FG    P Q  + +       LEG+   W K    +    SW+EF+ ++    K + + +
Sbjct: 282 FGENNIPEQERLQIV---YSNLEGDIGQWIKHLWKKNSPTSWKEFKCMMARETKTTMKVN 341

Query: 108 KHARSMKLQQETTVREYRRRVEQFSTSLKDMSDAALESKFVCGLREEIQSEIRKLNPVGL 167
                  +QQE +VREYR R E        +    LE+ F+ GL+  +Q+ +R+L P G+
Sbjct: 342 HQPHYSGIQQEGSVREYRERFEALCLGSVILPGQGLEALFLQGLQPSLQTAVRELKPNGI 401

Query: 168 EAKMLMAQVIED 180
              M  AQ +E+
Sbjct: 402 VQMMDTAQWLEE 410

BLAST of CSPI03G23920 vs. TAIR 10
Match: ATMG00850.1 (DNA/RNA polymerases superfamily protein )

HSP 1 Score: 44.7 bits (104), Expect = 5.3e-04
Identity = 21/39 (53.85%), Postives = 30/39 (76.92%), Query Frame = 0

Query: 549 QKDEIEKLVKKMLLAKIIQPSKSAFSSPVLLVKKKDGSW 588
           ++  ++  + +ML A+IIQPS S +SSPVLLV+KKDG W
Sbjct: 41  RRTRLKNWLGEMLEARIIQPSISPYSSPVLLVQKKDGGW 79

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
P043231.3e-9244.42Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
P208256.6e-9243.48Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
Q7LHG55.2e-8939.89Transposon Ty3-I Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q993151.2e-8839.89Transposon Ty3-G Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Q8I7P93.0e-8439.86Retrovirus-related Pol polyprotein from transposon opus OS=Drosophila melanogast... [more]
Match NameE-valueIdentityDescription
J3SDF53.0e-24948.42Ty3/gypsy retrotransposon protein OS=Beta vulgaris subsp. vulgaris OX=3555 PE=4 ... [more]
A0A2I0X1325.7e-24848.67Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca0130... [more]
A0A2I0WN127.0e-24648.71Putative mitochondrial protein OS=Dendrobium catenatum OX=906689 GN=MA16_Dca0016... [more]
A0A087GEK85.6e-24348.82Uncharacterized protein OS=Arabis alpina OX=50452 GN=AALP_AA8G499800 PE=4 SV=1[more]
A0A5D3BD169.5e-24346.81Ty3/gypsy retrotransposon protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... [more]
Match NameE-valueIdentityDescription
XP_028552250.17.2e-25349.79uncharacterized protein LOC114580023 [Dendrobium catenatum][more]
AFK13856.16.3e-24948.42Ty3/gypsy retrotransposon protein [Beta vulgaris subsp. vulgaris][more]
XP_028552383.11.1e-24848.67uncharacterized protein LOC114580110 [Dendrobium catenatum][more]
XP_028552640.11.4e-24848.98uncharacterized protein LOC114580166 [Dendrobium catenatum][more]
XP_028548251.11.4e-24848.98uncharacterized protein LOC110111203 [Dendrobium catenatum][more]
Match NameE-valueIdentityDescription
ATMG00860.12.0e-3555.22DNA/RNA polymerases superfamily protein [more]
AT3G29750.16.9e-1229.93Eukaryotic aspartyl protease family protein [more]
AT3G30770.16.5e-1028.89Eukaryotic aspartyl protease family protein [more]
AT3G42723.14.7e-0828.79aminoacyl-tRNA ligases;ATP binding;nucleotide binding [more]
ATMG00850.15.3e-0453.85DNA/RNA polymerases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 299..319
NoneNo IPR availableGENE3D3.10.10.10HIV Type 1 Reverse Transcriptase, subunit A, domain 1coord: 526..665
e-value: 5.1E-86
score: 289.4
NoneNo IPR availablePFAMPF08284RVP_2coord: 335..424
e-value: 9.1E-14
score: 51.4
NoneNo IPR availableGENE3D3.10.20.370coord: 830..896
e-value: 1.1E-6
score: 30.5
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 19..40
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 196..216
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 229..250
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..18
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..54
NoneNo IPR availablePANTHERPTHR24559TRANSPOSON TY3-I GAG-POL POLYPROTEINcoord: 777..900
coord: 310..691
NoneNo IPR availablePANTHERPTHR24559:SF319SUBFAMILY NOT NAMEDcoord: 777..900
NoneNo IPR availablePANTHERPTHR24559:SF319SUBFAMILY NOT NAMEDcoord: 310..691
NoneNo IPR availableCDDcd00303retropepsin_likecoord: 332..422
e-value: 7.79342E-14
score: 66.206
NoneNo IPR availableCDDcd09274RNase_HI_RT_Ty3coord: 832..947
e-value: 5.24688E-49
score: 167.67
NoneNo IPR availableCDDcd01647RT_LTRcoord: 564..739
e-value: 1.29195E-84
score: 268.695
IPR041577Reverse transcriptase/retrotransposon-derived protein, RNase H-like domainPFAMPF17919RT_RNaseH_2coord: 801..895
e-value: 4.4E-29
score: 100.4
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 605..739
e-value: 5.1E-86
score: 289.4
IPR043128Reverse transcriptase/Diguanylate cyclase domainGENE3D3.30.70.270coord: 749..829
e-value: 7.9E-28
score: 98.3
IPR000477Reverse transcriptase domainPFAMPF00078RVT_1coord: 580..739
e-value: 2.3E-28
score: 99.2
IPR000477Reverse transcriptase domainPROSITEPS50878RT_POLcoord: 561..739
score: 14.184858
IPR005162Retrotransposon gag domainPFAMPF03732Retrotrans_gagcoord: 68..153
e-value: 4.5E-10
score: 39.7
IPR021109Aspartic peptidase domain superfamilyGENE3D2.40.70.10Acid Proteasescoord: 311..444
e-value: 3.7E-16
score: 61.0
IPR021109Aspartic peptidase domain superfamilySUPERFAMILY50630Acid proteasescoord: 324..428
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 509..932

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI03G23920.1CSPI03G23920.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0090304 nucleic acid metabolic process
molecular_function GO:0016787 hydrolase activity
molecular_function GO:0016740 transferase activity