CSPI05G18750 (gene) Wild cucumber (PI 183967)

NameCSPI05G18750
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionRNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative
LocationChr5 : 19846048 .. 19847529 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGATTAAGAAGTTACAAGTTTATGGGGATTCTCTCTTGCTAATACATCAACTCAATGGGGAGTGGGAAACCAGAGACTTTAAATTGATTCCATATAACAAGTACATTCGAGAATTGGCTCAAACATTTGAGTCAATTACATTCAAGCATGTCCCACGTGAAAGTAATCAATTAGTATATGCATTGGCTACTTATTCTGCCATGTTTAGTGTGGCCTACAACGAGGAAATTCAGCCTATAAGAATTGAAAAGCGTGAAACACCAGTGTATTGCATGAACGTTGAGCAAGAGTTTGACAGAAAACAATGGTACCACAAAATTAAGCATTACATTAGATGTCGAGAATATCCTTTAGGAGCATTTGAAAATAGTAGACGTACCCTTAGAAAGTTGGTCATGAATTTTTTTCTTAACGGAGAAGTGTTGTACAACAAAAATTATGATATGACTCTTTTAAGATGTGTGGATGCATCAGAGGCTAAAATAATTCTGCAAGAAGTTCATGAGGGAGTTTGCGGAACGCATGCAAATGGACACATGATGGCAAGACAAATTATGCGTGCTGGTTATTATTGGTCGACTATGGGGTCAGATTGCATAAAATACGCAAGAAAATGTCATAAATGTCAAATATATGCTGATAAAATTCATGCTTTGGCTTCACCCTCGCATGTATTAACAGCCACATGGCCTTTCTCTATGTGGAGAATGGATGTAATTGGACCAATTGAACCAAAGGCGTCAAATGGTCATCGATTCATTTTGGTAGCCATAGATTATTTTACTAAATGGGTGGAAGCTACATCGTACAAGAGTGTCTCCAAGCAAGCTGTTGTCAAATTCATACAGAAAGATATTATATGTCGGCACAACCTTCCTGAACGTATAATCACCAATAATGCCAAGAACTTAAACAATAAGTTAATGGAGGATTTATGTAATCAGCTTAAAATTATGCACTTCAATTCTACCCTGTACCACCCTAAGATGAATAGAGCAATAGAGATGGCAAATAAAAATATCAAAAGAATTATTGAAAAGATGACTGTCACGTATAAAGATTGGCATGAGATGTTACCATTTGCATTACACGGTTACCGAACGTCAGTTCACACATCAACTGGGGGAACTCCGTTCTCTTTGATGTATGGCGTAGAAGTTGTTTTGCCTATTGAGGCCGAGGTGCCATCCTTCAGGGTAATCAAAGAAGGAGAACTATAAGAAGCTGAGTGGACACGAGCAAGGCACGAGGAATTAAACCTCATAGAAGAGAAAAGGTTAACTGCGTTGTGTAGAGGACAACTTTATCATAAGAAAATGACTCATGCATACGTTAGGAAAGTTTGAAAATGTTGTTTTCAGGAAGGGGATTTGGTCTTAAAAAAGATGCTGCCATTTCAAAAAGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATACGTAGTAAAGCGACATTCTCTGGTGGAGCTTTGA

mRNA sequence

ATGAAGATTAAGAAGTTACAAGTTTATGGGGATTCTCTCTTGCTAATACATCAACTCAATGGGGAGTGGGAAACCAGAGACTTTAAATTGATTCCATATAACAAGTACATTCGAGAATTGGCTCAAACATTTGAGTCAATTACATTCAAGCATGTCCCACGTGAAAGTAATCAATTAGTATATGCATTGGCTACTTATTCTGCCATGTTTAGTGTGGCCTACAACGAGGAAATTCAGCCTATAAGAATTGAAAAGCGTGAAACACCAGTGTATTGCATGAACGTTGAGCAAGAGTTTGACAGAAAACAATGGTACCACAAAATTAAGCATTACATTAGATGTCGAGAATATCCTTTAGGAGCATTTGAAAATAGTAGACGTACCCTTAGAAAGTTGGTCATGAATTTTTTTCTTAACGGAGAAGTGTTGTACAACAAAAATTATGATATGACTCTTTTAAGATGTGTGGATGCATCAGAGGCTAAAATAATTCTGCAAGAAGTTCATGAGGGAGTTTGCGGAACGCATGCAAATGGACACATGATGGCAAGACAAATTATGCGTGCTGGTTATTATTGGTCGACTATGGGGTCAGATTGCATAAAATACGCAAGAAAATGTCATAAATGTCAAATATATGCTGATAAAATTCATGCTTTGGCTTCACCCTCGCATGTATTAACAGCCACATGGCCTTTCTCTATGTGGAGAATGGATGTAATTGGACCAATTGAACCAAAGGCGTCAAATGGTCATCGATTCATTTTGGTAGCCATAGATTATTTTACTAAATGGGTGGAAGCTACATCGTACAAGAGTGTCTCCAAGCAAGCTGTTGTCAAATTCATACAGAAAGATATTATATGTCGGCACAACCTTCCTGAACGTATAATCACCAATAATGCCAAGAACTTAAACAATAAGTTAATGGAGGATTTATGTAATCAGCTTAAAATTATGCACTTCAATTCTACCCTGTACCACCCTAAGATGAATAGAGCAATAGAGATGGCAAATAAAAATATCAAAAGAATTATTGAAAAGATGACTGTCACGTATAAAGATTGGCATGAGATGTTACCATTTGCATTACACGGTTACCGAACGTCAGTTCACACATCAACTGGGGGAACTCCGTTCTCTTTGATGTATGGCGTAGAAGTTGTTTTGCCTATTGAGGCCGAGGTGCCATCCTTCAGGGAAGGGGATTTGGTCTTAAAAAAGATGCTGCCATTTCAAAAAGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATACGTAGTAAAGCGACATTCTCTGGTGGAGCTTTGA

Coding sequence (CDS)

ATGAAGATTAAGAAGTTACAAGTTTATGGGGATTCTCTCTTGCTAATACATCAACTCAATGGGGAGTGGGAAACCAGAGACTTTAAATTGATTCCATATAACAAGTACATTCGAGAATTGGCTCAAACATTTGAGTCAATTACATTCAAGCATGTCCCACGTGAAAGTAATCAATTAGTATATGCATTGGCTACTTATTCTGCCATGTTTAGTGTGGCCTACAACGAGGAAATTCAGCCTATAAGAATTGAAAAGCGTGAAACACCAGTGTATTGCATGAACGTTGAGCAAGAGTTTGACAGAAAACAATGGTACCACAAAATTAAGCATTACATTAGATGTCGAGAATATCCTTTAGGAGCATTTGAAAATAGTAGACGTACCCTTAGAAAGTTGGTCATGAATTTTTTTCTTAACGGAGAAGTGTTGTACAACAAAAATTATGATATGACTCTTTTAAGATGTGTGGATGCATCAGAGGCTAAAATAATTCTGCAAGAAGTTCATGAGGGAGTTTGCGGAACGCATGCAAATGGACACATGATGGCAAGACAAATTATGCGTGCTGGTTATTATTGGTCGACTATGGGGTCAGATTGCATAAAATACGCAAGAAAATGTCATAAATGTCAAATATATGCTGATAAAATTCATGCTTTGGCTTCACCCTCGCATGTATTAACAGCCACATGGCCTTTCTCTATGTGGAGAATGGATGTAATTGGACCAATTGAACCAAAGGCGTCAAATGGTCATCGATTCATTTTGGTAGCCATAGATTATTTTACTAAATGGGTGGAAGCTACATCGTACAAGAGTGTCTCCAAGCAAGCTGTTGTCAAATTCATACAGAAAGATATTATATGTCGGCACAACCTTCCTGAACGTATAATCACCAATAATGCCAAGAACTTAAACAATAAGTTAATGGAGGATTTATGTAATCAGCTTAAAATTATGCACTTCAATTCTACCCTGTACCACCCTAAGATGAATAGAGCAATAGAGATGGCAAATAAAAATATCAAAAGAATTATTGAAAAGATGACTGTCACGTATAAAGATTGGCATGAGATGTTACCATTTGCATTACACGGTTACCGAACGTCAGTTCACACATCAACTGGGGGAACTCCGTTCTCTTTGATGTATGGCGTAGAAGTTGTTTTGCCTATTGAGGCCGAGGTGCCATCCTTCAGGGAAGGGGATTTGGTCTTAAAAAAGATGCTGCCATTTCAAAAAGATCATAGAGGAAAGTGGACCCCTAATTATGAGGGACCATACGTAGTAAAGCGACATTCTCTGGTGGAGCTTTGA
BLAST of CSPI05G18750 vs. Swiss-Prot
Match: YRD6_CAEEL (Uncharacterized protein K02A2.6 OS=Caenorhabditis elegans GN=K02A2.6 PE=3 SV=1)

HSP 1 Score: 85.9 bits (211), Expect = 1.2e-15
Identity = 71/262 (27.10%), Postives = 121/262 (46.18%), Query Frame = 1

Query: 156  VDASEAKIILQEVHEGVCGTHANGHMMARQIMRAGYYWSTMGSDCIKYARKCHKCQIYAD 215
            V  S  KI+L+++HEG  G      +  +Q  R+  +W  + SD     R C+ CQ  + 
Sbjct: 778  VPKSLQKIVLKQLHEGHPGI-----VQMKQKARSFVFWRGLDSDIENMVRHCNNCQENSK 837

Query: 216  KIHALA-SPSHVLTATWPFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKSV 275
                +  +P  V  A W      +D  GP+     NG  ++LV +D  TK+ E    +S+
Sbjct: 838  MPRVVPLNPWPVPEAPW--KRIHIDFAGPL-----NGC-YLLVVVDAKTKYAEVKLTRSI 897

Query: 276  SKQAVVKFIQKDIICRHNLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRA 335
            S    +  ++ +I   H  PE II++N   L + L   +C    I H  S +Y+P+ N A
Sbjct: 898  SAVTTIDLLE-EIFSIHGYPETIISDNGTQLTSHLFAQMCQSHGIEHKTSAVYYPRSNGA 957

Query: 336  IEMANKNIKRIIEKMTVTYKDWHEMLPFALHGYRTSVHTS-TGGTPFSLMYGVEVVLPIE 395
             E     +KR I K+        ++L   L  YR + H++  G TP    +G ++   + 
Sbjct: 958  AERFVDTLKRGIAKIKGEGSVNQQILNKFLISYRNTPHSALNGSTPAECHFGRKIRTTMS 1017

Query: 396  AEVPSFREGDLVLKKMLPFQKD 416
              +P+ R   L + K+  +Q++
Sbjct: 1018 LLMPTDRV--LKVPKLTQYQQN 1023

BLAST of CSPI05G18750 vs. Swiss-Prot
Match: POL_KORV (Pro-Pol polyprotein OS=Koala retrovirus GN=pro-pol PE=3 SV=1)

HSP 1 Score: 73.2 bits (178), Expect = 8.1e-12
Identity = 56/228 (24.56%), Postives = 104/228 (45.61%), Query Frame = 1

Query: 160 EAKIILQEVHEGVCGTHANGHMMARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHA 219
           + +  +Q +H+    TH     + + + R  ++   + S   +   KC  C +  + +  
Sbjct: 770 QGREFIQRLHQL---THLGPDKLLQLVGRTSFHIPNLQSVVREITSKCQVCAV-TNAVTT 829

Query: 220 LASPSHVLTATWPFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAV 279
              P        P   W +D    ++P    G+R++LV ID F+ WVEA   K+ +   V
Sbjct: 830 YREPGRRQRGDRPGVYWEVDFT-EVKP-GRYGNRYLLVFIDTFSGWVEAFPTKTETALTV 889

Query: 280 VKFIQKDIICRHNLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMAN 339
            K I ++I+ R  +P+ + ++N      ++ + L  QL I       Y P+ +  +E  N
Sbjct: 890 CKKILEEILPRFGIPKVLGSDNGPAFVAQVSQGLATQLGIDWKLHCAYRPQSSGQVERMN 949

Query: 340 KNIKRIIEKMTVTY--KDWHEMLPFALHGYRTSVHTSTGGTPFSLMYG 386
           + IK  + K+ +    KDW  +LP AL   R +     G TP+ +++G
Sbjct: 950 RTIKETLTKLALETGGKDWVTLLPLALLRAR-NTPGQFGLTPYEILHG 990

BLAST of CSPI05G18750 vs. Swiss-Prot
Match: POL_MLVMS (Gag-Pol polyprotein OS=Moloney murine leukemia virus (isolate Shinnick) GN=gag-pol PE=1 SV=4)

HSP 1 Score: 70.1 bits (170), Expect = 6.9e-11
Identity = 45/156 (28.85%), Postives = 82/156 (52.56%), Query Frame = 1

Query: 232  PFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRH 291
            P + W +D    I+P    G++++LV ID F+ W+EA   K  + + V K + ++I  R 
Sbjct: 1448 PGTHWEIDFT-EIKP-GLYGYKYLLVFIDTFSGWIEAFPTKKETAKVVTKKLLEEIFPRF 1507

Query: 292  NLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTV 351
             +P+ + T+N     +K+ + + + L I       Y P+ +  +E  N+ IK  + K+T+
Sbjct: 1508 GMPQVLGTDNGPAFVSKVSQTVADLLGIDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1567

Query: 352  T--YKDWHEMLPFALHGYRTSVHTSTGGTPFSLMYG 386
                +DW  +LP AL+  R +     G TP+ ++YG
Sbjct: 1568 ATGSRDWVLLLPLALYRARNTPGPH-GLTPYEILYG 1600

BLAST of CSPI05G18750 vs. Swiss-Prot
Match: POL_FENV1 (Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 GN=pol PE=3 SV=1)

HSP 1 Score: 70.1 bits (170), Expect = 6.9e-11
Identity = 60/231 (25.97%), Postives = 108/231 (46.75%), Query Frame = 1

Query: 160 EAKIILQEVHEGVCGTHANGHMMARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHA 219
           EA  ++Q++H     TH +   +   I +  +     G+   +    C  CQ    +++A
Sbjct: 693 EALAMIQQMH---AWTHLSNQKLKLLIEKTDFLIPKAGTLIEQVTSACKVCQ----QVNA 752

Query: 220 LAS--PSHVLT-ATWPFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSK 279
            A+  P    T    P   W +D    ++P  + G++++LV +D F+ WVEA   +  + 
Sbjct: 753 GATRVPEGKRTRGNRPGVYWEIDFT-EVKPHYA-GYKYLLVFVDTFSGWVEAYPTRQETA 812

Query: 280 QAVVKFIQKDIICRHNLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIE 339
             V K I ++I  R  LP+ I ++N     +++ + L   L I       Y P+ +  +E
Sbjct: 813 HMVAKKILEEIFPRFGLPKVIGSDNGPAFVSQVSQGLARTLGINWKLHCAYRPQSSGQVE 872

Query: 340 MANKNIKRIIEKMTVT--YKDWHEMLPFALHGYRTSVHTSTGGTPFSLMYG 386
             N+ IK  + K+T+    KDW  +L  AL   R + +   G TP+ ++YG
Sbjct: 873 RMNRTIKETLTKLTLETGLKDWRRLLSLALLRARNTPN-RFGLTPYEILYG 913

BLAST of CSPI05G18750 vs. Swiss-Prot
Match: POL_MLVFF (Pol polyprotein OS=Friend murine leukemia virus (isolate FB29) GN=pol PE=3 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 9.0e-11
Identity = 44/156 (28.21%), Postives = 82/156 (52.56%), Query Frame = 1

Query: 232  PFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRH 291
            P + W +D    ++P    G++++LV ID F+ WVEA   K  + + V K + ++I  R 
Sbjct: 914  PGTHWEIDFT-EVKP-GLYGYKYLLVFIDTFSGWVEAFPTKKETAKVVTKKLLEEIFPRF 973

Query: 292  NLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTV 351
             +P+ + T+N     +K+ + + + L +       Y P+ +  +E  N+ IK  + K+T+
Sbjct: 974  GMPQVLGTDNGPAFVSKVSQTVADLLGVDWKLHCAYRPQSSGQVERMNRTIKETLTKLTL 1033

Query: 352  T--YKDWHEMLPFALHGYRTSVHTSTGGTPFSLMYG 386
                +DW  +LP AL+  R +     G TP+ ++YG
Sbjct: 1034 ATGSRDWVLLLPLALYRARNTPGPH-GLTPYEILYG 1066

BLAST of CSPI05G18750 vs. TrEMBL
Match: A0A061EXZ3_THECC (RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative OS=Theobroma cacao GN=TCM_024700 PE=4 SV=1)

HSP 1 Score: 550.8 bits (1418), Expect = 1.5e-153
Identity = 252/400 (63.00%), Postives = 311/400 (77.75%), Query Frame = 1

Query: 1    MKIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLV 60
            MK   + VYGDS L+I Q+ GEWETRD KL+PY K + EL++ F+ I+F H+PRE N++ 
Sbjct: 1041 MKADAIDVYGDSALVICQMKGEWETRDPKLVPYKKLVIELSKQFKEISFNHLPREENRIA 1100

Query: 61   YALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLG 120
             ALAT +AMF +    +++P  +E RE   +C+NVE+E D + WYH I+ YI+ + YP  
Sbjct: 1101 DALATLAAMFKIKEAADVRPFDLEVREVSAHCLNVEEEVDGRPWYHDIRQYIKHQAYPEN 1160

Query: 121  AFENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGH 180
              +N +RTLR+L M FFL+GEVLY ++ D  LLRCVD +EA  I++EVHEG CG HANGH
Sbjct: 1161 VTDNDKRTLRRLAMGFFLSGEVLYKRSRDQVLLRCVDVAEANKIMKEVHEGTCGAHANGH 1220

Query: 181  MMARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDV 240
            M+ARQIMRAGYYW T+ SDCI +ARKCHKCQ+YAD+IHA  +P HV TA WPFSMW MDV
Sbjct: 1221 MLARQIMRAGYYWLTLESDCINFARKCHKCQVYADRIHAPPAPLHVFTAPWPFSMWGMDV 1280

Query: 241  IGPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITN 300
            IG I PKASNGHRFILVAIDYFTKWVEA SY +V+++ V KFIQK+IICR+ LPERIIT+
Sbjct: 1281 IGLITPKASNGHRFILVAIDYFTKWVEAASYANVTQKVVCKFIQKEIICRYGLPERIITD 1340

Query: 301  NAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEML 360
            NA NLN  +++D+C + KI H NST Y PKMN A+E ANKNIK+I+EKMT  YKDWHE L
Sbjct: 1341 NASNLNGAMVKDVCAKFKIKHHNSTTYRPKMNGAVEAANKNIKKIVEKMTEVYKDWHEKL 1400

Query: 361  PFALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR 401
            PFALH YRTSV TSTG TP+SL+YG E VLP+E E+PS R
Sbjct: 1401 PFALHAYRTSVRTSTGATPYSLVYGAEAVLPVEVEIPSLR 1440

BLAST of CSPI05G18750 vs. TrEMBL
Match: A0A151T1W1_CAJCA (Gypsy retrotransposon integrase-like protein 1 OS=Cajanus cajan GN=KK1_023440 PE=4 SV=1)

HSP 1 Score: 545.8 bits (1405), Expect = 4.8e-152
Identity = 264/441 (59.86%), Postives = 325/441 (73.70%), Query Frame = 1

Query: 2   KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
           K K L+VYGDS L+IHQL  EWETRD KLIPY  YI+EL + FE ITF H+PRE NQL  
Sbjct: 334 KAKILEVYGDSALVIHQLKEEWETRDTKLIPYQAYIKELIRKFEKITFNHIPREDNQLAD 393

Query: 62  ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
           ALAT S+MF+++ +E++  I+I+ R+ P YC  VE+E D   WY+ IK YI+ R+YP  +
Sbjct: 394 ALATLSSMFTLSNDEDMPLIKIQCRDQPAYCQLVEEEPDGNPWYYDIKKYIKSRQYPPNS 453

Query: 122 FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
            EN +RTLR+L M+FFLNGEVLY +N+DM LLRC+DA EA+ I++EVHEG  G H   H 
Sbjct: 454 SENDKRTLRRLAMSFFLNGEVLYKRNHDMVLLRCLDAVEAQQIIKEVHEGSFGNHTKRHA 513

Query: 182 MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVI 241
           MAR+I+RAGYYW TM +DC KY +K HKCQ YAD I+AL +  +VL++ WPFSMW MDVI
Sbjct: 514 MARKILRAGYYWLTMENDCFKYVKKFHKCQTYADNINALPTSLNVLSSPWPFSMWGMDVI 573

Query: 242 GPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNN 301
           GPIEPKASNGHRFILVAIDYFTKWVEATSY  V++  VVKFI++++ICR+++P RIIT+N
Sbjct: 574 GPIEPKASNGHRFILVAIDYFTKWVEATSYAHVTQNVVVKFIKRELICRYSVPSRIITDN 633

Query: 302 AKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLP 361
             NLNNK+M +LC   KI H NS+ Y PKMN A+E ANKNIK+II+KM VTYKDWHEMLP
Sbjct: 634 GTNLNNKMMTELCVDFKIQHHNSSPYRPKMNGAVEAANKNIKKIIQKMVVTYKDWHEMLP 693

Query: 362 FALHGYRTS--------VHTSTGGTPFSLMYGVEVVLPIEAEV--PSFREGDLVLKKMLP 421
           FALHGY  +                    +Y   +    E +V    FREGDLVLKK+L 
Sbjct: 694 FALHGYMQTRLDQLNLIKEKRLNAICHGQLYQKRLKKAFEKKVHPREFREGDLVLKKILQ 753

Query: 422 FQKDHRGKWTPNYEGPYVVKR 433
            QKD  GKW PNYEGP+VVK+
Sbjct: 754 VQKDRLGKWAPNYEGPFVVKK 774

BLAST of CSPI05G18750 vs. TrEMBL
Match: A0A061EFI0_THECC (RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H OS=Theobroma cacao GN=TCM_010987 PE=4 SV=1)

HSP 1 Score: 538.9 bits (1387), Expect = 5.8e-150
Identity = 269/514 (52.33%), Postives = 335/514 (65.18%), Query Frame = 1

Query: 2    KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
            KI  L+VYGDS L+I+QL GEWETRD KL+ Y+KY+ +L + F+ I F H+PRE NQ+  
Sbjct: 503  KIHILEVYGDSALVIYQLRGEWETRDSKLVRYHKYVSKLIENFDEICFNHLPREENQMAD 562

Query: 62   ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
            ALA  +AMF V  N +IQPI I  RE P +C +VE+E D K WYH I HY++ ++YP  +
Sbjct: 563  ALAMLAAMFKVGTNVKIQPIMINLRECPAHCFSVEEEIDGKPWYHDIVHYLKFQQYPDQS 622

Query: 122  FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
             EN ++T+R+L MNFFL+G +LY ++ D TLLRCVD++EA+ I++EVHEGVCG HA+GH 
Sbjct: 623  SENDKKTIRRLAMNFFLDGNILYKRSRDQTLLRCVDSTEARRIVEEVHEGVCGAHASGHK 682

Query: 182  MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALA-------------------- 241
            +ARQ+MRAGYYW T+  DCI +ARKCHKCQIYAD+IH  A                    
Sbjct: 683  LARQVMRAGYYWLTLEKDCIDFARKCHKCQIYADRIHTPAXXXXXXXXXXXXXXXXXXXX 742

Query: 242  --------SPSHVLTATWPFSMWRMDVIGPIEPKASNGHRFILVAIDYFTKWVEATSYKS 301
                           + WPFSMW MDVIG I PKASNGHRFILVAIDYFTKWVEA SY +
Sbjct: 743  XXXXXXXXXXXXXXASPWPFSMWGMDVIGLITPKASNGHRFILVAIDYFTKWVEAASYAN 802

Query: 302  VSKQAVVKFIQKDIICRHNLPERIITNNAKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNR 361
            V+++ V KFIQK+IICR+   E IIT+N  NLN  +M+++C + KI H NST Y PKMN 
Sbjct: 803  VTQKVVCKFIQKEIICRYGFSEMIITDNTSNLNGSMMKEVCAKFKIKHHNSTPYRPKMNG 862

Query: 362  AIEMANKNIKRIIEKMTVTYKDWHEMLPFALHGYRTSVHTSTGGTPFSLMYGVEVVLPIE 421
            A+E ANKNIKRIIEKMT  YKDWHE LPFALH YRT+V TSTG TPFSL+YG+E VLPIE
Sbjct: 863  AVEAANKNIKRIIEKMTDIYKDWHEKLPFALHAYRTTVRTSTGATPFSLVYGMEAVLPIE 922

Query: 422  AEVPSFR------------------EGDLVLKK--------------------------- 433
             E+PS R                  + +L+ +K                           
Sbjct: 923  VEIPSLRVLKEVQLEEAEWVNARYEQLNLIEEKRLTALCHGQLYQKRMMRAYDKKAHSRQ 982

BLAST of CSPI05G18750 vs. TrEMBL
Match: A0A061EKZ1_THECC (RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative OS=Theobroma cacao GN=TCM_020600 PE=4 SV=1)

HSP 1 Score: 534.6 bits (1376), Expect = 1.1e-148
Identity = 247/395 (62.53%), Postives = 303/395 (76.71%), Query Frame = 1

Query: 6   LQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVYALAT 65
           + VYGDS L+I Q+ GEWETRD KL+PY K + EL++ F+ I+F H+PRE NQ+  ALAT
Sbjct: 461 IDVYGDSTLVICQMKGEWETRDLKLVPYKKLVTELSKQFKEISFNHLPREDNQIADALAT 520

Query: 66  YSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGAFENS 125
            +AMF +    ++ P  +E RE   +C+NVE+E D K WYH I  YI+ + YP    +N 
Sbjct: 521 LAAMFKIKEATDVLPFDLEVREVSAHCLNVEEEVDGKPWYHDIMQYIKHQTYPENVTDND 580

Query: 126 RRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHMMARQ 185
           +RTLR L M+FFL+ EVLY  + D  LLRCVD +EA  I++EVHEG CG HANGHM+ARQ
Sbjct: 581 KRTLRILAMSFFLSREVLYKTSRDQVLLRCVDIAEANKIMKEVHEGTCGAHANGHMLARQ 640

Query: 186 IMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVIGPIE 245
           IMRAGYYW T+ SDCI +ARKCHKCQ+YAD+IHA  +P HV +A WPFSMW MDVIG I 
Sbjct: 641 IMRAGYYWLTLESDCINFARKCHKCQVYADRIHAPPAPLHVFSAPWPFSMWGMDVIGLIT 700

Query: 246 PKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNNAKNL 305
           PKASNGHRFILVAIDYFTKWVEA SY +V+++ V +FIQK+IICR+ LPERIIT+NA NL
Sbjct: 701 PKASNGHRFILVAIDYFTKWVEAASYANVTQKVVCRFIQKEIICRYGLPERIITDNASNL 760

Query: 306 NNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLPFALH 365
           N  +++++C + KI H NST Y  KMN AIE ANKNIK+I+EKMT  YKDWHE LPFALH
Sbjct: 761 NGAMVKEVCTKFKIKHHNSTTYRLKMNGAIETANKNIKKIVEKMTEVYKDWHEKLPFALH 820

Query: 366 GYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR 401
            YRTSV TSTG TP+SL+YG E VLP+E E+ S R
Sbjct: 821 AYRTSVRTSTGATPYSLVYGAEAVLPVEVEISSLR 855

BLAST of CSPI05G18750 vs. TrEMBL
Match: A0A151QLV6_CAJCA (Retrotransposable element Tf2 OS=Cajanus cajan GN=KK1_048524 PE=4 SV=1)

HSP 1 Score: 510.4 bits (1313), Expect = 2.2e-141
Identity = 234/398 (58.79%), Postives = 302/398 (75.88%), Query Frame = 1

Query: 3   IKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVYA 62
           I KL+V+GDS L+I+QL G+WE R+ KLIPY ++I++L Q F+S+ F+++PRE NQ   A
Sbjct: 19  ILKLKVFGDSALVIYQLRGDWEVRNPKLIPYVEHIKDLTQLFQSVVFEYIPREENQFADA 78

Query: 63  LATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGAF 122
           LAT S+MF++    E+  I I + +   +C+ +++  D   W+H IK Y+   EYP  A 
Sbjct: 79  LATLSSMFALDSEHEMPIINIRRHDKQAFCLLIDEAVDDHPWFHDIKQYLTKGEYPDMAS 138

Query: 123 ENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHMM 182
           +N ++ +R+  M F L GE LY KNYD  LLRCVD  EA+ +++EVHEGV GTH  G  M
Sbjct: 139 DNDKKHIRRRAMGFLLVGETLYKKNYDSVLLRCVDRVEAQNMIKEVHEGVFGTHIPGPAM 198

Query: 183 ARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVIG 242
           A++I+RAGYYWSTM  DC +YARKCHKCQ YAD IH   +P +VL + WPFSMW MDVIG
Sbjct: 199 AKKILRAGYYWSTMEKDCYQYARKCHKCQAYADNIHVPPTPLNVLASPWPFSMWGMDVIG 258

Query: 243 PIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNNA 302
           PIEPKASNGHRFILVAIDYFTKWVEA SY  V+++ V  FI+K+IICR+ +P +IIT+N 
Sbjct: 259 PIEPKASNGHRFILVAIDYFTKWVEAASYAHVTRKVVTSFIRKNIICRYGIPNKIITDNG 318

Query: 303 KNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLPF 362
            NLNNK+ME+LC + KI H NS+ Y PKMN A+E +NKNIK+I++KM V++KDWHEMLPF
Sbjct: 319 SNLNNKIMEELCQEFKIQHHNSSPYRPKMNGAVEASNKNIKKIVQKMVVSFKDWHEMLPF 378

Query: 363 ALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR 401
           ALHGYRTSV TST  TPFSL+YG+E VLP+E E+PS R
Sbjct: 379 ALHGYRTSVRTSTRATPFSLVYGMEAVLPVEVEIPSLR 416

BLAST of CSPI05G18750 vs. NCBI nr
Match: gi|828327848|ref|XP_012573958.1| (PREDICTED: uncharacterized protein LOC101510858 [Cicer arietinum])

HSP 1 Score: 577.0 bits (1486), Expect = 2.8e-161
Identity = 281/486 (57.82%), Postives = 343/486 (70.58%), Query Frame = 1

Query: 2    KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
            K K L+VYGDS L+I+QLN EWETRD KLIPY  YI+EL+  F+ ITF HVPRE NQL  
Sbjct: 1689 KAKVLEVYGDSALVINQLNQEWETRDKKLIPYFTYIKELSLEFDKITFHHVPREDNQLAD 1748

Query: 62   ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
            ALAT S+MF +  N+EI  I++E R+ P YC  +E+E D K WYH IKHY+  REYP G 
Sbjct: 1749 ALATLSSMFQINRNDEIPSIKMESRDYPAYCHVMEEETDGKPWYHDIKHYLINREYPPGI 1808

Query: 122  FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
             EN +RTLR+L  +FF+N  +LY +N+DM LLRCVD +EAK ILQ++H+G  G H NGH 
Sbjct: 1809 SENEKRTLRRLSASFFVNENILYKRNHDMVLLRCVDVNEAKEILQDIHDGSYGIHMNGHA 1868

Query: 182  MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVI 241
            M+R+I+RAGYYW T+  DC  Y +KC+KCQIYAD IHA   P + L+A WPFSMW +DVI
Sbjct: 1869 MSRKILRAGYYWLTLEKDCYNYVKKCYKCQIYADNIHAPPVPLNTLSAPWPFSMWGIDVI 1928

Query: 242  GPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNN 301
            G IEPKASNGHRFILVAIDYFTKWVEA SY +V+K  VVKFI++++ICR+ LP +IIT+N
Sbjct: 1929 GMIEPKASNGHRFILVAIDYFTKWVEAASYANVTKNVVVKFIKRELICRYGLPSKIITDN 1988

Query: 302  AKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLP 361
            A NLNNK+M++LC   KI H NS+ Y PKMN A+E ANKNIK+II+KM +TYKDWHEMLP
Sbjct: 1989 ATNLNNKMMDELCATFKIQHHNSSPYRPKMNGAVEAANKNIKKIIQKMVITYKDWHEMLP 2048

Query: 362  FALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR------------------EGD 421
            FALHGYRTSV TSTG TPFSL+YG+E VLPIE E+PS R                  + +
Sbjct: 2049 FALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSLRVLMEAELEESEWIQTCFYQLN 2108

Query: 422  LVLKK-------------------------------------MLPFQKDHRGKWTPNYEG 433
            L+ +K                                     +LP QKD+RGKWTPNYEG
Sbjct: 2109 LIEEKRLAALCHGQLYQKRLKKAYEKKIRPREFQEGDLVLKKILPIQKDYRGKWTPNYEG 2168

BLAST of CSPI05G18750 vs. NCBI nr
Match: gi|828329454|ref|XP_012574247.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101497477 [Cicer arietinum])

HSP 1 Score: 575.5 bits (1482), Expect = 8.0e-161
Identity = 280/486 (57.61%), Postives = 343/486 (70.58%), Query Frame = 1

Query: 2    KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
            K+K L+VYGDS L+I+QLN EWETRD KLIPY  YI+EL+  F+ ITF HVPRE NQL  
Sbjct: 1605 KVKVLEVYGDSALVINQLNQEWETRDKKLIPYFTYIKELSLEFDKITFHHVPREDNQLAD 1664

Query: 62   ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
            ALAT S+MF +  N+EI  I++E R+ P YC  +E+E D K WYH IKHY+  REYP G 
Sbjct: 1665 ALATLSSMFQINRNDEIPXIKMESRDYPAYCHVMEEETDGKPWYHDIKHYLINREYPPGI 1724

Query: 122  FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
             EN +RTLR+L   FF+N  +LY +N DM LL+CVD +EAK ILQ++H+G  G H NGH 
Sbjct: 1725 SENEKRTLRRLSARFFVNENILYKRNNDMVLLKCVDVNEAKXILQDIHDGSYGIHMNGHA 1784

Query: 182  MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVI 241
            M+R+I+RAGYYW T+  DC  Y +KC+KCQIYAD IHA   P + L+A WPFSMW +DVI
Sbjct: 1785 MSRKILRAGYYWLTLEKDCFNYVKKCYKCQIYADNIHAPLVPLNTLSAPWPFSMWGIDVI 1844

Query: 242  GPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNN 301
            G IEPKASNGHRFILVA+DYFTKWVEATSY +V+K  VVKFI++++ICR+ L  +IIT+N
Sbjct: 1845 GMIEPKASNGHRFILVAVDYFTKWVEATSYANVTKNVVVKFIKRELICRYGLLSKIITDN 1904

Query: 302  AKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLP 361
            A NLNNK+M++LC   KI H NS+ Y PKMN A+E ANKNIK+II+KM +TYKDWHEMLP
Sbjct: 1905 ATNLNNKMMDELCVTFKIQHHNSSPYRPKMNGAVEAANKNIKKIIQKMVITYKDWHEMLP 1964

Query: 362  FALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR------------------EGD 421
            FALHGYRTSV TSTG TPFSL+YG+EVVLPIE E+PS R                  + +
Sbjct: 1965 FALHGYRTSVRTSTGATPFSLVYGMEVVLPIEVEIPSLRVLMESKLEESEWVQTRFDQLN 2024

Query: 422  LVLKK-------------------------------------MLPFQKDHRGKWTPNYEG 433
            L+ +K                                     +LP QKD+RGKWTPNYEG
Sbjct: 2025 LIEEKRLAALCHGQLYQKRLKKAYEKKIYPREFQEGDLVLKKILPIQKDYRGKWTPNYEG 2084

BLAST of CSPI05G18750 vs. NCBI nr
Match: gi|828334754|ref|XP_012575240.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105852935 [Cicer arietinum])

HSP 1 Score: 572.4 bits (1474), Expect = 6.8e-160
Identity = 260/399 (65.16%), Postives = 323/399 (80.95%), Query Frame = 1

Query: 2    KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
            KIK L+VYGDS L+IHQ  GEWETRD KL+PY+ YI+EL + FE ITF H+PRE NQL  
Sbjct: 1007 KIKVLEVYGDSSLVIHQTKGEWETRDSKLVPYHTYIKELVENFEHITFHHIPREENQLAD 1066

Query: 62   ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
            ALAT S+MF ++  +++  I+I++++ P YC+++E+E D K W++ IK Y++ REYP G 
Sbjct: 1067 ALATLSSMFKISVGQDVPVIKIQQKDKPAYCLSIEEELDSKPWFYDIKSYVKNREYPSGI 1126

Query: 122  FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
             EN +R LR+L MNFFLNG+VLY +N+DM LLRCVD +EA+ I+QEVHEG  GTHANGH 
Sbjct: 1127 SENDKRVLRRLSMNFFLNGDVLYKRNHDMVLLRCVDQAEAEKIIQEVHEGSFGTHANGHA 1186

Query: 182  MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVI 241
            MAR+I+RAGYYW TM SDC  + +KCHKCQIYAD++H   +  +VLT+ WPFSMW MDVI
Sbjct: 1187 MARKILRAGYYWLTMKSDCFSHVKKCHKCQIYADRVHVSPTSLNVLTSPWPFSMWGMDVI 1246

Query: 242  GPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNN 301
            G IEPKASNGHRFILVAIDYFTKWVEA SY +V++  VVKFI++ +ICR+ LP +IIT+N
Sbjct: 1247 GLIEPKASNGHRFILVAIDYFTKWVEAASYANVTRSVVVKFIKRKLICRYGLPNKIITDN 1306

Query: 302  AKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLP 361
            A NLNNK+M++LC+  KI H NS+ Y PKMN A+E ANKNIK+II+KM  TYKDWHEMLP
Sbjct: 1307 ATNLNNKMMKELCDSFKIQHHNSSPYRPKMNGAVEAANKNIKKIIQKMVETYKDWHEMLP 1366

Query: 362  FALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR 401
            FALHGYRTSV TSTG TPFSL+YG+E VLPIE E+PS R
Sbjct: 1367 FALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSIR 1405

BLAST of CSPI05G18750 vs. NCBI nr
Match: gi|828327855|ref|XP_012573961.1| (PREDICTED: uncharacterized protein LOC101511496 [Cicer arietinum])

HSP 1 Score: 571.6 bits (1472), Expect = 1.2e-159
Identity = 260/398 (65.33%), Postives = 324/398 (81.41%), Query Frame = 1

Query: 3    IKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVYA 62
            +K L+VYGDSLL+IHQ  G+WETRD KLIPY+ +I+EL + FE ITF H+PRE NQL  A
Sbjct: 1758 VKFLEVYGDSLLVIHQTKGDWETRDSKLIPYHTHIKELTEQFEKITFHHIPREENQLADA 1817

Query: 63   LATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGAF 122
            LAT S+MF +  N+++  I+I++R+ P YC+++E+E D K W++ IK Y++ +EYPLG  
Sbjct: 1818 LATLSSMFKITTNQDVPVIKIQQRDKPAYCLSIEEELDGKPWFYDIKSYVKNKEYPLGIS 1877

Query: 123  ENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHMM 182
            EN +R LR+L MNFFLNG+VLY +N+DM LLRCVD +EA  I+QEVHEG  GTHANGH M
Sbjct: 1878 ENDKRVLRRLSMNFFLNGDVLYKRNHDMVLLRCVDKAEAGKIIQEVHEGSFGTHANGHTM 1937

Query: 183  ARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVIG 242
            AR+I+RAGYYW TM SDC  + +KCHKCQIYADKIH   +  +VLT+ WPFSMW MDVIG
Sbjct: 1938 ARKILRAGYYWLTMESDCFSHVKKCHKCQIYADKIHVPPTSLNVLTSPWPFSMWGMDVIG 1997

Query: 243  PIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNNA 302
             IEPKASNGHRFILVAIDYFTKWVEA SY +V++  VV+FI++++ICR+ LP +IIT+NA
Sbjct: 1998 LIEPKASNGHRFILVAIDYFTKWVEAASYANVTRSVVVRFIKRELICRYGLPNKIITDNA 2057

Query: 303  KNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLPF 362
             NLNNK+M++LC+  KI H NS+ Y PKMN A+E ANKNIK+II+KM  TYKDWHEMLPF
Sbjct: 2058 TNLNNKMMKELCDNFKIQHHNSSPYRPKMNGAVEAANKNIKKIIQKMVETYKDWHEMLPF 2117

Query: 363  ALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPSFR 401
            ALHGYRTSV TSTG TPFSL+YG+E VLPIE E+PS +
Sbjct: 2118 ALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSIK 2155

BLAST of CSPI05G18750 vs. NCBI nr
Match: gi|828335798|ref|XP_012575508.1| (PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105853036 [Cicer arietinum])

HSP 1 Score: 570.5 bits (1469), Expect = 2.6e-159
Identity = 280/486 (57.61%), Postives = 341/486 (70.16%), Query Frame = 1

Query: 2   KIKKLQVYGDSLLLIHQLNGEWETRDFKLIPYNKYIRELAQTFESITFKHVPRESNQLVY 61
           K K L+VYGDS L+I+QLN EWETRD KLIPY  YI+EL+  F+ ITF HVPRE NQL  
Sbjct: 499 KAKVLEVYGDSALVINQLNQEWETRDKKLIPYFTYIKELSLEFDKITFHHVPREDNQLAD 558

Query: 62  ALATYSAMFSVAYNEEIQPIRIEKRETPVYCMNVEQEFDRKQWYHKIKHYIRCREYPLGA 121
           ALAT S+MF +  N+EI  I++E R+ P YC  +E+E D K WYH IKHY+  REYP G 
Sbjct: 559 ALATLSSMFQINRNDEIPSIKMESRDYPAYCHVMEEETDGKPWYHDIKHYLINREYPPGI 618

Query: 122 FENSRRTLRKLVMNFFLNGEVLYNKNYDMTLLRCVDASEAKIILQEVHEGVCGTHANGHM 181
            EN +RTLR+L  +FF+N  +LY +N+DM LLRCVD +EAK ILQ++H+G  G H NGH 
Sbjct: 619 SENEKRTLRRLSASFFVNENILYKRNHDMVLLRCVDVNEAKEILQDIHDGSYGIHMNGHA 678

Query: 182 MARQIMRAGYYWSTMGSDCIKYARKCHKCQIYADKIHALASPSHVLTATWPFSMWRMDVI 241
           M+R+I+RAGYYW T+  DC  Y +KC+KCQIYAD IHA   P + L+A W FSMW +DVI
Sbjct: 679 MSRKILRAGYYWLTLEKDCYNYVKKCYKCQIYADNIHAPPVPLNTLSAPWTFSMWGIDVI 738

Query: 242 GPIEPKASNGHRFILVAIDYFTKWVEATSYKSVSKQAVVKFIQKDIICRHNLPERIITNN 301
           G IEPKASNGHRFILVAIDYFTKWVEA SY +V+K  VVKFI++++ICR+ LP +IIT+N
Sbjct: 739 GMIEPKASNGHRFILVAIDYFTKWVEAASYANVTKNVVVKFIKRELICRYGLPSKIITDN 798

Query: 302 AKNLNNKLMEDLCNQLKIMHFNSTLYHPKMNRAIEMANKNIKRIIEKMTVTYKDWHEMLP 361
           A NLNNK+M++LC   KI H NS+ Y PKMN AIE ANKNIK+II+KM +TYKDWHEMLP
Sbjct: 799 ATNLNNKMMDELCATFKIQHHNSSPYRPKMNGAIEAANKNIKKIIQKMVITYKDWHEMLP 858

Query: 362 FALHGYRTSVHTSTGGTPFSLMYGVEVVLPIEAEVPS------------------FREGD 421
           FALHGYRTSV TSTG TPFSL+YG+E VLPIE E+PS                  F + +
Sbjct: 859 FALHGYRTSVRTSTGATPFSLVYGMEAVLPIEVEIPSLKVLMEAKLEESEWIQTRFDQLN 918

Query: 422 LVLKK-------------------------------------MLPFQKDHRGKWTPNYEG 433
           L+ +K                                     +LP QKD+RGKWTPNYEG
Sbjct: 919 LIEEKRLATLCHGQLDQKRLKKAYEKKIRPREFQEGDQVLKKILPIQKDYRGKWTPNYEG 978

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
YRD6_CAEEL1.2e-1527.10Uncharacterized protein K02A2.6 OS=Caenorhabditis elegans GN=K02A2.6 PE=3 SV=1[more]
POL_KORV8.1e-1224.56Pro-Pol polyprotein OS=Koala retrovirus GN=pro-pol PE=3 SV=1[more]
POL_MLVMS6.9e-1128.85Gag-Pol polyprotein OS=Moloney murine leukemia virus (isolate Shinnick) GN=gag-p... [more]
POL_FENV16.9e-1125.97Pol polyprotein (Fragment) OS=Feline endogenous virus ECE1 GN=pol PE=3 SV=1[more]
POL_MLVFF9.0e-1128.21Pol polyprotein OS=Friend murine leukemia virus (isolate FB29) GN=pol PE=3 SV=1[more]
Match NameE-valueIdentityDescription
A0A061EXZ3_THECC1.5e-15363.00RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative OS... [more]
A0A151T1W1_CAJCA4.8e-15259.86Gypsy retrotransposon integrase-like protein 1 OS=Cajanus cajan GN=KK1_023440 PE... [more]
A0A061EFI0_THECC5.8e-15052.33RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H OS=Theobroma... [more]
A0A061EKZ1_THECC1.1e-14862.53RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative OS... [more]
A0A151QLV6_CAJCA2.2e-14158.79Retrotransposable element Tf2 OS=Cajanus cajan GN=KK1_048524 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|828327848|ref|XP_012573958.1|2.8e-16157.82PREDICTED: uncharacterized protein LOC101510858 [Cicer arietinum][more]
gi|828329454|ref|XP_012574247.1|8.0e-16157.61PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101497477 [Cicer arie... [more]
gi|828334754|ref|XP_012575240.1|6.8e-16065.16PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105852935 [Cicer arie... [more]
gi|828327855|ref|XP_012573961.1|1.2e-15965.33PREDICTED: uncharacterized protein LOC101511496 [Cicer arietinum][more]
gi|828335798|ref|XP_012575508.1|2.6e-15957.61PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC105853036 [Cicer arie... [more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR001584Integrase_cat-core
IPR012337RNaseH-like_sf
Vocabulary: Biological Process
TermDefinition
GO:0015074DNA integration
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
biological_process GO:0090304 nucleic acid metabolic process
biological_process GO:0051252 regulation of RNA metabolic process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003824 catalytic activity
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G18750.1CSPI05G18750.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 232..344
score: 3.4
IPR001584Integrase, catalytic corePROFILEPS50994INTEGRASEcoord: 228..387
score: 20
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 231..391
score: 2.9
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 3..69
score: 3.22E-7coord: 230..392
score: 2.43
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 142..430
score: 2.8
NoneNo IPR availablePFAMPF13456RVT_3coord: 2..65
score: 6.2

The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
CSPI05G18750Cucsa.011650Cucumber (Gy14) v1cgycpiB296
The following gene(s) are paralogous to this gene:

None