CSPI05G14890 (gene) Wild cucumber (PI 183967)

NameCSPI05G14890
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionTransposon Ty3-I Gag-Pol polyprotein
LocationChr5 : 15622569 .. 15626029 (-)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAAGGTACATTGGTCATCTTTAACCATGACTTTTTGGGCAAAAGGCAAGCAAATTGTGCTTAAGGGAGATCCATCTCTGATTAAGGCAGAGTGTTTCTTGACGACATTAGAGAAAACTTGGGATATAGAAGATCAAGGATTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAGGATAATTATGAAAAAGAAACAGAGGAAAAGAGAGATGAGGAAGAATTACCCATGACTCAATCCTTGCTCAAGTATTATGTTGAGATATTTGAGACACCAAAGAGCCTACCTCCAAAAAGGGCTATTGATCACCGTATACTAACTCTACTGAAACATAGACCTATCAACGTTAGACCTTACAAATATGGTCATGTGTAGAAGGAAGAAATCGAAAAGCTAGTAATCGAAATACTTCAGGTAGGGGTGATCAAACCTAGTCATAGGCCTACAACCCTTACTTTAGTCCGGTGTTGTTAGTAAAGAAGGATGGGGGATGGAGATTTTGTGTCGATTATCACAAGCTAAACCAAGTAACTGTGTCTGATAAGTTCCCAATCCCGGTGATTGAAGAGTTATTAGATGAATTGAATGGAGCAACAGTTTTTTTCCAAATTAGATTTGAAATCTGGTTATCACCAGATAAGAATGAAAGAGGAAAGATGTGGAGAAAATAGCGTTCCGCACACATGAGGGACACTACGAGTTCCTAGTTATGTCGTTCGCCCTCACAAATGCCCCCGCCACCTTCCAGTCATTGATGAATCAGGTATTTAAGTCTTTTCTTAGATGCTGTGTTTTGATCTTTTTTAATGACATCCTAGTTTATAGTGCGGATATAGACGAACATGCAAAGCATCTAGGCATGGTCTTTAACGTATTGAAGGATAACCAATTATTTGCTAACAAGAATAAGTGTGTGATAGCCCATTCTCAAATCCAATATTTGGGACATATGATATCCAAGAGGGGAGTAGAGGCCGATGAGGATAAAATTAGGAGTATGATAAATTGACCACCACTAAAAGACGTCTCTAGGTTAAGGGGTTTCTTGGGACTGACGGGCTATTATAGGAGGTTTGTTAAGGGCTATGAGGAGATAGCTGCACCACTGACCAAGTTATTACAAAAAAATGCCTTCAAATGGAGTGAGGAATCAACAACTGCCTTCGAAAATCTTAAGCTAGCAATGAATACCTTACTGATTTTGGCTTTACCAAATTGGAACATACCTTTCACTATTGAAACTGATGCGTCTGGTGTGAGATTAGAAGCTGTGATTTCTCAAAATGGCCATCCAATTGCTTTATTCAGCAAAATATTATCTACAAGAGCCCAAAACAAATTTATTTATGCAAGGGAATTGATGACTGTAGTTCTTTCCGTACAAAAGTGGAGACATCACTTGCTTGGGAGAAAGTTCACCATAATTTCGGATCAAACTCTCAAATTCTTATTAGAACAAAGAGAGGTGCAGCCCCAATTCCTGGAATGTTTGACAAAGCTCCTTGGCTATGATTTTGAGATACTATACCAGCCAGAACTTCAAAACAAGATTGGTGATGCTCTATCAAGAATGGAACAACCTTTCGAACTGAACAGTATGACAACTACCGAAATTATTGATGTGGAACTGATCTGAAGGAAGTTGGTATGGATGAAGAACTTAAAACAATTATAGAGGAATTGAAGAAGAACTTTTCGGAAGGAGGTAAGTTTCAGGTGGTGAATGGGAGGCTACTTTATAAAGGAAGACTGGTAGGATTTAAAACCTCTTCTCTTATACCAAAGATACTACATACTTTTAATGATTCTATTCTTGGGGGTCACTCCGAATTTTTAGGAACTTATAAAAGGATCAGTGGAGAACTATACTGGAAGCATATGAAGGCTGGTGTAAAGAAATATGTCGAACAATGCGATATCTGCCAACAAAACAAATATGAAGCGACCAAGCCTGCTGGGCTACTTCAATCAATTCCCATTTCGGATAGAATATTAGAAGATTGGACAATGGACTTTATTAAGGGGTTACCCCCGGTCGGAGGAGTTGATGTGATCATGGTTGTAGTTGACCGATTCAGTAAGTATTCTTACTTGATACTGCTGAGACATCCATTTTCGGCTAAGCAAGTTGCTTCCATCCATTGACAGGGTAATAAGAAAGCATGACATTCCCAAGTCAATTATCACAGACAGGGATAAAAGCTTTCTTAGTTACTTCTGGAAAGAACTATTCGCGACTATGGGGACGATTCTAAAAAGGAGTACGACGTTTCATCCACAAACAGATTGACAGACGGAGAGGGTGAACAGATGCCTTGAAACTTACTTAAGGTGCTTTTGTAATGAACAGGCTAGAAAATGGGATAAATTGATTCCTTGGGCAGAATTGTGGTACAATACCACCTTCCATGCGTACACCAAGATCACCCCCTTCCAGATTGTATATGGCAGTTCCCCTCCTCCCCTGTTATCCTATGGTCATAAGAAGATTCCTAACAATGGGGTAGAAACAATGGGGTAGAAACAAGCTGAAAGAAAGGGACCTAGTCATCAATGCTTTGAAGGAAAACCTGTGTGCAGCACAGAACATAATGAAAAAAATGGCTGATTGGAAACGAAGGGAGCTGAAGTTTCAAGTAGGACATGAAGTCTACTTAAAATTGAGACCCTATCGACAACGTTCATTAGCTTGGAAGAAGTGTGAAAAGTTGGCTCCAAAATATTACGAACCTTACAAAATTATTGAAGAAATTGGTGAAGTTGCTTATAAGCTGATGTTACCCCCTGAAGCCATTTCTCAGTTAAAACTAAAGCCGGGAAAACAACAAGCCGCCAACATCAGCAACCTATCCTAACGAAAGAGTTTGAGCTGCAATTATGGCCCGAAACAGTGTTGGGAGTTCGCTGGAACCAGGAACTGGGAGGAAATGAATGGTTTATTAAATGGAAAGGGCTACCGGACAATGAAGCAACATGGGAATCGGTCTTTCAAACGAACCAACAATTCCATAATTTTCCCCTTGAGGACAAGTTGAACTTGGAGCCGAGGGGTATTGTAAGGCCCCCTATTGTCCATACATACAAAAGGAGGGGCAAAAACGTAAAAGCTCACGCAATAAAAGATCAGGGAATGTTAGGAAAAGAACATGAGAGTGGGGCCCGCGGGGAGGAATGAGAGGTGAGATATAAATAGGCCTTTATCGGGATTGGTAGCTAGGTTAATTTTTGTCATCATTTTGATAGGAGAGGGTGCTGCAGCCTGGGGAATGGACGAGCTTGACCTAAAGAGCTGGGTATATTTGTTATTATCGTTGCTTGTTATTTCCTTATTTCTACTGTTGTAATCCTTTGACATATATATAAATAAAGTGTATCAAGTGATGCCTCTGTAGTGTATTTTAGTATTATATCTGTAGAACCTCTACAGAGATACAGGTAATGAGGCAGTTGTTTGTTGA

mRNA sequence

ATGAAGGTACATTGGTCATCTTTAACCATGACTTTTTGGGCAAAAGGCAAGCAAATTGTGCTTAAGGGAGATCCATCTCTGATTAAGGCAGAGTGTTTCTTGACGACATTAGAGAAAACTTGGGATATAGAAGATCAAGGATTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAGGATAATTATGAAAAAGAAACAGAGGAAAAGAGAGATGAGGAAGAATTACCCATGACTCAATCCTTGCTCAAGTATTATGTTGAGATATTTGAGACACCAAAGAGCCTACCTCCAAAAAGGGCTATTGATCACCGTATACTAACTCTACTGAAACATAGACCTATCAACGTTAGACCTTACAAATATGGTCATGTTCCGGTGTTGTTAGTAAAGAAGGATGGGGGATGGAGATTTTGTGTCGATTATCACAAGCTAAACCAAGTAACTGTGTCTGATAAGTTCCCAATCCCGGTGATTGAAGAGTTATTAGATGAATTGAATGGAGCAACAGGACACTACGAGTTCCTAGTTATGTCGTTCGCCCTCACAAATGCCCCCGCCACCTTCCAGTCATTGATGAATCAGGATAACCAATTATTTGCTAACAAGAATAAGTGTGTGATAGCCCATTCTCAAATCCAATATTTGGGACATATGATATCCAAGAGGGGAGTAGAGGCCGATGAGGATAAAATTAGGAACGTCTCTAGGTTAAGGGGTTTCTTGGGACTGACGGGCTATTATAGGAGGTTTGTTAAGGGCTATGAGGAGATAGCTGCACCACTGACCAAGTTATTACAAAAAAATGCCTTCAAATGGAGTGAGGAATCAACAACTGCCTTCGAAAATCTTAAGCTAGCAATGAATACCTTACTGATTTTGGCTTTACCAAATTGGAACATACCTTTCACTATTGAAACTGATGCGTCTGGTGTGAGATTAGAAGCTGTGATTTCTCAAAATGGCCATCCAATTGCTTTATTCAGCAAAATATTATCTACAAGAGCCCAAAACAAATTTATTTATGCAAGGGAATTGATGACTGTAGTTCTTTCCGTACAAAAGTGGAGACATCACTTGCTTGGGAGAAAGTTCACCATAATTTCGGATCAAACTCTCAAATTCTTATTAGAACAAAGAGAGGTGCAGCCCCAATTCCTGGAATGTTTGACAAAGCTCCTTGGCTATGATTTTGAGATACTATACCAGCCAGAACTTCAAAACAAGATTGGTGATGCTCTATCAAGAATGGAACAACCTTTCGAACTGAACAAGGAATTGAAGAAGAACTTTTCGGAAGGAGGTAAGTTTCAGGTGGTGAATGGGAGGCTACTTTATAAAGGAAGACTGGTAGGATTTAAAACCTCTTCTCTTATACCAAAGATACTACATACTTTTAATGATTCTATTCTTGGGGGTCACTCCGAATTTTTAGGAACTTATAAAAGGATCAGTGGAGAACTATACTGGAAGCATATGAAGGCTGGTGTAAAGAAATATGTCGAACAATGCGATATCTGCCAACAAAACAAATATGAAGCGACCAAGCCTGCTGGGCTACTTCAATCAATTCCCATTTCGGATAGAATATTAGAAGATTGGACAATGGACTTTATTAAGGGGTTACCCCCGGTCGGAGGAGTTGATGTGATCATGGTTGTAGTTGACCGATTCAGTAAGTATTCTTACTTGATACTGCTGAGACATCCATTTTCGGCTAAGCAAGCTAGAAAATGGGATAAATTGATTCCTTGGGCAGAATTGTGGTACAATACCACCTTCCATGCGTACACCAAGATCACCCCCTTCCAGATTGTATATGGCAGTTCCCCTCCTCCCCTGTTATCCTATGGTCATAAGAAGATTCCTAACAATGGGAACATAATGAAAAAAATGGCTGATTGGAAACGAAGGGAGCTGAAGTTTCAAGTAGGACATGAAGTCTACTTAAAATTGAGACCCTATCGACAACGTTCATTAGCTTGGAAGAAGTGTGAAAACCATTTCTCAGTTAAAACTAAAGCCGGGAAAACAACAAGCCGCCAACATCAGCAACCTATCCTAACGAAAGAGTTTGAGCTGCAATTATGGCCCGAAACAGTGTTGGGAGTTCGCTGGAACCAGGAACTGGGAGGAAATGAATGGTTTATTAAATGGAAAGGGCTACCGGACAATGAAGCAACATGGGAATCGGTCTTTCAAACGAACCAACAATTCCATAATTTTCCCCTTGAGGACAAGTTGAACTTGGAGCCGAGGGGAGAGGGTGCTGCAGCCTGGGGAATGGACGAGCTTGACCTAAAGAGCTGGAACCTCTACAGAGATACAGGTAATGAGGCAGTTGTTTGTTGA

Coding sequence (CDS)

ATGAAGGTACATTGGTCATCTTTAACCATGACTTTTTGGGCAAAAGGCAAGCAAATTGTGCTTAAGGGAGATCCATCTCTGATTAAGGCAGAGTGTTTCTTGACGACATTAGAGAAAACTTGGGATATAGAAGATCAAGGATTCCTCTTAGAATTCCAGAATTATGAGATGGAAGTAGAGGATAATTATGAAAAAGAAACAGAGGAAAAGAGAGATGAGGAAGAATTACCCATGACTCAATCCTTGCTCAAGTATTATGTTGAGATATTTGAGACACCAAAGAGCCTACCTCCAAAAAGGGCTATTGATCACCGTATACTAACTCTACTGAAACATAGACCTATCAACGTTAGACCTTACAAATATGGTCATGTTCCGGTGTTGTTAGTAAAGAAGGATGGGGGATGGAGATTTTGTGTCGATTATCACAAGCTAAACCAAGTAACTGTGTCTGATAAGTTCCCAATCCCGGTGATTGAAGAGTTATTAGATGAATTGAATGGAGCAACAGGACACTACGAGTTCCTAGTTATGTCGTTCGCCCTCACAAATGCCCCCGCCACCTTCCAGTCATTGATGAATCAGGATAACCAATTATTTGCTAACAAGAATAAGTGTGTGATAGCCCATTCTCAAATCCAATATTTGGGACATATGATATCCAAGAGGGGAGTAGAGGCCGATGAGGATAAAATTAGGAACGTCTCTAGGTTAAGGGGTTTCTTGGGACTGACGGGCTATTATAGGAGGTTTGTTAAGGGCTATGAGGAGATAGCTGCACCACTGACCAAGTTATTACAAAAAAATGCCTTCAAATGGAGTGAGGAATCAACAACTGCCTTCGAAAATCTTAAGCTAGCAATGAATACCTTACTGATTTTGGCTTTACCAAATTGGAACATACCTTTCACTATTGAAACTGATGCGTCTGGTGTGAGATTAGAAGCTGTGATTTCTCAAAATGGCCATCCAATTGCTTTATTCAGCAAAATATTATCTACAAGAGCCCAAAACAAATTTATTTATGCAAGGGAATTGATGACTGTAGTTCTTTCCGTACAAAAGTGGAGACATCACTTGCTTGGGAGAAAGTTCACCATAATTTCGGATCAAACTCTCAAATTCTTATTAGAACAAAGAGAGGTGCAGCCCCAATTCCTGGAATGTTTGACAAAGCTCCTTGGCTATGATTTTGAGATACTATACCAGCCAGAACTTCAAAACAAGATTGGTGATGCTCTATCAAGAATGGAACAACCTTTCGAACTGAACAAGGAATTGAAGAAGAACTTTTCGGAAGGAGGTAAGTTTCAGGTGGTGAATGGGAGGCTACTTTATAAAGGAAGACTGGTAGGATTTAAAACCTCTTCTCTTATACCAAAGATACTACATACTTTTAATGATTCTATTCTTGGGGGTCACTCCGAATTTTTAGGAACTTATAAAAGGATCAGTGGAGAACTATACTGGAAGCATATGAAGGCTGGTGTAAAGAAATATGTCGAACAATGCGATATCTGCCAACAAAACAAATATGAAGCGACCAAGCCTGCTGGGCTACTTCAATCAATTCCCATTTCGGATAGAATATTAGAAGATTGGACAATGGACTTTATTAAGGGGTTACCCCCGGTCGGAGGAGTTGATGTGATCATGGTTGTAGTTGACCGATTCAGTAAGTATTCTTACTTGATACTGCTGAGACATCCATTTTCGGCTAAGCAAGCTAGAAAATGGGATAAATTGATTCCTTGGGCAGAATTGTGGTACAATACCACCTTCCATGCGTACACCAAGATCACCCCCTTCCAGATTGTATATGGCAGTTCCCCTCCTCCCCTGTTATCCTATGGTCATAAGAAGATTCCTAACAATGGGAACATAATGAAAAAAATGGCTGATTGGAAACGAAGGGAGCTGAAGTTTCAAGTAGGACATGAAGTCTACTTAAAATTGAGACCCTATCGACAACGTTCATTAGCTTGGAAGAAGTGTGAAAACCATTTCTCAGTTAAAACTAAAGCCGGGAAAACAACAAGCCGCCAACATCAGCAACCTATCCTAACGAAAGAGTTTGAGCTGCAATTATGGCCCGAAACAGTGTTGGGAGTTCGCTGGAACCAGGAACTGGGAGGAAATGAATGGTTTATTAAATGGAAAGGGCTACCGGACAATGAAGCAACATGGGAATCGGTCTTTCAAACGAACCAACAATTCCATAATTTTCCCCTTGAGGACAAGTTGAACTTGGAGCCGAGGGGAGAGGGTGCTGCAGCCTGGGGAATGGACGAGCTTGACCTAAAGAGCTGGAACCTCTACAGAGATACAGGTAATGAGGCAGTTGTTTGTTGA
BLAST of CSPI05G14890 vs. Swiss-Prot
Match: POL2_DROME (Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 171.8 bits (434), Expect = 3.0e-41
Identity = 115/364 (31.59%), Postives = 178/364 (48.90%), Query Frame = 1

Query: 136 WRFCVDYHKLNQVTVSDKFPIPVIEELLDELNGA-------------------------- 195
           +R  +DY KLN++T+ D++PIP ++E+L +L                             
Sbjct: 261 YRVVIDYRKLNEITIPDRYPIPNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTA 320

Query: 196 ----TGHYEFLVMSFALTNAPATFQSLMNQDNQLFANKN--------------------- 255
               +GHYE+L M F L NAPATFQ  MN   +   NK+                     
Sbjct: 321 FSTKSGHYEYLRMPFGLRNAPATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNS 380

Query: 256 ------KCVIAHSQIQ------------YLGHMISKRGVEADEDKIRNV---------SR 315
                 K   A+ ++Q            +LGH+++  G++ +  K++ +           
Sbjct: 381 IQLVFTKLADANLKLQLDKCEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKE 440

Query: 316 LRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSE--ESTTAFENLKLAMNTLLILA 375
           +R FLGLTGYYR+F+  Y +IA P+T  L+K     ++  E   AFE LK  +    IL 
Sbjct: 441 IRAFLGLTGYYRKFIPNYADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQ 500

Query: 376 LPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQK 419
           LP++   F + TDAS + L AV+SQNGHPI+  S+ L+    N     +EL+ +V + + 
Sbjct: 501 LPDFEKKFVLTTDASNLALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKT 560

BLAST of CSPI05G14890 vs. Swiss-Prot
Match: POL3_DROME (Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 142.9 bits (359), Expect = 1.5e-32
Identity = 80/227 (35.24%), Postives = 132/227 (58.15%), Query Frame = 1

Query: 204 NKCVIAHSQIQYLGHMISKRGVEADEDKIRNVSR---------LRGFLGLTGYYRRFVKG 263
           +KC     +  +LGH+++  G++ + +KI  + +         ++ FLGLTGYYR+F+  
Sbjct: 399 DKCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPN 458

Query: 264 YEEIAAPLTKLLQKNAF--KWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGV 323
           + +IA P+TK L+KN      + E  +AF+ LK  ++   IL +P++   FT+ TDAS V
Sbjct: 459 FADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASDV 518

Query: 324 RLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISD-Q 383
            L AV+SQ+GHP++  S+ L+    N     +EL+ +V + + +RH+LLGR F I SD Q
Sbjct: 519 ALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSDHQ 578

Query: 384 TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME 419
            L +L   ++   +      KL  +DF+I Y    +N + DALSR++
Sbjct: 579 PLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIK 625

BLAST of CSPI05G14890 vs. Swiss-Prot
Match: M860_ARATH (Uncharacterized mitochondrial protein AtMg00860 OS=Arabidopsis thaliana GN=AtMg00860 PE=4 SV=1)

HSP 1 Score: 116.3 bits (290), Expect = 1.5e-24
Identity = 60/120 (50.00%), Postives = 79/120 (65.83%), Query Frame = 1

Query: 195 QDNQLFANKNKCVIAHSQIQYLGH--MISKRGVEADEDKI---------RNVSRLRGFLG 254
           + +Q +AN+ KC     QI YLGH  +IS  GV AD  K+         +N + LRGFLG
Sbjct: 12  EQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLG 71

Query: 255 LTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPF 304
           LTGYYRRFVK Y +I  PLT+LL+KN+ KW+E +  AF+ LK A+ TL +LALP+  +PF
Sbjct: 72  LTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

BLAST of CSPI05G14890 vs. Swiss-Prot
Match: POL4_DROME (Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaster GN=POL PE=3 SV=1)

HSP 1 Score: 112.8 bits (281), Expect = 1.7e-23
Identity = 113/478 (23.64%), Postives = 203/478 (42.47%), Query Frame = 1

Query: 31  ECFLTTLEKTWDIEDQGFLLEFQNYEMEVEDNYEKETEEKRDEEELPMTQSLLKYYVEIF 90
           E F + LE         F LE +   + V + Y+++   K DE            Y + +
Sbjct: 273 ELFKSQLENICSEYIDIFALESE--PITVNNLYKQQLRLKDDEP----------VYTKNY 332

Query: 91  ETPKSLPPKRAIDHRILTLLKHRPINVRPYKYGHVPVLLVKKDGG-------WRFCVDYH 150
            +P S   +  I  ++  L+K + +     +Y   P+LLV K          WR  +DY 
Sbjct: 333 RSPHSQVEE--IQAQVQKLIKDKIVEPSVSQYNS-PLLLVPKKSSPNSDKKKWRLVIDYR 392

Query: 151 KLNQVTVSDKFPIPVIEELLDELNGA---------TGHYEFL-------VMSFALTN--- 210
           ++N+  ++DKFP+P I+++LD+L  A         +G ++         + SF+ +N   
Sbjct: 393 QINKKLLADKFPLPRIDDILDQLGRAKYFSCLDLMSGFHQIELDEGSRDITSFSTSNGSY 452

Query: 211 -----------APATFQSLMN------QDNQLFANKNKCVIAHS---------------- 270
                      AP +FQ +M       + +Q F   +  ++                   
Sbjct: 453 RFTRLPFGLKIAPNSFQRMMTIAFSGIEPSQAFLYMDDLIVIGCSEKHMLKNLTEVFGKC 512

Query: 271 -----------------QIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLT 330
                            ++ +LGH  + +G+  D+ K           +    R F+   
Sbjct: 513 REYNLKLHPEKCSFFMHEVTFLGHKCTDKGILPDDKKYDVIQNYPVPHDADSARRFVAFC 572

Query: 331 GYYRRFVKGYEEIAAPLTKLLQKNA-FKWSEESTTAFENLKLAMNTLLILALPNWNIPFT 390
            YYRRF+K + + +  +T+L +KN  F+W++E   AF +LK  +    +L  P+++  F 
Sbjct: 573 NYYRRFIKNFADYSRHITRLCKKNVPFEWTDECQKAFIHLKSQLINPTLLQYPDFSKEFC 632

Query: 391 IETDASGVRLEAVISQ--NGH--PIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHL 418
           I TDAS     AV++Q  NGH  P+A  S+  +    NK    +EL  +  ++  +R ++
Sbjct: 633 ITTDASKQACGAVLTQNHNGHQLPVAYASRAFTKGESNKSTTEQELAAIHWAIIHFRPYI 692

BLAST of CSPI05G14890 vs. Swiss-Prot
Match: POLY_DROME (Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogaster GN=pol PE=3 SV=1)

HSP 1 Score: 112.1 bits (279), Expect = 2.8e-23
Identity = 72/245 (29.39%), Postives = 126/245 (51.43%), Query Frame = 1

Query: 196 DNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKIRNVS---------RLRGFLGLTG 255
           D  +  ++ K       ++YLG ++SK G ++D +K++ +          ++R FLGL  
Sbjct: 366 DANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLAS 425

Query: 256 YYRRFVKGYEEIAAPLTKLLQ------------KNAFKWSEESTTAFENLK-LAMNTLLI 315
           YYR F+K +  IA P+T +L+            K   +++E    AF+ L+ +  +  +I
Sbjct: 426 YYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASEDVI 485

Query: 316 LALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSV 375
           L  P++  PF + TDAS   + AV+SQ G PI + S+ L    QN     REL+ +V ++
Sbjct: 486 LKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIVWAL 545

Query: 376 QKWRHHLLG-RKFTIISD-QTLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIG 417
            K ++ L G R+  I +D Q L F +  R    +     + +  ++ ++ Y+P  +N + 
Sbjct: 546 GKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKENFVA 605

BLAST of CSPI05G14890 vs. TrEMBL
Match: A0A087FZI0_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs71112U000100 PE=4 SV=1)

HSP 1 Score: 488.8 bits (1257), Expect = 1.2e-134
Identity = 282/710 (39.72%), Postives = 396/710 (55.77%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVE 60
            M  +W    ++F  +G+Q+ L+G+P +  +   L  L K  D E QG ++E+        
Sbjct: 337  MVCNWKLQKLSFKVEGRQVELQGNPGICCSPVTLKGLWKALDQEGQGVIIEYGG------ 396

Query: 61   DNYEKETEEKRDEEELPMT---QSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINV 120
                   +  + EEE+P+T   QS+L+ +  +F  P+ LPP R  +H I       P++V
Sbjct: 397  ------VQGPKPEEEVPVTEGIQSVLREFQSVFNEPQGLPPTRGREHAIELTPGAAPVSV 456

Query: 121  RPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVT 180
            RP++Y  +                           PVLLV KK+G WRFCVDY  LN+VT
Sbjct: 457  RPFRYPQIQREELEKLVATMLAAGIIQESTSPFSSPVLLVKKKNGSWRFCVDYRALNKVT 516

Query: 181  VSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMS 240
            V D +PIP+I++LLDEL+GA                               GHYEFLVM 
Sbjct: 517  VGDSYPIPMIDQLLDELHGAVIFSKLDLRAGYHQIRVRAEDVPKTAFRTHDGHYEFLVMP 576

Query: 241  FALTNAPATFQSLMNQ--------------DN-------------------------QLF 300
            F LTNAP+TFQSLMN               D+                         QLF
Sbjct: 577  FGLTNAPSTFQSLMNDLFRPYLRRFVLMLFDDILVYSKSEAEHQGHLRTVLQVLTDNQLF 636

Query: 301  ANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRF 360
            AN  KC     ++ YLGH+IS  GV AD  K+         RNV  LRGFLGLTGYYR+F
Sbjct: 637  ANSKKCQFGSQKVDYLGHVISAEGVAADPAKVQAMVDWPVPRNVKALRGFLGLTGYYRKF 696

Query: 361  VKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASG 420
            VKGY EIA PLT LL+K+ F+WS++   AF++LK+AM+T+ +LAL ++++PF IE+DASG
Sbjct: 697  VKGYGEIARPLTALLKKDQFQWSQKVEDAFQSLKVAMSTVPVLALVDFSLPFVIESDASG 756

Query: 421  VRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ 480
            V L AV+ Q   PIA FS+  + R + K +Y RELM +V ++QKW+H+LLGRKF + +DQ
Sbjct: 757  VGLGAVLMQQKQPIAYFSQAQTERQRLKSVYERELMAIVFAIQKWKHYLLGRKFLVRTDQ 816

Query: 481  -TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRM------------- 540
             +LKFLLEQRE+  ++   LTK+LG+DFEI Y+P L+NK  DALSR+             
Sbjct: 817  KSLKFLLEQREINLEYQRWLTKILGFDFEIQYKPGLENKAADALSRIDAVPQLCALSMHV 876

Query: 541  --------------EQPFELNKELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKIL 574
                          E+  +L +E+  + +    + VV GRL  KGRLV    S L+  +L
Sbjct: 877  AIQLSEIDEAIEKDEELSKLKQEVVTDATSHPDYSVVQGRLFMKGRLVLPAASPLVKLVL 936

BLAST of CSPI05G14890 vs. TrEMBL
Match: A0A087H8D5_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G106900 PE=4 SV=1)

HSP 1 Score: 460.7 bits (1184), Expect = 3.6e-126
Identity = 284/712 (39.89%), Postives = 386/712 (54.21%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEM--- 60
            M+V+W    + F  + ++++L+GDP L      L    K    E +G ++E  N +    
Sbjct: 463  MRVNWKLQRIRFHNEDREVLLQGDPGLCCTPVSLKAFWKVVSTEGEGMIVELNNCQQAGA 522

Query: 61   EVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINV 120
            E + N  K+ +E            +L  + ++FE P+ LPP R  +H I      RP+ V
Sbjct: 523  EQQHNIPKDLQE------------VLVVFDQVFEEPQGLPPSRGREHSITLEPGSRPVTV 582

Query: 121  RPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVT 180
            RP++Y  V                           PVLLV KKDG WRFCVDY  LN+ T
Sbjct: 583  RPFRYPQVQKAEIEKQVAVMLAAGIIRESTSPYSSPVLLVRKKDGSWRFCVDYRALNKAT 642

Query: 181  VSDKFPIPVIEELLDELNGA------------------------------TGHYEFLVMS 240
            V D +PIP+I++LLDEL+GA                               GHYEFLVM 
Sbjct: 643  VGDSYPIPMIDQLLDELHGACVFSKLDLRSGYHQIRVRAEDVPKTAFRTHDGHYEFLVMP 702

Query: 241  FALTNAPATFQSLMNQ--------------DNQLFANKN--------KCVIAHSQ----- 300
            F LTNAPATFQ+LMN               D+ L  +K+        + V+   Q     
Sbjct: 703  FGLTNAPATFQALMNDVFRQHLRKFVLVFFDDILVYSKSASEHRNHLQLVLQLLQDHQLY 762

Query: 301  ------------IQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRF 360
                        I+YLGH+I+  GV AD  KI         RNV  LRGFLGLTGYYR+F
Sbjct: 763  ANKRKCQFGSRSIEYLGHVITAEGVSADASKIQAMVDWPEPRNVKALRGFLGLTGYYRKF 822

Query: 361  VKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASG 420
            V+GY  IA PLT LLQK+ F+WS E++TAF NLK AM T+ +L + +++  F +E+DASG
Sbjct: 823  VRGYGSIAKPLTSLLQKDQFRWSPEASTAFNNLKQAMVTVPVLTMADFDAQFVVESDASG 882

Query: 421  VRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ 480
              L AV+ Q+  P+A FS+ L+ R + K +Y RELM +V ++QKWRH+LLGRKF + +DQ
Sbjct: 883  TGLGAVLMQHQKPLAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQ 942

Query: 481  -TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME---QPF------ 540
             +LKFLLEQR++  ++ + LTK+LG+DF I Y+  L+NK  DALSR +   Q F      
Sbjct: 943  KSLKFLLEQRQINMEYQKWLTKILGFDFNIQYKSGLENKAADALSRRDAIPQLFALSIPA 1002

Query: 541  ---------ELNKELK---------KNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKIL 576
                     E++K+LK          +      F VV GRLL +G+LV    S L+  IL
Sbjct: 1003 AIQLEDISSEVDKDLKLQKIKAEVLADPKSHAGFTVVQGRLLRQGKLVVPAQSHLVELIL 1062

BLAST of CSPI05G14890 vs. TrEMBL
Match: A0A087GW89_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1)

HSP 1 Score: 428.7 bits (1101), Expect = 1.5e-116
Identity = 267/705 (37.87%), Postives = 363/705 (51.49%), Query Frame = 1

Query: 195  QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLT 254
            +++QL+ N+ KC      ++YLGH+IS  GV AD +KI         RNV  LRG LGLT
Sbjct: 815  EEHQLYENRKKCYFGCESVEYLGHLISAEGVSADPEKINAMEKWPVPRNVKALRGILGLT 874

Query: 255  GYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTI 314
            GYY++FV+ Y EIA PLT LL+ N F W  E+  AF  LK AM T+ +LA+ ++   F +
Sbjct: 875  GYYKKFVQRYGEIARPLTALLKNNKFSWGPEADEAFLKLKRAMVTVPVLAMADFTALFVV 934

Query: 315  ETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKF 374
            E+DASGV L AV+ QN  P+A F   L+ R   K IY RELM +V ++QKWRH+LLGR+F
Sbjct: 935  ESDASGVGLGAVLMQNQRPVACFRHALTERQMLKSIYERELMAIVFAIQKWRHYLLGRRF 994

Query: 375  TIISDQ-TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME------ 434
             + +DQ +LKFLLEQRE+  ++   LTK+LG+DFEI Y+P L+NK  DALSR E      
Sbjct: 995  VVRTDQKSLKFLLEQREINVEYQRWLTKILGFDFEIHYKPRLENKAADALSRREAMPQLF 1054

Query: 435  ---------------------QPFELNKELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSS 494
                                 Q  +L +E+ ++ S    + VV GRLL +G+LV  +TS 
Sbjct: 1055 ALSVPAAIQLEDICSEVDKDPQLKKLKEEVLRDPSTHPDYAVVQGRLLRQGKLVLPRTSQ 1114

Query: 495  LIPKILHTFNDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKP 554
            L+  IL  F+D  +GGH   L T +RI    YW+ M   +++YV +C +C +NKY    P
Sbjct: 1115 LVWVILREFHDGKVGGHGGVLKTQRRIGDLFYWQGMMTEIREYVAECVVCHKNKYSTLVP 1174

Query: 555  AGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQAR 614
            AGLLQ +P+ ++I ED ++DFI+GLP   G DVIMVVVDR +K ++   L+HPF A +  
Sbjct: 1175 AGLLQPLPVPEQIWEDISLDFIEGLPKSEGYDVIMVVVDRLTKSAHFNRLKHPFVASEVA 1234

Query: 615  K----------------------------W----------------DKLIPWAE------ 674
                                         W                DK   W++      
Sbjct: 1235 LLFIQEVVRLHGFPKTLVSDRDKVFTGMFWGELFRGLETYLRCFASDKPKSWSQYLAWAE 1294

Query: 675  LWYNTTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN---------------------- 734
            L YNT++H+  ++TPF+ V+G  PP L+ + +    N                       
Sbjct: 1295 LCYNTSYHSTIQMTPFKAVFGRDPPALVKFENGSTTNAKLETYLRDRDVVIILLRQHILK 1354

Query: 735  -GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN----------------- 755
               +MK+ AD  RREL FQVG  VYLKL+PYRQ+SLA +  E                  
Sbjct: 1355 AQQVMKRQADKHRRELDFQVGDMVYLKLKPYRQKSLARRSNEKLSARYYGPYEVLARVGE 1414

BLAST of CSPI05G14890 vs. TrEMBL
Match: A0A087HBU4_ARAAL (Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G264600 PE=4 SV=1)

HSP 1 Score: 415.6 bits (1067), Expect = 1.3e-112
Identity = 262/702 (37.32%), Postives = 364/702 (51.85%), Query Frame = 1

Query: 196  DNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTG 255
            DNQ+FANKNKC    ++++YLGH+I+++GV AD  KI+          +  LRGFLGLTG
Sbjct: 684  DNQMFANKNKCQFGSAEVEYLGHVITQQGVAADPSKIKAMTDWPVPKTIKALRGFLGLTG 743

Query: 256  YYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIE 315
            YYR+FV+GY  I  PLT LL+K+ F WSEE+  AFE LK AM+T+ +LAL +++  F +E
Sbjct: 744  YYRKFVRGYGNIVKPLTSLLKKDKFGWSEEAEQAFEALKPAMSTVPVLALADFSELFVVE 803

Query: 316  TDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFT 375
            +DASG+ L AV+ Q   PIALFS+ L+   + K +Y RELM +V ++QKWRH+LLG KF 
Sbjct: 804  SDASGIGLGAVLMQQQKPIALFSQALTDIQKLKSVYERELMAIVFAIQKWRHYLLGCKFL 863

Query: 376  IISDQ-TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME------- 435
            +I+DQ +LKFLLEQREV  ++ + LTK+LG+DF+I Y+P L+NK  DALSR+E       
Sbjct: 864  VITDQKSLKFLLEQREVNLEYQKWLTKILGFDFDIHYKPRLENKAADALSRVEAVPHLFA 923

Query: 436  -------QPFELNKELKKNFSEG-------------GKFQVVNGRLLYKGRLVGFKTSSL 495
                   Q  E+++E+++N   G              +F VVNGRLL  GRLV  K S +
Sbjct: 924  LSVPEALQLKEIDREVEQNPELGKLKLEVIADPTAHDEFTVVNGRLLRNGRLVLPKESPM 983

Query: 496  IPKILHTFNDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPA 555
            +  IL  F+D  +GGH     T KRI    +W+ M   +K+YV  C +CQ++KY    PA
Sbjct: 984  VKLILQEFHDGKVGGHGGIHKTQKRIGDMFFWRGMMTDIKEYVAACQVCQRHKYSTLAPA 1043

Query: 556  GLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQ--- 615
            GLLQ +PI   + ED +MDFIKGLP   G  VIMVVV+R +KYS+ I L+HP+ A     
Sbjct: 1044 GLLQPLPIPADVWEDTSMDFIKGLPKSEGFSVIMVVVERITKYSHFISLKHPYEASMVVQ 1103

Query: 616  -------------------------ARKWDKLI--------------------------- 675
                                      R W ++                            
Sbjct: 1104 IFIQEIVRLHGFPKTIVSDRDKTFTGRLWKEVFRLSGTKLNFNTAYHPQSDGQTEVTNMS 1163

Query: 676  --------------PWAEL--W----YNTTFHAYTKITPFQIVYGSSPPPLLSYGHKKIP 735
                           W +   W    YN++FH+ TK++PF++VYG     LL + +    
Sbjct: 1164 VETFLRCFCSEKPNKWVQFLAWAEMSYNSSFHSATKMSPFKVVYGREAHTLLKFENGSTD 1223

Query: 736  NNGNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCENHFSV------------ 751
            N        AD         +G  V+LKLRPY Q+SLA +  +  F+             
Sbjct: 1224 N--------AD-------LDLGDLVFLKLRPYIQQSLARRVNDKLFARFLGPFAVEARVG 1283

BLAST of CSPI05G14890 vs. TrEMBL
Match: A0A087FX63_ARAAL (Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs67613U000200 PE=4 SV=1)

HSP 1 Score: 401.0 bits (1029), Expect = 3.4e-108
Identity = 242/652 (37.12%), Postives = 371/652 (56.90%), Query Frame = 1

Query: 1   MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVE 60
           MKV+W    M F  +G ++ L+GDP+L  +E  L    K  + ++ G ++E+   +    
Sbjct: 10  MKVNWGLQWMRFRVQGTEVTLQGDPTLCCSEFSLKAWLKAVEHDELGVIVEYNGLQ---- 69

Query: 61  DNYEKETEEKRDEEEL--PMTQSLLKYYVEIFETPKSLPP------------KRAIDHRI 120
                 +    D+  +  P+ Q +L+ +  +F  P+   P            K  I+ ++
Sbjct: 70  ------SVSPHDQSSVISPLLQQVLEKHPTVFSDPEGSKPVSVCPFRYPHAQKAEIERQV 129

Query: 121 LTLLKHRPINVRPYKYGHVPVLLVKKDGGWRFCVDYHKLNQVTVSDKFPIPVIEELLDEL 180
            ++L    I      +    +L+ KKDG WRFCVDY  LN+VT+   FPIP+I++LLDEL
Sbjct: 130 SSMLATGIIEESGSPFSRPVLLVKKKDGSWRFCVDYRALNKVTIPHSFPIPMIDQLLDEL 189

Query: 181 NGAT---------GHYEFLVMSFALTNAP--------------ATFQSLMN--QDNQLFA 240
           +GAT         G+++ LV +  + N                +  +++++  QD++L+A
Sbjct: 190 HGATVFSKLDLKSGYHQILVKATDVPNTAFMTHDGHKDLQDHQSHLETVLSVLQDHKLYA 249

Query: 241 NKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRFV 300
           N+ KC    S+I+YLGH+IS  GV AD  KI         +N+  LRGFLGLTGYYR+FV
Sbjct: 250 NQKKCQFGCSEIEYLGHIISGEGVAADPQKIQAMVSWPEPKNIKALRGFLGLTGYYRKFV 309

Query: 301 KGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGV 360
           +GY +IA PLT LL+K+ F+WSE ++ AF+ LK AM T+ +LAL +++  F +E+DASG+
Sbjct: 310 RGYGDIAKPLTSLLKKDQFQWSEAASVAFQQLKHAMITVPVLALADFSQLFVVESDASGI 369

Query: 361 RLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ- 420
            L AV+ QN  PIA +S+ L+ R + K +Y RELM +V ++Q+WRH+LLG KF + +DQ 
Sbjct: 370 GLGAVLMQNQRPIAYYSQALTDRKKLKSVYERELMAIVFAIQRWRHYLLGMKFLVKTDQK 429

Query: 421 TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME---QPFELN---- 480
           +LKFLLEQ EV  ++ + LTK+LG++F+I+Y+P L+NK+ DALSR E   Q F L+    
Sbjct: 430 SLKFLLEQHEVNAEYQQWLTKILGFNFDIMYKPGLENKVADALSRKELLPQLFALSIPAA 489

Query: 481 ---KELKKNFSEGGKFQVVNGRLLY-----------KGRLVGFKTSSL-IPKILHTFNDS 540
                +++        + +   +L            +GRL+  K   L IPK        
Sbjct: 490 IQLDMIQEAVERDADLKKIKEEVLLNVGLHPEFSVVQGRLL--KQGKLVIPKTSPLVGVL 549

Query: 541 ILGGHSEFLG-------TYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKPAGLLQ 575
           +   HS  +G       T KR+    YW  M A +K++V  C +CQ +KY    PAGLLQ
Sbjct: 550 LQEFHSSKMGGHGGILKTQKRLGALFYWAGMMANIKEFVAACLVCQTHKYSTLTPAGLLQ 609

BLAST of CSPI05G14890 vs. TAIR10
Match: ATMG00860.1 (ATMG00860.1 DNA/RNA polymerases superfamily protein)

HSP 1 Score: 116.3 bits (290), Expect = 8.4e-26
Identity = 60/120 (50.00%), Postives = 79/120 (65.83%), Query Frame = 1

Query: 195 QDNQLFANKNKCVIAHSQIQYLGH--MISKRGVEADEDKI---------RNVSRLRGFLG 254
           + +Q +AN+ KC     QI YLGH  +IS  GV AD  K+         +N + LRGFLG
Sbjct: 12  EQHQFYANRKKCAFGQPQIAYLGHRHIISGEGVSADPAKLEAMVGWPEPKNTTELRGFLG 71

Query: 255 LTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPF 304
           LTGYYRRFVK Y +I  PLT+LL+KN+ KW+E +  AF+ LK A+ TL +LALP+  +PF
Sbjct: 72  LTGYYRRFVKNYGKIVRPLTELLKKNSLKWTEMAALAFKALKGAVTTLPVLALPDLKLPF 131

BLAST of CSPI05G14890 vs. NCBI nr
Match: gi|674229247|gb|KFK23032.1| (hypothetical protein AALP_AAs71112U000100, partial [Arabis alpina])

HSP 1 Score: 488.8 bits (1257), Expect = 1.8e-134
Identity = 282/710 (39.72%), Postives = 396/710 (55.77%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVE 60
            M  +W    ++F  +G+Q+ L+G+P +  +   L  L K  D E QG ++E+        
Sbjct: 337  MVCNWKLQKLSFKVEGRQVELQGNPGICCSPVTLKGLWKALDQEGQGVIIEYGG------ 396

Query: 61   DNYEKETEEKRDEEELPMT---QSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINV 120
                   +  + EEE+P+T   QS+L+ +  +F  P+ LPP R  +H I       P++V
Sbjct: 397  ------VQGPKPEEEVPVTEGIQSVLREFQSVFNEPQGLPPTRGREHAIELTPGAAPVSV 456

Query: 121  RPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVT 180
            RP++Y  +                           PVLLV KK+G WRFCVDY  LN+VT
Sbjct: 457  RPFRYPQIQREELEKLVATMLAAGIIQESTSPFSSPVLLVKKKNGSWRFCVDYRALNKVT 516

Query: 181  VSDKFPIPVIEELLDELNGAT------------------------------GHYEFLVMS 240
            V D +PIP+I++LLDEL+GA                               GHYEFLVM 
Sbjct: 517  VGDSYPIPMIDQLLDELHGAVIFSKLDLRAGYHQIRVRAEDVPKTAFRTHDGHYEFLVMP 576

Query: 241  FALTNAPATFQSLMNQ--------------DN-------------------------QLF 300
            F LTNAP+TFQSLMN               D+                         QLF
Sbjct: 577  FGLTNAPSTFQSLMNDLFRPYLRRFVLMLFDDILVYSKSEAEHQGHLRTVLQVLTDNQLF 636

Query: 301  ANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRF 360
            AN  KC     ++ YLGH+IS  GV AD  K+         RNV  LRGFLGLTGYYR+F
Sbjct: 637  ANSKKCQFGSQKVDYLGHVISAEGVAADPAKVQAMVDWPVPRNVKALRGFLGLTGYYRKF 696

Query: 361  VKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASG 420
            VKGY EIA PLT LL+K+ F+WS++   AF++LK+AM+T+ +LAL ++++PF IE+DASG
Sbjct: 697  VKGYGEIARPLTALLKKDQFQWSQKVEDAFQSLKVAMSTVPVLALVDFSLPFVIESDASG 756

Query: 421  VRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ 480
            V L AV+ Q   PIA FS+  + R + K +Y RELM +V ++QKW+H+LLGRKF + +DQ
Sbjct: 757  VGLGAVLMQQKQPIAYFSQAQTERQRLKSVYERELMAIVFAIQKWKHYLLGRKFLVRTDQ 816

Query: 481  -TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRM------------- 540
             +LKFLLEQRE+  ++   LTK+LG+DFEI Y+P L+NK  DALSR+             
Sbjct: 817  KSLKFLLEQREINLEYQRWLTKILGFDFEIQYKPGLENKAADALSRIDAVPQLCALSMHV 876

Query: 541  --------------EQPFELNKELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKIL 574
                          E+  +L +E+  + +    + VV GRL  KGRLV    S L+  +L
Sbjct: 877  AIQLSEIDEAIEKDEELSKLKQEVVTDATSHPDYSVVQGRLFMKGRLVLPAASPLVKLVL 936

BLAST of CSPI05G14890 vs. NCBI nr
Match: gi|729375164|ref|XP_010548864.1| (PREDICTED: uncharacterized protein LOC104820194 [Tarenaya hassleriana])

HSP 1 Score: 483.8 bits (1244), Expect = 5.7e-133
Identity = 349/945 (36.93%), Postives = 469/945 (49.63%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVE 60
            +++ +  L + F      I + GDP+L  +   L +L K+    DQ +L++    E    
Sbjct: 571  VQMDFQDLELKFNQGTSWITVTGDPTLHNSLVTLRSLIKSVCEGDQSYLVKLGTIE---- 630

Query: 61   DNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINVRPY 120
               E    + +  E L   Q++L+ +  +FE P  LPP R  +H I       P++VRPY
Sbjct: 631  ---ELVGADSKLPERL---QAVLEEFGPVFEVPTELPPIRGREHPINLKEGTDPVSVRPY 690

Query: 121  KYGHV---------------------------PVLLVKK-DGGWRFCVDYHKLNQVTVSD 180
            +Y H                            PVLLVKK DG WRFC+DY  LN+VTV D
Sbjct: 691  RYPHAHKEEIEKLVKDMLKAGIVRPSQSPFSSPVLLVKKKDGSWRFCIDYIALNKVTVLD 750

Query: 181  KF--PI--PVIEEL--------LDELNGA------------------TGHYEFLVMSFA- 240
            KF  P+   +++EL        LD  +G                    GHYEFLVM FA 
Sbjct: 751  KFPIPMIDQLLDELHGERVFSKLDLRSGYHQIRMKTEDIHKTAFRTHDGHYEFLVMPFAC 810

Query: 241  -LTNAPATFQSLMN--QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKIR----- 300
             L +     Q ++   Q  QL+ANK KC     QI YLGH+IS+ GV  D  K       
Sbjct: 811  SLKDHATLLQMVLAVLQKQQLYANKKKCEFGKQQIDYLGHIISQEGVSTDPAKTAAMQKW 870

Query: 301  ----NVSRLRGFLGLTGYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMN 360
                NV  LRGFLGLTGYYRRFV+ Y  IA PL  LL+K+ F WSE++++AF  LK AM 
Sbjct: 871  PTPSNVKELRGFLGLTGYYRRFVQNYGTIARPLIDLLKKDGFNWSEDASSAFRKLKQAMT 930

Query: 361  TLLILALPNWNIPFTIETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTV 420
            +  IL L ++   F +ETDASG  + AV+ Q   PIA FS+ LS R + K +Y RELMTV
Sbjct: 931  SAPILGLLDFRDEFVVETDASGFGIGAVLMQKHRPIAFFSQALSERERLKPVYERELMTV 990

Query: 421  VLSVQKWRHHLLGRKFTIISDQ-TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQN 480
            VLS+Q+WRH+LLGR F + +DQ  LKFLLEQREV  ++   LTKLLGYDF+I+Y+P ++N
Sbjct: 991  VLSIQRWRHYLLGRSFLVCTDQKALKFLLEQREVSMEYQRWLTKLLGYDFQIVYRPGMEN 1050

Query: 481  KIGDALSRME--------------------QPFELNKE-------------LKKNFSEGG 540
            K  D LSRM                     Q  E+ KE             LK   ++ G
Sbjct: 1051 KAADGLSRMPHNAILEPTCMGLAITIPRNIQLVEIEKEIRQDKDLKEITDKLKDGETKVG 1110

Query: 541  KFQVVNGRLLYKGRLVGFKTSSLIPKILHTFNDSILGGHSEFLGTYKRISGELYWKHMKA 600
            K+ ++ G L YK RLV  + SS IP IL  F+DS +GGHS  L T KRI    +W  MK 
Sbjct: 1111 KYHLLQGMLRYKNRLVVSRHSSFIPTILAEFHDSKMGGHSGVLRTLKRIHEIFHWLGMKT 1170

Query: 601  GVKKYVEQCDICQQNKYEATKPAGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVV 660
             +KKYV +C +CQ  KY    PAGLLQ +PI ++I ED +MD I+GLP   G +V++VVV
Sbjct: 1171 DIKKYVAECAVCQSQKYSTLAPAGLLQPLPIPEQIWEDISMDLIEGLPRSAGYNVVLVVV 1230

Query: 661  DRFSKYSYLILLRHPFSAKQARK-------------------WDKLIP---WAEL----- 720
            DR SKY++ I L+H FSA    K                    DK+     W+EL     
Sbjct: 1231 DRLSKYAHFIALKHLFSAMVVVKAFVQEVVKLHGFPKSIVSDRDKIFLSNFWSELFRIAG 1290

Query: 721  ------------------------------------------------WYNTTFHAYTKI 749
                                                            WYNT+FH   + 
Sbjct: 1291 TKLKFSTSYHPQTDGQTEVLNRCLETYLRCYANAHPRKWIQFLSWAEFWYNTSFHTALQS 1350

BLAST of CSPI05G14890 vs. NCBI nr
Match: gi|923614274|ref|XP_013745228.1| (PREDICTED: uncharacterized protein LOC106447810 [Brassica napus])

HSP 1 Score: 483.4 bits (1243), Expect = 7.4e-133
Identity = 293/707 (41.44%), Postives = 388/707 (54.88%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEMEVE 60
            MKV+W    + F     + VL+GDP L  +   L ++ KT     +  L+E+   ++E  
Sbjct: 540  MKVNWKLQILRFKIGDNKYVLQGDPGLCCSAASLKSIWKTVQQGGEAMLIEYNGLQLE-- 599

Query: 61   DNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINVRPY 120
                   EEK         Q++LK Y E+F  P+ LPP R  +H I+      P++VRP+
Sbjct: 600  -------EEKGGGSIPQPLQNILKEYEEVFAEPQGLPPSRGKEHAIVLKTDASPVSVRPF 659

Query: 121  KYGHV---------------------------PVLLVKK-DGGWRFCVDYHKLNQVTVSD 180
            +Y                              PVLLVKK DG WRFCVDY  LN+VT++D
Sbjct: 660  RYPQAQREEIEKQVALMLSAGIIRDSSSPFSSPVLLVKKKDGSWRFCVDYRALNKVTIAD 719

Query: 181  KFPIPVIEELLDELNGA------------------------------TGHYEFLVMSFAL 240
             +PIP+I++LLDEL GA                               GHYEFLVM F L
Sbjct: 720  SYPIPMIDQLLDELQGAKVFSKLDLKSGYHQILVKAEDVQKTAFRTHDGHYEFLVMPFGL 779

Query: 241  TNAPATFQSLMNQDNQLFANKNKCV-----IAHSQIQ----------------------- 300
            +NAPATFQSLMN+  + +  K   V     + +SQ Q                       
Sbjct: 780  SNAPATFQSLMNEIFRSYLRKFVLVFFDDILVYSQTQSEHEEHLRLVLEVLKEQGLYANR 839

Query: 301  -----------YLGHMISKRGVEADEDKIR---------NVSRLRGFLGLTGYYRRFVKG 360
                       YLGH+IS  GV ADE K+R          V  LRGFLGLTGYYR+FV+G
Sbjct: 840  KKCEFGSSRIEYLGHVISAEGVAADEGKVRAMLDWMEPKAVKELRGFLGLTGYYRKFVQG 899

Query: 361  YEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASGVRL 420
            Y +IA PLT LL+K+ FKWS E+  AF+ LK AM T+ +LALP++N  F IE+DASGV L
Sbjct: 900  YGDIARPLTSLLRKDQFKWSGEAALAFQKLKQAMATVPVLALPDFNEQFVIESDASGVGL 959

Query: 421  EAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ-TL 480
             AV+ Q   PIA FS+ L+ R Q K +Y RELM +V ++QKWRH+LLGRKF + +DQ +L
Sbjct: 960  GAVLMQRQRPIAYFSQALTERQQMKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQKSL 1019

Query: 481  KFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSR----------------- 540
            KFLLEQRE+  ++   LTK+LG+DF+I Y+P L+NK  DALSR                 
Sbjct: 1020 KFLLEQREINMEYQRWLTKILGFDFDIHYKPGLENKAADALSRKSPVTELFAVSVPVSIQ 1079

Query: 541  -------MEQPFELNK---ELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKILHTF 574
                   +E+  EL+K   EL ++ S    + +V GRLL  G+LV  KTS LI  IL  +
Sbjct: 1080 LEEVGSEVERDSELSKLIQELTQDPSSHPDYTLVQGRLLRHGKLVLPKTSKLIELILKEY 1139

BLAST of CSPI05G14890 vs. NCBI nr
Match: gi|674245622|gb|KFK38387.1| (hypothetical protein AALP_AA3G106900 [Arabis alpina])

HSP 1 Score: 460.7 bits (1184), Expect = 5.2e-126
Identity = 284/712 (39.89%), Postives = 386/712 (54.21%), Query Frame = 1

Query: 1    MKVHWSSLTMTFWAKGKQIVLKGDPSLIKAECFLTTLEKTWDIEDQGFLLEFQNYEM--- 60
            M+V+W    + F  + ++++L+GDP L      L    K    E +G ++E  N +    
Sbjct: 463  MRVNWKLQRIRFHNEDREVLLQGDPGLCCTPVSLKAFWKVVSTEGEGMIVELNNCQQAGA 522

Query: 61   EVEDNYEKETEEKRDEEELPMTQSLLKYYVEIFETPKSLPPKRAIDHRILTLLKHRPINV 120
            E + N  K+ +E            +L  + ++FE P+ LPP R  +H I      RP+ V
Sbjct: 523  EQQHNIPKDLQE------------VLVVFDQVFEEPQGLPPSRGREHSITLEPGSRPVTV 582

Query: 121  RPYKYGHV---------------------------PVLLV-KKDGGWRFCVDYHKLNQVT 180
            RP++Y  V                           PVLLV KKDG WRFCVDY  LN+ T
Sbjct: 583  RPFRYPQVQKAEIEKQVAVMLAAGIIRESTSPYSSPVLLVRKKDGSWRFCVDYRALNKAT 642

Query: 181  VSDKFPIPVIEELLDELNGA------------------------------TGHYEFLVMS 240
            V D +PIP+I++LLDEL+GA                               GHYEFLVM 
Sbjct: 643  VGDSYPIPMIDQLLDELHGACVFSKLDLRSGYHQIRVRAEDVPKTAFRTHDGHYEFLVMP 702

Query: 241  FALTNAPATFQSLMNQ--------------DNQLFANKN--------KCVIAHSQ----- 300
            F LTNAPATFQ+LMN               D+ L  +K+        + V+   Q     
Sbjct: 703  FGLTNAPATFQALMNDVFRQHLRKFVLVFFDDILVYSKSASEHRNHLQLVLQLLQDHQLY 762

Query: 301  ------------IQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLTGYYRRF 360
                        I+YLGH+I+  GV AD  KI         RNV  LRGFLGLTGYYR+F
Sbjct: 763  ANKRKCQFGSRSIEYLGHVITAEGVSADASKIQAMVDWPEPRNVKALRGFLGLTGYYRKF 822

Query: 361  VKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTIETDASG 420
            V+GY  IA PLT LLQK+ F+WS E++TAF NLK AM T+ +L + +++  F +E+DASG
Sbjct: 823  VRGYGSIAKPLTSLLQKDQFRWSPEASTAFNNLKQAMVTVPVLTMADFDAQFVVESDASG 882

Query: 421  VRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKFTIISDQ 480
              L AV+ Q+  P+A FS+ L+ R + K +Y RELM +V ++QKWRH+LLGRKF + +DQ
Sbjct: 883  TGLGAVLMQHQKPLAYFSQALTDRQKLKSVYERELMAIVFAIQKWRHYLLGRKFVVRTDQ 942

Query: 481  -TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME---QPF------ 540
             +LKFLLEQR++  ++ + LTK+LG+DF I Y+  L+NK  DALSR +   Q F      
Sbjct: 943  KSLKFLLEQRQINMEYQKWLTKILGFDFNIQYKSGLENKAADALSRRDAIPQLFALSIPA 1002

Query: 541  ---------ELNKELK---------KNFSEGGKFQVVNGRLLYKGRLVGFKTSSLIPKIL 576
                     E++K+LK          +      F VV GRLL +G+LV    S L+  IL
Sbjct: 1003 AIQLEDISSEVDKDLKLQKIKAEVLADPKSHAGFTVVQGRLLRQGKLVVPAQSHLVELIL 1062

BLAST of CSPI05G14890 vs. NCBI nr
Match: gi|674241376|gb|KFK34141.1| (hypothetical protein AALP_AA5G106200 [Arabis alpina])

HSP 1 Score: 428.7 bits (1101), Expect = 2.2e-116
Identity = 267/705 (37.87%), Postives = 363/705 (51.49%), Query Frame = 1

Query: 195  QDNQLFANKNKCVIAHSQIQYLGHMISKRGVEADEDKI---------RNVSRLRGFLGLT 254
            +++QL+ N+ KC      ++YLGH+IS  GV AD +KI         RNV  LRG LGLT
Sbjct: 815  EEHQLYENRKKCYFGCESVEYLGHLISAEGVSADPEKINAMEKWPVPRNVKALRGILGLT 874

Query: 255  GYYRRFVKGYEEIAAPLTKLLQKNAFKWSEESTTAFENLKLAMNTLLILALPNWNIPFTI 314
            GYY++FV+ Y EIA PLT LL+ N F W  E+  AF  LK AM T+ +LA+ ++   F +
Sbjct: 875  GYYKKFVQRYGEIARPLTALLKNNKFSWGPEADEAFLKLKRAMVTVPVLAMADFTALFVV 934

Query: 315  ETDASGVRLEAVISQNGHPIALFSKILSTRAQNKFIYARELMTVVLSVQKWRHHLLGRKF 374
            E+DASGV L AV+ QN  P+A F   L+ R   K IY RELM +V ++QKWRH+LLGR+F
Sbjct: 935  ESDASGVGLGAVLMQNQRPVACFRHALTERQMLKSIYERELMAIVFAIQKWRHYLLGRRF 994

Query: 375  TIISDQ-TLKFLLEQREVQPQFLECLTKLLGYDFEILYQPELQNKIGDALSRME------ 434
             + +DQ +LKFLLEQRE+  ++   LTK+LG+DFEI Y+P L+NK  DALSR E      
Sbjct: 995  VVRTDQKSLKFLLEQREINVEYQRWLTKILGFDFEIHYKPRLENKAADALSRREAMPQLF 1054

Query: 435  ---------------------QPFELNKELKKNFSEGGKFQVVNGRLLYKGRLVGFKTSS 494
                                 Q  +L +E+ ++ S    + VV GRLL +G+LV  +TS 
Sbjct: 1055 ALSVPAAIQLEDICSEVDKDPQLKKLKEEVLRDPSTHPDYAVVQGRLLRQGKLVLPRTSQ 1114

Query: 495  LIPKILHTFNDSILGGHSEFLGTYKRISGELYWKHMKAGVKKYVEQCDICQQNKYEATKP 554
            L+  IL  F+D  +GGH   L T +RI    YW+ M   +++YV +C +C +NKY    P
Sbjct: 1115 LVWVILREFHDGKVGGHGGVLKTQRRIGDLFYWQGMMTEIREYVAECVVCHKNKYSTLVP 1174

Query: 555  AGLLQSIPISDRILEDWTMDFIKGLPPVGGVDVIMVVVDRFSKYSYLILLRHPFSAKQAR 614
            AGLLQ +P+ ++I ED ++DFI+GLP   G DVIMVVVDR +K ++   L+HPF A +  
Sbjct: 1175 AGLLQPLPVPEQIWEDISLDFIEGLPKSEGYDVIMVVVDRLTKSAHFNRLKHPFVASEVA 1234

Query: 615  K----------------------------W----------------DKLIPWAE------ 674
                                         W                DK   W++      
Sbjct: 1235 LLFIQEVVRLHGFPKTLVSDRDKVFTGMFWGELFRGLETYLRCFASDKPKSWSQYLAWAE 1294

Query: 675  LWYNTTFHAYTKITPFQIVYGSSPPPLLSYGHKKIPNN---------------------- 734
            L YNT++H+  ++TPF+ V+G  PP L+ + +    N                       
Sbjct: 1295 LCYNTSYHSTIQMTPFKAVFGRDPPALVKFENGSTTNAKLETYLRDRDVVIILLRQHILK 1354

Query: 735  -GNIMKKMADWKRRELKFQVGHEVYLKLRPYRQRSLAWKKCEN----------------- 755
               +MK+ AD  RREL FQVG  VYLKL+PYRQ+SLA +  E                  
Sbjct: 1355 AQQVMKRQADKHRRELDFQVGDMVYLKLKPYRQKSLARRSNEKLSARYYGPYEVLARVGE 1414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
POL2_DROME3.0e-4131.59Retrovirus-related Pol polyprotein from transposon 297 OS=Drosophila melanogaste... [more]
POL3_DROME1.5e-3235.24Retrovirus-related Pol polyprotein from transposon 17.6 OS=Drosophila melanogast... [more]
M860_ARATH1.5e-2450.00Uncharacterized mitochondrial protein AtMg00860 OS=Arabidopsis thaliana GN=AtMg0... [more]
POL4_DROME1.7e-2323.64Retrovirus-related Pol polyprotein from transposon 412 OS=Drosophila melanogaste... [more]
POLY_DROME2.8e-2329.39Retrovirus-related Pol polyprotein from transposon gypsy OS=Drosophila melanogas... [more]
Match NameE-valueIdentityDescription
A0A087FZI0_ARAAL1.2e-13439.72Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs71112U000100 PE=4... [more]
A0A087H8D5_ARAAL3.6e-12639.89Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G106900 PE=4 SV=1[more]
A0A087GW89_ARAAL1.5e-11637.87Uncharacterized protein OS=Arabis alpina GN=AALP_AA5G106200 PE=4 SV=1[more]
A0A087HBU4_ARAAL1.3e-11237.32Uncharacterized protein OS=Arabis alpina GN=AALP_AA3G264600 PE=4 SV=1[more]
A0A087FX63_ARAAL3.4e-10837.12Uncharacterized protein (Fragment) OS=Arabis alpina GN=AALP_AAs67613U000200 PE=4... [more]
Match NameE-valueIdentityDescription
ATMG00860.18.4e-2650.00ATMG00860.1 DNA/RNA polymerases superfamily protein[more]
Match NameE-valueIdentityDescription
gi|674229247|gb|KFK23032.1|1.8e-13439.72hypothetical protein AALP_AAs71112U000100, partial [Arabis alpina][more]
gi|729375164|ref|XP_010548864.1|5.7e-13336.93PREDICTED: uncharacterized protein LOC104820194 [Tarenaya hassleriana][more]
gi|923614274|ref|XP_013745228.1|7.4e-13341.44PREDICTED: uncharacterized protein LOC106447810 [Brassica napus][more]
gi|674245622|gb|KFK38387.1|5.2e-12639.89hypothetical protein AALP_AA3G106900 [Arabis alpina][more]
gi|674241376|gb|KFK34141.1|2.2e-11637.87hypothetical protein AALP_AA5G106200 [Arabis alpina][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR000953Chromo/chromo_shadow_dom
IPR012337RNaseH-like_sf
IPR016197Chromo-like_dom_sf
Vocabulary: Molecular Function
TermDefinition
GO:0003676nucleic acid binding
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003676 nucleic acid binding

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI05G14890.1CSPI05G14890.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000953Chromo/chromo shadow domainPROFILEPS50013CHROMO_2coord: 696..731
score: 10
IPR012337Ribonuclease H-like domainGENE3DG3DSA:3.30.420.10coord: 528..584
score: 3.
IPR012337Ribonuclease H-like domainunknownSSF53098Ribonuclease H-likecoord: 525..604
score: 1.0
IPR016197Chromo domain-likeunknownSSF54160Chromo domain-likecoord: 685..737
score: 7.3
NoneNo IPR availableunknownCoilCoilcoord: 56..76
scor
NoneNo IPR availableGENE3DG3DSA:2.40.50.40coord: 699..731
score: 1.
NoneNo IPR availableGENE3DG3DSA:3.10.10.10coord: 101..193
score: 6.7
NoneNo IPR availablePANTHERPTHR24559FAMILY NOT NAMEDcoord: 126..743
score: 6.9E
NoneNo IPR availablePANTHERPTHR24559:SF186SUBFAMILY NOT NAMEDcoord: 126..743
score: 6.9E
NoneNo IPR availableunknownSSF56672DNA/RNA polymerasescoord: 85..402
score: 2.22

The following gene(s) are orthologous to this gene:

None

The following gene(s) are paralogous to this gene:

None