Sed0019163 (gene) Chayote v1

Overview
NameSed0019163
Typegene
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
LocationLG04: 28821474 .. 28823432 (-)
RNA-Seq ExpressionSed0019163
SyntenySed0019163
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSexonpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGTCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATATTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATTGATACCTCTGTGATATTACCTAATAAGGATAGGATTATAATCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTTTAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAATCTTGAGCAAGGATTATATGTGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATATTGTTTGTAGTGTAAGGAGTGCCTCACCATCCCTATGGCATAGGAGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACAAAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTGATCTCATACATGTTGATATATGGGGGCCTTTTGCTCATACCTCACATTTAGGATACAGATATTTTCTAACAATAGTAGATGATTTTAGTAGATACACTTGGATATTTCTTTTGAAAAATAAATCAGATGTTTTAACTGTTATTCCACACTTCTTTAAACTAGTGCACACACAATATTCAAAAGTGATAAAGTGTTTTAGATCTGATAATGCGCCAGAACTTAAATTTACTGAATTCTTTAAGAAAAACGGAGTAGAACATCAATACTCTTGTGTAGCTCGTCCGGAACAAAACTCAGTAGTTGAGCGAAAACACCAACACATCCTCAATGTAGGAAGAGCCATCTATTTCCAATCCAAAATACCTTTAGATTATTGGGGAGAGTGTATACTAACTGCTGTTCATTTAATAAATAGGACACCTTCTAAGAACCTAGATTGGAAAACTCCCTATGAATTACTAAAGAAAGAAATGCCAAACTACCAAACCTTAAAAGTTTTTGGGTGTTTGGCATATGCATCTACCATTCGTGAACATAGAAATAAATTTTCATCTAGAGCAATTCCAAGTGTTTTTGTAGGCTATCCACAAGGCATGAAAGGTTTTAAACTTCTGGATTTAGAAAATAACAATATTTTTGTTTCAAGGGATGTAGTTTTCCATGAGGAAGTTTTCCCTTTTCAAAGTAAAAATACCATAGAAAACATGCCAGATTTTATTATGAACCAAGTATTGCCGAAAGCTTGTGATATATCTCTAGAATATAAAGATAGCATACATGACAATTATGGTGATGAAGTACAAAAGTAA

mRNA sequence

ATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGTCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATATTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATTGATACCTCTGTGATATTACCTAATAAGGATAGGATTATAATCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTTTAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAATCTTGAGCAAGGATTATATGTGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATATTGTTTGTAGTGTAAGGAGTGCCTCACCATCCCTATGGCATAGGAGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACAAAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTGATCTCATACATGTTGATATATGGGGGCCTTTTGCTCATACCTCACATTTAGGATACAGATATTTTCTAACAATAGTAGATGATTTTAGTAGATACACTTGGATATTTCTTTTGAAAAATAAATCAGATGTTTTAACTGTTATTCCACACTTCTTTAAACTAGTGCACACACAATATTCAAAAGTGATAAAGTGTTTTAGATCTGATAATGCGCCAGAACTTAAATTTACTGAATTCTTTAAGAAAAACGGAGTAGAACATCAATACTCTTGTGTAGCTCGTCCGGAACAAAACTCAGTAGTTGAGCGAAAACACCAACACATCCTCAATGTAGGAAGAGCCATCTATTTCCAATCCAAAATACCTTTAGATTATTGGGGAGAGTGTATACTAACTGCTGTTCATTTAATAAATAGGACACCTTCTAAGAACCTAGATTGGAAAACTCCCTATGAATTACTAAAGAAAGAAATGCCAAACTACCAAACCTTAAAAGTTTTTGGGTGTTTGGCATATGCATCTACCATTCGTGAACATAGAAATAAATTTTCATCTAGAGCAATTCCAAGTGTTTTTGTAGGCTATCCACAAGGCATGAAAGGTTTTAAACTTCTGGATTTAGAAAATAACAATATTTTTGTTTCAAGGGATGTAGTTTTCCATGAGGAAGTTTTCCCTTTTCAAAGTAAAAATACCATAGAAAACATGCCAGATTTTATTATGAACCAAGTATTGCCGAAAGCTTGTGATATATCTCTAGAATATAAAGATAGCATACATGACAATTATGGTGATGAAGTACAAAAGTAA

Coding sequence (CDS)

ATGGGTCTAAATGATGAGTTTTCTACTACTAGATCACAAATACTTCTCATGGATCCATTGCCACCAGTCAATAAGGCTTTTTCTTTAATCGTACAAGAAGAGGAACATAAGGGAGATACAAATATTAAGAGTAATATTACCTTAGCTGCCACTCAGTCTAAAACCACATACAAAGGGAAGGATTCTAAGCCAGTATGCAAGCATTGTGGTCTCATAGGACACACAATTGATGTTTGCTATAGAATACATGGATATCCGGATAATAGACCTGTGTGCAAGCATTGTGGGTTACAAGGACACACCATCGATGTATGTTATAAAATACATGGGTATCCACCTAGTAACAAGCAAAGGAAAAATAACTACAAGCAAACCAATGATAACCAAGGTTCTGTACAACCTGAAAACAAATCTTGCAAATCAGCAACAGTTGCAGCTAGCAATATTGAAAGTGATCCTTTTCAACAATGTCATGATATATTGACTCTTCTTCAATCCAAGTTAGCTGGCATCAAGAATGACAATGGAGCGAACCTAACGCAACATATGGCAGGTATGACACAAACATATGATTATTTTAAAGATAGATGGATACTAGATTCAGGTGCAGCAGCCCATATATGTCATAACAAAGATATATTCATGAATTTAAAAAGGATTGATACCTCTGTGATATTACCTAATAAGGATAGGATTATAATCACTCATGCTGGATCTATATTGTTGTGTGGATCTATCATTTTAGATAGAGTCTTGTATGTTCCAAGTTTTAAATACAATTTATTGTCTATTAGTGCACTAACCTTGAATGATGCAGTGTTAGTAAATTTCACAACTAATGCTTGTATTATCCAGGACAAGCGCACTTTGAAGATGATTGGGAAGGGTAATCTTGAGCAAGGATTATATGTGTTAGAGGAGGTACCTTTATCTGCAGCATTGAATATTGTTTGTAGTGTAAGGAGTGCCTCACCATCCCTATGGCATAGGAGATTAGGTCATCCAGCTGATTTACCTTTAGTTGCTTTAAAAAATGTACTTTCTTTTGATGCAAATTGTAAAGGGGCTGAAAATTGTACTATATGCCCTTTGGCTAAACAAAACAGATTGAGATTCATTTCAAATAATAATAAATCAGATGCTATTTTTGATCTCATACATGTTGATATATGGGGGCCTTTTGCTCATACCTCACATTTAGGATACAGATATTTTCTAACAATAGTAGATGATTTTAGTAGATACACTTGGATATTTCTTTTGAAAAATAAATCAGATGTTTTAACTGTTATTCCACACTTCTTTAAACTAGTGCACACACAATATTCAAAAGTGATAAAGTGTTTTAGATCTGATAATGCGCCAGAACTTAAATTTACTGAATTCTTTAAGAAAAACGGAGTAGAACATCAATACTCTTGTGTAGCTCGTCCGGAACAAAACTCAGTAGTTGAGCGAAAACACCAACACATCCTCAATGTAGGAAGAGCCATCTATTTCCAATCCAAAATACCTTTAGATTATTGGGGAGAGTGTATACTAACTGCTGTTCATTTAATAAATAGGACACCTTCTAAGAACCTAGATTGGAAAACTCCCTATGAATTACTAAAGAAAGAAATGCCAAACTACCAAACCTTAAAAGTTTTTGGGTGTTTGGCATATGCATCTACCATTCGTGAACATAGAAATAAATTTTCATCTAGAGCAATTCCAAGTGTTTTTGTAGGCTATCCACAAGGCATGAAAGGTTTTAAACTTCTGGATTTAGAAAATAACAATATTTTTGTTTCAAGGGATGTAGTTTTCCATGAGGAAGTTTTCCCTTTTCAAAGTAAAAATACCATAGAAAACATGCCAGATTTTATTATGAACCAAGTATTGCCGAAAGCTTGTGATATATCTCTAGAATATAAAGATAGCATACATGACAATTATGGTGATGAAGTACAAAAGTAA

Protein sequence

MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVLPKACDISLEYKDSIHDNYGDEVQK
Homology
BLAST of Sed0019163 vs. NCBI nr
Match: KZV25004.1 (Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum])

HSP 1 Score: 529.6 bits (1363), Expect = 3.7e-146
Identity = 280/641 (43.68%), Postives = 392/641 (61.15%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLA 60
           MGLND ++  R+Q+L+++PLP + K F+L++QEE  +             + I SN+  +
Sbjct: 196 MGLNDSYAQVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAGVDHSGILSNVNSS 255

Query: 61  ATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIH 120
           A  + +    ++SK                    G   +R +C HC  + HT+D CYK+H
Sbjct: 256 ANTATSLRTSQNSK--------------------GGRGDRIICSHCHFRNHTVDKCYKLH 315

Query: 121 GYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQ 180
           GYPP + + K+       +QGS      S  S     T    + +S    QC  ++  L 
Sbjct: 316 GYPPGHPKFKSQI-----SQGSAHAHQASSSSETHQETQQIDHSDSLTQSQCKQLIEFLS 375

Query: 181 SKLAGIKN--------------DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNK 240
           SKL   +N                  + T H+  +T+     KD WI+D+GA  HIC + 
Sbjct: 376 SKLQTRQNLLMEHQPETTVSCLTGICSATSHIPAITR-----KD-WIMDTGATHHICCSL 435

Query: 241 DIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND 300
            +F + + I + V+LPN   I +T AG++ +  +++L  VLYVP F++NLLS+S+LT N 
Sbjct: 436 SMFKSSRAIQSKVVLPNTLTIPVTIAGTVAVTSNLVLQNVLYVPVFQFNLLSVSSLTDNH 495

Query: 301 AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRR 360
              V+F +++C IQD   ++MIG G     LYVL++ P     + +C+   ++  LWHRR
Sbjct: 496 NCSVSFMSDSCKIQDISQIRMIGMGKRIGNLYVLQQ-PDRFLPSYICNTFVSNSELWHRR 555

Query: 361 LGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIW 420
           +GHP+   L +LKNVL+ + N      C  C L+KQ RL   S NN S  IF+L+H+D W
Sbjct: 556 MGHPSFNKLSSLKNVLNIE-NTDIVNICHSCHLSKQRRLPLASRNNISARIFELLHIDTW 615

Query: 421 GPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRS 480
           GPF+ TS  G+R+F TIVDD SRYTW+++LK+KSDVL++ P F ++V TQ+   +K  RS
Sbjct: 616 GPFSQTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSIFPDFCRMVSTQFGVTVKSVRS 675

Query: 481 DNAPELKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGE 540
           DNAPEL F +FF K G+ H +SCV RP+QNSVVERKHQHILNV RA+ FQS IPLDYW +
Sbjct: 676 DNAPELGFADFFAKAGITHYHSCVERPQQNSVVERKHQHILNVARALLFQSHIPLDYWCD 735

Query: 541 CILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI 600
           CI T+V+LINRTPS  L  KTP+ELL  ++P+Y  LKVFGCL YAST+   R+KFS RAI
Sbjct: 736 CINTSVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVFGCLCYASTLLSSRHKFSPRAI 795

Query: 601 PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS 613
             VF+GYP G KG+KLL+LE N IF+SRDV+FHE  FP+Q+
Sbjct: 796 RCVFIGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPYQN 803

BLAST of Sed0019163 vs. NCBI nr
Match: KZV39348.1 (hypothetical protein F511_17540 [Dorcoceras hygrometricum])

HSP 1 Score: 517.7 bits (1332), Expect = 1.5e-142
Identity = 278/633 (43.92%), Postives = 383/633 (60.51%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGK 60
           MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE                 + ++ ++G 
Sbjct: 102 MGLNESYAQIRAQILLMDPLPVISKIFSLVVQEE-----------------RQRSIHQGV 161

Query: 61  DSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNK 120
             K    P+  + G     +   Y   G   ++  C HC L  HT+D CYK+HGYPP + 
Sbjct: 162 GGKLLDQPLVMNYGANVAAVKGTYNPKGIKSDKVTCTHCHLPNHTVDKCYKLHGYPPGHP 221

Query: 121 QRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNG 180
           + K   KQ++     +Q + ++  +A+V    ++    + C  ++  L S+L  + N   
Sbjct: 222 RYK--LKQSDKKSHMIQSQPQADGTASVVGDILKP---EHCRQLIAFLSSQLQ-LGNGTT 281

Query: 181 ANLTQHMAGMTQTYDYFKD--------------RWILDSGAAAHICHNKDIFMNLKRIDT 240
             L Q       +   F D               WI+D+GA  HIC +   F++ K  ++
Sbjct: 282 MALQQPQQPPESSTSCFNDTYSLSTSHTAFPTFSWIIDTGATHHICCSLHHFVSFKPFNS 341

Query: 241 SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC 300
           +V LPN   I +TH GS++L   IIL  VL+VP FK+NLLSIS+LT      V+F++  C
Sbjct: 342 NVTLPNSLNIPVTHIGSVMLLPEIILQNVLFVPQFKFNLLSISSLTKQIPCSVSFSSELC 401

Query: 301 IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVA 360
            IQ     + IG G     LY+L   P S     VC+   +   LWH RLGH +   L  
Sbjct: 402 QIQVLNQARTIGTGRRIGDLYLLTSPPSSRM--EVCATVHSKTQLWHYRLGHISLPRLSI 461

Query: 361 LKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHL 420
           L +V+  SF +N +    C IC L+KQ RL FISNN   D+ FDL+H+DIWGPF   +  
Sbjct: 462 LGDVIQESF-SNNEALSACEICHLSKQKRLPFISNNTVVDSCFDLVHIDIWGPFNPMNVD 521

Query: 421 GYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFT 480
           G++YFLTIVDD SRYTW+ LLK+KSDV  + P F +++ TQ+ K IK  RSDNAPEL+F+
Sbjct: 522 GFKYFLTIVDDHSRYTWVQLLKSKSDVTIIFPAFCRMIRTQFGKSIKAVRSDNAPELQFS 581

Query: 481 EFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLI 540
           EFFK  G+   +SCV RP+QNS+VERKHQHILNV RA+ FQS IPL YW +CILT+V+LI
Sbjct: 582 EFFKAEGIVSYHSCVERPQQNSIVERKHQHILNVARALLFQSNIPLVYWSDCILTSVYLI 641

Query: 541 NRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ 600
           NR P+  L  KTP+E++  ++PN+  L+VFGCL Y ST+  HR KFS RAI S+F+GYP 
Sbjct: 642 NRVPAPILSNKTPFEVMHTKIPNFSHLRVFGCLCYGSTLLSHRTKFSPRAIRSIFLGYPP 701

Query: 601 GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK 614
           G KG+KLL+L+ N I++SRDV FHE VFPF++K
Sbjct: 702 GYKGYKLLNLDTNEIYISRDVTFHETVFPFRNK 708

BLAST of Sed0019163 vs. NCBI nr
Match: XP_012857659.1 (PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata])

HSP 1 Score: 515.4 bits (1326), Expect = 7.2e-142
Identity = 284/689 (41.22%), Postives = 403/689 (58.49%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHK-----GDTNIKSNITLAATQSKT 60
           MGLND  ++TR QILLMDPLPP+NK F+L+ QEE H+       ++++ ++  AA   +T
Sbjct: 187 MGLNDSLASTRGQILLMDPLPPINKVFALVSQEERHRSVAVTSSSDVQHSLAFAARGIQT 246

Query: 61  --------------TYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHT 120
                         T   +  K  C HC   GHT++ CYR+HG+P               
Sbjct: 247 NQFVRRPQNNQFYGTTSQRKDKIYCTHCHKTGHTVEKCYRLHGFPPG------------- 306

Query: 121 IDVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPF------Q 180
               Y+    P     + +  K   +    +   + S  S +++ S   SD F       
Sbjct: 307 ----YQPRQKPGMTSNQSSQTKFAVNQVSDIVHSDASLNSGSLSQSLPSSDNFLDAMTAS 366

Query: 181 QCHDILTLLQSKLAGIKN----DNGANL--TQHMAGMT--------QTYDYFKDRWILDS 240
           QC  +L+ + S LA   N    D  + +  T H++ +T         T  +    WILDS
Sbjct: 367 QCQQLLSYVSSHLANKANQPPHDKNSEIFDTSHISRVTGICLFNALHTPSFMPHHWILDS 426

Query: 241 GAAAHICHNKDIFMNLKRIDTS-VILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYN 300
           GA+ HICHNK +F+N+K +  + V+LP+   +++   G + L   ++L  V YVP FK+N
Sbjct: 427 GASRHICHNKSLFLNMKSVSNARVVLPDSSMVLVNCIGDVQLTTHLVLHNVFYVPEFKFN 486

Query: 301 LLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSV 360
           L+S+SAL    + +V F   +  IQD R +  IGKGN  QGLYVL+ V  S   +  C+ 
Sbjct: 487 LVSVSALLHGSSYVVIFDEFSFSIQD-RLMTQIGKGNKVQGLYVLDPVSASPIEHAFCNK 546

Query: 361 RSASPSLWHRRLGH--PADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNK 420
            SA  ++WH RLGH     L  +A K  LS D     +  C +CPLAKQ RL F ++++ 
Sbjct: 547 ISA--TVWHHRLGHIPQPKLAFLAKKFSLSVD-KISESSCCYVCPLAKQKRLHFSNSSSV 606

Query: 421 SDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLV 480
           S A+FDLIH DIWGPF   S+ G+ YF+T+VDD+SR+TW+ LLK KS+V+TV+P F K+V
Sbjct: 607 STAMFDLIHCDIWGPFKVPSYSGFHYFVTLVDDYSRFTWVHLLKTKSEVITVVPRFLKMV 666

Query: 481 HTQYSKVIKCFRSDNAPELKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAI 540
             Q+ K IK FRSDNA EL+F   F + GV HQ+SCV  P+QN++VERKHQHILNV R++
Sbjct: 667 LNQFGKSIKVFRSDNAYELQFKSLFDELGVIHQFSCVYTPQQNAIVERKHQHILNVARSL 726

Query: 541 YFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMP-NYQTLKVFGCLAYAS 600
           +FQS IP+ YW ECILTAV LINR P+ NL+  +PYELL    P +Y +LK FGCL +A+
Sbjct: 727 FFQSHIPITYWSECILTAVFLINRIPAHNLNDLSPYELLYPAKPFDYHSLKSFGCLVFAT 786

Query: 601 TIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIE 647
            +  H++KF  RA   VF+GYP G+KG+KLLDL ++ +F+SRDV+FHE ++PF +K++  
Sbjct: 787 DVSGHKSKFDPRANACVFLGYPSGIKGYKLLDLVSHKVFISRDVIFHENIYPFTNKSSSS 846

BLAST of Sed0019163 vs. NCBI nr
Match: KZV17946.1 (hypothetical protein F511_10775 [Dorcoceras hygrometricum])

HSP 1 Score: 514.6 bits (1324), Expect = 1.2e-141
Identity = 280/644 (43.48%), Postives = 382/644 (59.32%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKG 60
           MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE  +  +  ++  I             
Sbjct: 141 MGLNESYAQIRAQILLMDPLPTISKIFSLVVQEERQRSINQGVEGRIL------------ 200

Query: 61  KDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRK 120
              +P+    G     +   Y   G   ++  C HC L  HT+D CYK+HGYPP + +  
Sbjct: 201 --EQPLIMSHGANVAAVKGSYNSKGTKTDKVTCSHCHLPNHTVDKCYKLHGYPPGHPK-- 260

Query: 121 NNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANL 180
             YK    ++ S   ++ S      +  N    P + C  ++  L S+L   +  NG  +
Sbjct: 261 --YKVKQSDKKSHMTQSHSIADGVASTVNDFLKP-EHCRQLIAFLSSQL---QIGNGTTM 320

Query: 181 TQHMAGMTQ------TYDYFKDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVIL 240
           T      +       TY             WI+D+GA  HIC +   F++ +  +++V L
Sbjct: 321 TLQQTPESSASCFNGTYSLATSHTILPPSSWIVDTGATHHICCSPHHFVSFEPFNSNVTL 380

Query: 241 PNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD 300
           PN   I +TH GS++L   I L  VL+VP FK+NLLSIS+LT     LV+F++ +C IQ 
Sbjct: 381 PNNLNIPVTHIGSVILSSEITLHNVLFVPQFKFNLLSISSLTKQIPCLVSFSSESCQIQV 440

Query: 301 KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNV 360
               K IG G     LY+L     S+    VC+   +   LWH RLGH     L  L + 
Sbjct: 441 LNQAKTIGTGRRVGDLYILTG---SSPKIEVCTAAQSKTQLWHFRLGHIPLPKLSILGDT 500

Query: 361 L--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRY 420
           L  SF  N      C IC L+KQ RL FISNN+  D  FDL+H+DIWGPF   +  G++Y
Sbjct: 501 LQNSF-INNDELSTCEICHLSKQKRLPFISNNSIVDCCFDLVHIDIWGPFNPMNVDGFKY 560

Query: 421 FLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFK 480
           FLTIVDD SRYTW+ LLK+KS+V+ + P F +++H Q+ K IK  RSDNAPELKF+EFFK
Sbjct: 561 FLTIVDDHSRYTWVQLLKSKSEVIDIFPTFCRMIHKQFGKSIKSVRSDNAPELKFSEFFK 620

Query: 481 KNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTP 540
             G+   +SCV RP+QNSVVERKHQHILNV RA+ FQS IPL YW ECILTAV+LINRTP
Sbjct: 621 AEGIVAFHSCVERPQQNSVVERKHQHILNVARALLFQSGIPLVYWSECILTAVYLINRTP 680

Query: 541 SKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG 600
           +  L  KTP+EL+  + P Y  L+VFGCL Y ST+   R KFS RA  S+F+GYP G KG
Sbjct: 681 APLLSNKTPFELMHNKPPTYSHLRVFGCLCYGSTLLNQRTKFSPRATRSIFLGYPPGYKG 740

Query: 601 FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL 629
           +KLL+L+ N +++SRDV+FHE VFPF++K+T  + P+  ++ ++
Sbjct: 741 YKLLNLDTNEVYISRDVIFHETVFPFKNKST--SSPEHCLDNII 756

BLAST of Sed0019163 vs. NCBI nr
Match: RVW82526.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera])

HSP 1 Score: 510.8 bits (1314), Expect = 1.8e-140
Identity = 278/649 (42.84%), Postives = 375/649 (57.78%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGK 60
           +GLN+ F+  ++QILLM+P PP+NK FSL+VQEE  +  T   S        S+     +
Sbjct: 193 LGLNESFAPIQAQILLMEPTPPLNKVFSLVVQEEWQRSLTTSNSPAFTTPVSSRFQAASR 252

Query: 61  DSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKN 120
            S P                       +RP+C HC + GHT+D CYKIHGY P  + R  
Sbjct: 253 ASSPT---------------NSSRSRKDRPLCTHCNILGHTVDRCYKIHGYTPGFRNRP- 312

Query: 121 NYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIK 180
           N++        + P +      T+   +I S        D   Q   +L+L  S  +   
Sbjct: 313 NFRPNGSRPNQMLPNSLHTNQLTLTDGSIASASPPPLTHDQHNQLLALLSLHSSSGSSAS 372

Query: 181 NDNGANLTQHMAGMTQTYDYFKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SV 240
             +   L Q ++  T                WILDSGA  H+C N  +F ++    + +V
Sbjct: 373 FGDSNPLQQSISNFTGILSLSPSSSTLNPSIWILDSGATHHVCTNSSMFHSIHSFSSNTV 432

Query: 241 ILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII 300
            LP   +I IT  G+I L   ++L+ VLY+P+F++NL+SISALT  +    +FT + C I
Sbjct: 433 TLPTGTKIPITGIGTIHLSPHLVLEHVLYIPTFQFNLISISALTQTNCFSFDFTAHFCFI 492

Query: 301 QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLV 360
           QD    K+IG G  +  LY+L+     +  ++     + S     LWH RL HP+++ L 
Sbjct: 493 QDHSQGKLIGMGRRQGNLYLLDSSVFRSISSVFVVDNNTSAHVNKLWHFRLSHPSNVKLS 552

Query: 361 ALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLG 420
            LK  L   +N     +C+ICPLAKQ RL F  +NN S + FDLIH DIWGPF   +H G
Sbjct: 553 VLKPHLQLQSNGNTNLSCSICPLAKQKRLPFDCHNNLSSSPFDLIHCDIWGPFHIPTHDG 612

Query: 421 YRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTE 480
           +RYFLTIVDD +R TW+ LL+ KSDV T+ P FF +V T++   IK  RSDNAPEL  + 
Sbjct: 613 FRYFLTIVDDCTRNTWVHLLRAKSDVKTIFPQFFSMVKTKFGLTIKAVRSDNAPELNLSN 672

Query: 481 FFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLIN 540
            F +  V H +SCV  P+QNSVVERKHQHILNV RA+YFQS IP+ YWG+C+LT+V+LIN
Sbjct: 673 LFTQLDVLHFFSCVETPQQNSVVERKHQHILNVARALYFQSNIPIGYWGDCVLTSVYLIN 732

Query: 541 RTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG 600
           R PS  L+ KTP+ELL  + P+Y  LK FGCL Y+ST+   R+KFS RA+P VF+GYP G
Sbjct: 733 RIPSPLLNNKTPFELLHHKSPSYSHLKSFGCLCYSSTLPSTRHKFSPRALPCVFLGYPFG 792

Query: 601 MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP 630
            KG+K+LDLE N I VSR+V F E VFPF+ S+N      DF   +VLP
Sbjct: 793 YKGYKILDLETNRISVSRNVTFQESVFPFKLSQNNNSVASDFFSKKVLP 825

BLAST of Sed0019163 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 216.9 bits (551), Expect = 7.0e-55
Identity = 136/447 (30.43%), Postives = 224/447 (50.11%), Query Frame = 0

Query: 194 KDRWILDSGAAAHICHNKDIFMNLKRIDTSVI-LPNKDRIIITHAGSILL-----CGSII 253
           +  W++D+ A+ H    +D+F      D   + + N     I   G I +     C +++
Sbjct: 291 ESEWVVDTAASHHATPVRDLFCRYVAGDFGTVKMGNTSYSKIAGIGDICIKTNVGC-TLV 350

Query: 254 LDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEE 313
           L  V +VP  + NL  IS + L+     ++  N      K +L +I KG     LY    
Sbjct: 351 LKDVRHVPDLRMNL--ISGIALDRDGYESYFANQKWRLTKGSL-VIAKGVARGTLYRTNA 410

Query: 314 VPLSAALNIVCSVRSASPSLWHRRLGHPAD--LPLVALKNVLSFDANCKGAENCTICPLA 373
                 LN   +    S  LWH+R+GH ++  L ++A K+++S+ A     + C  C   
Sbjct: 411 EICQGELN--AAQDEISVDLWHKRMGHMSEKGLQILAKKSLISY-AKGTTVKPCDYCLFG 470

Query: 374 KQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKS 433
           KQ+R+ F +++ +   I DL++ D+ GP    S  G +YF+T +DD SR  W+++LK K 
Sbjct: 471 KQHRVSFQTSSERKLNILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKD 530

Query: 434 DVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPEQNS 493
            V  V   F  LV  +  + +K  RSDN  E    +F E+   +G+ H+ +    P+ N 
Sbjct: 531 QVFQVFQKFHALVERETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNG 590

Query: 494 VVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMP 553
           V ER ++ I+   R++   +K+P  +WGE + TA +LINR+PS  L ++ P  +   +  
Sbjct: 591 VAERMNRTIVEKVRSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEV 650

Query: 554 NYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFVSRDVV 613
           +Y  LKVFGC A+A   +E R K   ++IP +F+GY     G++L D     +  SRDVV
Sbjct: 651 SYSHLKVFGCRAFAHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVV 710

Query: 614 FHEEVFPFQSKNTIENMPDFIMNQVLP 630
           F E         T  +M + + N ++P
Sbjct: 711 FRE-----SEVRTAADMSEKVKNGIIP 725

BLAST of Sed0019163 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 9.1e-55
Identity = 151/514 (29.38%), Postives = 241/514 (46.89%), Query Frame = 0

Query: 115 NKQRKNNYKQTNDNQGSVQPENKSCKS--ATVAASNIESDPFQQCHDILTLLQSKLAGIK 174
           N+   NN K    +  +  P N   K          ++    ++C  +   L S +   +
Sbjct: 248 NRNNNNNSKPWQQSSTNFHPNNNQSKPYLGKCQICGVQGHSAKRCSQLQHFL-SSVNSQQ 307

Query: 175 NDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRID-----TSVILP 234
             +     Q  A +     Y  + W+LDSGA  HI  +   F NL           V++ 
Sbjct: 308 PPSPFTPWQPRANLALGSPYSSNNWLLDSGATHHITSD---FNNLSLHQPYTGGDDVMVA 367

Query: 235 NKDRIIITHAGSILLCGS---IILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII 294
           +   I I+H GS  L      + L  +LYVP+   NL+S+  L   + V V F   +  +
Sbjct: 368 DGSTIPISHTGSTSLSTKSRPLNLHNILYVPNIHKNLISVYRLCNANGVSVEFFPASFQV 427

Query: 295 QDKRTLKMIGKGNLEQGLYVLEEVPLSAA--LNIVCSVRS-ASPSLWHRRLGHPADLPLV 354
           +D  T   + +G  +  LY   E P++++  +++  S  S A+ S WH RLGHPA   L 
Sbjct: 428 KDLNTGVPLLQGKTKDELY---EWPIASSQPVSLFASPSSKATHSSWHARLGHPAPSILN 487

Query: 355 ALKNVLSFDANCKGAE--NCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSH 414
           ++ +  S        +  +C+ C + K N++ F  +   S    + I+ D+W      SH
Sbjct: 488 SVISNYSLSVLNPSHKFLSCSDCLINKSNKVPFSQSTINSTRPLEYIYSDVWSS-PILSH 547

Query: 415 LGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPE-LK 474
             YRY++  VD F+RYTW++ LK KS V      F  L+  ++   I  F SDN  E + 
Sbjct: 548 DNYRYYVIFVDHFTRYTWLYPLKQKSQVKETFITFKNLLENRFQTRIGTFYSDNGGEFVA 607

Query: 475 FTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVH 534
             E+F ++G+ H  S    PE N + ERKH+HI+  G  +   + IP  YW      AV+
Sbjct: 608 LWEYFSQHGISHLTSPPHTPEHNGLSERKHRHIVETGLTLLSHASIPKTYWPYAFAVAVY 667

Query: 535 LINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGY 594
           LINR P+  L  ++P++ L    PNY  L+VFGC  Y      +++K   ++   VF+GY
Sbjct: 668 LINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWLRPYNQHKLDDKSRQCVFLGY 727

Query: 595 PQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS 613
                 +  L L+ + +++SR V F E  FPF +
Sbjct: 728 SLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSN 753

BLAST of Sed0019163 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 212.6 bits (540), Expect = 1.3e-53
Identity = 149/522 (28.54%), Postives = 236/522 (45.21%), Query Frame = 0

Query: 114 SNKQRKNNYKQTNDNQGSVQPENKSCKS---------ATVAASNIESDPFQQCHDILTLL 173
           +N+    NY   N+   S QP +   +S               +++    ++C   L   
Sbjct: 219 NNRGDNRNYNNNNNRSNSWQPSSSGSRSDNRQPKPYLGRCQICSVQGHSAKRCPQ-LHQF 278

Query: 174 QSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHIC--HNKDIFMNLKRIDT 233
           QS     ++ +     Q  A +     Y  + W+LDSGA  HI    N   F        
Sbjct: 279 QSTTNQQQSTSPFTPWQPRANLAVNSPYNANNWLLDSGATHHITSDFNNLSFHQPYTGGD 338

Query: 234 SVILPNKDRIIITHAGSILL---CGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTT 293
            V++ +   I ITH GS  L     S+ L++VLYVP+   NL+S+  L   + V V F  
Sbjct: 339 DVMIADGSTIPITHTGSASLPTSSRSLDLNKVLYVPNIHKNLISVYRLCNTNRVSVEFFP 398

Query: 294 NACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLP 353
            +  ++D  T   + +G  +  LY        A          A+ S WH RLGHP+   
Sbjct: 399 ASFQVKDLNTGVPLLQGKTKDELYEWPIASSQAVSMFASPCSKATHSSWHSRLGHPS--- 458

Query: 354 LVALKNVLSFDA-----NCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIW-GP 413
           L  L +V+S  +           +C+ C + K +++ F ++   S    + I+ D+W  P
Sbjct: 459 LAILNSVISNHSLPVLNPSHKLLSCSDCFINKSHKVPFSNSTITSSKPLEYIYSDVWSSP 518

Query: 414 FAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDN 473
                +  YRY++  VD F+RYTW++ LK KS V      F  LV  ++   I    SDN
Sbjct: 519 ILSIDN--YRYYVIFVDHFTRYTWLYPLKQKSQVKDTFIIFKSLVENRFQTRIGTLYSDN 578

Query: 474 APE-LKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGEC 533
             E +   ++  ++G+ H  S    PE N + ERKH+HI+ +G  +   + +P  YW   
Sbjct: 579 GGEFVVLRDYLSQHGISHFTSPPHTPEHNGLSERKHRHIVEMGLTLLSHASVPKTYWPYA 638

Query: 534 ILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIP 593
              AV+LINR P+  L  ++P++ L  + PNY+ LKVFGC  Y      +R+K   ++  
Sbjct: 639 FSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWLRPYNRHKLEDKSKQ 698

Query: 594 SVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSKN 615
             F+GY      +  L +    ++ SR V F E  FPF + N
Sbjct: 699 CAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTN 734

BLAST of Sed0019163 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 156.8 bits (395), Expect = 8.6e-37
Identity = 119/428 (27.80%), Postives = 190/428 (44.39%), Query Frame = 0

Query: 197 WILDSGAAAHICHNKDIFMNLKRIDTSV---ILPNKDRIIITHAGSILLCG--SIILDRV 256
           ++LDSGA+ H+ +++ ++ +   +   +   +    + I  T  G + L     I L+ V
Sbjct: 289 FVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRNDHEITLEDV 348

Query: 257 LYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVP-L 316
           L+      NL+S+  L     + + F  +   I     + +   G       +L  VP +
Sbjct: 349 LFCKEAAGNLMSVKRLQ-EAGMSIEFDKSGVTISKNGLMVVKNSG-------MLNNVPVI 408

Query: 317 SAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNVLSFDANC------KGAENCTICPL 376
           +     + +    +  LWH R GH +D  L+ +K    F             E C  C  
Sbjct: 409 NFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPCLN 468

Query: 377 AKQNRLRFISNNNKSDAIFDL--IHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLLK 436
            KQ RL F    +K+     L  +H D+ GP    +     YF+  VD F+ Y   +L+K
Sbjct: 469 GKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIK 528

Query: 437 NKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPE 496
            KSDV ++   F       ++  +     DN  E    +  +F  K G+ +  +    P+
Sbjct: 529 YKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQ 588

Query: 497 QNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNL--DWKTPYELL 556
            N V ER  + I    R +   +K+   +WGE +LTA +LINR PS+ L    KTPYE+ 
Sbjct: 589 LNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMW 648

Query: 557 KKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKGFKLLDLENNNIFV 606
             + P  + L+VFG   Y   I+  + KF  ++  S+FVGY     GFKL D  N    V
Sbjct: 649 HNKKPYLKHLRVFGATVYVH-IKNKQGKFDDKSFKSIFVGYEP--NGFKLWDAVNEKFIV 705

BLAST of Sed0019163 vs. ExPASy Swiss-Prot
Match: P25384 (Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY2B-C PE=3 SV=2)

HSP 1 Score: 81.3 bits (199), Expect = 4.6e-14
Identity = 101/442 (22.85%), Postives = 167/442 (37.78%), Query Frame = 0

Query: 106 YKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQC-HDILTLL 165
           YK H    +  +   N   T     + Q  N S   A  A +   S  F +  +D +   
Sbjct: 357 YKQHSEYKNVSRTSPNTTNTKVTTRNYQRTNSSKPRAAKAHNIATSSKFSRVNNDHINES 416

Query: 166 QSKLAGIKNDNGANLTQHMAGMTQTY-----DYFKDRWILDSGA------AAHICHNKDI 225
                 + +DN  +L Q       T+     D   D  ++DSGA      +AH  H+   
Sbjct: 417 TVSSQYLSDDNELSLGQQQKESKPTHTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATP 476

Query: 226 FMNLKRIDTSVILPNKDRIIITHAGSI---LLCGSIILDRVLYVPSFKYNLLSISALTLN 285
              +  +D       K  I I   G++      G+    + L+ P+  Y+LLS+S L  N
Sbjct: 477 NSEINIVDA-----QKQDIPINAIGNLHFNFQNGTKTSIKALHTPNIAYDLLSLSELA-N 536

Query: 286 DAVLVNFTTNACIIQDKRTLKMIGKGNLEQGL---YVLEEVPLSAALNIVCSVRSASP-- 345
             +   FT N     D   L  I K      L   Y++        +N V   +S +   
Sbjct: 537 QNITACFTRNTLERSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYP 596

Query: 346 -SLWHRRLGHP--ADLPLVALKNVLSF----DANCKGAE--NCTICPLAKQNRLRFISNN 405
             L HR LGH     +     KN +++    D     A    C  C + K  + R +  +
Sbjct: 597 YPLIHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGS 656

Query: 406 ----NKSDAIFDLIHVDIWGPFAHTSHLGYRYFLTIVDDFSRYTWIFLL--KNKSDVLTV 465
                +S   F  +H DI+GP  H       YF++  D+ +R+ W++ L  + +  +L V
Sbjct: 657 RLKYQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNV 716

Query: 466 IPHFFKLVHTQYSKVIKCFRSDNAPEL---KFTEFFKKNGVEHQYSCVARPEQNSVVERK 510
                  +  Q++  +   + D   E       +FF   G+   Y+  A    + V ER 
Sbjct: 717 FTSILAFIKNQFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERL 776

BLAST of Sed0019163 vs. ExPASy TrEMBL
Match: A0A2Z7AT15 (Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum OX=472368 GN=F511_01974 PE=4 SV=1)

HSP 1 Score: 529.6 bits (1363), Expect = 1.8e-146
Identity = 280/641 (43.68%), Postives = 392/641 (61.15%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-----------DTNIKSNITLA 60
           MGLND ++  R+Q+L+++PLP + K F+L++QEE  +             + I SN+  +
Sbjct: 196 MGLNDSYAQVRAQVLMIEPLPTIAKVFALVIQEERQRSIHYDVSKAGVDHSGILSNVNSS 255

Query: 61  ATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIH 120
           A  + +    ++SK                    G   +R +C HC  + HT+D CYK+H
Sbjct: 256 ANTATSLRTSQNSK--------------------GGRGDRIICSHCHFRNHTVDKCYKLH 315

Query: 121 GYPPSNKQRKNNYKQTNDNQGSVQPENKSCKS----ATVAASNIESDPFQQCHDILTLLQ 180
           GYPP + + K+       +QGS      S  S     T    + +S    QC  ++  L 
Sbjct: 316 GYPPGHPKFKSQI-----SQGSAHAHQASSSSETHQETQQIDHSDSLTQSQCKQLIEFLS 375

Query: 181 SKLAGIKN--------------DNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNK 240
           SKL   +N                  + T H+  +T+     KD WI+D+GA  HIC + 
Sbjct: 376 SKLQTRQNLLMEHQPETTVSCLTGICSATSHIPAITR-----KD-WIMDTGATHHICCSL 435

Query: 241 DIFMNLKRIDTSVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLND 300
            +F + + I + V+LPN   I +T AG++ +  +++L  VLYVP F++NLLS+S+LT N 
Sbjct: 436 SMFKSSRAIQSKVVLPNTLTIPVTIAGTVAVTSNLVLQNVLYVPVFQFNLLSVSSLTDNH 495

Query: 301 AVLVNFTTNACIIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRR 360
              V+F +++C IQD   ++MIG G     LYVL++ P     + +C+   ++  LWHRR
Sbjct: 496 NCSVSFMSDSCKIQDISQIRMIGMGKRIGNLYVLQQ-PDRFLPSYICNTFVSNSELWHRR 555

Query: 361 LGHPADLPLVALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIW 420
           +GHP+   L +LKNVL+ + N      C  C L+KQ RL   S NN S  IF+L+H+D W
Sbjct: 556 MGHPSFNKLSSLKNVLNIE-NTDIVNICHSCHLSKQRRLPLASRNNISARIFELLHIDTW 615

Query: 421 GPFAHTSHLGYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRS 480
           GPF+ TS  G+R+F TIVDD SRYTW+++LK+KSDVL++ P F ++V TQ+   +K  RS
Sbjct: 616 GPFSQTSVDGFRFFFTIVDDHSRYTWVYMLKSKSDVLSIFPDFCRMVSTQFGVTVKSVRS 675

Query: 481 DNAPELKFTEFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGE 540
           DNAPEL F +FF K G+ H +SCV RP+QNSVVERKHQHILNV RA+ FQS IPLDYW +
Sbjct: 676 DNAPELGFADFFAKAGITHYHSCVERPQQNSVVERKHQHILNVARALLFQSHIPLDYWCD 735

Query: 541 CILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAI 600
           CI T+V+LINRTPS  L  KTP+ELL  ++P+Y  LKVFGCL YAST+   R+KFS RAI
Sbjct: 736 CINTSVYLINRTPSPILAHKTPFELLHGKLPSYSHLKVFGCLCYASTLLSSRHKFSPRAI 795

Query: 601 PSVFVGYPQGMKGFKLLDLENNNIFVSRDVVFHEEVFPFQS 613
             VF+GYP G KG+KLL+LE N IF+SRDV+FHE  FP+Q+
Sbjct: 796 RCVFIGYPPGYKGYKLLNLETNEIFISRDVIFHENTFPYQN 803

BLAST of Sed0019163 vs. ExPASy TrEMBL
Match: A0A2Z7C0E8 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_17540 PE=4 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 7.1e-143
Identity = 278/633 (43.92%), Postives = 383/633 (60.51%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGK 60
           MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE                 + ++ ++G 
Sbjct: 102 MGLNESYAQIRAQILLMDPLPVISKIFSLVVQEE-----------------RQRSIHQGV 161

Query: 61  DSK----PVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNK 120
             K    P+  + G     +   Y   G   ++  C HC L  HT+D CYK+HGYPP + 
Sbjct: 162 GGKLLDQPLVMNYGANVAAVKGTYNPKGIKSDKVTCTHCHLPNHTVDKCYKLHGYPPGHP 221

Query: 121 QRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNG 180
           + K   KQ++     +Q + ++  +A+V    ++    + C  ++  L S+L  + N   
Sbjct: 222 RYK--LKQSDKKSHMIQSQPQADGTASVVGDILKP---EHCRQLIAFLSSQLQ-LGNGTT 281

Query: 181 ANLTQHMAGMTQTYDYFKD--------------RWILDSGAAAHICHNKDIFMNLKRIDT 240
             L Q       +   F D               WI+D+GA  HIC +   F++ K  ++
Sbjct: 282 MALQQPQQPPESSTSCFNDTYSLSTSHTAFPTFSWIIDTGATHHICCSLHHFVSFKPFNS 341

Query: 241 SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC 300
           +V LPN   I +TH GS++L   IIL  VL+VP FK+NLLSIS+LT      V+F++  C
Sbjct: 342 NVTLPNSLNIPVTHIGSVMLLPEIILQNVLFVPQFKFNLLSISSLTKQIPCSVSFSSELC 401

Query: 301 IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVA 360
            IQ     + IG G     LY+L   P S     VC+   +   LWH RLGH +   L  
Sbjct: 402 QIQVLNQARTIGTGRRIGDLYLLTSPPSSRM--EVCATVHSKTQLWHYRLGHISLPRLSI 461

Query: 361 LKNVL--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHL 420
           L +V+  SF +N +    C IC L+KQ RL FISNN   D+ FDL+H+DIWGPF   +  
Sbjct: 462 LGDVIQESF-SNNEALSACEICHLSKQKRLPFISNNTVVDSCFDLVHIDIWGPFNPMNVD 521

Query: 421 GYRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFT 480
           G++YFLTIVDD SRYTW+ LLK+KSDV  + P F +++ TQ+ K IK  RSDNAPEL+F+
Sbjct: 522 GFKYFLTIVDDHSRYTWVQLLKSKSDVTIIFPAFCRMIRTQFGKSIKAVRSDNAPELQFS 581

Query: 481 EFFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLI 540
           EFFK  G+   +SCV RP+QNS+VERKHQHILNV RA+ FQS IPL YW +CILT+V+LI
Sbjct: 582 EFFKAEGIVSYHSCVERPQQNSIVERKHQHILNVARALLFQSNIPLVYWSDCILTSVYLI 641

Query: 541 NRTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQ 600
           NR P+  L  KTP+E++  ++PN+  L+VFGCL Y ST+  HR KFS RAI S+F+GYP 
Sbjct: 642 NRVPAPILSNKTPFEVMHTKIPNFSHLRVFGCLCYGSTLLSHRTKFSPRAIRSIFLGYPP 701

Query: 601 GMKGFKLLDLENNNIFVSRDVVFHEEVFPFQSK 614
           G KG+KLL+L+ N I++SRDV FHE VFPF++K
Sbjct: 702 GYKGYKLLNLDTNEIYISRDVTFHETVFPFRNK 708

BLAST of Sed0019163 vs. ExPASy TrEMBL
Match: A0A2Z7AFV2 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_10775 PE=4 SV=1)

HSP 1 Score: 514.6 bits (1324), Expect = 6.0e-142
Identity = 280/644 (43.48%), Postives = 382/644 (59.32%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKG-DTNIKSNITLAATQSKTTYKG 60
           MGLN+ ++  R+QILLMDPLP ++K FSL+VQEE  +  +  ++  I             
Sbjct: 141 MGLNESYAQIRAQILLMDPLPTISKIFSLVVQEERQRSINQGVEGRIL------------ 200

Query: 61  KDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRK 120
              +P+    G     +   Y   G   ++  C HC L  HT+D CYK+HGYPP + +  
Sbjct: 201 --EQPLIMSHGANVAAVKGSYNSKGTKTDKVTCSHCHLPNHTVDKCYKLHGYPPGHPK-- 260

Query: 121 NNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILTLLQSKLAGIKNDNGANL 180
             YK    ++ S   ++ S      +  N    P + C  ++  L S+L   +  NG  +
Sbjct: 261 --YKVKQSDKKSHMTQSHSIADGVASTVNDFLKP-EHCRQLIAFLSSQL---QIGNGTTM 320

Query: 181 TQHMAGMTQ------TYDYFKDR-------WILDSGAAAHICHNKDIFMNLKRIDTSVIL 240
           T      +       TY             WI+D+GA  HIC +   F++ +  +++V L
Sbjct: 321 TLQQTPESSASCFNGTYSLATSHTILPPSSWIVDTGATHHICCSPHHFVSFEPFNSNVTL 380

Query: 241 PNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACIIQD 300
           PN   I +TH GS++L   I L  VL+VP FK+NLLSIS+LT     LV+F++ +C IQ 
Sbjct: 381 PNNLNIPVTHIGSVILSSEITLHNVLFVPQFKFNLLSISSLTKQIPCLVSFSSESCQIQV 440

Query: 301 KRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVALKNV 360
               K IG G     LY+L     S+    VC+   +   LWH RLGH     L  L + 
Sbjct: 441 LNQAKTIGTGRRVGDLYILTG---SSPKIEVCTAAQSKTQLWHFRLGHIPLPKLSILGDT 500

Query: 361 L--SFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGYRY 420
           L  SF  N      C IC L+KQ RL FISNN+  D  FDL+H+DIWGPF   +  G++Y
Sbjct: 501 LQNSF-INNDELSTCEICHLSKQKRLPFISNNSIVDCCFDLVHIDIWGPFNPMNVDGFKY 560

Query: 421 FLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEFFK 480
           FLTIVDD SRYTW+ LLK+KS+V+ + P F +++H Q+ K IK  RSDNAPELKF+EFFK
Sbjct: 561 FLTIVDDHSRYTWVQLLKSKSEVIDIFPTFCRMIHKQFGKSIKSVRSDNAPELKFSEFFK 620

Query: 481 KNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINRTP 540
             G+   +SCV RP+QNSVVERKHQHILNV RA+ FQS IPL YW ECILTAV+LINRTP
Sbjct: 621 AEGIVAFHSCVERPQQNSVVERKHQHILNVARALLFQSGIPLVYWSECILTAVYLINRTP 680

Query: 541 SKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGMKG 600
           +  L  KTP+EL+  + P Y  L+VFGCL Y ST+   R KFS RA  S+F+GYP G KG
Sbjct: 681 APLLSNKTPFELMHNKPPTYSHLRVFGCLCYGSTLLNQRTKFSPRATRSIFLGYPPGYKG 740

Query: 601 FKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMPDFIMNQVL 629
           +KLL+L+ N +++SRDV+FHE VFPF++K+T  + P+  ++ ++
Sbjct: 741 YKLLNLDTNEVYISRDVIFHETVFPFKNKST--SSPEHCLDNII 756

BLAST of Sed0019163 vs. ExPASy TrEMBL
Match: A0A438HDI8 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX=29760 GN=POLX_2781 PE=4 SV=1)

HSP 1 Score: 510.8 bits (1314), Expect = 8.6e-141
Identity = 278/649 (42.84%), Postives = 375/649 (57.78%), Query Frame = 0

Query: 1   MGLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEEEHKGDTNIKSNITLAATQSKTTYKGK 60
           +GLN+ F+  ++QILLM+P PP+NK FSL+VQEE  +  T   S        S+     +
Sbjct: 193 LGLNESFAPIQAQILLMEPTPPLNKVFSLVVQEEWQRSLTTSNSPAFTTPVSSRFQAASR 252

Query: 61  DSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCKHCGLQGHTIDVCYKIHGYPPSNKQRKN 120
            S P                       +RP+C HC + GHT+D CYKIHGY P  + R  
Sbjct: 253 ASSPT---------------NSSRSRKDRPLCTHCNILGHTVDRCYKIHGYTPGFRNRP- 312

Query: 121 NYKQTNDNQGSVQPENKSCKSATVAASNIES--------DPFQQCHDILTLLQSKLAGIK 180
           N++        + P +      T+   +I S        D   Q   +L+L  S  +   
Sbjct: 313 NFRPNGSRPNQMLPNSLHTNQLTLTDGSIASASPPPLTHDQHNQLLALLSLHSSSGSSAS 372

Query: 181 NDNGANLTQHMAGMTQTYDYFKDR-------WILDSGAAAHICHNKDIFMNLKRIDT-SV 240
             +   L Q ++  T                WILDSGA  H+C N  +F ++    + +V
Sbjct: 373 FGDSNPLQQSISNFTGILSLSPSSSTLNPSIWILDSGATHHVCTNSSMFHSIHSFSSNTV 432

Query: 241 ILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNACII 300
            LP   +I IT  G+I L   ++L+ VLY+P+F++NL+SISALT  +    +FT + C I
Sbjct: 433 TLPTGTKIPITGIGTIHLSPHLVLEHVLYIPTFQFNLISISALTQTNCFSFDFTAHFCFI 492

Query: 301 QDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASP---SLWHRRLGHPADLPLV 360
           QD    K+IG G  +  LY+L+     +  ++     + S     LWH RL HP+++ L 
Sbjct: 493 QDHSQGKLIGMGRRQGNLYLLDSSVFRSISSVFVVDNNTSAHVNKLWHFRLSHPSNVKLS 552

Query: 361 ALKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLG 420
            LK  L   +N     +C+ICPLAKQ RL F  +NN S + FDLIH DIWGPF   +H G
Sbjct: 553 VLKPHLQLQSNGNTNLSCSICPLAKQKRLPFDCHNNLSSSPFDLIHCDIWGPFHIPTHDG 612

Query: 421 YRYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTE 480
           +RYFLTIVDD +R TW+ LL+ KSDV T+ P FF +V T++   IK  RSDNAPEL  + 
Sbjct: 613 FRYFLTIVDDCTRNTWVHLLRAKSDVKTIFPQFFSMVKTKFGLTIKAVRSDNAPELNLSN 672

Query: 481 FFKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLIN 540
            F +  V H +SCV  P+QNSVVERKHQHILNV RA+YFQS IP+ YWG+C+LT+V+LIN
Sbjct: 673 LFTQLDVLHFFSCVETPQQNSVVERKHQHILNVARALYFQSNIPIGYWGDCVLTSVYLIN 732

Query: 541 RTPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQG 600
           R PS  L+ KTP+ELL  + P+Y  LK FGCL Y+ST+   R+KFS RA+P VF+GYP G
Sbjct: 733 RIPSPLLNNKTPFELLHHKSPSYSHLKSFGCLCYSSTLPSTRHKFSPRALPCVFLGYPFG 792

Query: 601 MKGFKLLDLENNNIFVSRDVVFHEEVFPFQ-SKNTIENMPDFIMNQVLP 630
            KG+K+LDLE N I VSR+V F E VFPF+ S+N      DF   +VLP
Sbjct: 793 YKGYKILDLETNRISVSRNVTFQESVFPFKLSQNNNSVASDFFSKKVLP 825

BLAST of Sed0019163 vs. ExPASy TrEMBL
Match: A0A2Z7D0U1 (Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472368 GN=F511_19388 PE=4 SV=1)

HSP 1 Score: 507.7 bits (1306), Expect = 7.3e-140
Identity = 272/648 (41.98%), Postives = 384/648 (59.26%), Query Frame = 0

Query: 2   GLNDEFSTTRSQILLMDPLPPVNKAFSLIVQEE---------------EHKGDTNIKSNI 61
           GLN+ ++  R+Q+L+M+P P +   F+L+VQEE                H    N+ SNI
Sbjct: 142 GLNESYAQIRAQVLMMEPFPII--VFALVVQEERQRSIHHGTAKISIDHHVSLNNVNSNI 201

Query: 62  TLAATQSKTTYKGKDSKPVCKHCGLIGHTIDVCYRIHGYPDNRPVCK----HCGLQGHTI 121
             + T  +    GK  K VC HC    HT+D CY++HGYP   P  K        Q H I
Sbjct: 202 VNSTTTPRVQRSGKGDKVVCSHCHFRNHTVDKCYKLHGYPPGHPKLKQQLPQSNAQVHQI 261

Query: 122 DVCYKIHGYPPSNKQRKNNYKQTNDNQGSVQPENKSCKSATVAASNIESDPFQQCHDILT 181
               + +   P +   +N  KQ          E  S K     +S +E     Q H+  T
Sbjct: 262 SSIMQDNSSAPGDSLTQNQCKQL--------IEFLSSKLHFGHSSQVE----PQQHESST 321

Query: 182 LLQSKLAGIKNDNGANLTQHMAGMTQTYDYFKDRWILDSGAAAHICHNKDIFMNLKRIDT 241
              S   GI      +   H + +T T       W+LD+GA  HIC +  +F + K +++
Sbjct: 322 ---SCFTGI-----CSTVSHNSSITHT------DWVLDTGATHHICCSLSMFHSSKLVNS 381

Query: 242 SVILPNKDRIIITHAGSILLCGSIILDRVLYVPSFKYNLLSISALTLNDAVLVNFTTNAC 301
            ++LPN   I +T   S+ L   +IL  VLYVP F++NLLSIS+LT N A  V+F +++C
Sbjct: 382 KIMLPNTLTIQVTTTSSVFLTNDLILHDVLYVPEFQFNLLSISSLTKNLACSVSFMSDSC 441

Query: 302 IIQDKRTLKMIGKGNLEQGLYVLEEVPLSAALNIVCSVRSASPSLWHRRLGHPADLPLVA 361
            IQD +  K IG G     LYVL +  +++  + VC+V    P L H R+GHP+   L +
Sbjct: 442 HIQDFKRTKTIGMGKRLGNLYVLIKSSITSP-SYVCNVSVPKPELLHCRMGHPSPNKLSS 501

Query: 362 LKNVLSFDANCKGAENCTICPLAKQNRLRFISNNNKSDAIFDLIHVDIWGPFAHTSHLGY 421
           L N+L FD+       C +C ++KQ RL F S+N  +   F+L+H+D+WGPF+  S  GY
Sbjct: 502 LHNILHFDSTDVDINLCHVCHMSKQKRLPFESHNKTAAHSFELLHIDVWGPFSMYSIDGY 561

Query: 422 RYFLTIVDDFSRYTWIFLLKNKSDVLTVIPHFFKLVHTQYSKVIKCFRSDNAPELKFTEF 481
           R+FLTIVDD + +TW+++L++KS+V +++P F ++V TQ+   IK FRSDNAPEL F   
Sbjct: 562 RFFLTIVDDHTHFTWVYMLRSKSEVSSILPLFCRMVDTQFGAKIKSFRSDNAPELGFINL 621

Query: 482 FKKNGVEHQYSCVARPEQNSVVERKHQHILNVGRAIYFQSKIPLDYWGECILTAVHLINR 541
           F + G+ H YSCV RP+QNS+VERKHQHILNV RA+ FQS +P+DYW +CI+T+V+LINR
Sbjct: 622 FSELGIVHTYSCVERPQQNSIVERKHQHILNVSRALMFQSSVPIDYWSDCIVTSVYLINR 681

Query: 542 TPSKNLDWKTPYELLKKEMPNYQTLKVFGCLAYASTIREHRNKFSSRAIPSVFVGYPQGM 601
           TPS +L  KTP+ELL  + P Y  LK+FGCL YAST+   R+K S RAI  VF GYP G 
Sbjct: 682 TPSSSLHHKTPFELLHGKPPAYSHLKIFGCLCYASTLMSSRHKVSPRAIKCVFRGYPPGY 741

Query: 602 KGFKLLDLENNNIFVSRDVVFHEEVFPFQSKNTIENMP-DFIMNQVLP 630
           +G+KLL+L+ N I +SRDV+FHE  FPFQ+ +  ++ P D   + +LP
Sbjct: 742 RGYKLLNLDTNEILISRDVIFHEHEFPFQNTSNSDSQPSDIFSDNLLP 760

BLAST of Sed0019163 vs. TAIR 10
Match: ATMG00710.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 48.9 bits (115), Expect = 1.8e-05
Identity = 21/65 (32.31%), Postives = 38/65 (58.46%), Query Frame = 0

Query: 491 ILNVGRAIYFQSKIPLDYWGECILTAVHLINRTPSKNLDWKTPYELLKKEMPNYQTLKVF 550
           I+   R++  +  +P  +  +   TAVH+IN+ PS  +++  P E+  + +P Y  L+ F
Sbjct: 5   IIEKVRSMLCECGLPKTFRADAANTAVHIINKYPSTAINFHVPDEVWFQSVPTYSYLRRF 64

Query: 551 GCLAY 556
           GC+AY
Sbjct: 65  GCVAY 69

BLAST of Sed0019163 vs. TAIR 10
Match: AT4G05360.1 (Zinc knuckle (CCHC-type) family protein )

HSP 1 Score: 43.5 bits (101), Expect = 7.5e-04
Identity = 18/55 (32.73%), Postives = 28/55 (50.91%), Query Frame = 0

Query: 63  KPVCKHCGLIGHTIDVCYRI----------HGYPDNRPVCKHCGLQGHTIDVCYK 108
           +PVC HCG++GH    C+R+          +    + P C H G+QGH    C++
Sbjct: 625 RPVCHHCGVVGHIRPRCFRLLREKNRLMNAYDVRFHGPKCYHYGVQGHIKRNCFR 679

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KZV25004.13.7e-14643.68Cysteine-rich RLK (receptor-like protein kinase) 8 [Dorcoceras hygrometricum][more]
KZV39348.11.5e-14243.92hypothetical protein F511_17540 [Dorcoceras hygrometricum][more]
XP_012857659.17.2e-14241.22PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata][more]
KZV17946.11.2e-14143.48hypothetical protein F511_10775 [Dorcoceras hygrometricum][more]
RVW82526.11.8e-14042.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera][more]
Match NameE-valueIdentityDescription
P109787.0e-5530.43Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW29.1e-5529.38Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT941.3e-5328.54Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041468.6e-3727.80Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P253844.6e-1422.85Transposon Ty2-C Gag-Pol polyprotein OS=Saccharomyces cerevisiae (strain ATCC 20... [more]
Match NameE-valueIdentityDescription
A0A2Z7AT151.8e-14643.68Cysteine-rich RLK (Receptor-like protein kinase) 8 OS=Dorcoceras hygrometricum O... [more]
A0A2Z7C0E87.1e-14343.92Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A2Z7AFV26.0e-14243.48Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
A0A438HDI88.6e-14142.84Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Vitis vinifera OX... [more]
A0A2Z7D0U17.3e-14041.98Integrase catalytic domain-containing protein OS=Dorcoceras hygrometricum OX=472... [more]
Match NameE-valueIdentityDescription
ATMG00710.11.8e-0532.31Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT4G05360.17.5e-0432.73Zinc knuckle (CCHC-type) family protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 372..564
e-value: 1.2E-38
score: 134.4
IPR025724GAG-pre-integrase domainPFAMPF13976gag_pre-integrscoord: 301..367
e-value: 2.1E-8
score: 33.9
IPR001584Integrase, catalytic corePFAMPF00665rvecoord: 384..478
e-value: 4.1E-9
score: 36.7
IPR001584Integrase, catalytic corePROSITEPS50994INTEGRASEcoord: 378..541
score: 16.592196
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 116..137
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 1..89
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 1..89
NoneNo IPR availablePANTHERPTHR34222:SF6OS02G0671800 PROTEINcoord: 88..167
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 88..167
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 378..539

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0019163.1Sed0019163.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding