Sed0023410 (gene) Chayote v1

Overview
NameSed0023410
Typegene
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
LocationLG02: 35352543 .. 35357209 (-)
RNA-Seq ExpressionSed0023410
SyntenySed0023410
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAGTTTTTCATCTCTTTATATCTGCTTTTCGTGCTCACTACTCCATTGATATTCTGTAGAATGAGCTTCTCAGTGTTATTCTTCAATGAGTAAATACAAAGAGGTTTTCTCCTGGAAATTCTTGTGTTCTTGGTTTATTGCAGCTGGTTTCATGGTATCCGAGCTCTAAAGTTTAACGACGAATTTTTTTTTTTTCAAATGGAAAGCTTCAAAATGGCCTCCGATCTACAAAGCGTCTTGCCCGCTCCAACGACGATTGTTAATCCGGGAAACAAAGTTTCCACTGTTCTTCTGAGTAGTGAGAATTTCTTGTTGTGGAAATTTTAGGTGGAGTTTGCTCTAGAGGGATATGGTCTATTCTCGGCACACATCGATCCTGACTCGATTTCTCCACCAGAGAAAATTCAAGTTCGAGACGACTTCGATAAGCCTAATCCCGAGTATACAGCATGGAAGAAGCAAGATAGACTGATTTGTTCATGGTTGCTTGGGTCAATGTCTGAAGATATACTCCAACAAATGCTCCACTGTACTTCGGCAAAGGAAATATAGGCGTGCCTTCTACAAATCTTCAATTCCAGAAGCCTTGCTCAGGTCATGAAGCTAAAGTCAACTCTCTAAAATATGAAGAAAGGTAATTCCTCACTAAGTGATTATTTCGCCAAGATTAAACGAATAGTTGACTCTTTAGCTGCAGTTGATAAATCAATTTCATATGAGGACCATATACTGTACATCTTAGCTGGTTTGGGATCTGAGTATGAATCCATGATTTCGGTCATCACGGCTAAGGTTAATACAGACTCTGTTCAAGATATCATGGCTCTCCTACTCACACACGAGACACGACTTGAGTCGAAAACTGTTAATGCTGATGGCAGTGTTCCTTCGGCTAATGTGGTGCAACAACAGCCTACATACAGACCCTCCTCTGAATCTAAACCCCAGCGCTATCAAAATTTTAACTCGGGAAATGGCAGGGGACGGGGAAAAAACAATGGACGAGGAGGTAGTCGCTCGGGCGGCAGAAACAAGTTTTTCTGCACCATTTGCAACAAACATGGTCATACATCGAGCAGATGTTACTACCGGAATGATGCTCCCTCACATCATGCTCGCCCGATGTTTGCTAATCAGAGTATTGGACCAATGTTCCCATCTAATTTTCAGCAACCACCAATGTCATATCAAATGGCACCTAGTTATGGATATCAGTACTCACCACAGCCACATGGATACTATGGAATGCAGGCTTCTTCCACTTTCAATTCTGACAATAATTGGTATCCGAATTCGGGAGCAACAAATCATTTAACAAACAACTTTGGCAATCTATCAATGGGCTCGGAGTTTGGTGGTTCTAGTCAAGTTCATGTTGGTAATGGTGCAAGTCTGCCTATTTCTCACACTGGTTATGGTTCTTTGCACTCTTCTACTAGTGATAGAACATTTCATTTACATAACCTTCTACATGTCCCACAAATTACTAAAAACTTACTCAGTGTTAGTCAATTTGCTCGAGATAATAATGTGTTTTTGAATTTCATCCTACTTTTTGTCTTGTGAAGGATCTAGCAACTGGTCGGGCTCTGCTCCAAGGGACTCTACATGAAGGACTATATCGATTTCCTATGACTTTCTCCTTATTCAAAGTCTTTAGCTGTTAATTCTGTTCAAACTCATTTCAGTACTGTGCATCCTGCTTGTTTTAGTTCTGTTGTTCCTCATACGAAACTTTATTTGTGGCAACAACGGCTTGGTCATCCTGCTTTTCTTATTGTTCAAAACATTGTTAAAAGTAGTATGCATGCTGCTTTGTCTAAAAATAATTCAAGTTCTTTTTGCAATGCATGTGCTCTTGGTAAAATACATGCCGCCCCTTATTCTAAATCATTAACTGTGTATACCCGTCCCTTACAACTTGTTGTTATTGATTTATGGGGCCCGGCTTATACTGTTTCTAGGAATGGTTTCAAGTATTACATGAGTTTTATTGATGTTTTCTCTAGATTCACCTGGATTTATTTTCTGGAATCTAAATCTGATGCCTCTTCTATGCTTCATACTTTTAAAACACATGTTGAAAAACTTTGGGTGCACCCATTGTTCGTGTCCAAACTGATGGTGGTTCTGAGTTTAAGCCTCTTATCCCTGTTTTCGAATCCAATGGTATCACTCATCGGTTAGCGTGTCCTTACACCTCAAAACAAAACGGTATAGTTGAACGCAAACACAGACATATAGTTGAAACAGGCTTAACCCTTATGTCTCATGCTTCTATGCCTCTTGCTTTTTGGGATGATGCTTTTTCTACTGCTATTCATTTGATTAACAGGCTATCTACTCCGGTTCTTCATGGTGTTAGTCCCTTGCAGAAGCTTTTTGATCGGGTGCCAGATTACTCACAACTTTGTGTTTTTGGATTTAAGTGTTTTCCCTATCTTCGTCCATATAAGCTTGACTTTCGTTCTAAACCTTGCACTTTTCTTGGGTATAGTTCCATGCATAAAGGGTATAAATGTCTTGATGAGAATGGAAAACTCTATGTATCTAGGAACATTCTATTTGATGAACTTACATTTTCTTTTGCTACTAAAGGCCATTCACCCAAAACCACTTCCAAATCAGTTGTTACTTCTTCTATTCCTATTCTCTCTCAACCACCTCAGTCCTCTTCTCAGTCTCTCTCCACTTTACAACCTCCCAATGTTTCAGTAAATGATTCTTCCACTTTACAAAATTCAAGCCCATTGTTGAGCTCTCCACCTAGCCAGCCTGTTGCCTCGGACACTCATTCTCTTCAAAGTAATGGTGCAAATGACATAGCACCCATTACATCTGTCATACCTTGTCCTATTGTTAATGCCCATCCTATGCAGACAAGGGGGAAGAGTGGCATTTTTCGACCAAAGGCTCTTATTTCTAAAACCTATGATGAAATTGAACCACCAAATGCCAAGGTAGCCCTCAAATGTGCCCATTGGAGAAAAGCAATGCAAGATGAGTATGATGATTTGATGCAAAATAAAACATGGACCTTAGTCTCCAAACCTATAAATCAGAAAGTGGTTGGCTGCAAGTGGGTTTTCAAAGTCAAGAAACATTCGGATGGTACTATTGCTCGATACAAGGCACGTCTTGTTGCTAAGGGGTTTCATCAAACTCTTGATGTAAATTTTTTTTTTAGACTTTTAGCCCGGTTGTTAAACCAATTACTATTCGTGTTCTTCTTACTCTTGCTCTTACATACAACTAGACCATTAGACAAATAGATGTTAATAATTCTTTTTTACATGGTATTCTAACCGAAACTGTTTACATGGAACAACCGATCGGTTTTATCTCTCCAAATGGAAGAAATGAAGTATGCAAGTTACATAAGGCTTTGTATGGCTTGAAACAAGCCCCAAGAGCCTGGTTTGAGAGATTAACTGCTTGCTTGAAGAGCTTGGGTTTTGAACATTCTCATGCAGATACGTCTTTGCTGTTTAGACATACATTGAATAGTTGTTGTTATGTTCTCGTTTATGTTGATGATATCTTAGTGATGGGGAACTCTGCTTCTGTGGTCTCTGACTTGATCTCTAGACTCAATGCTTCGTTTTCGCTTAAGGATCTAGGACCATTAAACTATTTCATTGGCATTGAAGTATCCTACCCACCTACAGGAGGAATTTTCTTATCGCAGTCGAAATATGTTCTAGATTTACTTCGCAAAACCAATATGAGTGATGCTAATGCTATGAATACTCCCATGGTAAGTGGGAGCCTTCCGTCTGCTATTGGGGGAGAAATGTTTTCTGATGTCACATTATACAAGAGTGTTGTAGGAGCATTGCAGTATGTTCTCCTTACTCGGCCTGAACTCTCTTTTAGTGTAAACAAGGCTTGTCAATTCATGCACTCTCCAAAGGCTATTCACTAGAAATTGGTCAAACGCATTTTACGATACTTACAAGGCACTCGCTCTTCTGGTCTATTACTGACAAAACCTACATCTTTAACCTTACAGGGGTATGCAGATTCTAATTGGGCGTCTGATCCTGATGATAGGAAATCTACCTCGGGTCATTGTATTTATTTTGGTGGAAATTTAATTTCATGGGGATCTAAGAAGCAAACTATTATTTCTCGTTCTAGTACTGAGGCAGAGTATAGATGTCTAGCTACTGCTGCTACTGAACTTATCTGGTTGAATTCTCTGTTTGCTGACTTGAGAATATCTTATGCTGGTCCTCCTATTCTGTGGTGTGATAATTTAGGTGATGTCCACTTAAGTATGAATCCTGTTTTACATTCTAAAACTAAGCATGTGGAGTTAGATATCTATTATGTGCGTGACTTAGTTCATAACAGGAAACTTGTTGTTCGCCATCTTCCCATGACTATGCAGATTGCTGATATATTTATGAAGCCATTGTCTGCTCATACTTTCCTTCCTCTTCGATTCAAGCTCAATGTTCGTGATCCTCCAACCATAGGCTTGCGGGGGGTATTAGGAAAGACTCTCATTGACACATCAGCCCAAGCCCATGTAGTTTAAGTGGGCCGTTTATTGTGTTATTTCTTTGTAATGACTATTTAGCCCATGTGAGACGATACTTCTCTGCTTGTTGATGTATGTTTAATTCTAGGAGTTTTTCATCTCTTTGTATCTGCTTTTCGT

mRNA sequence

GGAGTTTTTCATCTCTTTATATCTGCTTTTCGTGCTCACTACTCCATTGATATTCTGTAGAATGAGCTTCTCAGTGTTATTCTTCAATGAGTAAATACAAAGAGGTTTTCTCCTGGAAATTCTTGTGTTCTTGGTTTATTGCAGCTGGTTTCATGGTATCCGAGCTCTAAAGTTTAACGACGAATTTTTTTTTTTTCAAATGGAAAGCTTCAAAATGGCCTCCGATCTACAAAGCGTCTTGCCCGCTCCAACGACGATTGTTAATCCGGGAAACAAAGTTTCCACTGTTCTTCTGAGTAGTGAGAATTTCTTGTTGTGGAAATTTTAGGTGGAGTTTGCTCTAGAGGGATATGGTCTATTCTCGGCACACATCGATCCTGACTCGATTTCTCCACCAGAGAAAATTCAAGTTCGAGACGACTTCGATAAGCCTAATCCCGAGTATACAGCATGGAAGAAGCAAGATAGACTGATTTGTTCATGGTTGCTTGGGTCAATGTCTGAAGATATACTCCAACAAATGCTCCACTGTACTTCGGCAAAGGAAATATAGGCGTGCCTTCTACAAATCTTCAATTCCAGAAGCCTTGCTCAGGTCATGAAGCTAAAGTCAACTCTCTAAAATATGAAGAAAGGTAATTCCTCACTAAGTGATTATTTCGCCAAGATTAAACGAATAGTTGACTCTTTAGCTGCAGTTGATAAATCAATTTCATATGAGGACCATATACTGTACATCTTAGCTGGTTTGGGATCTGAGTATGAATCCATGATTTCGGTCATCACGGCTAAGGTTAATACAGACTCTGTTCAAGATATCATGGCTCTCCTACTCACACACGAGACACGACTTGAGTCGAAAACTGTTAATGCTGATGGCAGTGTTCCTTCGGCTAATGTGGTGCAACAACAGCCTACATACAGACCCTCCTCTGAATCTAAACCCCAGCGCTATCAAAATTTTAACTCGGGAAATGGCAGGGGACGGGGAAAAAACAATGGACGAGGAGGTAGTCGCTCGGGCGGCAGAAACAAGTTTTTCTGCACCATTTGCAACAAACATGGTCATACATCGAGCAGATGTTACTACCGGAATGATGCTCCCTCACATCATGCTCGCCCGATGTTTGCTAATCAGAGTATTGGACCAATGTTCCCATCTAATTTTCAGCAACCACCAATGTCATATCAAATGGCACCTAGTTATGGATATCAGTACTCACCACAGCCACATGGATACTATGGAATGCAGGCTTCTTCCACTTTCAATTCTGACAATAATTGGTATCCGAATTCGGGAGCAACAAATCATTTAACAAACAACTTTGGCAATCTATCAATGGGCTCGGAGTTTGGTGGTTCTAGTCAAGTTCATGTTGGTAATGGTGCAAGTCTGCCTATTTCTCACACTGGATCTAGCAACTGGTCGGGCTCTGCTCCAAGGGACTCTACATGAAGGACTATATCGATTTCCTATGACTTTCTCCTTATTCAAAGTCTTTAGCTGTTAATTCTGTTCAAACTCATTTCAGTACTGTGCATCCTGCTTGTTTTAGTTCTGTTGTTCCTCATACGAAACTTTATTTGTGGCAACAACGGCTTGGTCATCCTGCTTTTCTTATTGTTCAAAACATTGTTAAAAGTAGTATGCATGCTGCTTTGTCTAAAAATAATTCAAGTTCTTTTTGCAATGCATGTGCTCTTGGTAAAATACATGCCGCCCCTTATTCTAAATCATTAACTGTGTATACCCGTCCCTTACAACTTGTTGTTATTGATTTATGGGGCCCGGCTTATACTGTTTCTAGGAATGGTTTCAAGTATTACATGAGTTTTATTGATGTTTTCTCTAGATTCACCTGGATTTATTTTCTGGAATCTAAATCTGATGCCTCTTCTATGCTTCATACTTTTAAAACACATGTTGAAAAACTTTGGGTGCACCCATTGTTCGTGTCCAAACTGATGGTGGTTCTGAGTTTAAGCCTCTTATCCCTGTTTTCGAATCCAATGGTATCACTCATCGGTTAGCGTGTCCTTACACCTCAAAACAAAACGGTATAGTTGAACGCAAACACAGACATATAGTTGAAACAGGCTTAACCCTTATGTCTCATGCTTCTATGCCTCTTGCTTTTTGGGATGATGCTTTTTCTACTGCTATTCATTTGATTAACAGGCTATCTACTCCGGTTCTTCATGGTGTTAGTCCCTTGCAGAAGCTTTTTGATCGGGTGCCAGATTACTCACAACTTTGTGTTTTTGGATTTAAGTGTTTTCCCTATCTTCGTCCATATAAGCTTGACTTTCGTTCTAAACCTTGCACTTTTCTTGGGTATAGTTCCATGCATAAAGGGTATAAATGTCTTGATGAGAATGGAAAACTCTATGTATCTAGGAACATTCTATTTGATGAACTTACATTTTCTTTTGCTACTAAAGGCCATTCACCCAAAACCACTTCCAAATCAGTTGTTACTTCTTCTATTCCTATTCTCTCTCAACCACCTCAGTCCTCTTCTCAGTCTCTCTCCACTTTACAACCTCCCAATGTTTCAGTAAATGATTCTTCCACTTTACAAAATTCAAGCCCATTGTTGAGCTCTCCACCTAGCCAGCCTGTTGCCTCGGACACTCATTCTCTTCAAAGTAATGGTGCAAATGACATAGCACCCATTACATCTGTCATACCTTGTCCTATTGTTAATGCCCATCCTATGCAGACAAGGGGGAAGAGTGGCATTTTTCGACCAAAGGCTCTTATTTCTAAAACCTATGATGAAATTGAACCACCAAATGCCAAGGTAGCCCTCAAATGTGCCCATTGGAGAAAAGCAATGCAAGATGAGTATGATGATTTGATGCAAAATAAAACATGGACCTTAGTCTCCAAACCTATAAATCAGAAAGTGGTTGGCTGCAAGTGGGTTTTCAAAGTCAAGAAACATTCGGATGGTACTATTGCTCGATACAAGGCACGTCTTGTTGCTAAGGGGTTTCATCAAACTCTTGATGTAAATTTTTTTTTTAGACTTTTAGCCCGGTTGTTAAACCAATTACTATTCGTGTTCTTCTTACTCTTGCTCTTACATACAACTAGACCATTAGACAAATAGATGTTAATAATTCTTTTTTACATGGTATTCTAACCGAAACTGTTTACATGGAACAACCGATCGGTTTTATCTCTCCAAATGGAAGAAATGAAGTATGCAAGTTACATAAGGCTTTGTATGGCTTGAAACAAGCCCCAAGAGCCTGGTTTGAGAGATTAACTGCTTGCTTGAAGAGCTTGGGTTTTGAACATTCTCATGCAGATACGTCTTTGCTGTTTAGACATACATTGAATAGTTGTTGTTATGTTCTCGTTTATGTTGATGATATCTTAGTGATGGGGAACTCTGCTTCTGTGGTCTCTGACTTGATCTCTAGACTCAATGCTTCGTTTTCGCTTAAGGATCTAGGACCATTAAACTATTTCATTGGCATTGAAGTATCCTACCCACCTACAGGAGGAATTTTCTTATCGCAGTCGAAATATGTTCTAGATTTACTTCGCAAAACCAATATGAGTGATGCTAATGCTATGAATACTCCCATGGTAAGTGGGAGCCTTCCGTCTGCTATTGGGGGAGAAATGTTTTCTGATGTCACATTATACAAGAGTGTTGTAGGAGCATTGCAGTATGTTCTCCTTACTCGGCCTGAACTCTCTTTTAGTGTAAACAAGGCTTGTCAATTCATGCACTCTCCAAAGGCTATTCACTAGAAATTGGTCAAACGCATTTTACGATACTTACAAGGCACTCGCTCTTCTGGTCTATTACTGACAAAACCTACATCTTTAACCTTACAGGGGTATGCAGATTCTAATTGGGCGTCTGATCCTGATGATAGGAAATCTACCTCGGGTCATTGTATTTATTTTGGTGGAAATTTAATTTCATGGGGATCTAAGAAGCAAACTATTATTTCTCGTTCTAGTACTGAGGCAGAGTATAGATGTCTAGCTACTGCTGCTACTGAACTTATCTGGTTGAATTCTCTGTTTGCTGACTTGAGAATATCTTATGCTGGTCCTCCTATTCTGTGGTGTGATAATTTAGGTGATGTCCACTTAAGTATGAATCCTGTTTTACATTCTAAAACTAAGCATGTGGAGTTAGATATCTATTATGTGCGTGACTTAGTTCATAACAGGAAACTTGTTGTTCGCCATCTTCCCATGACTATGCAGATTGCTGATATATTTATGAAGCCATTGTCTGCTCATACTTTCCTTCCTCTTCGATTCAAGCTCAATGTTCGTGATCCTCCAACCATAGGCTTGCGGGGGGTATTAGGAAAGACTCTCATTGACACATCAGCCCAAGCCCATGTAGTTTAAGTGGGCCGTTTATTGTGTTATTTCTTTGTAATGACTATTTAGCCCATGTGAGACGATACTTCTCTGCTTGTTGATGTATGTTTAATTCTAGGAGTTTTTCATCTCTTTGTATCTGCTTTTCGT

Coding sequence (CDS)

ATGTCTCATGCTTCTATGCCTCTTGCTTTTTGGGATGATGCTTTTTCTACTGCTATTCATTTGATTAACAGGCTATCTACTCCGGTTCTTCATGGTGTTAGTCCCTTGCAGAAGCTTTTTGATCGGGTGCCAGATTACTCACAACTTTGTGTTTTTGGATTTAAGTGTTTTCCCTATCTTCGTCCATATAAGCTTGACTTTCGTTCTAAACCTTGCACTTTTCTTGGGTATAGTTCCATGCATAAAGGGTATAAATGTCTTGATGAGAATGGAAAACTCTATGTATCTAGGAACATTCTATTTGATGAACTTACATTTTCTTTTGCTACTAAAGGCCATTCACCCAAAACCACTTCCAAATCAGTTGTTACTTCTTCTATTCCTATTCTCTCTCAACCACCTCAGTCCTCTTCTCAGTCTCTCTCCACTTTACAACCTCCCAATGTTTCAGTAAATGATTCTTCCACTTTACAAAATTCAAGCCCATTGTTGAGCTCTCCACCTAGCCAGCCTGTTGCCTCGGACACTCATTCTCTTCAAAGTAATGGTGCAAATGACATAGCACCCATTACATCTGTCATACCTTGTCCTATTGTTAATGCCCATCCTATGCAGACAAGGGGGAAGAGTGGCATTTTTCGACCAAAGGCTCTTATTTCTAAAACCTATGATGAAATTGAACCACCAAATGCCAAGGTAGCCCTCAAATGTGCCCATTGGAGAAAAGCAATGCAAGATGAGTATGATGATTTGATGCAAAATAAAACATGGACCTTAGTCTCCAAACCTATAAATCAGAAAGTGGTTGGCTGCAAGTGGGTTTTCAAAGTCAAGAAACATTCGGATGGTACTATTGCTCGATACAAGGCACGTCTTGTTGCTAAGGGGTTTCATCAAACTCTTGATGTAAATTTTTTTTTTAGACTTTTAGCCCGGTTGTTAAACCAATTACTATTCGTGTTCTTCTTACTCTTGCTCTTACATACAACTAGACCATTAGACAAATAG

Protein sequence

MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYLRPYKLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNFFFRLLARLLNQLLFVFFLLLLLHTTRPLDK
Homology
BLAST of Sed0023410 vs. NCBI nr
Match: TYK18915.1 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa])

HSP 1 Score: 255.8 bits (652), Expect = 5.3e-64
Identity = 150/329 (45.59%), Postives = 206/329 (62.61%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           +S A++PL+FWD+AFST+++LIN L TPVL  +SPL+K+F R P++  L VFG KC+PYL
Sbjct: 8   LSQATLPLSFWDEAFSTSVYLINLLPTPVLDNISPLEKVFFRKPNFPFLRVFGCKCYPYL 67

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFAT-KGHSPK 120
           RPY   KL  RS PCTFLGYS+ HKGYKCL  +G+L++SR++LFDE +F +A+   HS  
Sbjct: 68  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSI 127

Query: 121 TTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ----- 180
             SK+V+  S P+ S  P S              S +   L P  V   ++ T +     
Sbjct: 128 PKSKNVL--SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNPTIVYPLETGTQESSRDD 187

Query: 181 -NSSPLLSSP-PSQPVASDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPK 240
            NS  +  SP P +P     H   S G N     TS+        HPM T+ K  IF+PK
Sbjct: 188 GNSGGITQSPSPMEP----PHQTDS-GMNTQLQSTSI--------HPMITQSKHDIFKPK 247

Query: 241 ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK 300
           A +   Y + E  NAK A    HW+KAM++E+  L +N TW+L+ +  NQK+VGCKWVFK
Sbjct: 248 AFLI-DYTQTETCNAKEAFNHPHWKKAMEEEFKALQKNGTWSLIPQNPNQKIVGCKWVFK 307

Query: 301 VKKHSDGTIARYKARLVAKGFHQTLDVNF 306
           +K++S G+I+RYKARLVAKGFHQT ++++
Sbjct: 308 IKRNSYGSISRYKARLVAKGFHQTHNIDY 320

BLAST of Sed0023410 vs. NCBI nr
Match: KAG8473223.1 (hypothetical protein CXB51_035172 [Gossypium anomalum])

HSP 1 Score: 255.0 bits (650), Expect = 9.1e-64
Identity = 142/320 (44.38%), Postives = 193/320 (60.31%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           ++ A +PL FW  AF +A++LINRL T VL G SP + L    P Y  L +FG +C+PYL
Sbjct: 522 LAQAQVPLRFWVHAFISAVYLINRLPTSVLGGKSPYEVLHKAPPPYMHLRIFGCRCYPYL 581

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
           RP+   KL  RS+PC FLGYSS+HKGYKC+D+ GKL+VSR+++FDE  F FAT   SP  
Sbjct: 582 RPFNTHKLQCRSRPCVFLGYSSIHKGYKCMDDTGKLFVSRHVVFDEAVFPFATLA-SP-D 641

Query: 121 TSKSVVTSS--------IPILSQP----PQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLS 180
           +S S +TSS        +P++  P       S+ S S    P ++    S +  SS  +S
Sbjct: 642 SSVSAITSSQFQHHESLVPVIRYPSDLHSSGSANSRSATIQPGLTTTMLSPIPASSTQVS 701

Query: 181 SPPSQPVASDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDE 240
             P++     +  + S+      P+   IP P VN HPMQTR KSGIF+P+ + S     
Sbjct: 702 PSPAEVSPQGSQMVSSS-----VPVAKSIPAPPVNTHPMQTRSKSGIFKPR-VFSAEVGV 761

Query: 241 IEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTI 300
            EP   + AL    W  A Q E+D L++N+TW LV  P N + VGCKWVFK+K+H+DG+I
Sbjct: 762 SEPTTIEEALSSKEWALAAQQEFDALLRNQTWDLVPLPTNWRAVGCKWVFKLKRHADGSI 821

Query: 301 ARYKARLVAKGFHQTLDVNF 306
           ARYK RLV KG+ Q   ++F
Sbjct: 822 ARYKRRLVVKGYLQEAGIDF 833

BLAST of Sed0023410 vs. NCBI nr
Match: KAG8502419.1 (hypothetical protein CXB51_000456 [Gossypium anomalum])

HSP 1 Score: 251.9 bits (642), Expect = 7.7e-63
Identity = 140/314 (44.59%), Postives = 196/314 (62.42%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           ++HASMPL++W+DAF+T I+LINRL +  L  + P +KLF   P Y  L VFG  CFP L
Sbjct: 486 LAHASMPLSYWNDAFATTIYLINRLPSASLGSLLPYEKLFHNKPCYLSLKVFGCLCFPNL 545

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
           RPY   KL FRS PCTFLGYSS+HKGY+CL  +GK+YVSR++ F E TF F T   +PK+
Sbjct: 546 RPYNKHKLQFRSLPCTFLGYSSLHKGYRCLAPDGKIYVSRHVTFHETTFPFQTLCFNPKS 605

Query: 121 TSKSVVTSSIPILSQPPQSS--SQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASD 180
           T+   V++ + +LS   +S+  SQS+     PN +V+       ++   +S P+    S 
Sbjct: 606 TTVPSVSTKLLVLSPSVRSTIPSQSVPINFSPNPTVSPEPIPDPTNIRSTSSPTNSPVSS 665

Query: 181 TH--SLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALIS--KTYDEIEPPNA 240
           TH     S   + ++  T  +P   +N+HPM TRGK+ IF+PK  +S         P + 
Sbjct: 666 THISPFSSPPLSHLSSTTHSLP---LNSHPMITRGKANIFKPKVFLSSASALSSETPFDI 725

Query: 241 KVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKAR 300
             A++  HW+  + +E   L+ N TW+L + P N+K +GCKW+FKVKK +DGTI RYKAR
Sbjct: 726 HEAMQHEHWKNTVHNELQALLANGTWSLCTLPSNRKAIGCKWLFKVKKKADGTIERYKAR 785

Query: 301 LVAKGFHQTLDVNF 306
           LVAKGF Q   ++F
Sbjct: 786 LVAKGFLQHAGLDF 796

BLAST of Sed0023410 vs. NCBI nr
Match: KAG8479334.1 (hypothetical protein CXB51_029681 [Gossypium anomalum])

HSP 1 Score: 251.1 bits (640), Expect = 1.3e-62
Identity = 136/321 (42.37%), Postives = 199/321 (61.99%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            ++ AS+P+++W DAF+TA++++NRL T  L GVSP ++LF   PDY QL VFG  C+P L
Sbjct: 711  LAQASLPISYWADAFATAVYIMNRLPTKSLPGVSPCEQLFGHKPDYQQLRVFGCLCYPLL 770

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
            RPY   KL +RS PCTFLGY++ H+GYKC+D  G++Y+SR++ FDE T+ FA        
Sbjct: 771  RPYNRHKLQYRSAPCTFLGYATNHRGYKCVDRYGRVYISRHVRFDEDTYPFA-------Q 830

Query: 121  TSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTH 180
             SKSVV       S P   S Q +  +    + +   + +  SSP+ S+P S     DT 
Sbjct: 831  LSKSVV-------SVPDSRSGQFMRDVTSLPIFMTSPANISESSPVDSTPASNSSPVDTS 890

Query: 181  SL---QSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVA 240
             +    SN  + +A I       ++N HPM TR K GI++PK  ++   D +EP     A
Sbjct: 891  LMDPTSSNSEDHLALIVEQQGSSLINRHPMMTRSKMGIYKPKTYMAVVSD-VEPLTIHEA 950

Query: 241  LKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA 300
            +    W++A+ DE   L++N+TW LVS P+NQ +VGCKW+FK+K++SDG++AR K RLVA
Sbjct: 951  MAIPSWKQAVNDELQALIRNRTWDLVSVPVNQSLVGCKWLFKIKRNSDGSVARNKVRLVA 1010

Query: 301  KGFHQT--LDVNFFFRLLARL 314
            +GF Q   LD +  F L+ ++
Sbjct: 1011 QGFSQAAGLDYHETFSLVVKI 1016

BLAST of Sed0023410 vs. NCBI nr
Match: GAU32278.1 (hypothetical protein TSUD_62940 [Trifolium subterraneum])

HSP 1 Score: 249.2 bits (635), Expect = 5.0e-62
Identity = 134/311 (43.09%), Postives = 187/311 (60.13%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            +S A++PL +WD AF TA+HLINRL T  L+   P   LF + PDY+ L VFG  CFP +
Sbjct: 693  LSQANLPLTYWDHAFLTAVHLINRLPTASLNFKVPYTTLFQKDPDYNSLKVFGSACFPLI 752

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
            RPY   K DFRS  C FLGYS+ HKGYKCL   G++YVS++++F+E  F + +   +P  
Sbjct: 753  RPYNSHKFDFRSHECIFLGYSTTHKGYKCLSPTGRIYVSKDVMFNESRFPYESLFPTP-N 812

Query: 121  TSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTH 180
            ++ S  T  IP+ + P  + ++++ T        + ++   ++SP  +  P+QP     H
Sbjct: 813  SALSNPTPDIPLTTLPIGTQNENILTNIQNPTDQSSNTNQPSTSPPTTLQPTQPDTELPH 872

Query: 181  SLQSNGAND---IAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVA 240
            S  SN  +D   ++P    +P    N HPMQTR KSG+  PK          EP   K A
Sbjct: 873  STSSNPNSDPLTLSPSFHPMPSKTTNTHPMQTRVKSGLILPKINPKLLLTHTEPRTTKQA 932

Query: 241  LKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA 300
            L+   W  AM +E++ L +NKTWTLV  P N+ V+GCKWVF+ K++ DGTI +YKARLVA
Sbjct: 933  LQDRKWLSAMTEEFEALKRNKTWTLVPLPNNRDVIGCKWVFRTKENPDGTINKYKARLVA 992

Query: 301  KGFHQTLDVNF 306
            KGFHQ    +F
Sbjct: 993  KGFHQVQGFDF 1002

BLAST of Sed0023410 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 180.3 bits (456), Expect = 3.7e-44
Identity = 131/393 (33.33%), Postives = 181/393 (46.06%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            +SHAS+P  +W  AFS A++LINRL TP+L   SP QKLF + P+Y +L VFG  C+P+L
Sbjct: 619  LSHASVPKTYWPYAFSVAVYLINRLPTPLLQLQSPFQKLFGQPPNYEKLKVFGCACYPWL 678

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVSRNILFDELTFSFATKGHSPK 120
            RPY   KL+ +SK C F+GYS     Y CL    G+LY SR++ FDE  F F+T      
Sbjct: 679  RPYNRHKLEDKSKQCAFMGYSLTQSAYLCLHIPTGRLYTSRHVQFDERCFPFSTTNFGVS 738

Query: 121  T------------------------------------TSKSVVTSSIPIL------SQPP 180
            T                                    TS    +S  P+       S  P
Sbjct: 739  TSQEQRSDSAPNWPSHTTLPTTPLVLPAPPCLGPHLDTSPRPPSSPSPLCTTQVSSSNLP 798

Query: 181  QSSSQSLSTLQP---------PNVSVNDSSTLQNSSPLLSSP----------------PS 240
             SS  S S+ +P         P    + +    ++SP+L++P                P 
Sbjct: 799  SSSISSPSSSEPTAPSHNGPQPTAQPHQTQNSNSNSPILNNPNPNSPSPNSPNQNSPLPQ 858

Query: 241  QPVAS----------DTHSLQSNGANDIAPITSVIPCP---------IVNAHPMQTRGKS 300
             P++S             +  S+ +    P+  V+P P          VN H M TR K 
Sbjct: 859  SPISSPHIPTPSTSISEPNSPSSSSTSTPPLPPVLPAPPIIQVNAQAPVNTHSMATRAKD 918

BLAST of Sed0023410 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 175.3 bits (443), Expect = 1.2e-42
Identity = 135/389 (34.70%), Postives = 179/389 (46.02%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            +SHAS+P  +W  AF+ A++LINRL TP+L   SP QKLF   P+Y +L VFG  C+P+L
Sbjct: 640  LSHASIPKTYWPYAFAVAVYLINRLPTPLLQLESPFQKLFGTSPNYDKLRVFGCACYPWL 699

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLD-ENGKLYVSRNILFDELTFSF-------- 120
            RPY   KLD +S+ C FLGYS     Y CL  +  +LY+SR++ FDE  F F        
Sbjct: 700  RPYNQHKLDDKSRQCVFLGYSLTQSAYLCLHLQTSRLYISRHVRFDENCFPFSNYLATLS 759

Query: 121  -------------------------------------ATKGHSPKTTSK----------S 180
                                                 AT   SP    +          S
Sbjct: 760  PVQEQRRESSCVWSPHTTLPTRTPVLPAPSCSDPHHAATPPSSPSAPFRNSQVSSSNLDS 819

Query: 181  VVTSSIPILSQP-------PQSSSQSLSTLQPPNVSVNDS-STLQNSSP-----LLSSP- 240
              +SS P   +P       PQ ++Q   T    + S N S +   N SP      LS+P 
Sbjct: 820  SFSSSFPSSPEPTAPRQNGPQPTTQPTQTQTQTHSSQNTSQNNPTNESPSQLAQSLSTPA 879

Query: 241  -PSQPVASDTHSLQSNGANDIAPITSVIPCP------------IVNAHPMQTRGKSGIFR 300
              S    S T S  S+  +   P   + P P             +N H M TR K+GI +
Sbjct: 880  QSSSSSPSPTTSASSSSTSPTPPSILIHPPPPLAQIVNNNNQAPLNTHSMGTRAKAGIIK 939

BLAST of Sed0023410 vs. ExPASy Swiss-Prot
Match: P92520 (Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 GN=AtMg00820 PE=4 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 1.4e-22
Identity = 57/105 (54.29%), Postives = 69/105 (65.71%), Query Frame = 0

Query: 204 MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLV 263
           M TR K+GI    PK +L   T  + EP +   ALK   W +AMQ+E D L +NKTW LV
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 264 SKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNF 306
             P+NQ ++GCKWVFK K HSDGT+ R KARLVAKGFHQ   + F
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYF 105

BLAST of Sed0023410 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 104.0 bits (258), Expect = 3.4e-21
Identity = 88/309 (28.48%), Postives = 135/309 (43.69%), Query Frame = 0

Query: 4   ASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCF---PYL 63
           A +P +FW +A  TA +LINR  +  L    P +   ++   YS L VFG + F   P  
Sbjct: 603 AKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAFAHVPKE 662

Query: 64  RPYKLDFRSKPCTFLGYSSMHKGYKCLDE-NGKLYVSRNILFDELTFSFATKGHSPKTTS 123
           +  KLD +S PC F+GY     GY+  D    K+  SR+++F E      T     +   
Sbjct: 663 QRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRE--SEVRTAADMSEKVK 722

Query: 124 KSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTHSL 183
             ++ + + I S     +S   +T +           ++    L      + V    H  
Sbjct: 723 NGIIPNFVTIPSTSNNPTSAESTTDEVSEQGEQPGEVIEQGEQL-----DEGVEEVEHPT 782

Query: 184 QSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVAL---K 243
           Q    +           P+  +   +   +        LIS   D+ EP + K  L   +
Sbjct: 783 QGEEQHQ----------PLRRSERPRVESRRYPSTEYVLIS---DDREPESLKEVLSHPE 842

Query: 244 CAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKG 303
                KAMQ+E + L +N T+ LV  P  ++ + CKWVFK+KK  D  + RYKARLV KG
Sbjct: 843 KNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLKKDGDCKLVRYKARLVVKG 891

Query: 304 FHQTLDVNF 306
           F Q   ++F
Sbjct: 903 FEQKKGIDF 891

BLAST of Sed0023410 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 59.3 bits (142), Expect = 9.6e-08
Identity = 38/111 (34.23%), Postives = 62/111 (55.86%), Query Frame = 0

Query: 219 ISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVK 278
           +  ++DEI+  + K     + W +A+  E +    N TWT+  +P N+ +V  +WVF VK
Sbjct: 890 VPNSFDEIQYRDDK-----SSWEEAINTELNAHKINNTWTITKRPENKNIVDSRWVFSVK 949

Query: 279 KHSDGTIARYKARLVAKGFHQTLDVNF--FFRLLARLLNQLLFVFFLLLLL 328
            +  G   RYKARLVA+GF Q   +++   F  +AR+ +   F F L L++
Sbjct: 950 YNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISS---FRFILSLVI 992

BLAST of Sed0023410 vs. ExPASy TrEMBL
Match: A0A5D3D5W0 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold204G001310 PE=4 SV=1)

HSP 1 Score: 255.8 bits (652), Expect = 2.6e-64
Identity = 150/329 (45.59%), Postives = 206/329 (62.61%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           +S A++PL+FWD+AFST+++LIN L TPVL  +SPL+K+F R P++  L VFG KC+PYL
Sbjct: 8   LSQATLPLSFWDEAFSTSVYLINLLPTPVLDNISPLEKVFFRKPNFPFLRVFGCKCYPYL 67

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFAT-KGHSPK 120
           RPY   KL  RS PCTFLGYS+ HKGYKCL  +G+L++SR++LFDE +F +A+   HS  
Sbjct: 68  RPYQSHKLSLRSTPCTFLGYSTSHKGYKCLASDGRLFISRHVLFDENSFPYASFASHSSI 127

Query: 121 TTSKSVVTSSIPILSQPPQS-------------SSQSLSTLQPPNVSVNDSSTLQ----- 180
             SK+V+  S P+ S  P S              S +   L P  V   ++ T +     
Sbjct: 128 PKSKNVL--SPPLHSIIPSSLMNHNEDRRHTDTVSDNTDYLNPTIVYPLETGTQESSRDD 187

Query: 181 -NSSPLLSSP-PSQPVASDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPK 240
            NS  +  SP P +P     H   S G N     TS+        HPM T+ K  IF+PK
Sbjct: 188 GNSGGITQSPSPMEP----PHQTDS-GMNTQLQSTSI--------HPMITQSKHDIFKPK 247

Query: 241 ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFK 300
           A +   Y + E  NAK A    HW+KAM++E+  L +N TW+L+ +  NQK+VGCKWVFK
Sbjct: 248 AFLI-DYTQTETCNAKEAFNHPHWKKAMEEEFKALQKNGTWSLIPQNPNQKIVGCKWVFK 307

Query: 301 VKKHSDGTIARYKARLVAKGFHQTLDVNF 306
           +K++S G+I+RYKARLVAKGFHQT ++++
Sbjct: 308 IKRNSYGSISRYKARLVAKGFHQTHNIDY 320

BLAST of Sed0023410 vs. ExPASy TrEMBL
Match: A0A2Z6MJI3 (Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 GN=TSUD_62940 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 2.4e-62
Identity = 134/311 (43.09%), Postives = 187/311 (60.13%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            +S A++PL +WD AF TA+HLINRL T  L+   P   LF + PDY+ L VFG  CFP +
Sbjct: 693  LSQANLPLTYWDHAFLTAVHLINRLPTASLNFKVPYTTLFQKDPDYNSLKVFGSACFPLI 752

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
            RPY   K DFRS  C FLGYS+ HKGYKCL   G++YVS++++F+E  F + +   +P  
Sbjct: 753  RPYNSHKFDFRSHECIFLGYSTTHKGYKCLSPTGRIYVSKDVMFNESRFPYESLFPTP-N 812

Query: 121  TSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVASDTH 180
            ++ S  T  IP+ + P  + ++++ T        + ++   ++SP  +  P+QP     H
Sbjct: 813  SALSNPTPDIPLTTLPIGTQNENILTNIQNPTDQSSNTNQPSTSPPTTLQPTQPDTELPH 872

Query: 181  SLQSNGAND---IAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVA 240
            S  SN  +D   ++P    +P    N HPMQTR KSG+  PK          EP   K A
Sbjct: 873  STSSNPNSDPLTLSPSFHPMPSKTTNTHPMQTRVKSGLILPKINPKLLLTHTEPRTTKQA 932

Query: 241  LKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA 300
            L+   W  AM +E++ L +NKTWTLV  P N+ V+GCKWVF+ K++ DGTI +YKARLVA
Sbjct: 933  LQDRKWLSAMTEEFEALKRNKTWTLVPLPNNRDVIGCKWVFRTKENPDGTINKYKARLVA 992

Query: 301  KGFHQTLDVNF 306
            KGFHQ    +F
Sbjct: 993  KGFHQVQGFDF 1002

BLAST of Sed0023410 vs. ExPASy TrEMBL
Match: A0A803NRU8 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 249.2 bits (635), Expect = 2.4e-62
Identity = 142/311 (45.66%), Postives = 192/311 (61.74%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           ++ ASMPL FWD+AF  A++L NRL TPVL+ +SP++ LF+  PDY  L +FG  CFP +
Sbjct: 515 LAQASMPLKFWDEAFRCAVYLHNRLPTPVLNQLSPIEILFNNKPDYGNLKIFGCLCFPNI 574

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFAT-KGHSPK 120
           RPY   KLDFRS PCTFLG S  HKGYKCLD +G+LY+SR+++FDE  FS+A+    + +
Sbjct: 575 RPYNKHKLDFRSSPCTFLGCSLNHKGYKCLDSHGRLYISRDVIFDESNFSYASISTDASE 634

Query: 121 TTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVS--VNDSSTLQNSSPLLSSPPSQPVAS 180
           + S+S + S+IP+   P   + +SL +L   +V+  V D++ + +S   L  P    V S
Sbjct: 635 SVSESHIPSAIPLNHLP--YTVESLFSLPSASVTNRVTDAAVVASSGMYLQVPLKTIVPS 694

Query: 181 DTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVA 240
              SLQ              P P +N H M+TR KSGI++PKAL+       EP N K A
Sbjct: 695 QNLSLQ-----------HPTPSPPINNHSMKTRAKSGIYKPKALLVSQ----EPSNVKAA 754

Query: 241 LKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA 300
           LK   W  AM +E   L +N TWT V  P  +  +GCKWV+K K ++DG + R KARLVA
Sbjct: 755 LKEEKWCNAMSEEMVALKKNGTWTYVPLPSGRTPIGCKWVYKEKLNADGNVNRNKARLVA 808

Query: 301 KGFHQTLDVNF 306
           KGFHQ    +F
Sbjct: 815 KGFHQQAGFDF 808

BLAST of Sed0023410 vs. ExPASy TrEMBL
Match: A0A438H844 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=29760 GN=RE1_3152 PE=4 SV=1)

HSP 1 Score: 248.8 bits (634), Expect = 3.1e-62
Identity = 143/312 (45.83%), Postives = 192/312 (61.54%), Query Frame = 0

Query: 1   MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
           ++ AS+P  +WD+AF T+++LINRL TPVL   SPL+ LF + P YSQL VFG  C+P L
Sbjct: 680 LAQASLPFKYWDEAFRTSVYLINRLPTPVLKNKSPLEVLFHQKPSYSQLKVFGCMCYPNL 739

Query: 61  RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFATKGHSPKT 120
           RP+   KL FRS PCTFLGYS  HKGYKCL  NG + +SR+++FDE  F FA +    +T
Sbjct: 740 RPFNHHKLQFRSIPCTFLGYSLNHKGYKCLSPNGNILISRDVIFDEHAFPFAQRQSQKQT 799

Query: 121 TSK-SVVTSSIPILSQPP---QSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVA 180
           TS  S  ++S+P  +  P     SS S ST  P N S+  +++  N +       SQP  
Sbjct: 800 TSSFSSSSTSLPCQTSLPLMVLPSSTSCSTSSPTNPSIFPATSNHNVA-------SQP-- 859

Query: 181 SDTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKV 240
                          P +S  P P   +H M TR K+GIF+PKA +  T     P +   
Sbjct: 860 ---------------PPSSAPPFP---SHHMITRSKNGIFKPKAYLIST----TPTSVPE 919

Query: 241 ALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLV 300
           AL+ +HW++AM DEY  L++N TW LV  P ++K++G KWVFKVK++ DGTI +YKARLV
Sbjct: 920 ALQLSHWKQAMTDEYLALLRNNTWDLVPLPTDRKLIGYKWVFKVKENPDGTINKYKARLV 960

Query: 301 AKGFHQTLDVNF 306
           AKGFHQ +  +F
Sbjct: 980 AKGFHQIVGFDF 960

BLAST of Sed0023410 vs. ExPASy TrEMBL
Match: A0A2K3NHD4 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Trifolium pratense OX=57577 GN=L195_g025737 PE=4 SV=1)

HSP 1 Score: 248.4 bits (633), Expect = 4.1e-62
Identity = 140/311 (45.02%), Postives = 186/311 (59.81%), Query Frame = 0

Query: 1    MSHASMPLAFWDDAFSTAIHLINRLSTPVLHGVSPLQKLFDRVPDYSQLCVFGFKCFPYL 60
            +S AS+PL++WD AF TA++LINRL +  L    P   LF + PDY  L VFG  CFP L
Sbjct: 732  LSQASLPLSYWDYAFLTAVYLINRLPSAPLDFKIPYTLLFHQDPDYKFLKVFGCACFPLL 791

Query: 61   RPY---KLDFRSKPCTFLGYSSMHKGYKCLDENGKLYVSRNILFDELTFSFA---TKGHS 120
            RPY   KLD+RS  C FLGYS  HKGY+CL  NG+L++S++++F+E  F F    T  HS
Sbjct: 792  RPYNTHKLDYRSHECLFLGYSPSHKGYRCLSPNGRLFISKDVIFNESRFPFIDLFTSPHS 851

Query: 121  PKTTSKSVVTSSIPILSQPPQSSSQSLSTLQPPNVSVNDSSTLQNSSPLLSSPPSQPVAS 180
               +  S VT S P++  P  S S + S    P+ S+ +S     SSP++S  P+ P  +
Sbjct: 852  VVPSKSSAVTLS-PLVHHPSHSPSPTSS----PSPSIPNSPPPSASSPVVS--PTSPPVT 911

Query: 181  DTHSLQSNGANDIAPITSVIPCPIVNAHPMQTRGKSGIFRPKALISKTYDEIEPPNAKVA 240
               SL    A  I P    +     N HPM TR K G  +P+         IEP + K A
Sbjct: 912  SPESLSPPNAPPIPPKAKPVVHKPSNLHPMLTRAKDGFTQPRLEPRLLLTHIEPASVKQA 971

Query: 241  LKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIARYKARLVA 300
            L+   W+ AMQ EYD L+ N TWTLV  P ++  +GCKWVF+VK++SDGT+ +YKARLVA
Sbjct: 972  LQVPEWKTAMQAEYDALLANNTWTLVPLPSDRSSIGCKWVFRVKQNSDGTLNKYKARLVA 1031

Query: 301  KGFHQTLDVNF 306
            KGFHQ   ++F
Sbjct: 1032 KGFHQRHGIDF 1035

BLAST of Sed0023410 vs. TAIR 10
Match: ATMG00820.1 (Reverse transcriptase (RNA-dependent DNA polymerase) )

HSP 1 Score: 108.6 bits (270), Expect = 9.8e-24
Identity = 57/105 (54.29%), Postives = 69/105 (65.71%), Query Frame = 0

Query: 204 MQTRGKSGI--FRPK-ALISKTYDEIEPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLV 263
           M TR K+GI    PK +L   T  + EP +   ALK   W +AMQ+E D L +NKTW LV
Sbjct: 1   MLTRSKAGINKLNPKYSLTITTTIKKEPKSVIFALKDPGWCQAMQEELDALSRNKTWILV 60

Query: 264 SKPINQKVVGCKWVFKVKKHSDGTIARYKARLVAKGFHQTLDVNF 306
             P+NQ ++GCKWVFK K HSDGT+ R KARLVAKGFHQ   + F
Sbjct: 61  PPPVNQNILGCKWVFKTKLHSDGTLDRLKARLVAKGFHQEEGIYF 105

BLAST of Sed0023410 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 72.8 bits (177), Expect = 5.9e-13
Identity = 43/110 (39.09%), Postives = 60/110 (54.55%), Query Frame = 0

Query: 227 EPPNAKVALKCAHWRKAMQDEYDDLMQNKTWTLVSKPINQKVVGCKWVFKVKKHSDGTIA 286
           EP     A +   W  AM DE   +    TW + + P N+K +GCKWV+K+K +SDGTI 
Sbjct: 85  EPSTYNEAKEFLVWCGAMDDEIGAMETTHTWEICTLPPNKKPIGCKWVYKIKYNSDGTIE 144

Query: 287 RYKARLVAKGFHQTLDVNFF--FRLLARLLNQLLFVFFLLLLLHTTRPLD 335
           RYKARLVAKG+ Q   ++F   F  + +L +  L +    +   T   LD
Sbjct: 145 RYKARLVAKGYTQQEGIDFIETFSPVCKLTSVKLILAISAIYNFTLHQLD 194

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TYK18915.15.3e-6445.59Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. m... [more]
KAG8473223.19.1e-6444.38hypothetical protein CXB51_035172 [Gossypium anomalum][more]
KAG8502419.17.7e-6344.59hypothetical protein CXB51_000456 [Gossypium anomalum][more]
KAG8479334.11.3e-6242.37hypothetical protein CXB51_029681 [Gossypium anomalum][more]
GAU32278.15.0e-6243.09hypothetical protein TSUD_62940 [Trifolium subterraneum][more]
Match NameE-valueIdentityDescription
Q9ZT943.7e-4433.33Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
Q94HW21.2e-4234.70Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
P925201.4e-2254.29Uncharacterized mitochondrial protein AtMg00820 OS=Arabidopsis thaliana OX=3702 ... [more]
P109783.4e-2128.48Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
P041469.6e-0834.23Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A5D3D5W02.6e-6445.59Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Cucumis melo var.... [more]
A0A2Z6MJI32.4e-6243.09Integrase catalytic domain-containing protein OS=Trifolium subterraneum OX=3900 ... [more]
A0A803NRU82.4e-6245.66Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A438H8443.1e-6245.83Retrovirus-related Pol polyprotein from transposon RE1 OS=Vitis vinifera OX=2976... [more]
A0A2K3NHD44.1e-6245.02Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) OS=Trifol... [more]
Match NameE-valueIdentityDescription
ATMG00820.19.8e-2454.29Reverse transcriptase (RNA-dependent DNA polymerase) [more]
AT4G23160.15.9e-1339.09cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 254..305
e-value: 5.0E-12
score: 45.9
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 128..181
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 238..302
coord: 2..116

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0023410.1Sed0023410.1mRNA