Sed0020888 (gene) Chayote v1

Overview
NameSed0020888
Typegene
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase Ty1/copia-type domain-containing protein
LocationLG03: 16910163 .. 16914994 (+)
RNA-Seq ExpressionSed0020888
SyntenySed0020888
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CGAACTTGAAATAATGCAATTTTATATATTATTTTCTAAAAAATCTATCCTAAAGAGAAATGAAACATTGTGCTTTCCAACTTCAAGGTCTCTCTCTGCTTCATTAACCCTACCTCATCAAATCCTCTTCTTGAATGTCAAGGAAACAACCTCCGGTATCACAACGGTAGGGGAATTGATGGAATCTTCGACTAGGAGTTCGCTGCCCACTATTGTATTCGACCTTTAAATTTCCTCCCTCGCATCCCGACCCATTTCACTCTTTTTTACCTTCATTTTCATTCATCGCGATTTTACACACATTTCACCCTTTTCTCTCCAACTCCAACCCCCTTCATCGCCATTGCCACTACCGAAGCACCTAAAGTCCTCATCATCTCGATCACCGCCAGCTTCATCTGCCTGTCGCAACCTATATTTGAAAGGTTAGGGTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTCATTTCAATGGAAATTTCTCATTTTTTGCACTCATATGTTTGATTATATTAGGTATAATGTGAATTTAGAATTAATTTTTTGGTTAAGTTTTAGATTTAGATTTAAATAAGAAATTTAAAATTTTAGATTTAGAATTGAAATTAGATTTAGGTTTAGTGTAATTTCAGGATTGAAAATACATTAGGTTTATTTTAAGTTTAGGATTGTATTTAGATGTAGCGTTTATTGTAATTGTAGGATTGAAAATAGATTTAGGTTTAGTTTAAGTTTAAGATTGCAATTAGATGTTTATGCATAGGTTGTAATTAGTTTTTTAGACTAGTTTTTTACTCAATCAATTTCACGGTTGAAGAATGGGTGAAAAGGCATGTGAAGAAGGTGACCAAAATGTTTCGGTCATTGAAACTAATTCTTTTAGTGATAAATCAATCAAAATATTCAATGTTCCATTTCTCCCAAAAAATGAAGGAATTTCTAGTAACCATAATTTAGATAGTGCTATGGCATGTATGGCACCAGAGCAAGTAGATGAAACATTTAAAAGGGAAATTAATGGAAGAAGGAATCAAACTTCAAGACAAGTGAATATATTTTTTAACTTTCAAATGTATTTTTTAGTTGTTAAAACTTTATATTAAAGACTATTTTAGTACTTATAATTTATTCAATCATTTCTCCTATCTAACATGCTTAATATAACTTCAAATACTTGTCCATATTGATATAATGGATCAGCCTGGTGGTCAAATAAATAATTTGCGTAAAATAAACTGGATTGAGGTTATAAGTTCAATCTATAGTAACCATTAAGTTTCTCAGGCTATCAAGTATAATTAGGTCAAATGGTTGTTTCGTGAAGGTCTCGTAGATTCTAGTTTTCATTATATGAATCAATTGTTTTTTTTTCACTTTACAATTGTGATTTTCTATAATATATTTTGCCTTACTCGTCATAAGCCTTATTGGAGTAGAAATATATGCAATCAAGCAGAAGAAACGAACTTTTTGAAGCATTAAATGACAATACTTTGAGCTTTTTGATGCATTAAATGGCAATACTTTGTCAGAGTTAGGTTCTATCACTAAATATTGCTATCACTACAGTATGGTCATTTTAAACTATATGTCCATATCTCAAAATAAAAGTAGAGTTAAAAGAAGTGTTTTTTAAAGCATGTGTCTTGAGAAACCATGTATCTTAGCCAAAAGTAAGAAATTAGGAAATTTTCTTTCTTGGGGATTTTATTTAATTTTAAAAAATAATATATATGTGTGTTAAGTAGATGACATGTTTAAAGGCACAATTGAAGTGTGGTGGGCTTCAAGATTGCCCTTGCCTTGCCTCACATTAGACAAGGTGCCTTTGTTGCGGTCTCCTCGAGTGTTTTCTAAATGCATTCTATACGTGATTCTCTCTGTGCATGAAATGGGAGAGTAATATTAATATTTTTAAAGAAGTTGAGGATGGGTGTAGGTGCCCTCAGAGATAGTAGATAGTAGAGTGAAACTTTGATTTCCCAGTTTGAGTAAAAAATATTCTTAATGGAGTTTTTTTTATAAGCTTATGAATGTGGAGGCCAACAACCATCATTTTTTGTTGCATGATAAAAAATCTGCTACTTTTTGATAATCATTTGGTTTTTTGTTTCCAATTTTCGGTTTGGTTTCTAAGAATATATGTAGGATGTAGATATCCAAATAAGAAAAAACACAAGGTAAGAGAGTTGTTGTAGGCTTAAATTTTAAAAACAAAAAACTTAAAAAAAAATGGTTATCAAACATGGCCTATGTTAGTAGATTTCTATAACTTTTAATCTATTTTGTAAGTGAATGTCAGTGTTGTCTTGTGTACCTATTATTTTTTAAAAATGATTGCTTTTCTTTCTTTTAAAAGAAACTCCATTTGAAATGTATTTAGGGTCATTATAGAATCTTTGAAGTTTTGTTGAGAAGTAATAGATTTTCTTCGAATTCTGTTGGTGGCTGCTGTTAGTAAAAGGTCAATATATTTATCAACTCCAAATCCTCAGGTATACCAAAAAGTTCCCACCATTATTTTTTGGTTGAGTCTTTCCTTTATCGAAGTATTTGCATTTGCTTGATAGATTCAACTCTATGTGGAGACTTCACAGTTAATTTAACTATTGAGAGTGACCAATAATACCCATTTGTTTTTCAACTAATTTGTCATCTAGTATGTTTGCAAGCACTTGGACAGAATTTTAAAAAGAAAGTATGACTTATTCCCTTGACAATTTAGTAGTGTGGAAAAAGAAAGTGGAAGGTAAAGTCTTAGATACAAAATTTTAAAAAGTAAGTATCAAGACATTGCACAAGAAATAGATAACTTTGATTGACTGCCTTATGTTTTCATTTTATTTATATCTTAGGTAAACGAGAGGCTTTAGAAATTATTCAAACTTATAAAGTGAACCATTATAAGTAGATAACTTTGGTTGCCTTAATTTTTTAGAATTAGAATTGAAGCTGAAGTATATATTGGATTGATGCTTATAGGGAATATGGAAAGGGAAATAAAGAGGCTAGGGAAAAGACCTATGTTGTAGGCATTTAAATTTTAATGTAAGTTTCTCCATTGTTCTCATCATTTTCTTTTCTTGCTTAAGATTCTGAAATATTTATTGATGCATATTCGTTAGAATTACTTATTTACTCTTCTCTTCTCGATTTCAGTAATTCTTCACATTTCCTTGTGATTGAAAGTAGTAACGAAAATAGTAAGAGATACCAAACAAGCTCAAGGGGTCTAAATCTAAGTATTATAATAAGCAAAAATTGTTGAACATTGAGGTAATTTCTTTCGAGAAGTCATCACTTGAGATATTACAATTTTCAAATTCAACCATTGTATATATTAGGAGTATTGAACGTTTTTCTTTTCATTTGTTTGTTGCTGATTAATTTGATTAATTGTAGTGCATCTCTATTTTTCCTTTCTGATTAGGTGTATCTTCTAACAATTTGTTGGTTGTTACAAAGCTGTGGGCCATGTTACTTTATTTAGGCAAGTTTCTTTTTGGTGTGGTTGTATGGAATAAACAACCCCTAAGGATGGCACAATGGTTGAAGACTTGGTTTTTTAGGGTATGCTCCCCTCAAGGTCTCAGGTTCGAGACTAAGTTGTGACATTACTTCTTCGATGTAACTTAAGTGCAACTTCGCACAGGGGATAGTATATCATGCTTTTTTGTTATATTGTGTAAACTAACTAGTTTTCACACCCCTCTTAAACATTTTAAGAATTATAAGTTTATTGATATTGATTTTTTCTTTTTGGGCAAGTGTAATGTACAAATTATTTAGTCATATATGAAATAATATTTGTTGAATAAGACTTGTGGGCGCATAATATAAATGTTTCTTGACAGGTAAAGACAAACAAAACTGGTTATAAATAGGAAGCCCACAGTAGCAATGCTAAAATCCATAAGGATAAGCTCCTTGCGATTGTTGCCTATCATGATTATGAAATATGGCAGATGGATGTTAAAACCGCTTTCTTGAATGCGTTCTTGAAAGAGGATGTGTATATGACACACCCTGAAGGTTTTCAAGATCCAGCCAATCCTGGGAGAGTATACAGGCTTCAAAAATCTATTTATGGATTGAAAAAAGTATCTAGGAGTTGGAACCTCAGATTTGATGAGGCATTCAAACAGTTTAGGTTCCTTAAAAATGAAGAGGAATTTTGTGTATACAAGAAGTTAAGTGGGATCAGTGTTACCTTCCTTGTTCTGTATGTAGATGACATACTACTCATAGGAAACGATAGAACCATGCTTGAATCAGTCAAAGATTGACTTAAAAATTGTTCCTCTATGAAAGACATTGGAGAGGCTGAGTACATTCTAGGAATAAGAATCTATAGAGATAGATCCAAAAGAATGATTGGACTTAGTCAGGAAACTTATATTGATAAGGTTCTTACTAAGTTCAATATGGAAACTCTAAGAGAGGTTTCATTCCCATGCAACATGGCATATCGATTAGCAAGACTCAATGTCCTACAAGTCCTATTGAGGCTAAGCGTATGAGTATTGTTCCTTATGCTTTGGCAATAGGTTCAATCATGTATGCCATGATTTGTACTCGACCGGACGATGTGCTCATGCTTTTAGCATATGTAACAGATACCAGTCTAATCCTAGTGATACACACTGGATAGTAGTGAAAAATATTCTTAAGTACTTGAGAAAAACGAAGGATAATTTTATGGTTTATGGTGGAGTTAATGAGTTGGTTGTTACTGGATAAACTGATGCAAACATTGCAGATCCACTGACTAAACCATTACCGCAACCCAAACATGAGAGTCATACTAGGACTATGAGTATTAGAC

mRNA sequence

CGAACTTGAAATAATGCAATTTTATATATTATTTTCTAAAAAATCTATCCTAAAGAGAAATGAAACATTGTGCTTTCCAACTTCAAGGTCTCTCTCTGCTTCATTAACCCTACCTCATCAAATCCTCTTCTTGAATGTCAAGGAAACAACCTCCGGTATCACAACGGTAGGGGAATTGATGGAATCTTCGACTAGGAGTTCGCTGCCCACTATTGTATTCGACCTTTAAATTTCCTCCCTCGCATCCCGACCCATTTCACTCTTTTTTACCTTCATTTTCATTCATCGCGATTTTACACACATTTCACCCTTTTCTCTCCAACTCCAACCCCCTTCATCGCCATTGCCACTACCGAAGCACCTAAAGTCCTCATCATCTCGATCACCGCCAGCTTCATCTGCCTGTCGCAACCTATATTTGAAAGGGAATATGGAAAGGGAAATAAAGAGGCTAGGGAAAAGACCTATGTTGTAGGCATTTAAATTTTAATTAATTCTTCACATTTCCTTGTGATTGAAAGTAGTAACGAAAATAGTAAGAGATACCAAACAAGCTCAAGGGGTCTAAATCTAAGTATTATAATAAGCAAAAATTGTTGAACATTGAGGTGTATCTTCTAACAATTTGTTGGTTGTTACAAAGCTGTGGGCCATGTTACTTTATTTAGGCAAGTTTCTTTTTGGTGTGGTTGTATGGAATAAACAACCCCTAAGGATGGCACAATGGTTGAAGACTTGGTTTTTTAGGGTATGCTCCCCTCAAGGTCTCAGGTTCGAGACTAAGTTGTGACATTACTTCTTCGATGTAACTTAAGTGCAACTTCGCACAGGGGATAGTATATCATGCTTTTTTGTTATATTGTGTAAACTAACTAGTTTTCACACCCCTCTTAAACATTTTAAGAATTATAAGTTTATTGATATTGATTTTTTCTTTTTGGGCAAGTGTAATGTACAAATTATTTAGTCATATATGAAATAATATTTGTTGAATAAGACTTGTGGGCGCATAATATAAATGTTTCTTGACAGGTAAAGACAAACAAAACTGGTTATAAATAGGAAGCCCACAGTAGCAATGCTAAAATCCATAAGGATAAGCTCCTTGCGATTGTTGCCTATCATGATTATGAAATATGGCAGATGGATGTTAAAACCGCTTTCTTGAATGCGTTCTTGAAAGAGGATGTGTATATGACACACCCTGAAGGTTTTCAAGATCCAGCCAATCCTGGGAGAGTATACAGGCTTCAAAAATCTATTTATGGATTGAAAAAAGTATCTAGGAGTTGGAACCTCAGATTTGATGAGGCATTCAAACAGTTTAGGTTCCTTAAAAATGAAGAGGAATTTTGTGTATACAAGAAGTTAAGTGGGATCAGTGTTACCTTCCTTGTTCTGTATGTAGATGACATACTACTCATAGGAAACGATAGAACCATGCTTGAATCAGTCAAAGATTGACTTAAAAATTGTTCCTCTATGAAAGACATTGGAGAGGCTGAGTACATTCTAGGAATAAGAATCTATAGAGATAGATCCAAAAGAATGATTGGACTTAGTCAGGAAACTTATATTGATAAGGTTCTTACTAAGTTCAATATGGAAACTCTAAGAGAGGTTTCATTCCCATGCAACATGGCATATCGATTAGCAAGACTCAATGTCCTACAAGTCCTATTGAGGCTAAGCGTATGAGTATTGTTCCTTATGCTTTGGCAATAGGTTCAATCATGTATGCCATGATTTGTACTCGACCGGACGATGTGCTCATGCTTTTAGCATATGTAACAGATACCAGTCTAATCCTAGTGATACACACTGGATAGTAGTGAAAAATATTCTTAAGTACTTGAGAAAAACGAAGGATAATTTTATGGTTTATGGTGGAGTTAATGAGTTGGTTGTTACTGGATAAACTGATGCAAACATTGCAGATCCACTGACTAAACCATTACCGCAACCCAAACATGAGAGTCATACTAGGACTATGAGTATTAGAC

Coding sequence (CDS)

ATGGATGTTAAAACCGCTTTCTTGAATGCGTTCTTGAAAGAGGATGTGTATATGACACACCCTGAAGGTTTTCAAGATCCAGCCAATCCTGGGAGAGTATACAGGCTTCAAAAATCTATTTATGGATTGAAAAAAGTATCTAGGAGTTGGAACCTCAGATTTGATGAGGCATTCAAACAGTTTAGGTTCCTTAAAAATGAAGAGGAATTTTGTGTATACAAGAAGTTAAGTGGGATCAGTGTTACCTTCCTTGTTCTGTATGTAGATGACATACTACTCATAGGAAACGATAGAACCATGCTTGAATCAGTCAAAGATTGA

Protein sequence

MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVKD
Homology
BLAST of Sed0020888 vs. NCBI nr
Match: BAA22288.1 (polyprotein [Oryza australiensis])

HSP 1 Score: 159.8 bits (403), Expect = 1.3e-35
Identity = 75/105 (71.43%), Postives = 87/105 (82.86%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            MDVKTAFLN  L EDVYM  P+GF DP +PG++ +LQKSIYGLK+ SRSWN+RFDE  K 
Sbjct: 905  MDVKTAFLNGNLSEDVYMIQPQGFVDPESPGKICKLQKSIYGLKQASRSWNIRFDEVIKG 964

Query: 61   FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
            F F+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  MLESVK
Sbjct: 965  FGFIKNEEEACVYKKVSGSAIVFLILYVDDILLIGNDIPMLESVK 1009

BLAST of Sed0020888 vs. NCBI nr
Match: KAG7543183.1 (Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabidopsis arenosa] >KAG7547824.1 Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis suecica])

HSP 1 Score: 159.5 bits (402), Expect = 1.6e-35
Identity = 77/105 (73.33%), Postives = 88/105 (83.81%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L+EDVYMT PEGF  P N G+V +LQ+SIYGL+K SRSWNLRFDEA K+
Sbjct: 88  MDVKTAFLNGNLEEDVYMTQPEGFTSPQNAGKVCKLQRSIYGLQKASRSWNLRFDEAIKE 147

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F++NEEE CVYKK SG +V FLVLYVDDILLIGND  +L+SVK
Sbjct: 148 FDFIRNEEEPCVYKKTSGSAVAFLVLYVDDILLIGNDIPLLQSVK 192

BLAST of Sed0020888 vs. NCBI nr
Match: KAF5783325.1 (putative RNA-directed DNA polymerase [Helianthus annuus])

HSP 1 Score: 157.5 bits (397), Expect = 6.2e-35
Identity = 76/105 (72.38%), Postives = 84/105 (80.00%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SRSWNLRFD+  KQ
Sbjct: 882 MDVKTAFLNGHLTEDVYMEQPEGFVDPKNPKKVCKLNKSIYGLKQASRSWNLRFDQKIKQ 941

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F+KNE+E CVYKK SG S+TFL+LYVDDILLIGND  ML  VK
Sbjct: 942 FGFIKNEDEPCVYKKASGSSITFLILYVDDILLIGNDIPMLRDVK 986

BLAST of Sed0020888 vs. NCBI nr
Match: KAF5788164.1 (putative RNA-directed DNA polymerase [Helianthus annuus])

HSP 1 Score: 157.5 bits (397), Expect = 6.2e-35
Identity = 76/105 (72.38%), Postives = 85/105 (80.95%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SRSWNLRFD+  KQ
Sbjct: 858 MDVKTAFLNGHLIEDVYMEQPEGFVDPKNPKKVCKLNKSIYGLKQASRSWNLRFDQKIKQ 917

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F+KNE+E CVYKK SG S+TFL+LYVDDILLIGND  ML+ VK
Sbjct: 918 FGFIKNEDEPCVYKKASGSSITFLILYVDDILLIGNDIPMLQDVK 962

BLAST of Sed0020888 vs. NCBI nr
Match: KAF5757832.1 (putative RNA-directed DNA polymerase [Helianthus annuus])

HSP 1 Score: 155.2 bits (391), Expect = 3.1e-34
Identity = 74/105 (70.48%), Postives = 85/105 (80.95%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L EDVYM  PEGF DP NP +V +L KSIYGLK+ SR+WNLRFD+  KQ
Sbjct: 882 MDVKTAFLNGHLTEDVYMEQPEGFVDPKNPKKVCKLNKSIYGLKQASRTWNLRFDQKIKQ 941

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F+KNE+E CVYKK SG S+TFL+LYVDDILLIGN+  ML+ VK
Sbjct: 942 FGFIKNEDEPCVYKKASGSSITFLILYVDDILLIGNNIPMLQDVK 986

BLAST of Sed0020888 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1)

HSP 1 Score: 95.1 bits (235), Expect = 5.0e-19
Identity = 45/106 (42.45%), Postives = 68/106 (64.15%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            +DVKTAFL+  L+E++YM  PEGF+       V +L KS+YGLK+  R W ++FD   K 
Sbjct: 921  LDVKTAFLHGDLEEEIYMEQPEGFEVAGKKHMVCKLNKSLYGLKQAPRQWYMKFDSFMKS 980

Query: 61   FRFLKNEEEFCVY-KKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
              +LK   + CVY K+ S  +   L+LYVDD+L++G D+ ++  +K
Sbjct: 981  QTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLK 1026

BLAST of Sed0020888 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 84.3 bits (207), Expect = 8.8e-16
Identity = 39/106 (36.79%), Postives = 62/106 (58.49%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            +DV  AFL   L +DVYM+ P GF D   P  V +L+K++YGLK+  R+W +        
Sbjct: 1064 LDVNNAFLQGTLTDDVYMSQPPGFIDKDRPNYVCKLRKALYGLKQAPRAWYVELRNYLLT 1123

Query: 61   FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVKD 107
              F+ +  +  ++    G S+ ++++YVDDIL+ GND T+L +  D
Sbjct: 1124 IGFVNSVSDTSLFVLQRGKSIVYMLVYVDDILITGNDPTLLHNTLD 1169

BLAST of Sed0020888 vs. ExPASy Swiss-Prot
Match: Q9ZT94 (Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana OX=3702 GN=RE2 PE=4 SV=1)

HSP 1 Score: 81.3 bits (199), Expect = 7.4e-15
Identity = 39/106 (36.79%), Postives = 61/106 (57.55%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            +DV  AFL   L ++VYM+ P GF D   P  V RL+K+IYGLK+  R+W +        
Sbjct: 1047 LDVNNAFLQGTLTDEVYMSQPPGFVDKDRPDYVCRLRKAIYGLKQAPRAWYVELRTYLLT 1106

Query: 61   FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVKD 107
              F+ +  +  ++    G S+ ++++YVDDIL+ GND  +L+   D
Sbjct: 1107 VGFVNSISDTSLFVLQRGRSIIYMLVYVDDILITGNDTVLLKHTLD 1152

BLAST of Sed0020888 vs. ExPASy Swiss-Prot
Match: P04146 (Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3)

HSP 1 Score: 80.5 bits (197), Expect = 1.3e-14
Identity = 42/107 (39.25%), Postives = 65/107 (60.75%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            MDVKTAFLN  LKE++YM  P+G     N   V +L K+IYGLK+ +R W   F++A K+
Sbjct: 1001 MDVKTAFLNGTLKEEIYMRLPQGIS--CNSDNVCKLNKAIYGLKQAARCWFEVFEQALKE 1060

Query: 61   FRFLKNEEEFCVY--KKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
              F+ +  + C+Y   K +     +++LYVDD+++   D T + + K
Sbjct: 1061 CEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFK 1105

BLAST of Sed0020888 vs. ExPASy Swiss-Prot
Match: P25600 (Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c) OX=559292 GN=TY5A PE=5 SV=2)

HSP 1 Score: 73.2 bits (178), Expect = 2.0e-12
Identity = 35/105 (33.33%), Postives = 57/105 (54.29%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDV TAFLN+ + E +Y+  P GF +  NP  V+ L   +YGLK+    WN   +   K+
Sbjct: 1   MDVDTAFLNSTMDEPIYVKQPPGFVNERNPDYVWELYGGMYGLKQAPLLWNEHINNTLKK 60

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
             F ++E E  +Y + +     ++ +YVDD+L+      + + VK
Sbjct: 61  IGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAPSPKIYDRVK 105

BLAST of Sed0020888 vs. ExPASy TrEMBL
Match: O23864 (Polyprotein OS=Oryza australiensis OX=4532 PE=4 SV=1)

HSP 1 Score: 159.8 bits (403), Expect = 6.1e-36
Identity = 75/105 (71.43%), Postives = 87/105 (82.86%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            MDVKTAFLN  L EDVYM  P+GF DP +PG++ +LQKSIYGLK+ SRSWN+RFDE  K 
Sbjct: 905  MDVKTAFLNGNLSEDVYMIQPQGFVDPESPGKICKLQKSIYGLKQASRSWNIRFDEVIKG 964

Query: 61   FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
            F F+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  MLESVK
Sbjct: 965  FGFIKNEEEACVYKKVSGSAIVFLILYVDDILLIGNDIPMLESVK 1009

BLAST of Sed0020888 vs. ExPASy TrEMBL
Match: A0A6D2HJE2 (CCHC-type domain-containing protein OS=Microthlaspi erraticum OX=1685480 GN=MERR_LOCUS1945 PE=4 SV=1)

HSP 1 Score: 154.1 bits (388), Expect = 3.3e-34
Identity = 74/105 (70.48%), Postives = 86/105 (81.90%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L+EDVYM  PEGF  P N G+V +LQ++IYGLK+ SRSWNLRFDEA K+
Sbjct: 626 MDVKTAFLNGKLEEDVYMIQPEGFTSPENAGKVCKLQRAIYGLKQASRSWNLRFDEAIKE 685

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F++NEEE CVYKK SG +V FLVLYVDDI LIGND  +L+SVK
Sbjct: 686 FDFIRNEEEPCVYKKTSGSAVAFLVLYVDDIFLIGNDIPLLKSVK 730

BLAST of Sed0020888 vs. ExPASy TrEMBL
Match: D3IVT9 (Putative retrotransposon protein OS=Phyllostachys edulis OX=38705 PE=4 SV=1)

HSP 1 Score: 152.5 bits (384), Expect = 9.7e-34
Identity = 74/105 (70.48%), Postives = 85/105 (80.95%), Query Frame = 0

Query: 1    MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
            MDVKTAFLN  L EDVYMT PEGF DP N  +V +LQKSIYGLK+ SRSWN+RFDE  K+
Sbjct: 901  MDVKTAFLNGKLSEDVYMTQPEGFVDPNNASKVCKLQKSIYGLKQASRSWNIRFDEEIKR 960

Query: 61   FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
            F F+KN+EE CVY K+SG ++  L+LYVDDILLIGND  MLESVK
Sbjct: 961  FGFVKNKEEPCVYMKVSGSTLVILILYVDDILLIGNDIPMLESVK 1005

BLAST of Sed0020888 vs. ExPASy TrEMBL
Match: Q2QQ28 (Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. japonica OX=39947 GN=LOC_Os12g32440 PE=4 SV=2)

HSP 1 Score: 152.5 bits (384), Expect = 9.7e-34
Identity = 73/105 (69.52%), Postives = 85/105 (80.95%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKT FLN  L EDVYMT P+GF DP +  ++ +LQKSIYGLK+ SRSWN+RFDE  K 
Sbjct: 690 MDVKTTFLNGNLDEDVYMTQPKGFVDPQSAKKICKLQKSIYGLKQASRSWNIRFDEVVKA 749

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
            RF+KNEEE CVYKK+SG ++ FL+LYVDDILLIGND  MLESVK
Sbjct: 750 LRFVKNEEEPCVYKKISGSALVFLILYVDDILLIGNDILMLESVK 794

BLAST of Sed0020888 vs. ExPASy TrEMBL
Match: D3IVU0 (Putative retrotransposon protein OS=Phyllostachys edulis OX=38705 PE=4 SV=1)

HSP 1 Score: 152.1 bits (383), Expect = 1.3e-33
Identity = 73/105 (69.52%), Postives = 85/105 (80.95%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQDPANPGRVYRLQKSIYGLKKVSRSWNLRFDEAFKQ 60
           MDVKTAFLN  L EDVYMT PEGF DP N  +V +LQKSIYGLK+ SRSWN+RFDE  K+
Sbjct: 485 MDVKTAFLNGNLHEDVYMTQPEGFVDPNNASKVCKLQKSIYGLKQASRSWNIRFDEEIKR 544

Query: 61  FRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
           F F+KN+EE CVY K+SG ++  L+LYVDDILL+GND  MLESVK
Sbjct: 545 FGFIKNKEEPCVYMKVSGSTLVILILYVDDILLVGNDIPMLESVK 589

BLAST of Sed0020888 vs. TAIR 10
Match: AT4G23160.1 (cysteine-rich RLK (RECEPTOR-like protein kinase) 8 )

HSP 1 Score: 74.7 bits (182), Expect = 5.0e-14
Identity = 37/109 (33.94%), Postives = 61/109 (55.96%), Query Frame = 0

Query: 1   MDVKTAFLNAFLKEDVYMTHPEGFQ----DPANPGRVYRLQKSIYGLKKVSRSWNLRFDE 60
           +D+  AFLN  L E++YM  P G+     D   P  V  L+KSIYGLK+ SR W L+F  
Sbjct: 193 LDISNAFLNGDLDEEIYMKLPPGYAARQGDSLPPNAVCYLKKSIYGLKQASRQWFLKFSV 252

Query: 61  AFKQFRFLKNEEEFCVYKKLSGISVTFLVLYVDDILLIGNDRTMLESVK 106
               F F+++  +   + K++      +++YVDDI++  N+   ++ +K
Sbjct: 253 TLIGFGFVQSHSDHTYFLKITATLFLCVLVYVDDIIICSNNDAAVDELK 301

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
BAA22288.11.3e-3571.43polyprotein [Oryza australiensis][more]
KAG7543183.11.6e-3573.33Reverse transcriptase RNA-dependent DNA polymerase [Arabidopsis thaliana x Arabi... [more]
KAF5783325.16.2e-3572.38putative RNA-directed DNA polymerase [Helianthus annuus][more]
KAF5788164.16.2e-3572.38putative RNA-directed DNA polymerase [Helianthus annuus][more]
KAF5757832.13.1e-3470.48putative RNA-directed DNA polymerase [Helianthus annuus][more]
Match NameE-valueIdentityDescription
P109785.0e-1942.45Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... [more]
Q94HW28.8e-1636.79Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
Q9ZT947.4e-1536.79Retrovirus-related Pol polyprotein from transposon RE2 OS=Arabidopsis thaliana O... [more]
P041461.3e-1439.25Copia protein OS=Drosophila melanogaster OX=7227 GN=GIP PE=1 SV=3[more]
P256002.0e-1233.33Putative transposon Ty5-1 protein YCL074W OS=Saccharomyces cerevisiae (strain AT... [more]
Match NameE-valueIdentityDescription
O238646.1e-3671.43Polyprotein OS=Oryza australiensis OX=4532 PE=4 SV=1[more]
A0A6D2HJE23.3e-3470.48CCHC-type domain-containing protein OS=Microthlaspi erraticum OX=1685480 GN=MERR... [more]
D3IVT99.7e-3470.48Putative retrotransposon protein OS=Phyllostachys edulis OX=38705 PE=4 SV=1[more]
Q2QQ289.7e-3469.52Retrotransposon protein, putative, Ty1-copia subclass OS=Oryza sativa subsp. jap... [more]
D3IVU01.3e-3369.52Putative retrotransposon protein OS=Phyllostachys edulis OX=38705 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G23160.15.0e-1433.94cysteine-rich RLK (RECEPTOR-like protein kinase) 8 [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR013103Reverse transcriptase, RNA-dependent DNA polymerasePFAMPF07727RVT_2coord: 1..105
e-value: 6.9E-27
score: 94.6
NoneNo IPR availablePANTHERPTHR45895FAMILY NOT NAMEDcoord: 1..103
IPR043502DNA/RNA polymerase superfamilySUPERFAMILY56672DNA/RNA polymerasescoord: 1..104

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0020888.1Sed0020888.1mRNA
Sed0020888.2Sed0020888.2mRNA
Sed0020888.3Sed0020888.3mRNA
Sed0020888.4Sed0020888.4mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015074 DNA integration
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0008270 zinc ion binding