Sed0026194 (gene) Chayote v1

Overview
NameSed0026194
Typegene
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
LocationLG02: 32365663 .. 32370277 (+)
RNA-Seq ExpressionSed0026194
SyntenySed0026194
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCATTTTCGACAATTTTGCTCCAGTGATGGCAAAGCCGCGGTCAATTAGGGGTGAGGTGGAAGTCACCGAAGATGGGAAGATGGTTGGAAAAATTAAAACGGCCGATTTAAATGGGGTAGGAGGGTTATTGAAGGAGATGGAAATTAACATAAATGAGGTAATGAGGGTTATGGAAAGGTTGGAAGAAGGAGAGGTTTTTAAAGGTGTGGTCCCTCCCTCTTCATCAATGCAAAAAAAGGGGATTTTTTTGGAAAATAACGTTGGCTTTTTTATTTTATTGGTGGGACCCATCATTAAGAATTTTATTTTTGAAAACTATTTTGCTGCAGGATATTACTTATCCTTTTGGGATGGCAAAAGTTTGGAATTGGAAACATAAAAATGAGCTATTGGACAAGAATTCTTCCCAATCAACAAAGAGAAGCAAAAAGGTTACAAATAGTTGTACTATCGAAGTTGTGGAAACAGATTTTGTATCAAGTTCAATGGCGGCACGTGCACCCTGATATGCAGGGTCACCGAGCATTATGAAATTCTTATTGTGGAATGTTCGAGGGTTGGGCAACTCTTGGGCAATCTGTAATCTCAAGAAAGTTATTGCACTTTCTAAGTCCAATGTGTTATTTGTTTCAGAAAGCAAATGTGGTGGTACTCGAGCTAAGTTTGTTAAGACTCAAATAGGTTTTGATAATGTGTTTCATGTTGACTGCAATGGAAAAAGTGGTGGTCTTTGTCTTTTTTGGAATAATACTACGAATGTCTCTATTCTATCGTATAACAATTGTTATATTGATGTGATGATAAACAGTGTTGATAGACATTAGCGACTAACTGGTTTTTATGGAGACCCTTGTTCTACTAAACGGAGGTCTTCATGGGAACTATTACAACGTCTTAGTGGAATGTATAATGGACCATGGGTAATAGCAGGTGATTTCATTGAAATATTGTACGAACATGAGAAATGGGGAGGTTCAAAAAAATCACAAAATCAGATGGAAGATTTTCGAGAGGCAATTCAGTCTTGTGAATTGTTCGATATGGGTTTTAAGAGATCTTATTACACCTGGTATAGAACTGTGAATAAGCAAGTTGTTTTGATGGAACGCTTGGATAGAGCGTTATGTAACTCAGCCTTTTGGGATTTGTACCTTTTTTCGGTTGTGCAACATCTTGATTTTTTGGGGTCAGATCATTGTCCAATTAAGATTAGTGTTTCTAAATACCCTAGAAATTTTGGTGGTAAGAAGAATAAAATTTTTCGATTTGAAAAGGTTTGGACTATGCATGAAGATTGTTATTCCATTGTGAAAGAGGGTGGATGGTTATGACTGATAAGAGGTCTGTTGACTATACGAGAGACTTTTTGAAAAAACTCCATTCGTGTAAGTAGGCATTATCTAGATGGGGTGTGCAAAAGAAACTGGATTATGGAAGAAAAATTAATAGAGCAAGAATGGATTTGCAAACTGCAATTGATTCTAATAGAATGCTCAAGGCCTAAGGCAAAGACTTGAAAGTCTTTTGAATGAGGAGGAAATTTACTAGCGCCAACATTCTAGGGTCGACTGGCTAAAATGGGGTGACAGAAACTCTAGGTGGTTTCATTTATGTGCTTCTCAAAGGCGCATAACAAATATGCTTGACAAGATTTGTCGGGAGGATGGCAGCTAGGTGATTAATGAGGCTGAATTGCAAGATGAAGTGTGCGAGTACTTTCCTAAGTTGTTTACGTCGAATAGGGATGCCCATAAGGAGTCTCATCAGCCTTTCTTTGATGAGGTTATTTCTCTTTTTAATTATGATCAAATTTTTTACATGTCTCGACCTTGTTCTGAAGAGGAGGTGCTTGTTTCTTTAAAAAAATATGAATCCCTTGAAAGCTCCGGGTATTTATGGATTTACAGTTTCGTTTTTCAAAAAATATTGGTCCATTGTTGGGAACGACGTGTTAACATTTTGCCTAGGTGTTCTGAATGAGGGAAACCCGTTAAATGACATCAATTGAACAGTTATTACGTTAATCCCTAAGAGTTCAACTGCTTGCTATATAAAAGATTATCGCCCTATCAGCTTGTGTAATGTGATTTATAAGTTGATTTCAAAAACAGTGGCAAATAGGCTTAGAGAACTTTTGAACGATGTTATTTCTCCATGTCAAAGCGCCTTTGTCCCAGGTAGAAATATTTCTTATAATGCATTGTTGGTGTTCGAATATATTCATTCTCTAAAGAGTAGAAGGAAAGGAAATATTGGGATGGCTGCACTAAAGCTTGATATGAGTAAGGCCGATGATAGAGTGGAGTGGAGTTTTATTTCGAGGTTTAGGACTGCTTTAGGGTTTGCTCCAAACTTTATTAAGCTGATTATGAAGTGTATTATGTCAGTCACTTATACTTTTAATATTAATGGGGGTAGATGTGGTACAGTTATACCTTCACGAAGATTGAGGCAAGGTGACCCATTATCCGCTTATTTGTTCTTGCTTTGCTCGGAAGGATTATCTCATATTCTAAGTTGATGGAAGAAAAGAAAGATATTGTTGGTTTGAAGGTGGCACGACGATCACCTTCAATTATGCATTTATTCTTTGCAGATGATTTCCTTTTATTCTTTAAGGCAAATCAAAAGGAAACGAATAATATTCAAGATGCTTTGAAATTATACTCCGTATTGACAGGTCATGAGATAAATTTTGAAAAGTCTGCTTTCTATTTTAGTCCAAATGTTAAAAGCTCAGACATCGATGGGTTGATTAGTCAGCTGAACGTTAGTAAGGTGGATTGTCATACCAAGTACTTGAGGATTCTGGTATCTTTTTCATCCTCCAAGAAGTTTTCATTTAAGTTCTTGAAGGAAAAATTATGGAAGTTCTTATTTAAATGGAAGCACAAAAAAATTTCAAGTGGAGGGAAGGAAGTTCTAATCAAGGTTGTTTTTCAAGCCTTGCCAACATATTCTATGAATTGTTTTAAGATTCCCAAATCGATAATCAAGGATTGTCATCGAATTATTTCTCAATTTTGGTGGGGTGATGATGATACACAAAAAATAATTCATTTGGTGACGTGGAAGAAGGCGTGGACTCCAAAACGGAATAGGGACTTGGATTTAGAGATCTTGAGTTTTTCAACCAAGCTCTTTTGGCAAAACAAGGGTGGCATTTAATTATGGATCCCGAGTCTCTTCTGGCAAGGGTGCTTCGAGGTAAATATTATCATTCCTCTAATTTCCTTGATGCTAGAATGTTTCAAAATGCTTCGTATACTTGGAAGAGTATATTGTGGGGACGAGAGCTATTGTTGAAAGGCATTAGGTGGCGAATTGGAAATGGGAAAAATGTTAAAATTCTTTCAGATAATTGGCTTTGTCGTGTTCCATCTATGAGGCCGATCATTACAACAGGCATTGATCCAAATTTGAAGGTTAGTTCCTTAATATTGTCAAATGGAAATTGGGATGTTGAAAAAATTAATTTTGATGTTTGGGAAGGACGATGCGAGTTTAATCAAGCAACTTCCCTGATCGAGACAGAATGGAAGTGATGTGCTACTCTAGAATTATGAGAAGAATGGAATTTTCAGAGTTAAATCGGTGCATTGGCTTGCAAATTTGCTTCGTAGTGGTCCTTCATGTTCTAATAGTGAATTGATTGAACAATGGTGGAATGCGTTGTGGAGGTGCAAGTTACCAAATAAGATACAATTTTGGTTTGGCGTATCTTTAATAATTTTGTCCCTACTAGAGTGAATCTCAATAAACGTGGAATTATGAATGATGTGTTGTGTCCTCGTTGTAAAAATTTTTCGAAGACTACTTTACATGCAATCTGGCAATGTAAAAAAGTGAAGTCTAATTGGGAGGAAACCCTTGGTAATTTAAATGAGGCTGCTGATTTGATTGGATGGTTCTTTGAAAAATTGCCAATTACTAAATTGGAGGAGTTCTTATTTATGTGTTGGTGTGTGTGGAAAAAAAGAAATAGGGAGGTGGTTGGTTTCAAGGGTGGAGGCATCAATGCAGAAATACACTTAAATTTTAATTGGGAGTATTGTTGTCAGTTTTTTGCTGAGTTCAGGCAAACAAAAGTGAGTTTGTGTGACGGAGTTACAAATCACAGTGGGTTTTCTCTAAGTCAGAATTTTTGGTCTCCTCCAAGAAGTGATAACTATAAATTAAATACTGATGCCTTTATAAATTTAAAGGAAGGAAGAAGTGGATATGGGGCTATTATTCAAAATTACAAGGGTGAGGTAATGTTCTCAATGTCTCAACCAGTCGAGTGGATTGTTGATCCAGAAATTATGGAAGCATTAGCTATTAGAGAAGGAGTGGAAATGGCCTTTGAACTTGGTTTCCAACGTATTGAGGTGGAGTTCGATGCCTTACGTGTGATTAATCTGTTACAAAAACATTGCAAGAATCAAACAGAGGTTGGAAGAATCATTGAAGAAATGCTGCAAATGGCAAGGAATTTCAAGTTTATCTCTTTCAAATGGTGTAATCGGGAGACAAATATTCTTGCCCATAAACTAGCACACATGGCTAGTATTGACAATCAAGAAGGAAGATGGATGGAAAAATGTCCAGATATTCTAAATGGCCTTTAG

mRNA sequence

ATGGCCATTTTCGACAATTTTGCTCCAGTGATGGCAAAGCCGCGGTCAATTAGGGGTGAGGTGGAAGTCACCGAAGATGGGAAGATGGTTGGAAAAATTAAAACGGCCGATTTAAATGGGGTAGGAGGGTTATTGAAGGAGATGGAAATTAACATAAATGAGGTAATGAGGGTTATGGAAAGGTTGGAAGAAGGAGAGGATATTACTTATCCTTTTGGGATGGCAAAAGTTTGGAATTGGAAACATAAAAATGAGCTATTGGACAAGAATTCTTCCCAATCAACAAAGAGAAGCAAAAAGCGACTAACTGGTTTTTATGGAGACCCTTGTTCTACTAAACGGAGGTCTTCATGGGAACTATTACAACGTCTTAGTGGAATGTATAATGGACCATGGGTAATAGCAGGTGATTTCATTGAAATATTGTACGAACATGAGAAATGGGGAGGTTCAAAAAAATCACAAAATCAGATGGAAGATTTTCGAGAGGCAATTCAGTCTTGTGAATTGTTCGATATGGGTTTTAAGAGATCTTATTACACCTGGTATAGAACTGTGAATAAGCAAGTTGTTTTGATGGAACGCTTGGATAGAGCGTTATGTAACTCAGCCTTTTGGGATTTGTACCTTTTTTCGGTTGTGCAACATCTTGATTTTTTGGGGTCAGATCATTGTCCAATTAAGATTAGTGTTTCTAAATACCCTAGAAATTTTGGTGGTAAGAAGAATAAAATTTTTCGATTTGAAAAGGTTTGGACTATGCATGAAGATTGTTATTCCATTACTACTTTACATGCAATCTGGCAATGTAAAAAAGTGAAGTCTAATTGGGAGGAAACCCTTGGTAATTTAAATGAGGCTGCTGATTTGATTGGATGGTTCTTTGAAAAATTGCCAATTACTAAATTGGAGGAGTTCTTATTTATGTGTTGGTGTGTGTGGAAAAAAAGAAATAGGGAGGTGGTTGGTTTCAAGGGTGGAGGCATCAATGCAGAAATACACTTAAATTTTAATTGGGAGTATTGTTGTCAGTTTTTTGCTGAGTTCAGGCAAACAAAAGTGAGTTTGTGTGACGGAGTTACAAATCACAGTGGGTTTTCTCTAAGTCAGAATTTTTGGTCTCCTCCAAGAAGTGATAACTATAAATTAAATACTGATGCCTTTATAAATTTAAAGGAAGGAAGAAGTGGATATGGGGCTATTATTCAAAATTACAAGGGTGAGGTAATGTTCTCAATGTCTCAACCAGTCGAGTGGATTGTTGATCCAGAAATTATGGAAGCATTAGCTATTAGAGAAGGAGTGGAAATGGCCTTTGAACTTGGTTTCCAACGTATTGAGGTGGAGTTCGATGCCTTACGTGTGATTAATCTGTTACAAAAACATTGCAAGAATCAAACAGAGGTTGGAAGAATCATTGAAGAAATGCTGCAAATGGCAAGGAATTTCAAGTTTATCTCTTTCAAATGGTGTAATCGGGAGACAAATATTCTTGCCCATAAACTAGCACACATGGCTAGTATTGACAATCAAGAAGGAAGATGGATGGAAAAATGTCCAGATATTCTAAATGGCCTTTAG

Coding sequence (CDS)

ATGGCCATTTTCGACAATTTTGCTCCAGTGATGGCAAAGCCGCGGTCAATTAGGGGTGAGGTGGAAGTCACCGAAGATGGGAAGATGGTTGGAAAAATTAAAACGGCCGATTTAAATGGGGTAGGAGGGTTATTGAAGGAGATGGAAATTAACATAAATGAGGTAATGAGGGTTATGGAAAGGTTGGAAGAAGGAGAGGATATTACTTATCCTTTTGGGATGGCAAAAGTTTGGAATTGGAAACATAAAAATGAGCTATTGGACAAGAATTCTTCCCAATCAACAAAGAGAAGCAAAAAGCGACTAACTGGTTTTTATGGAGACCCTTGTTCTACTAAACGGAGGTCTTCATGGGAACTATTACAACGTCTTAGTGGAATGTATAATGGACCATGGGTAATAGCAGGTGATTTCATTGAAATATTGTACGAACATGAGAAATGGGGAGGTTCAAAAAAATCACAAAATCAGATGGAAGATTTTCGAGAGGCAATTCAGTCTTGTGAATTGTTCGATATGGGTTTTAAGAGATCTTATTACACCTGGTATAGAACTGTGAATAAGCAAGTTGTTTTGATGGAACGCTTGGATAGAGCGTTATGTAACTCAGCCTTTTGGGATTTGTACCTTTTTTCGGTTGTGCAACATCTTGATTTTTTGGGGTCAGATCATTGTCCAATTAAGATTAGTGTTTCTAAATACCCTAGAAATTTTGGTGGTAAGAAGAATAAAATTTTTCGATTTGAAAAGGTTTGGACTATGCATGAAGATTGTTATTCCATTACTACTTTACATGCAATCTGGCAATGTAAAAAAGTGAAGTCTAATTGGGAGGAAACCCTTGGTAATTTAAATGAGGCTGCTGATTTGATTGGATGGTTCTTTGAAAAATTGCCAATTACTAAATTGGAGGAGTTCTTATTTATGTGTTGGTGTGTGTGGAAAAAAAGAAATAGGGAGGTGGTTGGTTTCAAGGGTGGAGGCATCAATGCAGAAATACACTTAAATTTTAATTGGGAGTATTGTTGTCAGTTTTTTGCTGAGTTCAGGCAAACAAAAGTGAGTTTGTGTGACGGAGTTACAAATCACAGTGGGTTTTCTCTAAGTCAGAATTTTTGGTCTCCTCCAAGAAGTGATAACTATAAATTAAATACTGATGCCTTTATAAATTTAAAGGAAGGAAGAAGTGGATATGGGGCTATTATTCAAAATTACAAGGGTGAGGTAATGTTCTCAATGTCTCAACCAGTCGAGTGGATTGTTGATCCAGAAATTATGGAAGCATTAGCTATTAGAGAAGGAGTGGAAATGGCCTTTGAACTTGGTTTCCAACGTATTGAGGTGGAGTTCGATGCCTTACGTGTGATTAATCTGTTACAAAAACATTGCAAGAATCAAACAGAGGTTGGAAGAATCATTGAAGAAATGCTGCAAATGGCAAGGAATTTCAAGTTTATCTCTTTCAAATGGTGTAATCGGGAGACAAATATTCTTGCCCATAAACTAGCACACATGGCTAGTATTGACAATCAAGAAGGAAGATGGATGGAAAAATGTCCAGATATTCTAAATGGCCTTTAG

Protein sequence

MAIFDNFAPVMAKPRSIRGEVEVTEDGKMVGKIKTADLNGVGGLLKEMEININEVMRVMERLEEGEDITYPFGMAKVWNWKHKNELLDKNSSQSTKRSKKRLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNTDAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWMEKCPDILNGL
Homology
BLAST of Sed0026194 vs. NCBI nr
Match: TXG69190.1 (hypothetical protein EZV62_004125 [Acer yangbiense])

HSP 1 Score: 216.5 bits (550), Expect = 5.6e-52
Identity = 131/423 (30.97%), Postives = 210/423 (49.65%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           RLTGFYG P  T+R   W LL+RL+GM   PW + GDF EI+   EK GG+ + +  M +
Sbjct: 459 RLTGFYGHPNLTQRIHGWNLLRRLAGMSPLPWFVGGDFNEIVGLSEKVGGNTRHECFMGN 518

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           F+EA++ C L D+GF    +TW    + +  + ERLDR + N+ + DL+    ++HLDF 
Sbjct: 519 FQEALEDCGLRDLGFLGPRFTWSNRRDSEHAIQERLDRCVGNTGWLDLFSCFSIKHLDFW 578

Query: 221 GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEET 280
            SDH PI + +S      GG+    F +++ W   +D               V+ ++ + 
Sbjct: 579 KSDHRPILLEISDKKEEAGGRHR--FYYDRCWAERDDF--------------VRLDFSQL 638

Query: 281 LGNLNEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWE 340
                   D + +   KL I   E    + W VW +RN+ V        +  +H     +
Sbjct: 639 -----PFIDFVLFCKGKLDIMYFEFLCVVWWRVWYRRNQLVYEKS----SQTVHDFDVLD 698

Query: 341 YCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG 400
           +   F  +F+  K       T  +G  + Q     W P  S +YK+NTDA ++ +   +G
Sbjct: 699 WAASFIQDFKAAK-------TVDTGSVVKQRVAPKWKPSPSGSYKINTDATLDCRAKVTG 758

Query: 401 YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVIN 460
            G +I++  G VM S+ Q    ++ P+ +EA+A+  G  +A E G     +E D+L V+N
Sbjct: 759 IGVVIRDCYGHVMASLCQSFPGLLQPQTVEAVAVLRGFRLALEAGLCPASIESDSLSVVN 818

Query: 461 LLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWME 520
           L+      + E+G ++ ++L M  N  F S  +  R TN +AH LA ++     E  W+E
Sbjct: 819 LINSMDILRAEIGVVLHDILAMGSNSFFSSVSFVPRLTNCVAHSLAKLSLSFEGEHVWLE 849

BLAST of Sed0026194 vs. NCBI nr
Match: RYR02999.1 (hypothetical protein Ahy_B06g081839 [Arachis hypogaea])

HSP 1 Score: 161.4 bits (407), Expect = 2.1e-35
Identity = 116/432 (26.85%), Postives = 192/432 (44.44%), Query Frame = 0

Query: 106  YGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAI 165
            YG+P   KRR  W+ L   +     P    GDF +IL + EK G   + +N +E FR+ +
Sbjct: 616  YGNPVFQKRRKLWQELTISNMNKEEPQAYMGDFNDILSQDEKVGVHPQPKNCLETFRKFV 675

Query: 166  QSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHC 225
                L D+  K S YTW+  +    V  ERLDR L N  +  +Y   +++    + SDHC
Sbjct: 676  DDNGLMDVDLKGSRYTWFSNLRNNFVTRERLDRVLVNWKWLQMYQNVILKAAPAMSSDHC 735

Query: 226  PIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSN-WEETLGNL 285
             + +     PR   G+  K F+FE  W  HE+C  +  +   WQ  +   N W + +   
Sbjct: 736  ALILETQ--PR---GRIKKEFKFEAFWADHEECKEV--IRNSWQQDEGNRNCWNQFIRKR 795

Query: 286  NEA-ADLIGWFFEKLPITKLEEFLFM------CWCVWKKRNREV-----VGFKGGGINAE 345
            N    +LI W  +    T  E+   +      CWC+WK RN+ +     +  K   INAE
Sbjct: 796  NRCKRELIEWIRKIKSGTGKEQDRILCKLGSVCWCIWKARNQHIYQQIRINPKQAIINAE 855

Query: 346  IHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQN--FWSPPRSDNYKLNTDAFIN 405
                       Q   ++  T  S     T+ +  S  +    W PP  +  K NTDA  +
Sbjct: 856  -----------QLATDYHNTTRSRSTDNTSRADRSGERKRITWRPPPQNRLKANTDAAFH 915

Query: 406  LKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEI-MEALAIREGVEMAFELGFQRIEVE 465
             + G +    ++++++G+++   +    +I +  I  EA A RE + +   L      +E
Sbjct: 916  RESGIAAAAVVVRDWQGKIITGTTS--RFITNSAIAAEAQAYREALILIRNLQMDNCLIE 975

Query: 466  FDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASID 522
             D L ++  ++       E   II ++ Q+      +   W  RE N +AH+LA MA+ +
Sbjct: 976  TDCLPLVQAIKARMP-IAEADAIIRDIFQLLDETPDVGATWTPREGNSVAHQLAAMAAGN 1026

BLAST of Sed0026194 vs. NCBI nr
Match: KAG6624235.1 (hypothetical protein CIPAW_16G012000 [Carya illinoinensis])

HSP 1 Score: 155.6 bits (392), Expect = 1.2e-33
Identity = 122/476 (25.63%), Postives = 197/476 (41.39%), Query Frame = 0

Query: 120 LLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSY 179
           +L+ +    N  W+I GDF EIL   EKWGG  + ++QME FRE ++   L D+G++   
Sbjct: 1   MLKAMKPRGNEGWLIVGDFNEILTNDEKWGGKARPESQMELFREVLREGNLNDLGWRGDK 60

Query: 180 YTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFG 239
           YTW  +        ERLDRA+ N  + DL+    V+ +    SDH PI + ++       
Sbjct: 61  YTWSNSHTDDTFTKERLDRAIANPQWRDLFSEVWVEIMVARTSDHKPILVHLNTQIYGNE 120

Query: 240 GKKNKIFRFEKVWTMHEDCYSITTLHAIW---------------QCKKVKSNW------- 299
             K   F++E  W + EDC     L  IW               Q ++V  +W       
Sbjct: 121 PLKRMGFKYEASWALDEDCGGF--LSEIWRGKEGEPKSIINLLNQSRRVLQSWSKHMRQR 180

Query: 300 -----------------EETLGNL-------------------------------NEAAD 359
                            EE   N+                               N+   
Sbjct: 181 ERKEMEEKTKQLKALQAEECSSNVEELRKVTWECPAANDIWGQDESGVKKWDRSENDFLS 240

Query: 360 LIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEF 419
           + G    ++P   LEE   +   VW +RN  +   +      EI    +     Q   EF
Sbjct: 241 MWGKLMNRVPKNLLEEVAVLFRKVWLRRNDWLFEGRKACPRKEI---ISTRAALQ---EF 300

Query: 420 RQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNTDAFINLKEGRSGYGAIIQNYKGEV 479
           +  + +    V   SG + S   W  P  D  K+N DA   +K+ R G G +I++  GE 
Sbjct: 301 KDLQGNNSKQVAVQSGLTGSLQ-WEKPAPDYVKVNWDASTEIKQNRMGIGIMIRDEHGEA 360

Query: 480 MFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEV 526
           + ++    E ++D  + E +A+R+ VE+   L  ++   E DA  V+  +Q+  ++    
Sbjct: 361 LVAVCDRRENVMDAAVAECVALRKAVELCIALNIRKAIFEGDAKVVVKAVQEAEEDLPLF 420

BLAST of Sed0026194 vs. NCBI nr
Match: XP_042942839.1 (uncharacterized protein LOC122277021 [Carya illinoinensis])

HSP 1 Score: 153.3 bits (386), Expect = 5.8e-33
Identity = 78/194 (40.21%), Postives = 116/194 (59.79%), Query Frame = 0

Query: 102 LTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDF 161
           LTGFYG+P +TKRR SW+LLQ L       W+  GDF E+L   +K  G ++  NQ+E F
Sbjct: 69  LTGFYGNPDTTKRRESWQLLQALKPTMGMGWMCIGDFNEVLSSGDKSRGRQRPFNQVEAF 128

Query: 162 REAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLG 221
           R A +SC LFDMG+  + +TW    + Q  + ER+DRALCN  + +L+ FS V +L  L 
Sbjct: 129 RAATESCSLFDMGYVGNKFTWTNGRSGQAFIKERIDRALCNVEWSELFPFSKVYNLPILS 188

Query: 222 SDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIW-QCKKVKSNWEET 281
           SDHCPI +SV ++  +   +K++ FR+E  W + ED Y +  L   W + ++V     + 
Sbjct: 189 SDHCPILVSVEQFELD-STRKDRPFRYEASWALKEDFYEV--LEKAWSKPRRVDGKLCQV 248

Query: 282 LGNLNEA-ADLIGW 294
           +  LN+   +L+ W
Sbjct: 249 IEGLNQCRVELVRW 259

BLAST of Sed0026194 vs. NCBI nr
Match: KAF4381998.1 (hypothetical protein G4B88_006630 [Cannabis sativa])

HSP 1 Score: 146.7 bits (369), Expect = 5.5e-31
Identity = 95/312 (30.45%), Postives = 137/312 (43.91%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           R TGFYG+P ++ R  SW LL RL  +++ PW+  GDF EIL  +EK GGS +S + M +
Sbjct: 543 RFTGFYGNPKASCRSESWRLLCRLKDLFDLPWICGGDFNEILSINEKKGGSDRSMSAMTE 602

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           F+ A+  C L D+GF+   +TW         + ERLDR  CN  + DL+ F  V + DFL
Sbjct: 603 FQNALDRCSLADLGFEGQCFTWLNKRQGGAHVQERLDRYFCNQRWHDLFPFVKVLNGDFL 662

Query: 221 GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEE 280
            SDH PI  ++    R     K + FRFE  W    +C  I  ++  W       +N + 
Sbjct: 663 NSDHRPIVATLENVSRRQRYDKKRCFRFETHWLKDPECQEI--INRSWLSLDCPLANQDS 722

Query: 281 TLGNLNEAADLIG-WFFEK----------------------LPITKLEEFL--------- 340
            +      AD +G W   K                       P+ ++EE L         
Sbjct: 723 LIDIFGLCADQLGMWNKSKYGSLPRQVRETQKQLDDLLSVSAPLVRMEEVLSKDEFELVS 782

Query: 341 FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS 377
            + W VW  RN  + G K   +N  +      E+  +   EF+  +V +  G     G S
Sbjct: 783 MVWWWVWYDRNSVLFGKKQSRLNGVV------EFAREALVEFQGARVGVIGGGAVVRGGS 842

BLAST of Sed0026194 vs. ExPASy TrEMBL
Match: A0A5C7IIT4 (Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004125 PE=4 SV=1)

HSP 1 Score: 216.5 bits (550), Expect = 2.7e-52
Identity = 131/423 (30.97%), Postives = 210/423 (49.65%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           RLTGFYG P  T+R   W LL+RL+GM   PW + GDF EI+   EK GG+ + +  M +
Sbjct: 459 RLTGFYGHPNLTQRIHGWNLLRRLAGMSPLPWFVGGDFNEIVGLSEKVGGNTRHECFMGN 518

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           F+EA++ C L D+GF    +TW    + +  + ERLDR + N+ + DL+    ++HLDF 
Sbjct: 519 FQEALEDCGLRDLGFLGPRFTWSNRRDSEHAIQERLDRCVGNTGWLDLFSCFSIKHLDFW 578

Query: 221 GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEET 280
            SDH PI + +S      GG+    F +++ W   +D               V+ ++ + 
Sbjct: 579 KSDHRPILLEISDKKEEAGGRHR--FYYDRCWAERDDF--------------VRLDFSQL 638

Query: 281 LGNLNEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWE 340
                   D + +   KL I   E    + W VW +RN+ V        +  +H     +
Sbjct: 639 -----PFIDFVLFCKGKLDIMYFEFLCVVWWRVWYRRNQLVYEKS----SQTVHDFDVLD 698

Query: 341 YCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG 400
           +   F  +F+  K       T  +G  + Q     W P  S +YK+NTDA ++ +   +G
Sbjct: 699 WAASFIQDFKAAK-------TVDTGSVVKQRVAPKWKPSPSGSYKINTDATLDCRAKVTG 758

Query: 401 YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVIN 460
            G +I++  G VM S+ Q    ++ P+ +EA+A+  G  +A E G     +E D+L V+N
Sbjct: 759 IGVVIRDCYGHVMASLCQSFPGLLQPQTVEAVAVLRGFRLALEAGLCPASIESDSLSVVN 818

Query: 461 LLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWME 520
           L+      + E+G ++ ++L M  N  F S  +  R TN +AH LA ++     E  W+E
Sbjct: 819 LINSMDILRAEIGVVLHDILAMGSNSFFSSVSFVPRLTNCVAHSLAKLSLSFEGEHVWLE 849

BLAST of Sed0026194 vs. ExPASy TrEMBL
Match: A0A7J6GGL8 (CCHC-type domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_006630 PE=4 SV=1)

HSP 1 Score: 146.7 bits (369), Expect = 2.6e-31
Identity = 95/312 (30.45%), Postives = 137/312 (43.91%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           R TGFYG+P ++ R  SW LL RL  +++ PW+  GDF EIL  +EK GGS +S + M +
Sbjct: 543 RFTGFYGNPKASCRSESWRLLCRLKDLFDLPWICGGDFNEILSINEKKGGSDRSMSAMTE 602

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           F+ A+  C L D+GF+   +TW         + ERLDR  CN  + DL+ F  V + DFL
Sbjct: 603 FQNALDRCSLADLGFEGQCFTWLNKRQGGAHVQERLDRYFCNQRWHDLFPFVKVLNGDFL 662

Query: 221 GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEE 280
            SDH PI  ++    R     K + FRFE  W    +C  I  ++  W       +N + 
Sbjct: 663 NSDHRPIVATLENVSRRQRYDKKRCFRFETHWLKDPECQEI--INRSWLSLDCPLANQDS 722

Query: 281 TLGNLNEAADLIG-WFFEK----------------------LPITKLEEFL--------- 340
            +      AD +G W   K                       P+ ++EE L         
Sbjct: 723 LIDIFGLCADQLGMWNKSKYGSLPRQVRETQKQLDDLLSVSAPLVRMEEVLSKDEFELVS 782

Query: 341 FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS 377
            + W VW  RN  + G K   +N  +      E+  +   EF+  +V +  G     G S
Sbjct: 783 MVWWWVWYDRNSVLFGKKQSRLNGVV------EFAREALVEFQGARVGVIGGGAVVRGGS 842

BLAST of Sed0026194 vs. ExPASy TrEMBL
Match: A0A1U8M810 (uncharacterized protein LOC107933986 OS=Gossypium hirsutum OX=3635 GN=LOC107933986 PE=4 SV=1)

HSP 1 Score: 146.4 bits (368), Expect = 3.5e-31
Identity = 72/193 (37.31%), Postives = 105/193 (54.40%), Query Frame = 0

Query: 73  GMAKVWN-------WKHKNELLDKNSSQSTKRSKKRLTGFYGDPCSTKRRSSWELLQRLS 132
           G++  WN       + +    +D   +    ++K R TGFYG+P    ++ SW LL++L 
Sbjct: 67  GLSLAWNGNNLIQVYSYSTYHIDVGINDKDNQNKWRFTGFYGNPRQANKQESWNLLRQLK 126

Query: 133 GMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRT 192
            MY+ PW + GDF EI+Y +EK GG  K + QM++FR  ++ C+L DMGF    +TW R 
Sbjct: 127 NMYSLPWCVCGDFNEIMYAYEKIGGRVKDERQMDEFRRVLEECDLVDMGFHGQKFTWERG 186

Query: 193 VNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKI 252
                 + ERLDR L N  + +L+    VQ+L    SDHCPI I ++   ++FG   N  
Sbjct: 187 NFADTNIRERLDRGLANLEWLNLFNDYFVQNLPHSFSDHCPILIKITPKMKHFG---NHF 246

Query: 253 FRFEKVWTMHEDC 259
           FRFE  W     C
Sbjct: 247 FRFESWWITESSC 256

BLAST of Sed0026194 vs. ExPASy TrEMBL
Match: A0A803QD63 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 145.6 bits (366), Expect = 5.9e-31
Identity = 75/189 (39.68%), Postives = 112/189 (59.26%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           R TGFYGDP  T+R  SW+LL+RLS MY GPW + G+F EIL + EK GGS K    + +
Sbjct: 294 RFTGFYGDPDPTQRIHSWKLLKRLSRMYLGPWAVGGNFNEILSQREKMGGSSKLSYLINN 353

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           FR+A+ SC+L D+GF+ S YTW     KQ ++ ERLD+   NS +++ +  ++V+HLD +
Sbjct: 354 FRKALDSCQLRDVGFEGSDYTWCNG-RKQNLIFERLDQVCGNSDWFEKFSQAIVKHLDCI 413

Query: 221 GSDHCPIKISVSKYPRN---FGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNW 280
            SDHCP+ ++  K P +      +    F FE  W   E+C  I  + ++W   +   + 
Sbjct: 414 NSDHCPLLLT-EKDPSSRMQHMARWRSRFHFESAWVDDEECTEI--VQSVWLMNEPIKHT 473

Query: 281 EETLGNLNE 287
           +E    L +
Sbjct: 474 KEVKNRLGK 478

BLAST of Sed0026194 vs. ExPASy TrEMBL
Match: A0A5C7I4W9 (RNase H domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_010516 PE=4 SV=1)

HSP 1 Score: 145.2 bits (365), Expect = 7.7e-31
Identity = 103/374 (27.54%), Postives = 169/374 (45.19%), Query Frame = 0

Query: 101 RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMED 160
           R +G YGDP  + R ++W L++RL  + N PWV  GDF E+L   EK GGS+K+   +  
Sbjct: 114 RFSGIYGDPNPSNRMNTWTLMRRLKEVDNLPWVCGGDFNELLSMSEKLGGSEKAIRDIIR 173

Query: 161 FREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL 220
           FR+ +  CE  D+GF    +TW      +  + ERLDR L ++ + ++Y    ++HL F 
Sbjct: 174 FRQVVDDCEFIDLGFSGPKFTWNNMREGRDNVQERLDRILASTNWRNMYQQITIEHLGFN 233

Query: 221 GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEET 280
            SDH P+   +  + RN     N+    +K             +  +   + +K +  E 
Sbjct: 234 TSDHRPL---LMDWDRNTAYLHNRALVRKKK----------NYIPFLLDTRGIKQDSNEG 293

Query: 281 LGNL--------NEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGI--- 340
           + N+        N   D++  FF    + +L  F  + W +W+ RN  +V   G G+   
Sbjct: 294 MANVFFFYDLTYNYVIDILLLFFSTHSLDELNLFCMIMWAIWEHRN--LVSINGKGLLPV 353

Query: 341 ----NAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNT 400
                AE+ L+           EF QT +S    V   S   L    W  P     KLN+
Sbjct: 354 QVVSKAEVLLD-----------EF-QTSISAVILVARPSPRPLPCGDWLAPPPGRLKLNS 413

Query: 401 DAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQR 460
               N     +  G++I + KG+++ + ++    +   E    LA+ EG+ +A  LG   
Sbjct: 414 AVVTNNSYRNTALGSVICDDKGKIIAARARKFLGVFSKETSVLLALMEGLMLAKFLGLVV 460

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
TXG69190.15.6e-5230.97hypothetical protein EZV62_004125 [Acer yangbiense][more]
RYR02999.12.1e-3526.85hypothetical protein Ahy_B06g081839 [Arachis hypogaea][more]
KAG6624235.11.2e-3325.63hypothetical protein CIPAW_16G012000 [Carya illinoinensis][more]
XP_042942839.15.8e-3340.21uncharacterized protein LOC122277021 [Carya illinoinensis][more]
KAF4381998.15.5e-3130.45hypothetical protein G4B88_006630 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A5C7IIT42.7e-5230.97Uncharacterized protein OS=Acer yangbiense OX=1000413 GN=EZV62_004125 PE=4 SV=1[more]
A0A7J6GGL82.6e-3130.45CCHC-type domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_006630 P... [more]
A0A1U8M8103.5e-3137.31uncharacterized protein LOC107933986 OS=Gossypium hirsutum OX=3635 GN=LOC1079339... [more]
A0A803QD635.9e-3139.68Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A5C7I4W97.7e-3127.54RNase H domain-containing protein OS=Acer yangbiense OX=1000413 GN=EZV62_010516 ... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 45..65
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 108..261
NoneNo IPR availablePANTHERPTHR33116:SF39RETROTRANSPOSON, UNCLASSIFIED-LIKE PROTEINcoord: 108..261
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 380..511
e-value: 5.5E-16
score: 60.7
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 384..506
e-value: 1.4E-23
score: 83.1
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 73..231
e-value: 7.5E-16
score: 60.6
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 102..230
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 383..504
e-value: 1.48266E-19
score: 82.362
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 380..504

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0026194.1Sed0026194.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity