Sed0018752 (gene) Chayote v1

Overview
NameSed0018752
Typegene
OrganismSechium edule (Chayote v1)
DescriptionRNase H domain-containing protein
LocationLG03: 47480564 .. 47482168 (-)
RNA-Seq ExpressionSed0018752
SyntenySed0018752
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGTTTAAGGAAGCGCATCGGTAATGGCCAGAACTCCTTTGTGTTTAAATCTCCTTGGATTCCAAGAGAGAACACTTTTCGGGCAATTTCACCTCATTGCCCAAGCATTAGTGGCATTCATGTTTGTGAATTCATTACTACAGATGGAGATTGGGATCAGGAGCTCCTTTCTGCTTTCCTTTGGAGGGAAGATGTAGAGTGCATTTGTTCAATTTCTATTTGCAAATCCTATGAAGAGGATAAATGGATTTGGCACTACTCAAAGGATGGAGATTTCTCGGTTAGAAGTGCTCACTGGCTGCTTAGACATAATTTTTTTGAAGCCTCTAGGGTAAATGGTGGGTCAAGTGAAGCTTGGTGGAAGAGTCTTTGGAACCTTAAAGTTCCAAATAAGTTGAAGCTTTTCATGTGGAGACTCTATCATGAATTTCTACCAACCAACCTTGTGTTATAAAAAAGAAAGGTTCCTGTTCAGCCTTGGTGTGTGATGTGTGGAAATTTTTATGAATCATTTTTCCATATATTTTATGCCTCTGGTCAATTTGGATCAATCAAAATAAAGTAAATTTTAATGAGGTTGTCCCCCCAGTTAATGTTCGAATTGAATGGATTCGTCAGTATTTGGAGGATTTTTTGAAATCTAATGATATGGAATCCCATAAGCATGCTGAGTATGATTCTTGCTCTGTTGTTGCTCCTATTCGTTGGATTGCGCCATGGCCGGGATTTCTCAAGCTTAATGTCGATGCTTCATGTTCCCCTTTAGCGCCGAAATTGGGGTTTGAATTAATTTTCAGAGATCACATGGGATTATGCAAATTTGCTTCTTCCATTTTCAAACCTGTGTTTTGTGATATCCTCTTAGCGGAAGCTTTGGCTTTGTTGGAGGGATTGAAGGTGGCTGATAATCTTGGATTCCATAATTTGATTATTGAATCTGATTCAAAGACACTTGTCGATGCTATTCTAGGGAATCATTTATCTCTTTCTCCTCAAGGTATTATCCTTGATGAAATTCGTCTGCTGCTAAAGAAATATGGCTCAAAAACTGTTTGTTTCATCCCTCGAAAATGTAATAGTGTTGCTCATAACTTAGCATCTCGGGCAGTTTTTGTAGGCCTTAATGGTTGCTGGAGTAGTATTTTGCCACATTGGCTTTCTGATTTGGTCAATCTGGACCTTTCTAATTGTAGCCCTTTGGGTTTTTCTGATATTGGTTAATAAAAATCCTGCTTTCTTTCAAAAAAATTCTAGCCACGACGTTATTTTCTCCAGAATCCCTCTTCGTCAAGAACACCATCGCCTCGCACCAGGCCATTATCTTCTCCAAGTCCTCTTTGGCAGGTTTTGCAAAAATGACAAGGTTTACTGAAGTTACACCTAGGCTTGGACAACTAAAACAACATCAATCCGGGCAATTATGAAACTGTGCAGTAATATACTCTGAAAGATCCAATTTTTCACCACGAACACATGCTAGGCTGCCATAGATACCGATAACTACGCTCGTACGTATGAACAGAATACCTAACATAAATACTAATGATTTGATTATAGACTTGTTGACAGGGTGACAAATATAAATGCTTGGCCAAAAAAGCAC

mRNA sequence

ATGGGTTTAAGGAAGCGCATCGGTAATGGCCAGAACTCCTTTGTGTTTAAATCTCCTTGGATTCCAAGAGAGAACACTTTTCGGGCAATTTCACCTCATTGCCCAAGCATTAGTGGCATTCATGTTTGTGAATTCATTACTACAGATGGAGATTGGGATCAGGAGCTCCTTTCTGCTTTCCTTTGGAGGGAAGATGTAGAGTGCATTTGTTCAATTTCTATTTGCAAATCCTATGAAGAGGATAAATGGATTTGGCACTACTCAAAGGATGGAGATTTCTCGGTTAGAAGTGCTCACTGGCTGCTTAGACATAATTTTTTTGAAGCCTCTAGGGTAAATGGTGGGTCAAGTGAAGCTTGGTGGAAGAGTCTTTGGAACCTTAAAGTTGTCCCCCCAGTTAATGTTCGAATTGAATGGATTCGTCAGTATTTGGAGGATTTTTTGAAATCTAATGATATGGAATCCCATAAGCATGCTGAGTATGATTCTTGCTCTGTTGTTGCTCCTATTCGTTGGATTGCGCCATGGCCGGGATTTCTCAAGCTTAATGTCGATGCTTCATGTTCCCCTTTAGCGCCGAAATTGGGGTTTGAATTAATTTTCAGAGATCACATGGGATTATGCAAATTTGCTTCTTCCATTTTCAAACCTGTGTTTTGTGATATCCTCTTAGCGGAAGCTTTGGCTTTGTTGGAGGGATTGAAGGTGGCTGATAATCTTGGATTCCATAATTTGATTATTGAATCTGATTCAAAGACACTTGTCGATGCTATTCTAGGGAATCATTTATCTCTTTCTCCTCAAGGTATTATCCTTGATGAAATTCGTCTGCTGCTAAAGAAATATGGCTCAAAAACTGTTTGTTTCATCCCTCGAAAATGTAATAGTGTTGCTCATAACTTAGCATCTCGGGCAGTTTTTGTAGGCCTTAATGGTTGCTGGAGTAGTATTTTGCCACATTGGCTTTCTGATTTGGTCAATCTGGACCTTTCTAATTGTAGCCCTTTGGGTTTTTCTGATATTGGTTAATAAAAATCCTGCTTTCTTTCAAAAAAATTCTAGCCACGACGTTATTTTCTCCAGAATCCCTCTTCGTCAAGAACACCATCGCCTCGCACCAGGCCATTATCTTCTCCAAGTCCTCTTTGGCAGGTTTTGCAAAAATGACAAGGTTTACTGAAGTTACACCTAGGCTTGGACAACTAAAACAACATCAATCCGGGCAATTATGAAACTGTGCAGTAATATACTCTGAAAGATCCAATTTTTCACCACGAACACATGCTAGGCTGCCATAGATACCGATAACTACGCTCGTACGTATGAACAGAATACCTAACATAAATACTAATGATTTGATTATAGACTTGTTGACAGGGTGACAAATATAAATGCTTGGCCAAAAAAGCAC

Coding sequence (CDS)

ATGGGTTTAAGGAAGCGCATCGGTAATGGCCAGAACTCCTTTGTGTTTAAATCTCCTTGGATTCCAAGAGAGAACACTTTTCGGGCAATTTCACCTCATTGCCCAAGCATTAGTGGCATTCATGTTTGTGAATTCATTACTACAGATGGAGATTGGGATCAGGAGCTCCTTTCTGCTTTCCTTTGGAGGGAAGATGTAGAGTGCATTTGTTCAATTTCTATTTGCAAATCCTATGAAGAGGATAAATGGATTTGGCACTACTCAAAGGATGGAGATTTCTCGGTTAGAAGTGCTCACTGGCTGCTTAGACATAATTTTTTTGAAGCCTCTAGGGTAAATGGTGGGTCAAGTGAAGCTTGGTGGAAGAGTCTTTGGAACCTTAAAGTTGTCCCCCCAGTTAATGTTCGAATTGAATGGATTCGTCAGTATTTGGAGGATTTTTTGAAATCTAATGATATGGAATCCCATAAGCATGCTGAGTATGATTCTTGCTCTGTTGTTGCTCCTATTCGTTGGATTGCGCCATGGCCGGGATTTCTCAAGCTTAATGTCGATGCTTCATGTTCCCCTTTAGCGCCGAAATTGGGGTTTGAATTAATTTTCAGAGATCACATGGGATTATGCAAATTTGCTTCTTCCATTTTCAAACCTGTGTTTTGTGATATCCTCTTAGCGGAAGCTTTGGCTTTGTTGGAGGGATTGAAGGTGGCTGATAATCTTGGATTCCATAATTTGATTATTGAATCTGATTCAAAGACACTTGTCGATGCTATTCTAGGGAATCATTTATCTCTTTCTCCTCAAGGTATTATCCTTGATGAAATTCGTCTGCTGCTAAAGAAATATGGCTCAAAAACTGTTTGTTTCATCCCTCGAAAATGTAATAGTGTTGCTCATAACTTAGCATCTCGGGCAGTTTTTGTAGGCCTTAATGGTTGCTGGAGTAGTATTTTGCCACATTGGCTTTCTGATTTGGTCAATCTGGACCTTTCTAATTGTAGCCCTTTGGGTTTTTCTGATATTGGTTAA

Protein sequence

MGLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFITTDGDWDQELLSAFLWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSSEAWWKSLWNLKVVPPVNVRIEWIRQYLEDFLKSNDMESHKHAEYDSCSVVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIFKPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKYGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDLVNLDLSNCSPLGFSDIG
Homology
BLAST of Sed0018752 vs. NCBI nr
Match: XP_012836236.1 (PREDICTED: uncharacterized protein LOC105956874 [Erythranthe guttata])

HSP 1 Score: 147.9 bits (372), Expect = 1.6e-31
Identity = 107/352 (30.40%), Postives = 172/352 (48.86%), Query Frame = 0

Query: 2   GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFIT-TDGDWDQELLSAF 61
           G R RIGNG    +++  W+PRE TF+  +P       + V   I    G W   ++++ 
Sbjct: 342 GSRWRIGNGAKVTIWRDKWLPRERTFKVFTPRENWPIDMKVQSLIDGNTGLWKVNMIASM 401

Query: 62  LWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGS---- 121
            + ED + I SI I  +  ED+ IWHY+K+G FSVRSA+    H   E + VN  S    
Sbjct: 402 FYEEDRKSILSIPIGSAINEDRLIWHYNKNGLFSVRSAYHTAVHLEQEKAGVNIASGSSL 461

Query: 122 -SEAWWKSLWNLKVVPPVNVRIEW--------IRQYLED--FLKSNDMESHKHAEYD--- 181
            S   WK LW+L +   + + + W         RQ L +   L+++  E    AE D   
Sbjct: 462 VSSRSWKWLWDLNLPNKIKIFL-WRCCNNLLPTRQNLTNRKILENSLCEICGAAEEDVLH 521

Query: 182 ---SCSVV-----APIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSI 241
               CS       + I+W  P+   +K+NVDAS + +    G   + R   G C    S 
Sbjct: 522 CLSLCSFARQMQRSSIKWEVPFRDEVKINVDASLASVEHGCGLGGLGRTSDGNCIAWFST 581

Query: 242 FKPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDE 301
             P+F D   AEA+A L+ ++ A N  +  +++E DS T+V AI G   S +  G ++D+
Sbjct: 582 HCPLFIDPTSAEAMAALKAMEFAQNHRWSRVVLECDSSTIVAAIAGEFGSRTIYGNVIDD 641

Query: 302 IRLLLKKYGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDLV 327
           I+ L   +    +  + R+ N  AH +A  +    ++   S++LP+++ D+V
Sbjct: 642 IKRLASTFEVFKIRHVKREANRAAHEIARLS---SIDSFMSNVLPNFIIDIV 689

BLAST of Sed0018752 vs. NCBI nr
Match: XP_028125150.1 (uncharacterized protein LOC114322087 [Camellia sinensis])

HSP 1 Score: 144.4 bits (363), Expect = 1.8e-30
Identity = 103/351 (29.34%), Postives = 162/351 (46.15%), Query Frame = 0

Query: 2   GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFITTD-GDWDQELLSAF 61
           GLR R+GNG++  V    W+PR +TF+ I P         V E I +  G W+ ELL+  
Sbjct: 96  GLRWRVGNGESISVKSDKWVPRPHTFKVILPPSTLPDNAKVSELIDSQLGIWNSELLNQQ 155

Query: 62  LWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLL----RHNFFEASRVNGGS 121
               D E I  I IC +   DK +WH++++G+ SVRSA+ L       +    +  +G S
Sbjct: 156 FLGVDGEAIRCIPICPTLPPDKLVWHFTRNGEISVRSAYHLCVDMGCSDGLGVTSDDGLS 215

Query: 122 SEAWWKSLWNLKV---------------VPPVNVRIEWIRQYLEDFLKSNDMESHKHAEY 181
            + +W +LW L++               +P  ++  E   + ++D +   +    ++A+ 
Sbjct: 216 KKQFWTALWQLQIPTKIKIFGWKMCLGLLPRNSILFEHKVRTIDDVVAFAEKYVTENAQV 275

Query: 182 DSCSVVAPI------RWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIF 241
             C+ + P+      RW+AP  G  KLN D S        G  +I  D  GL   A    
Sbjct: 276 HQCNQI-PMHHLRVRRWVAPQEGVFKLNFDGSVRKTQALAGIGIIVPDSNGLVIAAMVEQ 335

Query: 242 KPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEI 301
                D+   EAL  ++ L+   +LG  N+ +E DS  +V AI G   +LS  G  +D  
Sbjct: 336 IHYLEDVDCVEALGAVKALQFGYDLGLDNIQLEGDSFNVVSAIQGKTENLSCYGHFVDVA 395

Query: 302 RLLLKKYGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDLV 327
           + LL ++ S ++  + R  NSVAH L+  A        W   +P  L  LV
Sbjct: 396 QALLLQFHSSSISHVYRDGNSVAHCLSRLAFDFDSRRIWMEEVPPSLMALV 445

BLAST of Sed0018752 vs. NCBI nr
Match: XP_012847426.1 (PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata])

HSP 1 Score: 143.3 bits (360), Expect = 3.9e-30
Identity = 114/410 (27.80%), Postives = 174/410 (42.44%), Query Frame = 0

Query: 2    GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFI-TTDGDWDQELLSAF 61
            G R RIGNG    ++   W+PR +TF+  +P     S + V   I +  G WD  +LS  
Sbjct: 1360 GTRWRIGNGDKVQIWGDRWLPRGSTFKPFTPRGQWPSDMKVSSLIDSVTGQWDPHILSQI 1419

Query: 62   LWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSSEA- 121
               ED+ CI SI +  S  EDK +WHY+++G FSVRSA+++      E    N  SS + 
Sbjct: 1420 FVEEDINCILSIPLGSSINEDKLMWHYNRNGLFSVRSAYYIAVQMEKEKDGSNSASSSSS 1479

Query: 122  ----WWKSLWNLKVVPPVNV----------RIEWIRQ---YLEDFLKSND-------MES 181
                 WK LW LK+    +V          R  W      YL  + K          M+ 
Sbjct: 1480 TLSGSWKWLWTLKLPSDEDVLHCLALCTFARQVWALSGVPYLIHWPKDKSVIEWVLWMKQ 1539

Query: 182  HK-HAEYDSCSVV----------------------------------------------- 241
            H+  A+++ C V+                                               
Sbjct: 1540 HQDSAQFEYCVVICWAIWNARNKKLFEDMDKSAMDIILFAKKFTSDMRGLSSVVLSPRPL 1599

Query: 242  -----APIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIFKPVFCDI 301
                 + IRW AP  G +K+N DAS   +    G   + RD  G C    SI    + D 
Sbjct: 1600 YSSKRSTIRWEAPPRGVVKINFDASLCSIDNGCGLGGLARDFDGRCVGWYSISCKQYFDP 1659

Query: 302  LLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKY 333
            + AEA+A L+ L+ A +  F  + +E DS  +V AI G   S +  G ++++I+ L   +
Sbjct: 1660 VTAEAMAALKALEFARDHDFRRVALEGDSSVIVAAIRGEDDSYTSYGNLINDIKRLATTF 1719

BLAST of Sed0018752 vs. NCBI nr
Match: KAF4371092.1 (hypothetical protein F8388_020819 [Cannabis sativa])

HSP 1 Score: 135.6 bits (340), Expect = 8.2e-28
Identity = 89/294 (30.27%), Postives = 146/294 (49.66%), Query Frame = 0

Query: 46  ITTDGDWDQELLSAFLWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHN 105
           I  +G+W  E +++ L +EDV  +  I+  K    D   W  + +G ++V S + L   +
Sbjct: 34  IIEEGEWKTEEIASCLHKEDVPWVLGITPSKE-TNDTIGWSRTINGQYTVASGYKLRFRD 93

Query: 106 FFEASRVNGGSSEAWWKSLWNLKVVPPVNVRIEWIRQYLEDFLKSNDMESHKHAEYDSCS 165
              A   +  +++AWWK + +       ++ I W  + LE  L     +SH+    D  S
Sbjct: 94  PNIAEYSDNSANKAWWKDIED-------SIWIPWAMEMLELHLAFAHKDSHQKPSKDKVS 153

Query: 166 VVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIFKPVFCDILLA 225
                 W +P  G   +N DAS     P  G  +I RDH+G    A++ + P    +L+A
Sbjct: 154 ------WSSPPLGSFMINTDASLIDGQPGCGLGVIIRDHLGALVTAATDYIPGCLSVLVA 213

Query: 226 EALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKYGSK 285
           E LA+   LK+A      N+ I SDS++++ A+ G     +  GII+++  L  K + + 
Sbjct: 214 ETLAIRLALKLAATRSMQNIFIASDSQSVITALKGQTRINTDWGIIIEDCILASKNFNNL 273

Query: 286 TVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDLVNLDLSNCSPLGFS 340
           +  FIPRKCN+VAH LA+ +  V ++  W+S LP   +  +  DL    PLG S
Sbjct: 274 SFIFIPRKCNNVAHCLANWSRLVHVSDVWTSFLPDCAAASLKADL----PLGAS 309

BLAST of Sed0018752 vs. NCBI nr
Match: XP_023897447.1 (uncharacterized protein LOC112009345 [Quercus suber])

HSP 1 Score: 131.3 bits (329), Expect = 1.5e-26
Identity = 95/371 (25.61%), Postives = 163/371 (43.94%), Query Frame = 0

Query: 2    GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFITTDG-DWDQELLSAF 61
            G R R+G+G N  V+   W+PR  ++  ISP         V EFI  +   W +E++   
Sbjct: 781  GCRWRVGSGSNIKVWGDKWLPRAASYEVISPRLFLHPETKVSEFICQERCCWKEEIIRQI 840

Query: 62   LWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSS--- 121
             +  D+E I  I +   + ED+ IW  + +G FSVRSA+ +    +   + V+  S+   
Sbjct: 841  FFPVDMEVILGIPLSTRFPEDRVIWAETSNGGFSVRSAYRVAMGLYQAENAVSASSNSQL 900

Query: 122  EAWWKSLWNL----------KVVPPVNVR----------------------IEWIRQYLE 181
            +++WK LW+L          K++  V V                       ++W+ +YL 
Sbjct: 901  QSFWKKLWHLPVPHKSFDEEKIILVVTVAWAFWCNRNEIRHGAEKKSPEAIVQWVNRYLL 960

Query: 182  DFLKSNDMESHKHAEYDSCSVVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHM 241
            ++  + +       E         + W  P P  LK+NVD + +     +G   + RD  
Sbjct: 961  EYSAATESVPAVREE-------VSVTWNPPPPSILKVNVDGATTKNLNFVGVGAVVRDEQ 1020

Query: 242  GLCKFASSIFKPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSL 301
            G    A S   P     L  E  A   GL++A ++G+ N+I+E DS  +V A+ G  LS 
Sbjct: 1021 GRVVAAMSRKIPAPLGPLEVEVKAFEAGLQLAKDMGYQNIILEGDSLIIVRALCGISLSS 1080

Query: 302  SPQGIILDEIRLLLKKYGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDL 337
            S    ++  ++L    + +  V  + R+ N  AH LA  A+ +  +  W    P  +  +
Sbjct: 1081 STIDSMIVGMQLFCSDFCTVYVSHVKRQENKPAHVLAKYALSINDSVVWIEETPCCIQQV 1140

BLAST of Sed0018752 vs. ExPASy TrEMBL
Match: A0A7J6FK63 (RNase H domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_020819 PE=4 SV=1)

HSP 1 Score: 135.6 bits (340), Expect = 4.0e-28
Identity = 89/294 (30.27%), Postives = 146/294 (49.66%), Query Frame = 0

Query: 46  ITTDGDWDQELLSAFLWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHN 105
           I  +G+W  E +++ L +EDV  +  I+  K    D   W  + +G ++V S + L   +
Sbjct: 34  IIEEGEWKTEEIASCLHKEDVPWVLGITPSKE-TNDTIGWSRTINGQYTVASGYKLRFRD 93

Query: 106 FFEASRVNGGSSEAWWKSLWNLKVVPPVNVRIEWIRQYLEDFLKSNDMESHKHAEYDSCS 165
              A   +  +++AWWK + +       ++ I W  + LE  L     +SH+    D  S
Sbjct: 94  PNIAEYSDNSANKAWWKDIED-------SIWIPWAMEMLELHLAFAHKDSHQKPSKDKVS 153

Query: 166 VVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIFKPVFCDILLA 225
                 W +P  G   +N DAS     P  G  +I RDH+G    A++ + P    +L+A
Sbjct: 154 ------WSSPPLGSFMINTDASLIDGQPGCGLGVIIRDHLGALVTAATDYIPGCLSVLVA 213

Query: 226 EALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKYGSK 285
           E LA+   LK+A      N+ I SDS++++ A+ G     +  GII+++  L  K + + 
Sbjct: 214 ETLAIRLALKLAATRSMQNIFIASDSQSVITALKGQTRINTDWGIIIEDCILASKNFNNL 273

Query: 286 TVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLSDLVNLDLSNCSPLGFS 340
           +  FIPRKCN+VAH LA+ +  V ++  W+S LP   +  +  DL    PLG S
Sbjct: 274 SFIFIPRKCNNVAHCLANWSRLVHVSDVWTSFLPDCAAASLKADL----PLGAS 309

BLAST of Sed0018752 vs. ExPASy TrEMBL
Match: M5VP36 (Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa014760mg PE=4 SV=1)

HSP 1 Score: 133.3 bits (334), Expect = 2.0e-27
Identity = 116/429 (27.04%), Postives = 173/429 (40.33%), Query Frame = 0

Query: 2   GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFITTDGDWDQELLSAFL 61
           GLRKRIG+GQ + V+   WIPR N+FR +SP         V +FI   G W+ +LL+   
Sbjct: 20  GLRKRIGDGQETLVYGDAWIPRPNSFRLVSPQVLD-QETKVSDFIFPTGIWNVDLLNLCF 79

Query: 62  WREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSSE--- 121
              DV+ I SI +  +Y +D+W+WHY+ +G +SV+S + L      + S   G SSE   
Sbjct: 80  HEGDVKAIKSIPLSVNYHKDRWMWHYTTNGIYSVKSGYRLEISKKKDCSGAVGSSSEPRV 139

Query: 122 --AWWKSLWNLKVVPPVNVRIEW--IRQYL---EDFLKSN------------DMESHKHA 181
             A+W+ +W+ + VP   +   W  I+ YL    + LK              + ES  HA
Sbjct: 140 QNAFWRKVWS-QEVPQKILYFTWRAIKNYLPCRSNLLKRKIISEDSCPICNVNSESVIHA 199

Query: 182 ---------------------------------------------------EYDSC---- 241
                                                              E  SC    
Sbjct: 200 LWSCPNAQKVWKKVLFRGVFTGLKFFNYGDLFEIATMYLSSLWQNRNKAVIEGRSCAQDV 259

Query: 242 -------------SVV---------APIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFR 301
                        S+V          P +W  P  G  KLNVD +  P     G   + R
Sbjct: 260 IFNRAHHLAGEYGSLVEGEPKLFEEKPTKWSPPLAGKYKLNVDTAFIPETGVGGIGAVVR 319

Query: 302 DHMGLCKFASSIFKPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLV-DAILGN 331
           +  G    A ++          AE +A L  +K A + GF +++IESDS+  V D     
Sbjct: 320 NDKGEVMAAMALPLASATSSKHAEIMAFLFWMKFARDAGFSSILIESDSQGAVNDVKKDE 379

BLAST of Sed0018752 vs. ExPASy TrEMBL
Match: A0A803NS66 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 130.2 bits (326), Expect = 1.7e-26
Identity = 105/386 (27.20%), Postives = 170/386 (44.04%), Query Frame = 0

Query: 2   GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPS-ISGIHVCEFITTDGDWDQELLSAF 61
           G R RIGNG +  V + PW+P+  TF+      PS +  ++V +    +G+WD E + A 
Sbjct: 77  GYRWRIGNGNSVRVLEDPWLPKPVTFKVYDK--PSLLDQLYVVDLKKGNGEWDAEFIRAV 136

Query: 62  LWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSSEAW 121
               DVE I S++  +   EDK +WHYSK+G++S RS + +         + N  ++E W
Sbjct: 137 FNPTDVELILSMATSEWEIEDKILWHYSKNGEYSFRSGYRMAAALQVHDIQSNTEATEKW 196

Query: 122 WKSLWNLKVVP--------------PVNVRIEWIRQYLEDF---LKSNDMESHKHAEYD- 181
           W+ LW LK++P              P N  +   + ++E +     S   E+  HA +  
Sbjct: 197 WRQLWKLKILPKVKHFVWKMAHSWIPTNSALAHWKIHVEPYCIRCSSGAYENVFHALWGC 256

Query: 182 --SCSV------------------------VAPIRWIAP-----WP---GFLKLNVDASC 241
             +C V                         A  RW +      WP   G  K+NVD S 
Sbjct: 257 RVNCDVWKLTGFHGRIKRQGKEDVLAFLMECAEERWGSKRSCCWWPPVRGSFKINVDQSQ 316

Query: 242 SP-LAPKLGFELIFRDHMG-LCKFASSIFKPVFCDILLAEALALLEGLKVADNLGFHNLI 301
           S  +   L    + RDH G +C  A+++ +  +   L AE  A+L GL+        +  
Sbjct: 317 SGWVGVWLVSASVVRDHDGRVCVAAATVVRKEYSP-LQAELAAILAGLQTGIQRRLPSFT 376

Query: 302 IESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKYGSKTVCFIPRKCNSVAHNLASRAV 333
           +E+D    V+ +L +         ++  I+ LL+      + F+ R+ N VAH LA+ A+
Sbjct: 377 METDCLQAVNLVLQDEDGCRDVDGLIAHIKCLLQDVRVSGISFVYREANQVAHVLANEAL 436

BLAST of Sed0018752 vs. ExPASy TrEMBL
Match: A0A7J6GEF1 (RNase H domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_009591 PE=4 SV=1)

HSP 1 Score: 128.3 bits (321), Expect = 6.3e-26
Identity = 76/278 (27.34%), Postives = 136/278 (48.92%), Query Frame = 0

Query: 51  DWDQELLSAFLWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEAS 110
           DW  E ++ +   +D+ C+  I+   +  ED   W  + +G ++V S + L   +   A 
Sbjct: 226 DWKTEEIAGWFHEDDIPCVLGITPSIN-REDGIGWSLTTNGQYTVASGYKLRFKDPNIAE 285

Query: 111 RVNGGSSEAWWKSLWNLKVVPPV---------NVRIEWIRQYLEDFLKSNDMESHKHAEY 170
             +  + +AWWK +W  ++ P +         ++ I W  + LE  L +   +SH+    
Sbjct: 286 CSDNSAIKAWWKVVWGSRLTPKMKIFIWHIDDSIWIPWAMEMLEMHLAAAPKDSHQKPTQ 345

Query: 171 DSCSVVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHMGLCKFASSIFKPVFCD 230
           D       + W  P  G   +N DAS     P  G  +I RDH+G    A++ + P +  
Sbjct: 346 DK------VCWSPPPLGSFMINTDASLIEGKPGCGLGVIIRDHLGELVTAATDYIPGYLS 405

Query: 231 ILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKK 290
           +L+AE LA+   LK+A +    N+ I SD+++++ A+ G     +  G IL++  +  K 
Sbjct: 406 VLMAETLAIRLALKLAASRSLQNIFIASDNQSVITALKGQTRFNTDWGKILEDCIIAAKN 465

Query: 291 YGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILP 320
           + + +  F PRKCN+VAH  A+ +    ++  W+S LP
Sbjct: 466 FNTLSFNFTPRKCNNVAHCFANWSRVAHISDVWTSFLP 496

BLAST of Sed0018752 vs. ExPASy TrEMBL
Match: A0A803NG99 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 124.4 bits (311), Expect = 9.1e-25
Identity = 96/369 (26.02%), Postives = 154/369 (41.73%), Query Frame = 0

Query: 2    GLRKRIGNGQNSFVFKSPWIPRENTFRAISPHCPSISGIHVCEFITTDGDWDQELLSAFL 61
            GLR ++G+G +      PWIP +N F  +       +   V ++IT +  WD   L    
Sbjct: 811  GLRLQVGSGLSIRTATDPWIPAQNRFTPV--RFTGFADSCVADYITPNKVWDVTQLHNDF 870

Query: 62   WREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHNFFEASRVNGGSSEAWW 121
               DV+ I  I +  + ++D +IWHY+  G ++V+S + L      +       S E WW
Sbjct: 871  SSIDVDNILKIPLSLAAQDDNFIWHYTPTGVYTVQSGYHLAYSLATQNQTTGSNSQEKWW 930

Query: 122  KSLWNLKVVPPVNVRIEWIR------------------------------------QYLE 181
            K  W+L++   +   + ++                                      YL 
Sbjct: 931  KYFWSLQLPSKIPALVVYVHVRGNLLDMRYSVVNTLNRVVHHKPCKKPAEIFAGSMAYLS 990

Query: 182  DFLKSNDMESHKHAEYDSCSVVAPIRWIAPWPGFLKLNVDASCSPLAPKLGFELIFRDHM 241
             F +++   S  H   +  +V     WI P PG  KLNVDA+      KLG+  + RDH 
Sbjct: 991  HFSQASHTSSIVH---NDPTVADHSPWIPPPPGLCKLNVDAAVISAQNKLGYGAVIRDHR 1050

Query: 242  GLCKFASSIFKPVFCDILLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSL 301
            G+   A S            EA AL   L+ +  +      IE+DS  LV AI  N  S 
Sbjct: 1051 GVVLAALSRPDVGNLKSQEMEAKALFNALQWSLLMNLQIDEIETDSLMLVQAI-SNFASC 1110

Query: 302  SP--QGIILDEIRLLLKKYGSKTVCFIPRKCNSVAHNLASRAVFVGLNGCWSSILPHWLS 333
            SP  + ++LD ++ LL  + +  V  + R  N  AH LA +A+ +  +  W   +P  + 
Sbjct: 1111 SPTFKALVLD-VKNLLSNFPNVCVSHVRRDANQAAHGLAKQALAMDSDSIWLGEIPPTIF 1170

BLAST of Sed0018752 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 53.9 bits (128), Expect = 2.9e-07
Identity = 29/83 (34.94%), Postives = 46/83 (55.42%), Query Frame = 0

Query: 52  WDQELLSAFLWREDVECICSISICKSYEEDKWIWHYSKDGDFSVRSAHWLLRHN---FFE 111
           WD   +S F+ + D   I  I + KS + DK IW+Y+  G+++VRS +WLL H+      
Sbjct: 88  WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIP 147

Query: 112 ASRVNGGSSEAWWKSLWNLKVVP 132
           A     GS +   + +WNL ++P
Sbjct: 148 AINPPHGSIDLKTR-IWNLPIMP 169

BLAST of Sed0018752 vs. TAIR 10
Match: AT4G09490.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 48.1 bits (113), Expect = 1.6e-05
Identity = 31/84 (36.90%), Postives = 45/84 (53.57%), Query Frame = 0

Query: 223 LLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKY 282
           L+AEA+AL   L+ A ++G   L + SDS+ L+ AI     S    GII D + L L  +
Sbjct: 74  LMAEAIALFLALQYAQSIGITKLSMASDSQQLITAITSESPSTEFYGIIFDILNLSL-GF 133

Query: 283 GSKTVCFIPRKCNSVAHNLASRAV 307
              +  F+PR  N VA  LA  ++
Sbjct: 134 ADVSFSFVPRSENRVADELAKSSL 156

BLAST of Sed0018752 vs. TAIR 10
Match: AT2G22440.1 (BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily protein (TAIR:AT4G29090.1); Has 208 Blast hits to 191 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 208; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 46.6 bits (109), Expect = 4.7e-05
Identity = 26/90 (28.89%), Postives = 46/90 (51.11%), Query Frame = 0

Query: 15  VFKSPWIPRENTFRAISPHCPSISGIHVCEFITTDGD-WDQELLSAFLWREDVECICSIS 74
           V+K PWIP      A S      S ++V + I  + + W  + L A +   D+  I  I 
Sbjct: 5   VWKDPWIPTILARPAKSILNIRDSLLYVNDLIDQNTNLWKLDRLQALIDPVDIPLILGIR 64

Query: 75  ICKSYEEDKWIWHYSKDGDFSVRSAHWLLR 104
             ++Y  D + W ++K G+++V+S +W+ R
Sbjct: 65  PSRTYLSDGFSWSHTKSGNYTVKSGYWVAR 94

BLAST of Sed0018752 vs. TAIR 10
Match: AT5G38920.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 45.4 bits (106), Expect = 1.0e-04
Identity = 34/110 (30.91%), Postives = 56/110 (50.91%), Query Frame = 0

Query: 223 LLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAIL--GNHLSLSPQGIILDEIRLLLK 282
           L+ EA A+   +     L +  +I ESDS+ LV  ++  G+   L P   IL +I+LLL+
Sbjct: 84  LVTEAEAMRWAVLSLSRLNYRKVIFESDSQQLVSILVDNGDMPILDP---ILQDIKLLLQ 143

Query: 283 KYGSKTVCFIPRKCNSVAHNLASRAVFV-GLNGCWSSILPHWLSDLVNLD 330
            +      F+ R  N VA  +A  ++ +   +    SI+P+W+   V LD
Sbjct: 144 HFEETKFVFMHRGGNGVADRIAKESLSLENYDPKLYSIVPYWVKSFVELD 190

BLAST of Sed0018752 vs. TAIR 10
Match: AT1G10000.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 44.7 bits (104), Expect = 1.8e-04
Identity = 32/89 (35.96%), Postives = 52/89 (58.43%), Query Frame = 0

Query: 223 LLAEALALLEGLKVADNLGFHNLIIESDSKTLVDAILGNHLSLSPQGIILDEIRLLLKKY 282
           L AEA A+   +  A  L   +L++ SDSK++VDA L +++SL+    +L EIR +  ++
Sbjct: 205 LAAEAWAIKSAMLHALQLERSDLLVLSDSKSIVDA-LNSNVSLNEIFGLLVEIRSIRNRF 264

Query: 283 GSKTVCFIPRKCNSVAHNLASRAVFVGLN 312
            S +  FIPR  NS+A   A  ++ +  N
Sbjct: 265 RSISFQFIPRLVNSIADAAAKLSLCISGN 292

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_012836236.11.6e-3130.40PREDICTED: uncharacterized protein LOC105956874 [Erythranthe guttata][more]
XP_028125150.11.8e-3029.34uncharacterized protein LOC114322087 [Camellia sinensis][more]
XP_012847426.13.9e-3027.80PREDICTED: uncharacterized protein LOC105967373 [Erythranthe guttata][more]
KAF4371092.18.2e-2830.27hypothetical protein F8388_020819 [Cannabis sativa][more]
XP_023897447.11.5e-2625.61uncharacterized protein LOC112009345 [Quercus suber][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A7J6FK634.0e-2830.27RNase H domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_020819 PE=... [more]
M5VP362.0e-2727.04Uncharacterized protein (Fragment) OS=Prunus persica OX=3760 GN=PRUPE_ppa014760m... [more]
A0A803NS661.7e-2627.20Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A7J6GEF16.3e-2627.34RNase H domain-containing protein OS=Cannabis sativa OX=3483 GN=G4B88_009591 PE=... [more]
A0A803NG999.1e-2526.02Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G09510.12.9e-0734.94Ribonuclease H-like superfamily protein [more]
AT4G09490.11.6e-0536.90Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT2G22440.14.7e-0528.89BEST Arabidopsis thaliana protein match is: Ribonuclease H-like superfamily prot... [more]
AT5G38920.11.0e-0430.91Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
AT1G10000.11.8e-0435.96Ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Chayote (edule) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 179..306
e-value: 9.7E-13
score: 50.3
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 183..305
e-value: 2.3E-22
score: 79.1
NoneNo IPR availablePANTHERPTHR47723OS05G0353850 PROTEINcoord: 166..329
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 182..302
e-value: 6.12286E-14
score: 65.7984
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 179..307

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sed0018752.1Sed0018752.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity