HG10014678 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10014678
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionReverse transcriptase domain-containing protein
LocationChr02: 17749523 .. 17750617 (-)
RNA-Seq ExpressionHG10014678
SyntenyHG10014678
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGGATTATATGTTGGAATGCTAGGGGTCTGGGCAATCCTCGAGCATTCCGTTCGCTGTGTGACCTTGTCCGTTCTTCAATTCCTGACATTTTGTTCATATCAGAGACAAAGGGTGGTTTAGACTTATGTAATAGAGTTAAGTTTCGTTGCAACTTCACTGGTTGCTTACCGGTAAATAGTGTGTGTGCTAAGGAAGGCTTGTGTATCTTCTGGAAGGATTATGTGGTAACCGAGATCAAATCGTTCTCAGACCATCATATTGATACTCTGATTACTTGGAATAATAGGCAATGGAGGTTTACAGGATTGTATGGGTATCCAGATACTGCAAGGAAGCATCTGACATGGGAATTGTTAAAAAGACTGAATCTTGATAAGAATACCCCGTGGCTGATAGGAGGTGATTTTAACGAAATCTTGAATGAGGATGAAAAAGTTGGTGGGATTCCTCGAGCTGTTCAGGCTCTAAGTGCTTTTAAAGAAGTTATGGACAACTGCCAATTAATGGATATGGGTTTCAAAGGTTCACTTTTCACATGGTACGGAAAAAGAAATGGGGATTTGATTAGGGAGAGGCTGGATAGGTTTTTATGTAATGAAGCTCTTTGGAACCTCTTCAATCAGGCTACCGTCAACCATTTAGGGTGGCTCTTTTCAGATCATTGTCCTATTATGATCTCCTTAGACTTTAACCAGTTAAGGAAAAGTTGGAGGAAGAAACCGTTTAGATTTGAGGAAGTTTGGACTCTTAATCTTGATTGCAAGAATATTATATCTGAAAGAGGTAATTGGAACTATCCGTGCTATGTGGTTTCTCTCAATGATAGTTTGTCGAGATGCTCCAGATCCTTATCGCTGTGGGGGAAAGAGTCCTTTAAGGATTTAAAAAACAAAACTGCTGATTGTAAAAGAGTTCTTCCGGCTGCGATTGATAATCCTTCAGCTACTGACTTTGACAGAATTCATGGGATAGAATTTGAGCTTGACAGACTTATGGAATATGAGGAGATTTATTGGAAGCAAAGATCTAGGGAGAATTGGCTTAGATGGGGGATAAAAATACTAGATGGTTCCACAAAAAGGCCTCTTTAA

mRNA sequence

ATGAGGATTATATGTTGGAATGCTAGGGGTCTGGGCAATCCTCGAGCATTCCGTTCGCTGTGTGACCTTGTCCGTTCTTCAATTCCTGACATTTTGTTCATATCAGAGACAAAGGGTGGTTTAGACTTATGTAATAGAGTTAAGTTTCGTTGCAACTTCACTGGTTGCTTACCGGTAAATAGTGTGTGTGCTAAGGAAGGCTTGTGTATCTTCTGGAAGGATTATGTGGTAACCGAGATCAAATCGTTCTCAGACCATCATATTGATACTCTGATTACTTGGAATAATAGGCAATGGAGGTTTACAGGATTGTATGGGTATCCAGATACTGCAAGGAAGCATCTGACATGGGAATTGTTAAAAAGACTGAATCTTGATAAGAATACCCCGTGGCTGATAGGAGGTGATTTTAACGAAATCTTGAATGAGGATGAAAAAGTTGGTGGGATTCCTCGAGCTGTTCAGGCTCTAAGTGCTTTTAAAGAAGTTATGGACAACTGCCAATTAATGGATATGGGTTTCAAAGGTTCACTTTTCACATGGTACGGAAAAAGAAATGGGGATTTGATTAGGGAGAGGCTGGATAGGTTTTTATGTAATGAAGCTCTTTGGAACCTCTTCAATCAGGCTACCGTCAACCATTTAGGGTGGCTCTTTTCAGATCATTGTCCTATTATGATCTCCTTAGACTTTAACCAGTTAAGGAAAAGTTGGAGGAAGAAACCGTTTAGATTTGAGGAAGTTTGGACTCTTAATCTTGATTGCAAGAATATTATATCTGAAAGAGGTAATTGGAACTATCCGTGCTATGTGGTTTCTCTCAATGATAGTTTGTCGAGATGCTCCAGATCCTTATCGCTGTGGGGGAAAGAGTCCTTTAAGGATTTAAAAAACAAAACTGCTGATTGTAAAAGAGTTCTTCCGGCTGCGATTGATAATCCTTCAGCTACTGACTTTGACAGAATTCATGGGATAGAATTTGAGCTTGACAGACTTATGGAATATGAGGAGATTTATTGGAAGCAAAGATCTAGGGAGAATTGGCTTAGATGGGGGATAAAAATACTAGATGGTTCCACAAAAAGGCCTCTTTAA

Coding sequence (CDS)

ATGAGGATTATATGTTGGAATGCTAGGGGTCTGGGCAATCCTCGAGCATTCCGTTCGCTGTGTGACCTTGTCCGTTCTTCAATTCCTGACATTTTGTTCATATCAGAGACAAAGGGTGGTTTAGACTTATGTAATAGAGTTAAGTTTCGTTGCAACTTCACTGGTTGCTTACCGGTAAATAGTGTGTGTGCTAAGGAAGGCTTGTGTATCTTCTGGAAGGATTATGTGGTAACCGAGATCAAATCGTTCTCAGACCATCATATTGATACTCTGATTACTTGGAATAATAGGCAATGGAGGTTTACAGGATTGTATGGGTATCCAGATACTGCAAGGAAGCATCTGACATGGGAATTGTTAAAAAGACTGAATCTTGATAAGAATACCCCGTGGCTGATAGGAGGTGATTTTAACGAAATCTTGAATGAGGATGAAAAAGTTGGTGGGATTCCTCGAGCTGTTCAGGCTCTAAGTGCTTTTAAAGAAGTTATGGACAACTGCCAATTAATGGATATGGGTTTCAAAGGTTCACTTTTCACATGGTACGGAAAAAGAAATGGGGATTTGATTAGGGAGAGGCTGGATAGGTTTTTATGTAATGAAGCTCTTTGGAACCTCTTCAATCAGGCTACCGTCAACCATTTAGGGTGGCTCTTTTCAGATCATTGTCCTATTATGATCTCCTTAGACTTTAACCAGTTAAGGAAAAGTTGGAGGAAGAAACCGTTTAGATTTGAGGAAGTTTGGACTCTTAATCTTGATTGCAAGAATATTATATCTGAAAGAGGTAATTGGAACTATCCGTGCTATGTGGTTTCTCTCAATGATAGTTTGTCGAGATGCTCCAGATCCTTATCGCTGTGGGGGAAAGAGTCCTTTAAGGATTTAAAAAACAAAACTGCTGATTGTAAAAGAGTTCTTCCGGCTGCGATTGATAATCCTTCAGCTACTGACTTTGACAGAATTCATGGGATAGAATTTGAGCTTGACAGACTTATGGAATATGAGGAGATTTATTGGAAGCAAAGATCTAGGGAGAATTGGCTTAGATGGGGGATAAAAATACTAGATGGTTCCACAAAAAGGCCTCTTTAA

Protein sequence

MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVNSVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLITWNNRQWRFTGLYGYPDTARKHLTWELLKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLFTWYGKRNGDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSWRKKPFRFEEVWTLNLDCKNIISERGNWNYPCYVVSLNDSLSRCSRSLSLWGKESFKDLKNKTADCKRVLPAAIDNPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWGIKILDGSTKRPL
Homology
BLAST of HG10014678 vs. NCBI nr
Match: XP_018816246.1 (uncharacterized protein LOC108987722 [Juglans regia] >KAF5463276.1 hypothetical protein F2P56_019199 [Juglans regia])

HSP 1 Score: 261.5 bits (667), Expect = 1.1e-65
Identity = 135/369 (36.59%), Postives = 203/369 (55.01%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+I  WNARGLGNPR  R+LCDL++  +PD+LF+ ET+         K++  F  CL ++
Sbjct: 1   MKICSWNARGLGNPRGIRTLCDLIQRELPDVLFLQETRLSTREVESCKYKLGFQNCLAIS 60

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLI---TWNNRQWRFTGLYGYPDTARKHLTW 120
           S   K G+ + W   +   + ++S +H+D +I         W  T LYG+P+T  +H +W
Sbjct: 61  SDGRKGGIALLWDVEIDLSVINYSSNHVDAVIKDLRLRKGHWFLTALYGFPETHLRHQSW 120

Query: 121 ELLKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGS 180
            LLK L    + PWL+ GDFNE+L+  EK GG PR  + LSAF+EV+D C+L D+GF G 
Sbjct: 121 SLLKSLCRAPDEPWLVLGDFNELLSAHEKSGGNPRPEKQLSAFREVVDVCRLRDLGFSGP 180

Query: 181 LFTWYGKRNGD-LIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRK 240
           + TW  +R GD  IRERLD  L N   W  F  A V H    +SDH PI ++L+      
Sbjct: 181 MLTWSNRRAGDKCIRERLDHCLVNSMWWACFPNARVTHGVVAYSDHLPIWLNLEGASASH 240

Query: 241 SWRKKPFRFEEVWTLNLDCKNIISERGNWN---YPCYVVSLNDSLSRCSRSLSLWGKESF 300
           +  +K F+FE +W   ++C+ II  +G W     P  +  L+  +  C   L  W K+ F
Sbjct: 241 N-SQKSFKFEAMWVGEVECEEII--KGVWERCAAPASMNVLSGLIKECGDQLQGWNKQGF 300

Query: 301 KDLKNKTADCKRVLPAAID-NPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 360
            +++ +    +R L    D +P     D ++    ++   +E +EI W+QRS+  WL+ G
Sbjct: 301 GNVQTQLNKAQRSLCNLQDMDPGLVSNDALNAARSKVQLWLERKEIMWRQRSKALWLKEG 360

Query: 361 IKILDGSTK 362
               D +TK
Sbjct: 361 ----DSNTK 362

BLAST of HG10014678 vs. NCBI nr
Match: KAF5443558.1 (hypothetical protein F2P56_036105, partial [Juglans regia])

HSP 1 Score: 261.2 bits (666), Expect = 1.4e-65
Identity = 143/359 (39.83%), Postives = 197/359 (54.87%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+++CWN+RGLGNP+  R L DL+ +  P ++F+ ETK         KFR + T C  V+
Sbjct: 1   MKLLCWNSRGLGNPQGIRVLRDLITNEDPSLVFLQETKLKARAMENCKFRLHLTHCFTVD 60

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLI-TWNNRQWRFTGLYGYPDTARKHLTWEL 120
            V    GL + WK  +   ++SFS HHID LI   +  +WRFTG+YG P+   ++LTW L
Sbjct: 61  CVGRSGGLSLLWKGDLRVRVQSFSLHHIDALIQDGDGPEWRFTGVYGNPEVVNRYLTWNL 120

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           L+RLN   + PWL+GGDFNE+L+ +EK GG PR+   + AF+ V+ +C L D+GF+G  +
Sbjct: 121 LRRLNSGVDGPWLVGGDFNELLHFNEKRGGRPRSENQMEAFRNVIFDCSLRDLGFRGPKY 180

Query: 181 TWYGKRNGD-LIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRK-S 240
           TW   R G   I ERLDRFL N     LF Q  V H    +SDH P+    D  +L K +
Sbjct: 181 TWCNGRAGSRAISERLDRFLGNNQFCALFPQFVVRHGMAAYSDHLPVW--FDSEELEKRN 240

Query: 241 WRKKPFRFEEVWTLNLDCKNIISERGNWNYPCYVVSLND---SLSRCSRSLSLWGKESFK 300
              K FRFE +W     C  II+    W+       + D   S+ +C   LS W K+SF 
Sbjct: 241 KAPKLFRFEAMWVGVEQCSQIINR--VWHTDGNGGRMEDVLRSMKKCGEQLSEWNKKSFG 300

Query: 301 DLKNKTADCKRVLPAAIDNPSA-TDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 353
           ++  K    K  L    D  S   D   +     E+   +E EE+ WKQRSR  WL+ G
Sbjct: 301 NVSRKLNMAKHYLKQIQDRDSLHPDSVVVTKARREVQVWLEREEVMWKQRSRIQWLQEG 355

BLAST of HG10014678 vs. NCBI nr
Match: KAF5471209.1 (hypothetical protein F2P56_011662 [Juglans regia])

HSP 1 Score: 260.0 bits (663), Expect = 3.1e-65
Identity = 136/364 (37.36%), Postives = 203/364 (55.77%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+I  WNARGLGNPR  R+LCDL++   PD+LF+ ET+         K++  F  CL ++
Sbjct: 1   MKICSWNARGLGNPRGIRTLCDLIQREGPDVLFLQETRLSTREMESCKYKLGFQNCLGIS 60

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLI---TWNNRQWRFTGLYGYPDTARKHLTW 120
           S   K G+ + W   +   + ++S +H+D +I         W  T +YG+P+T  +H +W
Sbjct: 61  SQGRKGGIALLWDAEIDLSVINYSSNHVDAIIKDSCLRQGHWFLTAIYGFPETHLRHHSW 120

Query: 121 ELLKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGS 180
            L+K L  D + PWL+ GDFNEIL+  EK GG PR  + L  F+EV+D C+L D+G+ G 
Sbjct: 121 NLIKSLCRDNDKPWLVLGDFNEILHAHEKSGGNPRPERQLRDFREVVDVCRLRDLGYLGP 180

Query: 181 LFTWYGKRNGD-LIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRK 240
            FTW  +R GD  IRERLDR L N   W  F +A V H    +SDH PI ++L+  ++  
Sbjct: 181 KFTWSNRRAGDKCIRERLDRCLVNSEWWASFPRARVTHGVAAYSDHLPIWLNLE-GEVDS 240

Query: 241 SWRKKPFRFEEVWTLNLDCKNIISERGNWNYPCYVVSLNDSL---SRCSRSLSLWGKESF 300
            + KK F+FE +W    +C++II  +G W       ++N++L     C   L  W K+SF
Sbjct: 241 HFVKKSFKFEAMWVGEAECEDII--KGVWGRSEGPATMNEALGLIKECGNQLQGWNKKSF 300

Query: 301 KDLKNKTADCKRVLPAAIDNPSATDFDRIHGIEFELDR-----LMEYEEIYWKQRSRENW 353
            +++ K  + ++ L     N    D D +   E  L R      +E +EI W+QRS+  W
Sbjct: 301 GNVQAKLNNAQKFL----HNLQERDSDMVPIEELNLARSQVQIWLERKEIMWRQRSKALW 357

BLAST of HG10014678 vs. NCBI nr
Match: XP_024172304.2 (uncharacterized protein LOC112178381 [Rosa chinensis])

HSP 1 Score: 256.1 bits (653), Expect = 4.4e-64
Identity = 131/369 (35.50%), Postives = 199/369 (53.93%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M ++CWN +G+GNP     L  LV  + PD++F+SETK      ++++F+  +     V+
Sbjct: 1   MNVLCWNCQGIGNPWTVNGLKGLVTLNFPDVVFLSETKCKTQEMDKIRFQLGYRNAFAVD 60

Query: 61  SVCAKE---------GLCIFWKDYVVTEIKSFSDHHIDTLI--TWNNRQWRFTGLYGYPD 120
               K          GLC+ WK+ +   + +FSD+HID LI    +  +WRFTG+YG+  
Sbjct: 61  CQVVKNPNGRVSRAGGLCLLWKEGIDVALSTFSDNHIDVLIGGVGDKNRWRFTGVYGHSK 120

Query: 121 TARKHLTWELLKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQL 180
              +HLTW L+ ++  + + PWLIGGDFNEIL   EK GG PR  + + AF+  ++ C L
Sbjct: 121 VELRHLTWALITKIGYNNHWPWLIGGDFNEILKACEKEGGPPRCTRQMEAFRRCVEGCCL 180

Query: 181 MDMGFKGSLFTWYGKRNGDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISL 240
            D+ F G  FTW GKR G+ I+ RLDRF+   +  +LF  + V HL    SDH PI++ +
Sbjct: 181 NDLNFVGPCFTWRGKRGGEEIKVRLDRFMATRSWSDLFPTSRVTHLKPSKSDHLPILVEV 240

Query: 241 DFNQLRKSWRKKPFRFEEVWTLNLDCKNIISERGNW-----NYPCYVVSLNDSLSRCSRS 300
                RK  RK+ FRFEE W    +C N++ +   W     N P   + +   + +  ++
Sbjct: 241 RSTIPRKRRRKRRFRFEEHWLHEAECANVVKD--GWESVAGNDPFQTICMR--IEQTRKA 300

Query: 301 LSLWGKESFKDLKNKTADCKRVLPAAIDNP-SATDFDRIHGIEFELDRLMEYEEIYWKQR 353
           L +W  + F  LK +    +  L    D   SA   +    +E +L+ L+ +E  YW+QR
Sbjct: 301 LWVWSDQKFGHLKAEIERIRAKLAVFYDKSLSAYPEEERLELETKLNDLLYHEHNYWQQR 360

BLAST of HG10014678 vs. NCBI nr
Match: XP_042962672.1 (uncharacterized protein LOC122296942 [Carya illinoinensis])

HSP 1 Score: 255.4 bits (651), Expect = 7.5e-64
Identity = 136/356 (38.20%), Postives = 195/356 (54.78%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+++ WN RGLGNPR+ RSL DL+ S +P+ILF+ ETK         K R  F  C  V+
Sbjct: 1   MKLLSWNLRGLGNPRSIRSLRDLLTSEVPEILFLQETKLSSRRLEFCKLRLGFRCCFGVD 60

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLIT-WNNRQWRFTGLYGYPDTARKHLTWEL 120
           SV    GL + WKD +   I ++S HHI   IT  +  +W  TG+YG+ D+ ++   W L
Sbjct: 61  SVGRSGGLALLWKDDINLRIINYSSHHIHASITNCDGVEWLLTGVYGHHDSGQRSEVWRL 120

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           LK L      PW++ GDFNEIL+  EK+GG  R+   +  F+EV+ +C L D+G+ GS F
Sbjct: 121 LKFLGRGVVLPWIVFGDFNEILDHSEKLGGNIRSDIQMREFREVLSDCYLRDLGYVGSRF 180

Query: 181 TWYGKR-NGDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSW 240
           TW  +R   DL++ERLDRFL N    ++F    V H    +SDH P+ +  +   +R+  
Sbjct: 181 TWSNRRGEEDLVKERLDRFLANSLWCDMFPNLRVTHGVAAYSDHIPLWLDTEGALVRRRS 240

Query: 241 RKKPFRFEEVWTLNLDCKNIISE-RGNWNYPCYVVSLNDSLSRCSRSLSLWGKESFKDLK 300
           R+  FRFE +W    +C +II    G  + P  +  +   +S C+  L  W K SF  ++
Sbjct: 241 RRL-FRFEAMWVGETECSSIIERVWGRRHGPISLDQIMGRISSCATELGRWNKASFGHVQ 300

Query: 301 NKTADCKRVLPAAIDNPSATDFDRIHGIE-FELDRLMEYEEIYWKQRSRENWLRWG 353
              A  KR L    +N S       H     E+ + +E +E+ WKQRSR  WLR G
Sbjct: 301 KNLATAKRRLQCLEENDSGQHCLEEHKQACLEVQKWLERDELMWKQRSRVKWLREG 355

BLAST of HG10014678 vs. ExPASy TrEMBL
Match: A0A2I4EA22 (uncharacterized protein LOC108987722 OS=Juglans regia OX=51240 GN=LOC108987722 PE=4 SV=1)

HSP 1 Score: 261.5 bits (667), Expect = 5.1e-66
Identity = 135/369 (36.59%), Postives = 203/369 (55.01%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+I  WNARGLGNPR  R+LCDL++  +PD+LF+ ET+         K++  F  CL ++
Sbjct: 1   MKICSWNARGLGNPRGIRTLCDLIQRELPDVLFLQETRLSTREVESCKYKLGFQNCLAIS 60

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLI---TWNNRQWRFTGLYGYPDTARKHLTW 120
           S   K G+ + W   +   + ++S +H+D +I         W  T LYG+P+T  +H +W
Sbjct: 61  SDGRKGGIALLWDVEIDLSVINYSSNHVDAVIKDLRLRKGHWFLTALYGFPETHLRHQSW 120

Query: 121 ELLKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGS 180
            LLK L    + PWL+ GDFNE+L+  EK GG PR  + LSAF+EV+D C+L D+GF G 
Sbjct: 121 SLLKSLCRAPDEPWLVLGDFNELLSAHEKSGGNPRPEKQLSAFREVVDVCRLRDLGFSGP 180

Query: 181 LFTWYGKRNGD-LIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRK 240
           + TW  +R GD  IRERLD  L N   W  F  A V H    +SDH PI ++L+      
Sbjct: 181 MLTWSNRRAGDKCIRERLDHCLVNSMWWACFPNARVTHGVVAYSDHLPIWLNLEGASASH 240

Query: 241 SWRKKPFRFEEVWTLNLDCKNIISERGNWN---YPCYVVSLNDSLSRCSRSLSLWGKESF 300
           +  +K F+FE +W   ++C+ II  +G W     P  +  L+  +  C   L  W K+ F
Sbjct: 241 N-SQKSFKFEAMWVGEVECEEII--KGVWERCAAPASMNVLSGLIKECGDQLQGWNKQGF 300

Query: 301 KDLKNKTADCKRVLPAAID-NPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 360
            +++ +    +R L    D +P     D ++    ++   +E +EI W+QRS+  WL+ G
Sbjct: 301 GNVQTQLNKAQRSLCNLQDMDPGLVSNDALNAARSKVQLWLERKEIMWRQRSKALWLKEG 360

Query: 361 IKILDGSTK 362
               D +TK
Sbjct: 361 ----DSNTK 362

BLAST of HG10014678 vs. ExPASy TrEMBL
Match: A0A2N9HYE3 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS44563 PE=4 SV=1)

HSP 1 Score: 253.8 bits (647), Expect = 1.1e-63
Identity = 132/359 (36.77%), Postives = 193/359 (53.76%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M  + WN RGLGNPR  + +  L R+  P ++F+ ET        +++ +  F     VN
Sbjct: 459 MNCLAWNCRGLGNPRTVQDIARLTRAQDPSVMFLIETWQDEGPLEKLRCQLQFDSKFIVN 518

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLITWNNRQ-WRFTGLYGYPDTARKHLTWEL 120
                 GLC+FWK  V   ++SFS  HID L+  N    WRFTG YG P+T ++  +W+L
Sbjct: 519 RRNKGGGLCLFWKKDVKLSVQSFSHSHIDALVNDNQPDTWRFTGFYGAPETHKREESWDL 578

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           L+RLN     PW   GDFNE++  +EK G   R+   +  F++V+D C  +D+GF G  F
Sbjct: 579 LRRLNAQLKLPWCCMGDFNELVRIEEKQGRHTRSESQMQLFRDVLDECGFVDLGFTGPKF 638

Query: 181 TWYGKRNGDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSWR 240
           TW   R GD+  ERLDR +        F  A V+HL   +SDH PI +S +   + K   
Sbjct: 639 TWTNNRPGDMTWERLDRVVATPDWLLRFPSARVSHLEGRWSDHKPIWVSTETAVIPK--- 698

Query: 241 KKPFRFEEVWTLNLDCKNIISERGNW-----NYPCYVVSLNDSLSRCSRSLSLWGKESFK 300
           +KPFRFEEVWT +  C+ +I +  +W       P Y V     +  C R L LW + +F 
Sbjct: 699 RKPFRFEEVWTSDQGCEAVIED--SWKQDLTGVPMYTVW--QKIHACRRGLRLWSRTTFG 758

Query: 301 DLKNKTADCKRVLPAAIDNP-SATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 353
           ++ ++  + +R+L  A +N     D  R++ ++ EL  L+  EE  W+QRSR  WL  G
Sbjct: 759 NITSRIKEVERLLKIAEENSMQGRDHHRVNQLKRELHSLLAKEERLWRQRSRAEWLHAG 810

BLAST of HG10014678 vs. ExPASy TrEMBL
Match: A0A2N9F086 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS8394 PE=4 SV=1)

HSP 1 Score: 252.3 bits (643), Expect = 3.1e-63
Identity = 137/361 (37.95%), Postives = 200/361 (55.40%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           MRII WN RGLGNP A RSL  LV++  P++LF+ ETK       R +    F     V 
Sbjct: 30  MRIISWNCRGLGNPDAVRSLHMLVKTQGPEVLFLMETKLETSSMERFRVSLGFNSVFVVP 89

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDTLITW-NNRQWRFTGLYGYPDTARKHLTWEL 120
           S+    GL +FWKD +  EIK+++ HHID  I   N+  WR TG YG P+  R+  +W L
Sbjct: 90  SLGRSGGLAMFWKDGINLEIKNYTTHHIDCYIRQRNDMGWRLTGFYGRPEDFRRWESWAL 149

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           + +LN     PWL  GDFNEI+ ++EK G  PR ++ +  F+EV+  C L+DMG++G  F
Sbjct: 150 MDQLNGLGQNPWLCCGDFNEIMYQNEKRGMHPRPLRRMWEFREVLSRCNLIDMGYRGYDF 209

Query: 181 TWYGKRNGDL-IRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSW 240
           TW   R G   ++ERLDR L + A  +LF  +TV+H+    SDH PI+I +         
Sbjct: 210 TWDNNRRGVANVQERLDRALSSPAWTDLFPNSTVSHIWSSTSDHMPILIEVGQPITTSDR 269

Query: 241 RKKPFRFEEVWTLNLDC----KNIISERGNWNYPCYVVSLNDSLSRCSRSLSLWGKESFK 300
           +K+  RFEE W L+  C    K + SE      P Y V+  + +  C   L  W +  F 
Sbjct: 270 KKRHHRFEEKWILDSSCEDEVKRLWSEAAVQGSPMYCVT--EKIKHCRMGLVQWSRRKFG 329

Query: 301 DLKNK-TADCKRVLPAAIDNPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWGI 355
            ++++  A  + +    +DN      +RI G++ E++ L+  +E +W+QRSR  WL+ G 
Sbjct: 330 GVQSQIKARFEMIEAHTLDNREGQHQERIKGLKGEINSLLLADECHWRQRSRAVWLKVGD 388

BLAST of HG10014678 vs. ExPASy TrEMBL
Match: A0A2N9HE04 (Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS37711 PE=4 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 7.6e-62
Identity = 126/358 (35.20%), Postives = 197/358 (55.03%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+++ WN +GLGNP    SLC LV+S  P +LF+ ETK G      ++ +  F     V 
Sbjct: 249 MKLLSWNCQGLGNPCTVLSLCRLVKSQDPQVLFLMETKLGKKKMEGIRLKLGFQNAFVVP 308

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDT-LITWNNRQWRFTGLYGYPDTARKHLTWEL 120
           S+    GL + W+  V  E+K+F+ HHID+ ++  N+  WR  G YG P+  RK  +W L
Sbjct: 309 SIGRSRGLALLWQGEVALEVKNFTTHHIDSHILHGNDSGWRLIGFYGRPEEQRKWESWAL 368

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           L++LN   + PWL  GDFNEIL ++EK G   R  + +  F+EV++ CQ +D+G+KG  F
Sbjct: 369 LEQLNKCCSLPWLCYGDFNEILEQNEKRGKRLRPWRRMCEFREVVNRCQFVDLGYKGYKF 428

Query: 181 TWYGKRN-GDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSW 240
           TW   R+    ++ERLDR +   +  NLFN  +V HL    SDH PI++     + +   
Sbjct: 429 TWNNNRDVRAFVKERLDRVVATLSWTNLFNIISVTHLQISKSDHIPILVEAANQRSQTRN 488

Query: 241 RKKPFRFEEVWTLNLDCKNIISERGNWNYP----CYVVSLNDSLSRCSRSLSLWGKESFK 300
           +++  RFEE W  + DC+ +I  RG W         +  L + + RC   L+ W K+ F 
Sbjct: 489 KRRLTRFEEKWATHPDCETVI--RGLWEEEVGEGSPMFRLTEKIKRCRMGLAQWSKQIFG 548

Query: 301 DLKNKTADCKRVLPAAIDNPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 353
             +++       + A  ++    +   I  ++ E++ L+  +EI+WKQRSR  WL+ G
Sbjct: 549 GSQHQIRARFEAMEALTNDDGGQNRSLISDLKEEINSLLLSDEIHWKQRSRNTWLKEG 604

BLAST of HG10014678 vs. ExPASy TrEMBL
Match: A0A2N9IJF6 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS52874 PE=4 SV=1)

HSP 1 Score: 247.7 bits (631), Expect = 7.6e-62
Identity = 126/358 (35.20%), Postives = 197/358 (55.03%), Query Frame = 0

Query: 1   MRIICWNARGLGNPRAFRSLCDLVRSSIPDILFISETKGGLDLCNRVKFRCNFTGCLPVN 60
           M+++ WN +GLGNP    SLC LV+S  P +LF+ ETK G      ++ +  F     V 
Sbjct: 286 MKLLSWNCQGLGNPCTVLSLCRLVKSQDPQVLFLMETKLGKKKMEGIRLKLGFQNAFVVP 345

Query: 61  SVCAKEGLCIFWKDYVVTEIKSFSDHHIDT-LITWNNRQWRFTGLYGYPDTARKHLTWEL 120
           S+    GL + W+  V  E+K+F+ HHID+ ++  N+  WR  G YG P+  RK  +W L
Sbjct: 346 SIGRSRGLALLWQGEVALEVKNFTTHHIDSHILHGNDSGWRLIGFYGRPEEQRKWESWAL 405

Query: 121 LKRLNLDKNTPWLIGGDFNEILNEDEKVGGIPRAVQALSAFKEVMDNCQLMDMGFKGSLF 180
           L++LN   + PWL  GDFNEIL ++EK G   R  + +  F+EV++ CQ +D+G+KG  F
Sbjct: 406 LEQLNKCCSLPWLCYGDFNEILEQNEKRGKRLRPWRRMCEFREVVNRCQFVDLGYKGYKF 465

Query: 181 TWYGKRN-GDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPIMISLDFNQLRKSW 240
           TW   R+    ++ERLDR +   +  NLFN  +V HL    SDH PI++     + +   
Sbjct: 466 TWNNNRDVRAFVKERLDRVVATLSWTNLFNIISVTHLQISKSDHIPILVEAANQRSQTRN 525

Query: 241 RKKPFRFEEVWTLNLDCKNIISERGNWNYP----CYVVSLNDSLSRCSRSLSLWGKESFK 300
           +++  RFEE W  + DC+ +I  RG W         +  L + + RC   L+ W K+ F 
Sbjct: 526 KRRLTRFEEKWATHPDCETVI--RGLWEEEVGEGSPMFRLTEKIKRCRMGLAQWSKQIFG 585

Query: 301 DLKNKTADCKRVLPAAIDNPSATDFDRIHGIEFELDRLMEYEEIYWKQRSRENWLRWG 353
             +++       + A  ++    +   I  ++ E++ L+  +EI+WKQRSR  WL+ G
Sbjct: 586 GSQHQIRARFEAMEALTNDDGGQNRSLISDLKEEINSLLLSDEIHWKQRSRNTWLKEG 641

BLAST of HG10014678 vs. TAIR 10
Match: AT1G40390.1 (DNAse I-like superfamily protein )

HSP 1 Score: 72.4 bits (176), Expect = 8.4e-13
Identity = 63/247 (25.51%), Postives = 103/247 (41.70%), Query Frame = 0

Query: 111 ARKHLTWELLKRLNLDK---NTPWLIGGDFNEILNEDEKVGGIPR--AVQALSAFKEVMD 170
           A +   W+ + RL+      N+PWL+ GDFN+I +  E    +P   ++Q L   +  M 
Sbjct: 99  AERRSLWDDITRLSASSPLCNSPWLVVGDFNQIASVTEHYSLMPSNISLQGLEDLQACMR 158

Query: 171 NCQLMDMGFKGSLFTWYGKRNGDLIRERLDRFLCNEALWNLFNQATVNHLGWLFSDHCPI 230
           +  L+D+  +G L+TW   +  + I  +LDR + N      F  A+        SDH   
Sbjct: 159 DSDLVDLPCRGVLYTWSNHQQDNPILRKLDRAIVNGCWLATFPTASAIFDPPSDSDHAAC 218

Query: 231 MISLDFNQLRKSWRKKPFRFEEVWTLNLDCKNIISERGNWNYPCYVVSLNDSLSRCSRSL 290
           M+ L  N      +KK F++    + + D   I S    W     V S   SL    +  
Sbjct: 219 MVIL--NNSPPLSKKKSFKYFSFLSTHPDF--ISSILAAWQKEIAVGSFMFSLGELLKE- 278

Query: 291 SLWGKESFKDLKNKTADCKRVLPAAIDNPSATDFDRIHGIEFELDRLMEYEEIYWKQRSR 350
               K++ + L  +      +    + NPS   F   H      +      E ++KQ+SR
Sbjct: 279 ---AKKACRGLNRR--GFSNIQAQLMSNPSDFLFRAEHVARKNWNFFAAALESFYKQKSR 335

Query: 351 ENWLRWG 353
             WL+ G
Sbjct: 339 IKWLKEG 335

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_018816246.11.1e-6536.59uncharacterized protein LOC108987722 [Juglans regia] >KAF5463276.1 hypothetical ... [more]
KAF5443558.11.4e-6539.83hypothetical protein F2P56_036105, partial [Juglans regia][more]
KAF5471209.13.1e-6537.36hypothetical protein F2P56_011662 [Juglans regia][more]
XP_024172304.24.4e-6435.50uncharacterized protein LOC112178381 [Rosa chinensis][more]
XP_042962672.17.5e-6438.20uncharacterized protein LOC122296942 [Carya illinoinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A2I4EA225.1e-6636.59uncharacterized protein LOC108987722 OS=Juglans regia OX=51240 GN=LOC108987722 P... [more]
A0A2N9HYE31.1e-6336.77Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9F0863.1e-6337.95Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9HE047.6e-6235.20Reverse transcriptase domain-containing protein OS=Fagus sylvatica OX=28930 GN=F... [more]
A0A2N9IJF67.6e-6235.20Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS52874 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G40390.18.4e-1325.51DNAse I-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036691Endonuclease/exonuclease/phosphatase superfamilyGENE3D3.60.10.10Endonuclease/exonuclease/phosphatasecoord: 1..229
e-value: 4.0E-36
score: 127.0
IPR036691Endonuclease/exonuclease/phosphatase superfamilySUPERFAMILY56219DNase I-likecoord: 1..230
IPR005135Endonuclease/exonuclease/phosphatasePFAMPF03372Exo_endo_phoscoord: 6..222
e-value: 2.1E-11
score: 43.9
NoneNo IPR availablePANTHERPTHR33710BNAC02G09200D PROTEINcoord: 83..300
NoneNo IPR availablePANTHERPTHR33710:SF41BNAC03G48920D PROTEINcoord: 83..300

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10014678.1HG10014678.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003824 catalytic activity