Lag0019044 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0019044
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRibonuclease H-like superfamily protein
Locationchr5: 37959554 .. 37960840 (+)
RNA-Seq ExpressionLag0019044
SyntenyLag0019044
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGACGACATCGGGGCAGTGGAACGAAGGGCTCATCCAGCAGCATTTTAGCCCTCAGGAGGTCGGTCTAATTTTGTCAATTCCTGTGTGGGTTGGGGCAGAGGATAAGTTTGTGTGGCATTATGAGAAGTCAGGCCTGTTTTCGGTTAAAAGTGGATATCGGTTGGGGCAGTCAGCTTGGCTTGCGCAGTTTCCATCTTCTTCTTCGAATGAGTCGATAATGGGTTGGTGGAAGGGGGTTTGGAAGATGCTTATCCCGAATAAGATCAAGATTTTTCTTTGGAGACTTTCTTTGGACCGCTTGCCCACGATTGATAATTTGGGCATTCGGGGCTGTGACGTTCTGAACGTTTGTGGCCTCTGTGGGCAAGGTGGGGAGTCCAGTCTGCATGTCTTCTGGCATTGCAAGTTTGTGAGGGCAGTTTTGATGGGATCCAAGTTTGGAAGTTTGATACAGAAGGTGCAGGGCTGGGTCTATGTTTGATCTTCTTAGGGAAGTGAGGGATGAGGTTGGTTGGGAAAGATTTGGGTTGTTTGTGGTGGTGTTGTGGCCTGTGTGGAACTGTCGAAATCAGCGGAAATTTAGGGGCCTAGAACCAGTGGTGGGACTCGTAGAGTGGGCGACGAGTTATATCTCGTCGTTCCAGCAGGCGATCTCAGCTTGTGGAGTGGGGGTGCAGAGTGTTGCTGCAAGGGAAGATGTGAGATGGAGTCCCCCGGAGGCTGGGTGGTATAAGGTGAATGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGAGTGGTGGTTCGGGACTCCTCTGGTCGGGTTATGCTGTCAGCGTCTTTGGTGCAGCGACATGTGCGAAGCCCGAAGATGGCTGAAGGTTGGGCCGCAGTTAAGGGAACGAGATTGGCAGTGGAGATGGGTTTGGGCCCTTGGTGTTGGAGACTAACTCCAGTCGGGTGGCTAGTTTCTTCCCAGATGAGGCAGGGGATGACTTCTCATATGTGGGTGCCCTGGTGGCGGACTTACGAAAGGACATGCCATGTCCTTCTTTTTTCTGCTGCAGGTTCACTCGAAGAGAGGGTAATGAGGTGGCTCACTAGTTGGCTTTTATGGCAGGGAGGGACGGAGAATCTAGGGTGTGGGTTGAGTCTGTACCCCAGTGTGTTGAGGGTTGATCCTTTCTGATATGGCACTGTTGTGATGTTTTTCTTGCAGGTTATGACGGAGGGCATGGAGCTAGTGAGGATCGAGAATGGATAGCAAACCAAAGGCCCTTCAAATTAGAAGGGTGA

mRNA sequence

ATGACGACATCGGGGCAGTGGAACGAAGGGCTCATCCAGCAGCATTTTAGCCCTCAGGAGGTCGGTCTAATTTTGTCAATTCCTGTGTGGGTTGGGGCAGAGGATAAGTTTGTGTGGCATTATGAGAAGTCAGGCCTGTTTTCGGTTAAAAGTGGATATCGGTTGGGGCAGTCAGCTTGGCTTGCGCAGTTTCCATCTTCTTCTTCGAATGAGTCGATAATGGGTTGGTGGAAGGGGGTTTGGAAGATGCTTATCCCGAATAAGATCAAGATTTTTCTTTGGAGACTTTCTTTGGACCGCTTGCCCACGATTGATAATTTGGGCATTCGGGGCTGTGACGTTCTGAACGTTTGTGGCCTCTGTGGGCAAGGTGGGGAGTCCAGTCTGCATGTCTTCTGGCATTGCAAAAGGTGCAGGGCTGGGTCTATGTTTGATCTTCTTAGGGAAGTGAGGGATGAGGTTGGTTGGGAAAGATTTGGGTTGTTTGTGGTGGTGTTGTGGCCTGTGTGGAACTGTCGAAATCAGCGGAAATTTAGGGGCCTAGAACCAGTGGTGGGACTCGTAGAGTGGGCGACGAGTTATATCTCGTCGTTCCAGCAGGCGATCTCAGCTTGTGGAGTGGGGGTGCAGAGTGTTGCTGCAAGGGAAGATGTGAGATGGAGTCCCCCGGAGGCTGGGTGGTATAAGGTGAATGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGAGTGGTGGTTCGGGACTCCTCTGGTCGGGTTATGCTGTCAGCGTCTTTGGTGCAGCGACATGTGCGAAGCCCGAAGATGGCTGAAGGTTGGGCCGCAGTTAAGGGAACGAGATTGGCAGTGGAGATGGGTTTGGGCCCTTGGTGTTGGAGACTAACTCCAGTCGGGTGGCTAGTTTCTTCCCAGATGAGGCAGGGGATGACTTCTCATATGTGGGTGCCCTGGTGGCGGACTTACGAAAGGACATGCCATGTCCTTCTTTTTTCTGCTGCAGGTTATGACGGAGGGCATGGAGCTAGTGAGGATCGAGAATGGATAGCAAACCAAAGGCCCTTCAAATTAGAAGGGTGA

Coding sequence (CDS)

ATGACGACATCGGGGCAGTGGAACGAAGGGCTCATCCAGCAGCATTTTAGCCCTCAGGAGGTCGGTCTAATTTTGTCAATTCCTGTGTGGGTTGGGGCAGAGGATAAGTTTGTGTGGCATTATGAGAAGTCAGGCCTGTTTTCGGTTAAAAGTGGATATCGGTTGGGGCAGTCAGCTTGGCTTGCGCAGTTTCCATCTTCTTCTTCGAATGAGTCGATAATGGGTTGGTGGAAGGGGGTTTGGAAGATGCTTATCCCGAATAAGATCAAGATTTTTCTTTGGAGACTTTCTTTGGACCGCTTGCCCACGATTGATAATTTGGGCATTCGGGGCTGTGACGTTCTGAACGTTTGTGGCCTCTGTGGGCAAGGTGGGGAGTCCAGTCTGCATGTCTTCTGGCATTGCAAAAGGTGCAGGGCTGGGTCTATGTTTGATCTTCTTAGGGAAGTGAGGGATGAGGTTGGTTGGGAAAGATTTGGGTTGTTTGTGGTGGTGTTGTGGCCTGTGTGGAACTGTCGAAATCAGCGGAAATTTAGGGGCCTAGAACCAGTGGTGGGACTCGTAGAGTGGGCGACGAGTTATATCTCGTCGTTCCAGCAGGCGATCTCAGCTTGTGGAGTGGGGGTGCAGAGTGTTGCTGCAAGGGAAGATGTGAGATGGAGTCCCCCGGAGGCTGGGTGGTATAAGGTGAATGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGAGTGGTGGTTCGGGACTCCTCTGGTCGGGTTATGCTGTCAGCGTCTTTGGTGCAGCGACATGTGCGAAGCCCGAAGATGGCTGAAGGTTGGGCCGCAGTTAAGGGAACGAGATTGGCAGTGGAGATGGGTTTGGGCCCTTGGTGTTGGAGACTAACTCCAGTCGGGTGGCTAGTTTCTTCCCAGATGAGGCAGGGGATGACTTCTCATATGTGGGTGCCCTGGTGGCGGACTTACGAAAGGACATGCCATGTCCTTCTTTTTTCTGCTGCAGGTTATGACGGAGGGCATGGAGCTAGTGAGGATCGAGAATGGATAGCAAACCAAAGGCCCTTCAAATTAGAAGGGTGA

Protein sequence

MTTSGQWNEGLIQQHFSPQEVGLILSIPVWVGAEDKFVWHYEKSGLFSVKSGYRLGQSAWLAQFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQGGESSLHVFWHCKRCRAGSMFDLLREVRDEVGWERFGLFVVVLWPVWNCRNQRKFRGLEPVVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLAVEMGLGPWCWRLTPVGWLVSSQMRQGMTSHMWVPWWRTYERTCHVLLFSAAGYDGGHGASEDREWIANQRPFKLEG
Homology
BLAST of Lag0019044 vs. NCBI nr
Match: XP_022150918.1 (uncharacterized protein LOC111018954 [Momordica charantia])

HSP 1 Score: 246.5 bits (628), Expect = 3.5e-61
Identity = 133/316 (42.09%), Postives = 183/316 (57.91%), Query Frame = 0

Query: 5    GQWNEGLIQQHFSPQEVGLILSIPVWVGA-EDKFVWHYEKSGLFSVKSGYRLG-QSAWLA 64
            G W   +++  F+P E   ILSIP+  GA ED+ +W+YEK+G++SV+SGY++   +    
Sbjct: 762  GGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLNNPCV 821

Query: 65   QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCG 124
            Q PSSSS+E +  WW G WKM IPNKIK+FLWRL LDRLPT  NL  RG ++ N C  CG
Sbjct: 822  QAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCG 881

Query: 125  QGGESSLHVFWHCKRCRA---------GSMFDLLREVRDEVGWERFGLFVVVLWPVWNCR 184
            + GE S+H+FW CK   A          S F +LRE  + +    F    VV+W +WN R
Sbjct: 882  RNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQR 941

Query: 185  NQRKFRGLEPVV-----GLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWY 244
            N R F      V      LVEWA  Y   F++A S    G   V    ++ W PP+ G Y
Sbjct: 942  NARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITG--RVTNTAEILWQPPDEGIY 1001

Query: 245  KVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLAVEM 304
            K+N DASF      AGLG+++ +  G+VM +A+    +++S  MAE  AAV+G +LA E+
Sbjct: 1002 KINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEI 1061

BLAST of Lag0019044 vs. NCBI nr
Match: XP_024037590.1 (uncharacterized protein LOC112097210 [Citrus clementina])

HSP 1 Score: 166.0 bits (419), Expect = 6.0e-37
Identity = 112/334 (33.53%), Postives = 159/334 (47.60%), Query Frame = 0

Query: 6    QWNEGLIQQHFSPQEVGLILSIPV-WVGAEDKFVWHYEKSGLFSVKSGYRLGQSAWLAQF 65
            QW E LI QHF P++   I+ IP+     ED+ +WHY+K G +SVKSGY++       + 
Sbjct: 768  QWREDLILQHFRPEDAEAIMQIPLPKRPKEDQLIWHYDKKGYYSVKSGYQVAMRIKFPED 827

Query: 66   PSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQG 125
            PS S+++  +  W+ +WK+ IP K+KIFLWR + D LPT +NL  +      +C  C   
Sbjct: 828  PSCSNHDQNL--WRFIWKLAIPEKVKIFLWRAAHDLLPTAENLWKKKVLQEPMCQSCHCH 887

Query: 126  GESSLHVFWHCKRCRA----GSMFDLLREV-RDEVGW-----------ERFGLFVVVLWP 185
             E+  H    C R R      ++ + LR V R ++ W                   +LW 
Sbjct: 888  VETVSHALVECNRARKIWRYSNLAEELRGVYRCDIVWMLQFWPRQHAKVEGAEVAALLWA 947

Query: 186  VWNCRNQRKFRG-LEPVVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGW 245
            +W  RN+  F G  E  + +V  A + + SF++      V      A    +WSPP  GW
Sbjct: 948  IWKARNKWLFEGKKENPLRVVANAEAIVESFKKIRQPEMVYKTKGNAERQKQWSPPPNGW 1007

Query: 246  YKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLA-- 305
             KVNVDA+   E   AGLGVVVRDS G    +A    R   S  MAE  A   G ++A  
Sbjct: 1008 QKVNVDAAVDVENQMAGLGVVVRDSDGNCRAAAIKSLRLPGSVAMAEATAMEWGLKVAEK 1067

BLAST of Lag0019044 vs. NCBI nr
Match: KAF4401718.1 (hypothetical protein G4B88_000766 [Cannabis sativa])

HSP 1 Score: 159.1 bits (401), Expect = 7.3e-35
Identity = 91/304 (29.93%), Postives = 147/304 (48.36%), Query Frame = 0

Query: 1   MTTSGQWNEGLIQQHFSPQEVGLILSIPVWVGAEDKFVWHYEKSGLFSVKSGYRLGQSAW 60
           + + GQW    +Q+HF  +++  +  IP+ +  ED   W Y  +G + VKSGYR+G+   
Sbjct: 442 INSDGQWQIDKLQKHFHEEDIPWVQGIPIDLYVEDTLTWPYTPNGQYMVKSGYRIGREIN 501

Query: 61  LAQFPSSSSN-ESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCG 120
           L   P+ SSN E I  WWK +W M +P ++K+F WR+  + LP   NL  RG DV   C 
Sbjct: 502 L--HPTRSSNMEDIHKWWKMLWSMSLPPRMKLFGWRVCHNWLPAKINLAHRGMDVNLNCD 561

Query: 121 LCGQGGESSLHVFWHCKRCRA----------------GSMFDLLREVRDEVGWERFGLFV 180
           LCG   E+  H  W C + +                 GSMFD++  ++D +    F   +
Sbjct: 562 LCGHQAETLTHALWGCAKVKTIWKLVPWYHKCAHFKNGSMFDIMVTLKDHLHKSEFEEAI 621

Query: 181 VVLWPVWNCRNQRKFRGLEPV---VGLVEW-ATSYISSFQQAISACGVGVQSVAAREDVR 240
            ++W +W  RN  K+    PV   + L++W +T+Y  S         + ++    +   +
Sbjct: 622 KIMWAIWENRN--KYWNKLPVMNGIQLLDWISTAYPDSRNNKEQPMNIDMKHQQLK---K 681

Query: 241 WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAV 284
           W  P  G   VN DA+        G G + RD  G ++L+  +  +   S +MAE WA +
Sbjct: 682 WIRPPTGNISVNCDAAMNNGTAGVGTGFIWRDYEGNLLLAGMVYHQSCCSVEMAEAWAIL 738

BLAST of Lag0019044 vs. NCBI nr
Match: KAF8408042.1 (hypothetical protein HHK36_007182 [Tetracentron sinense])

HSP 1 Score: 158.3 bits (399), Expect = 1.2e-34
Identity = 103/309 (33.33%), Postives = 153/309 (49.51%), Query Frame = 0

Query: 7   WNEGLIQQHFSPQEVGLILSIPVWVG-AEDKFVWHYEKSGLFSVKSGYRL------GQSA 66
           WN  L+   F P E  LI SIP+      DK VWH+   G FSV+S Y L       +SA
Sbjct: 294 WNHTLLMTVFMPHEAELISSIPLSERLPPDKRVWHFTSKG-FSVRSAYHLTSTLRDRESA 353

Query: 67  WLAQFPSSSSNESIMG-WWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVC 126
             +   S S N S+ G  W  VW++ IP K+KIF+W+++L+ LP   NL  R   V NVC
Sbjct: 354 TSSSTSSLSWNGSLSGIKWSQVWQLAIPPKVKIFIWKVALNILPVRANLCKRKIPVENVC 413

Query: 127 GLCGQGGESSLHVFWHCKRCR----------------AGSMFDLLREVRDEVGWERFGLF 186
           G+CG+ GE+ LHV  +C   R                A S+   + E+    G E    F
Sbjct: 414 GVCGEEGETILHVLKNCHYARQVWLLSQLGLRSDATSADSLSSWVEEIMKSHGEEGLSAF 473

Query: 187 VVVLWPVWNCRNQRKFRGLEPV-VGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWS 246
            ++ W +W  RN+  F G++      V+ A   ++ F  A        +S++A     W 
Sbjct: 474 FMIAWSIWKHRNEYIFSGVKMTPFNCVQRANKLLADFHNANDR--AAPESISAARS--WL 533

Query: 247 PPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKG 291
            P    +KVN+D +   E   AG+GVVVRD +G ++ + S    + +S  + E  AA +G
Sbjct: 534 APPGDLFKVNIDGALHLEDRSAGVGVVVRDHNGDLIAAMSKRISNTQSAAVIEAIAAREG 593

BLAST of Lag0019044 vs. NCBI nr
Match: KAF4364303.1 (hypothetical protein G4B88_028423 [Cannabis sativa])

HSP 1 Score: 157.9 bits (398), Expect = 1.6e-34
Identity = 89/290 (30.69%), Postives = 146/290 (50.34%), Query Frame = 0

Query: 4   SGQWNEGLIQQHFSPQEVGLILSIPVWVGAE-DKFVWHYEKSGLFSVKSGYRLGQSAWLA 63
           +G WN  L++  F    V  ILS+P    ++ D + W +  SG +SV++GY + + A   
Sbjct: 356 NGDWNIPLLRASFQQDTVNDILSLPPPDPSKPDTYFWQHSTSGHYSVRTGYHVAKQAINR 415

Query: 64  QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCG 123
             PSSS+ E++  WWK +W++ IP KI+ F++RL+   LPT +NL  R C    +C  C 
Sbjct: 416 VQPSSSNTETLTRWWKSLWRLPIPPKIRHFVYRLAQHSLPTTNNLYNRHCISSPICPKCS 475

Query: 124 QGGESSLHVFWHCKRCRAGSMFDLLREVRDEVGW-ERFGLFVVVLWPVWNCRNQRKFRGL 183
              ES  H  + C+  +                W + F LF+ +LW  WN RN   FR  
Sbjct: 476 LCFESVQHALFECQEMKK--------------AWSDEFNLFLCMLWKCWNARNASVFRNQ 535

Query: 184 EPVVGLVEW-ATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWYKVNVDASFRRER 243
                 +E  A  Y++ +Q A        QS + R+ + W PP  G+ K+N DA+    +
Sbjct: 536 VSRPETIEQEAQDYLAFYQAAQDKRWNHSQSTSDRDLLVWEPPPVGFLKLNTDAAISSHQ 595

Query: 244 WQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLAVEMGL 291
            + G G +VRD +G+++ + +  +     P+ AEGWA ++  +   + G+
Sbjct: 596 NRTGGGALVRDHTGKIIAATAFNRIGQLHPQAAEGWALLEALKWCQDKGI 631

BLAST of Lag0019044 vs. ExPASy Swiss-Prot
Match: P0C2F6 (Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1g65750 PE=3 SV=1)

HSP 1 Score: 90.9 bits (224), Expect = 3.2e-17
Identity = 74/282 (26.24%), Postives = 116/282 (41.13%), Query Frame = 0

Query: 32  GAEDKFVWHYEKSGLFSVKSGYRLGQSAWLAQFPSSSSNESIMGWWKGVWKMLIPNKIKI 91
           GA D+  W + + G FSV+S Y +     + + P      ++  ++  +WK+ +P ++K 
Sbjct: 250 GARDRLSWKFSQDGQFSVRSAYEM---LTVDEVP----RPNMASFFNCLWKVRVPERVKT 309

Query: 92  FLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQGGESSLHVFWHC------------KRCR 151
           FLW +    + T +    R     NVC +C  G ES LHV   C            +R +
Sbjct: 310 FLWLVGNQAVMTEEERHRRHLSASNVCQVCKGGVESMLHVLRDCPAQLGIWVRVVPQRRQ 369

Query: 152 AG----SMFDLL------REVRDEVGWERFGLFVVVLWPVWNCRNQRKF----RGLEPVV 211
            G    S+F+ L      R   +++ W    +F V++W  W  R    F    +  + V 
Sbjct: 370 QGFFSKSLFEWLYDNLGDRSGCEDIPWST--IFAVIIWWGWKWRCGNIFGENTKCRDRVK 429

Query: 212 GLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWYKVNVDASFRRERWQAGL 271
            + EWA     +    +    VG+        + W  P  GW KVN D + R     A  
Sbjct: 430 FVKEWAVEVYRAHSGNVL---VGITQPRVERMIGWVSPCVGWVKVNTDGASRGNPGLASA 489

Query: 272 GVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLAVE 288
           G V+RD +G      SL      +P+ AE W    G   A E
Sbjct: 490 GGVLRDCTGAWCGGFSLNIGRCSAPQ-AELWGVYYGLYFAWE 518

BLAST of Lag0019044 vs. ExPASy TrEMBL
Match: A0A6J1DAR4 (uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018954 PE=4 SV=1)

HSP 1 Score: 246.5 bits (628), Expect = 1.7e-61
Identity = 133/316 (42.09%), Postives = 183/316 (57.91%), Query Frame = 0

Query: 5    GQWNEGLIQQHFSPQEVGLILSIPVWVGA-EDKFVWHYEKSGLFSVKSGYRLG-QSAWLA 64
            G W   +++  F+P E   ILSIP+  GA ED+ +W+YEK+G++SV+SGY++   +    
Sbjct: 762  GGWQGDVVRDEFTPDEAKGILSIPIGRGAEEDRLIWNYEKTGVYSVRSGYKVALLNNPCV 821

Query: 65   QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCG 124
            Q PSSSS+E +  WW G WKM IPNKIK+FLWRL LDRLPT  NL  RG ++ N C  CG
Sbjct: 822  QAPSSSSSEEVRCWWNGFWKMHIPNKIKVFLWRLCLDRLPTGCNLSKRGVEITNCCYFCG 881

Query: 125  QGGESSLHVFWHCKRCRA---------GSMFDLLREVRDEVGWERFGLFVVVLWPVWNCR 184
            + GE S+H+FW CK   A          S F +LRE  + +    F    VV+W +WN R
Sbjct: 882  RNGEDSIHLFWICKFAEALWINSKFGKLSPFLILRESHESLSKADFEELCVVIWGLWNQR 941

Query: 185  NQRKFRGLEPVV-----GLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWY 244
            N R F      V      LVEWA  Y   F++A S    G   V    ++ W PP+ G Y
Sbjct: 942  NARAFNDSTKTVFKIGMELVEWANKYAMEFREAKSNPITG--RVTNTAEILWQPPDEGIY 1001

Query: 245  KVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRLAVEM 304
            K+N DASF      AGLG+++ +  G+VM +A+    +++S  MAE  AAV+G +LA E+
Sbjct: 1002 KINTDASFLASDQHAGLGIIIHNDRGQVMAAATKYLENIQSVDMAEAIAAVEGLQLASEI 1061

BLAST of Lag0019044 vs. ExPASy TrEMBL
Match: A0A803QQT2 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 176.8 bits (447), Expect = 1.6e-40
Identity = 106/306 (34.64%), Postives = 156/306 (50.98%), Query Frame = 0

Query: 5   GQWNEGLIQQHFSPQEVGLILSIPV--WVGAEDKFVWHYEKSGLFSVKSGYRLGQSAWLA 64
           GQW+EG I+  F+P +V LIL IP   W   EDK +WHY K G +SVKSGYR+  S    
Sbjct: 315 GQWDEGFIRSIFNPTDVDLILGIPCSDW-DFEDKILWHYSKYGEYSVKSGYRMAASFTTE 374

Query: 65  QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCG 124
           Q    S+  SI+ WWK +W++ IP K+K F+W+++ + LP   NL  RG     VC  C 
Sbjct: 375 Q--HQSNEHSIVQWWKKLWRLKIPPKVKHFVWKVAHNWLPANVNLAKRGIASSVVCSRCS 434

Query: 125 QG-GESSLHVFWHCKRC----RAGSMFDLLREVRDE----------VGW--ERFGLFVVV 184
               ES  H  W CK      R   ++D L+++  E            W  E+   F++V
Sbjct: 435 SHVDESVAHALWECKASKGYWRVSGLYDDLKQMLGEDNLTMLMRIAAEWDKEKLEFFLLV 494

Query: 185 LWPVWNCRNQRKFRGLEP-VVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPE 244
            W +WN RN     G  P    ++EW  ++++ F+          +S  + ED RW PP 
Sbjct: 495 SWNIWNVRNTVVHGGYHPKPEEMIEWCGNFLADFRGDTGR----ERSQRSSEDSRWVPPA 554

Query: 245 AGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRL 291
                +NVDA  ++    +GLG VVRD++G V+ +A+ V +    P   E  A  KG ++
Sbjct: 555 RDQVTINVDAGVKQGGLISGLGYVVRDAAGVVLSAAATVLQQELPPLQLELMAIKKGIQV 613

BLAST of Lag0019044 vs. ExPASy TrEMBL
Match: A0A803PEK8 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 176.0 bits (445), Expect = 2.8e-40
Identity = 95/272 (34.93%), Postives = 147/272 (54.04%), Query Frame = 0

Query: 5    GQWNEGLIQQHFSPQEVGLILSIPV--WVGAEDKFVWHYEKSGLFSVKSGYRLGQSAWLA 64
            G+W+E  ++  F+ ++  LILS P   W   EDK +WHY K+G ++VKSGY++  S    
Sbjct: 1220 GRWDENFVRSVFNMEDAELILSTPSTGW-DLEDKIMWHYSKNGEYTVKSGYKMASSLATE 1279

Query: 65   QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLC- 124
            Q+   S ++  + WWK +W + IP KIK F+W+L+ + +PT  NL  RG  + N+C  C 
Sbjct: 1280 QY--QSDDQLYVDWWKTLWHLKIPPKIKHFVWKLAYNWIPTSANLAKRGVALDNICERCS 1339

Query: 125  GQGGESSLHVFWHCKRCR----AGSMFDLLREVRDE----------VGWE--RFGLFVVV 184
            G   E++ H  W CKR +       + D +++++ E            W+  RF  F+V+
Sbjct: 1340 GHVVETTAHALWECKRSKELWAVSGLKDDMKQIKGEDLLSFLMRMARLWDKTRFEFFLVI 1399

Query: 185  LWPVWNCRNQRKFRGLEPVVG-LVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPE 244
             W +WN RN     G  P+ G +V+W  ++++ FQ      G  V S   RE+ +W  P+
Sbjct: 1400 TWNIWNVRNNVVHGGKAPIAGDMVDWYRTFLTEFQGE----GAAVGSRVRRENAKWGAPD 1459

Query: 245  AGWYKVNVDASFRRERWQAGLGVVVRDSSGRV 257
             G  K+NVDA  +     +GLG V RD  GRV
Sbjct: 1460 MGQMKLNVDARVKGGGGVSGLGCVARDHGGRV 1484

BLAST of Lag0019044 vs. ExPASy TrEMBL
Match: A0A803QE56 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 169.1 bits (427), Expect = 3.4e-38
Identity = 101/307 (32.90%), Postives = 153/307 (49.84%), Query Frame = 0

Query: 4    SGQWNEGLIQQHFSPQEVGLILSIPV--WVGAEDKFVWHYEKSGLFSVKSGYRLGQSAWL 63
            +G W+E  ++  F+ ++  +IL +P   W   EDK +WHY K+G +SVKSGY +     L
Sbjct: 1143 NGCWDEEFVRVVFNEEDADIILKLPSTGW-DIEDKIMWHYTKNGEYSVKSGYCMAME--L 1202

Query: 64   AQFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLC 123
             +  + S+ + +M WW+G+WK+ +P K+K F+W+++   LPT   L  RG DV   C  C
Sbjct: 1203 RKEVTQSNEKDMMAWWRGMWKLKLPPKVKHFVWKMANSWLPTHSALTSRGMDVDPRCSRC 1262

Query: 124  GQGG-ESSLHVFWHC-------KRC-------RAGSMFDLLREVRDEVGWER--FGLFVV 183
              GG E+  H  W C       KR        R GS   L   +R    WE+  F LF+V
Sbjct: 1263 SNGGRENIFHALWRCHANKDVWKRFGIQHQIKRQGSEDVLAFFMRISKAWEKETFELFLV 1322

Query: 184  VLWPVWNCRNQRKFRGLEP-VVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPP 243
            V W +W  RN  K  G++P    + EW   Y+  ++      G G      R       P
Sbjct: 1323 VSWQLWYIRNNTKHGGIQPKATEVFEWCVQYLEEYR------GHGPTVTTGRGRGAQRVP 1382

Query: 244  EAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTR 291
              G +K+NVDA  +R R  +G+  V++DSSG V  ++S +      P  AE  A   G +
Sbjct: 1383 HTGVWKINVDAGVKRGRGWSGVSCVIQDSSGCVSYASSTILHREYQPLHAELMAIHGGLQ 1440

BLAST of Lag0019044 vs. ExPASy TrEMBL
Match: A0A803PRL1 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 167.9 bits (424), Expect = 7.6e-38
Identity = 93/306 (30.39%), Postives = 157/306 (51.31%), Query Frame = 0

Query: 4    SGQWNEGLIQQHFSPQEVGLILSIPVWVGAE-DKFVWHYEKSGLFSVKSGYRLGQSAWLA 63
            +G WN  L++ +F    V  ILS+P    ++ D + W +  SG +SV++GY + + A   
Sbjct: 1674 NGDWNIPLLRAYFQQDTVNDILSLPPPDPSKPDTYFWQHSTSGHYSVRTGYHVAKQAINR 1733

Query: 64   QFPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCG 123
              PSSS+ E++  WWK +W++ IP KI+ F++RL+   LPT +NL  R C    +C  C 
Sbjct: 1734 VQPSSSNTETLTRWWKSLWRLPIPPKIRHFVYRLAQHSLPTTNNLYNRHCISSPICPRCS 1793

Query: 124  QGGESSLHVFWHCKRCRAG----------------SMFDLLREVRDEVGWERFGLFVVVL 183
               ES  H  + C+  +                  ++FD+L  ++     + F LF+ +L
Sbjct: 1794 LCFESVQHALFECQEMKKAWSGTIFISIIKTTKHMNIFDILLLMQLNFSKDEFNLFLCML 1853

Query: 184  WPVWNCRNQRKFRG--LEPVVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPE 243
            W  WN RN   FR   L P   + + A  Y++ +Q A+   G   QS + R+ + W PP 
Sbjct: 1854 WKCWNARNASIFRNQTLRPET-IEQEAQDYLAFYQAALDKRGNHSQSTSDRDLLVWEPPP 1913

Query: 244  AGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAEGWAAVKGTRL 291
             G  K+N DA+    + + G G +VRD +G+++ + +  +     P+ AEGWA ++  + 
Sbjct: 1914 VGLLKLNTDAAISSHQNRTGGGALVRDHTGKIIAATAFNRIGQLQPQAAEGWALLEALKW 1973

BLAST of Lag0019044 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 91.7 bits (226), Expect = 1.3e-18
Identity = 81/307 (26.38%), Postives = 124/307 (40.39%), Query Frame = 0

Query: 6   QWNEGLIQQHFSPQEVGLILSI-PVWVGAEDKFVWHYEKSGLFSVKSGY-RLGQSAWLAQ 65
           +W + +I+  F   E  LI  + P      D + W Y  SG ++VKSGY  L Q      
Sbjct: 181 EWRKDVIEMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKRS 240

Query: 66  FPSSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQ 125
            P   S  S+   ++ +WK     KI+ FLW+   + LP    L  R     + C  C  
Sbjct: 241 SPQEVSEPSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACIRCPS 300

Query: 126 GGESSLHVFWHCKRCR----------------AGSMFDLLREV----RDEVGWERFGLFV 185
             E+  H+ + C   R                A S++  L  V         WE+    V
Sbjct: 301 CKETVNHLLFKCTFARLTWAISSIPIPLGGEWADSIYVNLYWVFNLGNGNPQWEKASQLV 360

Query: 186 V-VLWPVWNCRNQRKFRGLE-PVVGLVEWATSYISSFQQAISACGVGVQSVAAREDV-RW 245
             +LW +W  RN+  FRG E     ++  A   +  ++    A   G +    R    RW
Sbjct: 361 PWLLWRLWKNRNELVFRGREFNAQEVLRRAEDDLEEWRIRTEAESCGTKPQVNRSSCGRW 420

Query: 246 SPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAE----GW 284
            PP   W K N DA++ R+  + G+G V+R+  G V    +     ++S   AE     W
Sbjct: 421 RPPPHQWVKCNTDATWNRDNERCGIGWVLRNEKGEVKWMGARALPKLKSVLEAELEAMRW 480

BLAST of Lag0019044 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 80.9 bits (198), Expect = 2.4e-15
Identity = 71/295 (24.07%), Postives = 117/295 (39.66%), Query Frame = 0

Query: 7   WNEGLIQQHFSPQEVGLILSIPVWVGAE-DKFVWHYEKSGLFSVKSGYRLGQSAWLAQFP 66
           W++  I Q     + G I  I +    + DK +W+Y  +G ++V+SGY L         P
Sbjct: 88  WDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWNYNTTGEYTVRSGYWLLTHDPSTNIP 147

Query: 67  SSSSNESIMGWWKGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQGG 126
           + +     +     +W + I  K+K FLWR     L T + L  RG  +   C  C +  
Sbjct: 148 AINPPHGSIDLKTRIWNLPIMPKLKHFLWRALSQALATTERLTTRGMRIDPSCPRCHREN 207

Query: 127 ESSLHVFWHCK------RCRAGSMF--------------DLLREVRDEVGWERFGLFVV- 186
           ES  H  + C       R    S+               ++L  V+D    +   L  V 
Sbjct: 208 ESINHALFTCPFATMAWRLSDSSLIRNQLMSNDFEENISNILNFVQDTTMSDFHKLLPVW 267

Query: 187 VLWPVWNCRNQ---RKFR--GLEPVVGLVEWATSYISSFQQAISACGVGVQSVAAREDVR 246
           ++W +W  RN     KFR    + V+        ++++ Q          Q   A   + 
Sbjct: 268 LIWRIWKARNNVVFNKFRESPSKTVLSAKAETHDWLNATQSHKKTPSPTRQ--IAENKIE 327

Query: 247 WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPKMAE 275
           W  P A + K N DA F  ++ +A  G ++R+  G  +   S+   H  +P  AE
Sbjct: 328 WRNPPATYVKCNFDAGFDVQKLEATGGWIIRNHYGTPISWGSMKLAHTSNPLEAE 380

BLAST of Lag0019044 vs. TAIR 10
Match: AT3G25270.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 67.8 bits (164), Expect = 2.1e-11
Identity = 61/215 (28.37%), Postives = 89/215 (41.40%), Query Frame = 0

Query: 80  VWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQGGESSLHVFWHCKRCR 139
           +WK+    KIK FLW+L    L T DNL  R       C  C Q  E+S H+F+ C   +
Sbjct: 18  IWKLKTAPKIKHFLWKLLSGALATGDNLKRRHIRNHPQCHRCCQEDETSQHLFFDCFYAQ 77

Query: 140 -----AGSMFDLLR------EVRDEV---------GWERFGLFVVVLWPVWNCRNQRKFR 199
                +G     LR      E + E+           + F L + +LW +W  RNQ  F+
Sbjct: 78  QVWRASGIPHQELRTTGITMETKMELLLSSCLANRQPQLFNLAIWILWRLWKSRNQLVFQ 137

Query: 200 --------GLEPVVGLV-EW--ATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAGWY 259
                    L+     V EW    +Y+ S  Q + +     Q   AR   +W  P + W 
Sbjct: 138 QKSISWQNTLQRARNDVQEWEDTNTYVQSLNQQVHS-SRHQQPTMAR--TKWQRPPSTWI 197

Query: 260 KVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLV 264
           K N D +F  +   A  G ++RD +G  M S   +
Sbjct: 198 KYNYDGAFNHQTRNAKAGWLMRDENGVYMGSGQAI 229

BLAST of Lag0019044 vs. TAIR 10
Match: AT2G02650.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 62.8 bits (151), Expect = 6.6e-10
Identity = 47/212 (22.17%), Postives = 87/212 (41.04%), Query Frame = 0

Query: 78  KGVWKMLIPNKIKIFLWRLSLDRLPTIDNLGIRGCDVLNVCGLCGQGGESSLHVFWHCKR 137
           + +WK+ +  KIK FLWR     L T   L  R  D   +C  C    E+  H+ ++C  
Sbjct: 35  QAIWKLHVAPKIKHFLWRCVTGALATNTRLRSRNIDADPICQRCCIEEETIHHIMFNCPY 94

Query: 138 CRA---------------GSMFD-------LLREVRDEVGWERFGLFVVVLWPVWNCRN- 197
            ++                S F+        L + +     +RF L   ++W +W  RN 
Sbjct: 95  TQSVWRSANIIIGNQWGPPSSFEDNLNRLIQLSKTQTTNSLDRF-LPFWIMWRLWKSRNV 154

Query: 198 ---QRK-----FRGLEPVVGLVEWATSYISSFQQAISACGVGVQSVAAREDVRWSPPEAG 257
              Q+K     +   + +    EW  +  ++    +      +Q+ + R+  +W+PP  G
Sbjct: 155 FLFQQKCQSPDYEARKGIQDATEWLNANETTENTNVHVATNPIQT-SRRDSSQWNPPPEG 214

Query: 258 WYKVNVDASFRRERWQAGLGVVVRDSSGRVML 259
           W K N D+ + +       G  +R+ +G ++L
Sbjct: 215 WVKCNFDSGYTQGSPYTRSGWTIRECNGHIVL 244

BLAST of Lag0019044 vs. TAIR 10
Match: AT2G34320.1 (Polynucleotidyl transferase, ribonuclease H-like superfamily protein )

HSP 1 Score: 49.3 bits (116), Expect = 7.6e-06
Identity = 29/102 (28.43%), Postives = 48/102 (47.06%), Query Frame = 0

Query: 165 VLWPVWNCRNQRKFRGLE---------PVVGLVEWATSYISSFQQAISACGVGVQSVAAR 224
           +LW +W  RN+  F+G E          +    EW+T      ++ +     G Q V   
Sbjct: 80  LLWRLWKSRNELMFKGKEYDAPEVLRRAMEDFEEWST------RRELEGKASGPQ-VERN 139

Query: 225 EDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVM 258
             V+W  P   W K N DA+++ E  + G+G ++R+ SG V+
Sbjct: 140 LSVQWKAPPYQWVKCNTDATWQLENPRCGIGWILRNESGGVL 174

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022150918.13.5e-6142.09uncharacterized protein LOC111018954 [Momordica charantia][more]
XP_024037590.16.0e-3733.53uncharacterized protein LOC112097210 [Citrus clementina][more]
KAF4401718.17.3e-3529.93hypothetical protein G4B88_000766 [Cannabis sativa][more]
KAF8408042.11.2e-3433.33hypothetical protein HHK36_007182 [Tetracentron sinense][more]
KAF4364303.11.6e-3430.69hypothetical protein G4B88_028423 [Cannabis sativa][more]
Match NameE-valueIdentityDescription
P0C2F63.2e-1726.24Putative ribonuclease H protein At1g65750 OS=Arabidopsis thaliana OX=3702 GN=At1... [more]
Match NameE-valueIdentityDescription
A0A6J1DAR41.7e-6142.09uncharacterized protein LOC111018954 OS=Momordica charantia OX=3673 GN=LOC111018... [more]
A0A803QQT21.6e-4034.64Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803PEK82.8e-4034.93Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QE563.4e-3832.90Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803PRL17.6e-3830.39Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.11.3e-1826.38Ribonuclease H-like superfamily protein [more]
AT3G09510.12.4e-1524.07Ribonuclease H-like superfamily protein [more]
AT3G25270.12.1e-1128.37Ribonuclease H-like superfamily protein [more]
AT2G02650.16.6e-1022.17Ribonuclease H-like superfamily protein [more]
AT2G34320.17.6e-0628.43Polynucleotidyl transferase, ribonuclease H-like superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 58..136
e-value: 1.7E-16
score: 60.7
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 231..290
e-value: 7.0E-8
score: 32.3
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 10..290
NoneNo IPR availablePANTHERPTHR46736:SF6SUBFAMILY NOT NAMEDcoord: 10..290
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 230..290
e-value: 3.4132E-8
score: 49.62

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0019044.1Lag0019044.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity