Lag0040851 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0040851
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr13: 8982489 .. 8983965 (-)
RNA-Seq ExpressionLag0040851
SyntenyLag0040851
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGTGTTTTCAGATTCCCAAAGGTATATTGACTAAAATCTCGGCTCTCTGTGCCAAGTTTTGGTGGGGCTCGAATGGAGATCACTGTCGAATGCATTGGCAACGATGGGAGAACTTATGTAGGCCAAAGGAGATTGGAGGTTTAAACTTTAGAGATCTTGTCAATTTCAATCAGGCAATGTTGGCGAAGCAAGCATGGCGAGTTTTAACTAATCCAAATCTGACGGTTTCAAGAGTTTTATGTGGGAAATATTTCCTGTCGAAATCAGTCCTATCGGGGTCAGTTAAACCATCATCATCTTTTTTTTGGGAAAGGTTTGGTTTGGGGGATGGATCTTTTAAAGTGTGGGATCAGGAAGAACCTAGGGAATGGGCAATCAATTTATATGTTCCAGGACCCGTGGCTCCCTCGACCTACTACCTTTAAGGTGGTCTCTCGTATGGATCCAAGGATGAAGGACGCGACAGTGGCTGATTTTATTACCCCATCTCTTCATTGGGATATACCCAAACTTAACAAGCATTTGGTGCCTCTTGATGTGGAGGTTATAAAAGGCTTGTCGATTAGTGGTACGACACCAGATAGATGGATATGGCATTATGATGGTAGAAGGGTGTATTCTGTTAAGAGTGGGTATAAGCTCTCGATGCTAACCAGCCAGAGGGGACATTTGTCAGATATGGGAAGGAATAGCACTTGGTGGAAGAAGGTGTGGAAGATGATAGTTCCTAGCAAAGTAAAAGTTTTTGTTTGGAAATCTTTTCATAACTCAATTCCCGCTATGGTTAACCTCTGGAATCATCATGTGCCTGTCTTGGGAAATTGTCTGGTGAGCCATGAAGAGATGGAAACTACGGACCATGCTCTGTTTCAGTGTTCGAGGGTTAGAGAGATTTAGACCCTTCTTCATCCACATATTTTGAGAAATCTTTGGGATCAAATGGATATCAAAGATCAATGGCAAGGATTTTCTCATGAGCCATTGCAGGTTTTTTAGAGTATCTGTGTGGGTGTCTGGTCGATATGGAATGATAGGAATAACGTGGTTCATAATCGTCCAATTCCGGATCCAAGGATTAGATGTGAATGGATCAATGATTATCTTTCGAAGTTCCGGATGGCTAATCCAAATGGCGGTTCGGTTGTTCAGTCAATGGCAGATATTGTTAATATTATATCAAAGGGGGAAGAGTTTATAATGCATATGGACGCTTGTGTAATGGGTAAGCAGAGTAACGCTGGCATTGGTATTGTTCTGCATGATAAAGACGGTGTGCTAATGGCGGTGCAGAACTTATCGACTATGGCGAACAATTCTCCTTTGGAAGCAAAAGCAGTGGCGGTCCTTGAAGGGCTACGTTTGGCTAGGAGATTGAATGTGGAGAGACTATCTATTTTGTCAGATTCACTATTGTTGATAAAATCCATTAATGAGGAAACGCAAGTGGAGACCTGTATAGCTGTGACTATCTAG

mRNA sequence

ATGGGGTGTTTTCAGATTCCCAAAGGTATATTGACTAAAATCTCGGCTCTCTGTGCCAAGTTTTGGTGGGGCTCGAATGGAGATCACTGTCGAATGCATTGGCAACGATGGGAGAACTTATGTAGGCCAAAGGAGATTGGAGGTTTAAACTTTAGAGATCTTGTCAATTTCAATCAGGCAATGTTGGCGAAGCAAGCATGGCGAGTTTTAACTAATCCAAATCTGACGGTTTCAAGAGTTTTATGTGGGAAATATTTCCTGTCGAAATCAGTCCTATCGGGGAAGAACCTAGGGAATGGGCAATCAATTTATATGTTCCAGGACCCGTGGCTCCCTCGACCTACTACCTTTAAGGTGGTCTCTCGTATGGATCCAAGGATGAAGGACGCGACAGTGGCTGATTTTATTACCCCATCTCTTCATTGGGATATACCCAAACTTAACAAGCATTTGGTGCCTCTTGATGTGGAGGTTATAAAAGGCTTGTCGATTAGTGGTACGACACCAGATAGATGGATATGGCATTATGATGGTAGAAGGGTGTATTCTGTTAAGAGTGGGTATAAGCTCTCGATGCTAACCAGCCAGAGGGGACATTTGTCAGATATGGGAAGGAATAGCACTTGGTGGAAGAAGGTGTGGAAGATGATAGTTCCTAGCAAAGTAAAAGTTTTTGTTTGGAAATCTTTTCATAACTCAATTCCCGCTATGGTTAACCTCTGGAATCATCATGTGCCTGTCTTGGGAAATTGTCTGAGTATCTGTGTGGGTGTCTGGTCGATATGGAATGATAGGAATAACGTGGTTCATAATCGTCCAATTCCGGATCCAAGGATTAGATGTGAATGGATCAATGATTATCTTTCGAAGTTCCGGATGGCTAATCCAAATGGCGGTTCGGTTGTTCAGTCAATGGCAGATATTGTTAATATTATATCAAAGGGGGAAGAGTTTATAATGCATATGGACGCTTGTGTAATGGGTAAGCAGAGTAACGCTGGCATTGGTATTGTTCTGCATGATAAAGACGGTGTGCTAATGGCGGTGCAGAACTTATCGACTATGGCGAACAATTCTCCTTTGGAAGCAAAAGCAGTGGCGGTCCTTGAAGGGCTACGTTTGGCTAGGAGATTGAATGTGGAGAGACTATCTATTTTGTCAGATTCACTATTGTTGATAAAATCCATTAATGAGGAAACGCAAGTGGAGACCTGTATAGCTGTGACTATCTAG

Coding sequence (CDS)

ATGGGGTGTTTTCAGATTCCCAAAGGTATATTGACTAAAATCTCGGCTCTCTGTGCCAAGTTTTGGTGGGGCTCGAATGGAGATCACTGTCGAATGCATTGGCAACGATGGGAGAACTTATGTAGGCCAAAGGAGATTGGAGGTTTAAACTTTAGAGATCTTGTCAATTTCAATCAGGCAATGTTGGCGAAGCAAGCATGGCGAGTTTTAACTAATCCAAATCTGACGGTTTCAAGAGTTTTATGTGGGAAATATTTCCTGTCGAAATCAGTCCTATCGGGGAAGAACCTAGGGAATGGGCAATCAATTTATATGTTCCAGGACCCGTGGCTCCCTCGACCTACTACCTTTAAGGTGGTCTCTCGTATGGATCCAAGGATGAAGGACGCGACAGTGGCTGATTTTATTACCCCATCTCTTCATTGGGATATACCCAAACTTAACAAGCATTTGGTGCCTCTTGATGTGGAGGTTATAAAAGGCTTGTCGATTAGTGGTACGACACCAGATAGATGGATATGGCATTATGATGGTAGAAGGGTGTATTCTGTTAAGAGTGGGTATAAGCTCTCGATGCTAACCAGCCAGAGGGGACATTTGTCAGATATGGGAAGGAATAGCACTTGGTGGAAGAAGGTGTGGAAGATGATAGTTCCTAGCAAAGTAAAAGTTTTTGTTTGGAAATCTTTTCATAACTCAATTCCCGCTATGGTTAACCTCTGGAATCATCATGTGCCTGTCTTGGGAAATTGTCTGAGTATCTGTGTGGGTGTCTGGTCGATATGGAATGATAGGAATAACGTGGTTCATAATCGTCCAATTCCGGATCCAAGGATTAGATGTGAATGGATCAATGATTATCTTTCGAAGTTCCGGATGGCTAATCCAAATGGCGGTTCGGTTGTTCAGTCAATGGCAGATATTGTTAATATTATATCAAAGGGGGAAGAGTTTATAATGCATATGGACGCTTGTGTAATGGGTAAGCAGAGTAACGCTGGCATTGGTATTGTTCTGCATGATAAAGACGGTGTGCTAATGGCGGTGCAGAACTTATCGACTATGGCGAACAATTCTCCTTTGGAAGCAAAAGCAGTGGCGGTCCTTGAAGGGCTACGTTTGGCTAGGAGATTGAATGTGGAGAGACTATCTATTTTGTCAGATTCACTATTGTTGATAAAATCCATTAATGAGGAAACGCAAGTGGAGACCTGTATAGCTGTGACTATCTAG

Protein sequence

MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKNLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISGTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEETQVETCIAVTI
Homology
BLAST of Lag0040851 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 243.4 bits (620), Expect = 3.3e-60
Identity = 152/467 (32.55%), Postives = 211/467 (45.18%), Query Frame = 0

Query: 27   GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYF 86
            G+  ++HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +PNL VS+VL  KYF
Sbjct: 924  GESRKLHWMKWGRMCYPKECGGLNFRDLEGFNQALVAKHVWRFLQHPNLLVSKVLKHKYF 983

Query: 87   LSKSVLSGKN-------------------------LGNGQSIYMFQDPWLPRPTTFKVVS 146
               S+L   N                         +GNG +I  F DPWLPRPTTFK + 
Sbjct: 984  KDTSLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRPTTFKPL- 1043

Query: 147  RMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRR 206
            R +    D TVA FIT   +WD+  ++      D ++I  + IS     D W+WHYD R 
Sbjct: 1044 RFNNGALDTTVASFITADGNWDVTSISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRG 1103

Query: 207  VYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL 266
             YSV+SGYKL M        +      T W  +WK+ VP+K+K+F+W+S H  IP   NL
Sbjct: 1104 NYSVRSGYKLYMHLKCNATSASTNYRGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNL 1163

Query: 267  ----------------------------------WNHHVPVLGNCLS------------- 326
                                              W    P L  CLS             
Sbjct: 1164 LLRGIGELPACTICGDRRESIIHAFFHCKRARQIWRTLFPFL-TCLSAEDNISFLELWSS 1223

Query: 327  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN--- 386
                          +  W IWNDRN+++H + +     +CEW+  +L   S+ +M+N   
Sbjct: 1224 LTEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPVEFKCEWLTPFLDSHSQAQMSNYSP 1283

Query: 387  ---PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNL 400
                N   VVQ      ++  K     ++ DA   G  ++   G ++ D    L+A  ++
Sbjct: 1284 RTQSNHRPVVQYWRPSSSVSLK-----LNTDAACRG--ASTSFGCIIRDSSCSLVAATSI 1343

BLAST of Lag0040851 vs. NCBI nr
Match: XP_030502555.1 (uncharacterized protein LOC115717715 [Cannabis sativa])

HSP 1 Score: 233.8 bits (595), Expect = 2.6e-57
Identity = 138/485 (28.45%), Postives = 215/485 (44.33%), Query Frame = 0

Query: 1    MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
            M CF++      +I ++ A+FWWGS+ D+ ++HW+ W++LC  K  G L FR  V+FNQA
Sbjct: 819  MSCFRLSTKFCKQIESMMARFWWGSSTDNKKIHWKGWKSLCTSKGDGRLGFRSFVHFNQA 878

Query: 61   MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
             LAKQAWRV  NP+  +SRVL G+Y+     LS K                         
Sbjct: 879  FLAKQAWRVFQNPHSLLSRVLQGRYYHHTDFLSAKASGLSSLTWQGILWGRELLQQGLRI 938

Query: 121  NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
             +G G ++    D W+P    FK      P   +  VA +IT +  W+   L++   P D
Sbjct: 939  KIGTGTTVSCANDSWIPGHDFFKPFRFTGP--SNNLVAHYITDAREWNWELLHRDFSPAD 998

Query: 181  VEVIKGLSIS-GTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
            VE I  + +S  +  D WIWHY+    Y+VKS Y L+     R   S  G   TWWK+ W
Sbjct: 999  VEKILTIPLSYNSMSDCWIWHYECSGEYTVKSRYTLACSLENRDQTSSSGSQETWWKRFW 1058

Query: 241  KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVWS-------------- 300
             + +PSKV++F WK  ++++P   NL++  V     C S+C  VW               
Sbjct: 1059 GLTLPSKVRIFGWKVINSALPVATNLFHRKVITSATC-SLCSRVWESIGHALFSCCHAKA 1118

Query: 301  --------------------------------------------IWNDRNNVVHNRPIPD 360
                                                        IW+DRNN +H + +  
Sbjct: 1119 VWQNTGFQLDFQKASYMKDGDYLMFLSTILTNSELERLFCTMWFIWSDRNNFIHGKQLKQ 1178

Query: 361  PRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIIS----KGEEFIMHMDACVMGKQSN 398
            P         YL+ F             +A  VN +        +  M++DA +   +S 
Sbjct: 1179 PMAISSQAEAYLANFNSVQLQATPATFRVAADVNRVKWTPPPETKLKMNVDAAIDSSRSK 1238

BLAST of Lag0040851 vs. NCBI nr
Match: XP_030497600.1 (uncharacterized protein LOC115713257 [Cannabis sativa])

HSP 1 Score: 228.4 bits (581), Expect = 1.1e-55
Identity = 135/484 (27.89%), Postives = 221/484 (45.66%), Query Frame = 0

Query: 1    MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
            M CF++      +I  + A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA
Sbjct: 819  MSCFRLSAKFCKQIETMMARFWWGSSTDNKKIHWKNWKSLCTSKRDGGLGFRSFVHFNQA 878

Query: 61   MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
             LAKQAWR+   PN  +SRVL G+Y+     ++ K                         
Sbjct: 879  FLAKQAWRIFQTPNSLLSRVLKGRYYHQNDFMTAKVSGLSSLTWQGIVWGRELLSKGLII 938

Query: 121  NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
             +G+G  +    D W+P    FK + R      +  VAD+IT +  WD+  L+    P D
Sbjct: 939  KIGDGTGVNCAHDSWIPGNEYFKPL-RFTGSCSN-LVADYITDTREWDLELLHNDFSPAD 998

Query: 181  VEVIKGLSIS-GTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
            ++ I  + +S  +T DRW WHYD    Y+VKSGY L+     + H S       WW+  W
Sbjct: 999  IDRILTIPLSYNSTRDRWRWHYDSSGDYTVKSGYNLACSLENKDHSSSSTSQEAWWQLFW 1058

Query: 241  KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG----------- 300
             + +PSKV++F W+  ++++P   NL++  V     C S+C      +G           
Sbjct: 1059 GLNLPSKVRIFGWRVINSALPVAQNLFHRKVITSATC-SLCSRAWESIGHALFSCCHAKS 1118

Query: 301  -----------------------------------------VWSIWNDRNNVVHNRPIPD 360
                                                     +W IW+DRNN +H + +  
Sbjct: 1119 VWQHTSFQLDFTKASFMKDGDYLLFLSTILTKSELEKLFCTMWFIWSDRNNYIHCKQLKH 1178

Query: 361  PRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIISKGEEFI-MHMDACVMGKQSN 397
            P         YL+ F   + A     S V + A  V  +   E  + M++DA +   ++ 
Sbjct: 1179 PMAISSQAEAYLANFHSVKSATAPAVSCVAADARTVKWVPPTESNLKMNVDAALDSSRNK 1238

BLAST of Lag0040851 vs. NCBI nr
Match: XP_030483769.1 (uncharacterized protein LOC115700339 [Cannabis sativa])

HSP 1 Score: 226.5 bits (576), Expect = 4.2e-55
Identity = 139/448 (31.03%), Postives = 215/448 (47.99%), Query Frame = 0

Query: 1    MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
            M CF++ K    ++ A+ A+FWWGS  D+ ++HW++W+ LC+ K  GG+ FR  V+FNQA
Sbjct: 783  MSCFRLSKKFCNEVEAMMARFWWGSATDNKKIHWKKWKFLCKSKGDGGMGFRSFVHFNQA 842

Query: 61   MLAKQAWRVLTNPNLTVSRVLCGKY-----FLSKS-----------VLSGKNL------- 120
            +LAKQAWR+   PN  +SRVL G+Y     F++ S           ++ G+ L       
Sbjct: 843  LLAKQAWRIFQQPNSLLSRVLKGRYYPHSDFMTASANGLCSLTWQGIVWGRELLAKGLRL 902

Query: 121  --GNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
              G+G SI    D W+P    FK    + P      V+D+ITP   W+I  L     P D
Sbjct: 903  KVGSGLSIACGTDSWIPGHDNFKAFCYLGPSSNH--VSDYITPDREWNIDLLQADFSPPD 962

Query: 181  VEVIKGLSIS-GTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
            V+ I  + +S     DRWIWH+D    YSV +GY  +    +R   +     +TWWK  W
Sbjct: 963  VDRILTIPLSYNAVQDRWIWHHDVSGDYSVSTGYHFASSLEEREISTGSNTQNTWWKTFW 1022

Query: 241  KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVW--------------S 300
               +P+KVK+F W+   +SIP   +L++  V     C S+C   W               
Sbjct: 1023 SYNLPAKVKIFGWRVIQSSIPVAKSLFHKKVLTSATC-SLCQTAWETIGHALFSCCHAKE 1082

Query: 301  IWNDRN-----NVVHNRPIPDPRIRCEWINDY--LSKFRMANPNGGSVVQSMADIVNIIS 360
            +W            H     D  IR   I      S    ++PN G    S A +V   +
Sbjct: 1083 VWKLSGFCFNFQSAHRMQDGDYLIRITKIPGVHTSSVISASSPNIGFHTNSQAAVVKWHA 1142

Query: 361  KGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGL 401
              E  I M++DA +   +S  GIG+++ + +G ++A  +     N    E +A A+  GL
Sbjct: 1143 PPENKIKMNVDAAIDSSRSKIGIGVIIRNSNGQVLAAMSKPARGNFKSQEMEAKALFFGL 1202

BLAST of Lag0040851 vs. NCBI nr
Match: XP_024950112.1 (uncharacterized protein LOC112496847 [Citrus sinensis])

HSP 1 Score: 224.9 bits (572), Expect = 1.2e-54
Identity = 148/489 (30.27%), Postives = 221/489 (45.19%), Query Frame = 0

Query: 1    MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
            M  F++P+G    I    AKFWWGS GD   +HW++WE L + K  GGL FR+   FNQA
Sbjct: 840  MSVFKLPRGFCDDIQRAIAKFWWGSKGDKRGIHWRKWEKLSQAKIRGGLGFREFSCFNQA 899

Query: 61   MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
            ++AKQAWR+L  PN  VSRVL  +YF + S L  K                         
Sbjct: 900  LVAKQAWRLLQYPNSLVSRVLQARYFRNSSFLCAKAGANASYIWRSIMWGRQVIKKGMRW 959

Query: 121  NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
             +GNG+ I +F D WLPRP TF+ +  +   +  + VAD I     WD  KL +H + +D
Sbjct: 960  RIGNGKKIAIFSDNWLPRPETFRPIFPLSLPV-SSVVADLIKADNQWDEIKLRQHFLDVD 1019

Query: 181  -VEVIKGLSISGTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
              E++K    +    D  +WHYD R  YSVKSGY+L++ +      S    +  +W  +W
Sbjct: 1020 TAEILKIPLPAEKAEDEVLWHYDKRGNYSVKSGYQLALRSKFPDSTSCTEASHKYWSALW 1079

Query: 241  KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHV---PVLGNC-------------------- 300
             + +P K+K+F+W++ +N +P+  NLW   V   P    C                    
Sbjct: 1080 TLELPEKLKIFMWRASNNLLPSAENLWKRKVVEEPTCKRCKLSVETISHALLECKAARKI 1139

Query: 301  ---------------------------------LSICVGV-WSIWNDRNNVVHNRPIPDP 360
                                             L + V + WS W  RN  + +    +P
Sbjct: 1140 WLQSPFSAPRLEANSQDIFSTLQNMAKELRKSDLELMVALCWSAWYARNKCIFDGRELNP 1199

Query: 361  RIRCEWINDYLSKF-RMANPNGGSVVQSMADIVNIISKGEE--------FIMHMDACVMG 398
             I        L+ F R+  P    +       ++I  K +E        F +++DA    
Sbjct: 1200 IISAAKAESVLTAFQRVRKPQQSHI------SISIKEKQQEWLPPPQNVFKVNVDAAFNS 1259

BLAST of Lag0040851 vs. ExPASy Swiss-Prot
Match: P93295 (Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 GN=AtMg00310 PE=4 SV=1)

HSP 1 Score: 86.7 bits (213), Expect = 6.9e-16
Identity = 38/93 (40.86%), Postives = 61/93 (65.59%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQ 60
           M CF++ K +  K+++   +FWW S  +  ++ W  W+ LC+ KE  GGL FRDL  FNQ
Sbjct: 8   MSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDLGWFNQ 67

Query: 61  AMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL 93
           A+LAKQ++R++  P+  +SR+L  +YF   S++
Sbjct: 68  ALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMM 100

BLAST of Lag0040851 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 1.6e-60
Identity = 152/467 (32.55%), Postives = 211/467 (45.18%), Query Frame = 0

Query: 27   GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYF 86
            G+  ++HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +PNL VS+VL  KYF
Sbjct: 924  GESRKLHWMKWGRMCYPKECGGLNFRDLEGFNQALVAKHVWRFLQHPNLLVSKVLKHKYF 983

Query: 87   LSKSVLSGKN-------------------------LGNGQSIYMFQDPWLPRPTTFKVVS 146
               S+L   N                         +GNG +I  F DPWLPRPTTFK + 
Sbjct: 984  KDTSLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRPTTFKPL- 1043

Query: 147  RMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRR 206
            R +    D TVA FIT   +WD+  ++      D ++I  + IS     D W+WHYD R 
Sbjct: 1044 RFNNGALDTTVASFITADGNWDVTSISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRG 1103

Query: 207  VYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL 266
             YSV+SGYKL M        +      T W  +WK+ VP+K+K+F+W+S H  IP   NL
Sbjct: 1104 NYSVRSGYKLYMHLKCNATSASTNYRGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNL 1163

Query: 267  ----------------------------------WNHHVPVLGNCLS------------- 326
                                              W    P L  CLS             
Sbjct: 1164 LLRGIGELPACTICGDRRESIIHAFFHCKRARQIWRTLFPFL-TCLSAEDNISFLELWSS 1223

Query: 327  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN--- 386
                          +  W IWNDRN+++H + +     +CEW+  +L   S+ +M+N   
Sbjct: 1224 LTEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPVEFKCEWLTPFLDSHSQAQMSNYSP 1283

Query: 387  ---PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNL 400
                N   VVQ      ++  K     ++ DA   G  ++   G ++ D    L+A  ++
Sbjct: 1284 RTQSNHRPVVQYWRPSSSVSLK-----LNTDAACRG--ASTSFGCIIRDSSCSLVAATSI 1343

BLAST of Lag0040851 vs. ExPASy TrEMBL
Match: A0A803PKJ2 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 242.7 bits (618), Expect = 2.8e-60
Identity = 142/485 (29.28%), Postives = 222/485 (45.77%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
           M CF++      +I ++ A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA
Sbjct: 316 MSCFRLSTKFCKQIESMMARFWWGSSTDNKKIHWKSWKSLCTSKGDGGLGFRSFVHFNQA 375

Query: 61  MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
            LAKQAWRV  NP+  +SRVL G+Y+     LS K                         
Sbjct: 376 FLAKQAWRVFQNPHSLLSRVLKGRYYHHNDFLSAKASGLSSLTWQGIIWGRELLQQGLRI 435

Query: 121 NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
            +G G ++    D W+P    FK      P   +  VA +IT +  W+   L++   P+D
Sbjct: 436 KIGTGTAVSCANDSWIPGHDFFKPFQFTGP--SNNWVAQYITDAREWNWELLHRDFSPVD 495

Query: 181 VEVIKGLSIS-GTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
           VE I  + +S  +  D WIWHYD    Y+VKSGY L+     R   S  G   TWWK+ W
Sbjct: 496 VEKILTIPLSYSSMTDCWIWHYDCSGEYTVKSGYTLACSLENRDQTSSSGSQETWWKRFW 555

Query: 241 KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG----------- 300
            + +PSKV++F WK  ++++P   NL++  V     C S+C      +G           
Sbjct: 556 GLTLPSKVRIFGWKVINSALPVATNLFHQKVITSATC-SLCSRAWESIGHALFSCCHAKA 615

Query: 301 -----------------------------------------VWSIWNDRNNVVHNRPIPD 360
                                                    +W IW+DRNN +H + +  
Sbjct: 616 VWQNTSFHLDFQKASYMKDGDYLMFLSTILTKSELERLFCTMWFIWSDRNNFIHGKQLKQ 675

Query: 361 PRIRCEWINDYLSKFRMANPNGGSVVQSMADIVN----IISKGEEFIMHMDACVMGKQSN 398
           P         YL+ F+    +   V   +A  VN    I     +  M++DA +   +S 
Sbjct: 676 PMAISSQAEVYLANFKSVQLHTTPVAFCVAADVNQMKWIPPPETKLKMNVDAAIDSSRSK 735

BLAST of Lag0040851 vs. ExPASy TrEMBL
Match: A0A803Q185 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 236.1 bits (601), Expect = 2.6e-58
Identity = 140/483 (28.99%), Postives = 216/483 (44.72%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
           M CF++ KG ++ +  + A+FWWGS+    ++HW +WE+LC+PK+ GG+ FRDL  FNQA
Sbjct: 30  MSCFRLTKGTISNLHRMAARFWWGSSEKDKKIHWCKWEHLCKPKDKGGMGFRDLGMFNQA 89

Query: 61  MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
           +LAKQ WR +  PN   +RVL   YF + SVL  K                         
Sbjct: 90  LLAKQIWRCIRYPNALCNRVLKASYFPTNSVLEAKCGTHASFVWRSLMWGKKLILKGYRW 149

Query: 121 NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
            +G+G +I + +DPWLPRP TFK+  +  P  +   V D       WD P ++      D
Sbjct: 150 RVGDGSNIRVLEDPWLPRPVTFKIYDK-PPLPEHLWVVDLKLGDGTWDKPFISAVFNKDD 209

Query: 181 VEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
            E+I  L  SG    D+ +WHY     YSVKSGY+++         SD      WW+K+W
Sbjct: 210 AEMILALPNSGWDLDDKILWHYCKNGEYSVKSGYRMACELKAERQQSDDHLAVQWWRKLW 269

Query: 241 KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNC--------------------LSI 300
           ++ +P K+KVFVWK  H  +P    L   HV     C                      +
Sbjct: 270 RLKIPPKIKVFVWKLAHGWLPTSTVLAKRHVSTTDGCSRCSLHNSETIFHALWECKKSKV 329

Query: 301 C--------------------------------------VGVWSIWNDRNNVVHNRPIPD 360
           C                                      V  WS+WN RN   H+  +P 
Sbjct: 330 CWKLCDFQLEIKRHGQEDELAFLMRLSTVMSKDQFEMFLVITWSLWNTRNACTHDGFVPQ 389

Query: 361 PRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSNAGIG 400
           P    EW    L  F+        V++    +  +   GE  I ++DA V      +G+G
Sbjct: 390 PAEMVEWCYKLLEDFQGGRSRPQEVMRREEGVWKVPRHGEVKI-NVDASVKTGAGYSGLG 449

BLAST of Lag0040851 vs. ExPASy TrEMBL
Match: A0A803QQT2 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 235.7 bits (600), Expect = 3.4e-58
Identity = 148/484 (30.58%), Postives = 220/484 (45.45%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
           M CF++PK  ++ +  + ++FWWGS     ++HW +W  LCRPK+ GGL FRDL  FNQA
Sbjct: 152 MSCFKLPKKTISSLHRMASRFWWGSLDKEKKIHWCKWRYLCRPKDKGGLGFRDLGMFNQA 211

Query: 61  MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLS----------------GKNL------- 120
           +LAKQ WR L +P L  SRVL   YF  K VL                 GK L       
Sbjct: 212 LLAKQIWRCLRHPQLLCSRVLKASYFPRKGVLEAGCGANASFVWRSLVWGKKLILKGYRW 271

Query: 121 --GNGQSIYMFQDPWLPRPTTFKVVSRMDPRM-KDATVADFITPSLHWDIPKLNKHLVPL 180
             GNG+S+ + +DPWLPRP TFKV  +  P +  +  V D       WD   +     P 
Sbjct: 272 RVGNGESVRVLEDPWLPRPVTFKVYDQ--PSLPANLYVTDLKLADGQWDEGFIRSIFNPT 331

Query: 181 DVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKV 240
           DV++I G+  S     D+ +WHY     YSVKSGY+++   +   H S+      WWKK+
Sbjct: 332 DVDLILGIPCSDWDFEDKILWHYSKYGEYSVKSGYRMAASFTTEQHQSNEHSIVQWWKKL 391

Query: 241 WKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVP---VLGNCLS----------------- 300
           W++ +P KVK FVWK  HN +PA VNL    +    V   C S                 
Sbjct: 392 WRLKIPPKVKHFVWKVAHNWLPANVNLAKRGIASSVVCSRCSSHVDESVAHALWECKASK 451

Query: 301 --------------------------------------ICVGVWSIWNDRNNVVHNRPIP 360
                                                   +  W+IWN RN VVH    P
Sbjct: 452 GYWRVSGLYDDLKQMLGEDNLTMLMRIAAEWDKEKLEFFLLVSWNIWNVRNTVVHGGYHP 511

Query: 361 DPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSNAGI 400
            P    EW  ++L+ FR  +       +S  D   +    ++  +++DA V      +G+
Sbjct: 512 KPEEMIEWCGNFLADFR-GDTGRERSQRSSEDSRWVPPARDQVTINVDAGVKQGGLISGL 571

BLAST of Lag0040851 vs. ExPASy TrEMBL
Match: A0A803PIB6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 228.4 bits (581), Expect = 5.4e-56
Identity = 135/484 (27.89%), Postives = 221/484 (45.66%), Query Frame = 0

Query: 1    MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
            M CF++      +I  + A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA
Sbjct: 1279 MSCFRLSAKFCKQIETMMARFWWGSSTDNKKIHWKNWKSLCTSKRDGGLGFRSFVHFNQA 1338

Query: 61   MLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK------------------------- 120
             LAKQAWR+   PN  +SRVL G+Y+     ++ K                         
Sbjct: 1339 FLAKQAWRIFQTPNSLLSRVLKGRYYHQNDFMTAKVSGLSSLTWQGIVWGRELLSKGLII 1398

Query: 121  NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD 180
             +G+G  +    D W+P    FK + R      +  VAD+IT +  WD+  L+    P D
Sbjct: 1399 KIGDGTGVNCAHDSWIPGNEYFKPL-RFTGSCSN-LVADYITDTREWDLELLHNDFSPAD 1458

Query: 181  VEVIKGLSIS-GTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVW 240
            ++ I  + +S  +T DRW WHYD    Y+VKSGY L+     + H S       WW+  W
Sbjct: 1459 IDRILTIPLSYNSTRDRWRWHYDSSGDYTVKSGYNLACSLENKDHSSSSTSQEAWWQLFW 1518

Query: 241  KMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG----------- 300
             + +PSKV++F W+  ++++P   NL++  V     C S+C      +G           
Sbjct: 1519 GLNLPSKVRIFGWRVINSALPVAQNLFHRKVITSATC-SLCSRAWESIGHALFSCCHAKS 1578

Query: 301  -----------------------------------------VWSIWNDRNNVVHNRPIPD 360
                                                     +W IW+DRNN +H + +  
Sbjct: 1579 VWQHTSFQLDFTKASFMKDGDYLLFLSTILTKSELEKLFCTMWFIWSDRNNYIHCKQLKH 1638

Query: 361  PRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIISKGEEFI-MHMDACVMGKQSN 397
            P         YL+ F   + A     S V + A  V  +   E  + M++DA +   ++ 
Sbjct: 1639 PMAISSQAEAYLANFHSVKSATAPAVSCVAADARTVKWVPPTESNLKMNVDAALDSSRNK 1698

BLAST of Lag0040851 vs. TAIR 10
Match: AT4G29090.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 119.4 bits (298), Expect = 6.8e-27
Identity = 75/289 (25.95%), Postives = 134/289 (46.37%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQA 60
           M CF +PK +  +I ++ A FWW +  +   MHW+ W++L   K  GG+ F+D+  FN A
Sbjct: 8   MACFLLPKTVCKQIISVLADFWWRNKQEAKGMHWKAWDHLSCYKAEGGIGFKDIEAFNLA 67

Query: 61  MLAKQAWRVLTNPNLTVSRVLCGKY----------------FLSKSVLSGKNL------- 120
           +L KQ WR+L+ P   +++V   +Y                F+ KS+ + + +       
Sbjct: 68  LLGKQMWRMLSRPESLMAKVFKSRYFHKSDPLNAPLGSRPSFVWKSIHASQEILRQGARA 127

Query: 121 --GNGQSIYMFQDPWL-PRPTTFKVVSRMDPRMKDATVADFITPS-------LHWDIPKL 180
             GNG+ I +++  WL  +P +  +  +  P  + A+V+  +  S         W    +
Sbjct: 128 VVGNGEDIIIWRHKWLDSKPASAALRMQRVPPQEYASVSSILKVSDLIDESGREWRKDVI 187

Query: 181 NKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGY-KLSMLTSQRGHLSDMGR 240
                 ++ ++I  L   G    D + W Y     Y+VKSGY  L+ + ++R    ++  
Sbjct: 188 EMLFPEVERKLIGELRPGGRRILDSYTWDYTSSGDYTVKSGYWVLTQIINKRSSPQEVSE 247

Query: 241 NS--TWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCL 253
            S    ++K+WK     K++ F+WK   NS+P    L   H+     C+
Sbjct: 248 PSLNPIYQKIWKSQTSPKIQHFLWKCLSNSLPVAGALAYRHLSKESACI 296

BLAST of Lag0040851 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 86.7 bits (213), Expect = 4.9e-17
Identity = 38/93 (40.86%), Postives = 61/93 (65.59%), Query Frame = 0

Query: 1   MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQ 60
           M CF++ K +  K+++   +FWW S  +  ++ W  W+ LC+ KE  GGL FRDL  FNQ
Sbjct: 8   MSCFRLSKLLCKKLTSAMTEFWWSSCENKRKISWVAWQKLCKSKEDDGGLGFRDLGWFNQ 67

Query: 61  AMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL 93
           A+LAKQ++R++  P+  +SR+L  +YF   S++
Sbjct: 68  ALLAKQSFRIIHQPHTLLSRLLRSRYFPHSSMM 100

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158377.13.3e-6032.55uncharacterized protein LOC111024874 [Momordica charantia][more]
XP_030502555.12.6e-5728.45uncharacterized protein LOC115717715 [Cannabis sativa][more]
XP_030497600.11.1e-5527.89uncharacterized protein LOC115713257 [Cannabis sativa][more]
XP_030483769.14.2e-5531.03uncharacterized protein LOC115700339 [Cannabis sativa][more]
XP_024950112.11.2e-5430.27uncharacterized protein LOC112496847 [Citrus sinensis][more]
Match NameE-valueIdentityDescription
P932956.9e-1640.86Uncharacterized mitochondrial protein AtMg00310 OS=Arabidopsis thaliana OX=3702 ... [more]
Match NameE-valueIdentityDescription
A0A6J1DX301.6e-6032.55uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A803PKJ22.8e-6029.28Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803Q1852.6e-5828.99Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803QQT23.4e-5830.58Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803PIB65.4e-5627.89Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G29090.16.8e-2725.95Ribonuclease H-like superfamily protein [more]
ATMG00310.14.9e-1740.86RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 322..403
e-value: 8.5E-11
score: 41.7
IPR026960Reverse transcriptase zinc-binding domainPFAMPF13966zf-RVTcoord: 182..255
e-value: 7.7E-9
score: 36.1
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 317..406
e-value: 8.0E-10
score: 40.9
NoneNo IPR availablePANTHERPTHR33116REVERSE TRANSCRIPTASE ZINC-BINDING DOMAIN-CONTAINING PROTEIN-RELATED-RELATEDcoord: 1..197
NoneNo IPR availablePANTHERPTHR33116:SF41SUBFAMILY NOT NAMEDcoord: 1..197
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 318..402

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0040851.1Lag0040851.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity