Lag0006229 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0006229
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionReverse transcriptase domain-containing protein
Locationchr6: 39658649 .. 39660055 (+)
RNA-Seq ExpressionLag0006229
SyntenyLag0006229
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTGTTTTGGATGTTTCGCCTTCTTCATCATCCTCTTATTTTTGGAAGGGCTTTATCTGGGGGATGGACCTTTTGAAAAGAGGTATCAGAAGAAATCTAGGTAACGACATCTCAATAAAAATGTTTAGTGATCCTTGGATTCCTCGTCCATCAACCTTTAAGATTCTCTCACATCCTAGATCAGAGAATGCAGATATGGTGGTGGCAGACTTTATTACAGAGACAAACCAGTGGGACATTGCCAAATTACATCAAGTTTTGGGAAAAGAAGATGTGGATGAGATTGTCCGACTTCCCATCAGCACATCCGCGTCGGACAAATGGGTGTGGCATTATGATAAAATGGGGAAGTATACTGTCAAAAGTGGTTATAAACTATGTATTAAGCATAGTCAAGAGGCCTCTGCCTCTTCAGCAGAGGTAGAATCGAGATGGTGAGCGAATGTTTGGAAACTTAAAGTTCCAAATAAGGTGAAACACTTTGTTTGGAAATCTTTTCATGAAGTCATTCCCACTATAGCTAACCTTAAGAGGCATCATGTTCCTGTTAGTGGAAGTTGCCCAGTGTGTAGGGAGGCGATGGAGACTACTGATCCTGCTCTGTTCTTATGTTCTCGTGCTCGTGAGGTGTGGGAAGGTATTCTCCCATGGATGAATGAGGAGTTTTGGATACCCATGGATATCCAAGACAGATGGTTGAGCCTTGGTGACTGTCAAAGCCAACGGTTAGATCTTATCAGTATTGGGGCTTGGGCAATTTGGAATGATAGGAACAATATTCATCACCAAAGGTTAGTTCCTAATGTCCAAACTCGAAGTGAATGGATTCTTGAGTATTTGGAGGAATTCCAAAATGCAAATCCAGTTCGTGGAATTGTTAATCAAGGAGTGGATGATGTTAGAAGAATACTACAAGGCGTTGAAGAGATAATCATGCATTGTGATGCGGCTTACGATGAGATTAATGGTAGTGTGGGTATTGGACTGGTGTTTCAGGACAAACAAGGGAATCTTAAGGTCGTGAAGGCTTTGTCTACAATTAGTGGTATATCACCGTTGGGAGCGGAAGCGGAAGTAGTCTTACAAGGGCTTTGTTTCGCTCGATCTTTAAAAATGCAGTGTTTGTCAGTTCTGTCTGATTCTTTAACATTCATAAAGACAGTCAGGAAAAAAGTGCAATGTGAAACTTGTTTGGCCACTACGATATGGGATATTAAGGAGATTCATCAGTCTTTCAGGACCATCAGATTCGAACATGCTCTTCGCCATTATAATCGATTTGCTCATAAGTTAGCCCATGTGGGCCTTCATTATCAATCACAGTCGTGGTTAGGAAACTATCCTAATTGGATACATAGTATGTCGAAAGAGCGATACATGTTTGTACCCCTAGGGGAATTTTGA

mRNA sequence

ATGTCTGTTTTGGATGTTTCGCCTTCTTCATCATCCTCTTATTTTTGGAAGGGCTTTATCTGGGGGATGGACCTTTTGAAAAGAGGTATCAGAAGAAATCTAGGTAACGACATCTCAATAAAAATGTTTAGTGATCCTTGGATTCCTCGTCCATCAACCTTTAAGATTCTCTCACATCCTAGATCAGAGAATGCAGATATGGTGGTGGCAGACTTTATTACAGAGACAAACCAGTGGGACATTGCCAAATTACATCAAGTTTTGGGAAAAGAAGATGTGGATGAGATTGTCCGACTTCCCATCAGCACATCCGCGTCGGACAAATGGGTGTGGCATTATGATAAAATGGGGAAGTATACTGTCAAAAGTGGTTATAAACTATGTATTAAGCATAGTCAAGAGGCCTCTGCCTCTTCAGCAGAGGTAGAATCGAGATGTGGAAGTTGCCCAGTGTGTAGGGAGGCGATGGAGACTACTGATCCTGCTCTGTTCTTATGTTCTCGTGCTCGTGAGGTGTGGGAAGGTATTCTCCCATGGATGAATGAGGAGTTTTGGATACCCATGGATATCCAAGACAGATGGTTGAGCCTTGGTGACTGTCAAAGCCAACGGTTAGATCTTATCAGTATTGGGGCTTGGGCAATTTGGAATGATAGGAACAATATTCATCACCAAAGGTTAGTTCCTAATGTCCAAACTCGAAGTGAATGGATTCTTGAGTATTTGGAGGAATTCCAAAATGCAAATCCAGTTCGTGGAATTGTTAATCAAGGAGTGGATGATGTTAGAAGAATACTACAAGGCGTTGAAGAGATAATCATGCATTGTGATGCGGCTTACGATGAGATTAATGGTAGTGTGGGTATTGGACTGGTGTTTCAGGACAAACAAGGGAATCTTAAGGTCGTGAAGGCTTTGTCTACAATTAGTGGTATATCACCGTTGGGAGCGGAAGCGGAAGTAGTCTTACAAGGGCTTTGTTTCGCTCGATCTTTAAAAATGCAGTGTTTGTCAGTTCTGTCTGATTCTTTAACATTCATAAAGACAGTCAGGAAAAAAGTGCAATGTGAAACTTGTTTGGCCACTACGATATGGGATATTAAGGAGATTCATCAGTCTTTCAGGACCATCAGATTCGAACATGCTCTTCGCCATTATAATCGATTTGCTCATAAGTTAGCCCATGTGGGCCTTCATTATCAATCACAGTCGTGGTTAGGAAACTATCCTAATTGGATACATAGTATGTCGAAAGAGCGATACATGTTTGTACCCCTAGGGGAATTTTGA

Coding sequence (CDS)

ATGTCTGTTTTGGATGTTTCGCCTTCTTCATCATCCTCTTATTTTTGGAAGGGCTTTATCTGGGGGATGGACCTTTTGAAAAGAGGTATCAGAAGAAATCTAGGTAACGACATCTCAATAAAAATGTTTAGTGATCCTTGGATTCCTCGTCCATCAACCTTTAAGATTCTCTCACATCCTAGATCAGAGAATGCAGATATGGTGGTGGCAGACTTTATTACAGAGACAAACCAGTGGGACATTGCCAAATTACATCAAGTTTTGGGAAAAGAAGATGTGGATGAGATTGTCCGACTTCCCATCAGCACATCCGCGTCGGACAAATGGGTGTGGCATTATGATAAAATGGGGAAGTATACTGTCAAAAGTGGTTATAAACTATGTATTAAGCATAGTCAAGAGGCCTCTGCCTCTTCAGCAGAGGTAGAATCGAGATGTGGAAGTTGCCCAGTGTGTAGGGAGGCGATGGAGACTACTGATCCTGCTCTGTTCTTATGTTCTCGTGCTCGTGAGGTGTGGGAAGGTATTCTCCCATGGATGAATGAGGAGTTTTGGATACCCATGGATATCCAAGACAGATGGTTGAGCCTTGGTGACTGTCAAAGCCAACGGTTAGATCTTATCAGTATTGGGGCTTGGGCAATTTGGAATGATAGGAACAATATTCATCACCAAAGGTTAGTTCCTAATGTCCAAACTCGAAGTGAATGGATTCTTGAGTATTTGGAGGAATTCCAAAATGCAAATCCAGTTCGTGGAATTGTTAATCAAGGAGTGGATGATGTTAGAAGAATACTACAAGGCGTTGAAGAGATAATCATGCATTGTGATGCGGCTTACGATGAGATTAATGGTAGTGTGGGTATTGGACTGGTGTTTCAGGACAAACAAGGGAATCTTAAGGTCGTGAAGGCTTTGTCTACAATTAGTGGTATATCACCGTTGGGAGCGGAAGCGGAAGTAGTCTTACAAGGGCTTTGTTTCGCTCGATCTTTAAAAATGCAGTGTTTGTCAGTTCTGTCTGATTCTTTAACATTCATAAAGACAGTCAGGAAAAAAGTGCAATGTGAAACTTGTTTGGCCACTACGATATGGGATATTAAGGAGATTCATCAGTCTTTCAGGACCATCAGATTCGAACATGCTCTTCGCCATTATAATCGATTTGCTCATAAGTTAGCCCATGTGGGCCTTCATTATCAATCACAGTCGTGGTTAGGAAACTATCCTAATTGGATACATAGTATGTCGAAAGAGCGATACATGTTTGTACCCCTAGGGGAATTTTGA

Protein sequence

MSVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSENADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPISTSASDKWVWHYDKMGKYTVKSGYKLCIKHSQEASASSAEVESRCGSCPVCREAMETTDPALFLCSRAREVWEGILPWMNEEFWIPMDIQDRWLSLGDCQSQRLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNANPVRGIVNQGVDDVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQSFRTIRFEHALRHYNRFAHKLAHVGLHYQSQSWLGNYPNWIHSMSKERYMFVPLGEF
Homology
BLAST of Lag0006229 vs. NCBI nr
Match: XP_022158377.1 (uncharacterized protein LOC111024874 [Momordica charantia])

HSP 1 Score: 228.8 bits (582), Expect = 8.9e-56
Identity = 147/464 (31.68%), Postives = 224/464 (48.28%), Query Frame = 0

Query: 2    SVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPR 61
            S+L  S +S SSYFWKGF+WG DLL +G+R  +GN  +IK FSDPW+PRP+TFK L    
Sbjct: 987  SLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRPTTFKPLRF-N 1046

Query: 62   SENADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPIST-SASDKWVWHYDKMGKYT 121
            +   D  VA FIT    WD+  +      ED D I+ +PIS+ +  D W+WHYDK G Y+
Sbjct: 1047 NGALDTTVASFITADGNWDVTSISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRGNYS 1106

Query: 122  VKSGYKLCIKHSQEASASSAEVE------------------------------------- 181
            V+SGYKL +     A+++S                                         
Sbjct: 1107 VRSGYKLYMHLKCNATSASTNYRGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNLLLR 1166

Query: 182  --SRCGSCPVCREAMETTDPALFLCSRAREVWEGILPWMN----EEFWIPMDIQDRWLSL 241
                  +C +C +  E+   A F C RAR++W  + P++     E+    +   + W SL
Sbjct: 1167 GIGELPACTICGDRRESIIHAFFHCKRARQIWRTLFPFLTCLSAED---NISFLELWSSL 1226

Query: 242  GD-CQSQRLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLE-----EFQNANPV 301
             +  + + L+L +I  W IWNDRN++ H + V  V+ + EW+  +L+     +  N +P 
Sbjct: 1227 TEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPVEFKCEWLTPFLDSHSQAQMSNYSPR 1286

Query: 302  RGIVNQGVDDVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISG 361
                ++ V    R    V  + ++ DAA      S   G + +D   +L    ++     
Sbjct: 1287 TQSNHRPVVQYWRPSSSV-SLKLNTDAACR--GASTSFGCIIRDSSCSLVAATSIRVPFP 1346

Query: 362  ISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIH 414
            +SPL AE   +L+GL FA +     L V SDSL  I+ +R ++         + +I+ + 
Sbjct: 1347 LSPLLAEIRCILEGLKFAAASNFTHLEVESDSLLAIQLIRNEIHTRGDEQNWVMEIQALT 1406

BLAST of Lag0006229 vs. NCBI nr
Match: XP_030497600.1 (uncharacterized protein LOC115713257 [Cannabis sativa])

HSP 1 Score: 169.9 bits (429), Expect = 4.9e-38
Identity = 125/457 (27.35%), Postives = 191/457 (41.79%), Query Frame = 0

Query: 9    SSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSENADMV 68
            S  SS  W+G +WG +LL +G+   +G+   +    D WIP    FK L    S     +
Sbjct: 915  SGLSSLTWQGIVWGRELLSKGLIIKIGDGTGVNCAHDSWIPGNEYFKPLRFTGS--CSNL 974

Query: 69   VADFITETNQWDIAKLHQVLGKEDVDEIVRLPIS-TSASDKWVWHYDKMGKYTVKSGYKL 128
            VAD+IT+T +WD+  LH      D+D I+ +P+S  S  D+W WHYD  G YTVKSGY L
Sbjct: 975  VADYITDTREWDLELLHNDFSPADIDRILTIPLSYNSTRDRWRWHYDSSGDYTVKSGYNL 1034

Query: 129  -CIKHSQEASASSAEVES--------------------------------------RCGS 188
             C   +++ S+SS   E+                                         +
Sbjct: 1035 ACSLENKDHSSSSTSQEAWWQLFWGLNLPSKVRIFGWRVINSALPVAQNLFHRKVITSAT 1094

Query: 189  CPVCREAMETTDPALFLCSRAREVWEGI---LPWMNEEFWIPMDIQDRWLSLGDCQSQ-R 248
            C +C  A E+   ALF C  A+ VW+     L +    F   M   D  L L    ++  
Sbjct: 1095 CSLCSRAWESIGHALFSCCHAKSVWQHTSFQLDFTKASF---MKDGDYLLFLSTILTKSE 1154

Query: 249  LDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNAN----PVRGIVNQGVD 308
            L+ +    W IW+DRNN  H + + +    S     YL  F +      P    V     
Sbjct: 1155 LEKLFCTMWFIWSDRNNYIHCKQLKHPMAISSQAEAYLANFHSVKSATAPAVSCVAADAR 1214

Query: 309  DVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGISPLGAEAE 368
             V+ +      + M+ DAA D     +GIG++ +D  G +    +   +        EA+
Sbjct: 1215 TVKWVPPTESNLKMNVDAALDSSRNKIGIGVIIRDSTGRVIAAMSKPVVGNFKSQEMEAK 1274

Query: 369  VVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQSFRTIRFE 417
             +  GL +A+ L++Q   V +D L  +  ++ K    +     + DI     SF      
Sbjct: 1275 AMFWGLQWAKQLQLQPHCVETDCLMLVHALQGKQSQLSSFHDLVEDISYHLSSFSNACIS 1334

BLAST of Lag0006229 vs. NCBI nr
Match: XP_024950112.1 (uncharacterized protein LOC112496847 [Citrus sinensis])

HSP 1 Score: 168.3 bits (425), Expect = 1.4e-37
Identity = 132/470 (28.09%), Postives = 210/470 (44.68%), Query Frame = 0

Query: 2    SVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPR 61
            S L     +++SY W+  +WG  ++K+G+R  +GN   I +FSD W+PRP TF+ +  P 
Sbjct: 929  SFLCAKAGANASYIWRSIMWGRQVIKKGMRWRIGNGKKIAIFSDNWLPRPETFRPI-FPL 988

Query: 62   SENADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPI-STSASDKWVWHYDKMGKYT 121
            S     VVAD I   NQWD  KL Q     D  EI+++P+ +  A D+ +WHYDK G Y+
Sbjct: 989  SLPVSSVVADLIKADNQWDEIKLRQHFLDVDTAEILKIPLPAEKAEDEVLWHYDKRGNYS 1048

Query: 122  VKSGYKLCIKHSQEASASSAEVESRCGS-------------------------------- 181
            VKSGY+L ++     S S  E   +  S                                
Sbjct: 1049 VKSGYQLALRSKFPDSTSCTEASHKYWSALWTLELPEKLKIFMWRASNNLLPSAENLWKR 1108

Query: 182  -------CPVCREAMETTDPALFLCSRAREVWEGILPWMNEEFWIP---MDIQDRWLSL- 241
                   C  C+ ++ET   AL  C  AR++      W+   F  P    + QD + +L 
Sbjct: 1109 KVVEEPTCKRCKLSVETISHALLECKAARKI------WLQSPFSAPRLEANSQDIFSTLQ 1168

Query: 242  ---GDCQSQRLDLISIGAWAIWNDRNN--IHHQRLVPNVQ-TRSEWILEYLEEFQNANPV 301
                + +   L+L+    W+ W  RN      + L P +   ++E +L   +  +   P 
Sbjct: 1169 NMAKELRKSDLELMVALCWSAWYARNKCIFDGRELNPIISAAKAESVLTAFQRVR--KPQ 1228

Query: 302  RGIVNQGVDDVRR-ILQGVEEII-MHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTI 361
            +  ++  + + ++  L   + +  ++ DAA++  N S G+G V +D  G +        +
Sbjct: 1229 QSHISISIKEKQQEWLPPPQNVFKVNVDAAFNSKNLSAGVGAVIRDSNGKIVAAGVNQNL 1288

Query: 362  SGISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKE 419
               S   AEAE VL GL  AR+  +  L + SD L  ++ V       + +  TI  I+ 
Sbjct: 1289 LKGSASLAEAEAVLWGLQLARNADVSSLIIESDCLEVVQLVNNTKGSRSEIFWTILAIQN 1348

BLAST of Lag0006229 vs. NCBI nr
Match: XP_017250619.1 (PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus])

HSP 1 Score: 163.7 bits (413), Expect = 3.5e-36
Identity = 127/464 (27.37%), Postives = 203/464 (43.75%), Query Frame = 0

Query: 2    SVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPR 61
            S LD       S  W+  +WG  LL  G+RR +GN  S + F DPW+ RP +F  L   R
Sbjct: 1109 SFLDSKEGRCPSLTWRSIVWGKTLLIEGLRRRIGNGQSTRAFKDPWLARPPSF--LPITR 1168

Query: 62   SENADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPIST-SASDKWVWHYDKMGKYT 121
                ++ V ++IT    W+   + Q     D+  I+ +P+S    +D W WHY+  G YT
Sbjct: 1169 GSEEEVKVVEYIT-GGTWNRELIRQTFLSPDIQLILEIPLSRFDHADSWFWHYNSQGNYT 1228

Query: 122  VKSGYKLCIKHSQEASASSAEVESRC---------------------------------- 181
            VKSGYKL    +++ S+SS +V  +                                   
Sbjct: 1229 VKSGYKLVTNLNKDVSSSSEQVMGKWWKYFWANKIPRKILIFAWRGYHEILPTTKGLHIR 1288

Query: 182  -----GSCPVCREAMETTDPALFLCSRAREVWE----GILPWMNEEFWIPMDIQDRWLSL 241
                  +CP+C  A ++   A+F C  A+EVWE      L    EE    +  +D  L +
Sbjct: 1289 KISLHSNCPLCGYADDSNAHAVFWCPFAQEVWELMTYPFLVGRKEE----ISFKDVLLYI 1348

Query: 242  GD-CQSQRLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNANPVRGIVN 301
             +  +   +D++ +  W IW +RN + HQ+          W+  Y EE +NA        
Sbjct: 1349 TELLEKDDVDMMLLTTWGIWTERNKLIHQQKRRTPSQIKVWLSAYYEEIKNAY------- 1408

Query: 302  QGVDDVRRILQG-------VEEI----IMHCDAAYDEINGSVGIG--LVFQDKQGNLKVV 361
              V + R I +G       VEE+     +  DAA  +    +G+G  ++  + +    + 
Sbjct: 1409 --VSENRAISRGDPINAHQVEEVSFGSTLFVDAALSKNTERIGLGAAIIASNNKTQATLS 1468

Query: 362  KALSTISGISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATT 407
            K L  I  +S L AEA  ++ GL +A++       VL+DS + ++ +  + +    L   
Sbjct: 1469 KPLEGI--LSVLHAEALALVVGLQWAQTTGYTLKKVLTDSQSLVQALNSEAEYHNELGLL 1528

BLAST of Lag0006229 vs. NCBI nr
Match: XP_006491472.1 (uncharacterized protein LOC102626455 [Citrus sinensis])

HSP 1 Score: 161.0 bits (406), Expect = 2.3e-35
Identity = 110/452 (24.34%), Postives = 202/452 (44.69%), Query Frame = 0

Query: 10   SSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSENADMVV 69
            S+ S+ W+  +WG  ++K+G+R  +G+   + ++ D WIPRP+TF+ +S P++   + VV
Sbjct: 996  SNPSFIWRSILWGSQVIKKGVRWRIGDGKKVLVYKDKWIPRPATFQPIS-PKTLPHETVV 1055

Query: 70   ADFITETNQWDIAKLHQVLGKEDVDEIVRLPI-STSASDKWVWHYDKMGKYTVKSGYKLC 129
            AD I   N+W + +L Q   KED++ I+++ + S    D+ +WH+DK G+Y+VKSGY+L 
Sbjct: 1056 ADLIDSENKWRVDRLEQHFMKEDIEAILKILLPSGKEEDEVLWHFDKKGEYSVKSGYQLA 1115

Query: 130  IKHSQEASASSAEVESRCGS---------------------------------------C 189
            +  +      S+   SR                                          C
Sbjct: 1116 LNQNFPNEPESSNSSSRLWKIPWMLDLPEKVKIFMWRALKNILPTAENLWKRRSLQEPIC 1175

Query: 190  PVCREAMETTDPALFLCSRAREVWEGILPWM-------NEEFWIPMDIQDRWLSLGDCQS 249
              C+  +ET    L  C  AR++W+ + P +       N++F+    IQ+ W      ++
Sbjct: 1176 QRCKLQVETVSHVLIECKAARKIWD-LAPLIVQPSKDHNQDFF--SAIQEMWSRSSTAEA 1235

Query: 250  QRLDLISIGAWAIWNDRNNIHHQRLVPN---VQTRSEWILEYLEEFQNANPVRGIVNQGV 309
               +L+ +  W IW+ RN    +    +   +  +++ +L+  +       V G  ++G+
Sbjct: 1236 ---ELMIVYCWVIWSARNKFIFEGKKSDSRFLAAKADSVLKAYQRVSKPGNVHGAKDRGI 1295

Query: 310  DDVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGISPLGAEA 369
            D  +        + ++ DAA    +  VG+G + +D +G +  V             AEA
Sbjct: 1296 DQQKWKPPSQNVLKLNVDAAVSTKDQKVGLGAIVRDAEGKILAVGIKQAQFRERVSLAEA 1355

Query: 370  EVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQSFRTIRF 411
            E +  GL  A  +    L V SD    ++ +       T +   + D++   + F+ ++F
Sbjct: 1356 EAIHWGLQVANQISSSSLIVESDCKEVVELLNNTKGSRTEIHWILSDVRRESKEFKQVQF 1415

BLAST of Lag0006229 vs. ExPASy TrEMBL
Match: A0A6J1DX30 (uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024874 PE=4 SV=1)

HSP 1 Score: 228.8 bits (582), Expect = 4.3e-56
Identity = 147/464 (31.68%), Postives = 224/464 (48.28%), Query Frame = 0

Query: 2    SVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPR 61
            S+L  S +S SSYFWKGF+WG DLL +G+R  +GN  +IK FSDPW+PRP+TFK L    
Sbjct: 987  SLLQASNNSKSSYFWKGFLWGRDLLVKGLRLRVGNGSTIKAFSDPWLPRPTTFKPLRF-N 1046

Query: 62   SENADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPIST-SASDKWVWHYDKMGKYT 121
            +   D  VA FIT    WD+  +      ED D I+ +PIS+ +  D W+WHYDK G Y+
Sbjct: 1047 NGALDTTVASFITADGNWDVTSISHSFCNEDRDLILSMPISSYNLQDSWLWHYDKRGNYS 1106

Query: 122  VKSGYKLCIKHSQEASASSAEVE------------------------------------- 181
            V+SGYKL +     A+++S                                         
Sbjct: 1107 VRSGYKLYMHLKCNATSASTNYRGTQWNSIWKLTVPTKIKIFIWRSAHEHIPTAQNLLLR 1166

Query: 182  --SRCGSCPVCREAMETTDPALFLCSRAREVWEGILPWMN----EEFWIPMDIQDRWLSL 241
                  +C +C +  E+   A F C RAR++W  + P++     E+    +   + W SL
Sbjct: 1167 GIGELPACTICGDRRESIIHAFFHCKRARQIWRTLFPFLTCLSAED---NISFLELWSSL 1226

Query: 242  GD-CQSQRLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLE-----EFQNANPV 301
             +  + + L+L +I  W IWNDRN++ H + V  V+ + EW+  +L+     +  N +P 
Sbjct: 1227 TEQLEPKDLNLAAITGWGIWNDRNSLIHGKQVSPVEFKCEWLTPFLDSHSQAQMSNYSPR 1286

Query: 302  RGIVNQGVDDVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISG 361
                ++ V    R    V  + ++ DAA      S   G + +D   +L    ++     
Sbjct: 1287 TQSNHRPVVQYWRPSSSV-SLKLNTDAACR--GASTSFGCIIRDSSCSLVAATSIRVPFP 1346

Query: 362  ISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIH 414
            +SPL AE   +L+GL FA +     L V SDSL  I+ +R ++         + +I+ + 
Sbjct: 1347 LSPLLAEIRCILEGLKFAAASNFTHLEVESDSLLAIQLIRNEIHTRGDEQNWVMEIQALT 1406

BLAST of Lag0006229 vs. ExPASy TrEMBL
Match: A0A803NM27 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 178.7 bits (452), Expect = 5.1e-41
Identity = 130/467 (27.84%), Postives = 204/467 (43.68%), Query Frame = 0

Query: 4    LDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSE 63
            L  +    SS  W+GF WG +LLK+G+R  +GN   I   +DPWIP  S F  +    + 
Sbjct: 1318 LSAATCGVSSLTWQGFCWGRELLKKGLRLQVGNGFDIACATDPWIPGNSIFTPIYFTGAP 1377

Query: 64   NADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPISTSA-SDKWVWHYDKMGKYTVK 123
                 VAD+IT   +W+++KL+      DV+ I+ LP+S  A SD WVWH    G+Y VK
Sbjct: 1378 T--NTVADYITPEKEWNVSKLNADFSSADVERILSLPLSHHAHSDYWVWHATG-GQYEVK 1437

Query: 124  SGYKLCIKHSQE--ASASSAEV-------------------------------------E 183
            SGY +    + E   S SS  V                                      
Sbjct: 1438 SGYHVACLLADENPVSVSSPNVSWWKSFWQLKLPPKVKLFAWKAIHNALPVASELYKRKS 1497

Query: 184  SRCGSCPVCREAMETTDPALFLCSRAREVWEGILPWMNEEFWIPMDIQDRWLSLGDCQSQ 243
                SC +C  A E+   A+F C  AR VW+      N +  + M I+D    + +C ++
Sbjct: 1498 LTSASCSLCLNAWESVGHAMFACKHARHVWKIAGFSFNNKAAVSMKIEDFLFQISECYTK 1557

Query: 244  -RLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNANPV---RGIVNQGV 303
              L++I    W+IW+DRNN+ H ++       S     +L  FQ+A  +    G+     
Sbjct: 1558 SELEMIFCTMWSIWSDRNNVLHGKIPQQPSVISAKAASFLSSFQSAQQLSLHAGLAPADT 1617

Query: 304  DDVRRILQGVEE--IIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGISPLGA 363
                R         + ++ DAA+D+    +G G + +D  GN+K   +        P   
Sbjct: 1618 PTAHRAWTPPPPNLLKLNVDAAFDDTRKRIGFGAIIRDSTGNVKAAMSHPIDGCCRPQDM 1677

Query: 364  EAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQSFRTI 423
            EA+ +   L +AR L  +   V +DSL  +  +RK     +     I+D++       T+
Sbjct: 1678 EAKGLFYSLKWARQLNFKVDLVETDSLILVNALRKSTSKSSSFQDLIFDVQTQLSYLPTV 1737

BLAST of Lag0006229 vs. ExPASy TrEMBL
Match: A0A803NGI9 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 172.9 bits (437), Expect = 2.8e-39
Identity = 117/432 (27.08%), Postives = 202/432 (46.76%), Query Frame = 0

Query: 4    LDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSE 63
            L  S +  SS  W+G  WG DLL +G+R  +G+  S++  SDPWIPR S F  +      
Sbjct: 692  LQASKTGCSSLTWQGICWGRDLLVKGLRLKIGDGSSVQCSSDPWIPRHSEFTPICF--LG 751

Query: 64   NADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPISTSA-SDKWVWHYDKMGKYTVK 123
            +   +V+ +IT+ N+W++  L +     DVD I+ +P+S+S+ SD+W+WH+    +YTV+
Sbjct: 752  DTQNLVSYYITDDNEWNLPLLARDFSAVDVDYILSIPLSSSSVSDQWIWHFTNSNEYTVQ 811

Query: 124  SGYKLC--IKHSQEASASSAE---VE------SRCGSCPVCREAMETTDPALFLCSRARE 183
            SGY L   ++ S  +++S+ +   VE      +   +C +C  A E+ + ALFLC  A++
Sbjct: 812  SGYHLANDLEDSDLSNSSNNQRYLVEILWNPATPIQACSLCSNAWESVEHALFLCKHAKK 871

Query: 184  VWEGILPWMNEEFWIPMDIQDRWLSLGDCQS-QRLDLISIGAWAIWNDRNNIHHQRLVPN 243
            VW      +N +    M   D  + L   +S   ++ +    W +WNDRNN  H +  P 
Sbjct: 872  VWRVAGISLNFDLVSRMSFADFLMLLSTLKSTSEMEQLLCTLWFLWNDRNNFIHGK--PG 931

Query: 244  VQTRSEWI--LEYLEEFQNANPVRGIVNQGVDDVRR---ILQGVEEIIMHCDAAYDEING 303
            +     W   + Y   FQ  +     +        +   I   ++++ M+ DAA D    
Sbjct: 932  LSPTQLWAKSVAYFCNFQQQSTSSKSIRDSTGAAAQPHWISPPLDKLKMNVDAACDISRN 991

Query: 304  SVGIGLVFQDKQGNLKVVKALSTISGISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLT 363
             +G+G++ ++  G +    +      + P   EA+ +L G+ +A    +      SDSL 
Sbjct: 992  KIGVGIIIRNSSGQVVAAYSKPLTGRLKPQEMEAKALLIGINWAARCNLSINLFESDSLI 1051

Query: 364  FIKTVRKKVQCETCLATTIWDIKEIHQSFRTIRFEHALRHYNRFAHKLAHVGLHYQSQ-S 417
             + ++       +     + DIK       ++   H  R  N+ AH LA   L       
Sbjct: 1052 LVNSINSISNAISSFGDLVLDIKNRLSYLSSVCVSHVKRDANQAAHGLAKHALELDDDCM 1111

BLAST of Lag0006229 vs. ExPASy TrEMBL
Match: A0A803PIB6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)

HSP 1 Score: 169.9 bits (429), Expect = 2.4e-38
Identity = 125/457 (27.35%), Postives = 191/457 (41.79%), Query Frame = 0

Query: 9    SSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSENADMV 68
            S  SS  W+G +WG +LL +G+   +G+   +    D WIP    FK L    S     +
Sbjct: 1375 SGLSSLTWQGIVWGRELLSKGLIIKIGDGTGVNCAHDSWIPGNEYFKPLRFTGS--CSNL 1434

Query: 69   VADFITETNQWDIAKLHQVLGKEDVDEIVRLPIS-TSASDKWVWHYDKMGKYTVKSGYKL 128
            VAD+IT+T +WD+  LH      D+D I+ +P+S  S  D+W WHYD  G YTVKSGY L
Sbjct: 1435 VADYITDTREWDLELLHNDFSPADIDRILTIPLSYNSTRDRWRWHYDSSGDYTVKSGYNL 1494

Query: 129  -CIKHSQEASASSAEVES--------------------------------------RCGS 188
             C   +++ S+SS   E+                                         +
Sbjct: 1495 ACSLENKDHSSSSTSQEAWWQLFWGLNLPSKVRIFGWRVINSALPVAQNLFHRKVITSAT 1554

Query: 189  CPVCREAMETTDPALFLCSRAREVWEGI---LPWMNEEFWIPMDIQDRWLSLGDCQSQ-R 248
            C +C  A E+   ALF C  A+ VW+     L +    F   M   D  L L    ++  
Sbjct: 1555 CSLCSRAWESIGHALFSCCHAKSVWQHTSFQLDFTKASF---MKDGDYLLFLSTILTKSE 1614

Query: 249  LDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNAN----PVRGIVNQGVD 308
            L+ +    W IW+DRNN  H + + +    S     YL  F +      P    V     
Sbjct: 1615 LEKLFCTMWFIWSDRNNYIHCKQLKHPMAISSQAEAYLANFHSVKSATAPAVSCVAADAR 1674

Query: 309  DVRRILQGVEEIIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGISPLGAEAE 368
             V+ +      + M+ DAA D     +GIG++ +D  G +    +   +        EA+
Sbjct: 1675 TVKWVPPTESNLKMNVDAALDSSRNKIGIGVIIRDSTGRVIAAMSKPVVGNFKSQEMEAK 1734

Query: 369  VVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQSFRTIRFE 417
             +  GL +A+ L++Q   V +D L  +  ++ K    +     + DI     SF      
Sbjct: 1735 AMFWGLQWAKQLQLQPHCVETDCLMLVHALQGKQSQLSSFHDLVEDISYHLSSFSNACIS 1794

BLAST of Lag0006229 vs. ExPASy TrEMBL
Match: A0A5E4FZN9 (PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1)

HSP 1 Score: 153.3 bits (386), Expect = 2.3e-33
Identity = 107/461 (23.21%), Postives = 193/461 (41.87%), Query Frame = 0

Query: 4    LDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWIPRPSTFKILSHPRSE 63
            L+    ++ S+ W+   WG +LL +G+R  +GN +SI++++D W+P PS FKI+S P+  
Sbjct: 828  LEAEVGTNPSFIWRSLQWGKELLNKGLRWRVGNGVSIQVYTDKWLPAPSFFKIMSPPQLP 887

Query: 64   NADMVVADFITETNQWDIAKLHQVLGKEDVDEIVRLPISTSAS-DKWVWHYDKMGKYTVK 123
                +V D  T + QW++  L  +   ++VD  +++P+++ A  D  +WHY++ G Y+VK
Sbjct: 888  -LSTLVCDLFTSSGQWNVPLLKDIFWDQEVDAKLQIPLASLAGHDCLIWHYERNGMYSVK 947

Query: 124  SGYKL-CIKHSQEASASSAEVESR-----------------------------CGS---- 183
            SGY+L C++  + +   S  V+                               CG     
Sbjct: 948  SGYRLACLEKDKMSGEPSVRVDLNSKFWKKIWALKIPNKIKFFLWRCAWDFLPCGQILFN 1007

Query: 184  --------CPVCREAMETTDPALFLCSRAREVWEGILPWMNE-EFWIPMDIQDRWLSLG- 243
                    CP C    E+   A++LC  A+EVW     W N  E W     ++ W +L  
Sbjct: 1008 RKIAPTPICPNCHRKAESVLHAVWLCETAKEVWRN-SAWGNVCEEWRVNSFRELWHALQL 1067

Query: 244  DCQSQRLDLISIGAWAIWNDRNNIHHQRLVPNVQTRSEWILEYLEEFQNANPVRGIVNQG 303
                +   L +   W +WN RN+   +            + +  +EF NAN +   ++  
Sbjct: 1068 SSSGEEQGLFAYLCWGLWNRRNSFIFEGKSETATQLLHRMTKLAQEFSNANNLSHTIHGR 1127

Query: 304  VDDVRRILQGVEE-----IIMHCDAAYDEINGSVGIGLVFQDKQGNLKVVKALSTISGIS 363
                +  L G          ++ D A    +   G+G+V ++  G           +   
Sbjct: 1128 QSSPQAPLHGWRPPPAGIYKINVDGAVKSGDSVRGVGVVVRNANGEFMAACVRRIQASYG 1187

Query: 364  PLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKVQCETCLATTIWDIKEIHQS 414
                E    ++GL FA  +      +  D+   I ++    +C       I ++  +  +
Sbjct: 1188 ARQTELMATIEGLRFAIDMGFTAAVLEMDAQDCINSILSTEECNGIDGLLIEEVNYLLHN 1247

BLAST of Lag0006229 vs. TAIR 10
Match: AT3G09510.1 (Ribonuclease H-like superfamily protein )

HSP 1 Score: 75.1 bits (183), Expect = 1.5e-13
Identity = 104/480 (21.67%), Postives = 179/480 (37.29%), Query Frame = 0

Query: 1   MSVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWI----PRPSTFKI 60
           +S+LD       SY W   + G+ LLK+G R  +G+  +I++  D  +    PRP     
Sbjct: 9   VSILDAKVRKQQSYGWASLLDGIALLKKGTRHLIGDGQNIRIGLDNIVDSHPPRP----- 68

Query: 61  LSHPRSENADMVVADFITETNQ---WDIAKLHQVLGKEDVDEIVRLPISTSAS-DKWVWH 120
             +      +M + +          WD +K+ Q + + D   I R+ ++ S   DK +W+
Sbjct: 69  -LNTEETYKEMTINNLFERKGSYYFWDDSKISQFVDQSDHGFIHRIYLAKSKKPDKIIWN 128

Query: 121 YDKMGKYTVKSGYKLC--------------------------------IKH-----SQEA 180
           Y+  G+YTV+SGY L                                 +KH       +A
Sbjct: 129 YNTTGEYTVRSGYWLLTHDPSTNIPAINPPHGSIDLKTRIWNLPIMPKLKHFLWRALSQA 188

Query: 181 SASSAEVESR----CGSCPVCREAMETTDPALFLCSRAREVWEGILPWMNEEFWIPMDIQ 240
            A++  + +R      SCP C    E+ + ALF C  A   W      +     +  D +
Sbjct: 189 LATTERLTTRGMRIDPSCPRCHRENESINHALFTCPFATMAWRLSDSSLIRNQLMSNDFE 248

Query: 241 DRWLSL-----GDCQSQRLDLISIG-AWAIWNDRNNIHHQR---------LVPNVQTRSE 300
           +   ++         S    L+ +   W IW  RNN+   +         L    +T  +
Sbjct: 249 ENISNILNFVQDTTMSDFHKLLPVWLIWRIWKARNNVVFNKFRESPSKTVLSAKAETH-D 308

Query: 301 WILEYLEEFQNANPVRGIVNQGVDDVRRILQGVEEIIMHC--DAAYDEINGSVGIGLVFQ 360
           W+       +  +P R I    ++      +      + C  DA +D        G + +
Sbjct: 309 WLNATQSHKKTPSPTRQIAENKIE-----WRNPPATYVKCNFDAGFDVQKLEATGGWIIR 368

Query: 361 DKQGNLKVVKALSTISGISPLGAEAEVVLQGLCFARSLKMQCLSVLSDSLTFIKTVRKKV 414
           +  G      ++      +PL AE + +L  L          + +  D  T I  +   +
Sbjct: 369 NHYGTPISWGSMKLAHTSNPLEAETKALLAALQQTWIRGYTQVFMEGDCQTLINLI-NGI 428

BLAST of Lag0006229 vs. TAIR 10
Match: ATMG00310.1 (RNA-directed DNA polymerase (reverse transcriptase)-related family protein )

HSP 1 Score: 44.3 bits (103), Expect = 2.9e-04
Identity = 18/47 (38.30%), Postives = 29/47 (61.70%), Query Frame = 0

Query: 2   SVLDVSPSSSSSYFWKGFIWGMDLLKRGIRRNLGNDISIKMFSDPWI 49
           S+++ S  +  SY W+  I G +LL RG+ R +G+ I  K++ D WI
Sbjct: 98  SMMECSVGTRPSYAWRSIIHGRELLSRGLLRTIGDGIHTKVWLDRWI 144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022158377.18.9e-5631.68uncharacterized protein LOC111024874 [Momordica charantia][more]
XP_030497600.14.9e-3827.35uncharacterized protein LOC115713257 [Cannabis sativa][more]
XP_024950112.11.4e-3728.09uncharacterized protein LOC112496847 [Citrus sinensis][more]
XP_017250619.13.5e-3627.37PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus][more]
XP_006491472.12.3e-3524.34uncharacterized protein LOC102626455 [Citrus sinensis][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1DX304.3e-5631.68uncharacterized protein LOC111024874 OS=Momordica charantia OX=3673 GN=LOC111024... [more]
A0A803NM275.1e-4127.84Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803NGI92.8e-3927.08Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A803PIB62.4e-3827.35Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1[more]
A0A5E4FZN92.3e-3323.21PREDICTED: retrotransposon OS=Prunus dulcis OX=3755 GN=ALMOND_2B007697 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT3G09510.11.5e-1321.67Ribonuclease H-like superfamily protein [more]
ATMG00310.12.9e-0438.30RNA-directed DNA polymerase (reverse transcriptase)-related family protein [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR036397Ribonuclease H superfamilyGENE3D3.30.420.10coord: 270..399
e-value: 7.0E-15
score: 57.2
IPR002156Ribonuclease H domainPFAMPF13456RVT_3coord: 276..395
e-value: 5.2E-19
score: 68.3
NoneNo IPR availablePIRSRPIRSR037839-1PIRSR037839-1coord: 277..370
e-value: 0.0024
score: 15.6
NoneNo IPR availablePANTHERPTHR46736FAMILY NOT NAMEDcoord: 25..344
IPR044730Ribonuclease H-like domain, plant typeCDDcd06222RNase_H_likecoord: 276..395
e-value: 2.60276E-12
score: 61.5612
IPR012337Ribonuclease H-like superfamilySUPERFAMILY53098Ribonuclease H-likecoord: 271..395

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0006229.1Lag0006229.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0003676 nucleic acid binding
molecular_function GO:0004523 RNA-DNA hybrid ribonuclease activity