Lag0022159 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0022159
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Locationchr7: 19870832 .. 19871697 (+)
RNA-Seq ExpressionLag0022159
SyntenyLag0022159
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTTCATCCCTATTCCTCGACTACGACTCGTTTTCGATACCCTCCACAGGTTACTACATCACTTCCCTTTTTCCCTTCATATAACTTTACTCTGCTGTTCTTTCCTCCTAACCAGCAACAGTCGTCTCCTTACCCAACTTTAACCCCTCCCTTGAATATTATCTGACTCTAACTATCTGTTGTGGAAGAATCAATTGCTCAATCATATAATCACATTCAATATGGAAAACTTCATCAATGGAACTCTGGCTCTACCAAAACACTTGGATGCTACTCAAACCCACTAAACCTTCAATTTCTTTTTTGGCAAAAGTACAATATAACTCTTATGAGATGAATTTATTCCTTCCAAAACGAAGATAGACTTGGTGAGATAATTGAGTATTCCTCTGCTTATGAAATTTGGGAGAATTTGCGTGTTGTCTATGAATCATCTTCTATAGCTCATATAATGGTTCTTAGATCTCAACTACAGAAAATTAGAAAGGATGTTATCTCAATTACACAATACTTGACTCATATCAAAGACGTTGACGACAAGTTCTCAGCCATCGATGAGCCTCTTTCCTATATGGACCATCATGGTTACATTCTTGAAGGACTTGATTCGGAATACAATCCTTTCGTTACCTCCATTCAAAATTGCACTGATCGCCACTCCCTTGCTGATGTTCGCAGTCTTCTTCTTGCATATGCAGCTCGTCTGGAAAAGAAAACCTCTGTTGATACGTTAAATATGGTGCAAGCCAATCTCGCCAATCTTTCGATAAGTTCTAATCAAAAGCAGTTCCAACACCCTTCCCAAAATCTCAAACCAAAATTCTTTTCTAGACCTTCTTCCCCTTTTTCATTTCCATTTCCCTAG

mRNA sequence

ATGTTCATCCCTATTCCTCGACTACGACTCGTTTTCGATACCCTCCACAGACTTGGTGAGATAATTGAGTATTCCTCTGCTTATGAAATTTGGGAGAATTTGCGTGTTGTCTATGAATCATCTTCTATAGCTCATATAATGGTTCTTAGATCTCAACTACAGAAAATTAGAAAGGATGTTATCTCAATTACACAATACTTGACTCATATCAAAGACGTTGACGACAAGTTCTCAGCCATCGATGAGCCTCTTTCCTATATGGACCATCATGGTTACATTCTTGAAGGACTTGATTCGGAATACAATCCTTTCGTTACCTCCATTCAAAATTGCACTGATCGCCACTCCCTTGCTGATGTTCGCAGTCTTCTTCTTGCATATGCAGCTCGTCTGGAAAAGAAAACCTCTGTTGATACGTTAAATATGGTGCAAGCCAATCTCGCCAATCTTTCGATAAGTTCTAATCAAAAGCAGTTCCAACACCCTTCCCAAAATCTCAAACCAAAATTCTTTTCTAGACCTTCTTCCCCTTTTTCATTTCCATTTCCCTAG

Coding sequence (CDS)

ATGTTCATCCCTATTCCTCGACTACGACTCGTTTTCGATACCCTCCACAGACTTGGTGAGATAATTGAGTATTCCTCTGCTTATGAAATTTGGGAGAATTTGCGTGTTGTCTATGAATCATCTTCTATAGCTCATATAATGGTTCTTAGATCTCAACTACAGAAAATTAGAAAGGATGTTATCTCAATTACACAATACTTGACTCATATCAAAGACGTTGACGACAAGTTCTCAGCCATCGATGAGCCTCTTTCCTATATGGACCATCATGGTTACATTCTTGAAGGACTTGATTCGGAATACAATCCTTTCGTTACCTCCATTCAAAATTGCACTGATCGCCACTCCCTTGCTGATGTTCGCAGTCTTCTTCTTGCATATGCAGCTCGTCTGGAAAAGAAAACCTCTGTTGATACGTTAAATATGGTGCAAGCCAATCTCGCCAATCTTTCGATAAGTTCTAATCAAAAGCAGTTCCAACACCCTTCCCAAAATCTCAAACCAAAATTCTTTTCTAGACCTTCTTCCCCTTTTTCATTTCCATTTCCCTAG

Protein sequence

MFIPIPRLRLVFDTLHRLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQFQHPSQNLKPKFFSRPSSPFSFPFP
Homology
BLAST of Lag0022159 vs. NCBI nr
Match: XP_038887133.1 (uncharacterized protein LOC120077323 [Benincasa hispida])

HSP 1 Score: 172.9 bits (437), Expect = 2.5e-39
Identity = 96/166 (57.83%), Postives = 122/166 (73.49%), Query Frame = 0

Query: 18  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKF 77
           +GEI+ Y SA++IWE LR VYESSSIA IM   SQLQKI+KD ++++QYL  IKDV D F
Sbjct: 1   MGEIVGYESAFDIWEALRTVYESSSIAPIMGFCSQLQKIKKDGLTVSQYLAQIKDVLDNF 60

Query: 78  SAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSV 137
           +AI EPLSY DH  YILEGL SEYNPFV+SI N T+R S+ADVR+LL+ Y +RLEK+T+ 
Sbjct: 61  AAIGEPLSYRDHLSYILEGLGSEYNPFVSSIHNRTNRPSIADVRNLLITYDSRLEKQTAT 120

Query: 138 DTLNMVQANLANLSISSNQKQFQHPSQNLKPKFFSRPSSPFSFPFP 184
           D L ++QAN+A+LSI+S   Q +HP      +   R S+P    FP
Sbjct: 121 DHLQLIQANVAHLSINS---QNRHPQWQQHNRSSIRSSTPSVGSFP 163

BLAST of Lag0022159 vs. NCBI nr
Match: XP_022155181.1 (uncharacterized protein LOC111022315 [Momordica charantia])

HSP 1 Score: 155.6 bits (392), Expect = 4.1e-34
Identity = 85/170 (50.00%), Postives = 115/170 (67.65%), Query Frame = 0

Query: 17  RLGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDK 76
           ++GE++   + ++IW +L  VY+S + A IM L+++LQ +RKD  S++QYL  IK++ DK
Sbjct: 103 KMGEVVSLETTHDIWSSLTRVYDSKTTARIMGLKTELQNLRKDGSSVSQYLAKIKEIADK 162

Query: 77  FSAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTS 136
           F+A+ EPLSY DH  ++L+GL SEYN FVTSI N  D  SL DVRSLLLAY ARL+K+ +
Sbjct: 163 FAAVGEPLSYRDHLAHVLDGLGSEYNAFVTSIHNRADSPSLEDVRSLLLAYEARLDKQNT 222

Query: 137 VDTLNMVQANLANLSISSNQKQFQHPSQNLKPKF-------FSRPSSPFS 180
           VD LN+ QANL NLS+       QH S+   PKF        S P+SP S
Sbjct: 223 VDQLNIAQANLVNLSL-------QHNSKRPPPKFSFPNHYKHSFPNSPIS 265

BLAST of Lag0022159 vs. NCBI nr
Match: GFS33695.1 (hypothetical protein Acr_00g0030110 [Actinidia rufa])

HSP 1 Score: 130.2 bits (326), Expect = 1.8e-26
Identity = 73/173 (42.20%), Postives = 107/173 (61.85%), Query Frame = 0

Query: 18  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKF 77
           LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  
Sbjct: 13  LGQIVGYTSASQIWEALERLYAAASFAHLTELRTALQTIKKDGLTALAYIQKFRHLCNSL 72

Query: 78  SAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSV 137
           ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+ +V SLLL+Y ARLE++++ 
Sbjct: 73  ASIGEPVTYTDHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQSAT 132

Query: 138 DTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP 178
           DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Sbjct: 133 DTLSSLQANLANLTYQKPKFKNPSTNSFPNSNSYSHPRGQNRNPSYSPNPSSP 185

BLAST of Lag0022159 vs. NCBI nr
Match: GFY82848.1 (hypothetical protein Acr_02g0010880 [Actinidia rufa])

HSP 1 Score: 130.2 bits (326), Expect = 1.8e-26
Identity = 73/173 (42.20%), Postives = 107/173 (61.85%), Query Frame = 0

Query: 18  LGEIIEYSSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKF 77
           LG+I+ Y+SA +IWE L  +Y ++S AH+  LR+ LQ I+KD ++   Y+   + + +  
Sbjct: 13  LGQIVGYTSASQIWEALERLYAAASFAHLTELRTALQTIKKDGLTALAYIQKFRHLCNSL 72

Query: 78  SAIDEPLSYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSV 137
           ++I EP++Y DH  Y L GL  +YNPFVTSIQ+   R S+ +V SLLL+Y ARLE++++ 
Sbjct: 73  ASIGEPVTYTDHLIYFLGGLGRDYNPFVTSIQSQAIRPSIEEVHSLLLSYDARLERQSAT 132

Query: 138 DTLNMVQANLANLSISS------------NQKQFQHP-SQNLKPKFFSRPSSP 178
           DTL+ +QANLANL+               N   + HP  QN  P +   PSSP
Sbjct: 133 DTLSSLQANLANLTYQKPKFKNPSTNSFPNSNSYSHPRGQNRNPSYSPNPSSP 185

BLAST of Lag0022159 vs. NCBI nr
Match: XP_038891713.1 (uncharacterized protein LOC120081111 [Benincasa hispida])

HSP 1 Score: 121.3 bits (303), Expect = 8.6e-24
Identity = 64/112 (57.14%), Postives = 84/112 (75.00%), Query Frame = 0

Query: 47  MVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPLSYMDHHGYILEGLDSEYNPFVT 106
           M L+++LQKIRKD +S++QYL+ IKDV DKFS + E +SY DH  +IL+GL SEYN FVT
Sbjct: 1   MSLKARLQKIRKDNLSLSQYLSQIKDVADKFSVVGESISYRDHLTHILDGLGSEYNAFVT 60

Query: 107 SIQNCTDRHSLADVRSLLLAYAARLEKKTSVDTLNMVQANLANLSISSNQKQ 159
           SIQN  D  S+ DV SLLL+Y A+LEK+ ++D LN+ QA L+ LS   N K+
Sbjct: 61  SIQNHVDNLSVEDVWSLLLSYEAQLEKQNAIDHLNIAQAYLSKLSFQHNSKR 112

BLAST of Lag0022159 vs. ExPASy Swiss-Prot
Match: Q94HW2 (Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana OX=3702 GN=RE1 PE=2 SV=1)

HSP 1 Score: 52.0 bits (123), Expect = 8.4e-06
Identity = 36/154 (23.38%), Postives = 71/154 (46.10%), Query Frame = 0

Query: 25  SSAYEIWENLRVVYESSSIAHIMVLRSQLQKIRKDVISITQYLTHIKDVDDKFSAIDEPL 84
           ++A +IWE LR +Y + S  H+  LR+QL++  K   +I  Y+  +    D+ + + +P+
Sbjct: 104 TTAAQIWETLRKIYANPSYGHVTQLRTQLKQWTKGTKTIDDYMQGLVTRFDQLALLGKPM 163

Query: 85  SYMDHHGYILEGLDSEYNPFVTSIQNCTDRHSLADVRSLLLAYAARLEKKTSVDTL---- 144
            + +    +LE L  EY P +  I       +L ++   LL + +++   +S   +    
Sbjct: 164 DHDEQVERVLENLPEEYKPVIDQIAAKDTPPTLTEIHERLLNHESKILAVSSATVIPITA 223

Query: 145 ------NMVQANLANLSISSNQKQFQHPSQNLKP 169
                 N    N  N    +N+   ++ + N KP
Sbjct: 224 NAVSHRNTTTTNNNNNGNRNNRYDNRNNNNNSKP 257

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038887133.12.5e-3957.83uncharacterized protein LOC120077323 [Benincasa hispida][more]
XP_022155181.14.1e-3450.00uncharacterized protein LOC111022315 [Momordica charantia][more]
GFS33695.11.8e-2642.20hypothetical protein Acr_00g0030110 [Actinidia rufa][more]
GFY82848.11.8e-2642.20hypothetical protein Acr_02g0010880 [Actinidia rufa][more]
XP_038891713.18.6e-2457.14uncharacterized protein LOC120081111 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
Q94HW28.4e-0623.38Retrovirus-related Pol polyprotein from transposon RE1 OS=Arabidopsis thaliana O... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF14223Retrotran_gag_2coord: 18..132
e-value: 5.7E-16
score: 58.5
NoneNo IPR availablePANTHERPTHR34222:SF48SUBFAMILY NOT NAMEDcoord: 21..160
NoneNo IPR availablePANTHERPTHR34222FAMILY NOT NAMEDcoord: 21..160

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0022159.1Lag0022159.1mRNA