|
Sequences
The following sequences are available for this feature:
Gene sequence (with intron) Legend: exonCDSpolypeptide Hold the cursor over a type above to highlight its positions in the sequence below. ATGGGTTTTGTAGAGCCAAGAAACTTCGATGGGAAAAATTTTGAGTGTTGGAAAATGCAAGTCAATGATTACCTAACTTGCAAGAAAATACATAAGGCATTGAAGGAGAGACCGAAAGGGATGACGAACGAAGATTGGGAAGCTTTGGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTGTCAATGGACGTGGCAAGTCTAGTGGCCCATGAGACAACTGCAGTTAAATTGATGGAAGCGCTTACAAATAGGTATGAAAAACCCTCGGCTAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGTTTAATCGGTTAAAATCTGTTAAGATAGAATTTATTGATGAGGTGAATGCTATTCAGTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAACATTGTCTAATTCGACTGGAAATAATACTTTAAAATTTTTAAAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGGTAG mRNA sequence ATGGGTTTTGTAGAGCCAAGAAACTTCGATGGGAAAAATTTTGAGTGTTGGAAAATGCAAGTCAATGATTACCTAACTTGCAAGAAAATACATAAGGCATTGAAGGAGAGACCGAAAGGGATGACGAACGAAGATTGGGAAGCTTTGGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTGTCAATGGACGTGGCAAGTCTAGTGGCCCATGAGACAACTGCAGTTAAATTGATGGAAGCGCTTACAAATAGGTATGAAAAACCCTCGGCTAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGTTTAATCGGTTAAAATCTGTTAAGATAGAATTTATTGATGAGGTGAATGCTATTCAGTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAACATTGTCTAATTCGACTGGAAATAATACTTTAAAATTTTTAAAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGGTAG Coding sequence (CDS) ATGGGTTTTGTAGAGCCAAGAAACTTCGATGGGAAAAATTTTGAGTGTTGGAAAATGCAAGTCAATGATTACCTAACTTGCAAGAAAATACATAAGGCATTGAAGGAGAGACCGAAAGGGATGACGAACGAAGATTGGGAAGCTTTGGATGAAGAGGCAGTTGCAAGCATAAGGATGTGTTTGTCAATGGACGTGGCAAGTCTAGTGGCCCATGAGACAACTGCAGTTAAATTGATGGAAGCGCTTACAAATAGGTATGAAAAACCCTCGGCTAATAATAAGGTCTACCTAGTTAAGAAGTTTTTCAACATGCAAATGTCTGAGGATGCTTCTGTGAATTCCTATATTAATGAGGTTACCACTTTGTTTAATCGGTTAAAATCTGTTAAGATAGAATTTATTGATGAGGTGAATGCTATTCAGTTGTTAACGTCTTTACCTGATAGTTGGGAAACGATGAAGACAACATTGTCTAATTCGACTGGAAATAATACTTTAAAATTTTTAAAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGGTAG Protein sequence MGFVEPRNFDGKNFECWKMQVNDYLTCKKIHKALKERPKGMTNEDWEALDEEAVASIRMCLSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLFNRLKSVKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEIRR
Homology
BLAST of Lag0040829 vs. NCBI nr
Match: KAE8691144.1 (hypothetical protein F3Y22_tig00110893pilonHSYRG01319 [Hibiscus syriacus]) HSP 1 Score: 176.8 bits (447), Expect = 1.7e-40 Identity = 92/182 (50.55%), Postives = 130/182 (71.43%), Query Frame = 0 Query: 2 GFVEPRNFDGKNFECWKMQVNDYLTCKKIHKALK-ERPKGMTNEDWEALDEEAVASIRMC 61 G V+ FDG NF WKMQ+ D+L K +++ L ++P+GM NEDW LD +A+ IR+ Sbjct: 6 GKVKIEKFDGANFGFWKMQIEDFLYQKNLYQPLSGKQPEGMKNEDWVLLDRQALGVIRLT 65
Query: 62 LSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVT 121 LS +VA +A E T LM AL++ YEKPSA+NKV+L+++ FN++M+E ASV ++NE+ Sbjct: 66 LSCNVAFNIAKEKTTAGLMAALSSMYEKPSASNKVHLMRRLFNLRMAEVASVAQHLNELN 125
Query: 122 TLFNRLKSVKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEI 181 T+ +L SV+IEF DEV A+ LL+SLPDSW T +S+S+GN+ LKF V DL ++EEI Sbjct: 126 TITTQLSSVEIEFDDEVRALILLSSLPDSWNATATAVSSSSGNSKLKFDDVRDLVLSEEI 185
Query: 182 RR 183 RR Sbjct: 186 RR 187
BLAST of Lag0040829 vs. NCBI nr
Match: KAG7593230.1 (Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa]) HSP 1 Score: 176.4 bits (446), Expect = 2.2e-40 Identity = 87/174 (50.00%), Postives = 121/174 (69.54%), Query Frame = 0 Query: 9 FDGKNFECWKMQVNDYLTCKKIHKALKERPKGMTNEDWEALDEEAVASIRMCLSMDVASL 68 FDG +F W+MQ+ DYL KK+H+ L +P+ M E+W+ LD + + IR+ LS +VA Sbjct: 14 FDGTDFAFWRMQIEDYLYGKKLHQPLSMKPEKMAQEEWDLLDRQVLGVIRLTLSKNVAHN 73
Query: 69 VAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLFNRLKS 128 VA E T LM+ L++ YEKPSANNKV+L+KK F+++M E V +++NE T+ N+L S Sbjct: 74 VAKEKTTEGLMKVLSDMYEKPSANNKVFLMKKLFHLKMEEGGPVATHVNEFNTIVNQLSS 133
Query: 129 VKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEIRR 183 V+IEF DEV A+ LL SLP+SWE M+ +SNS GN LKF+ V D + EE+RR Sbjct: 134 VEIEFDDEVRALILLASLPNSWEPMRAAVSNSVGNQKLKFVDVRDRILGEEVRR 187
BLAST of Lag0040829 vs. NCBI nr
Match: KAG7584790.1 (Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa]) HSP 1 Score: 176.4 bits (446), Expect = 2.2e-40 Identity = 87/174 (50.00%), Postives = 121/174 (69.54%), Query Frame = 0 Query: 9 FDGKNFECWKMQVNDYLTCKKIHKALKERPKGMTNEDWEALDEEAVASIRMCLSMDVASL 68 FDG +F W+MQ+ DYL KK+H+ L +P+ M E+W+ LD + + IR+ LS +VA Sbjct: 14 FDGTDFAFWRMQIEDYLYGKKLHQPLSMKPEKMAQEEWDLLDRQVLGVIRLTLSKNVAHN 73
Query: 69 VAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLFNRLKS 128 VA E T LM+ L++ YEKPSANNKV+L+KK F+++M E V +++NE T+ N+L S Sbjct: 74 VAKEKTTEGLMKVLSDMYEKPSANNKVFLMKKLFHLKMEEGGPVATHVNEFNTIVNQLSS 133
Query: 129 VKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEIRR 183 V+IEF DEV A+ LL SLP+SWE M+ +SNS GN LKF+ V D + EE+RR Sbjct: 134 VEIEFDDEVRALILLASLPNSWEPMRAAVSNSVGNQKLKFVDVRDRILGEEVRR 187
BLAST of Lag0040829 vs. NCBI nr
Match: TKR90717.1 (hypothetical protein D5086_0000230290 [Populus alba]) HSP 1 Score: 176.4 bits (446), Expect = 2.2e-40 Identity = 91/175 (52.00%), Postives = 120/175 (68.57%), Query Frame = 0 Query: 9 FDGKNFECWKMQVNDYLTCKKIH-KALKERPKGMTNEDWEALDEEAVASIRMCLSMDVAS 68 FDG NF WKMQ+ DYL KK+H L +P+ M E+W+ LD + + IR+ LS VA Sbjct: 13 FDGTNFGYWKMQIEDYLYGKKLHLPLLGSKPEKMEEEEWQLLDRQVLGIIRLSLSRRVAH 72
Query: 69 LVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVTTLFNRLK 128 V E + KLMEAL+ YEKPSANNKV+L+KK FN++M E+ASV ++N T+ N+L Sbjct: 73 NVTKEKSTAKLMEALSGMYEKPSANNKVHLMKKLFNLKMVENASVTQHLNNFNTITNQLS 132
Query: 129 SVKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEIRR 183 SV IEF DE+ A+ LL SLP SWE M+T +SNS G + LK+ + DL +AEE+RR Sbjct: 133 SVAIEFNDEIRALILLASLPSSWEGMRTAVSNSAGKSKLKYDDIRDLILAEEVRR 187
BLAST of Lag0040829 vs. NCBI nr
Match: KAE8678064.1 (Glycosyltransferase, CAZy family GT8 [Hibiscus syriacus]) HSP 1 Score: 176.4 bits (446), Expect = 2.2e-40 Identity = 91/182 (50.00%), Postives = 130/182 (71.43%), Query Frame = 0 Query: 2 GFVEPRNFDGKNFECWKMQVNDYLTCKKIHKALK-ERPKGMTNEDWEALDEEAVASIRMC 61 G V+ FDG +F WKMQ+ D+L K +++ L ++P+GM NEDW LD +A+ IR+ Sbjct: 122 GKVKIEKFDGADFGFWKMQIEDFLYQKNLYQPLSGKQPEGMKNEDWALLDRQALGVIRLT 181
Query: 62 LSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVT 121 LS +VA +A E T LM AL++ YEKPSA+NKV+L+++ FN++M+E ASV ++NE+ Sbjct: 182 LSRNVAFNIAKEKTTAGLMAALSSMYEKPSASNKVHLIRRLFNLRMTEGASVAQHLNELN 241
Query: 122 TLFNRLKSVKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEI 181 T+ +L SV+IEF DEV A+ LL+SLPDSW T +S+S+GN+ LKF V DL ++EEI Sbjct: 242 TITTQLSSVEIEFDDEVRALILLSSLPDSWNATVTAVSSSSGNSKLKFDDVRDLVLSEEI 301
Query: 182 RR 183 RR Sbjct: 302 RR 303
BLAST of Lag0040829 vs. ExPASy Swiss-Prot
Match: P10978 (Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum OX=4097 PE=2 SV=1) HSP 1 Score: 92.4 bits (228), Expect = 5.5e-18 Identity = 59/182 (32.42%), Postives = 96/182 (52.75%), Query Frame = 0 Query: 5 EPRNFDGKN-FECWKMQVNDYLTCKKIHKAL---KERPKGMTNEDWEALDEEAVASIRMC 64 E F+G N F W+ ++ D L + +HK L ++P M EDW LDE A ++IR+ Sbjct: 7 EVAKFNGDNGFSTWQRRMRDLLIQQGLHKVLDVDSKKPDTMKAEDWADLDERAASAIRLH 66
Query: 65 LSMDVASLVAHETTAVKLMEALTNRYEKPSANNKVYLVKKFFNMQMSEDASVNSYINEVT 124 LS DV + + E TA + L + Y + NK+YL K+ + + MSE + S++N Sbjct: 67 LSDDVVNNIIDEDTARGIWTRLESLYMSKTLTNKLYLKKQLYALHMSEGTNFLSHLNVFN 126
Query: 125 TLFNRLKSVKIEFIDEVNAIQLLTSLPDSWETMKTTLSNSTGNNTLKFLKVCDLAIAEEI 183 L +L ++ ++ +E AI LL SLP S++ + TT+ + G T++ V + E Sbjct: 127 GLITQLANLGVKIEEEDKAILLLNSLPSSYDNLATTILH--GKTTIELKDVTSALLLNEK 186
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAE8691144.1 | 1.7e-40 | 50.55 | hypothetical protein F3Y22_tig00110893pilonHSYRG01319 [Hibiscus syriacus] | [more] |
KAG7593230.1 | 2.2e-40 | 50.00 | Pentatricopeptide repeat [Arabidopsis thaliana x Arabidopsis arenosa] | [more] |
KAG7584790.1 | 2.2e-40 | 50.00 | Zinc finger CCHC-type superfamily [Arabidopsis thaliana x Arabidopsis arenosa] | [more] |
TKR90717.1 | 2.2e-40 | 52.00 | hypothetical protein D5086_0000230290 [Populus alba] | [more] |
KAE8678064.1 | 2.2e-40 | 50.00 | Glycosyltransferase, CAZy family GT8 [Hibiscus syriacus] | [more] |
Match Name | E-value | Identity | Description | |
P10978 | 5.5e-18 | 32.42 | Retrovirus-related Pol polyprotein from transposon TNT 1-94 OS=Nicotiana tabacum... | [more] |
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR Term | IPR Description | Source | Source Term | Source Description | Alignment |
None | No IPR available | PFAM | PF14223 | Retrotran_gag_2 | coord: 46..170 e-value: 1.6E-26 score: 92.7 |
None | No IPR available | PANTHER | PTHR34676:SF1 | ZINC FINGER, CCHC-TYPE, TUBBY C-TERMINAL-LIKE DOMAIN PROTEIN-RELATED | coord: 9..182 |
None | No IPR available | PANTHER | PTHR34676 | FAMILY NOT NAMED | coord: 9..182 |
Relationships
The following mRNA feature(s) are a part of this gene:
|