Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTCCTACTGTTAAAGAAGTCAATCTGAAGTACGACGACTGGATTGTTAAAGATCACGCCTTAATGACCTCGATCAATGCGACGCTTTTGCCAGCTGCTCTCACCTACGTTGTTGGTTGTGATTCTTCGCAACAGGTTTGGAACATCCTTGTCAAGCACTATTCTTCTAGTTCGCGCTCCAATGTGGTCAACTTGAAAACTGATCTGTGGTCAATTTTGAAAAAATCCTCTGAATCAGTGGGTCAATCAATATATTCAGCGTGTTAAAGAGTTGAAAGACAAATTGGCAAATGTTTCTGTGATTATTGAGGATGAAGATCTAATTATCTACACTCTTAATGGCTTGCCGTCGGAATTTAATACCTTTCAAACTTTTATGCGTACTTGTTCGCATCATGTATCCTTTGCTGAACTTCATGTTCTTTTGGTATCCGAAGAAGCTGCCATCGAAAAACAGTCTAAATGCGATGATCTTTTCGCTTCGCTAACTGCCTTATTTGCTAACTCTGATTCTCAACCTCGAAATCAGAATTTTAATCCGAATGTTCCTTGTGGTAGAGGTTATTCTAGTGGAAAGGCAAAGGCTTCTGGTAATAGTGGCCATGGAAAAATTACTAGTGATCGTGGTCGATTTCCATCTGTTGGTAGTAATCAGTTTGGTTTTCAGTTTTCTCGTCCTGTTTCTGACTCTTCCTCTTAG
mRNA sequence
ATGTCTCCTACTGTTAAAGAAGTCAATCTGAAGTACGACGACTGGATTGTTAAAGATCACGCCTTAATGACCTCGATCAATGCGACGCTTTTGCCAGCTGCTCTCACCTACGTTGTTGGTTGTGATTCTTCGCAACAGTGGGTCAATCAATATATTCAGCGTGTTAAAGAGTTGAAAGACAAATTGGCAAATGTTTCTGTGATTATTGAGGATGAAGATCTAATTATCTACACTCTTAATGGCTTGCCGTCGGAATTTAATACCTTTCAAACTTTTATGCGTACTTGTTCGCATCATGTATCCTTTGCTGAACTTCATGTTCTTTTGGTATCCGAAGAAGCTGCCATCGAAAAACAGTCTAAATGCGATGATCTTTTCGCTTCGCTAACTGCCTTATTTGCTAACTCTGATTCTCAACCTCGAAATCAGAATTTTAATCCGAATGTTCCTTGTGGTAGAGGTTATTCTAGTGGAAAGGCAAAGGCTTCTGGTAATAGTGGCCATGGAAAAATTACTAGTGATCGTGGTCGATTTCCATCTGTTGGTAGTAATCAGTTTGGTTTTCAGTTTTCTCGTCCTGTTTCTGACTCTTCCTCTTAG
Coding sequence (CDS)
ATGTCTCCTACTGTTAAAGAAGTCAATCTGAAGTACGACGACTGGATTGTTAAAGATCACGCCTTAATGACCTCGATCAATGCGACGCTTTTGCCAGCTGCTCTCACCTACGTTGTTGGTTGTGATTCTTCGCAACAGTGGGTCAATCAATATATTCAGCGTGTTAAAGAGTTGAAAGACAAATTGGCAAATGTTTCTGTGATTATTGAGGATGAAGATCTAATTATCTACACTCTTAATGGCTTGCCGTCGGAATTTAATACCTTTCAAACTTTTATGCGTACTTGTTCGCATCATGTATCCTTTGCTGAACTTCATGTTCTTTTGGTATCCGAAGAAGCTGCCATCGAAAAACAGTCTAAATGCGATGATCTTTTCGCTTCGCTAACTGCCTTATTTGCTAACTCTGATTCTCAACCTCGAAATCAGAATTTTAATCCGAATGTTCCTTGTGGTAGAGGTTATTCTAGTGGAAAGGCAAAGGCTTCTGGTAATAGTGGCCATGGAAAAATTACTAGTGATCGTGGTCGATTTCCATCTGTTGGTAGTAATCAGTTTGGTTTTCAGTTTTCTCGTCCTGTTTCTGACTCTTCCTCTTAG
Protein sequence
MSPTVKEVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQWVNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRTCSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNPNVPCGRGYSSGKAKASGNSGHGKITSDRGRFPSVGSNQFGFQFSRPVSDSSS
Homology
BLAST of Moc04g25900 vs. NCBI nr
Match:
XP_022158378.1 (uncharacterized protein LOC111024876 [Momordica charantia])
HSP 1 Score: 199.1 bits (505), Expect = 3.5e-47
Identity = 112/175 (64.00%), Postives = 129/175 (73.71%), Query Frame = 0
Query: 2 SPTVKEVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-WVNQYIQRVKELKD 61
S + ++ +DDWI KDH+LMT INATL AAL YVVGC SSQQ W YIQR+KELKD
Sbjct: 47 STITQTISPAFDDWIAKDHSLMTLINATLSSAALAYVVGCKSSQQVWETLYIQRIKELKD 106
Query: 62 KLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRTCSHHVSFAELHVLLVSEEAAIEKQS 121
KLANVSV+++DEDL+IYTLNGLPSEFNTF+T MRT S +SFAELHVLL SE AI+KQS
Sbjct: 107 KLANVSVLVDDEDLVIYTLNGLPSEFNTFRTSMRTRSRLISFAELHVLLNSEVVAIDKQS 166
Query: 122 KCDDLFASLTALFAN--SDSQPRNQNFNPNVPCGRGYSSGKAK--ASGNSGHGKI 172
K DDLF AL N S+SQ RNQN NPN GR + GK K ASG+S +I
Sbjct: 167 KSDDLFVQPAALVVNSGSNSQVRNQNLNPNYTKGRTSNGGKPKFAASGDSAATRI 221
BLAST of Moc04g25900 vs. NCBI nr
Match:
XP_022159298.1 (uncharacterized protein LOC111025709 [Momordica charantia])
HSP 1 Score: 169.1 bits (427), Expect = 3.9e-38
Identity = 101/207 (48.79%), Postives = 128/207 (61.84%), Query Frame = 0
Query: 1 MSPTVKEVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W------------ 60
+S T VN + DWI KDHALMT +NATL P+AL Y+VGCDSSQQ W
Sbjct: 37 LSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSR 96
Query: 61 --------------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFN 120
++ Y+QR+KELKDKLANVSV++++EDL+IYTLNGLP EFN
Sbjct: 97 TNVVNLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFN 156
Query: 121 TFQTFMRTCSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNP 174
F T M T S VSF EL+VLLV EEAAI+KQ+K D++F + L AN + + QN NP
Sbjct: 157 AFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSSTLLANMAT--KGQNSNP 216
BLAST of Moc04g25900 vs. NCBI nr
Match:
KAE8645659.1 (hypothetical protein Csa_020439 [Cucumis sativus])
HSP 1 Score: 149.4 bits (376), Expect = 3.2e-32
Identity = 94/202 (46.53%), Postives = 117/202 (57.92%), Query Frame = 0
Query: 7 EVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W------------------ 66
+ N Y+DWI KD ALMT INATL P AL YVVG SS+Q W
Sbjct: 79 QTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNL 138
Query: 67 --------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFM 126
++ YI+R+KE+KDKLANVS I +EDL+IY LNGLP+E+NTF+T M
Sbjct: 139 KSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM 198
Query: 127 RTCSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDS-QPRNQNFNPNVPCG 175
RT S V+F ELHVLL +EE+A+ KQSKCDD + T L ++S S FN N G
Sbjct: 199 RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRG 258
BLAST of Moc04g25900 vs. NCBI nr
Match:
XP_011658579.1 (uncharacterized protein LOC105436058 [Cucumis sativus])
HSP 1 Score: 149.4 bits (376), Expect = 3.2e-32
Identity = 94/202 (46.53%), Postives = 117/202 (57.92%), Query Frame = 0
Query: 7 EVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W------------------ 66
+ N Y+DWI KD ALMT INATL P AL YVVG SS+Q W
Sbjct: 79 QTNPLYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNL 138
Query: 67 --------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFM 126
++ YI+R+KE+KDKLANVS I +EDL+IY LNGLP+E+NTF+T M
Sbjct: 139 KSDLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSM 198
Query: 127 RTCSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDS-QPRNQNFNPNVPCG 175
RT S V+F ELHVLL +EE+A+ KQSKCDD + T L ++S S FN N G
Sbjct: 199 RTRSQPVTFEELHVLLRAEESALAKQSKCDDSYNQPTVLLSSSQSLLSCAPTFNNNFVRG 258
BLAST of Moc04g25900 vs. NCBI nr
Match:
KAG6588985.1 (Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 147.9 bits (372), Expect = 9.3e-32
Identity = 93/195 (47.69%), Postives = 111/195 (56.92%), Query Frame = 0
Query: 9 NLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W-------------------- 68
N YDDW KD ALMT INATL P AL YVVG +S+Q W
Sbjct: 69 NPSYDDWFAKDQALMTVINATLSPEALAYVVGSTTSKQVWNVLAKLYSSSSRSNVVNLKS 128
Query: 69 ------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRT 128
++ YI+R+KE+KDKLANVS ++ DEDL+IY LNGLP+E+NTF+T MRT
Sbjct: 129 DLQTISKKSDESIDAYIKRIKEIKDKLANVSTVVNDEDLLIYALNGLPTEYNTFRTSMRT 188
Query: 129 CSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDS-QPRNQNFNPNVPCGRG 170
S V+F ELHVLL +EE+A+ KQSK DDL TAL A+S S FN N GRG
Sbjct: 189 RSTPVTFEELHVLLKAEESALAKQSKRDDLCKQPTALLASSQSLMSYASTFNNNFVRGRG 248
BLAST of Moc04g25900 vs. ExPASy TrEMBL
Match:
A0A6J1DVX4 (uncharacterized protein LOC111024876 OS=Momordica charantia OX=3673 GN=LOC111024876 PE=4 SV=1)
HSP 1 Score: 199.1 bits (505), Expect = 1.7e-47
Identity = 112/175 (64.00%), Postives = 129/175 (73.71%), Query Frame = 0
Query: 2 SPTVKEVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-WVNQYIQRVKELKD 61
S + ++ +DDWI KDH+LMT INATL AAL YVVGC SSQQ W YIQR+KELKD
Sbjct: 47 STITQTISPAFDDWIAKDHSLMTLINATLSSAALAYVVGCKSSQQVWETLYIQRIKELKD 106
Query: 62 KLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRTCSHHVSFAELHVLLVSEEAAIEKQS 121
KLANVSV+++DEDL+IYTLNGLPSEFNTF+T MRT S +SFAELHVLL SE AI+KQS
Sbjct: 107 KLANVSVLVDDEDLVIYTLNGLPSEFNTFRTSMRTRSRLISFAELHVLLNSEVVAIDKQS 166
Query: 122 KCDDLFASLTALFAN--SDSQPRNQNFNPNVPCGRGYSSGKAK--ASGNSGHGKI 172
K DDLF AL N S+SQ RNQN NPN GR + GK K ASG+S +I
Sbjct: 167 KSDDLFVQPAALVVNSGSNSQVRNQNLNPNYTKGRTSNGGKPKFAASGDSAATRI 221
BLAST of Moc04g25900 vs. ExPASy TrEMBL
Match:
A0A6J1DYF1 (uncharacterized protein LOC111025709 OS=Momordica charantia OX=3673 GN=LOC111025709 PE=4 SV=1)
HSP 1 Score: 169.1 bits (427), Expect = 1.9e-38
Identity = 101/207 (48.79%), Postives = 128/207 (61.84%), Query Frame = 0
Query: 1 MSPTVKEVNLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W------------ 60
+S T VN + DWI KDHALMT +NATL P+AL Y+VGCDSSQQ W
Sbjct: 37 LSSTSPIVNPAFSDWIAKDHALMTLLNATLSPSALAYMVGCDSSQQVWQTLVKYYSSSSR 96
Query: 61 --------------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFN 120
++ Y+QR+KELKDKLANVSV++++EDL+IYTLNGLP EFN
Sbjct: 97 TNVVNLKSNLQSISKKPGESIDLYMQRIKELKDKLANVSVLVDNEDLLIYTLNGLPPEFN 156
Query: 121 TFQTFMRTCSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNP 174
F T M T S VSF EL+VLLV EEAAI+KQ+K D++F + L AN + + QN NP
Sbjct: 157 AFCTSMCTRSQSVSFEELYVLLVYEEAAIDKQTKHDEVFVQSSTLLANMAT--KGQNSNP 216
BLAST of Moc04g25900 vs. ExPASy TrEMBL
Match:
A0A5D3CLI6 (T4.5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold106G001120 PE=4 SV=1)
HSP 1 Score: 141.4 bits (355), Expect = 4.2e-30
Identity = 87/202 (43.07%), Postives = 114/202 (56.44%), Query Frame = 0
Query: 9 NLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W-------------------- 68
N Y+DWI KD ALMT INATL P AL YVVG SS+Q W
Sbjct: 83 NPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKS 142
Query: 69 ------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRT 128
++ YI+R+KE+KDKLANVS I +EDL+IY LNGLP+E+NTF+T MRT
Sbjct: 143 DLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT 202
Query: 129 CSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNPNVPCGRGY 175
S V+F ELHVLL +EE+A+ KQSK DD + T L ++S S + C +
Sbjct: 203 RSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSL---------LSCAPTF 262
BLAST of Moc04g25900 vs. ExPASy TrEMBL
Match:
A0A1S3BI58 (uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)
HSP 1 Score: 141.4 bits (355), Expect = 4.2e-30
Identity = 87/202 (43.07%), Postives = 114/202 (56.44%), Query Frame = 0
Query: 9 NLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W-------------------- 68
N Y+DWI KD ALMT INATL P AL YVVG SS+Q W
Sbjct: 83 NPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKS 142
Query: 69 ------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRT 128
++ YI+R+KE+KDKLANVS I +EDL+IY LNGLP+E+NTF+T MRT
Sbjct: 143 DLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT 202
Query: 129 CSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNPNVPCGRGY 175
S V+F ELHVLL +EE+A+ KQSK DD + T L ++S S + C +
Sbjct: 203 RSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSL---------LSCAPTF 262
BLAST of Moc04g25900 vs. ExPASy TrEMBL
Match:
A0A1S4DWT9 (uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103490319 PE=4 SV=1)
HSP 1 Score: 141.4 bits (355), Expect = 4.2e-30
Identity = 87/202 (43.07%), Postives = 114/202 (56.44%), Query Frame = 0
Query: 9 NLKYDDWIVKDHALMTSINATLLPAALTYVVGCDSSQQ-W-------------------- 68
N Y+DWI KD ALMT INATL P AL YVVG SS+Q W
Sbjct: 83 NPSYEDWIAKDQALMTVINATLSPEALAYVVGSTSSKQVWDVLAKLYSSGSRSNVVNLKS 142
Query: 69 ------------VNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRT 128
++ YI+R+KE+KDKLANVS I +EDL+IY LNGLP+E+NTF+T MRT
Sbjct: 143 DLQTIYKKPDESIDAYIKRIKEIKDKLANVSTFINEEDLLIYALNGLPNEYNTFRTSMRT 202
Query: 129 CSHHVSFAELHVLLVSEEAAIEKQSKCDDLFASLTALFANSDSQPRNQNFNPNVPCGRGY 175
S V+F ELHVLL +EE+A+ KQSK DD + T L ++S S + C +
Sbjct: 203 RSQPVTFEELHVLLRAEESALAKQSKGDDSYNQPTVLLSSSQSL---------LSCAPTF 262
BLAST of Moc04g25900 vs. TAIR 10
Match:
AT1G34070.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G48050.1); Has 648 Blast hits to 647 proteins in 29 species: Archae - 0; Bacteria - 0; Metazoa - 16; Fungi - 25; Plants - 607; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 43.9 bits (102), Expect = 1.8e-04
Identity = 24/80 (30.00%), Postives = 41/80 (51.25%), Query Frame = 0
Query: 42 DSSQQWVNQYIQRVKELKDKLANVSVIIEDEDLIIYTLNGLPSEFNTFQTFMRTCSHHVS 101
D V Y +++K+L D L NV V + D +L++Y LNGL +F+ ++ S
Sbjct: 125 DIGDMRVADYYRKMKKLADSLRNVDVPVTDRNLVMYVLNGLNPKFDNIINVIKHRQPFPS 184
Query: 102 FAELHVLLVSEEAAIEKQSK 122
F + +L EE +++ K
Sbjct: 185 FDDAATMLQEEEDRLKRAIK 204
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022158378.1 | 3.5e-47 | 64.00 | uncharacterized protein LOC111024876 [Momordica charantia] | [more] |
XP_022159298.1 | 3.9e-38 | 48.79 | uncharacterized protein LOC111025709 [Momordica charantia] | [more] |
KAE8645659.1 | 3.2e-32 | 46.53 | hypothetical protein Csa_020439 [Cucumis sativus] | [more] |
XP_011658579.1 | 3.2e-32 | 46.53 | uncharacterized protein LOC105436058 [Cucumis sativus] | [more] |
KAG6588985.1 | 9.3e-32 | 47.69 | Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyr... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DVX4 | 1.7e-47 | 64.00 | uncharacterized protein LOC111024876 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1DYF1 | 1.9e-38 | 48.79 | uncharacterized protein LOC111025709 OS=Momordica charantia OX=3673 GN=LOC111025... | [more] |
A0A5D3CLI6 | 4.2e-30 | 43.07 | T4.5 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold106G001120 PE=4 SV=... | [more] |
A0A1S3BI58 | 4.2e-30 | 43.07 | uncharacterized protein LOC103490319 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S4DWT9 | 4.2e-30 | 43.07 | uncharacterized protein LOC103490319 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT1G34070.1 | 1.8e-04 | 30.00 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); BE... | [more] |