Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATTATTGCGCTTTCCATTAATAATAAGCTTGGATTTACCAATGGATCTTTATCGAAGCCTACTGGTACTCTTATTGCTTCTTGGACTCGTAATAATTGTGTTTTAATTACCTGTATTTTGAACTTTGTTTCAAAATCAATTTCTGCTAGCCTCATCTTCTCCGATTCGGCACACGCTATTTGGCTTGATCTGAAGGAGGGATTTCAGCGTAAGAATGGCCCTAGAATTTTTAAACTTAAGCAAGATTTGGCAACGATAACGCAAGATCAACAATCTGTTTCCATGTATTTTACTCGGCTTAAAAGTGTTTGGGATGAATACATGACTTATCGACCTGCTTGGTCATGTGGCAAATGCTCTTGTGAAGGAAATCAATCTATTGAAGAATTTGTTCAATATGAATATCTCATGAGTTTTCTCATAGGTTTAAATGAGTCTTTCACTTCTACCAGGGCTCAAATTTTGTTGATTGATCCGACTCCTAGCATCAACAAGGCTTTTTCTCTCGTATCTCAGTAG
mRNA sequence
ATGATTATTGCGCTTTCCATTAATAATAAGCTTGGATTTACCAATGGATCTTTATCGAAGCCTACTGGTACTCTTATTGCTTCTTGGACTCGTAATAATTGTGTTTTAATTACCTGTATTTTGAACTTTGTTTCAAAATCAATTTCTGCTAGCCTCATCTTCTCCGATTCGGCACACGCTATTTGGCTTGATCTGAAGGAGGGATTTCAGCGTAAGAATGGCCCTAGAATTTTTAAACTTAAGCAAGATTTGGCAACGATAACGCAAGATCAACAATCTGTTTCCATGTATTTTACTCGGCTTAAAAGTGTTTGGGATGAATACATGACTTATCGACCTGCTTGGTCATGTGGCAAATGCTCTTGTGAAGGAAATCAATCTATTGAAGAATTTGTTCAATATGAATATCTCATGAGTTTTCTCATAGGTTTAAATGAGTCTTTCACTTCTACCAGGGCTCAAATTTTGTTGATTGATCCGACTCCTAGCATCAACAAGGCTTTTTCTCTCGTATCTCAGTAG
Coding sequence (CDS)
ATGATTATTGCGCTTTCCATTAATAATAAGCTTGGATTTACCAATGGATCTTTATCGAAGCCTACTGGTACTCTTATTGCTTCTTGGACTCGTAATAATTGTGTTTTAATTACCTGTATTTTGAACTTTGTTTCAAAATCAATTTCTGCTAGCCTCATCTTCTCCGATTCGGCACACGCTATTTGGCTTGATCTGAAGGAGGGATTTCAGCGTAAGAATGGCCCTAGAATTTTTAAACTTAAGCAAGATTTGGCAACGATAACGCAAGATCAACAATCTGTTTCCATGTATTTTACTCGGCTTAAAAGTGTTTGGGATGAATACATGACTTATCGACCTGCTTGGTCATGTGGCAAATGCTCTTGTGAAGGAAATCAATCTATTGAAGAATTTGTTCAATATGAATATCTCATGAGTTTTCTCATAGGTTTAAATGAGTCTTTCACTTCTACCAGGGCTCAAATTTTGTTGATTGATCCGACTCCTAGCATCAACAAGGCTTTTTCTCTCGTATCTCAGTAG
Protein sequence
MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ
Homology
BLAST of Moc08g16940 vs. NCBI nr
Match:
XP_022154973.1 (uncharacterized protein LOC111022117 [Momordica charantia])
HSP 1 Score: 234.6 bits (597), Expect = 6.6e-58
Identity = 113/173 (65.32%), Postives = 140/173 (80.92%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHA 60
M IALSI NKLGF NGSL KP G L+ W RN V+I LN VSK ISASLIF++S H
Sbjct: 28 MTIALSIKNKLGFINGSLPKPAGDLLPVWIRNKHVVIAWFLNSVSKPISASLIFTNSTHE 87
Query: 61 IWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKC 120
IWLDLK+ FQ +NGP+IF+L++DLAT+TQDQ SV+MY+T+LK++WDEY++YRP +CG C
Sbjct: 88 IWLDLKDRFQLQNGPQIFQLRRDLATLTQDQLSVTMYYTKLKALWDEYVSYRPGCTCGSC 147
Query: 121 SCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
SC G + +E+FVQ+E+LM FL+GLNESF RAQILL+DP PSI KAFSL+SQ
Sbjct: 148 SCGGYRLVEKFVQFEHLMKFLMGLNESFAHIRAQILLMDPPPSIGKAFSLISQ 200
BLAST of Moc08g16940 vs. NCBI nr
Match:
XP_038895765.1 (uncharacterized protein LOC120083929 [Benincasa hispida])
HSP 1 Score: 216.5 bits (550), Expect = 1.8e-52
Identity = 99/173 (57.23%), Postives = 140/173 (80.92%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHA 60
M++ L I NKLGF +GSL +PTG L+ W NN V+++ IL VSKSIS+S++F++SA A
Sbjct: 65 MVLTLFIQNKLGFIDGSLPRPTGDLLHLWIHNNNVVVSWILKSVSKSISSSILFTESAQA 124
Query: 61 IWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKC 120
IWLDL++ FQR+NGPRIF LK++L+++ QDQ SV+MYFT++KS DEY++YRP +CG+C
Sbjct: 125 IWLDLQDCFQRRNGPRIFHLKRELSSLKQDQDSVTMYFTKMKSFCDEYVSYRPGCTCGQC 184
Query: 121 SCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
+C G +S+E+F+Q+EYL+ F +GLN+SF TR+Q+LL+DP P +NKAFS V Q
Sbjct: 185 TCGGIKSMEDFLQFEYLLCFFMGLNDSFNHTRSQLLLMDPPPPLNKAFSFVFQ 237
BLAST of Moc08g16940 vs. NCBI nr
Match:
XP_022856063.1 (uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris])
HSP 1 Score: 210.7 bits (535), Expect = 1.0e-50
Identity = 99/175 (56.57%), Postives = 136/175 (77.71%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGT--LIASWTRNNCVLITCILNFVSKSISASLIFSDSA 60
M+I L + NK+GF +GS++KP + +++W RNN ++I+ ILN VSK ISAS+I+S+SA
Sbjct: 1 MMIPLFVKNKIGFIDGSIAKPDNSDDQVSNWIRNNNIVISWILNSVSKEISASVIYSESA 60
Query: 61 HAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCG 120
H IW+DLKE FQ++NGPRIF+L+++L +TQ Q SV +YFT+LK++W+E YRP SCG
Sbjct: 61 HDIWIDLKERFQQRNGPRIFQLRRELMNLTQGQLSVGVYFTKLKTIWEELSNYRPICSCG 120
Query: 121 KCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
KCSC N+ + E Q EY+MSFL+GLN++F R Q+LL+DP PSINK FSLVSQ
Sbjct: 121 KCSCGENKKLSEHYQMEYVMSFLMGLNDTFAQGRGQLLLMDPMPSINKVFSLVSQ 175
BLAST of Moc08g16940 vs. NCBI nr
Match:
KAA8542446.1 (hypothetical protein F0562_023418 [Nyssa sinensis])
HSP 1 Score: 209.9 bits (533), Expect = 1.7e-50
Identity = 100/176 (56.82%), Postives = 137/176 (77.84%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDS 60
M+IALS+ NKLGF +GS+ +P GT LI SW RNN ++I+ ILN VSK ISAS+IF+ S
Sbjct: 1 MLIALSVKNKLGFVDGSIPEPQGTDNDLINSWIRNNNIVISWILNSVSKEISASIIFAAS 60
Query: 61 AHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSC 120
A IWLDL++ FQ++NGPRIF+LK++L + Q+Q SVS+YFT+LK++W+E YRP SC
Sbjct: 61 AREIWLDLRDRFQQRNGPRIFQLKRELMNLRQEQSSVSIYFTKLKTIWEELSNYRPNCSC 120
Query: 121 GKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
GKCSC G +++ + Q EY+MSFL+GL++SF+ R Q+LL+DP P IN+ FSL+ Q
Sbjct: 121 GKCSCGGVKNLNDHHQMEYIMSFLMGLDDSFSQVRGQLLLMDPIPPINRVFSLIVQ 176
BLAST of Moc08g16940 vs. NCBI nr
Match:
XP_022154919.1 (uncharacterized protein LOC111022065 [Momordica charantia])
HSP 1 Score: 208.0 bits (528), Expect = 6.6e-50
Identity = 100/173 (57.80%), Postives = 137/173 (79.19%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHA 60
++IAL++ NK+GF +GS+S+PT + SW N V+I+ I N +SK ISAS++FSDSAH
Sbjct: 58 IVIALTVKNKIGFVDGSISRPTDGRLHSWIICNNVVISWIFNSLSKKISASVLFSDSAHE 117
Query: 61 IWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKC 120
IWLDLKE FQR+N PRIF+L+++L+ +TQDQ SV+ YFTRLK++W E YRPA SCG+C
Sbjct: 118 IWLDLKERFQRQNRPRIFQLRRELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRC 177
Query: 121 SCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
S G +SIE Q EY+M+FL+GLN SF+ RAQ+LL++P P+IN+AF+LV+Q
Sbjct: 178 SYGGVKSIEAHYQQEYVMAFLMGLNVSFSQIRAQLLLMEPAPTINRAFALVAQ 230
BLAST of Moc08g16940 vs. ExPASy TrEMBL
Match:
A0A6J1DLQ9 (uncharacterized protein LOC111022117 OS=Momordica charantia OX=3673 GN=LOC111022117 PE=4 SV=1)
HSP 1 Score: 234.6 bits (597), Expect = 3.2e-58
Identity = 113/173 (65.32%), Postives = 140/173 (80.92%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHA 60
M IALSI NKLGF NGSL KP G L+ W RN V+I LN VSK ISASLIF++S H
Sbjct: 28 MTIALSIKNKLGFINGSLPKPAGDLLPVWIRNKHVVIAWFLNSVSKPISASLIFTNSTHE 87
Query: 61 IWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKC 120
IWLDLK+ FQ +NGP+IF+L++DLAT+TQDQ SV+MY+T+LK++WDEY++YRP +CG C
Sbjct: 88 IWLDLKDRFQLQNGPQIFQLRRDLATLTQDQLSVTMYYTKLKALWDEYVSYRPGCTCGSC 147
Query: 121 SCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
SC G + +E+FVQ+E+LM FL+GLNESF RAQILL+DP PSI KAFSL+SQ
Sbjct: 148 SCGGYRLVEKFVQFEHLMKFLMGLNESFAHIRAQILLMDPPPSIGKAFSLISQ 200
BLAST of Moc08g16940 vs. ExPASy TrEMBL
Match:
A0A5J5BIH5 (Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_023418 PE=4 SV=1)
HSP 1 Score: 209.9 bits (533), Expect = 8.4e-51
Identity = 100/176 (56.82%), Postives = 137/176 (77.84%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDS 60
M+IALS+ NKLGF +GS+ +P GT LI SW RNN ++I+ ILN VSK ISAS+IF+ S
Sbjct: 1 MLIALSVKNKLGFVDGSIPEPQGTDNDLINSWIRNNNIVISWILNSVSKEISASIIFAAS 60
Query: 61 AHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSC 120
A IWLDL++ FQ++NGPRIF+LK++L + Q+Q SVS+YFT+LK++W+E YRP SC
Sbjct: 61 AREIWLDLRDRFQQRNGPRIFQLKRELMNLRQEQSSVSIYFTKLKTIWEELSNYRPNCSC 120
Query: 121 GKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
GKCSC G +++ + Q EY+MSFL+GL++SF+ R Q+LL+DP P IN+ FSL+ Q
Sbjct: 121 GKCSCGGVKNLNDHHQMEYIMSFLMGLDDSFSQVRGQLLLMDPIPPINRVFSLIVQ 176
BLAST of Moc08g16940 vs. ExPASy TrEMBL
Match:
A0A6J1DNP7 (uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022065 PE=4 SV=1)
HSP 1 Score: 208.0 bits (528), Expect = 3.2e-50
Identity = 100/173 (57.80%), Postives = 137/173 (79.19%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHA 60
++IAL++ NK+GF +GS+S+PT + SW N V+I+ I N +SK ISAS++FSDSAH
Sbjct: 58 IVIALTVKNKIGFVDGSISRPTDGRLHSWIICNNVVISWIFNSLSKKISASVLFSDSAHE 117
Query: 61 IWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSCGKC 120
IWLDLKE FQR+N PRIF+L+++L+ +TQDQ SV+ YFTRLK++W E YRPA SCG+C
Sbjct: 118 IWLDLKERFQRQNRPRIFQLRRELSNLTQDQLSVTAYFTRLKTLWSELALYRPACSCGRC 177
Query: 121 SCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
S G +SIE Q EY+M+FL+GLN SF+ RAQ+LL++P P+IN+AF+LV+Q
Sbjct: 178 SYGGVKSIEAHYQQEYVMAFLMGLNVSFSQIRAQLLLMEPAPTINRAFALVAQ 230
BLAST of Moc08g16940 vs. ExPASy TrEMBL
Match:
A0A7J0FKC9 (Haloacid dehalogenase-like hydrolase (HAD) superfamily protein OS=Actinidia rufa OX=165716 GN=Acr_13g0000100 PE=4 SV=1)
HSP 1 Score: 204.5 bits (519), Expect = 3.5e-49
Identity = 101/176 (57.39%), Postives = 131/176 (74.43%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTG---TLIASWTRNNCVLITCILNFVSKSISASLIFSDS 60
MIIALS+ NKLGF +GS++KP G L+ SW RNN V+I+ ILN VSK ISAS+IFS S
Sbjct: 300 MIIALSVKNKLGFIDGSITKPEGNDTNLLNSWIRNNNVVISWILNSVSKEISASIIFSAS 359
Query: 61 AHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSC 120
A+ IW+DLK+ FQ+ NGPRIF+L+++L QDQ VS+YFT+LK++W+E YRPA SC
Sbjct: 360 ANEIWIDLKDRFQQSNGPRIFQLRRELMNHVQDQSPVSVYFTKLKTIWEELNNYRPACSC 419
Query: 121 GKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
G C+C G + + Q EY+MSFL+ L+ SF R Q+LL+DP P INK FSL+SQ
Sbjct: 420 GNCTCGGVKKLNSHYQMEYIMSFLMVLHYSFAQIRGQLLLMDPLPPINKVFSLISQ 475
BLAST of Moc08g16940 vs. ExPASy TrEMBL
Match:
A0A5J5A1K4 (Retrotrans_gag domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_010359 PE=4 SV=1)
HSP 1 Score: 202.6 bits (514), Expect = 1.3e-48
Identity = 97/176 (55.11%), Postives = 134/176 (76.14%), Query Frame = 0
Query: 1 MIIALSINNKLGFTNGSLSKPTGT---LIASWTRNNCVLITCILNFVSKSISASLIFSDS 60
M+IAL + NKLGF +GS+ +P GT L SW RNN ++I+ ILN VSK ISAS+IF+ S
Sbjct: 1 MLIALFVKNKLGFVDGSIPEPQGTDTDLFNSWIRNNNIVISWILNSVSKEISASIIFAAS 60
Query: 61 AHAIWLDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYRPAWSC 120
A IWLDL++ FQ++NGPRIF+LK++L + Q+Q SVS+YFT+LK++W+E RP SC
Sbjct: 61 AREIWLDLRDRFQQRNGPRIFQLKRELMNLRQEQSSVSIYFTKLKTIWEELSNSRPNCSC 120
Query: 121 GKCSCEGNQSIEEFVQYEYLMSFLIGLNESFTSTRAQILLIDPTPSINKAFSLVSQ 174
GKCSC G +++ + Q EY+MSFL+GL++SF+ R Q+LL+DP P IN+ FSL+ Q
Sbjct: 121 GKCSCGGVKNLNDHHQMEYIMSFLMGLDDSFSQVRGQLLLMDPMPPINRVFSLIVQ 176
BLAST of Moc08g16940 vs. TAIR 10
Match:
AT1G21280.1 (CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 100.5 bits (249), Expect = 1.4e-21
Identity = 56/173 (32.37%), Postives = 96/173 (55.49%), Query Frame = 0
Query: 5 LSINNKLGFTNGSLSKPT--GTLIASWTRNNCVLITCILNFVSKSISASLIFSDSAHAIW 64
L + K GF +G+L KP L W + N +++ ++N ++ + S++++++AH +W
Sbjct: 53 LRVTKKFGFIDGTLPKPDPFSPLYQPWEQCNAMVMYWLMNSMTDKLLESVMYAETAHKMW 112
Query: 65 LDLKEGFQRKNGPRIFKLKQDLATITQDQQSVSMYFTRLKSVWDEYMTYR--PAWSCGKC 124
DL+ F +I++L++ LAT+ Q SV YF +L VW E Y P CG C
Sbjct: 113 EDLRRVFVPCVDLKIYQLRRRLATLRQGGDSVEEYFGKLSKVWMELSEYAPIPECKCGGC 172
Query: 125 SCEGNQSIEEFVQYEYLMSFLIG--LNESFTSTRAQILLIDPTPSINKAFSLV 172
+CE + EE + E FL+G LN+ F + +I+ P PS+++AF++V
Sbjct: 173 NCECTKRAEEAREKEQRYEFLMGLKLNQGFEAVTTKIMFQKPPPSLHEAFAMV 225
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022154973.1 | 6.6e-58 | 65.32 | uncharacterized protein LOC111022117 [Momordica charantia] | [more] |
XP_038895765.1 | 1.8e-52 | 57.23 | uncharacterized protein LOC120083929 [Benincasa hispida] | [more] |
XP_022856063.1 | 1.0e-50 | 56.57 | uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris] | [more] |
KAA8542446.1 | 1.7e-50 | 56.82 | hypothetical protein F0562_023418 [Nyssa sinensis] | [more] |
XP_022154919.1 | 6.6e-50 | 57.80 | uncharacterized protein LOC111022065 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1DLQ9 | 3.2e-58 | 65.32 | uncharacterized protein LOC111022117 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A5J5BIH5 | 8.4e-51 | 56.82 | Uncharacterized protein OS=Nyssa sinensis OX=561372 GN=F0562_023418 PE=4 SV=1 | [more] |
A0A6J1DNP7 | 3.2e-50 | 57.80 | uncharacterized protein LOC111022065 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A7J0FKC9 | 3.5e-49 | 57.39 | Haloacid dehalogenase-like hydrolase (HAD) superfamily protein OS=Actinidia rufa... | [more] |
A0A5J5A1K4 | 1.3e-48 | 55.11 | Retrotrans_gag domain-containing protein OS=Nyssa sinensis OX=561372 GN=F0562_01... | [more] |
Match Name | E-value | Identity | Description | |
AT1G21280.1 | 1.4e-21 | 32.37 | CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Ha... | [more] |