Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGATGGCTATAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAACACACACCACGGCGACCACCAGTGGTGGCTACAGAAGACTATGATGGTCCAGGGTCTAATCCAAAGCACACGCCACGACCACCGCCAATGGTGGCTACAAAAGACTATGCTGGTCCAGGATCTAATCCAAAGCACACACCGCGGCGACAACCAATGGTGGCTACAGAAGACTATGCTGGTCTAGGATCTAATCCAAAGCACACACCGTGGCGACCACCAATGGTGGCTACAGAAGACTGCTCTAGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTACGCTAGTCCAGGGTCTAATCCGAAACACACACTGCGGCGACCGCTGATGGTGGCTACAGAAGATTATGCTGGTCCAGGGTCTAATCCAAAGCACACACCACGACGACCGCCATTTCAGGTTTTAACCACTACTGGTGGCCGTCGCCAGCAGCCAGAGGGCCACGTCAGCAAGGAGCCCTGA
mRNA sequence
ATGATGGCTATAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAACACACACCACGGCGACCACCAGTGGTGGCTACAGAAGACTATGATGGTCCAGGGTCTAATCCAAAGCACACGCCACGACCACCGCCAATGGTGGCTACAAAAGACTATGCTGGTCCAGGATCTAATCCAAAGCACACACCGCGGCGACAACCAATGGTGGCTACAGAAGACTATGCTGGTCTAGGATCTAATCCAAAGCACACACCGTGGCGACCACCAATGGTGGCTACAGAAGACTGCTCTAGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTACGCTAGTCCAGGGTCTAATCCGAAACACACACTGCGGCGACCGCTGATGGTGGCTACAGAAGATTATGCTGGTCCAGGGTCTAATCCAAAGCACACACCACGACGACCGCCATTTCAGGTTTTAACCACTACTGGTGGCCGTCGCCAGCAGCCAGAGGGCCACGTCAGCAAGGAGCCCTGA
Coding sequence (CDS)
ATGATGGCTATAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAACACACACCACGGCGACCACCAGTGGTGGCTACAGAAGACTATGATGGTCCAGGGTCTAATCCAAAGCACACGCCACGACCACCGCCAATGGTGGCTACAAAAGACTATGCTGGTCCAGGATCTAATCCAAAGCACACACCGCGGCGACAACCAATGGTGGCTACAGAAGACTATGCTGGTCTAGGATCTAATCCAAAGCACACACCGTGGCGACCACCAATGGTGGCTACAGAAGACTGCTCTAGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTATGCTGGTCCAGGGTCTAATCCAAAGCACACACTACTGCGACCATCGATGGTGGCTACAGAAGACTACGCTAGTCCAGGGTCTAATCCGAAACACACACTGCGGCGACCGCTGATGGTGGCTACAGAAGATTATGCTGGTCCAGGGTCTAATCCAAAGCACACACCACGACGACCGCCATTTCAGGTTTTAACCACTACTGGTGGCCGTCGCCAGCAGCCAGAGGGCCACGTCAGCAAGGAGCCCTGA
Protein sequence
MMAIEDYAGPGSNPKHTPRRPPVVATEDYDGPGSNPKHTPRPPPMVATKDYAGPGSNPKHTPRRQPMVATEDYAGLGSNPKHTPWRPPMVATEDCSSPGSNPKHTLLRPSMVATEDYAGPGSNPKHTLLRPSMVATEDYASPGSNPKHTLRRPLMVATEDYAGPGSNPKHTPRRPPFQVLTTTGGRRQQPEGHVSKEP
Homology
BLAST of Moc03g01470 vs. NCBI nr
Match:
KAF4369213.1 (hypothetical protein G4B88_009511 [Cannabis sativa] >KAF4387559.1 hypothetical protein F8388_011707 [Cannabis sativa])
HSP 1 Score: 120.2 bits (300), Expect = 2.1e-23
Identity = 74/207 (35.75%), Postives = 105/207 (50.72%), Query Frame = 0
Query: 1 MMAIEDYAGPGSNPKHTPRRPPVVATEDYDGPGSNPKHTPRPPP-----MVATKDYAGPG 60
M+ E+Y GP SNP T + P + +DY G NP+H P PP ++ +DYA P
Sbjct: 40 MLIGEEYKGPQSNPSDTHDQLPPLILQDYPSVGPNPRHKPHLPPPANENKLSVQDYADPE 99
Query: 61 SNPKH-TPRRQP----MVATEDYAGLGSNPKHTPWRPPMVATE------DCSSPGSNPKH 120
NP+H P +QP ++ +DYA GSNP+H PP++ E D G NP+H
Sbjct: 100 PNPRHKLPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNENELILQDYPPVGPNPRH 159
Query: 121 -----TLLRPSMVATEDYAGPGSNPKH-----TLLRPSMVATEDYASPGSNPKHTLRRPL 177
L + ++ +DYA P NP+H + ++ +DYA PGSNP+H PL
Sbjct: 160 KPHHPPLANENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRHKHPPPL 219
BLAST of Moc03g01470 vs. NCBI nr
Match:
XP_030505386.1 (sporozoite surface protein 2-like [Cannabis sativa])
HSP 1 Score: 117.1 bits (292), Expect = 1.7e-22
Identity = 73/208 (35.10%), Postives = 105/208 (50.48%), Query Frame = 0
Query: 1 MMAIEDYAGPGSNPKHTPRRPPVVATEDYDGPGSNPKHTPRPPP-----MVATKDYAGPG 60
M+ E+Y GP SNP T + P + +DY G NP+H P PP ++ +DYA P
Sbjct: 1 MLIGEEYKGPQSNPSDTHDQLPPLILQDYPPVGPNPRHKPHLPPPANENKLSVQDYADPE 60
Query: 61 SNPKH-TPRRQP----MVATEDYAGLGSNPKHTPWRPPMVAT-------EDCSSPGSNPK 120
NP+H P +QP ++ +DYA GSNP+H PP + T +D G NP+
Sbjct: 61 PNPRHKPPPQQPTNENKLSIQDYADPGSNPRHK--HPPPLMTNKNELILQDYPPVGPNPR 120
Query: 121 H-----TLLRPSMVATEDYAGPGSNPKH-----TLLRPSMVATEDYASPGSNPKHTLRRP 177
H L + ++ +DYA P NP+H + ++ +DYA PGSNP+H P
Sbjct: 121 HKPHHPPLANENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRHKHPPP 180
BLAST of Moc03g01470 vs. NCBI nr
Match:
QCD98561.1 (hypothetical protein DEO72_LG6g3283 [Vigna unguiculata])
HSP 1 Score: 114.0 bits (284), Expect = 1.5e-21
Identity = 76/194 (39.18%), Postives = 87/194 (44.85%), Query Frame = 0
Query: 4 IEDYAGPGSNPKHTPRRPPVV----ATEDYDGPGSNPKHTPRPP----PMVATKDYAGPG 63
++DY G GSNPKH P P + +DY G SNPKH P PP DY G G
Sbjct: 37 VDDYPGTGSNPKHKPPPPRNLNNHYEVDDYPGTRSNPKHKPPPPRDLNDHYEIDDYPGTG 96
Query: 64 SNPKHTP----RRQPMVATEDYAGLGSNPKHTPWRPPMVATEDCSSPGSNPKHTLLRPSM 123
SNPKH P +DY G G+NPKH P PP D
Sbjct: 97 SNPKHKPPPPRDLNNHYEVDDYPGTGANPKHKP--PPSRDLND----------------H 156
Query: 124 VATEDYAGPGSNPKHTLLRP----SMVATEDYASPGSNPKHTLRRPLMV----ATEDYAG 178
+DY G GSNPKH P + +DYA GSNPKH P + +DY G
Sbjct: 157 FEVDDYPGTGSNPKHKPPPPRDLNNHYEVDDYAGTGSNPKHKPPPPRDLNDHYEVDDYPG 212
BLAST of Moc03g01470 vs. NCBI nr
Match:
KAF4369216.1 (hypothetical protein G4B88_009514 [Cannabis sativa])
HSP 1 Score: 108.2 bits (269), Expect = 8.1e-20
Identity = 76/226 (33.63%), Postives = 109/226 (48.23%), Query Frame = 0
Query: 2 MAIEDYAGPGSNPKHTPRRPPVVAT-------EDYDGPGSNPKHTPRPPPM--------- 61
++++DYA P NP+H P PP T +DY PGSNP+H PP M
Sbjct: 85 LSVQDYADPKPNPRHKP--PPQQPTNENKLSIQDYADPGSNPRHQHPPPLMTNENEHKPH 144
Query: 62 ---------VATKDYAGPGSNPKH-TPRRQP----MVATEDYAGLGSNPKHTPWRPPMVA 121
++ +DYA P NP+H P +QP ++ +DY GSNP+H PP++
Sbjct: 145 HPPLANENQLSVQDYANPEPNPRHKPPPQQPTNENKLSIQDYVDPGSNPRH-KHPPPLMT 204
Query: 122 TE------DCSSPGSNPKH-----TLLRPSMVATEDYAGPGSNPKH-----TLLRPSMVA 177
E D G NP+H L ++++ +DYA P NPK+ + ++
Sbjct: 205 NENELILQDYPPVGPNPRHKPHHPPLANENILSVQDYADPEPNPKYKPPPQQPTNENKLS 264
BLAST of Moc03g01470 vs. NCBI nr
Match:
XP_038895781.1 (circumsporozoite protein-like [Benincasa hispida])
HSP 1 Score: 107.1 bits (266), Expect = 1.8e-19
Identity = 53/125 (42.40%), Postives = 77/125 (61.60%), Query Frame = 0
Query: 2 MAIEDYAGPGSNPKHTPRRPPVVATEDYDGPGSNPKHTPRPPPMVATKDYAGPGSNPKHT 61
+ I DYA PG+NP+H P +PP++ DY PG+NP+H P PM+ DY PG+NP+H
Sbjct: 35 ITINDYADPGANPRHDPNQPPMM-INDYADPGANPRHDPNQTPMM-INDYTDPGANPRHD 94
Query: 62 PRRQPMVATEDYAGLGSNPKHTPWRPPMVATEDCSSPGSNPKHTLLRPSMVATEDYAGPG 121
P + PM+ +NP+H P +PPM+ D + P +NP+H +P M+ DYA G
Sbjct: 95 PNQPPMMINR------ANPRHDPNQPPMM-INDYTDPRANPRHDPNQPPMM-INDYADAG 149
Query: 122 SNPKH 127
+NP+H
Sbjct: 155 ANPRH 149
BLAST of Moc03g01470 vs. ExPASy TrEMBL
Match:
A0A7J6FET4 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_011707 PE=4 SV=1)
HSP 1 Score: 120.2 bits (300), Expect = 1.0e-23
Identity = 74/207 (35.75%), Postives = 105/207 (50.72%), Query Frame = 0
Query: 1 MMAIEDYAGPGSNPKHTPRRPPVVATEDYDGPGSNPKHTPRPPP-----MVATKDYAGPG 60
M+ E+Y GP SNP T + P + +DY G NP+H P PP ++ +DYA P
Sbjct: 40 MLIGEEYKGPQSNPSDTHDQLPPLILQDYPSVGPNPRHKPHLPPPANENKLSVQDYADPE 99
Query: 61 SNPKH-TPRRQP----MVATEDYAGLGSNPKHTPWRPPMVATE------DCSSPGSNPKH 120
NP+H P +QP ++ +DYA GSNP+H PP++ E D G NP+H
Sbjct: 100 PNPRHKLPPQQPTNENKLSIQDYADPGSNPRH-KHPPPLMTNENELILQDYPPVGPNPRH 159
Query: 121 -----TLLRPSMVATEDYAGPGSNPKH-----TLLRPSMVATEDYASPGSNPKHTLRRPL 177
L + ++ +DYA P NP+H + ++ +DYA PGSNP+H PL
Sbjct: 160 KPHHPPLANENKLSVQDYADPEPNPRHKPPPQQPTNENKLSIQDYADPGSNPRHKHPPPL 219
BLAST of Moc03g01470 vs. ExPASy TrEMBL
Match:
A0A4D6MDP1 (Uncharacterized protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG6g3283 PE=4 SV=1)
HSP 1 Score: 114.0 bits (284), Expect = 7.2e-22
Identity = 76/194 (39.18%), Postives = 87/194 (44.85%), Query Frame = 0
Query: 4 IEDYAGPGSNPKHTPRRPPVV----ATEDYDGPGSNPKHTPRPP----PMVATKDYAGPG 63
++DY G GSNPKH P P + +DY G SNPKH P PP DY G G
Sbjct: 37 VDDYPGTGSNPKHKPPPPRNLNNHYEVDDYPGTRSNPKHKPPPPRDLNDHYEIDDYPGTG 96
Query: 64 SNPKHTP----RRQPMVATEDYAGLGSNPKHTPWRPPMVATEDCSSPGSNPKHTLLRPSM 123
SNPKH P +DY G G+NPKH P PP D
Sbjct: 97 SNPKHKPPPPRDLNNHYEVDDYPGTGANPKHKP--PPSRDLND----------------H 156
Query: 124 VATEDYAGPGSNPKHTLLRP----SMVATEDYASPGSNPKHTLRRPLMV----ATEDYAG 178
+DY G GSNPKH P + +DYA GSNPKH P + +DY G
Sbjct: 157 FEVDDYPGTGSNPKHKPPPPRDLNNHYEVDDYAGTGSNPKHKPPPPRDLNDHYEVDDYPG 212
BLAST of Moc03g01470 vs. ExPASy TrEMBL
Match:
A0A7J6FEU1 (Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009514 PE=4 SV=1)
HSP 1 Score: 108.2 bits (269), Expect = 3.9e-20
Identity = 76/226 (33.63%), Postives = 109/226 (48.23%), Query Frame = 0
Query: 2 MAIEDYAGPGSNPKHTPRRPPVVAT-------EDYDGPGSNPKHTPRPPPM--------- 61
++++DYA P NP+H P PP T +DY PGSNP+H PP M
Sbjct: 85 LSVQDYADPKPNPRHKP--PPQQPTNENKLSIQDYADPGSNPRHQHPPPLMTNENEHKPH 144
Query: 62 ---------VATKDYAGPGSNPKH-TPRRQP----MVATEDYAGLGSNPKHTPWRPPMVA 121
++ +DYA P NP+H P +QP ++ +DY GSNP+H PP++
Sbjct: 145 HPPLANENQLSVQDYANPEPNPRHKPPPQQPTNENKLSIQDYVDPGSNPRH-KHPPPLMT 204
Query: 122 TE------DCSSPGSNPKH-----TLLRPSMVATEDYAGPGSNPKH-----TLLRPSMVA 177
E D G NP+H L ++++ +DYA P NPK+ + ++
Sbjct: 205 NENELILQDYPPVGPNPRHKPHHPPLANENILSVQDYADPEPNPKYKPPPQQPTNENKLS 264
BLAST of Moc03g01470 vs. ExPASy TrEMBL
Match:
A0A0E0DTA0 (Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1)
HSP 1 Score: 83.6 bits (205), Expect = 1.0e-12
Identity = 46/127 (36.22%), Postives = 60/127 (47.24%), Query Frame = 0
Query: 4 IEDYAGPGSNPKHTPRRPP----------VVATE----DYDGPGSNPKHTPRPPP----- 63
+ DY PG+NP+H P+RPP +AT+ DY PG+NP+H P+ PP
Sbjct: 59 VNDYPAPGANPRHNPKRPPGREMSVQGMVAMATDVEVNDYPAPGANPRHNPKRPPGREMS 118
Query: 64 ---------MVATKDYAGPGSNPKHTPRRQP--------------MVATEDYAGLGSNPK 89
V DY PG+NP+H P+R P V DY G G+NP+
Sbjct: 119 VLGTVAATTNVEVNDYPAPGANPRHNPKRPPGREMFAQGMAAATTNVEVNDYPGPGANPR 178
BLAST of Moc03g01470 vs. ExPASy TrEMBL
Match:
Q75KZ2 (Uncharacterized protein OS=Oryza sativa subsp. japonica OX=39947 GN=OJ1004_E02.10 PE=4 SV=1)
HSP 1 Score: 81.6 bits (200), Expect = 3.9e-12
Identity = 48/128 (37.50%), Postives = 60/128 (46.88%), Query Frame = 0
Query: 4 IEDYAGPGSNPKHTPRRPP---------------VVATEDYDGPGSNPKHTPRPPP---- 63
+ DY PG+NP+H P+RPP V DY PG+NP+H P+ PP
Sbjct: 60 VNDYPAPGANPRHNPKRPPGREMSVQGMVAAATNNVEVNDYPAPGANPRHNPKSPPGREM 119
Query: 64 ----MVA------TKDYAGPGSNPKHTPRRQP--------MVA------TEDYAGLGSNP 89
MVA DY PG+NP+H P+R P MVA DY G+NP
Sbjct: 120 SVQGMVAAATDVEVNDYPAPGANPRHNPKRPPGREMSVQGMVAATTDVEVNDYLAPGANP 179
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAF4369213.1 | 2.1e-23 | 35.75 | hypothetical protein G4B88_009511 [Cannabis sativa] >KAF4387559.1 hypothetical p... | [more] |
XP_030505386.1 | 1.7e-22 | 35.10 | sporozoite surface protein 2-like [Cannabis sativa] | [more] |
QCD98561.1 | 1.5e-21 | 39.18 | hypothetical protein DEO72_LG6g3283 [Vigna unguiculata] | [more] |
KAF4369216.1 | 8.1e-20 | 33.63 | hypothetical protein G4B88_009514 [Cannabis sativa] | [more] |
XP_038895781.1 | 1.8e-19 | 42.40 | circumsporozoite protein-like [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A7J6FET4 | 1.0e-23 | 35.75 | Uncharacterized protein OS=Cannabis sativa OX=3483 GN=F8388_011707 PE=4 SV=1 | [more] |
A0A4D6MDP1 | 7.2e-22 | 39.18 | Uncharacterized protein OS=Vigna unguiculata OX=3917 GN=DEO72_LG6g3283 PE=4 SV=1 | [more] |
A0A7J6FEU1 | 3.9e-20 | 33.63 | Uncharacterized protein OS=Cannabis sativa OX=3483 GN=G4B88_009514 PE=4 SV=1 | [more] |
A0A0E0DTA0 | 1.0e-12 | 36.22 | Uncharacterized protein OS=Oryza meridionalis OX=40149 PE=4 SV=1 | [more] |
Q75KZ2 | 3.9e-12 | 37.50 | Uncharacterized protein OS=Oryza sativa subsp. japonica OX=39947 GN=OJ1004_E02.1... | [more] |
Match Name | E-value | Identity | Description | |