Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAATAGGTTGGGAGATCTCTTCCATTACCCTTGGGTTATAGGTGGAGATTTTAATGAGATTCTCGGAATAGAAGAAAAGGAATGCGGCAGATTAAGAAATCAAGCCCAAATTGAAGCTTTTTCCGAGGTAGTAGATCATTGTCTCCTAAAAGACCTGGGCTGGGTGGGGGACAAGTTTACCTGGAGGAGAAGCAAAAGGAAAGATAGTTGGATCAGAGAAAGATTAGACCGTATTTTCGCTAATCAGGACATGATAAACATGTGCAAGGATATCTCCATCCAGCACCACGGCTACCATCACTCAGACCACAGAATTCTTTCGGCAGAGATTACCTTTGAAGATAAGAAATTCAGGGGCAGGAAGAGCAGGGAGTTGAAGTTCGAAGAAAGCTGGTTAAAACACCCTGATAGCTTTAAGATTATCGAGAAAAATTGGAATAGCCTCCGGGGGACTGATATAAACTCCTTCAACAACAAAATCAAGAATTGTCTTCAGAGCTTAGATAAGTGGCATAAGGACAGGCTCCAAGGTTCGATTAAAGGGGTTGTGGAAAGAAAGCTCAAAGAAATTCAGAACCTAGTGTCGCTACGGGTTTGCGACGCGGAGCGGCCGACTTTGTTCCCAAGAGGAGGAAAATGGAGTCGCCACCAGTCTTGA
mRNA sequence
ATGAATAGGTTGGGAGATCTCTTCCATTACCCTTGGGTTATAGGTGGAGATTTTAATGAGATTCTCGGAATAGAAGAAAAGGAATGCGGCAGATTAAGAAATCAAGCCCAAATTGAAGCTTTTTCCGAGGTAGTAGATCATTGTCTCCTAAAAGACCTGGGCTGGGTGGGGGACAAGTTTACCTGGAGGAGAAGCAAAAGGAAAGATAGTTGGATCAGAGAAAGATTAGACCGTATTTTCGCTAATCAGGACATGATAAACATGTGCAAGGATATCTCCATCCAGCACCACGGCTACCATCACTCAGACCACAGAATTCTTTCGGCAGAGATTACCTTTGAAGATAAGAAATTCAGGGGCAGGAAGAGCAGGGAGTTGAAGTTCGAAGAAAGCTGGTTAAAACACCCTGATAGCTTTAAGATTATCGAGAAAAATTGGAATAGCCTCCGGGGGACTGATATAAACTCCTTCAACAACAAAATCAAGAATTGTCTTCAGAGCTTAGATAAGTGGCATAAGGACAGGCTCCAAGGTTCGATTAAAGGGGTTGTGGAAAGAAAGCTCAAAGAAATTCAGAACCTAGTGTCGCTACGGGTTTGCGACGCGGAGCGGCCGACTTTGTTCCCAAGAGGAGGAAAATGGAGTCGCCACCAGTCTTGA
Coding sequence (CDS)
ATGAATAGGTTGGGAGATCTCTTCCATTACCCTTGGGTTATAGGTGGAGATTTTAATGAGATTCTCGGAATAGAAGAAAAGGAATGCGGCAGATTAAGAAATCAAGCCCAAATTGAAGCTTTTTCCGAGGTAGTAGATCATTGTCTCCTAAAAGACCTGGGCTGGGTGGGGGACAAGTTTACCTGGAGGAGAAGCAAAAGGAAAGATAGTTGGATCAGAGAAAGATTAGACCGTATTTTCGCTAATCAGGACATGATAAACATGTGCAAGGATATCTCCATCCAGCACCACGGCTACCATCACTCAGACCACAGAATTCTTTCGGCAGAGATTACCTTTGAAGATAAGAAATTCAGGGGCAGGAAGAGCAGGGAGTTGAAGTTCGAAGAAAGCTGGTTAAAACACCCTGATAGCTTTAAGATTATCGAGAAAAATTGGAATAGCCTCCGGGGGACTGATATAAACTCCTTCAACAACAAAATCAAGAATTGTCTTCAGAGCTTAGATAAGTGGCATAAGGACAGGCTCCAAGGTTCGATTAAAGGGGTTGTGGAAAGAAAGCTCAAAGAAATTCAGAACCTAGTGTCGCTACGGGTTTGCGACGCGGAGCGGCCGACTTTGTTCCCAAGAGGAGGAAAATGGAGTCGCCACCAGTCTTGA
Protein sequence
MNRLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRGRKSRELKFEESWLKHPDSFKIIEKNWNSLRGTDINSFNNKIKNCLQSLDKWHKDRLQGSIKGVVERKLKEIQNLVSLRVCDAERPTLFPRGGKWSRHQS
Homology
BLAST of Lag0038052 vs. NCBI nr
Match:
XP_022155286.1 (uncharacterized protein LOC111022423 [Momordica charantia])
HSP 1 Score: 125.2 bits (313), Expect = 7.1e-25
Identity = 76/200 (38.00%), Postives = 108/200 (54.00%), Query Frame = 0
Query: 1 MNRLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKF 60
+ RL +F PW++GGDFNEIL EK G R Q+ ++ F + +D C L D G+VGD F
Sbjct: 30 IGRLHSIFDLPWILGGDFNEILFNSEKLEGVPRRQSLMQNFKDTLDLCGLLDPGFVGDIF 89
Query: 61 TWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRG 120
TW + I ERLDR N + + + + I+H + SDHR + AE + G
Sbjct: 90 TWCDGHKYRQPIWERLDRFLINTAIFQIQETLEIRHLEFLASDHRPILAEWLGVGEATVG 149
Query: 121 RKS--RELKFEESWLKHPDSFKIIEKNW---NSLRGTDINSFNNKIKNCLQSLDKWHKDR 180
R+ R +FEE W + +I+ + W L GT F KI +CL+ L KW+ R
Sbjct: 150 RRKGRRPHRFEEQWFSFQECKEIVRRVWAVQGDLCGT---VFQGKINSCLEELIKWNVGR 209
Query: 181 LQGSIKGVVERKLKEIQNLV 196
L GS++G + RK EIQ +V
Sbjct: 210 LGGSLRGAIMRKEVEIQRMV 226
BLAST of Lag0038052 vs. NCBI nr
Match:
KAF4381998.1 (hypothetical protein G4B88_006630 [Cannabis sativa])
HSP 1 Score: 124.0 bits (310), Expect = 1.6e-24
Identity = 70/197 (35.53%), Postives = 107/197 (54.31%), Query Frame = 0
Query: 3 RLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTW 62
RL DLF PW+ GGDFNEIL I EK+ G R+ + + F +D C L DLG+ G FTW
Sbjct: 565 RLKDLFDLPWICGGDFNEILSINEKKGGSDRSMSAMTEFQNALDRCSLADLGFEGQCFTW 624
Query: 63 RRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRGRK 122
++ + ++ERLDR F NQ ++ + + + + +SDHR + A + ++ R K
Sbjct: 625 LNKRQGGAHVQERLDRYFCNQRWHDLFPFVKVLNGDFLNSDHRPIVATLENVSRRQRYDK 684
Query: 123 SRELKFEESWLKHPDSFKIIEKNWNSLRGTDIN--SFNNKIKNCLQSLDKWHKDRLQGSI 182
R +FE WLK P+ +II ++W SL N S + C L W+K + GS+
Sbjct: 685 KRCFRFETHWLKDPECQEIINRSWLSLDCPLANQDSLIDIFGLCADQLGMWNKSK-YGSL 744
Query: 183 KGVVERKLKEIQNLVSL 198
V K++ +L+S+
Sbjct: 745 PRQVRETQKQLDDLLSV 760
BLAST of Lag0038052 vs. NCBI nr
Match:
KAF4383622.1 (hypothetical protein F8388_014122 [Cannabis sativa])
HSP 1 Score: 124.0 bits (310), Expect = 1.6e-24
Identity = 70/197 (35.53%), Postives = 107/197 (54.31%), Query Frame = 0
Query: 3 RLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTW 62
RL DLF PW+ GGDFNEIL I EK+ G R+ + + F +D C L DLG+ G FTW
Sbjct: 589 RLKDLFDLPWICGGDFNEILSINEKKGGSDRSMSAMTEFQNALDRCSLADLGFEGQCFTW 648
Query: 63 RRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRGRK 122
++ + ++ERLDR F NQ ++ + + + + +SDHR + A + ++ R K
Sbjct: 649 LNKRQGGAHVQERLDRYFCNQRWHDLFPFVKVLNGDFLNSDHRPIVATLENVSRRQRYDK 708
Query: 123 SRELKFEESWLKHPDSFKIIEKNWNSLRGTDIN--SFNNKIKNCLQSLDKWHKDRLQGSI 182
R +FE WLK P+ +II ++W SL N S + C L W+K + GS+
Sbjct: 709 KRCFRFETHWLKDPECQEIINRSWLSLDCPLANQDSLIDIFGLCADQLGMWNKSK-YGSL 768
Query: 183 KGVVERKLKEIQNLVSL 198
V K++ +L+S+
Sbjct: 769 PRQVRETQKQLDDLLSV 784
BLAST of Lag0038052 vs. NCBI nr
Match:
TXG57064.1 (hypothetical protein EZV62_018377 [Acer yangbiense])
HSP 1 Score: 122.9 bits (307), Expect = 3.5e-24
Identity = 68/197 (34.52%), Postives = 103/197 (52.28%), Query Frame = 0
Query: 1 MNRLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKF 60
+ RL + + W+ GGDFNEIL ++EK G ++ I F EV+D C L DLG+ G K
Sbjct: 111 LRRLRSVDDFSWLCGGDFNEILRVKEKSGGSNKSILGICQFREVIDDCNLVDLGFEGPKM 170
Query: 61 TWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRG 120
TW + D+ ++ER+DR+ A+ I++ +QH GY+ SDHR L + R
Sbjct: 171 TWNNRRDGDNNVQERIDRMLADTAWIDLFPGARVQHLGYNSSDHRPLLLSFADGFQGARK 230
Query: 121 RKSRELKFEESWLKHPDSFKIIEKNWNSLR-GTDINSFNNKIKNCLQSLDKWHKDRLQGS 180
+ KFE WLK + K+I + WN L + K+ C L W + GS
Sbjct: 231 TNRKPFKFEHFWLKEEECSKVIREAWNLLEVPHSLGDLERKLMFCAGKLSFWSLAKF-GS 290
Query: 181 IKGVVERKLKEIQNLVS 197
++ +E K KE++ L+S
Sbjct: 291 LRKRIEEKQKEVEELLS 306
BLAST of Lag0038052 vs. NCBI nr
Match:
CCA66040.1 (hypothetical protein [Beta vulgaris subsp. vulgaris])
HSP 1 Score: 122.1 bits (305), Expect = 6.0e-24
Identity = 76/207 (36.71%), Postives = 108/207 (52.17%), Query Frame = 0
Query: 1 MNRLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKF 60
+ RL P + GDFNEI IEEKE G R + ++AF EV+D C +KDLG+VG++F
Sbjct: 121 LRRLKQQCSLPVLFFGDFNEITSIEEKEGGAPRCERVMDAFREVIDDCAVKDLGYVGNRF 180
Query: 61 TWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRG 120
TW+R + IRERLDR+ AN + + + H + SDH L + D RG
Sbjct: 181 TWQRGNSPSTLIRERLDRMLANDEWCDNFPSWEVVHLPRYRSDHAPLLLKTGVNDSFRRG 240
Query: 121 RKSRELKFEESWLKHPDSFKIIEKNWNSLRGTDINSFNNKIKNCLQSLDKWHKDRLQGSI 180
K KFE WL + KI+E+ WN G DI N++ +SL W + G++
Sbjct: 241 NKL--FKFEAMWLSKEECGKIVEEAWNGSAGEDI---TNRLDEVSRSLSTW-ATKTFGNL 300
Query: 181 KGVVERKLKEIQNLVSLRVCDAERPTL 208
K +RK + + L L+ D + TL
Sbjct: 301 K---KRKKEALTLLNGLQQRDPDASTL 318
BLAST of Lag0038052 vs. ExPASy TrEMBL
Match:
A0A803QED6 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 129.4 bits (324), Expect = 1.8e-26
Identity = 67/190 (35.26%), Postives = 113/190 (59.47%), Query Frame = 0
Query: 11 PWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTWRRSKRKDS 70
PW++ GDFNEIL K G LRN+AQ++ F +V+D C L + + GD+FTW + +
Sbjct: 123 PWIVIGDFNEILSNRNKSGGALRNEAQMDMFRKVLDLCQLHEQAFDGDQFTWIKGRHTID 182
Query: 71 WIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEI-----TFEDKKFRGRKSRE 130
I+ERLD FAN + + I H Y+ SDHR ++ I T + ++ R R+SR
Sbjct: 183 TIKERLDWCFANTMWEGIFQPIITHHLDYYRSDHRAIAVHILPLNSTHQQQQQRSRRSR- 242
Query: 131 LKFEESWLKHPDSFKIIEKNWNSLRGTDI-NSFNNKIKNCLQSLDKWHKDRLQGSIKGVV 190
+FE+ WL+ ++ +I++NW+ + ++ + F N ++ C +SL +WH+ + GS K +
Sbjct: 243 FRFEKLWLQEEEATNLIQQNWHQVTTENVADVFVNNLQRCAESLQQWHQQKF-GSFKKNI 302
Query: 191 ERKLKEIQNL 195
R K++++L
Sbjct: 303 SRAQKKVKDL 310
BLAST of Lag0038052 vs. ExPASy TrEMBL
Match:
A0A803PJK4 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 126.7 bits (317), Expect = 1.2e-25
Identity = 66/181 (36.46%), Postives = 109/181 (60.22%), Query Frame = 0
Query: 11 PWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTWRRSKRKDS 70
PW++ GDFNEI+ ++K G +RN++QI+AF E VDHC L +L + G++FTW + + +
Sbjct: 595 PWLVMGDFNEIISHDDKLGGAIRNESQIDAFRETVDHCNLTELNFEGERFTWHNNNSRGA 654
Query: 71 WIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFED-KKFRGRKSRELKFE 130
++ERLD F N ++ ++H ++ SDHR+L A+I+ + ++ R KSR +FE
Sbjct: 655 NVKERLDYGFINDKWTDLMATPVLKHLDFYASDHRVLLADISQSNIQQVRPFKSR-FRFE 714
Query: 131 ESWLKHPDSFKIIEKNWNSLRGTDINSFN-NKIKNCLQSLDKWHKDRLQGSIKGVVERKL 190
+ WLK D +II K WN+L +D + + I +C L WH+ + G + +K+
Sbjct: 715 KIWLKEDDCLEIISKFWNNLTVSDPTALTLDNISSCAFHLQSWHRHKF-----GDIPKKI 769
BLAST of Lag0038052 vs. ExPASy TrEMBL
Match:
A0A6J1DRA0 (uncharacterized protein LOC111022423 OS=Momordica charantia OX=3673 GN=LOC111022423 PE=4 SV=1)
HSP 1 Score: 125.2 bits (313), Expect = 3.4e-25
Identity = 76/200 (38.00%), Postives = 108/200 (54.00%), Query Frame = 0
Query: 1 MNRLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKF 60
+ RL +F PW++GGDFNEIL EK G R Q+ ++ F + +D C L D G+VGD F
Sbjct: 30 IGRLHSIFDLPWILGGDFNEILFNSEKLEGVPRRQSLMQNFKDTLDLCGLLDPGFVGDIF 89
Query: 61 TWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRG 120
TW + I ERLDR N + + + + I+H + SDHR + AE + G
Sbjct: 90 TWCDGHKYRQPIWERLDRFLINTAIFQIQETLEIRHLEFLASDHRPILAEWLGVGEATVG 149
Query: 121 RKS--RELKFEESWLKHPDSFKIIEKNW---NSLRGTDINSFNNKIKNCLQSLDKWHKDR 180
R+ R +FEE W + +I+ + W L GT F KI +CL+ L KW+ R
Sbjct: 150 RRKGRRPHRFEEQWFSFQECKEIVRRVWAVQGDLCGT---VFQGKINSCLEELIKWNVGR 209
Query: 181 LQGSIKGVVERKLKEIQNLV 196
L GS++G + RK EIQ +V
Sbjct: 210 LGGSLRGAIMRKEVEIQRMV 226
BLAST of Lag0038052 vs. ExPASy TrEMBL
Match:
A0A803QG78 (Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1)
HSP 1 Score: 125.2 bits (313), Expect = 3.4e-25
Identity = 70/184 (38.04%), Postives = 104/184 (56.52%), Query Frame = 0
Query: 1 MNRLGDLF-HYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDK 60
+ RL D+ H PW+ GDFNEI+ K G LRN++Q+E F +V+DHC L + + G+
Sbjct: 56 LKRLADVAPHLPWLAIGDFNEIISNANKSGGGLRNESQMETFRKVLDHCSLHETPFEGEP 115
Query: 61 FTWRRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFR 120
FTW ++ + I+ERLD F N + ++QHH Y+ SDHR +SA IT D +
Sbjct: 116 FTWIKNHFATNTIKERLDWCFVNSLWGSTFSIPTVQHHDYYSSDHRAISATITPMDSTVQ 175
Query: 121 GRKSR-ELKFEESWLKHPDSFKIIEKNWNSLRGTD-INSFNNKIKNCLQSLDKWHKDRLQ 180
K R +FE+ WL P+ +I +WN+ D I + + +C SL KWH+ +
Sbjct: 176 QEKRRSRFRFEKLWLSDPECKDLIINSWNTHSHADPIQTVLLNLDSCATSLQKWHQHK-Y 235
Query: 181 GSIK 182
GS+K
Sbjct: 236 GSMK 238
BLAST of Lag0038052 vs. ExPASy TrEMBL
Match:
A0A7J6GL46 (CCHC-type domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_014122 PE=4 SV=1)
HSP 1 Score: 124.0 bits (310), Expect = 7.6e-25
Identity = 70/197 (35.53%), Postives = 107/197 (54.31%), Query Frame = 0
Query: 3 RLGDLFHYPWVIGGDFNEILGIEEKECGRLRNQAQIEAFSEVVDHCLLKDLGWVGDKFTW 62
RL DLF PW+ GGDFNEIL I EK+ G R+ + + F +D C L DLG+ G FTW
Sbjct: 589 RLKDLFDLPWICGGDFNEILSINEKKGGSDRSMSAMTEFQNALDRCSLADLGFEGQCFTW 648
Query: 63 RRSKRKDSWIRERLDRIFANQDMINMCKDISIQHHGYHHSDHRILSAEITFEDKKFRGRK 122
++ + ++ERLDR F NQ ++ + + + + +SDHR + A + ++ R K
Sbjct: 649 LNKRQGGAHVQERLDRYFCNQRWHDLFPFVKVLNGDFLNSDHRPIVATLENVSRRQRYDK 708
Query: 123 SRELKFEESWLKHPDSFKIIEKNWNSLRGTDIN--SFNNKIKNCLQSLDKWHKDRLQGSI 182
R +FE WLK P+ +II ++W SL N S + C L W+K + GS+
Sbjct: 709 KRCFRFETHWLKDPECQEIINRSWLSLDCPLANQDSLIDIFGLCADQLGMWNKSK-YGSL 768
Query: 183 KGVVERKLKEIQNLVSL 198
V K++ +L+S+
Sbjct: 769 PRQVRETQKQLDDLLSV 784
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022155286.1 | 7.1e-25 | 38.00 | uncharacterized protein LOC111022423 [Momordica charantia] | [more] |
KAF4381998.1 | 1.6e-24 | 35.53 | hypothetical protein G4B88_006630 [Cannabis sativa] | [more] |
KAF4383622.1 | 1.6e-24 | 35.53 | hypothetical protein F8388_014122 [Cannabis sativa] | [more] |
TXG57064.1 | 3.5e-24 | 34.52 | hypothetical protein EZV62_018377 [Acer yangbiense] | [more] |
CCA66040.1 | 6.0e-24 | 36.71 | hypothetical protein [Beta vulgaris subsp. vulgaris] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A803QED6 | 1.8e-26 | 35.26 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A803PJK4 | 1.2e-25 | 36.46 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A6J1DRA0 | 3.4e-25 | 38.00 | uncharacterized protein LOC111022423 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A803QG78 | 3.4e-25 | 38.04 | Uncharacterized protein OS=Cannabis sativa OX=3483 PE=4 SV=1 | [more] |
A0A7J6GL46 | 7.6e-25 | 35.53 | CCHC-type domain-containing protein OS=Cannabis sativa OX=3483 GN=F8388_014122 P... | [more] |
Match Name | E-value | Identity | Description | |