Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCGTTCATAAGGAACGACCCCTTCTCCAATACCTACAACCCTGGTTGGAGGAACCATCTCAACTTTGGATGGGGAGGATCGAGTCAACAACAAAGGCGACATGGTGGTCAAAGTGACCATCGCGGGGAAGCACCTGGCTCCCACGCGAGGTACCAAAACAATAGACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG
mRNA sequence
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG
Coding sequence (CDS)
ATGGAAAATAACAACAGAAATGCCCCTCCGCCGCAAGCTGACCCAGAACCAAACACCGCCTATATAGCACATGACTTGGATAGACCGATTAGATCTTATGCGGCGCCCAACCTCTATAACTTCAACCCAGGAATCGCCTACCCTGTATTTGGCGAGAACGCCAGGTTTGAAATCAAACCTGTTATGCTTCAAATGATTCAGAACGCCGGACAATTCGGCGGACATCCTGGGGAAGATCCACACGAACATATAAGGAGTTTCTACTCCATCTGCGCTTCCTTCCATATGCCAGGCATCTCACCTGAAGAATTAAGATTCGCTCTTTTCCCGTTAACTCTGAGGGATGAGGCGAAGAGGTGGGTGAATGCCCTAGAAGATGGCGAGGTGGGAACATGGGATCAATTAATAGAAAAATTTATGAAGAAATTTTTCCCACCTCACGAAAATGCTAGAAGGAGGAAGGAACTTATGAGCTTCCAGCAGAAGGATAAAGAAAACCTACATGACGCGTGGAGTAGGTTCAAACGGATGGTCAAAGCATGCCCCCACAATGGCATTCCTAAATGCATATTGATGGAGGTTTTCTACTTTAGACTAAACAAGGCCACACAGCAGACTGTTGATGCTGTGTTTGTAGACGATATGCTGAAAAGTACATACAACCAGATTAAGACGACGCAGGACACGATGGCCAGCAATAATGAAGAATGGGACGAAGATGATTTCGGCAATCGCCGAGGAGGACGAGCAAAAGATGATGGCATGGATAAGAACGTCGTGGTGGCATTGCAGGGACAAATGACTGCGATGAACAATTTACTTAAATCAATGGCAATATCGCAAGTCAATGCCGCAGGAAGCTATGTGCTCGCGGCTAATCAAATTGATGACATGGGATGTGTGGGATGCGACGGTCATCATAACACTGACGCATGCCCACTCAATACTGAAACCGTCGCACTCCAACAATCCCATCATCAACAGCATCCCACCACCACCGCCTCGTCCACCTCTCCCATGGAAAACCTCCTCCGCGAATACATGCAGAAAAATGATGCTCTTCTGCAAAGCCAAGCTTCATCAATTCGTAATCTGGAGGTACAGTTAGGTCAGCTCGCTAGTGATTTCTCCGGAAGACAGCAAGGATCCCTTCCGAGCAATACAGAAACGCCAAATCAGGCGGGAGGATCTGGTAAATAG
Protein sequence
MENNNRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETVALQQSHHQQHPTTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTETPNQAGGSGK
Homology
BLAST of PI0023494 vs. ExPASy TrEMBL
Match:
A0A6J1G7Q6 (uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC111451598 PE=4 SV=1)
HSP 1 Score: 253.8 bits (647), Expect = 1.2e-63
Identity = 152/444 (34.23%), Postives = 227/444 (51.13%), Query Frame = 0
Query: 18 NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
N ++A D +R IR+YA P + NP I P + FE+KPVM QM+Q GQF G
Sbjct: 10 NAIHVADDRERAIRAYAHPAVEELNPCIIRPEM-QATTFELKPVMFQMLQTIGQFHGLSS 69
Query: 78 EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
+DPH H++SF + SF G+ + +R + F +LRD AK W+N L G + +W+ L E
Sbjct: 70 KDPHLHLKSFLGVSDSFRFQGVDKDVIRLSFFSYSLRDGAKSWLNILALGIIDSWNSLAE 129
Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
KF+ K+FPP +AR R E+++FQ+ + E L +AW RFK ++ CPH+G+P CI +E FY
Sbjct: 130 KFLFKYFPPTRSARFRNEIVAFQKFENETLSEAWERFKETLRKCPHHGLPHCIQIETFYN 189
Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDK 257
LN AT+Q VDA D+L TYN+ + +ASNN +W + G+ + ++
Sbjct: 190 GLNTATKQVVDASANGDILSKTYNEAYEILERIASNNCQWVD---VRSNPGKKTREVLEV 249
Query: 258 NVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHHNTDACPL 317
+ + ++ Q+ +M N+L+++A Q + A Q CV C H D CP
Sbjct: 250 DALSSINAQLASMTNILQNLAFGQGSMIKAPAHTATVMIQTATESCVYCGEKHTFDQCPS 309
Query: 318 NTETVAL---------------------------------QQSHHQQHP----------- 377
N ++ Q S++QQ P
Sbjct: 310 NPASIFYVGNQASQGNPKTNPSSNTYNPGWRNHPNFLCKGQGSYNQQMPPKANYPPGFGL 369
Query: 378 -----------TTTASSTSP--------MENLLREYMQKNDALLQSQASSIRNLEVQLGQ 396
TT TS +E+L++EYM +NDA++QSQ S+RNLEVQ+GQ
Sbjct: 370 QNQLTYDSQQATTQGEGTSQAQHISGTLLESLIKEYMARNDAVIQSQQVSLRNLEVQVGQ 429
BLAST of PI0023494 vs. ExPASy TrEMBL
Match:
U5CUI2 (Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMTR_s04947p00003620 PE=4 SV=1)
HSP 1 Score: 252.3 bits (643), Expect = 3.4e-63
Identity = 134/304 (44.08%), Postives = 179/304 (58.88%), Query Frame = 0
Query: 18 NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
N +A D R IR YAAP NPGI P + +FE+KPVM QM+Q GQF G P
Sbjct: 11 NPIILADDRARAIREYAAPMFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSGMPT 70
Query: 78 EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
EDPH H+RSF + SF + G+S E LR LFP +LRD A+ W+N L V W+ L E
Sbjct: 71 EDPHLHLRSFLEVSDSFKIQGVSEEVLRLKLFPFSLRDRARSWLNTLPPDSVTNWNDLAE 130
Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
KF++K+FPP NA+ R E+MSFQQ + E+ DAW RFK +++ CPH+GIP CI ME FY
Sbjct: 131 KFLRKYFPPTRNAKFRSEIMSFQQLEDESTSDAWERFKELLRKCPHHGIPHCIQMETFYN 190
Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRG--GRAKDDGM 257
LN A++ +DA +L +YN+ +T+ASNN +W N R R +
Sbjct: 191 GLNAASRMVLDASANGAILSKSYNEAFEILETIASNNYQW-----SNTRAPTSRKVAGVL 250
Query: 258 DKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLN 317
+ + + AL QM +M N+LK+++I NA AA Q DD+ CV C H + CP N
Sbjct: 251 EVDAITALTAQMASMTNVLKNLSIG--NAKNIQPAAAIQSDDVSCVFCGEGHVFEKCPSN 306
Query: 318 TETV 320
E+V
Sbjct: 311 PESV 306
BLAST of PI0023494 vs. ExPASy TrEMBL
Match:
A0A6J1EEI2 (uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC111433394 PE=4 SV=1)
HSP 1 Score: 239.6 bits (610), Expect = 2.3e-59
Identity = 144/417 (34.53%), Postives = 211/417 (50.60%), Query Frame = 0
Query: 18 NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
N ++A D +R IR+YA P + NP I P + FE+KPVM QM+Q GQF G P
Sbjct: 40 NAIHLADDRERAIRAYAHPAVEELNPCIIRPEM-QATTFELKPVMFQMLQTIGQFHGLPS 99
Query: 78 EDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIE 137
EDPH H++SF + SF + + +R +LFP +LRD AK W+N L G + +W+ L+E
Sbjct: 100 EDPHLHLKSFLGVSDSFRFQRVDKDVIRLSLFPYSLRDGAKSWLNTLALGTIDSWNSLVE 159
Query: 138 KFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYF 197
KF+ K+FPP NAR R E++ FQQ + + L +AW RFK M++ CPH+G+P CI ME FY
Sbjct: 160 KFLIKYFPPTRNARFRNEIVVFQQFEDDTLSEAWERFKEMLRKCPHHGLPHCIQMETFYN 219
Query: 198 RLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDGMDK 257
LN AT+Q VDA +L TYN+ + +ASNN +W + GR ++
Sbjct: 220 GLNIATKQVVDASANGAILSKTYNEAYEILERIASNNCQWAD---VRSNPGRKTRGVLEV 279
Query: 258 NVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHHNTDACPL 317
+ + ++ Q+ ++ N+L+++A+ Q + A V NQ CV C H D CP
Sbjct: 280 DALSSINAQLASVTNILQNLALGQDSMIKAPVHTVAVINQTAAESCVYCGEEHTFDQCPS 339
Query: 318 NTETVAL---------------------------------QQSHHQQHP----------- 369
N ++ Q S++QQ P
Sbjct: 340 NPASIFYVGNQASQGNPKNNPFSNTYNPGWRNHPNFSWKGQGSYNQQMPPKANYPPGFGL 399
BLAST of PI0023494 vs. ExPASy TrEMBL
Match:
A0A6J1DW02 (uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024897 PE=4 SV=1)
HSP 1 Score: 234.6 bits (597), Expect = 7.3e-58
Identity = 140/424 (33.02%), Postives = 225/424 (53.07%), Query Frame = 0
Query: 5 NRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQ 64
N N + E N +A + D +R YAA NF+ GI P+ + FE+KP+M Q
Sbjct: 69 NGNMRDHARNDEFNYIQMADNRDVAMREYAATAFQNFDSGIVNPI-PAHXNFELKPMMFQ 128
Query: 65 MIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNAL 124
M+Q G FGG EDPH+H++SF I +F +PGI+ + LFP +L+D+A+ +NA
Sbjct: 129 MLQTIGHFGGQEHEDPHDHLKSFIQIANAFRLPGITDDAXXLTLFPFSLKDQARXXLNAF 188
Query: 125 EDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHN 184
G + TW L+EKF+ KFFPP +A R+E++SF+Q D+E +H+AW RFK +++ C ++
Sbjct: 189 PXGSITTWGSLVEKFLTKFFPPTRHADIREEIISFRQYDREPVHEAWERFKELIRKCXNH 248
Query: 185 GIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGN 244
G+P C +E F+ L+ T+ ++ K T+N+I + +AS+NE W +
Sbjct: 249 GLPACXQIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQ--RS 308
Query: 245 RRGGRAKDDG--MDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAAN--------- 304
R + +D + ++ ++Q +M MN LK MA+ N + +
Sbjct: 309 RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLATXIQPVQSDYCTHAPV 368
Query: 305 -QIDDMGC------------VGCDGHHNTDA--------CPLNTETVALQQSHHQQHPT- 364
Q++D+ C G G + + P QQ ++Q+ T
Sbjct: 369 CQVNDLICWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQHIPPPQQQYNQRTQTP 428
Query: 365 TTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTETP 396
++ S +EN+++EYM + DA++QSQA+S+RN QLG LA++ R QGS P +TE P
Sbjct: 429 PIQNNNSNLENMMKEYMARTDAVIQSQAASMRNFGTQLGHLANELKNRPQGSFPGHTELP 488
BLAST of PI0023494 vs. ExPASy TrEMBL
Match:
A0A6J1EQ90 (uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC111436411 PE=4 SV=1)
HSP 1 Score: 234.2 bits (596), Expect = 9.6e-58
Identity = 150/449 (33.41%), Postives = 221/449 (49.22%), Query Frame = 0
Query: 18 NTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPG 77
N ++A D +R IR+YA P + NP I P + FE+KPVM QM+Q GQF G P
Sbjct: 62 NPIHLADDRERAIRAYAHPAVEELNPCIIRPEI-QGTTFELKPVMFQMLQTIGQFHGLPL 121
Query: 78 EDPHEHIRSFYSI-------CASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVG 137
EDPH H++SF + SF G+ + +R +LFP LRD AK W+N L G +
Sbjct: 122 EDPHLHLKSFLGVSDSFRFHSDSFRFQGVDKDMIRLSLFPYLLRDGAKSWLNTLAPGTID 181
Query: 138 TWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCI 197
+W+ L E F+ K+FPP NAR + E+++FQQ + E L +A RFK M++ CPH+G+P CI
Sbjct: 182 SWNSLAENFLIKYFPPTRNARFKNEIVTFQQFEDETLSEACERFKEMLRKCPHHGLPHCI 241
Query: 198 LMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRA 257
ME FY LN T+Q VDA +L TYN+ + +ASNN +W + GR
Sbjct: 242 QMETFYNGLNIVTKQVVDASANGAILSKTYNEAYEILERIASNNCQWAD---VRSNPGRK 301
Query: 258 KDDGMDKNVVVALQGQMTAMNNLLKSMAISQ---VNAAGSYVLAANQIDDMGCVGCDGHH 317
++ + + ++ Q+ ++ N+L+++A+ Q + A A NQ CV C H
Sbjct: 302 TRGVLEVDALSSINAQLASVTNILQNLALGQDSMIKAPVHTAAAINQTAAESCVYCGEEH 361
Query: 318 NTDACPLNTETVAL---------------------------------QQSHHQQHP---- 377
D CP N ++ Q ++QQ P
Sbjct: 362 TFDQCPSNPASIFYVGNQASQGNLKNNPFSNTYNPGWRNHPNFSWKGQSLYNQQMPPKAN 421
Query: 378 ------------------------TTTASSTS--PMENLLREYMQKNDALLQSQASSIRN 394
TT A TS +E+L++EYM KNDA++QSQ +S+RN
Sbjct: 422 YPSGFRLQNQLAYSSQQVNTQGKGTTQAQYTSETSIESLIKEYMAKNDAVIQSQQASLRN 481
BLAST of PI0023494 vs. NCBI nr
Match:
XP_030497803.1 (uncharacterized protein LOC115713460 [Cannabis sativa])
HSP 1 Score: 297.4 bits (760), Expect = 1.9e-76
Identity = 177/427 (41.45%), Postives = 229/427 (53.63%), Query Frame = 0
Query: 13 ADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQF 72
A E N +A D R IR YAAP NPGI P + FE+KPVM QM+Q GQF
Sbjct: 10 AHNEANPIALADDRTRAIREYAAPMFNELNPGIVRPEI-QAPHFELKPVMFQMLQTVGQF 69
Query: 73 GGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTW 132
GG P EDPH HIRSF + SF + G+S E LR LFP +LRD A+ W+N L V W
Sbjct: 70 GGSPTEDPHLHIRSFLEVSDSFKLQGVSEEALRLKLFPFSLRDRARAWLNTLPPDSVTNW 129
Query: 133 DQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILM 192
+ L EKF++K+FPP NA+ R E+MSFQQ + E DAW RFK +++ CPH+GIP CI +
Sbjct: 130 NDLAEKFLRKYFPPTRNAKFRSEIMSFQQSEDETTSDAWERFKELLRKCPHHGIPHCIQL 189
Query: 193 EVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKD 252
E FY LN A++ +DA +L +YN+ + +ASNN +W NR K
Sbjct: 190 ETFYNGLNAASRMVLDASANGAILSKSYNEAFEILERIASNNYQWST----NRAHTSRKV 249
Query: 253 DG-MDKNVVVALQGQMTAMNNLLKSMAISQVNAAGS-YVLAANQIDDMGCVGCDGHHNTD 312
G ++ + + AL QM +M N+LK+M N GS AA Q CV C H +
Sbjct: 250 AGVLEVDALTALTAQMASMTNILKNM-----NMGGSVQPAAAIQRAKNSCVYCGDGHTFE 309
Query: 313 ACPLNTETVAL------------------------------------------QQSHHQQ 372
CP N +V QQ QQ
Sbjct: 310 NCPSNLASVCYVGNQNFNRNNNPYSNSYNPAWKHHPNFSWGGQGKQSFPPGFSQQPRPQQ 369
Query: 373 HPTTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNT 396
S TS +E+L+R+YM KND ++QSQA+S+RNLEVQLGQLA+D R QG+LPS+T
Sbjct: 370 PHQPQGSQTSSLESLMRDYMAKNDTVIQSQAASLRNLEVQLGQLANDLKNRPQGTLPSDT 426
BLAST of PI0023494 vs. NCBI nr
Match:
XP_030505184.1 (uncharacterized protein LOC115720166 [Cannabis sativa])
HSP 1 Score: 286.6 bits (732), Expect = 3.4e-73
Identity = 168/425 (39.53%), Postives = 226/425 (53.18%), Query Frame = 0
Query: 22 IAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPH 81
+ D R IR YAAP NPGI P + +FE+KPVM QM+Q GQF P EDPH
Sbjct: 20 LVDDRARAIREYAAPMFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSEMPTEDPH 79
Query: 82 EHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMK 141
H+RSF + SF + G+S E R LFP +LRD A+ W+N L V W+ EKF++
Sbjct: 80 LHLRSFLEMSDSFKIQGVSEEVRRLKLFPFSLRDRARSWLNTLSPDSVTNWNDFAEKFLR 139
Query: 142 KFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNK 201
K+FPP NA+ R E+MSF Q + E+ DAW RFK +++ CPH+GIP CI ME FY LN
Sbjct: 140 KYFPPTRNAKFRSEIMSFHQLEDESASDAWERFKELLRKCPHHGIPHCIQMETFYNGLNA 199
Query: 202 ATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDEDDFGNRRGGRAKDDG-MDKNVV 261
+Q +DA +L +YN+ +T+ASNN +W R G K G ++ + +
Sbjct: 200 TSQMVLDASANGAILSKSYNEAFEILETIASNNYQWS----NTRAPGSRKVAGVLEVDAI 259
Query: 262 VALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETVA 321
AL QM +M N+LK+++I N+ AA Q DD+ CV C H + CP N E+V
Sbjct: 260 TALTTQMASMTNVLKNLSIG--NSKNIQPAAAIQSDDVSCVFCREGHAFEKCPSNPESVC 319
Query: 322 L--------------------------------------------------QQSHHQQHP 381
QQ H QH
Sbjct: 320 YMGNQNFNRNNGAFSNSYNQAWKNHPNLSWGSRSKLKHFDQGRQAYPPGFSQQLRHPQHA 379
Query: 382 TTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPSNTET 396
S S +E+L+R+YM KNDA++QSQA+ +RNLE+QLG LA++ R QGSLPS+TE
Sbjct: 380 QN--SQPSSLESLMRDYMAKNDAVIQSQAAFLRNLELQLGHLANELKARPQGSLPSDTEN 435
BLAST of PI0023494 vs. NCBI nr
Match:
XP_030483210.1 (uncharacterized protein LOC115699807 [Cannabis sativa])
HSP 1 Score: 280.0 bits (715), Expect = 3.1e-71
Identity = 160/429 (37.30%), Postives = 237/429 (55.24%), Query Frame = 0
Query: 22 IAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKPVMLQMIQNAGQFGGHPGEDPH 81
+A D D+ IR YAAP NPGI P + +FE+KPVM QM+Q GQF G P EDPH
Sbjct: 1 MADDRDQIIRQYAAPLFNELNPGIVRPEI-QAPQFELKPVMFQMLQTVGQFSGIPTEDPH 60
Query: 82 EHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRWVNALEDGEVGTWDQLIEKFMK 141
H+R F + SF +PG++ + LR LFP +LRD+A+ W+N+L V TW +L E+F+
Sbjct: 61 LHLRLFMEVSDSFKLPGVTEDALRLKLFPYSLRDQARAWLNSLPSASVTTWQELAERFLM 120
Query: 142 KFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKACPHNGIPKCILMEVFYFRLNK 201
K+FPP +NA+ RKE+ SFQQ + E+L++AW RFK +++ CPH+GIP CI ME FY LN
Sbjct: 121 KYFPPTKNAKLRKEITSFQQFEDESLYEAWERFKELLRKCPHHGIPHCIQMETFYNGLNA 180
Query: 202 ATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDED--DFGNRRGGRAKDDGMDKNV 261
T+ VDA +L +YN+ + +++NN +W G + G ++ +
Sbjct: 181 HTRMVVDASANGALLAKSYNEAYDIIERISNNNYQWPTTRVPLGKKVAG-----VLEVDA 240
Query: 262 VVALQGQMTAMNNLLKSMAISQVNAAGSYVLAANQIDDMGCVGCDGHHNTDACPLNTETV 321
+ AL Q+ +M+N++K+M++ Q + Q++++ CV C H D CP N +V
Sbjct: 241 ITALSAQVASMSNMIKNMSMGQQMGQQNVSSPVGQLEEVSCVFCSEAHTFDNCPFNPASV 300
Query: 322 ALQQSH------HQQHPT---------TTASS------------------------TSPM 381
S ++QHP T+ SS TS +
Sbjct: 301 FYMGSQNAYNQTYKQHPNLAYRNQGAGTSNSSMLPRSNFPPGFSQAFQQRQQQGVQTSSL 360
Query: 382 ENLLRE--------------YMQKNDALLQSQASSIRNLEVQLGQLASDFSGRQQGSLPS 396
E+++R+ YM KND +QSQA+S+R LE Q+GQLA++ R QG+LPS
Sbjct: 361 ESMMRDFMAKTENFMTRTESYMAKNDTAIQSQATSMRTLENQVGQLANELRNRPQGTLPS 420
BLAST of PI0023494 vs. NCBI nr
Match:
XP_017233063.1 (PREDICTED: uncharacterized protein LOC108207110 [Daucus carota subsp. sativus])
HSP 1 Score: 278.5 bits (711), Expect = 9.1e-71
Identity = 168/443 (37.92%), Postives = 244/443 (55.08%), Query Frame = 0
Query: 1 MENNNRNAPPPQADPEPNTAYIAHDLDRPIRSYAAPNLYNFNPGIAYPVFGENARFEIKP 60
M++N N P P A+I D DR IR YAAP N GI P + +FE+KP
Sbjct: 36 MDDNVNNGDIPIV---PRGAFIVDDKDRAIRQYAAPRFEELNSGIIRPNI-QATQFELKP 95
Query: 61 VMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPLTLRDEAKRW 120
VM QM+Q GQF G P EDPH H+R F I SF G+ + LR LFP ++RD A+ W
Sbjct: 96 VMFQMLQTIGQFSGMPTEDPHLHLRLFMEISDSFKFQGVPEDALRLKLFPYSVRDRARTW 155
Query: 121 VNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAWSRFKRMVKA 180
+N+L G V TW+ L EKF+ K+FPP+ NA+ R E+ SFQQ+D E+L+DAW RFK +++
Sbjct: 156 LNSLPAGSVTTWNDLTEKFLSKYFPPNMNAKLRNEINSFQQQDDESLYDAWERFKELLRK 215
Query: 181 CPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMASNNEEWDED 240
CPH+GI CI ME FY LN T+ VDA +L +YNQ +T+A+ N +W
Sbjct: 216 CPHHGILHCIQMETFYNGLNAQTKMVVDASANGALLSKSYNQAYEILETIATKNYQWSS- 275
Query: 241 DFGNRRGGRAKDDGMDKNVVVALQGQMTAMNNLLKSMAISQVNAAGSYVLAA--NQIDDM 300
+ G+ D + + +++ Q+ +M ++LK++++ N + L++ NQ ++
Sbjct: 276 --SRAQTGKKVAGIYDVDSITSMKAQLASMEHMLKNLSMGN-NQSKEQSLSSQINQTKNV 335
Query: 301 GCVGCDGHHNTDACPLNTETVALQQSHH-------------QQHP------------TTT 360
CV C H D+CP N E+V + + +QHP T+T
Sbjct: 336 SCVFCGEAHTYDSCPSNPESVFYMGNQNKAGPYSNTYNQSWRQHPNFSWSNQGANSGTST 395
Query: 361 --------------ASSTSPMENLLREYMQKN-------DALLQSQASSIRNLEVQLGQL 396
A ++ +EN+L+EY+ KN +AL+QSQA+S+RNLE Q+GQL
Sbjct: 396 GNVKSNYPPGFSQQAPQSNSLENMLKEYIIKNEASRSQTEALVQSQAASLRNLENQVGQL 455
BLAST of PI0023494 vs. NCBI nr
Match:
XP_038889363.1 (uncharacterized protein LOC120079279 [Benincasa hispida])
HSP 1 Score: 277.3 bits (708), Expect = 2.0e-70
Identity = 151/377 (40.05%), Postives = 224/377 (59.42%), Query Frame = 0
Query: 52 ENARFEIKPVMLQMIQNAGQFGGHPGEDPHEHIRSFYSICASFHMPGISPEELRFALFPL 111
EN RF+IK VMLQM+QN GQFGG GED H H+ SF +C++F + G++PE +R LFP
Sbjct: 4 ENTRFKIKSVMLQMVQNTGQFGGLQGEDLHAHLTSFVEMCSTFSISGVTPEGIRLYLFPY 63
Query: 112 TLRDEAKRWVNALEDGEVGTWDQLIEKFMKKFFPPHENARRRKELMSFQQKDKENLHDAW 171
TLRDEA W ++LE E+ +WDQL+E FMKKFFPP NARRRK++++F+Q + E L W
Sbjct: 64 TLRDEANIWAHSLEPNEITSWDQLVEWFMKKFFPPTVNARRRKDVLNFEQMNNETLSTTW 123
Query: 172 SRFKRMVKACPHNGIPKCILMEVFYFRLNKATQQTVDAVFVDDMLKSTYNQIKTTQDTMA 231
+R+VK C H GIP C+LM+ FY LN++TQ DA + TY + K ++
Sbjct: 124 VHLRRLVKNCLHIGIPDCVLMKTFYNGLNRSTQVVADASVARGFMDKTYTEAKVILHRIS 183
Query: 232 SNNEEWDEDDFGNRRGGRAKDDG--MDKNVVVALQGQMTAMNNLLKSMAISQ--VNAAGS 291
N ++ +D +G R R ++D + + + L QM A+ +LL++MA++Q ++ +
Sbjct: 184 RNTDDCVDDGYGGRGSERRRNDNAIVPLDTMTTLAAQMAAVTSLLQTMALNQGALSQISA 243
Query: 292 YVLAANQIDDMGCVGCDGHH-----------NTDACPLNTETVAL------------QQS 351
A Q+ + CV C G H + P N ++ Q
Sbjct: 244 QPNAPAQVAAISCVQCGGGHANHPNFGWGGNHNQGGPSNHQSNNFENRGNSPPFHQNQNQ 303
Query: 352 HHQQHP------TTTASSTSPMENLLREYMQKNDALLQSQASSIRNLEVQLGQLASDFSG 396
HQ P + T++++S +E+LL++Y++KND ++QSQ SSIRNLE+Q+GQLA++
Sbjct: 304 GHQPQPQNLPSSSNTSANSSSLESLLKQYIEKNDVVMQSQVSSIRNLEIQVGQLATELRN 363
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1G7Q6 | 1.2e-63 | 34.23 | uncharacterized protein LOC111451598 OS=Cucurbita moschata OX=3662 GN=LOC1114515... | [more] |
U5CUI2 | 3.4e-63 | 44.08 | Retrotrans_gag domain-containing protein OS=Amborella trichopoda OX=13333 GN=AMT... | [more] |
A0A6J1EEI2 | 2.3e-59 | 34.53 | uncharacterized protein LOC111433394 OS=Cucurbita moschata OX=3662 GN=LOC1114333... | [more] |
A0A6J1DW02 | 7.3e-58 | 33.02 | uncharacterized protein LOC111024897 OS=Momordica charantia OX=3673 GN=LOC111024... | [more] |
A0A6J1EQ90 | 9.6e-58 | 33.41 | uncharacterized protein LOC111436411 OS=Cucurbita moschata OX=3662 GN=LOC1114364... | [more] |
Match Name | E-value | Identity | Description | |