Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGGTACTTTGTTTTTCTCTATCTCTCTCTCTCTCTCTATCGCTGAATTTTTTCTCTTTCGTGATGTTTCAATTGGGTTTGGCGCTATTTTGTCGGATGTTCTGTTTGGTCGCTGGGTAGTGTTCCATGAATACGCTGTAATCTGGCTTATGATTTGTCGAATATATCTGGGAAATCCTCTCTCGGTCGATTTTTATGCTTGAATTTTAAGTTTGTGTGTATGAGTTTCTTCTTAGTTGGCAACTGGAAAATGTGATATTGGTATTGTGGTTGCGGTTAAAGATATGAATTTTTTTCTCCCTCTTTTCCCCCTTTTACCTCGTATAAGATTGTTTTGATGCCGATGTAAATTTCCTTTGCATCCATTTTCTTTTTGTTTTGTTTTTATCTTTTGGCTCAAACCATGAAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAGCATATCGACTACAGACTAAATTTATATATTCAGCAGTGAGTATAATGAATACTAGTTTTATTAAAAACTTCTTTGCATGGATCTGCAATTATTCATTTCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA
mRNA sequence
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA
Coding sequence (CDS)
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA
Protein sequence
MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Homology
BLAST of Lag0037879 vs. NCBI nr
Match:
XP_022139776.1 (uncharacterized protein LOC111010601 [Momordica charantia])
HSP 1 Score: 844.7 bits (2181), Expect = 4.9e-241
Identity = 488/610 (80.00%), Postives = 525/610 (86.07%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS
Sbjct: 1 MGK---SRKKERSKTSSSQRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE DEK+GRSRS KNAKP KK+AKKRS D + RD SPHPRKRK+ KR D CEVKKTNK
Sbjct: 61 SEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTNK 120
Query: 121 -KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSK 180
KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSK
Sbjct: 121 RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSK 180
Query: 181 SHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQP 240
SHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H P
Sbjct: 181 SHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHP 240
Query: 241 DDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN 300
DDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Sbjct: 241 DDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSN 300
Query: 301 NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNC 360
NNHG V +D LNE NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T NC
Sbjct: 301 NNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANC 360
Query: 361 KVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAI 420
+VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV ST+AI
Sbjct: 361 QVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI 420
Query: 421 DSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNR 480
DSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+R
Sbjct: 421 DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSR 480
Query: 481 SNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNG 540
SNL+ AA R +SKVD L+KQASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NG
Sbjct: 481 SNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNG 540
Query: 541 FGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
F SSAHK SSLN SGE S NK HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 FDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
Query: 601 PALTRRQLKR 609
PAL RRQLKR
Sbjct: 601 PALARRQLKR 606
BLAST of Lag0037879 vs. NCBI nr
Match:
XP_038897880.1 (histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida])
HSP 1 Score: 840.9 bits (2171), Expect = 7.0e-240
Identity = 482/610 (79.02%), Postives = 529/610 (86.72%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SS+KLKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSKKLKSKKLRYRHDSPSCSDTDFESSTSVSSS- 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+++ RSRSKTRKNAKPSKK++K++S DR+SR+ SPHPRKRK+ KR+DHCE KK K
Sbjct: 61 SEDDKRVRRSRSKTRKNAKPSKKRSKRQSHDRQSRERSPHPRKRKHSKRNDHCEAKKATK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRD S+ A SDS SCSTCG GSTTSNE E+ R RGRS K+KGNMGKTER RYRSKS
Sbjct: 121 KKRRRDASVGA-YSDSSSCSTCGNGSTTSNESEVVRRRGRSGKRKGNMGKTERGRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSL S+ SD+QNEV+D SYV NNFRRLRS+I++ GEENKL+TF N Q+E HQP+
Sbjct: 181 RSPCSLSSKDSDYQNEVDDDSYVRNNFRRLRSIIVIAGEENKLKTFAGNEQQEGATHQPN 240
Query: 241 --DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGS 300
DDH S G M SKD SKRELDYV SKE P VE KKEV +P+NRNSMVVKDDGVQNEGS
Sbjct: 241 DVDDHPSLGDMDSKDATSKRELDYVISKEVPVVEKKKEVDVPNNRNSMVVKDDGVQNEGS 300
Query: 301 NNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDN 360
N N G V +D SL+ERKNGCSG DS+N IDLESILRQRALENLRKFK P RNV+T N
Sbjct: 301 NKNLGGVTNDHSLDERKNGCSGKTDSVNGIDLESILRQRALENLRKFKGAPPRNVETIAN 360
Query: 361 CKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDA 420
CKVD+NNDAKQL SPVSKSVHVTSPRDDA+IN G SRQGGGN VNSMIV+ENGV STDA
Sbjct: 361 CKVDHNNDAKQLSSPVSKSVHVTSPRDDAEINSKGFSRQGGGNAVNSMIVKENGVKSTDA 420
Query: 421 IDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTN 480
IDS+V SMHDPVYSSQNLG+ SNGSNGMNELKQ++SS+DQE IN+NICQKADADIC TTN
Sbjct: 421 IDSSVPSMHDPVYSSQNLGKISNGSNGMNELKQNISSLDQEVINDNICQKADADICSTTN 480
Query: 481 RSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGN 540
RSNL+IAA R ESKVDSL+KQA A+QESIQTKPSISDIGVDETA+TQTQMRNNDDQNI N
Sbjct: 481 RSNLVIAALRPESKVDSLIKQAPAAQESIQTKPSISDIGVDETAQTQTQMRNNDDQNIRN 540
Query: 541 GFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
G SSAHK SSLNSISGE S + S HESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 GLDSSAHKPSSLNSISGENSLSTSRHESGDSSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
Query: 601 PALTRRQLKR 609
PALTRRQLKR
Sbjct: 601 PALTRRQLKR 608
BLAST of Lag0037879 vs. NCBI nr
Match:
XP_023535556.1 (transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 825.9 bits (2132), Expect = 2.3e-235
Identity = 471/609 (77.34%), Postives = 533/609 (87.52%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ PHPRKRK+ KR D E KKTNK
Sbjct: 61 SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECPPHPRKRKHSKRSDRYEAKKTNK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLSRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSLCS+G D QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGGDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
++H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN
Sbjct: 241 ENHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D SL+ERKNGCSGN DSINCI+LESILRQ+ALENLRKFK V RNV+ NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTDSINCINLESILRQKALENLRKFKGVSPRNVEIISNCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V+NNNDAKQL SPVSKSVHVT PRDDA+ING G SRQ GG+ VNSMIV+ENG STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTPPRDDAEINGKGFSRQDGGDAVNSMIVKENGFKSTDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADADIYSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R ESKVDSL+++ASA+QE I+TKPSISDI VDETA+T+TQM+NN+DQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSLIEKASAAQECIETKPSISDIVVDETAQTETQMKNNEDQNIRNGF 540
Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
GSSA+K SSLNSISGE S +KS HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
Query: 601 ALTRRQLKR 609
ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609
BLAST of Lag0037879 vs. NCBI nr
Match:
KAG7024815.1 (hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 819.3 bits (2115), Expect = 2.2e-233
Identity = 472/609 (77.50%), Postives = 527/609 (86.54%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSK SSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKASSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D E KKTNK
Sbjct: 61 SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRDVS+ AT+SDSL STCG+GS+TS++ EIDR RGRS K+K NM KTE RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
HS CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF N Q+E V HQ D
Sbjct: 181 HSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V RNV+ NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V+NNNDAKQL SPVSKSVHVTSPRDDA+IN G SRQ GG+ VNSMIV+ NG STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTSPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI TTN S
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADADIYSTTNGS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R ESKVDS +K+ASA QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASADQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540
Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
GSSA+K SSLNSISGE S +KS ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
Query: 601 ALTRRQLKR 609
ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609
BLAST of Lag0037879 vs. NCBI nr
Match:
XP_022935712.1 (uncharacterized protein LOC111442542 isoform X1 [Cucurbita moschata])
HSP 1 Score: 807.7 bits (2085), Expect = 6.6e-230
Identity = 466/609 (76.52%), Postives = 522/609 (85.71%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRRLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D E KKTNK
Sbjct: 61 SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRD S+ AT+SDSL STCG+GS+TS++ EIDR RGRS K+K NM KTE RYRSKS
Sbjct: 121 KKRRRDASVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V RNV+ NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V+NNNDAKQL SPVSKSVHVT PRDDA+IN G SRQ GG+ VNSMIV+ NG STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTFPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
+AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNEPKQDISSLDQEVINDNICLKADADIYSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R ESKVDS +K+AS QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASVDQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540
Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
GSSA+K SSLN ISGE +KS ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKSSSSLNPISGENRLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
Query: 601 ALTRRQLKR 609
ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609
BLAST of Lag0037879 vs. ExPASy TrEMBL
Match:
A0A6J1CDR0 (uncharacterized protein LOC111010601 OS=Momordica charantia OX=3673 GN=LOC111010601 PE=4 SV=1)
HSP 1 Score: 844.7 bits (2181), Expect = 2.4e-241
Identity = 488/610 (80.00%), Postives = 525/610 (86.07%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS
Sbjct: 1 MGK---SRKKERSKTSSSQRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE DEK+GRSRS KNAKP KK+AKKRS D + RD SPHPRKRK+ KR D CEVKKTNK
Sbjct: 61 SEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTNK 120
Query: 121 -KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSK 180
KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSK
Sbjct: 121 RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSK 180
Query: 181 SHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQP 240
SHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H P
Sbjct: 181 SHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHP 240
Query: 241 DDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN 300
DDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Sbjct: 241 DDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSN 300
Query: 301 NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNC 360
NNHG V +D LNE NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T NC
Sbjct: 301 NNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANC 360
Query: 361 KVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAI 420
+VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV ST+AI
Sbjct: 361 QVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI 420
Query: 421 DSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNR 480
DSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+R
Sbjct: 421 DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSR 480
Query: 481 SNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNG 540
SNL+ AA R +SKVD L+KQASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NG
Sbjct: 481 SNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNG 540
Query: 541 FGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
F SSAHK SSLN SGE S NK HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 FDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
Query: 601 PALTRRQLKR 609
PAL RRQLKR
Sbjct: 601 PALARRQLKR 606
BLAST of Lag0037879 vs. ExPASy TrEMBL
Match:
A0A6J1F6D1 (uncharacterized protein LOC111442542 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442542 PE=4 SV=1)
HSP 1 Score: 807.7 bits (2085), Expect = 3.2e-230
Identity = 466/609 (76.52%), Postives = 522/609 (85.71%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRRLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D E KKTNK
Sbjct: 61 SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRD S+ AT+SDSL STCG+GS+TS++ EIDR RGRS K+K NM KTE RYRSKS
Sbjct: 121 KKRRRDASVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V RNV+ NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V+NNNDAKQL SPVSKSVHVT PRDDA+IN G SRQ GG+ VNSMIV+ NG STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTFPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
+AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNEPKQDISSLDQEVINDNICLKADADIYSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R ESKVDS +K+AS QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASVDQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540
Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
GSSA+K SSLN ISGE +KS ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKSSSSLNPISGENRLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
Query: 601 ALTRRQLKR 609
ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609
BLAST of Lag0037879 vs. ExPASy TrEMBL
Match:
A0A6J1IGY0 (uncharacterized protein LOC111476850 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476850 PE=4 SV=1)
HSP 1 Score: 803.5 bits (2074), Expect = 6.0e-229
Identity = 462/609 (75.86%), Postives = 524/609 (86.04%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR+D E KKTNK
Sbjct: 61 SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRYEAKKTNK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLSRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEERRYRSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSLCS+GSD QNEVED SYV+N RRL+S+I+VVGEE++L+TF N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVKNCCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
D+H F M SKDG KRELDYV SKEAPEVE+K ++ PDNRNS+++ +DGV+NEGSN
Sbjct: 241 DNHPLFEDMNSKDGTCKRELDYVISKEAPEVESKNKMATPDNRNSLILNNDGVRNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D SL+ERKNGCSGN D+INCIDLESILRQ+ALENLRKFK RNV+ NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTDNINCIDLESILRQKALENLRKFKGASPRNVEIIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V+NNNDAKQL SPVSKSVHV SPRDDA+ NG G SRQ GG+ VNSMI++ NG STDAID
Sbjct: 361 VENNNDAKQLFSPVSKSVHVASPRDDAETNGKGFSRQDGGDAVNSMIIKGNGFKSTDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
+AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADA+I TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADANIYSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R ESKVDSL+++ASA+QE IQTKPSISDI VDE ++TQTQ NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSLIEKASAAQECIQTKPSISDIVVDEISQTQTQKTNNDDQNIRNGF 540
Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
GSSA+K SSLNSISGE S +KS ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
Query: 601 ALTRRQLKR 609
ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609
BLAST of Lag0037879 vs. ExPASy TrEMBL
Match:
A0A0A0L248 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G630530 PE=4 SV=1)
HSP 1 Score: 803.5 bits (2074), Expect = 6.0e-229
Identity = 466/608 (76.64%), Postives = 520/608 (85.53%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSV SS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVPSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE +++ RSRSKT+KNAKPSKK++KK+S DR+SR+ SP+PRKRK+ KR+D EV K NK
Sbjct: 61 SEHHKRVRRSRSKTQKNAKPSKKRSKKQSHDRQSRECSPNPRKRKHSKRNDRREVNKANK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRDVS+ S+SLSCSTCG GSTTSNE E+ R RGRS K+K NM KTE RY SKS
Sbjct: 121 KKRRRDVSVG--HSNSLSCSTCGNGSTTSNESEVVRRRGRSGKRKENMRKTESGRYMSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
HS CSL SEGSD+QNEV+D SYVENNFRRLRS+I+VVGEENKL +E +E V +QP
Sbjct: 181 HSPCSLRSEGSDYQNEVDDESYVENNFRRLRSIIVVVGEENKLYVGNE---QEGVTNQPS 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
DDH SFG M SKD SKRELDYV +KEAP VEN+KEV +P+ RNSMVV+DDGVQNEGSN
Sbjct: 241 DDHPSFGDMDSKDATSKRELDYVITKEAPVVENEKEVDVPNFRNSMVVEDDGVQNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +DRS +E KNGCS N DSINCIDLES+LRQRALENLRKFK P RNV+T NCK
Sbjct: 301 NHGGVTNDRSSDEIKNGCSDNTDSINCIDLESMLRQRALENLRKFKGAPPRNVETIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
V +NN AKQL SP+SKSVHVTSPR+DA+IN SRQGGGN VNSMIV+ENGV S DAID
Sbjct: 361 VSHNNAAKQLCSPISKSVHVTSPRNDAEINSEQFSRQGGGNAVNSMIVKENGVNSMDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
SAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKA+ADIC TTNRS
Sbjct: 421 SAVATMHDPVYSSQNLGKISNGSNGMNEQKQDISSLDQELINDNICQKANADICSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R + KVDSL+KQ SA+QES+QTKPSISD+ V ETA+TQTQMRNN+D NI NG
Sbjct: 481 NLVIAALRPKPKVDSLIKQTSAAQESVQTKPSISDVAVGETAQTQTQMRNNNDLNIRNGL 540
Query: 541 GSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
GSSAHK SSLNSISGE S + S HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Sbjct: 541 GSSAHKPSSLNSISGENSLHMSNHESGESSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
Query: 601 LTRRQLKR 609
LTRRQLKR
Sbjct: 601 LTRRQLKR 603
BLAST of Lag0037879 vs. ExPASy TrEMBL
Match:
A0A1S3CJV0 (uncharacterized protein LOC103501777 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501777 PE=4 SV=1)
HSP 1 Score: 785.4 bits (2027), Expect = 1.7e-223
Identity = 458/608 (75.33%), Postives = 510/608 (83.88%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKK RYRHDSPSCSDTDFESSTSV SS
Sbjct: 1 MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKQRYRHDSPSCSDTDFESSTSVPSSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
SE D+K+ RSRSKTRKN KPSKK+ KK+S DR+SR+ SP+PRKRK+ KR+D EV K NK
Sbjct: 61 SEDDKKVRRSRSKTRKNVKPSKKRFKKQSHDRQSRECSPNPRKRKHSKRNDRREVNKANK 120
Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
KKRRRDVS+ SDSLSCSTCG G+TTSNE E+ R RGR K+KGNM KT RY SKS
Sbjct: 121 KKRRRDVSVG--HSDSLSCSTCGNGTTTSNESEVVRRRGRFAKRKGNMRKTGSGRYMSKS 180
Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
S CSL SEGSD+QNEV+D SYVE NFRRLRS+I+VVGEENKL+TF N +++EV +Q
Sbjct: 181 RSPCSLHSEGSDYQNEVDDDSYVEKNFRRLRSIIVVVGEENKLKTFVGN-EQQEVTNQLS 240
Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
DDH S G M SKD KR LDYV +KEA VEN+KEV +P+ RNSMVVKD GVQNEGSN
Sbjct: 241 DDHPSSGEMDSKDATGKRGLDYVVTKEALVVENEKEVDVPNYRNSMVVKDGGVQNEGSNK 300
Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
NHG V +D S +E KNGCS N DSINCIDLES+LRQRALENLRKFK RNV+T NCK
Sbjct: 301 NHGGVTNDHSSDEIKNGCSDNTDSINCIDLESMLRQRALENLRKFKGASPRNVETIANCK 360
Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
VD+NN AKQL SPVS SVHVTSPR++A+IN SRQGGGN +NSMI++ENGV S DAID
Sbjct: 361 VDHNNAAKQLRSPVSDSVHVTSPRNNAEINSKRFSRQGGGNAINSMILKENGVKSMDAID 420
Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
SAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKADADIC TTNRS
Sbjct: 421 SAVATMHDPVYSSQNLGKISNGSNGMNEQKQDISSLDQEVINDNICQKADADICSTTNRS 480
Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
NL+IAA R E KVDSL+KQ SA+QES+QTKPSISD+GV ETA+ QTQMRNNDD NI NG
Sbjct: 481 NLVIAALRPEPKVDSLIKQTSAAQESVQTKPSISDVGVGETAQIQTQMRNNDDLNIRNGL 540
Query: 541 GSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
GSSA++ SSLNSISGE S N S ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Sbjct: 541 GSSAYEPSSLNSISGEDSLNMSNQESGESSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
Query: 601 LTRRQLKR 609
LTRRQLKR
Sbjct: 601 LTRRQLKR 605
BLAST of Lag0037879 vs. TAIR 10
Match:
AT5G53930.1 (unknown protein; LOCATED IN: chloroplast; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )
HSP 1 Score: 73.2 bits (178), Expect = 8.3e-13
Identity = 178/624 (28.53%), Postives = 277/624 (44.39%), Query Frame = 0
Query: 1 MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
MGKSSSS K K SS++ K + S++ KSKK+R D S +D S S
Sbjct: 1 MGKSSSSSK--IIKDSSNKLRSVKKKKSKRNKSKKIRRIKDESESSGSD-----SSLYSS 60
Query: 61 SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPR---KRKNLKRDDHCEVKK 120
SE D R K ++ +K SKK+++KR S D S R K+K KR D KK
Sbjct: 61 SEDD-----YRRKKKRRSKLSKKRSRKRYSSSESDDDSDDDRLLKKKKRSKRKDENVGKK 120
Query: 121 TNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTER 180
K K+R+RD+S S+TSS+ + +GS + + R RGR +GK +
Sbjct: 121 KKKVVSRKRRKRDLSSSSTSSE----QSDNDGSESDDGKRWSRDRGR------RLGKVKD 180
Query: 181 SRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEE 240
SR RS+ SE D + ED E N RRL+S+++V NG+ +
Sbjct: 181 SRSRSRDELEGE--SEEPDECWQGEDEVIPEKNPRRLKSIVVVSYNYG-------NGERK 240
Query: 241 EVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGV 300
E +DD + G REL Y S+++ E++ + D+ + + D+G
Sbjct: 241 E-----EDDRDVY----MTRGGGNRELGY--SEDSDEMDGES----IDSYSRIRADDNGF 300
Query: 301 QNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNV 360
+ V D SL + DLE+IL++RALENL++F+ V
Sbjct: 301 GEYNKSETSKVSHTDNSLKDD--------------DLEAILKKRALENLKRFRGV----- 360
Query: 361 KTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGV 420
+ AK+ S VS+ G Q +V S +++ +
Sbjct: 361 -------TQKSGIAKKEVSSVSE----------------GEPMQIESEKVES---QDHDL 420
Query: 421 ISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADI 480
+ DSAV+ + +S+ + N L S DQ++ + K + +
Sbjct: 421 MEQKLCDSAVSK---DLENSEKILHVINVKESGTALANSASQQDQQS-GDTAKVKVSSGL 480
Query: 481 CPTTNRSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDD 540
T + L+ +S + K+AS SQ++ S + + T + N+
Sbjct: 481 SSCTTKRKLVRPVLSKDSLNLASKKEASGSQDAEAESIDGSTVDKNCLESTLALVTKNEG 529
Query: 541 QNI-----GNGFGSSAHKHSSLNSI----SGEPSSNKSGHESGEGSQFEQKTMSVMRGGE 600
++I + + + H+ + G S K+ E+ + SQ+EQKTM+VMRGGE
Sbjct: 541 EHIEPTKVSSTLNAESSSHADTEEVDEVKGGSQSEQKTIDETKDESQYEQKTMTVMRGGE 529
Query: 601 MVQVNYKVYIPKRAPALTRRQLKR 609
MVQV+YKVYIPK+A +L RR+L R
Sbjct: 601 MVQVSYKVYIPKKASSLGRRKLNR 529
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022139776.1 | 4.9e-241 | 80.00 | uncharacterized protein LOC111010601 [Momordica charantia] | [more] |
XP_038897880.1 | 7.0e-240 | 79.02 | histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida] | [more] |
XP_023535556.1 | 2.3e-235 | 77.34 | transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
KAG7024815.1 | 2.2e-233 | 77.50 | hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022935712.1 | 6.6e-230 | 76.52 | uncharacterized protein LOC111442542 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CDR0 | 2.4e-241 | 80.00 | uncharacterized protein LOC111010601 OS=Momordica charantia OX=3673 GN=LOC111010... | [more] |
A0A6J1F6D1 | 3.2e-230 | 76.52 | uncharacterized protein LOC111442542 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IGY0 | 6.0e-229 | 75.86 | uncharacterized protein LOC111476850 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A0A0L248 | 6.0e-229 | 76.64 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G630530 PE=4 SV=1 | [more] |
A0A1S3CJV0 | 1.7e-223 | 75.33 | uncharacterized protein LOC103501777 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
Match Name | E-value | Identity | Description | |
AT5G53930.1 | 8.3e-13 | 28.53 | unknown protein; LOCATED IN: chloroplast; Has 1807 Blast hits to 1807 proteins i... | [more] |