Lag0037879 (gene) Sponge gourd (AG‐4) v1

Overview
NameLag0037879
Typegene
OrganismLuffa acutangula (Sponge gourd (AG‐4) v1)
Descriptionpre-mRNA-splicing factor CWC22 homolog isoform X2
Locationchr2: 10130877 .. 10133215 (+)
RNA-Seq ExpressionLag0037879
SyntenyLag0037879
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGGTACTTTGTTTTTCTCTATCTCTCTCTCTCTCTCTATCGCTGAATTTTTTCTCTTTCGTGATGTTTCAATTGGGTTTGGCGCTATTTTGTCGGATGTTCTGTTTGGTCGCTGGGTAGTGTTCCATGAATACGCTGTAATCTGGCTTATGATTTGTCGAATATATCTGGGAAATCCTCTCTCGGTCGATTTTTATGCTTGAATTTTAAGTTTGTGTGTATGAGTTTCTTCTTAGTTGGCAACTGGAAAATGTGATATTGGTATTGTGGTTGCGGTTAAAGATATGAATTTTTTTCTCCCTCTTTTCCCCCTTTTACCTCGTATAAGATTGTTTTGATGCCGATGTAAATTTCCTTTGCATCCATTTTCTTTTTGTTTTGTTTTTATCTTTTGGCTCAAACCATGAAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAGCATATCGACTACAGACTAAATTTATATATTCAGCAGTGAGTATAATGAATACTAGTTTTATTAAAAACTTCTTTGCATGGATCTGCAATTATTCATTTCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA

mRNA sequence

ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA

Coding sequence (CDS)

ATGGGAAAGTCCTCTTCTTCTCGCAAGAAGGAGCGTTCCAAGACTTCTTCATCCCAGAGAAGTAGGAGAAAGAGTAGGAGTAGTAGGAAATTGAAGTCCAAGAAGCTGCGGTATCGCCACGATTCTCCATCTTGCTCTGACACTGATTTTGAAAGTTCAACTTCAGTGTCTTCTTCTGGCTCGGAGGGTGATGAAAAATTGGGAAGATCTCGATCCAAGACGCGGAAGAATGCGAAGCCTAGTAAAAAGAAAGCTAAGAAGCGATCTGATGACCGTCGAAGTAGGGATTATTCTCCTCATCCCAGAAAGAGGAAGAATTTGAAGAGAGACGACCATTGCGAGGTGAAGAAGACAAACAAAAAGAAGCGTAGAAGAGATGTGAGTATTAGTGCCACAAGTAGTGACTCTTTGAGCTGCTCAACTTGTGGAGAGGGGAGTACAACCAGCAATGAGGGTGAAATCGATAGGCATAGGGGCAGGTCTAGAAAGAAGAAAGGAAATATGGGGAAGACTGAAAGAAGTAGATACAGGTCAAAGAGTCATTCAGCATGCTCTTTATGTAGTGAAGGTAGTGATCATCAGAATGAGGTTGAGGATGGCAGTTATGTTGAAAACAACTTTAGACGATTAAGATCTGTAATTATTGTAGTAGGAGAGGAAAATAAATTAGAGACGTTTGATGAGAATGGACAAGAAGAAGAGGTCGTGCATCAGCCTGATGATGACCACCTGTCTTTTGGAGTTATGGGCAGTAAGGATGGGGCAAGTAAAAGAGAATTAGACTATGTTACATCGAAAGAGGCACCAGAGGTAGAAAACAAAAAAGAAGTGGTTATACCTGACAATAGAAACTCTATGGTTGTAAAGGATGATGGAGTTCAAAATGAGGGAAGCAACAATAACCATGGAGTAGTAATTCATGACCGTTCTTTAAATGAAAGAAAGAATGGCTGTTCTGGAAATAATGACAGCATAAATTGTATCGATTTAGAGTCAATTTTGAGACAGAGGGCTTTGGAAAACCTAAGAAAGTTCAAACGGGTGCCCCTAAGGAATGTGAAAACTCCTGATAATTGCAAAGTTGACAATAATAATGATGCAAAGCAATTGCACTCTCCCGTCTCTAAGTCAGTTCACGTGACTTCCCCTAGGGATGATGCCAAGATAAATGGTAACGGGCTCTCTAGACAAGGTGGAGGGAATGAAGTAAATTCAATGATAGTTGAAGAGAATGGTGTCATATCTACTGATGCAATAGATTCAGCAGTTGCATCTATGCATGATCCTGTCTATTCTTCACAGAATCTGGGTAGGACTTCCAATGGAAGCAATGGTATGAATGAACTGAAGCAGGATGTTTCTTCAGTAGACCAGGAGGCTATAAATAATAATATTTGCCAGAAGGCAGATGCAGATATTTGTCCTACAACTAACAGAAGCAATTTGATTATTGCAGCTTCGAGGGCTGAGTCCAAAGTTGATTCTCTTATGAAGCAGGCATCTGCTTCTCAGGAATCTATCCAAACAAAGCCATCTATATCTGACATTGGTGTTGATGAGACAGCTGAAACTCAGACCCAGATGAGGAATAATGATGATCAAAACATTGGTAATGGTTTTGGTTCTTCAGCTCACAAGCATTCTTCCCTTAATTCTATTTCAGGAGAACCCAGCTCTAACAAGTCTGGACACGAGAGTGGCGAAGGCTCGCAGTTTGAACAGAAAACCATGTCCGTGATGCGGGGTGGTGAAATGGTGCAGGTGAACTACAAGGTCTACATCCCAAAGAGAGCTCCTGCTTTGACTAGGAGGCAACTCAAGCGGTGA

Protein sequence

MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSGSEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNKKKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPALTRRQLKR
Homology
BLAST of Lag0037879 vs. NCBI nr
Match: XP_022139776.1 (uncharacterized protein LOC111010601 [Momordica charantia])

HSP 1 Score: 844.7 bits (2181), Expect = 4.9e-241
Identity = 488/610 (80.00%), Postives = 525/610 (86.07%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK   SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS 
Sbjct: 1   MGK---SRKKERSKTSSSQRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE DEK+GRSRS   KNAKP KK+AKKRS D + RD SPHPRKRK+ KR D CEVKKTNK
Sbjct: 61  SEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTNK 120

Query: 121 -KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSK 180
            KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSK
Sbjct: 121 RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSK 180

Query: 181 SHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQP 240
           SHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H P
Sbjct: 181 SHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHP 240

Query: 241 DDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN 300
           DDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Sbjct: 241 DDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSN 300

Query: 301 NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNC 360
           NNHG V +D  LNE  NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T  NC
Sbjct: 301 NNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANC 360

Query: 361 KVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAI 420
           +VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV ST+AI
Sbjct: 361 QVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI 420

Query: 421 DSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNR 480
           DSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+R
Sbjct: 421 DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSR 480

Query: 481 SNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNG 540
           SNL+ AA R +SKVD L+KQASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NG
Sbjct: 481 SNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNG 540

Query: 541 FGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
           F SSAHK  SSLN  SGE S NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 FDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600

Query: 601 PALTRRQLKR 609
           PAL RRQLKR
Sbjct: 601 PALARRQLKR 606

BLAST of Lag0037879 vs. NCBI nr
Match: XP_038897880.1 (histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida])

HSP 1 Score: 840.9 bits (2171), Expect = 7.0e-240
Identity = 482/610 (79.02%), Postives = 529/610 (86.72%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SS+KLKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSKKLKSKKLRYRHDSPSCSDTDFESSTSVSSS- 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+++ RSRSKTRKNAKPSKK++K++S DR+SR+ SPHPRKRK+ KR+DHCE KK  K
Sbjct: 61  SEDDKRVRRSRSKTRKNAKPSKKRSKRQSHDRQSRERSPHPRKRKHSKRNDHCEAKKATK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRD S+ A  SDS SCSTCG GSTTSNE E+ R RGRS K+KGNMGKTER RYRSKS
Sbjct: 121 KKRRRDASVGA-YSDSSSCSTCGNGSTTSNESEVVRRRGRSGKRKGNMGKTERGRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSL S+ SD+QNEV+D SYV NNFRRLRS+I++ GEENKL+TF  N Q+E   HQP+
Sbjct: 181 RSPCSLSSKDSDYQNEVDDDSYVRNNFRRLRSIIVIAGEENKLKTFAGNEQQEGATHQPN 240

Query: 241 --DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGS 300
             DDH S G M SKD  SKRELDYV SKE P VE KKEV +P+NRNSMVVKDDGVQNEGS
Sbjct: 241 DVDDHPSLGDMDSKDATSKRELDYVISKEVPVVEKKKEVDVPNNRNSMVVKDDGVQNEGS 300

Query: 301 NNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDN 360
           N N G V +D SL+ERKNGCSG  DS+N IDLESILRQRALENLRKFK  P RNV+T  N
Sbjct: 301 NKNLGGVTNDHSLDERKNGCSGKTDSVNGIDLESILRQRALENLRKFKGAPPRNVETIAN 360

Query: 361 CKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDA 420
           CKVD+NNDAKQL SPVSKSVHVTSPRDDA+IN  G SRQGGGN VNSMIV+ENGV STDA
Sbjct: 361 CKVDHNNDAKQLSSPVSKSVHVTSPRDDAEINSKGFSRQGGGNAVNSMIVKENGVKSTDA 420

Query: 421 IDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTN 480
           IDS+V SMHDPVYSSQNLG+ SNGSNGMNELKQ++SS+DQE IN+NICQKADADIC TTN
Sbjct: 421 IDSSVPSMHDPVYSSQNLGKISNGSNGMNELKQNISSLDQEVINDNICQKADADICSTTN 480

Query: 481 RSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGN 540
           RSNL+IAA R ESKVDSL+KQA A+QESIQTKPSISDIGVDETA+TQTQMRNNDDQNI N
Sbjct: 481 RSNLVIAALRPESKVDSLIKQAPAAQESIQTKPSISDIGVDETAQTQTQMRNNDDQNIRN 540

Query: 541 GFGSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
           G  SSAHK SSLNSISGE S + S HESG+ SQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 GLDSSAHKPSSLNSISGENSLSTSRHESGDSSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600

Query: 601 PALTRRQLKR 609
           PALTRRQLKR
Sbjct: 601 PALTRRQLKR 608

BLAST of Lag0037879 vs. NCBI nr
Match: XP_023535556.1 (transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 825.9 bits (2132), Expect = 2.3e-235
Identity = 471/609 (77.34%), Postives = 533/609 (87.52%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+  PHPRKRK+ KR D  E KKTNK
Sbjct: 61  SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECPPHPRKRKHSKRSDRYEAKKTNK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLSRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSLCS+G D QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGGDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           ++H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Sbjct: 241 ENHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D SL+ERKNGCSGN DSINCI+LESILRQ+ALENLRKFK V  RNV+   NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTDSINCINLESILRQKALENLRKFKGVSPRNVEIISNCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V+NNNDAKQL SPVSKSVHVT PRDDA+ING G SRQ GG+ VNSMIV+ENG  STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTPPRDDAEINGKGFSRQDGGDAVNSMIVKENGFKSTDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           +AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI  TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADADIYSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R ESKVDSL+++ASA+QE I+TKPSISDI VDETA+T+TQM+NN+DQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSLIEKASAAQECIETKPSISDIVVDETAQTETQMKNNEDQNIRNGF 540

Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
           GSSA+K  SSLNSISGE S +KS HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600

Query: 601 ALTRRQLKR 609
           ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609

BLAST of Lag0037879 vs. NCBI nr
Match: KAG7024815.1 (hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 819.3 bits (2115), Expect = 2.2e-233
Identity = 472/609 (77.50%), Postives = 527/609 (86.54%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSK SSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKASSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D  E KKTNK
Sbjct: 61  SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRDVS+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
           HS CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ D
Sbjct: 181 HSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V+NNNDAKQL SPVSKSVHVTSPRDDA+IN  G SRQ GG+ VNSMIV+ NG  STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTSPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           +AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADADI  TTN S
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADADIYSTTNGS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R ESKVDS +K+ASA QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASADQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540

Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
           GSSA+K  SSLNSISGE S +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600

Query: 601 ALTRRQLKR 609
           ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609

BLAST of Lag0037879 vs. NCBI nr
Match: XP_022935712.1 (uncharacterized protein LOC111442542 isoform X1 [Cucurbita moschata])

HSP 1 Score: 807.7 bits (2085), Expect = 6.6e-230
Identity = 466/609 (76.52%), Postives = 522/609 (85.71%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRRLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D  E KKTNK
Sbjct: 61  SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRD S+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS
Sbjct: 121 KKRRRDASVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V+NNNDAKQL SPVSKSVHVT PRDDA+IN  G SRQ GG+ VNSMIV+ NG  STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTFPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           +AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI  TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNEPKQDISSLDQEVINDNICLKADADIYSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R ESKVDS +K+AS  QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASVDQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540

Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
           GSSA+K  SSLN ISGE   +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKSSSSLNPISGENRLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600

Query: 601 ALTRRQLKR 609
           ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609

BLAST of Lag0037879 vs. ExPASy TrEMBL
Match: A0A6J1CDR0 (uncharacterized protein LOC111010601 OS=Momordica charantia OX=3673 GN=LOC111010601 PE=4 SV=1)

HSP 1 Score: 844.7 bits (2181), Expect = 2.4e-241
Identity = 488/610 (80.00%), Postives = 525/610 (86.07%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK   SRKKERSKTSSSQRSRRK+RSSRKLKSKKLRYRHDSPSCSDTDFESSTS+SSS 
Sbjct: 1   MGK---SRKKERSKTSSSQRSRRKNRSSRKLKSKKLRYRHDSPSCSDTDFESSTSLSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE DEK+GRSRS   KNAKP KK+AKKRS D + RD SPHPRKRK+ KR D CEVKKTNK
Sbjct: 61  SEDDEKVGRSRSNKLKNAKPGKKRAKKRSRDDQIRDLSPHPRKRKHSKRHDRCEVKKTNK 120

Query: 121 -KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSK 180
            KKRRRDVS+SATS DSLSCSTCG+GSTTSNE EIDRHRGRS K+K N GKTERSRYRSK
Sbjct: 121 RKKRRRDVSVSATSRDSLSCSTCGDGSTTSNESEIDRHRGRSGKRKRNRGKTERSRYRSK 180

Query: 181 SHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQP 240
           SHS CSLCSEGSD+QNEVEDGSYVENNFRRLRSVI+VVGEENKL+TFD N Q+EEV+H P
Sbjct: 181 SHSPCSLCSEGSDYQNEVEDGSYVENNFRRLRSVIVVVGEENKLKTFDGNEQQEEVMHHP 240

Query: 241 DDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSN 300
           DDDH SFG M S DG SKRELD VTS EA EVENKKEVVIPD RN +VVKD GVQNEGSN
Sbjct: 241 DDDHPSFGDMDSNDGMSKRELDRVTSNEASEVENKKEVVIPDIRNFLVVKDGGVQNEGSN 300

Query: 301 NNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNC 360
           NNHG V +D  LNE  NG SGN D INCIDLESILRQRALENLRKFK VP +NV+T  NC
Sbjct: 301 NNHGGVTNDHPLNEGNNG-SGNTDRINCIDLESILRQRALENLRKFKGVPPKNVETSANC 360

Query: 361 KVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAI 420
           +VDN+NDAKQL+SPVS SV + SPRDDA+ING G S QGGGN VN MIVEENGV ST+AI
Sbjct: 361 QVDNSNDAKQLYSPVSNSVRIKSPRDDAEINGKGFSGQGGGNAVNPMIVEENGVESTNAI 420

Query: 421 DSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNR 480
           DSAVAS HDP+YSSQNLG+ S+ SNGMNELKQD+SS+DQEA+N+NICQK DADIC TT+R
Sbjct: 421 DSAVASTHDPIYSSQNLGKISSTSNGMNELKQDISSLDQEAVNDNICQKVDADICSTTSR 480

Query: 481 SNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNG 540
           SNL+ AA R +SKVD L+KQASA QE IQTKPSISD+GVDE A+ Q Q RNNDDQNI NG
Sbjct: 481 SNLVYAALRPDSKVDFLVKQASAPQEYIQTKPSISDMGVDEIAQMQIQTRNNDDQNIVNG 540

Query: 541 FGSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600
           F SSAHK  SSLN  SGE S NK  HESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA
Sbjct: 541 FDSSAHKPSSSLNYFSGENSLNKPRHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRA 600

Query: 601 PALTRRQLKR 609
           PAL RRQLKR
Sbjct: 601 PALARRQLKR 606

BLAST of Lag0037879 vs. ExPASy TrEMBL
Match: A0A6J1F6D1 (uncharacterized protein LOC111442542 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442542 PE=4 SV=1)

HSP 1 Score: 807.7 bits (2085), Expect = 3.2e-230
Identity = 466/609 (76.52%), Postives = 522/609 (85.71%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSR+LKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRRLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR D  E KKTNK
Sbjct: 61  SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRSDRYEAKKTNK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRD S+ AT+SDSL  STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS
Sbjct: 121 KKRRRDASVGATNSDSLGRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEGRRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSLCS+GSD QNEVED SYVEN+ RRL+S+I+VVGEE++L+TF  N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVENSCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           D+H SFG M SKDG SKRELDYV SKEAPEVE+K ++V PDNRNS+++ DDGV+NEGSN 
Sbjct: 241 DNHPSFGDMNSKDGTSKRELDYVISKEAPEVESKNKIVTPDNRNSLILNDDGVRNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D SL+ERKNGCSGN +SINCIDLESILRQ+ALENLRKFK V  RNV+   NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTESINCIDLESILRQKALENLRKFKGVSPRNVEIIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V+NNNDAKQL SPVSKSVHVT PRDDA+IN  G SRQ GG+ VNSMIV+ NG  STDAID
Sbjct: 361 VENNNDAKQLISPVSKSVHVTFPRDDAEINVKGFSRQDGGDAVNSMIVKGNGFKSTDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           +AVASMHDPV SSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NIC KADADI  TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNEPKQDISSLDQEVINDNICLKADADIYSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R ESKVDS +K+AS  QE IQTK SISDI VDETA+TQTQM NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSFIKKASVDQECIQTKSSISDIVVDETAQTQTQMTNNDDQNIRNGF 540

Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
           GSSA+K  SSLN ISGE   +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKSSSSLNPISGENRLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600

Query: 601 ALTRRQLKR 609
           ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609

BLAST of Lag0037879 vs. ExPASy TrEMBL
Match: A0A6J1IGY0 (uncharacterized protein LOC111476850 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111476850 PE=4 SV=1)

HSP 1 Score: 803.5 bits (2074), Expect = 6.0e-229
Identity = 462/609 (75.86%), Postives = 524/609 (86.04%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN+KPSKK++KK+S D +SR+ SPHPRKRK+ KR+D  E KKTNK
Sbjct: 61  SEDDKKVRRSRSKTRKNSKPSKKRSKKQSHDHQSRECSPHPRKRKHSKRNDRYEAKKTNK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRDVS+ AT+SDSLS STCG+GS+TS++ EIDR RGRS K+K NM KTE  RYRSKS
Sbjct: 121 KKRRRDVSVGATNSDSLSRSTCGDGSSTSSDSEIDRRRGRSGKRKRNMVKTEERRYRSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSLCS+GSD QNEVED SYV+N  RRL+S+I+VVGEE++L+TF  N Q+E V HQ D
Sbjct: 181 RSPCSLCSDGSDFQNEVEDDSYVKNCCRRLKSIIVVVGEEDELKTFVGNEQQEAVTHQLD 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           D+H  F  M SKDG  KRELDYV SKEAPEVE+K ++  PDNRNS+++ +DGV+NEGSN 
Sbjct: 241 DNHPLFEDMNSKDGTCKRELDYVISKEAPEVESKNKMATPDNRNSLILNNDGVRNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D SL+ERKNGCSGN D+INCIDLESILRQ+ALENLRKFK    RNV+   NCK
Sbjct: 301 NHGGVTNDHSLDERKNGCSGNTDNINCIDLESILRQKALENLRKFKGASPRNVEIIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V+NNNDAKQL SPVSKSVHV SPRDDA+ NG G SRQ GG+ VNSMI++ NG  STDAID
Sbjct: 361 VENNNDAKQLFSPVSKSVHVASPRDDAETNGKGFSRQDGGDAVNSMIIKGNGFKSTDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           +AVASMHDPV SSQNLG+ SNGSNGMNELKQD+SS+DQE IN+NIC KADA+I  TTNRS
Sbjct: 421 TAVASMHDPVCSSQNLGKISNGSNGMNELKQDISSLDQEVINDNICLKADANIYSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R ESKVDSL+++ASA+QE IQTKPSISDI VDE ++TQTQ  NNDDQNI NGF
Sbjct: 481 NLVIAAFRPESKVDSLIEKASAAQECIQTKPSISDIVVDEISQTQTQKTNNDDQNIRNGF 540

Query: 541 GSSAHK-HSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600
           GSSA+K  SSLNSISGE S +KS  ESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP
Sbjct: 541 GSSAYKPSSSLNSISGENSLDKSRQESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAP 600

Query: 601 ALTRRQLKR 609
           ALTRRQLKR
Sbjct: 601 ALTRRQLKR 609

BLAST of Lag0037879 vs. ExPASy TrEMBL
Match: A0A0A0L248 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G630530 PE=4 SV=1)

HSP 1 Score: 803.5 bits (2074), Expect = 6.0e-229
Identity = 466/608 (76.64%), Postives = 520/608 (85.53%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKKLRYRHDSPSCSDTDFESSTSV SS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKLRYRHDSPSCSDTDFESSTSVPSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE  +++ RSRSKT+KNAKPSKK++KK+S DR+SR+ SP+PRKRK+ KR+D  EV K NK
Sbjct: 61  SEHHKRVRRSRSKTQKNAKPSKKRSKKQSHDRQSRECSPNPRKRKHSKRNDRREVNKANK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRDVS+    S+SLSCSTCG GSTTSNE E+ R RGRS K+K NM KTE  RY SKS
Sbjct: 121 KKRRRDVSVG--HSNSLSCSTCGNGSTTSNESEVVRRRGRSGKRKENMRKTESGRYMSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
           HS CSL SEGSD+QNEV+D SYVENNFRRLRS+I+VVGEENKL   +E   +E V +QP 
Sbjct: 181 HSPCSLRSEGSDYQNEVDDESYVENNFRRLRSIIVVVGEENKLYVGNE---QEGVTNQPS 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           DDH SFG M SKD  SKRELDYV +KEAP VEN+KEV +P+ RNSMVV+DDGVQNEGSN 
Sbjct: 241 DDHPSFGDMDSKDATSKRELDYVITKEAPVVENEKEVDVPNFRNSMVVEDDGVQNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +DRS +E KNGCS N DSINCIDLES+LRQRALENLRKFK  P RNV+T  NCK
Sbjct: 301 NHGGVTNDRSSDEIKNGCSDNTDSINCIDLESMLRQRALENLRKFKGAPPRNVETIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           V +NN AKQL SP+SKSVHVTSPR+DA+IN    SRQGGGN VNSMIV+ENGV S DAID
Sbjct: 361 VSHNNAAKQLCSPISKSVHVTSPRNDAEINSEQFSRQGGGNAVNSMIVKENGVNSMDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           SAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKA+ADIC TTNRS
Sbjct: 421 SAVATMHDPVYSSQNLGKISNGSNGMNEQKQDISSLDQELINDNICQKANADICSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R + KVDSL+KQ SA+QES+QTKPSISD+ V ETA+TQTQMRNN+D NI NG 
Sbjct: 481 NLVIAALRPKPKVDSLIKQTSAAQESVQTKPSISDVAVGETAQTQTQMRNNNDLNIRNGL 540

Query: 541 GSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
           GSSAHK SSLNSISGE S + S HESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Sbjct: 541 GSSAHKPSSLNSISGENSLHMSNHESGESSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600

Query: 601 LTRRQLKR 609
           LTRRQLKR
Sbjct: 601 LTRRQLKR 603

BLAST of Lag0037879 vs. ExPASy TrEMBL
Match: A0A1S3CJV0 (uncharacterized protein LOC103501777 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103501777 PE=4 SV=1)

HSP 1 Score: 785.4 bits (2027), Expect = 1.7e-223
Identity = 458/608 (75.33%), Postives = 510/608 (83.88%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGK+SSSRKKERSKTSSSQRSRRKS+SSRKLKSKK RYRHDSPSCSDTDFESSTSV SS 
Sbjct: 1   MGKASSSRKKERSKTSSSQRSRRKSKSSRKLKSKKQRYRHDSPSCSDTDFESSTSVPSSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPRKRKNLKRDDHCEVKKTNK 120
           SE D+K+ RSRSKTRKN KPSKK+ KK+S DR+SR+ SP+PRKRK+ KR+D  EV K NK
Sbjct: 61  SEDDKKVRRSRSKTRKNVKPSKKRFKKQSHDRQSRECSPNPRKRKHSKRNDRREVNKANK 120

Query: 121 KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTERSRYRSKS 180
           KKRRRDVS+    SDSLSCSTCG G+TTSNE E+ R RGR  K+KGNM KT   RY SKS
Sbjct: 121 KKRRRDVSVG--HSDSLSCSTCGNGTTTSNESEVVRRRGRFAKRKGNMRKTGSGRYMSKS 180

Query: 181 HSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEEEVVHQPD 240
            S CSL SEGSD+QNEV+D SYVE NFRRLRS+I+VVGEENKL+TF  N +++EV +Q  
Sbjct: 181 RSPCSLHSEGSDYQNEVDDDSYVEKNFRRLRSIIVVVGEENKLKTFVGN-EQQEVTNQLS 240

Query: 241 DDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGVQNEGSNN 300
           DDH S G M SKD   KR LDYV +KEA  VEN+KEV +P+ RNSMVVKD GVQNEGSN 
Sbjct: 241 DDHPSSGEMDSKDATGKRGLDYVVTKEALVVENEKEVDVPNYRNSMVVKDGGVQNEGSNK 300

Query: 301 NHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNVKTPDNCK 360
           NHG V +D S +E KNGCS N DSINCIDLES+LRQRALENLRKFK    RNV+T  NCK
Sbjct: 301 NHGGVTNDHSSDEIKNGCSDNTDSINCIDLESMLRQRALENLRKFKGASPRNVETIANCK 360

Query: 361 VDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGVISTDAID 420
           VD+NN AKQL SPVS SVHVTSPR++A+IN    SRQGGGN +NSMI++ENGV S DAID
Sbjct: 361 VDHNNAAKQLRSPVSDSVHVTSPRNNAEINSKRFSRQGGGNAINSMILKENGVKSMDAID 420

Query: 421 SAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADICPTTNRS 480
           SAVA+MHDPVYSSQNLG+ SNGSNGMNE KQD+SS+DQE IN+NICQKADADIC TTNRS
Sbjct: 421 SAVATMHDPVYSSQNLGKISNGSNGMNEQKQDISSLDQEVINDNICQKADADICSTTNRS 480

Query: 481 NLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDDQNIGNGF 540
           NL+IAA R E KVDSL+KQ SA+QES+QTKPSISD+GV ETA+ QTQMRNNDD NI NG 
Sbjct: 481 NLVIAALRPEPKVDSLIKQTSAAQESVQTKPSISDVGVGETAQIQTQMRNNDDLNIRNGL 540

Query: 541 GSSAHKHSSLNSISGEPSSNKSGHESGEGSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600
           GSSA++ SSLNSISGE S N S  ESGE SQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA
Sbjct: 541 GSSAYEPSSLNSISGEDSLNMSNQESGESSQFEQKTMSVMRGGEMVQVNYKVYIPKRAPA 600

Query: 601 LTRRQLKR 609
           LTRRQLKR
Sbjct: 601 LTRRQLKR 605

BLAST of Lag0037879 vs. TAIR 10
Match: AT5G53930.1 (unknown protein; LOCATED IN: chloroplast; Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 73.2 bits (178), Expect = 8.3e-13
Identity = 178/624 (28.53%), Postives = 277/624 (44.39%), Query Frame = 0

Query: 1   MGKSSSSRKKERSKTSSSQRSRRKSRSSRKLKSKKLRYRHDSPSCSDTDFESSTSVSSSG 60
           MGKSSSS K    K SS++    K + S++ KSKK+R   D    S +D     S   S 
Sbjct: 1   MGKSSSSSK--IIKDSSNKLRSVKKKKSKRNKSKKIRRIKDESESSGSD-----SSLYSS 60

Query: 61  SEGDEKLGRSRSKTRKNAKPSKKKAKKRSDDRRSRDYSPHPR---KRKNLKRDDHCEVKK 120
           SE D      R K ++ +K SKK+++KR     S D S   R   K+K  KR D    KK
Sbjct: 61  SEDD-----YRRKKKRRSKLSKKRSRKRYSSSESDDDSDDDRLLKKKKRSKRKDENVGKK 120

Query: 121 TNK----KKRRRDVSISATSSDSLSCSTCGEGSTTSNEGEIDRHRGRSRKKKGNMGKTER 180
             K    K+R+RD+S S+TSS+     +  +GS + +     R RGR       +GK + 
Sbjct: 121 KKKVVSRKRRKRDLSSSSTSSE----QSDNDGSESDDGKRWSRDRGR------RLGKVKD 180

Query: 181 SRYRSKSHSACSLCSEGSDHQNEVEDGSYVENNFRRLRSVIIVVGEENKLETFDENGQEE 240
           SR RS+        SE  D   + ED    E N RRL+S+++V            NG+ +
Sbjct: 181 SRSRSRDELEGE--SEEPDECWQGEDEVIPEKNPRRLKSIVVVSYNYG-------NGERK 240

Query: 241 EVVHQPDDDHLSFGVMGSKDGASKRELDYVTSKEAPEVENKKEVVIPDNRNSMVVKDDGV 300
           E     +DD   +       G   REL Y  S+++ E++ +      D+ + +   D+G 
Sbjct: 241 E-----EDDRDVY----MTRGGGNRELGY--SEDSDEMDGES----IDSYSRIRADDNGF 300

Query: 301 QNEGSNNNHGVVIHDRSLNERKNGCSGNNDSINCIDLESILRQRALENLRKFKRVPLRNV 360
                +    V   D SL +               DLE+IL++RALENL++F+ V     
Sbjct: 301 GEYNKSETSKVSHTDNSLKDD--------------DLEAILKKRALENLKRFRGV----- 360

Query: 361 KTPDNCKVDNNNDAKQLHSPVSKSVHVTSPRDDAKINGNGLSRQGGGNEVNSMIVEENGV 420
                     +  AK+  S VS+                G   Q    +V S   +++ +
Sbjct: 361 -------TQKSGIAKKEVSSVSE----------------GEPMQIESEKVES---QDHDL 420

Query: 421 ISTDAIDSAVASMHDPVYSSQNLGRTSNGSNGMNELKQDVSSVDQEAINNNICQKADADI 480
           +     DSAV+     + +S+ +    N       L    S  DQ++  +    K  + +
Sbjct: 421 MEQKLCDSAVSK---DLENSEKILHVINVKESGTALANSASQQDQQS-GDTAKVKVSSGL 480

Query: 481 CPTTNRSNLIIAASRAESKVDSLMKQASASQESIQTKPSISDIGVDETAETQTQMRNNDD 540
              T +  L+      +S   +  K+AS SQ++       S +  +    T   +  N+ 
Sbjct: 481 SSCTTKRKLVRPVLSKDSLNLASKKEASGSQDAEAESIDGSTVDKNCLESTLALVTKNEG 529

Query: 541 QNI-----GNGFGSSAHKHSSLNSI----SGEPSSNKSGHESGEGSQFEQKTMSVMRGGE 600
           ++I      +   + +  H+    +     G  S  K+  E+ + SQ+EQKTM+VMRGGE
Sbjct: 541 EHIEPTKVSSTLNAESSSHADTEEVDEVKGGSQSEQKTIDETKDESQYEQKTMTVMRGGE 529

Query: 601 MVQVNYKVYIPKRAPALTRRQLKR 609
           MVQV+YKVYIPK+A +L RR+L R
Sbjct: 601 MVQVSYKVYIPKKASSLGRRKLNR 529

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022139776.14.9e-24180.00uncharacterized protein LOC111010601 [Momordica charantia][more]
XP_038897880.17.0e-24079.02histone-lysine N-methyltransferase SETD2 isoform X1 [Benincasa hispida][more]
XP_023535556.12.3e-23577.34transcriptional regulator ATRX isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7024815.12.2e-23377.50hypothetical protein SDJN02_13634, partial [Cucurbita argyrosperma subsp. argyro... [more]
XP_022935712.16.6e-23076.52uncharacterized protein LOC111442542 isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CDR02.4e-24180.00uncharacterized protein LOC111010601 OS=Momordica charantia OX=3673 GN=LOC111010... [more]
A0A6J1F6D13.2e-23076.52uncharacterized protein LOC111442542 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1IGY06.0e-22975.86uncharacterized protein LOC111476850 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A0A0L2486.0e-22976.64Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G630530 PE=4 SV=1[more]
A0A1S3CJV01.7e-22375.33uncharacterized protein LOC103501777 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... [more]
Match NameE-valueIdentityDescription
AT5G53930.18.3e-1328.53unknown protein; LOCATED IN: chloroplast; Has 1807 Blast hits to 1807 proteins i... [more]
InterPro
Analysis Name: InterPro Annotations of Sponge gourd (AG-4) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 90..117
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 529..575
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 357..379
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 356..402
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 15..37
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 130..149
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 47..63
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 72..89
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 496..517
NoneNo IPR availablePANTHERPTHR36808TRANSCRIPTIONAL REGULATOR ATRX-LIKE PROTEINcoord: 1..608

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Lag0037879.1Lag0037879.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane