Sgr022861 (gene) Monk fruit (Qingpiguo) v1

Overview
NameSgr022861
Typegene
OrganismSiraitia grosvenorii (Monk fruit (Qingpiguo) v1)
DescriptionDUF4378 domain-containing protein
Locationtig00000589: 2514473 .. 2517808 (-)
RNA-Seq ExpressionSgr022861
SyntenySgr022861
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGTTTGCCTCTTCTCCAAAACTCATTTTATTACTATTTTTACTTTTATCTTTCAATCTCTTAGCCCAGAAACACCATCTTTATGAACTGGGGTTCACCGGAAAAAGCCCACTGTCGGCAGTGTAAATTCACTGCAATTACCGCCGGAATGTTGCAGTTTCAGGCGTCAAAACATGTCAACCCCACCAGAACAACTTGTCAAAAGCTTCTGAAATTTTCAACGCCACCGTGAATTTTGACGAACAAACTGATAAATCTCTCATTTTCCATTCCACACTCCTTTTCCCGGAAAATTATGGCCCTCCAAGAAAAAGACAAACCCTCTTTTGTATTTTCCCTTGTATTCTAATTCCCATGAGTACCGACATCCCTATTCTATAATTAACATTTTTCCTTTTGGTCTTTGGCATCATTATAATAATTATAAGAGAAAAACCAATAAAGTAGAGATGGAAACCAACAACTTTACCAAAACCAACTAAATCAAAGATCAATTTTTACACACATTTTGTTCTCTTGTCAACTTTTCTTTGCCTTGTTTTTACAAATTTACAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAGTATGTAACCATGATTCAACTCAACTGCTAATCTCCTAAAGAAAACTTATATTAGTTTTGCTTTTTTGCACTGATCCGAGTTTGTTGTGTGGTCGTTTTCAATGACATTTGCATATAGGTACTGTCTGACTTGAGATTTGTTACTGATACAAATGTCTGTTACACTTTATAGAAATGGTTAATAGAGCTTCACAACTCATAAACATGTTGCTGTACTAGATATGAACACTTGTATCTTTGGTGGGTTCCATTGGGGAGATTTCAATTATCAAAAGGGTGGAGGGTGAAATCAAAACAAGCTTTTGAATTTAGGTAAATCTGTACTAGAATAAAGGGAAAGAGGGAGAAAATGTTTCGTTGTCAGATAAATATAAGACGATGACCTCACGAGATATTTCAGCGCAAACGGTATTATTAGACACTTCTTTCTGGCCTGGAAGGCCTCTTCTGTGGAGAAATGTTTTGGAGGTCTGGCCCTTTTCTTTTTCTTGTTCTTCAAAACTTCAAAACTTGACCCACATTTTGCTGTAAGAGATGTGTCATAATTGTGGGAGTCCTCTTGAATTTATTAGGCTATTCAATCATTGAACCAATTTATTCATCGTTTCGAACGGCATCATCATTGTCTGTACTTCAGAACAAGGCCCCCCCTCCCCTCCCATGCCCGCCCGTCCAAATTTTTTACTGTTTGAGCATTTAATAGGGATTTTGAAGCATATCAGTGACTACTCTGTGAAAGCATGGCAATGAGTTGTTCTAGTCATTAGGCATATGCACTTCATCTGTAAAAGCTGTCTTTGTAACACTTTCTATTAATTATCATGTACTTTTTACTTATGAAATACCTAAATTGTCTAGTTCATTCTCTTCATTAGGCATGTGATTGTATTTGAGCTAGAGATTCCAAATTCAACTCTTTGCATTGTCATTTCATTCTCTTTTCTCGAATTTAACAGTTCGCTCTTACTGAAAATCGAACATTTAGTTTTCTTCCATAAGTCAACAATTTTGAAGTTCAAAGACAGAAAATATGTTTGTCTTCTTCTGCAATTAAAAGGAGTTGCACGATAATCACAACCCCCAAAAGTTTTCTACCTTGTTAGTACAGTGTTCAGTAGCTTGAGCTTGGTTGGTACAAACCCTGCATTGAATTACCTTCATGGCCATGGTCCTACTTGTTTGTTGTGCAATCGAAAGCGGTATAAATGTCAAATTTTTCGATCGGGATTTCAAATTTCTATTTTACCTTCTTCCAGGAGTTTAATTTCCACTGTACTTGGATTCTTCTTGATCCTCATTTAATGGCTGTAAATTTAAGCCTGTTTCTAAACTCTCCATGTAAATTTTTGTGATATTTTCAGAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA

mRNA sequence

ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA

Coding sequence (CDS)

ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA

Protein sequence

MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKNACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH
Homology
BLAST of Sgr022861 vs. NCBI nr
Match: XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])

HSP 1 Score: 708.4 bits (1827), Expect = 4.2e-200
Identity = 384/478 (80.33%), Postives = 413/478 (86.40%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPK++L LK+RKPISET DFPG FCK+
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61

Query: 61  ACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSS 120
           ACFFSFH+SPDLRKSPL EFQSPV    RN NAIFLHVPARTAG+LLEAALRIQKQST++
Sbjct: 62  ACFFSFHESPDLRKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTAA 121

Query: 121 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE- 180
           RSK HGKTNGLGLLGSFLKRLTHRGRARKREI+GDG+RND GG    P KM IEENE+E 
Sbjct: 122 RSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDEN 181

Query: 181 -NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
            NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS GHRTPEFSSP +SP   DHQDN
Sbjct: 182 VNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDN 241

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED  +LE SY IVQKAK
Sbjct: 242 DVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAK 301

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
           HQLLKKLRRFEKLAELDPVELE+FLLKGEEDELDD+DDIDHLK EEEY+SHNF+      
Sbjct: 302 HQLLKKLRRFEKLAELDPVELESFLLKGEEDELDDDDDIDHLK-EEEYESHNFE------ 361

Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
              QHDVEANGSSSF+IPH       RLV N +T ++RD+ V DN+EE  K VYVRSDL 
Sbjct: 362 ---QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRSDLW 421

Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
           KRVD NAID TVGQ+LK ELDGWNRN  QRGE+AIEIELAIFSLLV EMQTELDC TH
Sbjct: 422 KRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458

BLAST of Sgr022861 vs. NCBI nr
Match: XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])

HSP 1 Score: 670.6 bits (1729), Expect = 9.8e-189
Identity = 364/473 (76.96%), Postives = 400/473 (84.57%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
           MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPS K+H  L   KPIS +SDFP  FC++
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNPKPISHSSDFPAKFCRS 60

Query: 61  ACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS 120
           ACFFSF+ SPDL   SPL  FQSPVK+PCRN N IFLHVPARTAGLLLEAALRIQKQST 
Sbjct: 61  ACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTV 120

Query: 121 SRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE 180
           +RSK+ GK+NGLG+LGSFLKRLTHRGRARKREI+GDG++ND       P KM IE  ENE
Sbjct: 121 ARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIE--ENE 180

Query: 181 NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDND 240
           NENDSVS+ SNVT FDFC+SN CDSPFRFVLQSSPSPGH+TPE +SP SSP   DHQ ND
Sbjct: 181 NENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQAND 240

Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKH 300
           VE LKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDD NLE S+AIVQ+AKH
Sbjct: 241 VEGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKH 300

Query: 301 QLLKKLRRFEKLAELDPVELETFLLKGE-EDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
           QLLKKLRRFE+LAELDPVELETFLLK E EDE +D+DDIDHLKEEE+YK          K
Sbjct: 301 QLLKKLRRFERLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEEDYK----------K 360

Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
           DIK+HD+EAN SS F+IPHRP +DM  LVCNLVTE+ERD  VI+ +EE MK +YVRSDL 
Sbjct: 361 DIKEHDIEANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLW 420

Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTEL 469
           KRVD NAI+V VGQ+LK E+DGW RN +QR EIAIEIE+AIFSLLVEEMQ EL
Sbjct: 421 KRVDSNAINVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPEL 461

BLAST of Sgr022861 vs. NCBI nr
Match: XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])

HSP 1 Score: 629.8 bits (1623), Expect = 1.9e-176
Identity = 348/483 (72.05%), Postives = 388/483 (80.33%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H  LK  KPI  +SDF   FC+
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60

Query: 61  NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
           + CFFSF+ SPDL   SP   FQSPVK+PCRN N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
           ++RSK+ GK+NGLGLLGSFLKRLTHR RARKREI GDG+ ND       P KM IE  EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIE--EN 180

Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
           E ENDSV + SNVT FDFCESN CDSPFRFVLQSSPSPGHRTPE SSP SSP   DHQ N
Sbjct: 181 ETENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAN 240

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAK 300

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELD--DNDDIDHLKEEEEYKSHNFDL 360
           HQLLKKLRRFE+LAELDP+ELETFLL  E   EDEL   D DDIDHLKEE E        
Sbjct: 301 HQLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE-------- 360

Query: 361 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 420
              EKDIKQH+ E N SS F+IP+RP++D K LVCNL+T++ER+  VI+  EE MKRVY+
Sbjct: 361 -QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYM 420

Query: 421 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 474
           R DL KRVD NAID+ VG++LK E+DGWN N + RGEIA+EIE+AIFSLLVEEMQ+EL C
Sbjct: 421 RQDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHC 472

BLAST of Sgr022861 vs. NCBI nr
Match: KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])

HSP 1 Score: 620.5 bits (1599), Expect = 1.2e-173
Identity = 347/482 (71.99%), Postives = 388/482 (80.50%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H  LK  KPIS + DF   FC+
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60

Query: 61  NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
           + CFFSF+ SPDL   SPL  FQSPVK+PCR+ N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
           ++RSK+ GK+NGLGLLGSFLKRLTHR R+RKREI GDG+ ND       P KM IE  EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIE--EN 180

Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
           E ENDSV + SNVT FDFCESN CDSPFRFVLQSS SPGHRTPE SSPVSSP   DHQ N
Sbjct: 181 EKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAN 240

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAK 300

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELDDNDDIDHLKEE-EEYKSHNFDLP 360
           HQLLKKLRRFE+LAELDP+ELETFLL  E   EDEL D DDIDHLKEE EEY        
Sbjct: 301 HQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY-------- 360

Query: 361 NNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVR 420
             EKDIKQH+ E N SS F+  +RP++D K LVCNL+TE+ER+   I+ +EE MKRVY+R
Sbjct: 361 --EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMR 420

Query: 421 SDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCS 474
            DL KRVD NAIDV VG++LK E+DGWNRN + RGEI IEIE+AIFSLLVEEMQ+EL C 
Sbjct: 421 PDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCL 468

BLAST of Sgr022861 vs. NCBI nr
Match: XP_023526007.1 (uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 560.8 bits (1444), Expect = 1.1e-155
Identity = 327/475 (68.84%), Postives = 365/475 (76.84%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
           MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPSPK+  QL + KPIS++SDF  NFC++
Sbjct: 1   MAQKHLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRPKPISDSSDFHRNFCRS 60

Query: 61  ACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS 120
           ACFFSF  SPDL   SPL EFQSPVK+PCRN N IFLHVPA TAGLLLEAALRIQKQST+
Sbjct: 61  ACFFSFTHSPDLITSSPLFEFQSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQSTA 120

Query: 121 SRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENEN 180
           ++S++ GK+NGLG LGSFLKRLTHRGR RKREI  DG++N   GSP    +  NENENEN
Sbjct: 121 AKSRSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGDRGSP---PLPANENENEN 180

Query: 181 DSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVES 240
           DSVS+QSN+          C SPFRFVLQSSPSPGHRTPEFSSP SSP   +HQ  D ES
Sbjct: 181 DSVSRQSNL----------CHSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDAES 240

Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQLL 300
           LKK  VEDEEEEKEQSSPVSVLDPPFE+ DEGHY     EDD NL+ SYAIVQKAKHQLL
Sbjct: 241 LKKFAVEDEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQLL 300

Query: 301 KKLRRFEKLAELDPVELETFLLKGE-EDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
           KKLRRFE+LAELD VELETFLLK E EDEL+D+ +I HL ++E +            DI 
Sbjct: 301 KKLRRFERLAELDVVELETFLLKDEDEDELNDDANIAHLDDDESH------------DII 360

Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
           +H+   NGSS F+IP       KRL+ NLVT+DERD  VI+      KRV VRS L K V
Sbjct: 361 EHN---NGSSRFQIPR------KRLIYNLVTKDERDVVVIE------KRVLVRSKLWKGV 420

Query: 421 DMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
           D NAIDV   Q+LKGE+DGW+RNG+QRGEIAIEIELAIFSLLVEEMQTEL C  H
Sbjct: 421 DTNAIDVITRQDLKGEVDGWSRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLAH 430

BLAST of Sgr022861 vs. ExPASy TrEMBL
Match: A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)

HSP 1 Score: 708.4 bits (1827), Expect = 2.0e-200
Identity = 384/478 (80.33%), Postives = 413/478 (86.40%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
           M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPK++L LK+RKPISET DFPG FCK+
Sbjct: 2   MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61

Query: 61  ACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSS 120
           ACFFSFH+SPDLRKSPL EFQSPV    RN NAIFLHVPARTAG+LLEAALRIQKQST++
Sbjct: 62  ACFFSFHESPDLRKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTAA 121

Query: 121 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE- 180
           RSK HGKTNGLGLLGSFLKRLTHRGRARKREI+GDG+RND GG    P KM IEENE+E 
Sbjct: 122 RSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDEN 181

Query: 181 -NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
            NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS GHRTPEFSSP +SP   DHQDN
Sbjct: 182 VNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDN 241

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED  +LE SY IVQKAK
Sbjct: 242 DVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAK 301

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
           HQLLKKLRRFEKLAELDPVELE+FLLKGEEDELDD+DDIDHLK EEEY+SHNF+      
Sbjct: 302 HQLLKKLRRFEKLAELDPVELESFLLKGEEDELDDDDDIDHLK-EEEYESHNFE------ 361

Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
              QHDVEANGSSSF+IPH       RLV N +T ++RD+ V DN+EE  K VYVRSDL 
Sbjct: 362 ---QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRSDLW 421

Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
           KRVD NAID TVGQ+LK ELDGWNRN  QRGE+AIEIELAIFSLLV EMQTELDC TH
Sbjct: 422 KRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458

BLAST of Sgr022861 vs. ExPASy TrEMBL
Match: A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)

HSP 1 Score: 629.8 bits (1623), Expect = 9.3e-177
Identity = 348/483 (72.05%), Postives = 388/483 (80.33%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H  LK  KPI  +SDF   FC+
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60

Query: 61  NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
           + CFFSF+ SPDL   SP   FQSPVK+PCRN N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
           ++RSK+ GK+NGLGLLGSFLKRLTHR RARKREI GDG+ ND       P KM IE  EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIE--EN 180

Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
           E ENDSV + SNVT FDFCESN CDSPFRFVLQSSPSPGHRTPE SSP SSP   DHQ N
Sbjct: 181 ETENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAN 240

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAK 300

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELD--DNDDIDHLKEEEEYKSHNFDL 360
           HQLLKKLRRFE+LAELDP+ELETFLL  E   EDEL   D DDIDHLKEE E        
Sbjct: 301 HQLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE-------- 360

Query: 361 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 420
              EKDIKQH+ E N SS F+IP+RP++D K LVCNL+T++ER+  VI+  EE MKRVY+
Sbjct: 361 -QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYM 420

Query: 421 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 474
           R DL KRVD NAID+ VG++LK E+DGWN N + RGEIA+EIE+AIFSLLVEEMQ+EL C
Sbjct: 421 RQDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHC 472

BLAST of Sgr022861 vs. ExPASy TrEMBL
Match: A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)

HSP 1 Score: 620.5 bits (1599), Expect = 5.6e-174
Identity = 347/482 (71.99%), Postives = 388/482 (80.50%), Query Frame = 0

Query: 1   MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
           MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H  LK  KPIS + DF   FC+
Sbjct: 1   MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60

Query: 61  NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
           + CFFSF+ SPDL   SPL  FQSPVK+PCR+ N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61  STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
           ++RSK+ GK+NGLGLLGSFLKRLTHR R+RKREI GDG+ ND       P KM IE  EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIE--EN 180

Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
           E ENDSV + SNVT FDFCESN CDSPFRFVLQSS SPGHRTPE SSPVSSP   DHQ N
Sbjct: 181 EKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAN 240

Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
           DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAK 300

Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELDDNDDIDHLKEE-EEYKSHNFDLP 360
           HQLLKKLRRFE+LAELDP+ELETFLL  E   EDEL D DDIDHLKEE EEY        
Sbjct: 301 HQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY-------- 360

Query: 361 NNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVR 420
             EKDIKQH+ E N SS F+  +RP++D K LVCNL+TE+ER+   I+ +EE MKRVY+R
Sbjct: 361 --EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMR 420

Query: 421 SDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCS 474
            DL KRVD NAIDV VG++LK E+DGWNRN + RGEI IEIE+AIFSLLVEEMQ+EL C 
Sbjct: 421 PDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCL 468

BLAST of Sgr022861 vs. ExPASy TrEMBL
Match: A0A6J1G0G0 (uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC111449564 PE=4 SV=1)

HSP 1 Score: 558.5 bits (1438), Expect = 2.6e-155
Identity = 324/473 (68.50%), Postives = 359/473 (75.90%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTH-LQLKKRKPISETSDFPGNFCK 60
           MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPK+H L L KRKPIS  SDFP +FCK
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60

Query: 61  NACFFSFHDSPDLRK-SPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
            ACF SF+DSPDLR  SPL +FQSPVKSPCRNSNA+FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61  GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENE 180
           ++RS      NG GLLGSFLKR THRGR+RKREI+G  +RND         I      NE
Sbjct: 121 AARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPI------NE 180

Query: 181 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVE 240
            DSVS+QSNVTS DFCE     SPFRFVLQSSPS GHRTPEFSSP SSP  HDHQ NDVE
Sbjct: 181 KDSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVE 240

Query: 241 SLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQL 300
           SLKKLPV+DEEEEKEQSSPVSVLDPPFEDD+EG YEDGED+DD  +E SYAIV+KAKHQL
Sbjct: 241 SLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQL 300

Query: 301 LKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
           LKKLRRFE+LAELDPVELETFLLK EE ELDD DDIDHLK EEE +SHNFD  NNEKD+K
Sbjct: 301 LKKLRRFERLAELDPVELETFLLKDEEGELDD-DDIDHLK-EEECESHNFDRSNNEKDMK 360

Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
           QH ++ N                                       ++RVY+R DL K V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414

Query: 421 DMNAIDVTVGQELKGEL-DGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 471
           + +AIDV  G++L+ E+ DGW RNG+ RG+IAIEIE+ IF LLVEEMQTE+DC
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414

BLAST of Sgr022861 vs. ExPASy TrEMBL
Match: A0A6J1L3C1 (uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735 PE=4 SV=1)

HSP 1 Score: 548.1 bits (1411), Expect = 3.5e-152
Identity = 323/473 (68.29%), Postives = 361/473 (76.32%), Query Frame = 0

Query: 1   MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTH-LQLKKRKPISETSDFPGNFCK 60
           MAQKHLHELLKEDQEPFLLTNFIA+RR +LKRPSPK+H L L K KPIS  +DFP +FCK
Sbjct: 1   MAQKHLHELLKEDQEPFLLTNFIANRR-VLKRPSPKSHLLHLNKPKPISHFADFPASFCK 60

Query: 61  NACFFSFHDSPDLRK-SPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
            ACF SF+ SPDLR  SPL +FQSPVKSPCRNSNA+FLHVPA TA LLLEAALRIQKQST
Sbjct: 61  GACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQST 120

Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENE 180
            +RS      NG GLLGSFLKR T+RGR+RKREI+G  +RND   S  KM I  NENEN 
Sbjct: 121 PARS------NGFGLLGSFLKRFTYRGRSRKREIDGGCRRND--PSTAKMAI--NENENG 180

Query: 181 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVE 240
           NDSVS+QSNVTS     S+FCDSPFRFVLQSSPS GHRTPEFSSP SSP   DHQ NDVE
Sbjct: 181 NDSVSRQSNVTS-----SDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDVE 240

Query: 241 SLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQL 300
           SLKKLPV+DEEEEKEQSSPVSVLDPPFEDD+EG YEDGED+DD  +E SYAIVQKAKHQL
Sbjct: 241 SLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQL 300

Query: 301 LKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
           LKKLRRFE+LAELDPVELETFLLK EE +LD  DD DHL EEEE KSHNFD  NNEKD+K
Sbjct: 301 LKKLRRFERLAELDPVELETFLLKDEEGKLD--DDGDHL-EEEECKSHNFDRSNNEKDMK 360

Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
           QH +E+N                                       ++RVY+R DL K V
Sbjct: 361 QHGIESN---------------------------------------VERVYMRWDLWKEV 415

Query: 421 DMNAIDVTVGQELKGELD-GWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 471
           + +AIDV   ++L+ E+D GW RNG++RG+IAIEIE+ IF LLVEEMQTE+DC
Sbjct: 421 ESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDC 415

BLAST of Sgr022861 vs. TAIR 10
Match: AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )

HSP 1 Score: 260.8 bits (665), Expect = 2.2e-69
Identity = 210/543 (38.67%), Postives = 292/543 (53.78%), Query Frame = 0

Query: 2   AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKNA 61
           +Q+HL +LL+EDQEPF L ++I+DRR  +   +  THLQ+KKR+PIS+ +  P  FC+NA
Sbjct: 3   SQRHLKDLLEEDQEPFQLQSYISDRRCQIN--AHVTHLQVKKRRPISQNAGLPSRFCRNA 62

Query: 62  CFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS-S 121
           CFFS  +SPD +KSPL E    +KSP R+ NAIF+++PARTA +LLEAA+RIQKQS+  S
Sbjct: 63  CFFSLRESPDPKKSPLFE----LKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSSEVS 122

Query: 122 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEK------------- 181
           +++     N  G+ GS LK+LT+R   +KREI G  +      S  K             
Sbjct: 123 KTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVVRKI 182

Query: 182 MTIEENENENEN-------------------------DSVSQQSNVTSFDFCES------ 241
           +T +   NE EN                         +SV+        DF  S      
Sbjct: 183 VTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSISTSSR 242

Query: 242 --------------------NFCDSPFRFVLQSSPS-PGHRTPEFSSPVSSPTHHDH--- 301
                                FC+SPF FVLQ+ PS  G RTP FSSP +SP H  H   
Sbjct: 243 SNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDCHEME 302

Query: 302 -QDNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIV 361
            +  +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DDDE  +      DD N+  S+  V
Sbjct: 303 KESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIH-----MDDNNIPSSFRSV 362

Query: 362 QKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKS-HNFDL 421
           QKAKH LL+KL RFE+LA LDP+ELE  +   E +E ++       +EEEE KS ++ ++
Sbjct: 363 QKAKHLLLQKLCRFEQLAGLDPMELEKRMSDQETEEEEE-------EEEEEMKSLYHCEI 422

Query: 422 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEK---MKR 469
                 I Q  ++       ++P    + ++ L+ +L  E+      ID + E     KR
Sbjct: 423 ------ITQRVLKTYFEEMVEVP----EGVEALISDLAAEELPSD--IDGEAEAAIVAKR 482

BLAST of Sgr022861 vs. TAIR 10
Match: AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )

HSP 1 Score: 228.8 bits (582), Expect = 9.2e-60
Identity = 183/478 (38.28%), Postives = 265/478 (55.44%), Query Frame = 0

Query: 3   QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNF-CKNA 62
           +KHLHE L++DQEPF L ++I + RS +      + +++KKRK  +  +  PG F C+N+
Sbjct: 7   KKHLHEFLEDDQEPFHLNHYIGNLRSQM----GCSDMRVKKRKSDNVATFPPGLFSCENS 66

Query: 63  CFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST--S 122
           CFF+ H SPD RKSPL E +SP K   R+   +FL +PARTA +LL+AA RIQKQ +  +
Sbjct: 67  CFFAAHKSPDPRKSPLFELRSPGKKKIRDGR-VFLQIPARTAAILLDAAARIQKQQSEKA 126

Query: 123 SRSKAHGKTNGLGLLGSFLKRLTHR-GRARKREIEGDGQRNDHGGSPEKMTIEENENENE 182
             +KA  + NG G+ GS LK LT+R  + R    +G+    + G  P             
Sbjct: 127 KTNKARTRGNGFGMFGSVLKLLTYRITKPRLDNADGNAVSLERGSEP------------- 186

Query: 183 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSP-SPGHRTPEFSSPVSSPTHHDHQDND- 242
             S  ++  V   D C   FC+SPF FVLQ++P S GH+TP F+S  +SP     +D D 
Sbjct: 187 TSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDEDS 246

Query: 243 --VESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAI 302
              ESL+K+  ++    EEE+KEQ SPVSVLDP  E++++  +   E +   NL CS+ I
Sbjct: 247 DETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFEI 306

Query: 303 VQKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDL 362
           VQ+AK +LLKKLRRFEKLA LDPVELE     G+  E +D ++ ++ + EE+     +D 
Sbjct: 307 VQRAKRRLLKKLRRFEKLAGLDPVELE-----GKMSEEEDEEEEEYEESEEDDNIRIYDS 366

Query: 363 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 422
               +D+     EA    S     R  +D KR        DER       K+ +M   + 
Sbjct: 367 DEEYEDVD----EAMARES-----RCAEDEKR-----KKNDER------QKKWRMMNAW- 426

Query: 423 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTEL 469
           R  L    D   +D  V ++L+ E   W R+G +  E   ++E +IF +L++E   EL
Sbjct: 427 RVGLGAEED---VDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVLIDEFSREL 434

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_022144766.14.2e-20080.33uncharacterized protein LOC111014376 [Momordica charantia][more]
XP_038903007.19.8e-18976.96uncharacterized protein LOC120089713 [Benincasa hispida][more]
XP_011651995.11.9e-17672.05uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... [more]
KAA0043909.11.2e-17371.99histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... [more]
XP_023526007.11.1e-15568.84uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1CUE02.0e-20080.33uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... [more]
A0A0A0LAR89.3e-17772.05Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1[more]
A0A5D3DNQ55.6e-17471.99Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... [more]
A0A6J1G0G02.6e-15568.50uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC1114495... [more]
A0A6J1L3C13.5e-15268.29uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735... [more]
Match NameE-valueIdentityDescription
AT5G03670.12.2e-6938.67unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT2G36420.19.2e-6038.28unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Monk fruit (Qingpiguo) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 229..249
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 209..228
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 209..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 147..174
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 263..280
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 145..184
NoneNo IPR availablePANTHERPTHR33623:SF5HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B-LIKE PROTEINcoord: 1..159
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 173..471
NoneNo IPR availablePANTHERPTHR33623:SF5HISTONE-LYSINE N-METHYLTRANSFERASE SETD1B-LIKE PROTEINcoord: 173..471
NoneNo IPR availablePANTHERPTHR33623OS04G0572500 PROTEINcoord: 1..159

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Sgr022861.1Sgr022861.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032259 methylation
molecular_function GO:0008168 methyltransferase activity