Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGTTTGCCTCTTCTCCAAAACTCATTTTATTACTATTTTTACTTTTATCTTTCAATCTCTTAGCCCAGAAACACCATCTTTATGAACTGGGGTTCACCGGAAAAAGCCCACTGTCGGCAGTGTAAATTCACTGCAATTACCGCCGGAATGTTGCAGTTTCAGGCGTCAAAACATGTCAACCCCACCAGAACAACTTGTCAAAAGCTTCTGAAATTTTCAACGCCACCGTGAATTTTGACGAACAAACTGATAAATCTCTCATTTTCCATTCCACACTCCTTTTCCCGGAAAATTATGGCCCTCCAAGAAAAAGACAAACCCTCTTTTGTATTTTCCCTTGTATTCTAATTCCCATGAGTACCGACATCCCTATTCTATAATTAACATTTTTCCTTTTGGTCTTTGGCATCATTATAATAATTATAAGAGAAAAACCAATAAAGTAGAGATGGAAACCAACAACTTTACCAAAACCAACTAAATCAAAGATCAATTTTTACACACATTTTGTTCTCTTGTCAACTTTTCTTTGCCTTGTTTTTACAAATTTACAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAGTATGTAACCATGATTCAACTCAACTGCTAATCTCCTAAAGAAAACTTATATTAGTTTTGCTTTTTTGCACTGATCCGAGTTTGTTGTGTGGTCGTTTTCAATGACATTTGCATATAGGTACTGTCTGACTTGAGATTTGTTACTGATACAAATGTCTGTTACACTTTATAGAAATGGTTAATAGAGCTTCACAACTCATAAACATGTTGCTGTACTAGATATGAACACTTGTATCTTTGGTGGGTTCCATTGGGGAGATTTCAATTATCAAAAGGGTGGAGGGTGAAATCAAAACAAGCTTTTGAATTTAGGTAAATCTGTACTAGAATAAAGGGAAAGAGGGAGAAAATGTTTCGTTGTCAGATAAATATAAGACGATGACCTCACGAGATATTTCAGCGCAAACGGTATTATTAGACACTTCTTTCTGGCCTGGAAGGCCTCTTCTGTGGAGAAATGTTTTGGAGGTCTGGCCCTTTTCTTTTTCTTGTTCTTCAAAACTTCAAAACTTGACCCACATTTTGCTGTAAGAGATGTGTCATAATTGTGGGAGTCCTCTTGAATTTATTAGGCTATTCAATCATTGAACCAATTTATTCATCGTTTCGAACGGCATCATCATTGTCTGTACTTCAGAACAAGGCCCCCCCTCCCCTCCCATGCCCGCCCGTCCAAATTTTTTACTGTTTGAGCATTTAATAGGGATTTTGAAGCATATCAGTGACTACTCTGTGAAAGCATGGCAATGAGTTGTTCTAGTCATTAGGCATATGCACTTCATCTGTAAAAGCTGTCTTTGTAACACTTTCTATTAATTATCATGTACTTTTTACTTATGAAATACCTAAATTGTCTAGTTCATTCTCTTCATTAGGCATGTGATTGTATTTGAGCTAGAGATTCCAAATTCAACTCTTTGCATTGTCATTTCATTCTCTTTTCTCGAATTTAACAGTTCGCTCTTACTGAAAATCGAACATTTAGTTTTCTTCCATAAGTCAACAATTTTGAAGTTCAAAGACAGAAAATATGTTTGTCTTCTTCTGCAATTAAAAGGAGTTGCACGATAATCACAACCCCCAAAAGTTTTCTACCTTGTTAGTACAGTGTTCAGTAGCTTGAGCTTGGTTGGTACAAACCCTGCATTGAATTACCTTCATGGCCATGGTCCTACTTGTTTGTTGTGCAATCGAAAGCGGTATAAATGTCAAATTTTTCGATCGGGATTTCAAATTTCTATTTTACCTTCTTCCAGGAGTTTAATTTCCACTGTACTTGGATTCTTCTTGATCCTCATTTAATGGCTGTAAATTTAAGCCTGTTTCTAAACTCTCCATGTAAATTTTTGTGATATTTTCAGAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA
mRNA sequence
ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA
Coding sequence (CDS)
ATGGCTCAAAAGCATTTACACGAGTTATTGAAAGAGGATCAAGAGCCCTTTCTTCTCACCAACTTCATCGCCGACAGACGATCTCTTCTCAAACGCCCTTCCCCCAAAACCCATTTACAGCTCAAGAAACGAAAACCCATCTCCGAAACCTCTGATTTTCCCGGAAACTTCTGCAAGAACGCTTGCTTTTTCTCTTTCCATGACTCCCCCGATCTCAGAAAGTCGCCGCTCATTGAATTTCAGTCTCCGGTGAAAAGCCCCTGCAGAAACTCCAATGCCATTTTCCTCCATGTTCCGGCCAGGACGGCCGGGCTTCTGTTGGAAGCTGCTCTTCGGATTCAGAAACAGTCAACGTCCTCGAGATCCAAAGCCCACGGCAAAACTAATGGTTTGGGGCTTTTGGGTTCTTTTCTGAAGCGGCTGACTCATCGCGGTCGTGCTCGGAAGCGAGAGATCGAAGGCGATGGCCAGAGAAACGACCACGGCGGCAGCCCGGAGAAAATGACGATTGAGGAGAACGAAAACGAGAACGAGAACGACTCTGTTTCTCAGCAGAGTAATGTAACAAGCTTTGATTTCTGTGAGAGTAACTTCTGCGATAGCCCTTTTCGATTTGTACTTCAATCGAGCCCCTCGCCAGGCCACCGGACGCCGGAGTTCTCTTCACCGGTGTCTTCTCCGACTCACCACGACCACCAGGACAATGATGTAGAAAGCTTGAAGAAATTGCCAGTTGAGGATGAAGAGGAAGAGAAAGAACAGAGCAGTCCTGTGTCTGTATTGGATCCTCCGTTCGAGGACGACGACGAAGGGCATTATGAAGATGGTGAGGATGAGGACGATTGCAATTTGGAATGCAGCTACGCCATTGTACAAAAGGCGAAGCATCAGCTATTAAAAAAGCTTCGGAGATTCGAGAAACTAGCAGAACTAGATCCAGTAGAACTTGAGACGTTTCTACTAAAGGGCGAGGAAGATGAACTCGACGACAACGATGACATTGATCATCTCAAGGAAGAAGAAGAGTACAAAAGCCATAACTTTGATCTACCTAATAACGAAAAGGACATCAAACAACATGACGTAGAGGCCAATGGTAGTTCAAGCTTCAAAATTCCTCACCGACCCACGAAAGATATGAAGAGACTGGTCTGCAATCTCGTTACTGAGGACGAGAGAGATCGGTTTGTGATAGACAACAAAGAAGAGAAGATGAAGAGGGTCTACGTGAGATCGGATTTGTCGAAACGGGTGGACATGAACGCCATAGACGTGACGGTGGGGCAAGAGTTGAAGGGAGAACTTGATGGGTGGAACAGAAATGGGAAGCAGAGAGGAGAGATAGCCATAGAAATAGAGCTTGCAATCTTCAGCTTGCTGGTTGAGGAAATGCAAACTGAACTAGATTGCTCAACTCATTAA
Protein sequence
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKNACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH
Homology
BLAST of Sgr022861 vs. NCBI nr
Match:
XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])
HSP 1 Score: 708.4 bits (1827), Expect = 4.2e-200
Identity = 384/478 (80.33%), Postives = 413/478 (86.40%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPK++L LK+RKPISET DFPG FCK+
Sbjct: 2 MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61
Query: 61 ACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSS 120
ACFFSFH+SPDLRKSPL EFQSPV RN NAIFLHVPARTAG+LLEAALRIQKQST++
Sbjct: 62 ACFFSFHESPDLRKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTAA 121
Query: 121 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE- 180
RSK HGKTNGLGLLGSFLKRLTHRGRARKREI+GDG+RND GG P KM IEENE+E
Sbjct: 122 RSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDEN 181
Query: 181 -NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS GHRTPEFSSP +SP DHQDN
Sbjct: 182 VNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDN 241
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED +LE SY IVQKAK
Sbjct: 242 DVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAK 301
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
HQLLKKLRRFEKLAELDPVELE+FLLKGEEDELDD+DDIDHLK EEEY+SHNF+
Sbjct: 302 HQLLKKLRRFEKLAELDPVELESFLLKGEEDELDDDDDIDHLK-EEEYESHNFE------ 361
Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
QHDVEANGSSSF+IPH RLV N +T ++RD+ V DN+EE K VYVRSDL
Sbjct: 362 ---QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRSDLW 421
Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
KRVD NAID TVGQ+LK ELDGWNRN QRGE+AIEIELAIFSLLV EMQTELDC TH
Sbjct: 422 KRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of Sgr022861 vs. NCBI nr
Match:
XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])
HSP 1 Score: 670.6 bits (1729), Expect = 9.8e-189
Identity = 364/473 (76.96%), Postives = 400/473 (84.57%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPS K+H L KPIS +SDFP FC++
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNPKPISHSSDFPAKFCRS 60
Query: 61 ACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS 120
ACFFSF+ SPDL SPL FQSPVK+PCRN N IFLHVPARTAGLLLEAALRIQKQST
Sbjct: 61 ACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQSTV 120
Query: 121 SRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE 180
+RSK+ GK+NGLG+LGSFLKRLTHRGRARKREI+GDG++ND P KM IE ENE
Sbjct: 121 ARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIE--ENE 180
Query: 181 NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDND 240
NENDSVS+ SNVT FDFC+SN CDSPFRFVLQSSPSPGH+TPE +SP SSP DHQ ND
Sbjct: 181 NENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQAND 240
Query: 241 VESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKH 300
VE LKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDD NLE S+AIVQ+AKH
Sbjct: 241 VEGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKH 300
Query: 301 QLLKKLRRFEKLAELDPVELETFLLKGE-EDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
QLLKKLRRFE+LAELDPVELETFLLK E EDE +D+DDIDHLKEEE+YK K
Sbjct: 301 QLLKKLRRFERLAELDPVELETFLLKDEDEDEDEDDDDIDHLKEEEDYK----------K 360
Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
DIK+HD+EAN SS F+IPHRP +DM LVCNLVTE+ERD VI+ +EE MK +YVRSDL
Sbjct: 361 DIKEHDIEANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLW 420
Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTEL 469
KRVD NAI+V VGQ+LK E+DGW RN +QR EIAIEIE+AIFSLLVEEMQ EL
Sbjct: 421 KRVDSNAINVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPEL 461
BLAST of Sgr022861 vs. NCBI nr
Match:
XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])
HSP 1 Score: 629.8 bits (1623), Expect = 1.9e-176
Identity = 348/483 (72.05%), Postives = 388/483 (80.33%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H LK KPI +SDF FC+
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
+ CFFSF+ SPDL SP FQSPVK+PCRN N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
++RSK+ GK+NGLGLLGSFLKRLTHR RARKREI GDG+ ND P KM IE EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIE--EN 180
Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
E ENDSV + SNVT FDFCESN CDSPFRFVLQSSPSPGHRTPE SSP SSP DHQ N
Sbjct: 181 ETENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAN 240
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAK 300
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELD--DNDDIDHLKEEEEYKSHNFDL 360
HQLLKKLRRFE+LAELDP+ELETFLL E EDEL D DDIDHLKEE E
Sbjct: 301 HQLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE-------- 360
Query: 361 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 420
EKDIKQH+ E N SS F+IP+RP++D K LVCNL+T++ER+ VI+ EE MKRVY+
Sbjct: 361 -QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYM 420
Query: 421 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 474
R DL KRVD NAID+ VG++LK E+DGWN N + RGEIA+EIE+AIFSLLVEEMQ+EL C
Sbjct: 421 RQDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHC 472
BLAST of Sgr022861 vs. NCBI nr
Match:
KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 620.5 bits (1599), Expect = 1.2e-173
Identity = 347/482 (71.99%), Postives = 388/482 (80.50%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H LK KPIS + DF FC+
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
+ CFFSF+ SPDL SPL FQSPVK+PCR+ N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
++RSK+ GK+NGLGLLGSFLKRLTHR R+RKREI GDG+ ND P KM IE EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIE--EN 180
Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
E ENDSV + SNVT FDFCESN CDSPFRFVLQSS SPGHRTPE SSPVSSP DHQ N
Sbjct: 181 EKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAN 240
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAK 300
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELDDNDDIDHLKEE-EEYKSHNFDLP 360
HQLLKKLRRFE+LAELDP+ELETFLL E EDEL D DDIDHLKEE EEY
Sbjct: 301 HQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY-------- 360
Query: 361 NNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVR 420
EKDIKQH+ E N SS F+ +RP++D K LVCNL+TE+ER+ I+ +EE MKRVY+R
Sbjct: 361 --EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMR 420
Query: 421 SDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCS 474
DL KRVD NAIDV VG++LK E+DGWNRN + RGEI IEIE+AIFSLLVEEMQ+EL C
Sbjct: 421 PDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCL 468
BLAST of Sgr022861 vs. NCBI nr
Match:
XP_023526007.1 (uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 560.8 bits (1444), Expect = 1.1e-155
Identity = 327/475 (68.84%), Postives = 365/475 (76.84%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
MAQKHLHELLKEDQ PFLL NFIADRRSLLKRPSPK+ QL + KPIS++SDF NFC++
Sbjct: 1 MAQKHLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRPKPISDSSDFHRNFCRS 60
Query: 61 ACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS 120
ACFFSF SPDL SPL EFQSPVK+PCRN N IFLHVPA TAGLLLEAALRIQKQST+
Sbjct: 61 ACFFSFTHSPDLITSSPLFEFQSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQSTA 120
Query: 121 SRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENEN 180
++S++ GK+NGLG LGSFLKRLTHRGR RKREI DG++N GSP + NENENEN
Sbjct: 121 AKSRSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGDRGSP---PLPANENENEN 180
Query: 181 DSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVES 240
DSVS+QSN+ C SPFRFVLQSSPSPGHRTPEFSSP SSP +HQ D ES
Sbjct: 181 DSVSRQSNL----------CHSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDAES 240
Query: 241 LKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQLL 300
LKK VEDEEEEKEQSSPVSVLDPPFE+ DEGHY EDD NL+ SYAIVQKAKHQLL
Sbjct: 241 LKKFAVEDEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQLL 300
Query: 301 KKLRRFEKLAELDPVELETFLLKGE-EDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
KKLRRFE+LAELD VELETFLLK E EDEL+D+ +I HL ++E + DI
Sbjct: 301 KKLRRFERLAELDVVELETFLLKDEDEDELNDDANIAHLDDDESH------------DII 360
Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
+H+ NGSS F+IP KRL+ NLVT+DERD VI+ KRV VRS L K V
Sbjct: 361 EHN---NGSSRFQIPR------KRLIYNLVTKDERDVVVIE------KRVLVRSKLWKGV 420
Query: 421 DMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
D NAIDV Q+LKGE+DGW+RNG+QRGEIAIEIELAIFSLLVEEMQTEL C H
Sbjct: 421 DTNAIDVITRQDLKGEVDGWSRNGEQRGEIAIEIELAIFSLLVEEMQTELHCLAH 430
BLAST of Sgr022861 vs. ExPASy TrEMBL
Match:
A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)
HSP 1 Score: 708.4 bits (1827), Expect = 2.0e-200
Identity = 384/478 (80.33%), Postives = 413/478 (86.40%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKN 60
M QKHLHELLKEDQEPF+LTNFIADRRSLLKRPSPK++L LK+RKPISET DFPG FCK+
Sbjct: 2 MPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCKS 61
Query: 61 ACFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTSS 120
ACFFSFH+SPDLRKSPL EFQSPV RN NAIFLHVPARTAG+LLEAALRIQKQST++
Sbjct: 62 ACFFSFHESPDLRKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQSTAA 121
Query: 121 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENENE- 180
RSK HGKTNGLGLLGSFLKRLTHRGRARKREI+GDG+RND GG P KM IEENE+E
Sbjct: 122 RSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDEN 181
Query: 181 -NENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
NEN SVS Q+N+TSF FCESNFCDSPFRFVLQSSPS GHRTPEFSSP +SP DHQDN
Sbjct: 182 VNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDN 241
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESLKKLPVEDEEEEKEQSSPVS+LDPPFEDDDEGHYEDGEDED +LE SY IVQKAK
Sbjct: 242 DVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAK 301
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEK 360
HQLLKKLRRFEKLAELDPVELE+FLLKGEEDELDD+DDIDHLK EEEY+SHNF+
Sbjct: 302 HQLLKKLRRFEKLAELDPVELESFLLKGEEDELDDDDDIDHLK-EEEYESHNFE------ 361
Query: 361 DIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLS 420
QHDVEANGSSSF+IPH RLV N +T ++RD+ V DN+EE K VYVRSDL
Sbjct: 362 ---QHDVEANGSSSFQIPH-------RLVRNRITGEQRDQAVTDNREEMTKGVYVRSDLW 421
Query: 421 KRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCSTH 474
KRVD NAID TVGQ+LK ELDGWNRN QRGE+AIEIELAIFSLLV EMQTELDC TH
Sbjct: 422 KRVDSNAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of Sgr022861 vs. ExPASy TrEMBL
Match:
A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)
HSP 1 Score: 629.8 bits (1623), Expect = 9.3e-177
Identity = 348/483 (72.05%), Postives = 388/483 (80.33%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H LK KPI +SDF FC+
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
+ CFFSF+ SPDL SP FQSPVK+PCRN N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
++RSK+ GK+NGLGLLGSFLKRLTHR RARKREI GDG+ ND P KM IE EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIE--EN 180
Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
E ENDSV + SNVT FDFCESN CDSPFRFVLQSSPSPGHRTPE SSP SSP DHQ N
Sbjct: 181 ETENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQAN 240
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEGH+EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAK 300
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELD--DNDDIDHLKEEEEYKSHNFDL 360
HQLLKKLRRFE+LAELDP+ELETFLL E EDEL D DDIDHLKEE E
Sbjct: 301 HQLLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVE-------- 360
Query: 361 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 420
EKDIKQH+ E N SS F+IP+RP++D K LVCNL+T++ER+ VI+ EE MKRVY+
Sbjct: 361 -QYEKDIKQHNKEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYM 420
Query: 421 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 474
R DL KRVD NAID+ VG++LK E+DGWN N + RGEIA+EIE+AIFSLLVEEMQ+EL C
Sbjct: 421 RQDLWKRVDSNAIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHC 472
BLAST of Sgr022861 vs. ExPASy TrEMBL
Match:
A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)
HSP 1 Score: 620.5 bits (1599), Expect = 5.6e-174
Identity = 347/482 (71.99%), Postives = 388/482 (80.50%), Query Frame = 0
Query: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCK 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S K+H LK KPIS + DF FC+
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 NACFFSFHDSPDL-RKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
+ CFFSF+ SPDL SPL FQSPVK+PCR+ N +F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGS---PEKMTIEENEN 180
++RSK+ GK+NGLGLLGSFLKRLTHR R+RKREI GDG+ ND P KM IE EN
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIE--EN 180
Query: 181 ENENDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDN 240
E ENDSV + SNVT FDFCESN CDSPFRFVLQSS SPGHRTPE SSPVSSP DHQ N
Sbjct: 181 EKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQAN 240
Query: 241 DVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAK 300
DVESL+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDD NLE S+AIVQKAK
Sbjct: 241 DVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAK 300
Query: 301 HQLLKKLRRFEKLAELDPVELETFLLKGE---EDELDDNDDIDHLKEE-EEYKSHNFDLP 360
HQLLKKLRRFE+LAELDP+ELETFLL E EDEL D DDIDHLKEE EEY
Sbjct: 301 HQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEY-------- 360
Query: 361 NNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVR 420
EKDIKQH+ E N SS F+ +RP++D K LVCNL+TE+ER+ I+ +EE MKRVY+R
Sbjct: 361 --EKDIKQHNKEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMR 420
Query: 421 SDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDCS 474
DL KRVD NAIDV VG++LK E+DGWNRN + RGEI IEIE+AIFSLLVEEMQ+EL C
Sbjct: 421 PDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCL 468
BLAST of Sgr022861 vs. ExPASy TrEMBL
Match:
A0A6J1G0G0 (uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC111449564 PE=4 SV=1)
HSP 1 Score: 558.5 bits (1438), Expect = 2.6e-155
Identity = 324/473 (68.50%), Postives = 359/473 (75.90%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTH-LQLKKRKPISETSDFPGNFCK 60
MAQKHLHELLKEDQEPFLLTNFIADRR +LKRPSPK+H L L KRKPIS SDFP +FCK
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIADRR-VLKRPSPKSHLLHLNKRKPISHFSDFPASFCK 60
Query: 61 NACFFSFHDSPDLRK-SPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
ACF SF+DSPDLR SPL +FQSPVKSPCRNSNA+FLHVPA TAGLLLEAALRIQKQST
Sbjct: 61 GACFLSFNDSPDLRNPSPLFQFQSPVKSPCRNSNAVFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENE 180
++RS NG GLLGSFLKR THRGR+RKREI+G +RND I NE
Sbjct: 121 AARS------NGFGLLGSFLKRFTHRGRSRKREIDGGCRRNDPRDDHLLPPI------NE 180
Query: 181 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVE 240
DSVS+QSNVTS DFCE SPFRFVLQSSPS GHRTPEFSSP SSP HDHQ NDVE
Sbjct: 181 KDSVSRQSNVTSSDFCE-----SPFRFVLQSSPSAGHRTPEFSSPPSSPARHDHQVNDVE 240
Query: 241 SLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQL 300
SLKKLPV+DEEEEKEQSSPVSVLDPPFEDD+EG YEDGED+DD +E SYAIV+KAKHQL
Sbjct: 241 SLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYEMERSYAIVEKAKHQL 300
Query: 301 LKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
LKKLRRFE+LAELDPVELETFLLK EE ELDD DDIDHLK EEE +SHNFD NNEKD+K
Sbjct: 301 LKKLRRFERLAELDPVELETFLLKDEEGELDD-DDIDHLK-EEECESHNFDRSNNEKDMK 360
Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
QH ++ N ++RVY+R DL K V
Sbjct: 361 QHGIDGN---------------------------------------VERVYMRWDLWKEV 414
Query: 421 DMNAIDVTVGQELKGEL-DGWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 471
+ +AIDV G++L+ E+ DGW RNG+ RG+IAIEIE+ IF LLVEEMQTE+DC
Sbjct: 421 ESSAIDVMAGEDLRAEVDDGWKRNGEARGDIAIEIEVEIFRLLVEEMQTEVDC 414
BLAST of Sgr022861 vs. ExPASy TrEMBL
Match:
A0A6J1L3C1 (uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735 PE=4 SV=1)
HSP 1 Score: 548.1 bits (1411), Expect = 3.5e-152
Identity = 323/473 (68.29%), Postives = 361/473 (76.32%), Query Frame = 0
Query: 1 MAQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTH-LQLKKRKPISETSDFPGNFCK 60
MAQKHLHELLKEDQEPFLLTNFIA+RR +LKRPSPK+H L L K KPIS +DFP +FCK
Sbjct: 1 MAQKHLHELLKEDQEPFLLTNFIANRR-VLKRPSPKSHLLHLNKPKPISHFADFPASFCK 60
Query: 61 NACFFSFHDSPDLRK-SPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST 120
ACF SF+ SPDLR SPL +FQSPVKSPCRNSNA+FLHVPA TA LLLEAALRIQKQST
Sbjct: 61 GACFLSFNHSPDLRNPSPLFQFQSPVKSPCRNSNAMFLHVPATTARLLLEAALRIQKQST 120
Query: 121 SSRSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEKMTIEENENENE 180
+RS NG GLLGSFLKR T+RGR+RKREI+G +RND S KM I NENEN
Sbjct: 121 PARS------NGFGLLGSFLKRFTYRGRSRKREIDGGCRRND--PSTAKMAI--NENENG 180
Query: 181 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSPSPGHRTPEFSSPVSSPTHHDHQDNDVE 240
NDSVS+QSNVTS S+FCDSPFRFVLQSSPS GHRTPEFSSP SSP DHQ NDVE
Sbjct: 181 NDSVSRQSNVTS-----SDFCDSPFRFVLQSSPSAGHRTPEFSSPPSSPARDDHQVNDVE 240
Query: 241 SLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIVQKAKHQL 300
SLKKLPV+DEEEEKEQSSPVSVLDPPFEDD+EG YEDGED+DD +E SYAIVQKAKHQL
Sbjct: 241 SLKKLPVQDEEEEKEQSSPVSVLDPPFEDDEEGRYEDGEDDDDYKMERSYAIVQKAKHQL 300
Query: 301 LKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDLPNNEKDIK 360
LKKLRRFE+LAELDPVELETFLLK EE +LD DD DHL EEEE KSHNFD NNEKD+K
Sbjct: 301 LKKLRRFERLAELDPVELETFLLKDEEGKLD--DDGDHL-EEEECKSHNFDRSNNEKDMK 360
Query: 361 QHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYVRSDLSKRV 420
QH +E+N ++RVY+R DL K V
Sbjct: 361 QHGIESN---------------------------------------VERVYMRWDLWKEV 415
Query: 421 DMNAIDVTVGQELKGELD-GWNRNGKQRGEIAIEIELAIFSLLVEEMQTELDC 471
+ +AIDV ++L+ E+D GW RNG++RG+IAIEIE+ IF LLVEEMQTE+DC
Sbjct: 421 ESSAIDVMAEEDLRAEVDVGWKRNGEERGDIAIEIEVEIFRLLVEEMQTEVDC 415
BLAST of Sgr022861 vs. TAIR 10
Match:
AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )
HSP 1 Score: 260.8 bits (665), Expect = 2.2e-69
Identity = 210/543 (38.67%), Postives = 292/543 (53.78%), Query Frame = 0
Query: 2 AQKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNFCKNA 61
+Q+HL +LL+EDQEPF L ++I+DRR + + THLQ+KKR+PIS+ + P FC+NA
Sbjct: 3 SQRHLKDLLEEDQEPFQLQSYISDRRCQIN--AHVTHLQVKKRRPISQNAGLPSRFCRNA 62
Query: 62 CFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQSTS-S 121
CFFS +SPD +KSPL E +KSP R+ NAIF+++PARTA +LLEAA+RIQKQS+ S
Sbjct: 63 CFFSLRESPDPKKSPLFE----LKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSSEVS 122
Query: 122 RSKAHGKTNGLGLLGSFLKRLTHRGRARKREIEGDGQRNDHGGSPEK------------- 181
+++ N G+ GS LK+LT+R +KREI G + S K
Sbjct: 123 KTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVVRKI 182
Query: 182 MTIEENENENEN-------------------------DSVSQQSNVTSFDFCES------ 241
+T + NE EN +SV+ DF S
Sbjct: 183 VTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSISTSSR 242
Query: 242 --------------------NFCDSPFRFVLQSSPS-PGHRTPEFSSPVSSPTHHDH--- 301
FC+SPF FVLQ+ PS G RTP FSSP +SP H H
Sbjct: 243 SNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDCHEME 302
Query: 302 -QDNDVESLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAIV 361
+ +VE LKKL +E+EEEEKEQSSPVSVLDPPF+DDDE + DD N+ S+ V
Sbjct: 303 KESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDEDIH-----MDDNNIPSSFRSV 362
Query: 362 QKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKS-HNFDL 421
QKAKH LL+KL RFE+LA LDP+ELE + E +E ++ +EEEE KS ++ ++
Sbjct: 363 QKAKHLLLQKLCRFEQLAGLDPMELEKRMSDQETEEEEE-------EEEEEMKSLYHCEI 422
Query: 422 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEK---MKR 469
I Q ++ ++P + ++ L+ +L E+ ID + E KR
Sbjct: 423 ------ITQRVLKTYFEEMVEVP----EGVEALISDLAAEELPSD--IDGEAEAAIVAKR 482
BLAST of Sgr022861 vs. TAIR 10
Match:
AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )
HSP 1 Score: 228.8 bits (582), Expect = 9.2e-60
Identity = 183/478 (38.28%), Postives = 265/478 (55.44%), Query Frame = 0
Query: 3 QKHLHELLKEDQEPFLLTNFIADRRSLLKRPSPKTHLQLKKRKPISETSDFPGNF-CKNA 62
+KHLHE L++DQEPF L ++I + RS + + +++KKRK + + PG F C+N+
Sbjct: 7 KKHLHEFLEDDQEPFHLNHYIGNLRSQM----GCSDMRVKKRKSDNVATFPPGLFSCENS 66
Query: 63 CFFSFHDSPDLRKSPLIEFQSPVKSPCRNSNAIFLHVPARTAGLLLEAALRIQKQST--S 122
CFF+ H SPD RKSPL E +SP K R+ +FL +PARTA +LL+AA RIQKQ + +
Sbjct: 67 CFFAAHKSPDPRKSPLFELRSPGKKKIRDGR-VFLQIPARTAAILLDAAARIQKQQSEKA 126
Query: 123 SRSKAHGKTNGLGLLGSFLKRLTHR-GRARKREIEGDGQRNDHGGSPEKMTIEENENENE 182
+KA + NG G+ GS LK LT+R + R +G+ + G P
Sbjct: 127 KTNKARTRGNGFGMFGSVLKLLTYRITKPRLDNADGNAVSLERGSEP------------- 186
Query: 183 NDSVSQQSNVTSFDFCESNFCDSPFRFVLQSSP-SPGHRTPEFSSPVSSPTHHDHQDND- 242
S ++ V D C FC+SPF FVLQ++P S GH+TP F+S +SP +D D
Sbjct: 187 TSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDEDS 246
Query: 243 --VESLKKLPVED----EEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDCNLECSYAI 302
ESL+K+ ++ EEE+KEQ SPVSVLDP E++++ + E + NL CS+ I
Sbjct: 247 DETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFEI 306
Query: 303 VQKAKHQLLKKLRRFEKLAELDPVELETFLLKGEEDELDDNDDIDHLKEEEEYKSHNFDL 362
VQ+AK +LLKKLRRFEKLA LDPVELE G+ E +D ++ ++ + EE+ +D
Sbjct: 307 VQRAKRRLLKKLRRFEKLAGLDPVELE-----GKMSEEEDEEEEEYEESEEDDNIRIYDS 366
Query: 363 PNNEKDIKQHDVEANGSSSFKIPHRPTKDMKRLVCNLVTEDERDRFVIDNKEEKMKRVYV 422
+D+ EA S R +D KR DER K+ +M +
Sbjct: 367 DEEYEDVD----EAMARES-----RCAEDEKR-----KKNDER------QKKWRMMNAW- 426
Query: 423 RSDLSKRVDMNAIDVTVGQELKGELDGWNRNGKQRGEIAIEIELAIFSLLVEEMQTEL 469
R L D +D V ++L+ E W R+G + E ++E +IF +L++E EL
Sbjct: 427 RVGLGAEED---VDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVLIDEFSREL 434
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022144766.1 | 4.2e-200 | 80.33 | uncharacterized protein LOC111014376 [Momordica charantia] | [more] |
XP_038903007.1 | 9.8e-189 | 76.96 | uncharacterized protein LOC120089713 [Benincasa hispida] | [more] |
XP_011651995.1 | 1.9e-176 | 72.05 | uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... | [more] |
KAA0043909.1 | 1.2e-173 | 71.99 | histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... | [more] |
XP_023526007.1 | 1.1e-155 | 68.84 | uncharacterized protein LOC111789613 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CUE0 | 2.0e-200 | 80.33 | uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A0A0LAR8 | 9.3e-177 | 72.05 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1 | [more] |
A0A5D3DNQ5 | 5.6e-174 | 71.99 | Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... | [more] |
A0A6J1G0G0 | 2.6e-155 | 68.50 | uncharacterized protein LOC111449564 OS=Cucurbita moschata OX=3662 GN=LOC1114495... | [more] |
A0A6J1L3C1 | 3.5e-152 | 68.29 | uncharacterized protein LOC111498735 OS=Cucurbita maxima OX=3661 GN=LOC111498735... | [more] |
Match Name | E-value | Identity | Description | |
AT5G03670.1 | 2.2e-69 | 38.67 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G36420.1 | 9.2e-60 | 38.28 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |