Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
ATTTGGCGCAACCCTGATTTTGATTCAGCCAAGTCCTTCTCTCCACATTCCCCTCTCTCTGTTTCTTTTGCTCTGTTCTATTTGTGTCAACTTCTTCCACACTCACCTTTTTCTCCACTGAAACTTCAAAAATTTCTTCAACTTCTTCTTACTCCAAATTCATTCCTTTAACCCCAATCCACAATTCTCCATTTAGCTTTCCTCTACTCTTTTGCAGACATTCTGCCACTCCCATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCATTCCCCTGATCTTGCTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTACTCATTTTTATTACTGCTTTTGGGTTTCATCCTAAAATGTCTTGGCTTAGGAATTGCATTTTTATGAACTGGGGTTCACCCGAAACAACCAACCGTCGGCAGTTTTGTGTAAATTCTCTGAAATTACCGTCGGAATGTTGCAGTTTCAGGCGTCAAAACACGTCAACCCCACCAAAATGACTCTGAATTTTGACAAGCAAACCGATAAATCTCTCATTTTCAATTCCACGCTTTCCTTTTTCCCGGAAAATTATGACCCTCCAAGAAAAAGACAAACCCCCCTTTTGTATTTTCCTTTGTATTCTTACCCTATTCTTACAATTAAACATTTTCTCTTTTGGCCTTGGCTTCATTATAATAATTAGAAGAGAAAAAAGCAGGGGAAAAAAACCCAGTAAAGTGAAGATTATGTAAATCAACAACTTTACTAAAAACCAACTAAATTAAAGATCCCTTTTCACACTCATTTCCTTTTGTTAGTTTCTGAACTTTTCTTTGCCTTGTTTTCACACAAATTTACAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAGTATGTAAGCATGATTCAACCCAACTGCTAATTCTCTGTAGGAAGCATATATTAATTTTGTTTTTCTGTCTTGATTCGAGTTTGTTGTGCAGTCATAATCGGTGGCATTTGTATTTAGGTTCTGTCAAACTTGATATCGGTTATTGATACAAATGTATGTCTGTTATGCTCTATAGAAATGGCTAATAGAGCTTCACAACTCATAAGCTTGTTGGTGTGCTAAGTTATGAAAACTTGCATCTTTGGTAGGGTAGGGGTGAAACCAGAACAAACTTTTGAATTTAGGTAAATATGTACTAGAATGAAGGGAAAGAGTGCAAAAATGTTTGATGGCAACATCACGATAAATATCAGCACAAAGGTATTGTTGAACACTCGTTTTCGCCTGGATGGTCTCTTCTGTAGGAGAAATGCTTTGGAGGTCTGGTCCTTTTATTTTTCTTGTTCTTCAAAACTTCAAATCTGACCCACATTTTGCTGTAAGAGAGGTGCTATAATTGTGGGAGTCCTCTTCTATTTATTAGGCTGTTCAATCATTGAACCAATTTATTCATTGTTTTCGAATGCTATCATCACTGTCTGTACTTCAGAACAAGGTCCCCCCAACCCCCGCCCTGCTCGTCCGGTTTTCTTTCTTACTGTTTTGTGTATTTATTAGGGATTTTGAAGCATATCACTGACCACTCTGTGAATGCATGATAATGAGTTGTTCTAGTCACTAGGCATATGCACTTCATCAGTCAAAGTTGTCTTTGTTGCATTAAGAAATAGCTAGACTGTCTAGTTCATCCTTTTCATTTGGCATATGATTGCATTTGAGTTTGAGATTCCAAATTCTATTCTTTGCATTGTCGCTTCTTCAAACTCTTTTTAAATTTACTGATCTGCTCTTTACCGAAAATGAAACACTCAGTCGATTTCGAAATTTGACGTGACAAAGATGTCTGTCTTTGTCTGCAATTAGAAGGTGTTCGATAATAATCACAACCACAAAGTTTTCTACCTTGTTAGTACATTTCAGTAACTTGAGCTTGGCACAAAGTCTACTATAAATTATTGTCATAGACGCGGCCCTACTTGTCTTTGTGTGCAGTTAAAACGGTATAATTGTCGAATTTTCGACCAATATTTCAAATTTTCATTTCACCTTCTTCCAGGGTTATTTAAACTGATCTTACATTTTTGGTAAAAGTAAAACTTGAAGTTGATGTCAAGAAGGCATAAGTTCACGATGAATGCGAAGTTAATCCTGATCTGTTCATGTAATATTTATTTTCAGAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAAACTGCAAGCAATTGGAGTGAGTCCACAAAAAAAAATTTAAAATTTCCACAAAATAATCTCTAGATTTTTTAATCACTCTTAGGAATATATAATCTGACTTCAAGAGCATAGGGTTTAAGTTTAACACCTCTGAAGTAGAAAGATTAGGATGAAAGGGACATCACTATGATTTGTAAATTCATTATCTCTCCATATATTATCATCATCTTATAATATCTATAATTTTAATTTTTGAAATGCTTATCTTCCCCCCTTTTCCTTTTTTATTTTTTGGGAAAAATAAAATGAAAAAGACATGTCAGTTAAAAAGTGAAGGGATGGTTAATGGCAG
mRNA sequence
ATTTGGCGCAACCCTGATTTTGATTCAGCCAAGTCCTTCTCTCCACATTCCCCTCTCTCTGTTTCTTTTGCTCTGTTCTATTTGTGTCAACTTCTTCCACACTCACCTTTTTCTCCACTGAAACTTCAAAAATTTCTTCAACTTCTTCTTACTCCAAATTCATTCCTTTAACCCCAATCCACAATTCTCCATTTAGCTTTCCTCTACTCTTTTGCAGACATTCTGCCACTCCCATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCATTCCCCTGATCTTGCTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAAACTGCAAGCAATTGGAGTGAGTCCACAAAAAAAAATTTAAAATTTCCACAAAATAATCTCTAGATTTTTTAATCACTCTTAGGAATATATAATCTGACTTCAAGAGCATAGGGTTTAAGTTTAACACCTCTGAAGTAGAAAGATTAGGATGAAAGGGACATCACTATGATTTGTAAATTCATTATCTCTCCATATATTATCATCATCTTATAATATCTATAATTTTAATTTTTGAAATGCTTATCTTCCCCCCTTTTCCTTTTTTATTTTTTGGGAAAAATAAAATGAAAAAGACATGTCAGTTAAAAAGTGAAGGGATGGTTAATGGCAG
Coding sequence (CDS)
ATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCATTCCCCTGATCTTGCTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAA
Protein sequence
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRSTCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Homology
BLAST of Pay0012399 vs. ExPASy TrEMBL
Match:
A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)
HSP 1 Score: 911.4 bits (2354), Expect = 1.6e-261
Identity = 468/468 (100.00%), Postives = 468/468 (100.00%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE
Sbjct: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV
Sbjct: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Sbjct: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468
BLAST of Pay0012399 vs. ExPASy TrEMBL
Match:
A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)
HSP 1 Score: 857.1 bits (2213), Expect = 3.5e-245
Identity = 443/472 (93.86%), Postives = 454/472 (96.19%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DFSAKFCR
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
STCFFSFNHSPDLANSSP FGFQSPVKTPCR+PNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHN 360
LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELS DGDDIDHLKEEVE+YEKDIKQHN
Sbjct: 301 LLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHN 360
Query: 361 KEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSN 420
KEGNDSSRFQ RPSRDTK LVCNLIT+EERN+V IEK EETMKRVYMR DLWKRVDSN
Sbjct: 361 KEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSN 420
Query: 421 AIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
AID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Sbjct: 421 AIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472
BLAST of Pay0012399 vs. ExPASy TrEMBL
Match:
A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)
HSP 1 Score: 597.0 bits (1538), Expect = 6.6e-167
Identity = 335/473 (70.82%), Postives = 375/473 (79.28%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
M ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK KPIS + DF KFC+
Sbjct: 1 MMPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCK 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSF+ SPDL SPLF FQSPV R+PN +F HVPARTAG+LLEAALRIQKQST
Sbjct: 61 SACFFSFHESPDL-RKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE- 180
AARSK GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND G PLPAKMAIEENE
Sbjct: 121 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 180
Query: 181 ---KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQ 240
EN SV +N+T F FCESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ
Sbjct: 181 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 240
Query: 241 ANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK 300
NDVESL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQK
Sbjct: 241 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 300
Query: 301 AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIK 360
AKHQLLKKLRRFE+LAELDP+ELE+FLL E EDEL D DDIDHLKE EEYE + +
Sbjct: 301 AKHQLLKKLRRFEKLAELDPVELESFLLKGE---EDELDDDDDIDHLKE--EEYESHNFE 360
Query: 361 QHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDS 420
QH+ E N SS FQ P R LV N IT E+R+ + REE K VY+R DLWKRVDS
Sbjct: 361 QHDVEANGSSSFQ-IPHR----LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDS 420
Query: 421 NAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
NAID VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Sbjct: 421 NAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of Pay0012399 vs. ExPASy TrEMBL
Match:
A0A6J1FAX4 (uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC111442411 PE=4 SV=1)
HSP 1 Score: 548.1 bits (1411), Expect = 3.5e-152
Identity = 318/468 (67.95%), Postives = 354/468 (75.64%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L KPIS S D FCR
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRSKPISDSSD----FCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSF HSPDL SSPLF F SPVKTPCR+ N +F HVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFTHSPDLTTSSPLFEFHSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AA+SKS GKSN LG LGSFLKRLTHR R RKREI DGR N R PPLP NE
Sbjct: 121 AAKSKSLGKSNALGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPT------NEN 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV R +SNLC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ D
Sbjct: 181 ENDSVSR----------QSNLCNSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDA 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESL+KL EDEEEEKEQSSPVSVLDPPFE+ DEG++ EDDYNL+RS+AIVQKAKHQ
Sbjct: 241 ESLKKLAVEDEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELD +ELETFLL DED+DEDEL D DI HL ++ DI +HN
Sbjct: 301 LLKKLRRFERLAELDVVELETFLLKDEDEDEDELDDDADIAHLDDDESH---DIIEHN-- 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SSRFQ P R L+ NL+T+EER++V IE KRV +R +LWK VD+NAID+
Sbjct: 361 -NGSSRFQIPPKR----LIYNLVTKEERDVVVIE------KRVLVRSELWKGVDTNAIDM 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
+ +DLK EVDGW+RN E RGEI I++E+AIFSLLVEEMQ+ELHCLAH
Sbjct: 421 ITRQDLKGEVDGWSRNGEQRGEIAIDVELAIFSLLVEEMQTELHCLAH 426
BLAST of Pay0012399 vs. ExPASy TrEMBL
Match:
A0A6J1J5Y5 (uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647 PE=4 SV=1)
HSP 1 Score: 537.0 bits (1382), Expect = 8.1e-149
Identity = 313/468 (66.88%), Postives = 350/468 (74.79%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLK + KS F L KPIS S DF FCR
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKLPTPKSLFQLNRSKPISDSSDFRRNFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSF HSPDL SSPLF F SPVKTPC + N F HVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFTHSPDLITSSPLFEFHSPVKTPCPNHNGTFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AA SKS GKSNGLG LGSFLKRLTHR R RKREI DGR N R PPLPA
Sbjct: 121 AANSKSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPA--------N 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV R +SNLC+SPFRFVLQSS S GHRTPE SSP SSPAR +HQ D
Sbjct: 181 ENDSVSR----------QSNLCNSPFRFVLQSSPSSGHRTPEFSSPTSSPARRNHQVKDA 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESL+KL EDEEEEKEQSSPVSVLDPPFE+ +EG++ EDDYNL+RS+AIVQKAKHQ
Sbjct: 241 ESLKKLAVEDEEEEKEQSSPVSVLDPPFEEYEEGHY-----EDDYNLDRSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELD +ELETFLL DED+DEDEL+D DI HL ++ DI +H
Sbjct: 301 LLKKLRRFERLAELDVVELETFLLKDEDEDEDELNDDADIAHLDDDESH---DIMEHK-- 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SSRFQ P R L+ NL+T++ER++V IE KRV +R +LWK VD+NAIDV
Sbjct: 361 -NGSSRFQIPPKR----LISNLVTKDERDVVVIE------KRVLVRSELWKGVDTNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
++ +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELH LAH
Sbjct: 421 IMKQDLKGEVDGWSRNGEQRGEIAIDIELAIFSLLVEEMQTELHFLAH 428
BLAST of Pay0012399 vs. NCBI nr
Match:
KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 911.4 bits (2354), Expect = 3.3e-261
Identity = 468/468 (100.00%), Postives = 468/468 (100.00%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE
Sbjct: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV
Sbjct: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Sbjct: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468
BLAST of Pay0012399 vs. NCBI nr
Match:
XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])
HSP 1 Score: 857.1 bits (2213), Expect = 7.3e-245
Identity = 443/472 (93.86%), Postives = 454/472 (96.19%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DFSAKFCR
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
STCFFSFNHSPDLANSSP FGFQSPVKTPCR+PNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHN 360
LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELS DGDDIDHLKEEVE+YEKDIKQHN
Sbjct: 301 LLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHN 360
Query: 361 KEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSN 420
KEGNDSSRFQ RPSRDTK LVCNLIT+EERN+V IEK EETMKRVYMR DLWKRVDSN
Sbjct: 361 KEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSN 420
Query: 421 AIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
AID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Sbjct: 421 AIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472
BLAST of Pay0012399 vs. NCBI nr
Match:
XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])
HSP 1 Score: 754.6 bits (1947), Expect = 5.1e-214
Identity = 395/466 (84.76%), Postives = 424/466 (90.99%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL NPKPISHS DF AKFCR
Sbjct: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNPKPISHSSDFPAKFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSFNHSPDL NSSPLFGFQSPVKTPCR+PNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
ARSKS GKSNGLG+LGSFLKRLTHR R+RKREI GDGR NDPRDGPPLPAKMAIEENE
Sbjct: 121 VARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENEN 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV RLSNVTGFDFC+SNLCDSPFRFVLQSS SPGH+TPEL+SP SSPARLDHQANDV
Sbjct: 181 ENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
E L+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDDYNLERSFAIVQ+AKHQ
Sbjct: 241 EGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDP+ELETFLL DED+DEDE D DDIDHLKEE E+Y+KDIK+H+ E
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEDEDEDE--DDDDIDHLKEE-EDYKKDIKEHDIE 360
Query: 361 GNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAI 420
NDSSRFQ +RP+RD LVCNL+TEEER++V IEKREE MK +Y+R DLWKRVDSNAI
Sbjct: 361 ANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWKRVDSNAI 420
Query: 421 DVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELH 465
+VMVG+DLKEEVDGW RNKE R EI IEIEVAIFSLLVEEMQ ELH
Sbjct: 421 NVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPELH 462
BLAST of Pay0012399 vs. NCBI nr
Match:
XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])
HSP 1 Score: 597.0 bits (1538), Expect = 1.4e-166
Identity = 335/473 (70.82%), Postives = 375/473 (79.28%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
M ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK KPIS + DF KFC+
Sbjct: 1 MMPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCK 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSF+ SPDL SPLF FQSPV R+PN +F HVPARTAG+LLEAALRIQKQST
Sbjct: 61 SACFFSFHESPDL-RKSPLFEFQSPV----RNPNAIFLHVPARTAGILLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE- 180
AARSK GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND G PLPAKMAIEENE
Sbjct: 121 AARSKPHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENED 180
Query: 181 ---KENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQ 240
EN SV +N+T F FCESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ
Sbjct: 181 ENVNENGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQ 240
Query: 241 ANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK 300
NDVESL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQK
Sbjct: 241 DNDVESLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQK 300
Query: 301 AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIK 360
AKHQLLKKLRRFE+LAELDP+ELE+FLL E EDEL D DDIDHLKE EEYE + +
Sbjct: 301 AKHQLLKKLRRFEKLAELDPVELESFLLKGE---EDELDDDDDIDHLKE--EEYESHNFE 360
Query: 361 QHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDS 420
QH+ E N SS FQ P R LV N IT E+R+ + REE K VY+R DLWKRVDS
Sbjct: 361 QHDVEANGSSSFQ-IPHR----LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDS 420
Query: 421 NAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
NAID VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Sbjct: 421 NAIDATVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of Pay0012399 vs. NCBI nr
Match:
KAG6580678.1 (hypothetical protein SDJN03_20680, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 553.5 bits (1425), Expect = 1.7e-153
Identity = 319/468 (68.16%), Postives = 355/468 (75.85%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L KPIS S DF FCR
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRSKPISDSSDFRRNFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
S CFFSF HSPDL SSPLF F SPVKTPCR+ N +F HVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFTHSPDLTTSSPLFEFHSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AA+SKS GKSN LG LGSFLKRLTHR R RKREI D R N R PPLPA NE
Sbjct: 121 AAKSKSLGKSNALGFLGSFLKRLTHRGRIRKREICSDSRKNGYRGSPPLPA------NEN 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV R +SNLC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ D
Sbjct: 181 ENDSVSR----------QSNLCNSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDA 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESL+KL EDEEEEKEQSSPVSVLDPPFE+ DEG++ EDDYNL+RS+AIVQKAKHQ
Sbjct: 241 ESLKKLAVEDEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAEL+ +ELETFLL DED+DEDEL D DI HL ++ DI +HN
Sbjct: 301 LLKKLRRFERLAELEVVELETFLLKDEDEDEDELDDDADIAHLDDDESH---DIIEHN-- 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SSRFQ P R L+ NL+T+EER++V IE KRV +R +LWK VD+NAID+
Sbjct: 361 -NGSSRFQIPPKR----LIYNLVTKEERDVVVIE------KRVLVRSELWKGVDTNAIDM 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 469
+ +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELHCLAH
Sbjct: 421 ITRQDLKGEVDGWSRNGEQRGEIAIDIELAIFSLLVEEMQTELHCLAH 430
BLAST of Pay0012399 vs. TAIR 10
Match:
AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )
HSP 1 Score: 229.6 bits (584), Expect = 5.3e-60
Identity = 195/535 (36.45%), Postives = 288/535 (53.83%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
MA ++HL +LL++DQEPF L ++I+DRR + ++ +H +K +PIS + ++FCR
Sbjct: 1 MASQRHLKDLLEEDQEPFQLQSYISDRRCQI--NAHVTHLQVKKRRPISQNAGLPSRFCR 60
Query: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
+ CFFS SPD SPLF +K+P RS N +F ++PARTA +LLEAA+RIQKQS+
Sbjct: 61 NACFFSLRESPD-PKKSPLF----ELKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSS 120
Query: 121 -AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLP 180
+++++ N G+ GS LK+LT+R +KREI G GR++ R P+
Sbjct: 121 EVSKTRTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVV 180
Query: 181 AKMAI---EENEKENDS--VFRLSNVTGF-----------------------DFCES--- 240
K+ + NE+EN S ++++ T F DF S
Sbjct: 181 RKIVTRKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSIST 240
Query: 241 -----------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA---- 300
C+SPF FVLQ+ S+ G RTP SSP +SP
Sbjct: 241 SSRSNGSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDCH 300
Query: 301 RLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSF 360
++ ++ +VE L+KL E+EEEEKEQSSPVSVLDPPF+DDDE DD N+ SF
Sbjct: 301 EMEKESYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDE-----DIHMDDNNIPSSF 360
Query: 361 AIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYE 420
VQKAKH LL+KL RFE+LA LDP+ELE ++D++ +E+E + +++ L +
Sbjct: 361 RSVQKAKHLLLQKLCRFEQLAGLDPMELEK-RMSDQETEEEEEEEEEEMKSLYHCEIITQ 420
Query: 421 KDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM-KRVYMRPDLW 464
+ +K + +E + L+ +L EE + + E + KRV R W
Sbjct: 421 RVLKTYFEE-------MVEVPEGVEALISDLAAEELPSDIDGEAEAAIVAKRVCERLRSW 480
BLAST of Pay0012399 vs. TAIR 10
Match:
AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )
HSP 1 Score: 209.5 bits (532), Expect = 5.7e-54
Identity = 176/476 (36.97%), Postives = 261/476 (54.83%), Query Frame = 0
Query: 3 RKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCRST 62
+K+HLHE L+ DQEPF L+++I + RS + S + K+ + P + C ++
Sbjct: 6 KKKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSDMRVK-KRKSDNVATFPPGLFS--CENS 65
Query: 63 CFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST-- 122
CFF+ + SPD SPLF +SP K R VF +PARTA +LL+AA RIQKQ +
Sbjct: 66 CFFAAHKSPD-PRKSPLFELRSPGKKKIRD-GRVFLQIPARTAAILLDAAARIQKQQSEK 125
Query: 123 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 182
A +K+ + NG G+ GS LK LT+R ++ R + DG +++E +
Sbjct: 126 AKTNKARTRGNGFGMFGSVLKLLTYRI-TKPRLDNADGN------------AVSLERGSE 185
Query: 183 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL---DHQ 242
S R V D C C+SPF FVLQ++ SS GH+TP +S +SPAR D
Sbjct: 186 PTSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRSTEDED 245
Query: 243 ANDVESLQKLPAED----EEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFA 302
+++ ESL+K+ ++ EEE+KEQ SPVSVLDP E++++ + E + NL SF
Sbjct: 246 SDETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNLSCSFE 305
Query: 303 IVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLK--EEVEEY 362
IVQ+AK +LLKKLRRFE+LA LDP+ELE + +ED++E+E + ++ D+++ + EEY
Sbjct: 306 IVQRAKRRLLKKLRRFEKLAGLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYDSDEEY 365
Query: 363 EKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLW 422
E D R SR E+E+ +K +E K+ M + W
Sbjct: 366 E-----------DVDEAMARESR---------CAEDEKR----KKNDERQKKWRMM-NAW 425
Query: 423 KRVDSNA---IDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSEL 464
RV A +D +V KDL+EE W R+ E ++E +IF +L++E EL
Sbjct: 426 -RVGLGAEEDVDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVLIDEFSREL 434
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3DNQ5 | 1.6e-261 | 100.00 | Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... | [more] |
A0A0A0LAR8 | 3.5e-245 | 93.86 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1 | [more] |
A0A6J1CUE0 | 6.6e-167 | 70.82 | uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A6J1FAX4 | 3.5e-152 | 67.95 | uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC1114424... | [more] |
A0A6J1J5Y5 | 8.1e-149 | 66.88 | uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647... | [more] |
Match Name | E-value | Identity | Description | |
KAA0043909.1 | 3.3e-261 | 100.00 | histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... | [more] |
XP_011651995.1 | 7.3e-245 | 93.86 | uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... | [more] |
XP_038903007.1 | 5.1e-214 | 84.76 | uncharacterized protein LOC120089713 [Benincasa hispida] | [more] |
XP_022144766.1 | 1.4e-166 | 70.82 | uncharacterized protein LOC111014376 [Momordica charantia] | [more] |
KAG6580678.1 | 1.7e-153 | 68.16 | hypothetical protein SDJN03_20680, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT5G03670.1 | 5.3e-60 | 36.45 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G36420.1 | 5.7e-54 | 36.97 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |