Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTCAGCTAAATTTTGCAGGAGCACTTGTTTTTTCTCTTTCAACCATTCCCCTGATCTTGCTAACTCATCCCCGCTTTTTGGGTTTCAGTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGTTTTACTCATTTTTATTACTGCTTTTGGGTTTCATCCTAAAATGTCTTGGCTTAGGAATTGCATTTTATGAACTGGGGTTCACCCGAAACAACCAACCGTCGGCAGTTTTGTGTAATTCTCTGAAATTACCGTCGGAATGTTGCAGTTTCAGGCGTCAAAACACGTCAACCCCACCAAAATGACTCTGAATTTTGACAAGCAAACCGATAAATCTCTCATTTTCAATTCCACGCTTTCCTTTTTCCCGGAAAATTATGACCCTCCAAGAAAAAGACAAACCCCCCTTTGTATTTTCCTTTGTATTCTTACCCTATTCTTACAATTAAACATTTTCTCTTTTGGCCTTGGCTTCATTATAATAATTAGAAGAGAAAAAGCAGGGGAAAAAAACCCAGTAAAGTGAAGATTATGTAAATCAACAACTTTACTAAAAACCAACTAAATTAAAGATCCCTTTTCACACTCATTTCCTTTTGTTAGTTTCTGAACTTTTCTTTGCCTTGTTTTCACACAAATTTACAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAGTATGTAAGCATGATTCAACCCAACTGCTAATTCTCTGTAGGAAGCATATTAATTTTGTTTTTCTGTCTTGATTCGAGTTTGTTGTGCAGTCATAATCGGTGGCATTTGTATTTAGGTTCTGTCAAACTTGATATCGGTTATTGATACAAATGTATGTCTGTTATGCTCTATAGAAATGGCTAATAGAGCTTCACAACTCATAAGCTTGTTGGTGTGCTAAGTTATGAAAACTTGCATCTTTGGTAGGGTAGGGGTGAAACCAGAACAAACTTTTGAATTTAGGTAAATATGTACTAGAATGAAGGGAAAGAGTGCAAAAATGTTTGATGGCAACATCACGATAAATATCAGCACAAAGGTATTGTTGAACACTCGTTTTCGCCTGGATGGTCTCTTCTGTAGGAGAAATGCTTTGGAGGTCTGGTCCTTTTATTTTTCTTGTTCTTCAAAACTTCAAATCTGACCCACATTTTGCTGTAAGAGAGGTGCTATAATTGTGGAGTCCTCTTCTATTTATTAGGCTGTTCAATCATTGAACCAATTTATTCATTGTTTTCGAATGCTATCATCACTGTCTGTACTTCAGAACAAGGTCCCCCCAACCCCGCCCTGCTCGTCCGGTTTTCTTTCTTACTGTTTTGTGTATTTATTAGGGATTTTGAAGCATATCACTGACCACTCTGTGAATGCATGATAATGAGTTGTTCTAGTCACTAGGCATATGCACTTCATCAGTCAAAGTTGTCTTTGTTGCATTAAGAAATAGCTAGACTGTCTAGTTCATCCTTTTCATTTGGCATATGATTGCATTTGAGTTTGAGATTCCAAATTCTATTCTTTGCATTGTCGCTTCTTCAAACTCTTTTTAAATTTACTGATCTGCTCTTTACCGAAAATGAAACACTCAGTCGATTTCGAAATTTGACGTGACAAAGATGTCTGTCTTTGTCTGCAATTAGAAGGTGTTCGATAATAATCACAACCACAAAGTTTTCTACCTTGTTAGTACATTTCAGTAACTTGAGCTTGGCACAAAGTCTACTATAAATTATTGTCATAGACGCGGCCCTACTTGTCTTTGTGTGCAGTTAAAACGGTATAATTGTCGAATTTTCGACCAATATTTCAAATTTTCATTTCACCTTCTTCCAGGGTTATTTAAACTGATCTTACATTTTGGTAAAAGTAAAACTTGAAGTTGATGTCAAGAAGGCATAAGTTCACGATGAATGCGAAGTTAATCCTGATCTGTTCATGTAATATTTATTTTCAGAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAA
mRNA sequence
ATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTCAGCTAAATTTTGCAGGAGCACTTTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAA
Coding sequence (CDS)
ATGGCTCGAAAGCAACACTTACACGAGCTTTTGAAACAGGATCAAGAACCCTTTCTTCTCTCCAATTTCATCAATGACAGACGCTCTCTTCTCAAGCGCTCTTCCTTCAAATCCCATTTCCATCTCAAAAACCCAAAACCCATTTCCCATTCCCCTGATTTTCAGCTAAATTTTGCAGGAGCACTTTCTCCGGTTAAAACCCCTTGTCGAAGCCCCAATCCTGTTTTCTTTCATGTTCCGGCTAGAACGGCTGGACTTCTTTTGGAAGCTGCTTTGAGGATTCAGAAACAGTCAACGGCTGCGAGATCCAAATCCTTTGGAAAATCGAATGGATTAGGCCTTTTGGGTTCTTTTCTTAAGCGTTTGACTCATCGGAGCCGTTCTCGGAAGCGAGAGATCCACGGCGATGGTCGGATAAACGACCCCCGTGATGGCCCGCCATTGCCGGCGAAAATGGCGATCGAGGAGAACGAGAAAGAGAACGACTCTGTTTTTCGGCTGAGTAATGTAACAGGCTTTGATTTCTGTGAGAGTAATTTATGCGATAGTCCTTTTCGGTTCGTGCTTCAATCGAGCTCTTCACCCGGTCACCGGACGCCAGAGCTCTCTTCACCGGTGTCTTCTCCGGCTCGCCTAGACCATCAGGCCAATGATGTAGAGAGCTTGCAAAAATTGCCGGCTGAGGATGAGGAGGAAGAGAAAGAACAGAGCAGTCCCGTGTCGGTATTGGATCCTCCGTTTGAGGACGACGACGAAGGAAATTTCGAGGATGGCGAGGACGAGGATGATTACAATTTGGAACGCAGCTTCGCCATTGTACAAAAGGCAAAGCATCAGCTACTGAAAAAACTTCGAAGATTCGAGAGGCTAGCAGAACTAGACCCCTTAGAACTCGAGACATTTCTACTAAACGACGAAGACCAAGATGAAGACGAACTCAGTGATGGCGATGACATTGATCATCTCAAGGAAGAAGTAGAAGAATACGAAAAGGACATCAAACAACACAACAAAGAGGGCAATGACAGTTCAAGGTTCCAAAATCGACCCTCAAGAGATACAAAGATACTCGTCTGCAATCTCATTACTGAGGAAGAGAGGAACATAGTTGCGATAGAGAAGAGAGAAGAGACAATGAAGAGGGTGTACATGAGACCAGATTTGTGGAAACGGGTAGACTCGAATGCCATCGACGTGATGGTGGGGAAAGATTTGAAAGAAGAAGTTGATGGATGGAACAGAAATAAGGAGCCGAGAGGAGAAATAGGCATTGAAATAGAGGTTGCAATCTTCAGCTTGCTGGTGGAAGAAATGCAAAGTGAACTACATTGCTTAGCTCATTAA
Protein sequence
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFAGALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQSTAARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Homology
BLAST of IVF0023206 vs. ExPASy TrEMBL
Match:
A0A5D3DNQ5 (Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold352G003580 PE=4 SV=1)
HSP 1 Score: 842.0 bits (2174), Expect = 1.1e-240
Identity = 441/468 (94.23%), Postives = 441/468 (94.23%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDF F
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 --------------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
G SPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE
Sbjct: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV
Sbjct: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 448
MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Sbjct: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468
BLAST of IVF0023206 vs. ExPASy TrEMBL
Match:
A0A0A0LAR8 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1)
HSP 1 Score: 789.3 bits (2037), Expect = 8.6e-225
Identity = 417/472 (88.35%), Postives = 428/472 (90.68%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DF F
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 --------------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
G SPVKTPCR+PNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELS--DGDDIDHLKEEVEEYEKDIKQHN 360
LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELS DGDDIDHLKEEVE+YEKDIKQHN
Sbjct: 301 LLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHN 360
Query: 361 KEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSN 420
KEGNDSSRFQ RPSRDTK LVCNLIT+EERN+V IEK EETMKRVYMR DLWKRVDSN
Sbjct: 361 KEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSN 420
Query: 421 AIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 448
AID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Sbjct: 421 AIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472
BLAST of IVF0023206 vs. ExPASy TrEMBL
Match:
A0A6J1CUE0 (uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014376 PE=4 SV=1)
HSP 1 Score: 554.7 bits (1428), Expect = 3.6e-154
Identity = 316/468 (67.52%), Postives = 356/468 (76.07%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
M ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK KPIS + DF F
Sbjct: 1 MMPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCK 60
Query: 61 ---------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQSTAARSK 120
L ++P R+PN +F HVPARTAG+LLEAALRIQKQSTAARSK
Sbjct: 61 SACFFSFHESPDLRKSPLFEFQSPVRNPNAIFLHVPARTAGILLEAALRIQKQSTAARSK 120
Query: 121 SFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KE 180
GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND G PLPAKMAIEENE E
Sbjct: 121 PHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDENVNE 180
Query: 181 NDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVE 240
N SV +N+T F FCESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ NDVE
Sbjct: 181 NGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDNDVE 240
Query: 241 SLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQL 300
SL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQKAKHQL
Sbjct: 241 SLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAKHQL 300
Query: 301 LKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKE 360
LKKLRRFE+LAELDP+ELE+FLL E EDEL D DDIDHLKE EEYE + +QH+ E
Sbjct: 301 LKKLRRFEKLAELDPVELESFLLKGE---EDELDDDDDIDHLKE--EEYESHNFEQHDVE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SS FQ P R LV N IT E+R+ + REE K VY+R DLWKRVDSNAID
Sbjct: 361 ANGSSSFQ-IPHR----LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDSNAIDA 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 448
VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Sbjct: 421 TVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of IVF0023206 vs. ExPASy TrEMBL
Match:
A0A6J1FAX4 (uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC111442411 PE=4 SV=1)
HSP 1 Score: 501.9 bits (1291), Expect = 2.8e-138
Identity = 301/464 (64.87%), Postives = 338/464 (72.84%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDF------ 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L KPIS S DF
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRSKPISDSSDFCRSACF 60
Query: 61 -------QLNFAGAL----SPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQSTAARS 120
L + L SPVKTPCR+ N +F HVPA TAGLLLEAALRIQKQSTAA+S
Sbjct: 61 FSFTHSPDLTTSSPLFEFHSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQSTAAKS 120
Query: 121 KSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEKENDS 180
KS GKSN LG LGSFLKRLTHR R RKREI DGR N R PPLP NE ENDS
Sbjct: 121 KSLGKSNALGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPT------NENENDS 180
Query: 181 VFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVESLQ 240
V R +SNLC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ D ESL+
Sbjct: 181 VSR----------QSNLCNSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDAESLK 240
Query: 241 KLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQLLKK 300
KL EDEEEEKEQSSPVSVLDPPFE+ DEG++ EDDYNL+RS+AIVQKAKHQLLKK
Sbjct: 241 KLAVEDEEEEKEQSSPVSVLDPPFEEYDEGHY-----EDDYNLDRSYAIVQKAKHQLLKK 300
Query: 301 LRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKEGNDS 360
LRRFERLAELD +ELETFLL DED+DEDEL D DI HL ++ DI +HN N S
Sbjct: 301 LRRFERLAELDVVELETFLLKDEDEDEDELDDDADIAHLDDDESH---DIIEHN---NGS 360
Query: 361 SRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDVMVGK 420
SRFQ P R L+ NL+T+EER++V IE KRV +R +LWK VD+NAID++ +
Sbjct: 361 SRFQIPPKR----LIYNLVTKEERDVVVIE------KRVLVRSELWKGVDTNAIDMITRQ 420
Query: 421 DLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 448
DLK EVDGW+RN E RGEI I++E+AIFSLLVEEMQ+ELHCLAH
Sbjct: 421 DLKGEVDGWSRNGEQRGEIAIDVELAIFSLLVEEMQTELHCLAH 426
BLAST of IVF0023206 vs. ExPASy TrEMBL
Match:
A0A6J1J5Y5 (uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647 PE=4 SV=1)
HSP 1 Score: 487.3 bits (1253), Expect = 7.0e-134
Identity = 295/468 (63.03%), Postives = 334/468 (71.37%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFAG 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLK + KS F L KPIS S DF+ NF
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKLPTPKSLFQLNRSKPISDSSDFRRNFCR 60
Query: 61 AL---------------------SPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
+ SPVKTPC + N F HVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFTHSPDLITSSPLFEFHSPVKTPCPNHNGTFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AA SKS GKSNGLG LGSFLKRLTHR R RKREI DGR N R PPLPA
Sbjct: 121 AANSKSLGKSNGLGFLGSFLKRLTHRGRIRKREICSDGRKNGYRGSPPLPA--------N 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV R +SNLC+SPFRFVLQSS S GHRTPE SSP SSPAR +HQ D
Sbjct: 181 ENDSVSR----------QSNLCNSPFRFVLQSSPSSGHRTPEFSSPTSSPARRNHQVKDA 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESL+KL EDEEEEKEQSSPVSVLDPPFE+ +EG++ EDDYNL+RS+AIVQKAKHQ
Sbjct: 241 ESLKKLAVEDEEEEKEQSSPVSVLDPPFEEYEEGHY-----EDDYNLDRSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELD +ELETFLL DED+DEDEL+D DI HL ++ DI +H
Sbjct: 301 LLKKLRRFERLAELDVVELETFLLKDEDEDEDELNDDADIAHLDDDESH---DIMEHK-- 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SSRFQ P R L+ NL+T++ER++V IE KRV +R +LWK VD+NAIDV
Sbjct: 361 -NGSSRFQIPPKR----LISNLVTKDERDVVVIE------KRVLVRSELWKGVDTNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 448
++ +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELH LAH
Sbjct: 421 IMKQDLKGEVDGWSRNGEQRGEIAIDIELAIFSLLVEEMQTELHFLAH 428
BLAST of IVF0023206 vs. NCBI nr
Match:
KAA0043909.1 (histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa] >TYK25228.1 histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 845 bits (2184), Expect = 3.68e-308
Identity = 441/468 (94.23%), Postives = 441/468 (94.23%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDF F
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFSAKFCR 60
Query: 61 --------------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
G SPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPLFGFQSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE
Sbjct: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV
Sbjct: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 447
MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH
Sbjct: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 468
BLAST of IVF0023206 vs. NCBI nr
Match:
XP_011651995.1 (uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical protein Csa_002656 [Cucumis sativus])
HSP 1 Score: 793 bits (2047), Expect = 3.18e-287
Identity = 417/472 (88.35%), Postives = 428/472 (90.68%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPI HS DF F
Sbjct: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPIPHSSDFSAKFCR 60
Query: 61 --------------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
G SPVKTPCR+PNPVFFHVPARTAGLLLEAALRIQKQST
Sbjct: 61 STCFFSFNHSPDLANSSPFFGFQSPVKTPCRNPNPVFFHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AARSKSFGKSNGLGLLGSFLKRLTHRSR+RKREIHGDGR+NDPRDGPPLPAKMAIEENE
Sbjct: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRARKREIHGDGRMNDPRDGPPLPAKMAIEENET 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS SPGHRTPELSSP SSPARLDHQANDV
Sbjct: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSPSPGHRTPELSSPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEG+FEDGEDEDDYNLERSFAIVQKAKHQ
Sbjct: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGHFEDGEDEDDYNLERSFAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGD--DIDHLKEEVEEYEKDIKQHN 360
LLKKLRRFERLAELDP+ELETFLL+DEDQDEDELSDGD DIDHLKEEVE+YEKDIKQHN
Sbjct: 301 LLKKLRRFERLAELDPIELETFLLHDEDQDEDELSDGDGDDIDHLKEEVEQYEKDIKQHN 360
Query: 361 KEGNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSN 420
KEGNDSSRFQ RPSRDTK LVCNLIT+EERN+V IEK EETMKRVYMR DLWKRVDSN
Sbjct: 361 KEGNDSSRFQIPYRPSRDTKTLVCNLITKEERNLVVIEKSEETMKRVYMRQDLWKRVDSN 420
Query: 421 AIDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 447
AID+MVGKDLKEEVDGWN NKEPRGEI +EIEVAIFSLLVEEMQSELHCL H
Sbjct: 421 AIDLMVGKDLKEEVDGWNINKEPRGEIAVEIEVAIFSLLVEEMQSELHCLTH 472
BLAST of IVF0023206 vs. NCBI nr
Match:
XP_038903007.1 (uncharacterized protein LOC120089713 [Benincasa hispida])
HSP 1 Score: 694 bits (1791), Expect = 2.09e-248
Identity = 371/466 (79.61%), Postives = 400/466 (85.84%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MA+K HLHELLK+DQEPFLL+NFI DRRSLLKR S KSHFHL NPKPISHS DF F
Sbjct: 1 MAQK-HLHELLKEDQEPFLLTNFIADRRSLLKRPSLKSHFHLNNPKPISHSSDFPAKFCR 60
Query: 61 --------------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
G SPVKTPCR+PNP+F HVPARTAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFNHSPDLINSSPLFGFQSPVKTPCRNPNPIFLHVPARTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
ARSKS GKSNGLG+LGSFLKRLTHR R+RKREI GDGR NDPRDGPPLPAKMAIEENE
Sbjct: 121 VARSKSLGKSNGLGVLGSFLKRLTHRGRARKREIDGDGRKNDPRDGPPLPAKMAIEENEN 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV RLSNVTGFDFC+SNLCDSPFRFVLQSS SPGH+TPEL+SP SSPARLDHQANDV
Sbjct: 181 ENDSVSRLSNVTGFDFCDSNLCDSPFRFVLQSSPSPGHQTPELASPASSPARLDHQANDV 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
E L+KLP EDEEEEKEQSSPVSVLDPPFEDDDEG++EDGEDEDDYNLERSFAIVQ+AKHQ
Sbjct: 241 EGLKKLPVEDEEEEKEQSSPVSVLDPPFEDDDEGHYEDGEDEDDYNLERSFAIVQQAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAELDP+ELETFLL DED+DEDE D DDIDHLKEE E+Y+KDIK+H+ E
Sbjct: 301 LLKKLRRFERLAELDPVELETFLLKDEDEDEDE--DDDDIDHLKEE-EDYKKDIKEHDIE 360
Query: 361 GNDSSRFQ--NRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAI 420
NDSSRFQ +RP+RD LVCNL+TEEER++V IEKREE MK +Y+R DLWKRVDSNAI
Sbjct: 361 ANDSSRFQIPHRPARDMTTLVCNLVTEEERDLVVIEKREEMMKGMYVRSDLWKRVDSNAI 420
Query: 421 DVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELH 443
+VMVG+DLKEEVDGW RNKE R EI IEIEVAIFSLLVEEMQ ELH
Sbjct: 421 NVMVGQDLKEEVDGWKRNKEQRREIAIEIEVAIFSLLVEEMQPELH 462
BLAST of IVF0023206 vs. NCBI nr
Match:
XP_022144766.1 (uncharacterized protein LOC111014376 [Momordica charantia])
HSP 1 Score: 558 bits (1438), Expect = 8.61e-195
Identity = 317/468 (67.74%), Postives = 359/468 (76.71%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFAG 60
M ++HLHELLK+DQEPF+L+NFI DRRSLLKR S KS+ HLK KPIS + DF F
Sbjct: 1 MMPQKHLHELLKEDQEPFVLTNFIADRRSLLKRPSPKSNLHLKRRKPISETLDFPGKFCK 60
Query: 61 AL-------------SPV---KTPCRSPNPVFFHVPARTAGLLLEAALRIQKQSTAARSK 120
+ SP+ ++P R+PN +F HVPARTAG+LLEAALRIQKQSTAARSK
Sbjct: 61 SACFFSFHESPDLRKSPLFEFQSPVRNPNAIFLHVPARTAGILLEAALRIQKQSTAARSK 120
Query: 121 SFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENE----KE 180
GK+NGLGLLGSFLKRLTHR R+RKREI GDGR ND G PLPAKMAIEENE E
Sbjct: 121 PHGKTNGLGLLGSFLKRLTHRGRARKREIDGDGRRNDLGGGRPLPAKMAIEENEDENVNE 180
Query: 181 NDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDVE 240
N SV +N+T F FCESN CDSPFRFVLQSS S GHRTPE SSP +SP R DHQ NDVE
Sbjct: 181 NGSVSGQTNLTSFAFCESNFCDSPFRFVLQSSPSSGHRTPEFSSPAASPVRRDHQDNDVE 240
Query: 241 SLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQL 300
SL+KLP EDEEEEKEQSSPVS+LDPPFEDDDEG++EDGEDED Y+LERS+ IVQKAKHQL
Sbjct: 241 SLKKLPVEDEEEEKEQSSPVSILDPPFEDDDEGHYEDGEDEDGYDLERSYTIVQKAKHQL 300
Query: 301 LKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEK-DIKQHNKE 360
LKKLRRFE+LAELDP+ELE+FLL E EDEL D DDIDHLKEE EYE + +QH+ E
Sbjct: 301 LKKLRRFEKLAELDPVELESFLLKGE---EDELDDDDDIDHLKEE--EYESHNFEQHDVE 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SS FQ P R LV N IT E+R+ + REE K VY+R DLWKRVDSNAID
Sbjct: 361 ANGSSSFQI-PHR----LVRNRITGEQRDQAVTDNREEMTKGVYVRSDLWKRVDSNAIDA 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 447
VG+DLK E+DGWNRN++ RGE+ IEIE+AIFSLLV EMQ+EL CL H
Sbjct: 421 TVGQDLKTELDGWNRNEDQRGEVAIEIELAIFSLLVGEMQTELDCLTH 458
BLAST of IVF0023206 vs. NCBI nr
Match:
KAG6580678.1 (hypothetical protein SDJN03_20680, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 507 bits (1305), Expect = 5.01e-175
Identity = 301/468 (64.32%), Postives = 339/468 (72.44%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFAG 60
MA+K HLHELLK+DQ PFLL+NFI DRRSLLKR S KS F L KPIS S DF+ NF
Sbjct: 1 MAQK-HLHELLKEDQHPFLLANFIADRRSLLKRPSPKSLFQLNRSKPISDSSDFRRNFCR 60
Query: 61 AL---------------------SPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST 120
+ SPVKTPCR+ N +F HVPA TAGLLLEAALRIQKQST
Sbjct: 61 SACFFSFTHSPDLTTSSPLFEFHSPVKTPCRNHNGIFLHVPATTAGLLLEAALRIQKQST 120
Query: 121 AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAIEENEK 180
AA+SKS GKSN LG LGSFLKRLTHR R RKREI D R N R PPLPA NE
Sbjct: 121 AAKSKSLGKSNALGFLGSFLKRLTHRGRIRKREICSDSRKNGYRGSPPLPA------NEN 180
Query: 181 ENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSSSSPGHRTPELSSPVSSPARLDHQANDV 240
ENDSV R +SNLC+SPFRFVLQSS SPGHRTPE SSP SSPAR +HQ D
Sbjct: 181 ENDSVSR----------QSNLCNSPFRFVLQSSPSPGHRTPEFSSPTSSPARRNHQVKDA 240
Query: 241 ESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQKAKHQ 300
ESL+KL EDEEEEKEQSSPVSVLDPPFE+ DEG++ED DYNL+RS+AIVQKAKHQ
Sbjct: 241 ESLKKLAVEDEEEEKEQSSPVSVLDPPFEEYDEGHYED-----DYNLDRSYAIVQKAKHQ 300
Query: 301 LLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQHNKE 360
LLKKLRRFERLAEL+ +ELETFLL DED+DEDEL D DI HL ++ DI +HN
Sbjct: 301 LLKKLRRFERLAELEVVELETFLLKDEDEDEDELDDDADIAHLDDDE---SHDIIEHN-- 360
Query: 361 GNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYMRPDLWKRVDSNAIDV 420
N SSRFQ P R L+ NL+T+EER++V IEKR V +R +LWK VD+NAID+
Sbjct: 361 -NGSSRFQIPPKR----LIYNLVTKEERDVVVIEKR------VLVRSELWKGVDTNAIDM 420
Query: 421 MVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSELHCLAH 447
+ +DLK EVDGW+RN E RGEI I+IE+AIFSLLVEEMQ+ELHCLAH
Sbjct: 421 ITRQDLKGEVDGWSRNGEQRGEIAIDIELAIFSLLVEEMQTELHCLAH 430
BLAST of IVF0023206 vs. TAIR 10
Match:
AT5G03670.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G36420.1); Has 700 Blast hits to 624 proteins in 104 species: Archae - 0; Bacteria - 18; Metazoa - 333; Fungi - 60; Plants - 73; Viruses - 24; Other Eukaryotes - 192 (source: NCBI BLink). )
HSP 1 Score: 200.7 bits (509), Expect = 2.5e-51
Identity = 183/530 (34.53%), Postives = 273/530 (51.51%), Query Frame = 0
Query: 1 MARKQHLHELLKQDQEPFLLSNFINDRRSLLKRSSFKSHFHLKNPKPISHSPDFQLNFA- 60
MA ++HL +LL++DQEPF L ++I+DRR + ++ +H +K +PIS + F
Sbjct: 1 MASQRHLKDLLEEDQEPFQLQSYISDRRCQI--NAHVTHLQVKKRRPISQNAGLPSRFCR 60
Query: 61 ---------------GALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQKQST-AARS 120
L +K+P RS N +F ++PARTA +LLEAA+RIQKQS+ +++
Sbjct: 61 NACFFSLRESPDPKKSPLFELKSPNRSQNAIFVNIPARTASILLEAAVRIQKQSSEVSKT 120
Query: 121 KSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGD---GRINDP------RDGPPLPAKMAI 180
++ N G+ GS LK+LT+R +KREI G GR++ R P+ K+
Sbjct: 121 RTRNAGNAFGIFGSVLKKLTNR---KKREISGGKEAGRVSSSSVKDMLRWESPVVRKIVT 180
Query: 181 ---EENEKENDS--VFRLSNVTGF-----------------------DFCES-------- 240
+ NE+EN S ++++ T F DF S
Sbjct: 181 RKSKRNEEENASSQTHKIASETHFSRRSSSSGVWSESVTNGERSWDVDFETSISTSSRSN 240
Query: 241 ------------------NLCDSPFRFVLQS-SSSPGHRTPELSSPVSSPA----RLDHQ 300
C+SPF FVLQ+ S+ G RTP SSP +SP ++ +
Sbjct: 241 GSDEFAMMMNGQDLSEDKRFCESPFHFVLQTMPSNGGFRTPNFSSPAASPRHDCHEMEKE 300
Query: 301 ANDVESLQKLPAEDEEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNLERSFAIVQK 360
+ +VE L+KL E+EEEEKEQSSPVSVLDPPF+DDDE DD N+ SF VQK
Sbjct: 301 SYEVEKLKKLEMEEEEEEKEQSSPVSVLDPPFQDDDE-----DIHMDDNNIPSSFRSVQK 360
Query: 361 AKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLKEEVEEYEKDIKQ 420
AKH LL+KL RFE+LA LDP+ELE ++D++ +E+E + +++ L ++ +K
Sbjct: 361 AKHLLLQKLCRFEQLAGLDPMELEK-RMSDQETEEEEEEEEEEMKSLYHCEIITQRVLKT 420
Query: 421 HNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETM-KRVYMRPDLWKRVDS 443
+ +E + L+ +L EE + + E + KRV R W+ V+S
Sbjct: 421 YFEE-------MVEVPEGVEALISDLAAEELPSDIDGEAEAAIVAKRVCERLRSWRDVES 480
BLAST of IVF0023206 vs. TAIR 10
Match:
AT2G36420.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G03670.1); Has 10588 Blast hits to 6606 proteins in 440 species: Archae - 8; Bacteria - 365; Metazoa - 4146; Fungi - 1198; Plants - 483; Viruses - 212; Other Eukaryotes - 4176 (source: NCBI BLink). )
HSP 1 Score: 188.7 bits (478), Expect = 9.9e-48
Identity = 174/481 (36.17%), Postives = 253/481 (52.60%), Query Frame = 0
Query: 3 RKQHLHELLKQDQEPFLLSNFINDRRSLL--------KRSS------------------F 62
+K+HLHE L+ DQEPF L+++I + RS + KR S F
Sbjct: 6 KKKHLHEFLEDDQEPFHLNHYIGNLRSQMGCSDMRVKKRKSDNVATFPPGLFSCENSCFF 65
Query: 63 KSHFHLKNPKPISHSPDFQLNFAGALSPVKTPCRSPNPVFFHVPARTAGLLLEAALRIQK 122
+H K+P P SP F+L SP K R VF +PARTA +LL+AA RIQK
Sbjct: 66 AAH---KSPDP-RKSPLFELR-----SPGKKKIRD-GRVFLQIPARTAAILLDAAARIQK 125
Query: 123 QST--AARSKSFGKSNGLGLLGSFLKRLTHRSRSRKREIHGDGRINDPRDGPPLPAKMAI 182
Q + A +K+ + NG G+ GS LK LT+R ++ R + DG +++
Sbjct: 126 QQSEKAKTNKARTRGNGFGMFGSVLKLLTYRI-TKPRLDNADGN------------AVSL 185
Query: 183 EENEKENDSVFRLSNVTGFDFCESNLCDSPFRFVLQSS-SSPGHRTPELSSPVSSPARL- 242
E + S R V D C C+SPF FVLQ++ SS GH+TP +S +SPAR
Sbjct: 186 ERGSEPTSSSRRERIVEISDKC---FCESPFHFVLQTTPSSSGHQTPHFTSTATSPARRS 245
Query: 243 --DHQANDVESLQKLPAED----EEEEKEQSSPVSVLDPPFEDDDEGNFEDGEDEDDYNL 302
D +++ ESL+K+ ++ EEE+KEQ SPVSVLDP E++++ + E + NL
Sbjct: 246 TEDEDSDETESLEKVRGQEEEDKEEEDKEQCSPVSVLDPLEEEEEDEDHHQHEPDPPNNL 305
Query: 303 ERSFAIVQKAKHQLLKKLRRFERLAELDPLELETFLLNDEDQDEDELSDGDDIDHLK--E 362
SF IVQ+AK +LLKKLRRFE+LA LDP+ELE + +ED++E+E + ++ D+++ +
Sbjct: 306 SCSFEIVQRAKRRLLKKLRRFEKLAGLDPVELEGKMSEEEDEEEEEYEESEEDDNIRIYD 365
Query: 363 EVEEYEKDIKQHNKEGNDSSRFQNRPSRDTKILVCNLITEEERNIVAIEKREETMKRVYM 422
EEYE D R SR E+E+ +K +E K+ M
Sbjct: 366 SDEEYE-----------DVDEAMARESR---------CAEDEKR----KKNDERQKKWRM 425
Query: 423 RPDLWKRVDSNA---IDVMVGKDLKEEVDGWNRNKEPRGEIGIEIEVAIFSLLVEEMQSE 443
+ W RV A +D +V KDL+EE W R+ E ++E +IF +L++E E
Sbjct: 426 M-NAW-RVGLGAEEDVDAVVRKDLREEAGEWTRHGGEVEEAVSDLEHSIFFVLIDEFSRE 434
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5D3DNQ5 | 1.1e-240 | 94.23 | Histone-lysine N-methyltransferase SETD1B-like isoform X2 OS=Cucumis melo var. m... | [more] |
A0A0A0LAR8 | 8.6e-225 | 88.35 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G751450 PE=4 SV=1 | [more] |
A0A6J1CUE0 | 3.6e-154 | 67.52 | uncharacterized protein LOC111014376 OS=Momordica charantia OX=3673 GN=LOC111014... | [more] |
A0A6J1FAX4 | 2.8e-138 | 64.87 | uncharacterized protein LOC111442411 OS=Cucurbita moschata OX=3662 GN=LOC1114424... | [more] |
A0A6J1J5Y5 | 7.0e-134 | 63.03 | uncharacterized protein LOC111481647 OS=Cucurbita maxima OX=3661 GN=LOC111481647... | [more] |
Match Name | E-value | Identity | Description | |
KAA0043909.1 | 3.68e-308 | 94.23 | histone-lysine N-methyltransferase SETD1B-like isoform X2 [Cucumis melo var. mak... | [more] |
XP_011651995.1 | 3.18e-287 | 88.35 | uncharacterized protein LOC105434967 [Cucumis sativus] >KGN59070.1 hypothetical ... | [more] |
XP_038903007.1 | 2.09e-248 | 79.61 | uncharacterized protein LOC120089713 [Benincasa hispida] | [more] |
XP_022144766.1 | 8.61e-195 | 67.74 | uncharacterized protein LOC111014376 [Momordica charantia] | [more] |
KAG6580678.1 | 5.01e-175 | 64.32 | hypothetical protein SDJN03_20680, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
AT5G03670.1 | 2.5e-51 | 34.53 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |
AT2G36420.1 | 9.9e-48 | 36.17 | unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... | [more] |