Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG
mRNA sequence
ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG
Coding sequence (CDS)
ATGCTCTGCGAAGAAGAAGGAGACGAAGCGGCTGATCGGAACCTGTATCCTCGTAAAACTCTGACCATGGATTCAAATCCGCCCCGTAACCGTTCCAACCAGGTAGCCTATACTCGCCTCGGTTATTATATTCCGCGTGAATGTTCTAAAGCTCAAATGTGCCTTCAAAAGATTCTGCATATAATTTCGTCCGTTCCTCGCCAAGAATCATGCGCGGATTCCCAAATCAGAAAATTAGAATTCGATGGAAGCGTAAGTGGTGGTGGAGGGTTTTGTACCGATTTCAATCTTAGATTCGGCGCGTCCATGGCGGAAGAAGACCACGAATCGGTGGCTTTTGTGGAGAATGGAGGAGGCGGCGGAAATGGTTCGGTAGGCGTCGTGTCTGTTGGTGGTTCGGCGACTGAGAAATGCTCGGAAGGCGACGGTGACGACGACGGCGAGAACGCGAAATGCTCTGAATCTCCTGAAACTAAAACTAACAAATATGAAGTCGAAGAACACATGGAGAAAATTCCAGAACATTCGAAGATCGCCAAATTTCATATTCAAAGAGGTGAAGCTGAAAGCAGCAGATTTAATGAAAACGAAAGCGAAAAGACAGTCGAAGAAGAAGAAGGTAAAAAGGAAAAACTCACGGGGAGTTCCGTTGGTGGAAGCGACGACGGCAAAGTGGAAACGGCACCGTATCAAGCGGTGGAATGCAAGAGCGGTGGCGATTGTCTAGGTTTGCTCATCGAAGCGGCGAGACTGATACTGGGAGACATCGGTGAGAACGAGTTTGAGACAGAGTCAACTCCCGAGGACGGCGAGTCGAACAGCGAGTTAGATGCCAAAGAACCGAGTCAACTCGAGAAGGTAATTTCAGAGTCGCATTCAAGCGGGTCGAAGAGGGAGCTCGAAGGAAGTTGGATGGTGATGAATTTAGTTAGAGATATCGACGACAGGTCGCCATTAGTAAGGTCAAAGCGAGGAAGAAGCCAGGTCTTACCGTGTCGTTACAAAGACTCTGTTCTTGAGCCATGGCGATCTCAGCCATTGCCGAGCAAGGTCAAGGTCTCGAGAAGACAACGGCGATCGAGGTACCCCACTTAG
Protein sequence
MLCEEEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISSVPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNGSVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYPT
Homology
BLAST of Sgr023301 vs. NCBI nr
Match:
XP_022140313.1 (uncharacterized protein LOC111011014 isoform X1 [Momordica charantia])
HSP 1 Score: 526.6 bits (1355), Expect = 1.8e-145
Identity = 295/364 (81.04%), Postives = 312/364 (85.71%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEG+EAADRNLYP +T +MDSN RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4 EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
VPRQESCADSQI KLEFDG+VSG GGFC DFN R GAS EED ESVA VEN GGGGNG
Sbjct: 64 VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
V VVSVGGS TEKC +G DGENAK SESP T+T+KYEVEEHMEK EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183
Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
HIQRG+ RF+ENESEKT EEEGKKEKLTGSS GGSDDG VETAP++ + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243
Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
LGLLIEAARLILGD GENEFETEST E ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303
Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 364
EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 362
BLAST of Sgr023301 vs. NCBI nr
Match:
XP_022140314.1 (uncharacterized protein LOC111011014 isoform X2 [Momordica charantia])
HSP 1 Score: 522.7 bits (1345), Expect = 2.5e-144
Identity = 293/361 (81.16%), Postives = 310/361 (85.87%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEG+EAADRNLYP +T +MDSN RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4 EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
VPRQESCADSQI KLEFDG+VSG GGFC DFN R GAS EED ESVA VEN GGGGNG
Sbjct: 64 VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
V VVSVGGS TEKC +G DGENAK SESP T+T+KYEVEEHMEK EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183
Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
HIQRG+ RF+ENESEKT EEEGKKEKLTGSS GGSDDG VETAP++ + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243
Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
LGLLIEAARLILGD GENEFETEST E ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303
Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 362
EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 359
BLAST of Sgr023301 vs. NCBI nr
Match:
XP_022985998.1 (uncharacterized protein LOC111483864 isoform X2 [Cucurbita maxima])
HSP 1 Score: 517.3 bits (1331), Expect = 1.1e-142
Identity = 281/361 (77.84%), Postives = 308/361 (85.32%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYP 364
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSRY
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSRYR 360
BLAST of Sgr023301 vs. NCBI nr
Match:
XP_022985999.1 (uncharacterized protein LOC111483864 isoform X3 [Cucurbita maxima])
HSP 1 Score: 513.5 bits (1321), Expect = 1.5e-141
Identity = 279/358 (77.93%), Postives = 306/358 (85.47%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSR 362
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSR
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSR 357
BLAST of Sgr023301 vs. NCBI nr
Match:
XP_022985997.1 (uncharacterized protein LOC111483864 isoform X1 [Cucurbita maxima])
HSP 1 Score: 512.3 bits (1318), Expect = 3.4e-141
Identity = 278/357 (77.87%), Postives = 305/357 (85.43%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 361
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRS
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRS 356
BLAST of Sgr023301 vs. ExPASy TrEMBL
Match:
A0A6J1CHQ9 (uncharacterized protein LOC111011014 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111011014 PE=4 SV=1)
HSP 1 Score: 526.6 bits (1355), Expect = 8.5e-146
Identity = 295/364 (81.04%), Postives = 312/364 (85.71%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEG+EAADRNLYP +T +MDSN RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4 EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
VPRQESCADSQI KLEFDG+VSG GGFC DFN R GAS EED ESVA VEN GGGGNG
Sbjct: 64 VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
V VVSVGGS TEKC +G DGENAK SESP T+T+KYEVEEHMEK EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183
Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
HIQRG+ RF+ENESEKT EEEGKKEKLTGSS GGSDDG VETAP++ + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243
Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
LGLLIEAARLILGD GENEFETEST E ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303
Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 364
EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 362
BLAST of Sgr023301 vs. ExPASy TrEMBL
Match:
A0A6J1CFC4 (uncharacterized protein LOC111011014 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111011014 PE=4 SV=1)
HSP 1 Score: 522.7 bits (1345), Expect = 1.2e-144
Identity = 293/361 (81.16%), Postives = 310/361 (85.87%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEG+EAADRNLYP +T +MDSN RSNQV +TRLGY IPRECSKAQMCLQKILHIISS
Sbjct: 4 EEGEEAADRNLYPCETPSMDSNLLCKRSNQVGHTRLGYCIPRECSKAQMCLQKILHIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSG-GGGFCTDFNLRFGASMAEEDHESVAFVENGGGGGNG 124
VPRQESCADSQI KLEFDG+VSG GGFC DFN R GAS EED ESVA VEN GGGGNG
Sbjct: 64 VPRQESCADSQIGKLEFDGNVSGDAGGFCPDFNPRVGASAEEEDQESVALVENEGGGGNG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESP--ETKTNKYEVEEHMEKIPEHSKIAKF 184
V VVSVGGS TEKC +G DGENAK SESP T+T+KYEVEEHMEK EHS IA+F
Sbjct: 124 LVDVVSVGGSVTEKCLDG----DGENAKFSESPATATETDKYEVEEHMEKTLEHSNIAEF 183
Query: 185 HIQRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDC 244
HIQRG+ RF+ENESEKT EEEGKKEKLTGSS GGSDDG VETAP++ + KSGGDC
Sbjct: 184 HIQRGDDRIGRFDENESEKTA-EEEGKKEKLTGSSDGGSDDGIVETAPFRGAKXKSGGDC 243
Query: 245 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESH-SSGSKREL 304
LGLLIEAARLILGD GENEFETEST E ESNSELDAKEPSQLE+VISESH SSGSKR+L
Sbjct: 244 LGLLIEAARLILGDFGENEFETESTHEXHESNSELDAKEPSQLEQVISESHSSSGSKRKL 303
Query: 305 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 362
EG+WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQ LPSKVKVSRRQRRS
Sbjct: 304 EGNWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQSLPSKVKVSRRQRRS 359
BLAST of Sgr023301 vs. ExPASy TrEMBL
Match:
A0A6J1JF75 (uncharacterized protein LOC111483864 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)
HSP 1 Score: 517.3 bits (1331), Expect = 5.2e-143
Identity = 281/361 (77.84%), Postives = 308/361 (85.32%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSRYP 364
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSRY
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSRYR 360
BLAST of Sgr023301 vs. ExPASy TrEMBL
Match:
A0A6J1JEV5 (uncharacterized protein LOC111483864 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)
HSP 1 Score: 513.5 bits (1321), Expect = 7.4e-142
Identity = 279/358 (77.93%), Postives = 306/358 (85.47%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRSR 362
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRSR
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRSR 357
BLAST of Sgr023301 vs. ExPASy TrEMBL
Match:
A0A6J1J6D7 (uncharacterized protein LOC111483864 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111483864 PE=4 SV=1)
HSP 1 Score: 512.3 bits (1318), Expect = 1.7e-141
Identity = 278/357 (77.87%), Postives = 305/357 (85.43%), Query Frame = 0
Query: 5 EEGDEAADRNLYPRKTLTMDSNPPRNRSNQVAYTRLGYYIPRECSKAQMCLQKILHIISS 64
EEGDEAADR LYP KTL M+S+P NRSNQV Y+R G+YIPRECSKAQMCLQKIL+IISS
Sbjct: 4 EEGDEAADRKLYPCKTLNMNSSPLCNRSNQVGYSRTGHYIPRECSKAQMCLQKILYIISS 63
Query: 65 VPRQESCADSQIRKLEFDGSVSGGGGFCTDFNLRFGASMAEEDHESVAFVEN-GGGGGNG 124
VPRQESCAD QI KL+ + +VSGGGGF TDFNLRFGA EED ESVAFVEN GGGG+G
Sbjct: 64 VPRQESCADLQIGKLDLNRNVSGGGGFRTDFNLRFGAFTEEEDQESVAFVENREGGGGSG 123
Query: 125 SVGVVSVGGSATEKCSEGDGDDDGENAKCSESPETKTNKYEVEEHMEKIPEHSKIAKFHI 184
SV V V GSATEKCS G DGENAKCSESP +T KYEVEEH +IPE S+IA+FH
Sbjct: 124 SVDAVPVTGSATEKCSVG----DGENAKCSESPVAETEKYEVEEHAMEIPERSQIAEFHT 183
Query: 185 QRGEAESSRFNENESEKTVEEEEGKKEKLTGSSVGGSDDGKVETAPYQAVECKSGGDCLG 244
QRG+AE+ R +EN+SEKTVEEEEGK+ K TGSSV SDD K ET P++ +C+SGGDCLG
Sbjct: 184 QRGDAENIRLDENKSEKTVEEEEGKELKPTGSSVDRSDDMKAETVPFRGAKCQSGGDCLG 243
Query: 245 LLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHSSGSKRELEGS 304
LLIEAARLI GDIGENEF+TEST E+ ESNSELD K+PSQLEKVISESHSS KR+LEG+
Sbjct: 244 LLIEAARLIFGDIGENEFDTESTQEENESNSELDIKDPSQLEKVISESHSSEPKRKLEGN 303
Query: 305 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLPSKVKVSRRQRRS 361
WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPL SKVKVSRRQRRS
Sbjct: 304 WMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWRSQPLSSKVKVSRRQRRS 356
BLAST of Sgr023301 vs. TAIR 10
Match:
AT4G27910.1 (SET domain protein 16 )
HSP 1 Score: 42.7 bits (99), Expect = 7.2e-04
Identity = 36/103 (34.95%), Postives = 50/103 (48.54%), Query Frame = 0
Query: 242 LGLLIEAARLILGDIGENEFETESTPEDGESNSELDAKEPSQLEKVISESHS-SGSKREL 301
L LL E A I+ G N F E +P ++E+ +S+ S SG+ R+
Sbjct: 41 LNLLGEIAAGIVPGNGRNGFSASWCTE---------VTKPVEVEESLSKRRSDSGTVRDS 100
Query: 302 EGSWMVMNLVRDIDDRSPLVRSKRGRSQVLPCRYKDSVLEPWR 344
+ + R PLVR+ RGR QVLP R+ DSVL+ WR
Sbjct: 101 PPAEV---------SRPPLVRTSRGRIQVLPSRFNDSVLDNWR 125
BLAST of Sgr023301 vs. TAIR 10
Match:
AT5G53430.1 (SET domain group 29 )
HSP 1 Score: 42.4 bits (98), Expect = 9.4e-04
Identity = 18/28 (64.29%), Postives = 22/28 (78.57%), Query Frame = 0
Query: 316 RSPLVRSKRGRSQVLPCRYKDSVLEPWR 344
R PLV++ RGR QVLP R+ DSV+E WR
Sbjct: 95 RPPLVKTSRGRVQVLPSRFNDSVIENWR 122
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_022140313.1 | 1.8e-145 | 81.04 | uncharacterized protein LOC111011014 isoform X1 [Momordica charantia] | [more] |
XP_022140314.1 | 2.5e-144 | 81.16 | uncharacterized protein LOC111011014 isoform X2 [Momordica charantia] | [more] |
XP_022985998.1 | 1.1e-142 | 77.84 | uncharacterized protein LOC111483864 isoform X2 [Cucurbita maxima] | [more] |
XP_022985999.1 | 1.5e-141 | 77.93 | uncharacterized protein LOC111483864 isoform X3 [Cucurbita maxima] | [more] |
XP_022985997.1 | 3.4e-141 | 77.87 | uncharacterized protein LOC111483864 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CHQ9 | 8.5e-146 | 81.04 | uncharacterized protein LOC111011014 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CFC4 | 1.2e-144 | 81.16 | uncharacterized protein LOC111011014 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1JF75 | 5.2e-143 | 77.84 | uncharacterized protein LOC111483864 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1JEV5 | 7.4e-142 | 77.93 | uncharacterized protein LOC111483864 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1J6D7 | 1.7e-141 | 77.87 | uncharacterized protein LOC111483864 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |