Carg28094 (gene) Silver-seed gourd (SMH-JMG-627) v2

Overview
NameCarg28094
Typegene
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
Descriptionprotein SET DOMAIN GROUP 41 isoform X1
LocationCarg_Chr09: 8904508 .. 8913576 (-)
RNA-Seq ExpressionCarg28094
SyntenyCarg28094
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAGTAATTGTTTCAACTACGAATGATTTTCCTTCTCTTTATGGGCGTTTCTCTCCCTTTCTCGTGAAATTAGTGAGAGCTTTTGCATGATTGTTTATGGCAGATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGTTTGCAATCAATTAAGTGCTGTTGTTGTTGTCCCTGTTTGTATATATTCATGTGAATTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGTATTATATATTATTTTTGTTATCTTCCCTTCATAATATTTCACAAGTAACTCACGTATGGAACAAAATGGTATGCAAAGTTTGCCTTTTTGAAACAAAAATCCATCTAACAACGAGTAATTATCCACAATGAGTCTTTTTTCACAAGATTCAACAAAAGGAGCAATGTCATGTTGGTACAAATCCTTTATGTGTTCAAAACCCAACAACTTAGCATTCAAAGTATCGAGGAGGACATACCTTCGTGATAAAAGCATCTGCAACAATGTTCTCCTTACCTTGTTTATATTTTATGATGTGAGAGAATGTTTCAATAAATTCCAACCACTTAGCATGTGGTCTATTGAGTTTATTTTGTGCTCTCAAATGCTTTAAACTTTCATGATCCGTATGAATAATGAACTCCTCAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGTATATATTTCAACTCAAAGATTTGCAGAGTAAATGAATGTGGTGCTTGTGGAATATCGTTGTTAATTTCATCTTCTTGTTGAGGGATTATGCTGAAATGACTAGCTATAGATTTGTGAGAAATTTGTTGATTTTGTCTCAGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGTAAGAAAAACTATCAAAATATATACATAAACTTGCTTTTGTTTTGGAAATTTGGTTATGAATTAAAATTTTTCTTTTAACAAAGATTAAAACAATGTAAAGAACATGTGAGAAAACAAGCACATTTTTCAAAAACCAAAAATCAAAAGCTAAAAAACTATGTAGTTATCCTTAGCGATTAGGAGGAGTTTGGTATCACTTTCAAGATGTTTATGTAAGGTATCTCCTTGTAGTGATATGCGAAGGAAGAAAAGCCTTAAGGCAATATAAAATAATCCCTACCTATTTGACAGGGTTGGTAATCACTCCTTCCTCAGCTTCTTTGCTAAGAAATACGATTCGAAAAACATATAAAGAAAGCTATCGTCCTTTCCAAACAAGTATACTACTCAACATTCCCCCAATGGTCCCCTTTCCTTGTGTAACAAACTCGTAAACTGCCCTCATGTGACTAACTTGCAGGCTCCGTGTCTTTCTTGCCCCTCCTAGTGTACTAATTAGTAATTGGTGGCCTAATGTCACCCGCCGGGTCGGAGACATTGTTGAGTTGCTCCCATGTTGCTCCCAAGTTGCTTCATGATCTAGCAAGTTCTTCCAACTGACTAATAACTTAGTCATGCCCGTGGTGCTATTCTTTTGATAACTGAAAATTTCCTCACGGTCAACTTTCCATTCAAAATCACTGGTCTACACAAGCAGCTTGGGCTGTACTTGATGGTTTTGTTCCCAGAGCCTTCTCATCTGAGAATCATGGAATACTTGATGTATTGTTGCCCCAAGTGGAAGTTGGAGTTTATAAGCCACTGTCCCAATCTTGTATTCGATGAAATAAGGTTCAAAAAACTTCAGGGCTAACTTCTCATTACATTTCTTTGCTAGTGATGCCTGTCTATAAGGACGAATTTTGAAGAAAATCCAATCACCTACTGCAAATTCCACTTCCCTTTGTTTCATCCGAGGAGTTGTTAGGTGTTTTTTTGGATTCCATACAATAGCTAAGGAGGAGGTGCCCTGCCATTCACCACTTTGAAGATATGGAACGTGGTGTTATGCCTGCCCAACACAACTATTGTATAACGTTCACCATGAAAACACCTTAGGTAATTTTCTACACATTTATTTTATTTATTTTCCTTTGTTCATTTGCCTGAGGGTGGTATGGAGTGCTTTGCCAGAGTTGAGTCCCCTGTATCTGAAGAAGTTCAGTCCAGAAATGACTGATAAAGATCTTATCATAGTCTGAAATGATTGAGTTTGGAAACCTATGTAACTCTACTACCTTTCTAATAAATAGGGATGCTTATGTCTTAGCTGTAAATCGATGTTTGATGGCAAAAAAGTGAGCATATTTACCGAAATGGTCCACGACTACTAAGATGGTATCATGTCCTTCTGATCGTAGTAATCCTTCAATGAACCCCATAGCTGATTTGTTTCGTTGACAGACTACACAATCCTTCAATTAACCGCATGCCTTAACAATACAACTCACTAGTCAAATGTTTGTAGGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGGTAACCTCCCCTTGCATTTCAATGTTCCTTGATGAAAGGAGAATTTGGATAGCTTTTCCCTTTCGTCTTGCAACTGTTGGATGATCTTGCCGAGCTCGGGGTCATTTGCAACCTCTTCTTTAATCGTATCTATATCCAATATGGCAGGGGACAAATGGGTGAGACTCACTAATACGAGCTGAGATTTAGTGTTGGGAACTCAGTGGGTTGCATACGATCATGAAGAGGCCCAATTGGTGAAGGGACCGGAACAACCCTAAGCTGATGCAGGTAAGCAACCACTAAAATGGCCTGGTGTGTTAACCCTAAGGGGAACTCATCTTCATCAAAGGATAATAGCTAGGTGGCGAACTTGATCCCCACATGTACCATCACGATACAGAGTGAAGCAATGTAGCAATAGATACGTCAAGAATAGCCAACGAAAAGATTGCCGAAATTGAAATTTTTAGCTCAAAAGAAAAAGGACCTAGTTTTCAGTCGCATTAGTCATGCTAGGGTATTGTAAGTACTAAGCCTTTGCCATGGTATCAGAGCCAAATTAGTGAGGGAAGAGGAGTATTCAAATCTCTAAGAATCAAAGCCAAATTAGTGAGGGAAGAGGAGTATTCAATCTCTAAGAATCAAGGTTGAATGCAGGAAGAAGATGAGAAATTAAGAGGAATTTGGAGTATGTGATGACTGGGAGAATAACCCTTGTGTTTCAGATGGTAAGACGCGGGAAATGCAGTAAGTCCATTGACAAGTTTGGGTGCTTAATCATACAAAAGATTCCAAAGAAAATTTTATATCAACAACCTGACATCTTTATTAAAGACACTCTAATAATGAAGAAAATGACAAACGAAGATTGACAACTTGTCGGCCACACAATGAAGGCTCACATGTGAATTATTAATCTTTTGTCTGGGTGAATTATTAGTCTTTTGTCTTTATGAATTATTAGTTTGCTTTAGGTCTTTTCTTAATTTTAGACTTTTAGAAAAACAAATCTGAAGTCTATAAAAAGACTTCTTGAAGGTATAGAGAGGCATTGGTCGATAGATATCTCTAAAATCAATCAGAAACCAAGAGAAGAAACCTTTCTACTCCACTTATCTTTTTTATTTTCGACATTTTAATGAACGAGTTCTTAAAAAAGTTTACCAAATCAAATCGTCTCAAATGTTCTAACCAAGCAGAAGAAAGTTCAGATGGGAGAGGGAAACCTCCACCATCTTGAATACTGTCAATCTAGAATACCATACTTCAAGAATTGAAGAAAAACTTCAAAATTGGACTCTTCCGAAGGTAGATCCTAATACTATTTGCCAATTTTCAACTTCAATTTTGCTCAAAGATCATGCATCAAATTTTCGGAGAAAAGTATCTCAATTCATAGCAATAGGGAATCTCTGAATCTTTCTTATACAACACCGTGAAGAGTTTAATTATCTTCACGTCGGTCTTGTTCAAGTTGCGATAAAACCGTTATTCAGACTTGGGCTAGATAGCCCTGTCCTTATCTCCTTTCGAGATAAGAGACATGAAGATTTCTCAAATTCCTTCCTGGGAATGGTTCAATCTAATTTAGAGAATGTACCTGTCTATTTCAATTGTTATCTAAATTTTACTTTGTCACTCAAAGATCCGCATATATTATCATCCCTTATGTTGGACCTATAATAAAAAAAACTTGAACATCAAGGCTGAAACACATTCTTCAGTCGTTATATTCAGAGTTTATTACAAGTTCATGAATACAAATATCTCTCCAAGAGCATTAAGATCCTCACCAAAAGGATCAACTATGCTCATAGAAGCAAATATTGGAAAGTCGTGACTGTTCCAAAACCCTCCCTTAGGATCAAATAACTAAAAATAATCTCTGGAAAATAGAAGATGCCCATTTTTTCAAAAGAAAAGAGTCTCGAAACCCTGTTCAAATCATCGAACATGATAACGGAAGCGTTGAAATAAAGTTCAGCGAAGAACCTTCGTCAAATCCAAAAGTGAAAAAATTCCTAAGTTCGAGACCGAGTATTTCAGGAATTTCAAGCTCAATATATGACCCTTTAAAAGTAAAGGATGTCAACTACGATCAAAGAAGAGCCTCGATCCACTATGAAGATGGCTCAAGATCTCCAACTCATACTGATATGGATACTCAATCTGTCTACAAAAGTCGCTAAACGTCATTAGATTGAATGATTGAACCATTCCCAGTAAGGAATTTGAGAAATCTTCATGTCTCTTGTCTTGAAAGGAGATAAAGACGGGGATATCTTCAGAGATTCCCTATTGCTATGAATTGGGATATTTTTCTCCGAGAATTTGATGCATGATCTTTGAGTAAAATTGAAGGTTGAAAATTGGTAAATAGTTTTAGTATCTACTTTCGGAATAGTCCAATTTGAAGTTTTTCTTCAATTCTGGCCATATGGTATTCTTGATTGACAGTATGGTGAGGCCTTTGTGCCCACTCTGTGTCCATCTGAACTTTTTTCTGCTTGGTTAGAAGATTTGAGACAATTTTATTTGGGGAACATTTCTAAGAACTTGTTCATTAAAGTGTCAAAAATAAGAAAGATAAGAGGAGTAGAAAGGTTTCTTCTCTTGATTTCTGATTAAGTTTAGGGAGACCTATCGAACAATGCCTCTCTATACATTCAAGAAGTCTTTTTATAGACTTCAGATTTGAAAATAAACGTAAACTAAGAGATAAAGATTAAACCTAGCAGGATAGGACCCCTGCAAGATTTGTTTTTCTAAAAGTCTTAAAGTAAGAAAAAACCTAAAACAAACTAATAATTTACCGAGACAAAAGACTAATAATTCACAAAGACGAGATTAATAATTCACATGTGAGCCTTTACTGTGTGGCCTGTTGTCAATCTTCGTTTGTCATTTTCTTCTTTATTGAAGTGTCTTTAATAAAAATGTCAGATTGTTGATATAAAATTTTCTTTGAAATCTTTCGTATGATTGAGTACCCAAACTTGCCAACGACTTACTGCACTTCCCGCGTCTTACCATATGGAACACAAGGGTTATTCTCTCAGCTATCACATACTCCAAATTCCTCTAAATTTCTCTTATCTTCTTCCTACATTCAACCTTGATTCTTAGAGATTGAATACTCCTCTTCCTTTCACTAATTTGGCTCTGATACCACAGCGAAGGCTTAGTATTTACAATACCCTAGCATAACTAATGCGATTGAACATGACTTAAACTTGCCAACGACTTACTGCACTTCCCGCGTCTTACCATATGGAACACAAGGGTTATTCTCTCAGCTATCACATACTCCAAATTCCTCTAAATTTCTCTTATCTTCTTCCTACATTCAACCTTGATTCTTAGAGATTGAATACTCCTCTTCCTTTCACTAATTTGGCTCTGATACCACAGCGAAAGCTTGATACCACTGCATAACAATGCGATTGAACATGACTTTAAACTTGCCCAACGACTTACTGCACTTCCGCGTCTTACCATATGGAACACAAGGGTTATTGCTATCACATACTCCAAATTCCTCTAATTTCTCTAGTTCTTTCCTACATTCAACCTAATTTTAGAGATTGGTACTCCTCTTCTTTCACTTATGCTCTGATACCACAGCGAAGGCTTAGTGATACATACCCTAGCATAACTAATGCAAATGAACATGACAAAACTGCCAACGACTTACTGCATTTCCCGCGTCTTACCATCTGAAACACAAGGGTTATTCTCCCAGTCATCACATACTCCAAATTCCTCTGAATTTCTCATCTTCTTCCTGCATTCAACCTTGATTCTTAGAGACTGAATACTCCTCTTCCCTCACTAATTTGGCTCTGATACCATGGCAAAGGCTTAGTACTTACAATACCCTAGCATGACTAATGCGATTGACATGAACTTGCTCTATACAGACCATCTTTGATATCCTATTGCGCTTCCCGCTTCTTACCACTTGAGACGTAGTATTTACATTACTCCTTAATCTTACAGATTTGAAATACCCAGCGTGTCAGCTATCACATACTCCAAATTCCTCTGAATTTCTCCTATCTTCTTCCTTCAACCTTGATTTTTAGAGATTGAATACTCCCAAAGCTTGCCATGAAATTGTTCCATTATACTTTGTCTTAAGACCATCTTTGATCCTTCTTGAGAAGTGTTTTAGCAGTCCATTGAAAAAATTCTCCTTCCAGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGTTATATAGCATGCAAGCAAAAAGACGACAACAACGGCTTAAACAGTGTATAACTTTTCTGATAGGGAGAATTCGACAACTGGTCGCATCTCCCTTATAGTACATTCTGAGGCATTTCATGTCTACCAGACATAGTCATGATTTCGCCGAAACGTTACTCTTAGAAAAAACAAGGAAATAATACAACATGAAAGCAGCAAAAGAAAGCGATAAAGTAGTACAAGGCAAGAGTTTCAGCGTGTAACGGGCATGCGGCCATGGGCAACACGCTTTGAACAAGCATGACCATGACACTAAACAAATATATGAAAAACATTAAAAGAGCGCTATCGTCCTTCCCTAACACGTATACCGTGCAACATTCCCTCAGTGGTCTCATAATAATTGGTGGGTGATTCGATTTCTTGTGATTTCTGTTGTTCTTTCGAATAGTGTCGTAGTAAAATGATATGCATTTTAAATTTCTTCAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA

mRNA sequence

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA

Coding sequence (CDS)

ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA

Protein sequence

MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECILHDMN
Homology
BLAST of Carg28094 vs. NCBI nr
Match: KAG7025102.1 (Protein SET DOMAIN GROUP 41, partial [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1488.8 bits (3853), Expect = 0.0e+00
Identity = 725/725 (100.00%), Postives = 725/725 (100.00%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK
Sbjct: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
           ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS
Sbjct: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600

Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
           DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660

Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
           ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI
Sbjct: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720

Query: 721 LHDMN 726
           LHDMN
Sbjct: 721 LHDMN 725

BLAST of Carg28094 vs. NCBI nr
Match: XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])

HSP 1 Score: 1196.0 bits (3093), Expect = 0.0e+00
Identity = 611/725 (84.28%), Postives = 612/725 (84.41%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600

Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
           DKFNTNRIHGRSIE DFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEADFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 623

Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
           ITTC NY  RSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGH SHLASQIECI
Sbjct: 661 ITTCLNYHGRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECI 623

Query: 721 LHDMN 726
           LHDMN
Sbjct: 721 LHDMN 623

BLAST of Carg28094 vs. NCBI nr
Match: XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1161.0 bits (3002), Expect = 0.0e+00
Identity = 592/725 (81.66%), Postives = 606/725 (83.59%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAF LTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICS SDSLTAAVFST  FPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLA+DDSEVFVKIR+G+DAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG+T
Sbjct: 121 LMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGRT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKS+R GEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           A NVELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGS ESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGSSESCCEKLQNLLTLGFYDEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWN DENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNGDENQCNATMSKTSAAYSLFLA 540

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFL+EPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLSEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600

Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
           DKFNT+RIHGRSIE DFREFSIGISNCIA+IS KYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTSRIHGRSIEADFREFSIGISNCIANISQKYWSFLAHECSYLKAFTDPFDFSWPKT 623

Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
           ITTCSNYRDRSCDCSKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGHHSHLASQI+CI
Sbjct: 661 ITTCSNYRDRSCDCSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCI 623

Query: 721 LHDMN 726
           LHDMN
Sbjct: 721 LHDMN 623

BLAST of Carg28094 vs. NCBI nr
Match: XP_022974027.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima])

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 578/727 (79.50%), Postives = 598/727 (82.26%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EI 
Sbjct: 361 ALQ------------------------------------------------------EIF 420

Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
           A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480

Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
           QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 540

Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
           LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 600

Query: 601 WVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWP 660
           WVDKFNT+RIHGRSIEVDF+EFSIGISNCIA+ISHKYWSFL HEC YLKAFTDPFDFSWP
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSIGISNCIANISHKYWSFLTHECPYLKAFTDPFDFSWP 625

Query: 661 KTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIE 720
           KTITTCSNYRDR CD SKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGH SHL+SQI+
Sbjct: 661 KTITTCSNYRDRLCDYSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQ 625

Query: 721 CILHDMN 726
           CIL DMN
Sbjct: 721 CILQDMN 625

BLAST of Carg28094 vs. NCBI nr
Match: XP_022932825.1 (protein SET DOMAIN GROUP 41 isoform X2 [Cucurbita moschata])

HSP 1 Score: 982.2 bits (2538), Expect = 2.3e-282
Identity = 512/622 (82.32%), Postives = 513/622 (82.48%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 520

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 520

Query: 601 DKFNTNRIHGRSIEVDFREFSI 623
           DKFNTNRIHGRSIE DFREFSI
Sbjct: 601 DKFNTNRIHGRSIEADFREFSI 520

BLAST of Carg28094 vs. ExPASy Swiss-Prot
Match: Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)

HSP 1 Score: 328.9 bits (842), Expect = 1.4e-88
Identity = 248/724 (34.25%), Postives = 341/724 (47.10%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPIC 62
           ME+RA EDIE+  D+ PPL PL ++L+D+F  +HCSSCFS LP S         YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SRSDSLTAAVFSTDHFPFSDT----SDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 122
           S +DS T    ++  FP   T    SD+R SL LL+    D     S+ P R+  LLTN 
Sbjct: 61  SLTDSFT----NSPQFPPEITPILPSDIRTSLHLLNSTAVD----TSSSPHRLNNLLTNH 120

Query: 123 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 182
             LM    D  + V I   A+ +A   R+N    R +  LEEA +C VLTNAVEV DS G
Sbjct: 121 HLLMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNG 180

Query: 183 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSI-NTRLRISPFCTDIGTGEGSCNQMST 242
             +GIA+Y+ +F WINHSCSPN+CYRF     S  +  +  +   +++   E  C     
Sbjct: 181 LALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCG---- 240

Query: 243 VRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHAT 302
                    T   S +G GP+++VRSIK ++ GE +T++Y DLLQP              
Sbjct: 241 ---------TSLNSGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQP-------------- 300

Query: 303 KNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTY 362
                                             +RQS+L S+Y+F+C+C RC+A PP Y
Sbjct: 301 --------------------------------TGLRQSDLWSKYRFMCNCGRCAASPPAY 360

Query: 363 VDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPK 422
           VD  L+                                             G   L+S K
Sbjct: 361 VDSILE---------------------------------------------GVLTLESEK 420

Query: 423 EISAFNVELLDSTSISNFD----YDTAMRRIDDYVNNAIAEYLSIG-SPESCCEKLQNLL 482
                       T++ +FD     D A+ +++DY+  AI ++LS    P++CCE ++++L
Sbjct: 421 ------------TTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNIDPKTCCEMIESVL 480

Query: 483 TLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTS 542
             G      +  +  Q   LRLH  H++ LNAY  LA+AY++RS  D E      MS+ S
Sbjct: 481 HHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRS-IDSETGIVCDMSRIS 540

Query: 543 AAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEI 602
           AAYSLFLAG +HHLF  E S   SAA  W  AGE L  L     +         S   ++
Sbjct: 541 AAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGELLFDLAPKLLM-------ELSVESDV 555

Query: 603 TCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDP 662
            C  C  ++  N++R        D +E S  I +C+ DIS   WSFL   C YL+ F  P
Sbjct: 601 KCTKCLMLETSNSHR--------DIKEKSRQILSCVRDISQVTWSFLTRGCPYLEKFRSP 555

Query: 663 FDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSH 717
            DFS    +T  +  R+ S   SK Q V      ++  L  HCL Y   L  +CYG  SH
Sbjct: 661 VDFS----LTRTNGEREES---SKDQTV------NVLLLSSHCLLYADLLTDLCYGQKSH 555

BLAST of Carg28094 vs. ExPASy TrEMBL
Match: A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 1196.0 bits (3093), Expect = 0.0e+00
Identity = 611/725 (84.28%), Postives = 612/725 (84.41%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600

Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
           DKFNTNRIHGRSIE DFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEADFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 623

Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
           ITTC NY  RSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGH SHLASQIECI
Sbjct: 661 ITTCLNYHGRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECI 623

Query: 721 LHDMN 726
           LHDMN
Sbjct: 721 LHDMN 623

BLAST of Carg28094 vs. ExPASy TrEMBL
Match: A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 578/727 (79.50%), Postives = 598/727 (82.26%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EI 
Sbjct: 361 ALQ------------------------------------------------------EIF 420

Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
           A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480

Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
           QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 540

Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
           LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 600

Query: 601 WVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWP 660
           WVDKFNT+RIHGRSIEVDF+EFSIGISNCIA+ISHKYWSFL HEC YLKAFTDPFDFSWP
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSIGISNCIANISHKYWSFLTHECPYLKAFTDPFDFSWP 625

Query: 661 KTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIE 720
           KTITTCSNYRDR CD SKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGH SHL+SQI+
Sbjct: 661 KTITTCSNYRDRLCDYSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQ 625

Query: 721 CILHDMN 726
           CIL DMN
Sbjct: 721 CILQDMN 625

BLAST of Carg28094 vs. ExPASy TrEMBL
Match: A0A6J1F365 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)

HSP 1 Score: 982.2 bits (2538), Expect = 1.1e-282
Identity = 512/622 (82.32%), Postives = 513/622 (82.48%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420

Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
           AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480

Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
           AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 520

Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
           GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 520

Query: 601 DKFNTNRIHGRSIEVDFREFSI 623
           DKFNTNRIHGRSIE DFREFSI
Sbjct: 601 DKFNTNRIHGRSIEADFREFSI 520

BLAST of Carg28094 vs. ExPASy TrEMBL
Match: A0A6J1IF01 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)

HSP 1 Score: 923.7 bits (2386), Expect = 4.8e-265
Identity = 485/624 (77.72%), Postives = 501/624 (80.29%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1   MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60

Query: 61  ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
           ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61  ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120

Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
           LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180

Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
           IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240

Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
           NFSHFITK     GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK                
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300

Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
                                         A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360

Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
           ALQ                                                      EI 
Sbjct: 361 ALQ------------------------------------------------------EIF 420

Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
           A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480

Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
           QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 522

Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
           LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 522

Query: 601 WVDKFNTNRIHGRSIEVDFREFSI 623
           WVDKFNT+RIHGRSIEVDF+EFSI
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSI 522

BLAST of Carg28094 vs. ExPASy TrEMBL
Match: A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)

HSP 1 Score: 835.9 bits (2158), Expect = 1.3e-238
Identity = 459/752 (61.04%), Postives = 516/752 (68.62%), Query Frame = 0

Query: 1   MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
           MEMEM A+EDIEMAEDI+PPL PLT+ALHD+F  THCSSCFS LPN  ISHS  L YCS 
Sbjct: 1   MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60

Query: 61  IC--SRSDSLTAAVFSTDHFP--FSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLT 120
            C  S SD LT A FS   FP   SDTSDLRASLRLLHLLLS PS   S PP+RI+GLLT
Sbjct: 61  KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120

Query: 121 NREKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDS 180
           NR KLM  ++DSEVF+K+R+GA+A+AA RR N ADI    ALEEA+LCLVLTNAV+VQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180

Query: 181 VGQTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMS 240
           +GQTIGIAVY  TF WINHSCSPNACYRFETPSDS+ TR RI+P CTD  + EGSC QM 
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240

Query: 241 TVRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHA 300
            VR N   FI +G  L+G GPRV+VRSIK ++KGEAVTIAYCDLLQPKA+          
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKAR---------- 300

Query: 301 TKNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPT 360
                                               RQSEL SRY+FVCSCQRCSA P T
Sbjct: 301 ------------------------------------RQSELWSRYQFVCSCQRCSAVPLT 360

Query: 361 YVDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSP 420
           YVDHALQ                                                     
Sbjct: 361 YVDHALQ----------------------------------------------------- 420

Query: 421 KEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGF 480
            EIS+  VELLDST ISNFD+DTA+RRID+YV+NAI EYLS  SPESCCEKLQNLLT GF
Sbjct: 421 -EISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYLSTSSPESCCEKLQNLLTFGF 480

Query: 481 YNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWN----------DDENQCNA 540
           ++EQ EDG+GKQ ++LRLHP+HFLLLNAYTAL SAYKVRS +          D+ N+ NA
Sbjct: 481 HDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRSCDLVALSSEMDKDNGNRHNA 540

Query: 541 -TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWG--SNTS 600
            TM KTSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLLIL +HSSLW   +NTS
Sbjct: 541 LTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAGESLLILARHSSLWATTTNTS 600

Query: 601 KSSSPMGEITCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHEC 660
               P+G+  C NCSWVD+FN +RIHG+ ++ DFREFSIGISNCIA IS K WS L H C
Sbjct: 601 NWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIGISNCIASISQKCWSSLTHGC 652

Query: 661 SYLKAFTDPFDFSWPKT--ITTCSNYRDRSCDCSKIQDV--------SEQDRQSIFELGI 720
            YLKAFT PFDFSWPKT     C    D SC CSK QDV        S Q+R+SI  LGI
Sbjct: 661 PYLKAFTGPFDFSWPKTNEQDICGRGIDHSCACSKTQDVCLECKPQDSNQERESISGLGI 652

Query: 721 HCLFYGGYLASICYGHHSHLASQIECILHDMN 726
           HCL+YGGYLASICYGHHSHLASQI+ IL+D+N
Sbjct: 721 HCLYYGGYLASICYGHHSHLASQIQNILNDLN 652

BLAST of Carg28094 vs. TAIR 10
Match: AT1G43245.1 (SET domain-containing protein )

HSP 1 Score: 328.9 bits (842), Expect = 1.0e-89
Identity = 248/724 (34.25%), Postives = 341/724 (47.10%), Query Frame = 0

Query: 3   MEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPIC 62
           ME+RA EDIE+  D+ PPL PL ++L+D+F  +HCSSCFS LP S         YCS  C
Sbjct: 1   MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60

Query: 63  SRSDSLTAAVFSTDHFPFSDT----SDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 122
           S +DS T    ++  FP   T    SD+R SL LL+    D     S+ P R+  LLTN 
Sbjct: 61  SLTDSFT----NSPQFPPEITPILPSDIRTSLHLLNSTAVD----TSSSPHRLNNLLTNH 120

Query: 123 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 182
             LM    D  + V I   A+ +A   R+N    R +  LEEA +C VLTNAVEV DS G
Sbjct: 121 HLLMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNG 180

Query: 183 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSI-NTRLRISPFCTDIGTGEGSCNQMST 242
             +GIA+Y+ +F WINHSCSPN+CYRF     S  +  +  +   +++   E  C     
Sbjct: 181 LALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCG---- 240

Query: 243 VRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHAT 302
                    T   S +G GP+++VRSIK ++ GE +T++Y DLLQP              
Sbjct: 241 ---------TSLNSGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQP-------------- 300

Query: 303 KNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTY 362
                                             +RQS+L S+Y+F+C+C RC+A PP Y
Sbjct: 301 --------------------------------TGLRQSDLWSKYRFMCNCGRCAASPPAY 360

Query: 363 VDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPK 422
           VD  L+                                             G   L+S K
Sbjct: 361 VDSILE---------------------------------------------GVLTLESEK 420

Query: 423 EISAFNVELLDSTSISNFD----YDTAMRRIDDYVNNAIAEYLSIG-SPESCCEKLQNLL 482
                       T++ +FD     D A+ +++DY+  AI ++LS    P++CCE ++++L
Sbjct: 421 ------------TTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNIDPKTCCEMIESVL 480

Query: 483 TLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTS 542
             G      +  +  Q   LRLH  H++ LNAY  LA+AY++RS  D E      MS+ S
Sbjct: 481 HHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRS-IDSETGIVCDMSRIS 540

Query: 543 AAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEI 602
           AAYSLFLAG +HHLF  E S   SAA  W  AGE L  L     +         S   ++
Sbjct: 541 AAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGELLFDLAPKLLM-------ELSVESDV 555

Query: 603 TCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDP 662
            C  C  ++  N++R        D +E S  I +C+ DIS   WSFL   C YL+ F  P
Sbjct: 601 KCTKCLMLETSNSHR--------DIKEKSRQILSCVRDISQVTWSFLTRGCPYLEKFRSP 555

Query: 663 FDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSH 717
            DFS    +T  +  R+ S   SK Q V      ++  L  HCL Y   L  +CYG  SH
Sbjct: 661 VDFS----LTRTNGEREES---SKDQTV------NVLLLSSHCLLYADLLTDLCYGQKSH 555

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
KAG7025102.10.0e+00100.00Protein SET DOMAIN GROUP 41, partial [Cucurbita argyrosperma subsp. argyrosperma... [more]
XP_022932824.10.0e+0084.28protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata][more]
XP_023520942.10.0e+0081.66protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022974027.10.0e+0079.50protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima][more]
XP_022932825.12.3e-28282.32protein SET DOMAIN GROUP 41 isoform X2 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
Q3ECY61.4e-8834.25Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A6J1EY390.0e+0084.28protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1I9540.0e+0079.50protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A6J1F3651.1e-28282.32protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143... [more]
A0A6J1IF014.8e-26577.72protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1114726... [more]
A0A0A0KAK31.3e-23861.04SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... [more]
Match NameE-valueIdentityDescription
AT1G43245.11.0e-8934.25SET domain-containing protein [more]
InterPro
Analysis Name: InterPro Annotations of Silver-seed gourd (SMH-JMG-627) v2
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 326..540
e-value: 7.0E-8
score: 34.3
NoneNo IPR availableGENE3D2.170.270.10SET domaincoord: 136..277
e-value: 1.1E-16
score: 62.4
NoneNo IPR availablePANTHERPTHR47780PROTEIN SET DOMAIN GROUP 41coord: 417..723
coord: 3..286
coord: 327..370
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 182..348
NoneNo IPR availableSUPERFAMILY82199SET domaincoord: 57..278

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Carg28094-RACarg28094-RAmRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding