Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideexonCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAGTAATTGTTTCAACTACGAATGATTTTCCTTCTCTTTATGGGCGTTTCTCTCCCTTTCTCGTGAAATTAGTGAGAGCTTTTGCATGATTGTTTATGGCAGATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGTTTGCAATCAATTAAGTGCTGTTGTTGTTGTCCCTGTTTGTATATATTCATGTGAATTTTGATGAAGATTTTCAGGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGTATTATATATTATTTTTGTTATCTTCCCTTCATAATATTTCACAAGTAACTCACGTATGGAACAAAATGGTATGCAAAGTTTGCCTTTTTGAAACAAAAATCCATCTAACAACGAGTAATTATCCACAATGAGTCTTTTTTCACAAGATTCAACAAAAGGAGCAATGTCATGTTGGTACAAATCCTTTATGTGTTCAAAACCCAACAACTTAGCATTCAAAGTATCGAGGAGGACATACCTTCGTGATAAAAGCATCTGCAACAATGTTCTCCTTACCTTGTTTATATTTTATGATGTGAGAGAATGTTTCAATAAATTCCAACCACTTAGCATGTGGTCTATTGAGTTTATTTTGTGCTCTCAAATGCTTTAAACTTTCATGATCCGTATGAATAATGAACTCCTCAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGTATATATTTCAACTCAAAGATTTGCAGAGTAAATGAATGTGGTGCTTGTGGAATATCGTTGTTAATTTCATCTTCTTGTTGAGGGATTATGCTGAAATGACTAGCTATAGATTTGTGAGAAATTTGTTGATTTTGTCTCAGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGTAAGAAAAACTATCAAAATATATACATAAACTTGCTTTTGTTTTGGAAATTTGGTTATGAATTAAAATTTTTCTTTTAACAAAGATTAAAACAATGTAAAGAACATGTGAGAAAACAAGCACATTTTTCAAAAACCAAAAATCAAAAGCTAAAAAACTATGTAGTTATCCTTAGCGATTAGGAGGAGTTTGGTATCACTTTCAAGATGTTTATGTAAGGTATCTCCTTGTAGTGATATGCGAAGGAAGAAAAGCCTTAAGGCAATATAAAATAATCCCTACCTATTTGACAGGGTTGGTAATCACTCCTTCCTCAGCTTCTTTGCTAAGAAATACGATTCGAAAAACATATAAAGAAAGCTATCGTCCTTTCCAAACAAGTATACTACTCAACATTCCCCCAATGGTCCCCTTTCCTTGTGTAACAAACTCGTAAACTGCCCTCATGTGACTAACTTGCAGGCTCCGTGTCTTTCTTGCCCCTCCTAGTGTACTAATTAGTAATTGGTGGCCTAATGTCACCCGCCGGGTCGGAGACATTGTTGAGTTGCTCCCATGTTGCTCCCAAGTTGCTTCATGATCTAGCAAGTTCTTCCAACTGACTAATAACTTAGTCATGCCCGTGGTGCTATTCTTTTGATAACTGAAAATTTCCTCACGGTCAACTTTCCATTCAAAATCACTGGTCTACACAAGCAGCTTGGGCTGTACTTGATGGTTTTGTTCCCAGAGCCTTCTCATCTGAGAATCATGGAATACTTGATGTATTGTTGCCCCAAGTGGAAGTTGGAGTTTATAAGCCACTGTCCCAATCTTGTATTCGATGAAATAAGGTTCAAAAAACTTCAGGGCTAACTTCTCATTACATTTCTTTGCTAGTGATGCCTGTCTATAAGGACGAATTTTGAAGAAAATCCAATCACCTACTGCAAATTCCACTTCCCTTTGTTTCATCCGAGGAGTTGTTAGGTGTTTTTTTGGATTCCATACAATAGCTAAGGAGGAGGTGCCCTGCCATTCACCACTTTGAAGATATGGAACGTGGTGTTATGCCTGCCCAACACAACTATTGTATAACGTTCACCATGAAAACACCTTAGGTAATTTTCTACACATTTATTTTATTTATTTTCCTTTGTTCATTTGCCTGAGGGTGGTATGGAGTGCTTTGCCAGAGTTGAGTCCCCTGTATCTGAAGAAGTTCAGTCCAGAAATGACTGATAAAGATCTTATCATAGTCTGAAATGATTGAGTTTGGAAACCTATGTAACTCTACTACCTTTCTAATAAATAGGGATGCTTATGTCTTAGCTGTAAATCGATGTTTGATGGCAAAAAAGTGAGCATATTTACCGAAATGGTCCACGACTACTAAGATGGTATCATGTCCTTCTGATCGTAGTAATCCTTCAATGAACCCCATAGCTGATTTGTTTCGTTGACAGACTACACAATCCTTCAATTAACCGCATGCCTTAACAATACAACTCACTAGTCAAATGTTTGTAGGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGGTAACCTCCCCTTGCATTTCAATGTTCCTTGATGAAAGGAGAATTTGGATAGCTTTTCCCTTTCGTCTTGCAACTGTTGGATGATCTTGCCGAGCTCGGGGTCATTTGCAACCTCTTCTTTAATCGTATCTATATCCAATATGGCAGGGGACAAATGGGTGAGACTCACTAATACGAGCTGAGATTTAGTGTTGGGAACTCAGTGGGTTGCATACGATCATGAAGAGGCCCAATTGGTGAAGGGACCGGAACAACCCTAAGCTGATGCAGGTAAGCAACCACTAAAATGGCCTGGTGTGTTAACCCTAAGGGGAACTCATCTTCATCAAAGGATAATAGCTAGGTGGCGAACTTGATCCCCACATGTACCATCACGATACAGAGTGAAGCAATGTAGCAATAGATACGTCAAGAATAGCCAACGAAAAGATTGCCGAAATTGAAATTTTTAGCTCAAAAGAAAAAGGACCTAGTTTTCAGTCGCATTAGTCATGCTAGGGTATTGTAAGTACTAAGCCTTTGCCATGGTATCAGAGCCAAATTAGTGAGGGAAGAGGAGTATTCAAATCTCTAAGAATCAAAGCCAAATTAGTGAGGGAAGAGGAGTATTCAATCTCTAAGAATCAAGGTTGAATGCAGGAAGAAGATGAGAAATTAAGAGGAATTTGGAGTATGTGATGACTGGGAGAATAACCCTTGTGTTTCAGATGGTAAGACGCGGGAAATGCAGTAAGTCCATTGACAAGTTTGGGTGCTTAATCATACAAAAGATTCCAAAGAAAATTTTATATCAACAACCTGACATCTTTATTAAAGACACTCTAATAATGAAGAAAATGACAAACGAAGATTGACAACTTGTCGGCCACACAATGAAGGCTCACATGTGAATTATTAATCTTTTGTCTGGGTGAATTATTAGTCTTTTGTCTTTATGAATTATTAGTTTGCTTTAGGTCTTTTCTTAATTTTAGACTTTTAGAAAAACAAATCTGAAGTCTATAAAAAGACTTCTTGAAGGTATAGAGAGGCATTGGTCGATAGATATCTCTAAAATCAATCAGAAACCAAGAGAAGAAACCTTTCTACTCCACTTATCTTTTTTATTTTCGACATTTTAATGAACGAGTTCTTAAAAAAGTTTACCAAATCAAATCGTCTCAAATGTTCTAACCAAGCAGAAGAAAGTTCAGATGGGAGAGGGAAACCTCCACCATCTTGAATACTGTCAATCTAGAATACCATACTTCAAGAATTGAAGAAAAACTTCAAAATTGGACTCTTCCGAAGGTAGATCCTAATACTATTTGCCAATTTTCAACTTCAATTTTGCTCAAAGATCATGCATCAAATTTTCGGAGAAAAGTATCTCAATTCATAGCAATAGGGAATCTCTGAATCTTTCTTATACAACACCGTGAAGAGTTTAATTATCTTCACGTCGGTCTTGTTCAAGTTGCGATAAAACCGTTATTCAGACTTGGGCTAGATAGCCCTGTCCTTATCTCCTTTCGAGATAAGAGACATGAAGATTTCTCAAATTCCTTCCTGGGAATGGTTCAATCTAATTTAGAGAATGTACCTGTCTATTTCAATTGTTATCTAAATTTTACTTTGTCACTCAAAGATCCGCATATATTATCATCCCTTATGTTGGACCTATAATAAAAAAAACTTGAACATCAAGGCTGAAACACATTCTTCAGTCGTTATATTCAGAGTTTATTACAAGTTCATGAATACAAATATCTCTCCAAGAGCATTAAGATCCTCACCAAAAGGATCAACTATGCTCATAGAAGCAAATATTGGAAAGTCGTGACTGTTCCAAAACCCTCCCTTAGGATCAAATAACTAAAAATAATCTCTGGAAAATAGAAGATGCCCATTTTTTCAAAAGAAAAGAGTCTCGAAACCCTGTTCAAATCATCGAACATGATAACGGAAGCGTTGAAATAAAGTTCAGCGAAGAACCTTCGTCAAATCCAAAAGTGAAAAAATTCCTAAGTTCGAGACCGAGTATTTCAGGAATTTCAAGCTCAATATATGACCCTTTAAAAGTAAAGGATGTCAACTACGATCAAAGAAGAGCCTCGATCCACTATGAAGATGGCTCAAGATCTCCAACTCATACTGATATGGATACTCAATCTGTCTACAAAAGTCGCTAAACGTCATTAGATTGAATGATTGAACCATTCCCAGTAAGGAATTTGAGAAATCTTCATGTCTCTTGTCTTGAAAGGAGATAAAGACGGGGATATCTTCAGAGATTCCCTATTGCTATGAATTGGGATATTTTTCTCCGAGAATTTGATGCATGATCTTTGAGTAAAATTGAAGGTTGAAAATTGGTAAATAGTTTTAGTATCTACTTTCGGAATAGTCCAATTTGAAGTTTTTCTTCAATTCTGGCCATATGGTATTCTTGATTGACAGTATGGTGAGGCCTTTGTGCCCACTCTGTGTCCATCTGAACTTTTTTCTGCTTGGTTAGAAGATTTGAGACAATTTTATTTGGGGAACATTTCTAAGAACTTGTTCATTAAAGTGTCAAAAATAAGAAAGATAAGAGGAGTAGAAAGGTTTCTTCTCTTGATTTCTGATTAAGTTTAGGGAGACCTATCGAACAATGCCTCTCTATACATTCAAGAAGTCTTTTTATAGACTTCAGATTTGAAAATAAACGTAAACTAAGAGATAAAGATTAAACCTAGCAGGATAGGACCCCTGCAAGATTTGTTTTTCTAAAAGTCTTAAAGTAAGAAAAAACCTAAAACAAACTAATAATTTACCGAGACAAAAGACTAATAATTCACAAAGACGAGATTAATAATTCACATGTGAGCCTTTACTGTGTGGCCTGTTGTCAATCTTCGTTTGTCATTTTCTTCTTTATTGAAGTGTCTTTAATAAAAATGTCAGATTGTTGATATAAAATTTTCTTTGAAATCTTTCGTATGATTGAGTACCCAAACTTGCCAACGACTTACTGCACTTCCCGCGTCTTACCATATGGAACACAAGGGTTATTCTCTCAGCTATCACATACTCCAAATTCCTCTAAATTTCTCTTATCTTCTTCCTACATTCAACCTTGATTCTTAGAGATTGAATACTCCTCTTCCTTTCACTAATTTGGCTCTGATACCACAGCGAAGGCTTAGTATTTACAATACCCTAGCATAACTAATGCGATTGAACATGACTTAAACTTGCCAACGACTTACTGCACTTCCCGCGTCTTACCATATGGAACACAAGGGTTATTCTCTCAGCTATCACATACTCCAAATTCCTCTAAATTTCTCTTATCTTCTTCCTACATTCAACCTTGATTCTTAGAGATTGAATACTCCTCTTCCTTTCACTAATTTGGCTCTGATACCACAGCGAAAGCTTGATACCACTGCATAACAATGCGATTGAACATGACTTTAAACTTGCCCAACGACTTACTGCACTTCCGCGTCTTACCATATGGAACACAAGGGTTATTGCTATCACATACTCCAAATTCCTCTAATTTCTCTAGTTCTTTCCTACATTCAACCTAATTTTAGAGATTGGTACTCCTCTTCTTTCACTTATGCTCTGATACCACAGCGAAGGCTTAGTGATACATACCCTAGCATAACTAATGCAAATGAACATGACAAAACTGCCAACGACTTACTGCATTTCCCGCGTCTTACCATCTGAAACACAAGGGTTATTCTCCCAGTCATCACATACTCCAAATTCCTCTGAATTTCTCATCTTCTTCCTGCATTCAACCTTGATTCTTAGAGACTGAATACTCCTCTTCCCTCACTAATTTGGCTCTGATACCATGGCAAAGGCTTAGTACTTACAATACCCTAGCATGACTAATGCGATTGACATGAACTTGCTCTATACAGACCATCTTTGATATCCTATTGCGCTTCCCGCTTCTTACCACTTGAGACGTAGTATTTACATTACTCCTTAATCTTACAGATTTGAAATACCCAGCGTGTCAGCTATCACATACTCCAAATTCCTCTGAATTTCTCCTATCTTCTTCCTTCAACCTTGATTTTTAGAGATTGAATACTCCCAAAGCTTGCCATGAAATTGTTCCATTATACTTTGTCTTAAGACCATCTTTGATCCTTCTTGAGAAGTGTTTTAGCAGTCCATTGAAAAAATTCTCCTTCCAGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGTTATATAGCATGCAAGCAAAAAGACGACAACAACGGCTTAAACAGTGTATAACTTTTCTGATAGGGAGAATTCGACAACTGGTCGCATCTCCCTTATAGTACATTCTGAGGCATTTCATGTCTACCAGACATAGTCATGATTTCGCCGAAACGTTACTCTTAGAAAAAACAAGGAAATAATACAACATGAAAGCAGCAAAAGAAAGCGATAAAGTAGTACAAGGCAAGAGTTTCAGCGTGTAACGGGCATGCGGCCATGGGCAACACGCTTTGAACAAGCATGACCATGACACTAAACAAATATATGAAAAACATTAAAAGAGCGCTATCGTCCTTCCCTAACACGTATACCGTGCAACATTCCCTCAGTGGTCTCATAATAATTGGTGGGTGATTCGATTTCTTGTGATTTCTGTTGTTCTTTCGAATAGTGTCGTAGTAAAATGATATGCATTTTAAATTTCTTCAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA
mRNA sequence
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA
Coding sequence (CDS)
ATGGAGATGGAAATGAGAGCAATGGAAGACATAGAAATGGCGGAAGACATTACTCCGCCGTTGCCTCCTCTCACCGCCGCTCTCCACGATGCCTTCTTCCTCACTCACTGCTCCTCCTGTTTCTCCACTCTCCCAAATTCCTCAATTTCTCACTCTAATCTCCTCCGCTACTGCTCCCCTATATGCTCCCGTTCCGATTCCCTCACCGCCGCCGTCTTCTCCACCGATCACTTCCCCTTCTCCGACACATCCGACCTCCGCGCCTCCCTCCGCCTCCTCCACCTCCTCCTCTCCGATCCCTCCGCTTGGCGCTCTGCTCCTCCCGAGCGTATCTTTGGCCTTCTCACCAATCGGGAGAAATTGATGCTTGCTGAAGACGATTCCGAGGTTTTCGTCAAGATTCGGAAAGGGGCCGACGCCATGGCCGCTTCCAGACGGACGAACTCTGCCGATATTCGCTATGACAACGCCTTGGAAGAGGCTATCCTGTGCCTCGTATTGACCAACGCCGTCGAGGTTCAGGATTCGGTTGGCCAAACCATTGGGATTGCTGTGTACCATCCAACCTTCTGCTGGATCAATCACAGTTGTTCTCCCAATGCTTGTTACAGATTTGAAACTCCGTCGGATTCCATCAATACGAGGCTACGGATTTCCCCCTTCTGTACTGACATTGGGACTGGTGAAGGAAGTTGTAATCAAATGAGTACTGTTCGTAGAAACTTTTCGCATTTCATTACAAAAGGTGTGTCTCTTCACGGTTATGGTCCAAGAGTCATGGTTAGGAGTATAAAGAGTATGAGGAAAGGAGAAGCAGTCACGATTGCATACTGTGACTTGTTGCAACCTAAGGCTAAAGATAATGTTGCCAGGTTTGCATTGCACGCAACAAAGAATAAAGCTCTTTGTCATAAATTGGATACCTCAAAGATGCACCTGTCAACTTCTCATAAGTTCTACATAAGCATGCCAAATTTCATTATGACCTTGGCAGTGAGGCAGTCAGAGTTGCTGTCAAGATATAAATTTGTCTGTAGTTGCCAGCGATGTAGTGCCAAGCCCCCAACTTATGTGGACCATGCTTTGCAAGACTGTAGGAAACCAGAGTGTCCTTCAATTACAGAGTCATGGTACATGTGTAGAGTAGATGGGATCAAAGTGGAAGTGTTTGATAATGAATTCCAGCGAATTGCTTGTCCTAATGAGGACTTTGCTGAAGTACATGGCTTTGACCACCTAAAATCGCCTAAGGAAATCTCTGCTTTCAATGTGGAATTGCTTGATTCAACTTCGATTAGCAACTTTGATTATGACACTGCAATGAGAAGAATAGATGATTATGTTAACAATGCCATCGCTGAGTACCTGTCTATTGGTTCCCCTGAATCATGTTGTGAGAAGCTTCAAAACTTGCTTACTTTAGGTTTCTACAACGAGCAAGCAGAAGACGGGGACGGAAAACAGCTTCTTAACTTAAGGCTGCATCCCGTGCACTTCCTGTTGCTGAACGCGTACACTGCTCTAGCATCGGCTTACAAAGTCCGTTCATGGAATGACGATGAAAATCAATGCAACGCTACGATGAGCAAAACAAGTGCAGCATACTCCTTGTTCCTTGCAGGTGCTACTCATCATCTTTTTCTTAATGAACCATCTTTGATTGCTTCTGCTGCAAATTGTTGGGTTGTTGCTGGAGAGTCTTTGCTCATTCTTGTTAAACACAGCTCATTATGGGGCTCCAACACTTCAAAATCGAGCTCCCCTATGGGCGAAATAACGTGTTTAAACTGCTCATGGGTCGATAAGTTCAATACGAATAGAATACATGGTCGATCTATAGAAGTAGATTTTCGGGAGTTTTCTATTGGTATTTCTAATTGCATTGCTGATATTTCACACAAATATTGGAGCTTTCTGGCTCATGAATGCTCATATTTGAAGGCTTTCACTGACCCCTTTGATTTCAGCTGGCCGAAGACGATCACGACATGTTCGAATTACCGTGATCGTTCGTGTGATTGTAGTAAAATTCAAGATGTTTCTGAGCAAGACAGGCAATCTATCTTTGAGCTTGGTATCCATTGCTTATTCTATGGAGGTTATTTAGCAAGTATTTGTTATGGTCACCATTCACATTTGGCATCCCAGATTGAATGTATTTTACATGACATGAACTGA
Protein sequence
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECILHDMN
Homology
BLAST of Carg28094 vs. NCBI nr
Match:
KAG7025102.1 (Protein SET DOMAIN GROUP 41, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1488.8 bits (3853), Expect = 0.0e+00
Identity = 725/725 (100.00%), Postives = 725/725 (100.00%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK
Sbjct: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS
Sbjct: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI
Sbjct: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
Query: 721 LHDMN 726
LHDMN
Sbjct: 721 LHDMN 725
BLAST of Carg28094 vs. NCBI nr
Match:
XP_022932824.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1196.0 bits (3093), Expect = 0.0e+00
Identity = 611/725 (84.28%), Postives = 612/725 (84.41%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
DKFNTNRIHGRSIE DFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEADFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 623
Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
ITTC NY RSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGH SHLASQIECI
Sbjct: 661 ITTCLNYHGRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECI 623
Query: 721 LHDMN 726
LHDMN
Sbjct: 721 LHDMN 623
BLAST of Carg28094 vs. NCBI nr
Match:
XP_023520942.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1161.0 bits (3002), Expect = 0.0e+00
Identity = 592/725 (81.66%), Postives = 606/725 (83.59%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAF LTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFLLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICS SDSLTAAVFST FPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSHSDSLTAAVFSTGQFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLA+DDSEVFVKIR+G+DAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG+T
Sbjct: 121 LMLADDDSEVFVKIREGSDAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGRT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKS+R GEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRNGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
A NVELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGS ESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AVNVELLDSTSISNFDYDTAIARIDDYVNNAIAEYLSIGSSESCCEKLQNLLTLGFYDEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWN DENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNGDENQCNATMSKTSAAYSLFLA 540
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFL+EPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLSEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
DKFNT+RIHGRSIE DFREFSIGISNCIA+IS KYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTSRIHGRSIEADFREFSIGISNCIANISQKYWSFLAHECSYLKAFTDPFDFSWPKT 623
Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
ITTCSNYRDRSCDCSKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGHHSHLASQI+CI
Sbjct: 661 ITTCSNYRDRSCDCSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIQCI 623
Query: 721 LHDMN 726
LHDMN
Sbjct: 721 LHDMN 623
BLAST of Carg28094 vs. NCBI nr
Match:
XP_022974027.1 (protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 578/727 (79.50%), Postives = 598/727 (82.26%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1 MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61 ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EI
Sbjct: 361 ALQ------------------------------------------------------EIF 420
Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480
Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 540
Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 600
Query: 601 WVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWP 660
WVDKFNT+RIHGRSIEVDF+EFSIGISNCIA+ISHKYWSFL HEC YLKAFTDPFDFSWP
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSIGISNCIANISHKYWSFLTHECPYLKAFTDPFDFSWP 625
Query: 661 KTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIE 720
KTITTCSNYRDR CD SKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGH SHL+SQI+
Sbjct: 661 KTITTCSNYRDRLCDYSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQ 625
Query: 721 CILHDMN 726
CIL DMN
Sbjct: 721 CILQDMN 625
BLAST of Carg28094 vs. NCBI nr
Match:
XP_022932825.1 (protein SET DOMAIN GROUP 41 isoform X2 [Cucurbita moschata])
HSP 1 Score: 982.2 bits (2538), Expect = 2.3e-282
Identity = 512/622 (82.32%), Postives = 513/622 (82.48%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 520
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 520
Query: 601 DKFNTNRIHGRSIEVDFREFSI 623
DKFNTNRIHGRSIE DFREFSI
Sbjct: 601 DKFNTNRIHGRSIEADFREFSI 520
BLAST of Carg28094 vs. ExPASy Swiss-Prot
Match:
Q3ECY6 (Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1)
HSP 1 Score: 328.9 bits (842), Expect = 1.4e-88
Identity = 248/724 (34.25%), Postives = 341/724 (47.10%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPIC 62
ME+RA EDIE+ D+ PPL PL ++L+D+F +HCSSCFS LP S YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SRSDSLTAAVFSTDHFPFSDT----SDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 122
S +DS T ++ FP T SD+R SL LL+ D S+ P R+ LLTN
Sbjct: 61 SLTDSFT----NSPQFPPEITPILPSDIRTSLHLLNSTAVD----TSSSPHRLNNLLTNH 120
Query: 123 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 182
LM D + V I A+ +A R+N R + LEEA +C VLTNAVEV DS G
Sbjct: 121 HLLMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNG 180
Query: 183 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSI-NTRLRISPFCTDIGTGEGSCNQMST 242
+GIA+Y+ +F WINHSCSPN+CYRF S + + + +++ E C
Sbjct: 181 LALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCG---- 240
Query: 243 VRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHAT 302
T S +G GP+++VRSIK ++ GE +T++Y DLLQP
Sbjct: 241 ---------TSLNSGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQP-------------- 300
Query: 303 KNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTY 362
+RQS+L S+Y+F+C+C RC+A PP Y
Sbjct: 301 --------------------------------TGLRQSDLWSKYRFMCNCGRCAASPPAY 360
Query: 363 VDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPK 422
VD L+ G L+S K
Sbjct: 361 VDSILE---------------------------------------------GVLTLESEK 420
Query: 423 EISAFNVELLDSTSISNFD----YDTAMRRIDDYVNNAIAEYLSIG-SPESCCEKLQNLL 482
T++ +FD D A+ +++DY+ AI ++LS P++CCE ++++L
Sbjct: 421 ------------TTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNIDPKTCCEMIESVL 480
Query: 483 TLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTS 542
G + + Q LRLH H++ LNAY LA+AY++RS D E MS+ S
Sbjct: 481 HHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRS-IDSETGIVCDMSRIS 540
Query: 543 AAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEI 602
AAYSLFLAG +HHLF E S SAA W AGE L L + S ++
Sbjct: 541 AAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGELLFDLAPKLLM-------ELSVESDV 555
Query: 603 TCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDP 662
C C ++ N++R D +E S I +C+ DIS WSFL C YL+ F P
Sbjct: 601 KCTKCLMLETSNSHR--------DIKEKSRQILSCVRDISQVTWSFLTRGCPYLEKFRSP 555
Query: 663 FDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSH 717
DFS +T + R+ S SK Q V ++ L HCL Y L +CYG SH
Sbjct: 661 VDFS----LTRTNGEREES---SKDQTV------NVLLLSSHCLLYADLLTDLCYGQKSH 555
BLAST of Carg28094 vs. ExPASy TrEMBL
Match:
A0A6J1EY39 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)
HSP 1 Score: 1196.0 bits (3093), Expect = 0.0e+00
Identity = 611/725 (84.28%), Postives = 612/725 (84.41%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
Query: 601 DKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 660
DKFNTNRIHGRSIE DFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT
Sbjct: 601 DKFNTNRIHGRSIEADFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWPKT 623
Query: 661 ITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIECI 720
ITTC NY RSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGH SHLASQIECI
Sbjct: 661 ITTCLNYHGRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHDSHLASQIECI 623
Query: 721 LHDMN 726
LHDMN
Sbjct: 721 LHDMN 623
BLAST of Carg28094 vs. ExPASy TrEMBL
Match:
A0A6J1I954 (protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)
HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 578/727 (79.50%), Postives = 598/727 (82.26%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1 MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61 ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EI
Sbjct: 361 ALQ------------------------------------------------------EIF 420
Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480
Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 540
Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 600
Query: 601 WVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDPFDFSWP 660
WVDKFNT+RIHGRSIEVDF+EFSIGISNCIA+ISHKYWSFL HEC YLKAFTDPFDFSWP
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSIGISNCIANISHKYWSFLTHECPYLKAFTDPFDFSWP 625
Query: 661 KTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSHLASQIE 720
KTITTCSNYRDR CD SKIQDVS+QDRQSIFELGIHCLFYGGYLASICYGH SHL+SQI+
Sbjct: 661 KTITTCSNYRDRLCDYSKIQDVSDQDRQSIFELGIHCLFYGGYLASICYGHPSHLSSQIQ 625
Query: 721 CILHDMN 726
CIL DMN
Sbjct: 721 CILQDMN 625
BLAST of Carg28094 vs. ExPASy TrEMBL
Match:
A0A6J1F365 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111439283 PE=4 SV=1)
HSP 1 Score: 982.2 bits (2538), Expect = 1.1e-282
Identity = 512/622 (82.32%), Postives = 513/622 (82.48%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFS LPNSSISHSNLLRYCSP
Sbjct: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSPLPNSSISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSD SAWRSAPPERIFGLLTNREK
Sbjct: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDSSAWRSAPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT
Sbjct: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
AVRQSELLSRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EIS
Sbjct: 361 ALQ------------------------------------------------------EIS 420
Query: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNEQ 480
AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+EQ
Sbjct: 421 AFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDEQ 480
Query: 481 AEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 540
AEDGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA
Sbjct: 481 AEDGDGKQLLNLRLHPVHFLLLNTYTALASAYKVRSWNDDENQCNATMSKTSAAYSLFLA 520
Query: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 600
GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV
Sbjct: 541 GATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCSWV 520
Query: 601 DKFNTNRIHGRSIEVDFREFSI 623
DKFNTNRIHGRSIE DFREFSI
Sbjct: 601 DKFNTNRIHGRSIEADFREFSI 520
BLAST of Carg28094 vs. ExPASy TrEMBL
Match:
A0A6J1IF01 (protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111472647 PE=4 SV=1)
HSP 1 Score: 923.7 bits (2386), Expect = 4.8e-265
Identity = 485/624 (77.72%), Postives = 501/624 (80.29%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEME+RAMEDIEMAEDITPPLPPLTAALHD+F LTHCSSCFS LPNS ISHSNLLRYCSP
Sbjct: 1 MEMELRAMEDIEMAEDITPPLPPLTAALHDSFLLTHCSSCFSPLPNSPISHSNLLRYCSP 60
Query: 61 ICSRSDSLTAAVFSTDHFPFSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNREK 120
ICS SDSLTAAVFSTDHF FSDTSDLRASLRLLHLLLSD SAWRS PPERIFGLLTNREK
Sbjct: 61 ICSYSDSLTAAVFSTDHFLFSDTSDLRASLRLLHLLLSDTSAWRSTPPERIFGLLTNREK 120
Query: 121 LMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVGQT 180
LMLA+DDSEVF KIRKGADA+A SRRTNSADIRYDNALEEAI+CLVLTNAVEVQDSVGQT
Sbjct: 121 LMLADDDSEVFAKIRKGADAIATSRRTNSADIRYDNALEEAIMCLVLTNAVEVQDSVGQT 180
Query: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMSTVRR 240
IGIAVYHPTFCWINHSCSPNACYRFETPSDSI TRLRISPFCTDIGTGEGSC+QMSTVRR
Sbjct: 181 IGIAVYHPTFCWINHSCSPNACYRFETPSDSIKTRLRISPFCTDIGTGEGSCSQMSTVRR 240
Query: 241 NFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHATKNK 300
NFSHFITK GYGPRVMVRSIKS+RKGEAVTIAYCDLLQPK
Sbjct: 241 NFSHFITK--DFQGYGPRVMVRSIKSIRKGEAVTIAYCDLLQPK---------------- 300
Query: 301 ALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTYVDH 360
A+RQSEL SRYKFVCSCQRCSAKPPTYVDH
Sbjct: 301 ------------------------------AMRQSELRSRYKFVCSCQRCSAKPPTYVDH 360
Query: 361 ALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPKEIS 420
ALQ EI
Sbjct: 361 ALQ------------------------------------------------------EIF 420
Query: 421 AFNV-ELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYNE 480
A NV ELLDSTSISNFDYDTA+ RIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFY+E
Sbjct: 421 AVNVEELLDSTSISNFDYDTAITRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGFYDE 480
Query: 481 QAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCN-ATMSKTSAAYSLF 540
QA+DGDGKQLLNLRLHPVHFLLLN YTALASAYKVRSWND+ENQCN +TMSKTSAAYSLF
Sbjct: 481 QADDGDGKQLLNLRLHPVHFLLLNVYTALASAYKVRSWNDNENQCNTSTMSKTSAAYSLF 522
Query: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEITCLNCS 600
LAGATHHLFLNEPSLIASAANCWVVAGESLL LV+HSSLWGSNTSKSSSPMGEITCLNCS
Sbjct: 541 LAGATHHLFLNEPSLIASAANCWVVAGESLLRLVRHSSLWGSNTSKSSSPMGEITCLNCS 522
Query: 601 WVDKFNTNRIHGRSIEVDFREFSI 623
WVDKFNT+RIHGRSIEVDF+EFSI
Sbjct: 601 WVDKFNTSRIHGRSIEVDFQEFSI 522
BLAST of Carg28094 vs. ExPASy TrEMBL
Match:
A0A0A0KAK3 (SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV=1)
HSP 1 Score: 835.9 bits (2158), Expect = 1.3e-238
Identity = 459/752 (61.04%), Postives = 516/752 (68.62%), Query Frame = 0
Query: 1 MEMEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSP 60
MEMEM A+EDIEMAEDI+PPL PLT+ALHD+F THCSSCFS LPN ISHS L YCS
Sbjct: 1 MEMEMIAVEDIEMAEDISPPLFPLTSALHDSFLFTHCSSCFSLLPNPPISHSIPLHYCSL 60
Query: 61 IC--SRSDSLTAAVFSTDHFP--FSDTSDLRASLRLLHLLLSDPSAWRSAPPERIFGLLT 120
C S SD LT A FS FP SDTSDLRASLRLLHLLLS PS S PP+RI+GLLT
Sbjct: 61 KCSLSHSDPLTDAFFSIHPFPDASSDTSDLRASLRLLHLLLSHPSPSLSPPPDRIYGLLT 120
Query: 121 NREKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDS 180
NR KLM ++DSEVF+K+R+GA+A+AA RR N ADI ALEEA+LCLVLTNAV+VQDS
Sbjct: 121 NRHKLMTPQNDSEVFLKLREGANAIAALRRKNYADIPPGTALEEAVLCLVLTNAVDVQDS 180
Query: 181 VGQTIGIAVYHPTFCWINHSCSPNACYRFETPSDSINTRLRISPFCTDIGTGEGSCNQMS 240
+GQTIGIAVY TF WINHSCSPNACYRFETPSDS+ TR RI+P CTD + EGSC QM
Sbjct: 181 IGQTIGIAVYASTFSWINHSCSPNACYRFETPSDSVTTRFRIAPSCTDFMSDEGSCRQMG 240
Query: 241 TVRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHA 300
VR N FI +G L+G GPRV+VRSIK ++KGEAVTIAYCDLLQPKA+
Sbjct: 241 NVRSNILDFIREGALLNGNGPRVVVRSIKRIKKGEAVTIAYCDLLQPKAR---------- 300
Query: 301 TKNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPT 360
RQSEL SRY+FVCSCQRCSA P T
Sbjct: 301 ------------------------------------RQSELWSRYQFVCSCQRCSAVPLT 360
Query: 361 YVDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSP 420
YVDHALQ
Sbjct: 361 YVDHALQ----------------------------------------------------- 420
Query: 421 KEISAFNVELLDSTSISNFDYDTAMRRIDDYVNNAIAEYLSIGSPESCCEKLQNLLTLGF 480
EIS+ VELLDST ISNFD+DTA+RRID+YV+NAI EYLS SPESCCEKLQNLLT GF
Sbjct: 421 -EISSVKVELLDSTPISNFDHDTAVRRIDEYVDNAITEYLSTSSPESCCEKLQNLLTFGF 480
Query: 481 YNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWN----------DDENQCNA 540
++EQ EDG+GKQ ++LRLHP+HFLLLNAYTAL SAYKVRS + D+ N+ NA
Sbjct: 481 HDEQVEDGEGKQHVSLRLHPLHFLLLNAYTALTSAYKVRSCDLVALSSEMDKDNGNRHNA 540
Query: 541 -TMSKTSAAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWG--SNTS 600
TM KTSAAY+LFLAGATH LFL EPSL+ASAANCWVVAGESLLIL +HSSLW +NTS
Sbjct: 541 LTMGKTSAAYALFLAGATHRLFLFEPSLVASAANCWVVAGESLLILARHSSLWATTTNTS 600
Query: 601 KSSSPMGEITCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHEC 660
P+G+ C NCSWVD+FN +RIHG+ ++ DFREFSIGISNCIA IS K WS L H C
Sbjct: 601 NWVFPLGKRMCYNCSWVDEFNASRIHGQPVQADFREFSIGISNCIASISQKCWSSLTHGC 652
Query: 661 SYLKAFTDPFDFSWPKT--ITTCSNYRDRSCDCSKIQDV--------SEQDRQSIFELGI 720
YLKAFT PFDFSWPKT C D SC CSK QDV S Q+R+SI LGI
Sbjct: 661 PYLKAFTGPFDFSWPKTNEQDICGRGIDHSCACSKTQDVCLECKPQDSNQERESISGLGI 652
Query: 721 HCLFYGGYLASICYGHHSHLASQIECILHDMN 726
HCL+YGGYLASICYGHHSHLASQI+ IL+D+N
Sbjct: 721 HCLYYGGYLASICYGHHSHLASQIQNILNDLN 652
BLAST of Carg28094 vs. TAIR 10
Match:
AT1G43245.1 (SET domain-containing protein )
HSP 1 Score: 328.9 bits (842), Expect = 1.0e-89
Identity = 248/724 (34.25%), Postives = 341/724 (47.10%), Query Frame = 0
Query: 3 MEMRAMEDIEMAEDITPPLPPLTAALHDAFFLTHCSSCFSTLPNSSISHSNLLRYCSPIC 62
ME+RA EDIE+ D+ PPL PL ++L+D+F +HCSSCFS LP S YCS C
Sbjct: 1 MEIRAAEDIEIRTDLFPPLSPLASSLYDSFLSSHCSSCFSLLPPSPPQP----LYCSAAC 60
Query: 63 SRSDSLTAAVFSTDHFPFSDT----SDLRASLRLLHLLLSDPSAWRSAPPERIFGLLTNR 122
S +DS T ++ FP T SD+R SL LL+ D S+ P R+ LLTN
Sbjct: 61 SLTDSFT----NSPQFPPEITPILPSDIRTSLHLLNSTAVD----TSSSPHRLNNLLTNH 120
Query: 123 EKLMLAEDDSEVFVKIRKGADAMAASRRTNSADIRYDNALEEAILCLVLTNAVEVQDSVG 182
LM D + V I A+ +A R+N R + LEEA +C VLTNAVEV DS G
Sbjct: 121 HLLMA---DPSISVAIHHAANFIATVIRSN----RKNTELEEAAICAVLTNAVEVHDSNG 180
Query: 183 QTIGIAVYHPTFCWINHSCSPNACYRFETPSDSI-NTRLRISPFCTDIGTGEGSCNQMST 242
+GIA+Y+ +F WINHSCSPN+CYRF S + + + +++ E C
Sbjct: 181 LALGIALYNSSFSWINHSCSPNSCYRFVNNRTSYHDVHVTNTETSSNLELQEQVCG---- 240
Query: 243 VRRNFSHFITKGVSLHGYGPRVMVRSIKSMRKGEAVTIAYCDLLQPKAKDNVARFALHAT 302
T S +G GP+++VRSIK ++ GE +T++Y DLLQP
Sbjct: 241 ---------TSLNSGNGNGPKLIVRSIKRIKSGEEITVSYIDLLQP-------------- 300
Query: 303 KNKALCHKLDTSKMHLSTSHKFYISMPNFIMTLAVRQSELLSRYKFVCSCQRCSAKPPTY 362
+RQS+L S+Y+F+C+C RC+A PP Y
Sbjct: 301 --------------------------------TGLRQSDLWSKYRFMCNCGRCAASPPAY 360
Query: 363 VDHALQDCRKPECPSITESWYMCRVDGIKVEVFDNEFQRIACPNEDFAEVHGFDHLKSPK 422
VD L+ G L+S K
Sbjct: 361 VDSILE---------------------------------------------GVLTLESEK 420
Query: 423 EISAFNVELLDSTSISNFD----YDTAMRRIDDYVNNAIAEYLSIG-SPESCCEKLQNLL 482
T++ +FD D A+ +++DY+ AI ++LS P++CCE ++++L
Sbjct: 421 ------------TTVGHFDGSTNKDEAVGKMNDYIQEAIDDFLSDNIDPKTCCEMIESVL 480
Query: 483 TLGFYNEQAEDGDGKQLLNLRLHPVHFLLLNAYTALASAYKVRSWNDDENQCNATMSKTS 542
G + + Q LRLH H++ LNAY LA+AY++RS D E MS+ S
Sbjct: 481 HHGI-----QFKEDSQPHCLRLHACHYVALNAYITLATAYRIRS-IDSETGIVCDMSRIS 540
Query: 543 AAYSLFLAGATHHLFLNEPSLIASAANCWVVAGESLLILVKHSSLWGSNTSKSSSPMGEI 602
AAYSLFLAG +HHLF E S SAA W AGE L L + S ++
Sbjct: 541 AAYSLFLAGVSHHLFCAERSFAISAAKFWKNAGELLFDLAPKLLM-------ELSVESDV 555
Query: 603 TCLNCSWVDKFNTNRIHGRSIEVDFREFSIGISNCIADISHKYWSFLAHECSYLKAFTDP 662
C C ++ N++R D +E S I +C+ DIS WSFL C YL+ F P
Sbjct: 601 KCTKCLMLETSNSHR--------DIKEKSRQILSCVRDISQVTWSFLTRGCPYLEKFRSP 555
Query: 663 FDFSWPKTITTCSNYRDRSCDCSKIQDVSEQDRQSIFELGIHCLFYGGYLASICYGHHSH 717
DFS +T + R+ S SK Q V ++ L HCL Y L +CYG SH
Sbjct: 661 VDFS----LTRTNGEREES---SKDQTV------NVLLLSSHCLLYADLLTDLCYGQKSH 555
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAG7025102.1 | 0.0e+00 | 100.00 | Protein SET DOMAIN GROUP 41, partial [Cucurbita argyrosperma subsp. argyrosperma... | [more] |
XP_022932824.1 | 0.0e+00 | 84.28 | protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita moschata] | [more] |
XP_023520942.1 | 0.0e+00 | 81.66 | protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022974027.1 | 0.0e+00 | 79.50 | protein SET DOMAIN GROUP 41 isoform X1 [Cucurbita maxima] | [more] |
XP_022932825.1 | 2.3e-282 | 82.32 | protein SET DOMAIN GROUP 41 isoform X2 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
Q3ECY6 | 1.4e-88 | 34.25 | Protein SET DOMAIN GROUP 41 OS=Arabidopsis thaliana OX=3702 GN=SDG41 PE=2 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EY39 | 0.0e+00 | 84.28 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC11143... | [more] |
A0A6J1I954 | 0.0e+00 | 79.50 | protein SET DOMAIN GROUP 41 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC1114726... | [more] |
A0A6J1F365 | 1.1e-282 | 82.32 | protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC11143... | [more] |
A0A6J1IF01 | 4.8e-265 | 77.72 | protein SET DOMAIN GROUP 41 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC1114726... | [more] |
A0A0A0KAK3 | 1.3e-238 | 61.04 | SET domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_6G014840 PE=4 SV... | [more] |
Match Name | E-value | Identity | Description | |
AT1G43245.1 | 1.0e-89 | 34.25 | SET domain-containing protein | [more] |