Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDSinitialstart_codonintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAACTCCCAACCCCATCTTTCTACCATTCCCACGCCACCGGAAAATTCCCCGCCTCCGTCCTCTGTAACGCCGCATTCCGATCACCGATATTCACTTATTGCTGGAAGGTTCAGAGACGCCCTATTCTCCGCCGCCGCCGCCAAATATGCGACTAATGGCAGCGCCCACTCCTTGCCCTTCCCCTCCGAGCAGTTCAAGTCCGTTATCGAGTGCTGTCTTCATGAGAATTTCCCTTCCTTTCGCACTCCCACTCATCTTCCATATGCCTCGGTAACTTTCTTTCTCGTTGCTCGTCCTTTCTGTCGATTAAGATTTTGAGGTTATCTGTTCCTTTGAGTTTTTTTATTTTTTATTTTTTTTATTTCTGTAATTGATATTGTGAGGAAATTTTTTGTGCATTGTGGTTGCAATTGAAGCGATGGTTGTTGATGAATTGATATCTTTGGGAATGTAGATGATACAGAAGGCAATAGCTGAAATGGGAGAGGAAGATGGGTTGAGTGAGGAATTAATATCGGAGTTTATTGTGAATGAGTATAAGGATTTACCATGGGCTCACCCCGCATTTTTGCGTCGCCATTTGGGGAAGCTTTGTGAAAGTGGGGAGCTAGTGAAATCCAAATGTGGGAAGTATAACTTTAAGGTGGAGGGTAAGGAAGTAAAGAGGAAGAAGCGGAGGAGGAAGTCAGCAGGAAGGAGTCGTCGCCGAGAAGTGGAGAGTGATGATGAGATAGAAGAGGATTTTAATAGGATAAAGCGATCAAAGAAACTGAACATAAGAGGACCCCGTGCGGAGGAGGTAGTAACAAGTAAAGGGTCTAAAGAACAAAATAATTCATTGAGAGAAGTAATTATTGGGGCTGAAGATGGAGCTCACGCACATAGAGGCGAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCATCATAGAGAGGAAATCAAGTATAAATATGGGGCTAATGATTTTAATCTGCCAAAGAACTCACGGAATCTGGTGATTATAGGTCTTCAAGCTCCAGTAGCTATTAAGGAGATTGGAAGACAAAGTCGTTCATTGGGGGGAAAAGTTCATGAAGCTGAAGAAGGAGATCATGCGAAAGGAGGCCAAATTCAAGTGCTTGGTGATGTTAAAGAAGTTCAGGCAGATGTAATGATTGACCAACCTTGTGAAAAAGAAGTCAAGAGTAGACATGTTATTCAAGATATTGATGAGACAAGGCAATCACGGACCGTGACAGCTGCAAATCTCGGTGTACAGGAGGCACTAGCAATGACAGGGATTGAAGCAAAATGTGGTTCGTTGAGGGAAGAAATTGGTGGACTTATGGAAGTTAGAAAAGTTGAAATGATTAATGATCCTCATGACGTGGAAACTAAGAGTACAGATAGGGCTGAAGATTTTGGAGAGATAAAACAATCACAGGATCTAATGGTTGTTGGACTTCATGCAAAAAAAGCACTACCAACTAAAGGGACTGAAGACCAATGTAGTTCGTTAAGAAAAAATGTTGATGGGGCTGAAGGAGACTGTGAACAAGCAGGTCAAACTGAAGTGCTAGTTTCATTCAAAGGAGGTCAAGAAGTTGAAATGATCGACGAGCATCACGAAGAGGAAAGACAAGGAGAAATGATGGAAGAACCAAAAGAGGTACTGCATATTACTCCACCTTATGCTGCCTTGCGTGCTATCTATTTTGTTTCTTGCTTGTAGTATATGCATACAATAAAATATATCACATGGATAGCATTTGAGTGAAAAAATAGTCAAAAGATCACACAATAAAACATATCTTATAGATTGGCATATCTTAAAGGGTTCATATTAAAGCTCCAAGAGCAATCAATAGATAATCTTGAAAGGTCTTAAATCACCTATCAATCCAAAAAACTTAAAGCTGATGGGTTATATCAATACTTCCACACTTTTCCTCATTTATGGGCTTAGAAATTTGAACAAGGCCTAATAAGTGCAAATCATACTAATTGGGGAGGAAATAGCATTACAAAGGTTTGAACACAAAATCTCCTACTCTAATACCATGTTAAATCACCATTGAACTCAAAAACTTAAGCTATTGGGTTACGGTTAATTTAATTTTATCAATGTACTCCACAAAAGGAGATATGTTAAATTTTTGTATGTTGGTTGGCATCACATCACAGTGTGTATTGTGTATGTTCCCATTACATTCATTACACTAGTGATTGTGTAAGGTTCTTGGAGGAAAAACACAGAAGAGTAGAGAATAAATCCGGAGGCTTTAGTCTTCCTCATTTCTTTCTTCATTTTTTACCACCAGCTTTGACACTCTTCATTGTACGCTTCTATTCTGAGAGGTTAGGATTCCTAGTTTAGTGATAGAACTCAGCTGGATTATTTATATTGAAGATGAATTTTAGTACTCTGGCTTTTTCTGTAAGCCACTGGAAGAAATGAGCTTCAAATTACTCATCCTAAACATTTGAGGGAGACATAAGGATAGATTGAAAATGACAGAAGTTCAAGCCCTGCAGCACTCTGAGGGGGAAAACACCTAGCTTGGATTTTGCTTTTGACGACAATCCTAAGTGTTTTAAGTTTCTGTAGTCTACCAACCATATATAGAACTTTACTTTCGTTGCGCACAAGGGGGAGCTTGACTTTTATGGCAATAGAAATGAGCTTCAACACAGAAAATTTACAAGAAACAACTCAGCTGAAACGGCAGTGTGTCCAATAATAAGTGGCTCAATAAGTAAGTTAGGAAACATATATTTCATTGATGATATGAAATATTACGAAAGAAGTATCTTATAATAGGATGCCTCTCAAAGAGAAAGGAGGGAGGAAAAAGAAACTCTTAGCTGTAAGTCGGTTACAAAAAAACTCCTCCATTTGGTTAAAAGAGAAGAAAACAATAGCTGTCAGAGGGGAGGGAAAACATTTACATCAAGTACGGGACAAGTAAGATAGGTTCTCAAAAATAGCGTCGAAAGGCTTCTCATTATCCTTGAAAATTCCGTTGCTTCATTCCTTCCATGGTAGCCAAAGAGTCTTTTATAAAATTGGACTAAAGGAGCTTCTCTTCATGCAATGACTGATGTTAACCCTGCTAATAGGCTCACTAAAAGTTTACCATTAATTCAGTTTGATCTTTGTTTGTACTAGCAGTATAAAGATACTAGATGATTTGATTGTTTGAGATCTAAGACCCCATTTACTTCAGAATAGAATATTCATTTCTATTTTAATTCTTGATTGTCTAAACAATAGGAAGGACCTTCTGATTCAGATTCCAAATAGCTCATATTGAACAACAAAGAAATATCATTCACATTCCAATGGATTTTCATTCTGATTCTTAGTAAGTTGATTTGTAGTAAGTAAATGAGGCCTAAGTATTTTTTTTTTTTTTTATTAATTTATTGTTCTTTTTTCTGTTGGGAACAAGAATTAACTTCAAGGATCAAATTGACACAAAGTTTAATGTTCAGGGACTAAAATTGTAATGTAACGATTTGTTAATTATGAAGACCAAGTCAATCTATAACTGTGTCCCATGTCAAGATGGGGATGGAACCTTACAGTCAGAGCTAAAGCCACCATTCCAATATCTGATCGTGTTCCAAGATATTTATGTAACTAATCTTCTCAGTAATTCAGTTCTTCTCCAGTTTCCTTGATAATTCATTCTAAGGGGCGAGTCTTTTAGTAACTAAGTTGAGATATGGCCATCCATGAGATATATTTTATTGTTTCTTGATATTAACCTTGTGCCTATCCGTAAGATATACTTTAACACATGCTATCTATCATATTACTCCATTTCATTTCAAGATTATCATAAGATATATTTTACTGCTCTAGCTAAATTATTCCAGCTTATGGGACTCCTATTGGGAAACCCAAAAAATAATCAGACATTTCTTTTTCCATTCAATATAAAATAACTGTTGAAATTATACATTTTTGCATCTCTTTATCTTCTTTAGCTACTTTTTTCCTCACCCACTGCTTGTGGAGGTGGTGTCCTGCTGTTGTTGGCAGCTGGCTCTGAAAGAAAGGAAGACATAGGATTTCATCAGTGTTTGTTTGGAGATGAGGTAGAAGAATTCTCCTTTATGTTGTAGAGAGAGTCGTGGGTGGAGGCTAGAGGGATAGATTTGCTGATGATCTGTTGTCAAGGAGAGAAGCTTTGATATTTCAAAATATTAAGTGTAGTTGCGTGAAAACTAATATTGAGTCAACTTAGCAAGGTGTTTTAAGTTTCATTTCCATCGCCAGAGAAAGCGCAAGTTTCATTTCCACCGTCAGAAGGTGTTAATTACAGGCTTACATTACACTACATTCTCCGTAGTTGTAGACATTTTTTTCTATAACAATATTTTCTATGGAGTTGGCTTTTAAATTTTTTAATCTTATTTTAGCATTCATGTTGATCAAATTTCTGATTATAATGTAGAGAGCATCCAAGGTATCAAACGAAGAAGAGGGTCCTGGTGAAGAAGCCACTTTGGATTTCTTTGATGCTATGCCTAACGATGACGATGCTAAAGAAAATGGAGTGATTGATGCTCAAGGCTGCCAGAAGTTACAAGAGGAAAATGAAGATTTGGAGTTCTTTGATGCGAAGTCTGACCATGGAGACAATGAGGCGAATGAAATAACCGGTGCTCAAACTTCCAAGGGGAAGGTACTTGGTGAAGTTGGCAATAAACAAAATAGACTGGAAGAACAACGAATATCCAAGGTGAGCGATGATCAAACTGGAATAAGCAAGGGCTGTGAGGCTGAGAACCCTCAACTATCCAATAAGCATCCTCGAGTTAGATGGCCTTCTGAAATAACTGGAACTTGGAGAACTAGTATAGCAGCATCCCCACCTCTTGAGCATCAGACCATGGCGCCAAAGCATTCAGAGCAAGCGGTGCGTGGTACATCTGAGGCAGACAAAAATGAATATTCTGAGGCATTATGGACTAAAGATGTTATTTGTAGTCCTAAGAGTCAACCAAGGGGGCACCGTGGGCGAGGGAGGCCTCATAAGTTGAAGATTCAAGAAACTTTTGCGACTTCATTATCTTCGCCTGCTGGAGATTGTGACCAGCAATTTCTGGAATCAAAAGGGGAGGACAGAGAGACATCTGGCCCAGATATGTGCAAAGACACTCATCATATTGATCAGCAGCAACTCAAGCTGCCAAGAGGAAGGGGCAGAGGTAGAGGTCGGGGAAGGCCTCGAATAATGAGACAAGACTGGATTTCGGTGCCAGAGACGTTCTCACCTTCCCAGCATTTGCATCAGCAATCTCCTGCAAAGAGAGGCCGTGGTAGGCCTCCCAAACAAAAATTTGATGAAGATACTGTATCAAAGGACATCTCGACTTTAGAGAATGACCAGCAAGAACGAAAGGGTCGTGGTCGTGGTCGGGGCCGGGGCCGGGGCCGGGGCCGTGACGGTGAAAGACCTTCCAGAGGAAGAAAGAGAGAGAAGGAATCATTTGATACGTTCAATTGCTAAGTATATTAATTAGTATTTAGGAAAATCTAACATCTGATTTTGTTGCTTGTTTCACCGCGCGAGTTTGCCACTTTCCTTTTACCTTCACACCGTTATAAAATTTAAAATTCACACTGTAATTCATGAAGTTAGTTGTTTTTCAAGTCATGCAATTGCAGGTATTACAACGGGGAATAATAGGAGCAGGAAGTTGCACGAAGTGATGGTTGCTCTTGCACACAGGAAGATCTTCCCTGCATCATCACTCGCAGCTTCTCCATCCAACCTCATGTCTATCCTTGTGATCAGAGACACCGCCGCCCTGCTCATGCACATTTGGATCAAGCTTCTTTCATGTCCCAATATGTATAAAGGTAAGAACCTTACCTAA
mRNA sequence
ATGGAGAACTCCCAACCCCATCTTTCTACCATTCCCACGCCACCGGAAAATTCCCCGCCTCCGTCCTCTGTAACGCCGCATTCCGATCACCGATATTCACTTATTGCTGGAAGGTTCAGAGACGCCCTATTCTCCGCCGCCGCCGCCAAATATGCGACTAATGGCAGCGCCCACTCCTTGCCCTTCCCCTCCGAGCAGTTCAAGTCCGTTATCGAGTGCTGTCTTCATGAGAATTTCCCTTCCTTTCGCACTCCCACTCATCTTCCATATGCCTCGATGATACAGAAGGCAATAGCTGAAATGGGAGAGGAAGATGGGTTGAGTGAGGAATTAATATCGGAGTTTATTGTGAATGAGTATAAGGATTTACCATGGGCTCACCCCGCATTTTTGCGTCGCCATTTGGGGAAGCTTTGTGAAAGTGGGGAGCTAGTGAAATCCAAATGTGGGAAGTATAACTTTAAGGTGGAGGGTAAGGAAGTAAAGAGGAAGAAGCGGAGGAGGAAGTCAGCAGGAAGGAGTCGTCGCCGAGAAGTGGAGAGTGATGATGAGATAGAAGAGGATTTTAATAGGATAAAGCGATCAAAGAAACTGAACATAAGAGGACCCCGTGCGGAGGAGGTAGTAACAAGTAAAGGGTCTAAAGAACAAAATAATTCATTGAGAGAAGTAATTATTGGGGCTGAAGATGGAGCTCACGCACATAGAGGCGAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCATCATAGAGAGGAAATCAAGTATAAATATGGGGCTAATGATTTTAATCTGCCAAAGAACTCACGGAATCTGGTGATTATAGGTCTTCAAGCTCCAGTAGCTATTAAGGAGATTGGAAGACAAAGTCGTTCATTGGGGGGAAAAGTTCATGAAGCTGAAGAAGGAGATCATGCGAAAGGAGGCCAAATTCAAGTGCTTGGTGATGTTAAAGAAGTTCAGGCAGATGTAATGATTGACCAACCTTGTGAAAAAGAAGTCAAGAGTAGACATGTTATTCAAGATATTGATGAGACAAGGCAATCACGGACCGTGACAGCTGCAAATCTCGGTGTACAGGAGGCACTAGCAATGACAGGGATTGAAGCAAAATGTGGTTCGTTGAGGGAAGAAATTGGTGGACTTATGGAAGTTAGAAAAGTTGAAATGATTAATGATCCTCATGACGTGGAAACTAAGAGTACAGATAGGGCTGAAGATTTTGGAGAGATAAAACAATCACAGGATCTAATGGTTGTTGGACTTCATGCAAAAAAAGCACTACCAACTAAAGGGACTGAAGACCAATGTAGTTCGTTAAGAAAAAATGTTGATGGGGCTGAAGGAGACTGTGAACAAGCAGGTCAAACTGAAGTGCTAGTTTCATTCAAAGGAGGTCAAGAAGTTGAAATGATCGACGAGCATCACGAAGAGGAAAGACAAGGAGAAATGATGGAAGAACCAAAAGAGAGAGCATCCAAGGTATCAAACGAAGAAGAGGGTCCTGGTGAAGAAGCCACTTTGGATTTCTTTGATGCTATGCCTAACGATGACGATGCTAAAGAAAATGGAGTGATTGATGCTCAAGGCTGCCAGAAGTTACAAGAGGAAAATGAAGATTTGGAGTTCTTTGATGCGAAGTCTGACCATGGAGACAATGAGGCGAATGAAATAACCGGTGCTCAAACTTCCAAGGGGAAGGTACTTGGTGAAGTTGGCAATAAACAAAATAGACTGGAAGAACAACGAATATCCAAGGTGAGCGATGATCAAACTGGAATAAGCAAGGGCTGTGAGGCTGAGAACCCTCAACTATCCAATAAGCATCCTCGAGTTAGATGGCCTTCTGAAATAACTGGAACTTGGAGAACTAGTATAGCAGCATCCCCACCTCTTGAGCATCAGACCATGGCGCCAAAGCATTCAGAGCAAGCGGTGCGTGGTACATCTGAGGCAGACAAAAATGAATATTCTGAGGCATTATGGACTAAAGATGTTATTTGTAGTCCTAAGAGTCAACCAAGGGGGCACCGTGGGCGAGGGAGGCCTCATAAGTTGAAGATTCAAGAAACTTTTGCGACTTCATTATCTTCGCCTGCTGGAGATTGTGACCAGCAATTTCTGGAATCAAAAGGGGAGGACAGAGAGACATCTGGCCCAGATATGTGCAAAGACACTCATCATATTGATCAGCAGCAACTCAAGCTGCCAAGAGGAAGGGGCAGAGGTAGAGGTCGGGGAAGGCCTCGAATAATGAGACAAGACTGGATTTCGGTGCCAGAGACGTTCTCACCTTCCCAGCATTTGCATCAGCAATCTCCTGCAAAGAGAGGCCGTGGTAGGCCTCCCAAACAAAAATTTGATGAAGATACTGTATCAAAGGACATCTCGACTTTAGAGAATGACCAGCAAGAACGAAAGGGTCGTGGTCGTGGTCGGGGCCGGGGCCGGGGCCGGGGCCGTGACGGTATTACAACGGGGAATAATAGGAGCAGGAAGTTGCACGAAGTGATGGTTGCTCTTGCACACAGGAAGATCTTCCCTGCATCATCACTCGCAGCTTCTCCATCCAACCTCATGTCTATCCTTGTGATCAGAGACACCGCCGCCCTGCTCATGCACATTTGGATCAAGCTTCTTTCATGTCCCAATATGTATAAAGGTAAGAACCTTACCTAA
Coding sequence (CDS)
ATGGAGAACTCCCAACCCCATCTTTCTACCATTCCCACGCCACCGGAAAATTCCCCGCCTCCGTCCTCTGTAACGCCGCATTCCGATCACCGATATTCACTTATTGCTGGAAGGTTCAGAGACGCCCTATTCTCCGCCGCCGCCGCCAAATATGCGACTAATGGCAGCGCCCACTCCTTGCCCTTCCCCTCCGAGCAGTTCAAGTCCGTTATCGAGTGCTGTCTTCATGAGAATTTCCCTTCCTTTCGCACTCCCACTCATCTTCCATATGCCTCGATGATACAGAAGGCAATAGCTGAAATGGGAGAGGAAGATGGGTTGAGTGAGGAATTAATATCGGAGTTTATTGTGAATGAGTATAAGGATTTACCATGGGCTCACCCCGCATTTTTGCGTCGCCATTTGGGGAAGCTTTGTGAAAGTGGGGAGCTAGTGAAATCCAAATGTGGGAAGTATAACTTTAAGGTGGAGGGTAAGGAAGTAAAGAGGAAGAAGCGGAGGAGGAAGTCAGCAGGAAGGAGTCGTCGCCGAGAAGTGGAGAGTGATGATGAGATAGAAGAGGATTTTAATAGGATAAAGCGATCAAAGAAACTGAACATAAGAGGACCCCGTGCGGAGGAGGTAGTAACAAGTAAAGGGTCTAAAGAACAAAATAATTCATTGAGAGAAGTAATTATTGGGGCTGAAGATGGAGCTCACGCACATAGAGGCGAAGTTGTGCTGGATGAACTTGAAGAAGTTCAAGAAGATGAAATGATTGACAAGCATCATAGAGAGGAAATCAAGTATAAATATGGGGCTAATGATTTTAATCTGCCAAAGAACTCACGGAATCTGGTGATTATAGGTCTTCAAGCTCCAGTAGCTATTAAGGAGATTGGAAGACAAAGTCGTTCATTGGGGGGAAAAGTTCATGAAGCTGAAGAAGGAGATCATGCGAAAGGAGGCCAAATTCAAGTGCTTGGTGATGTTAAAGAAGTTCAGGCAGATGTAATGATTGACCAACCTTGTGAAAAAGAAGTCAAGAGTAGACATGTTATTCAAGATATTGATGAGACAAGGCAATCACGGACCGTGACAGCTGCAAATCTCGGTGTACAGGAGGCACTAGCAATGACAGGGATTGAAGCAAAATGTGGTTCGTTGAGGGAAGAAATTGGTGGACTTATGGAAGTTAGAAAAGTTGAAATGATTAATGATCCTCATGACGTGGAAACTAAGAGTACAGATAGGGCTGAAGATTTTGGAGAGATAAAACAATCACAGGATCTAATGGTTGTTGGACTTCATGCAAAAAAAGCACTACCAACTAAAGGGACTGAAGACCAATGTAGTTCGTTAAGAAAAAATGTTGATGGGGCTGAAGGAGACTGTGAACAAGCAGGTCAAACTGAAGTGCTAGTTTCATTCAAAGGAGGTCAAGAAGTTGAAATGATCGACGAGCATCACGAAGAGGAAAGACAAGGAGAAATGATGGAAGAACCAAAAGAGAGAGCATCCAAGGTATCAAACGAAGAAGAGGGTCCTGGTGAAGAAGCCACTTTGGATTTCTTTGATGCTATGCCTAACGATGACGATGCTAAAGAAAATGGAGTGATTGATGCTCAAGGCTGCCAGAAGTTACAAGAGGAAAATGAAGATTTGGAGTTCTTTGATGCGAAGTCTGACCATGGAGACAATGAGGCGAATGAAATAACCGGTGCTCAAACTTCCAAGGGGAAGGTACTTGGTGAAGTTGGCAATAAACAAAATAGACTGGAAGAACAACGAATATCCAAGGTGAGCGATGATCAAACTGGAATAAGCAAGGGCTGTGAGGCTGAGAACCCTCAACTATCCAATAAGCATCCTCGAGTTAGATGGCCTTCTGAAATAACTGGAACTTGGAGAACTAGTATAGCAGCATCCCCACCTCTTGAGCATCAGACCATGGCGCCAAAGCATTCAGAGCAAGCGGTGCGTGGTACATCTGAGGCAGACAAAAATGAATATTCTGAGGCATTATGGACTAAAGATGTTATTTGTAGTCCTAAGAGTCAACCAAGGGGGCACCGTGGGCGAGGGAGGCCTCATAAGTTGAAGATTCAAGAAACTTTTGCGACTTCATTATCTTCGCCTGCTGGAGATTGTGACCAGCAATTTCTGGAATCAAAAGGGGAGGACAGAGAGACATCTGGCCCAGATATGTGCAAAGACACTCATCATATTGATCAGCAGCAACTCAAGCTGCCAAGAGGAAGGGGCAGAGGTAGAGGTCGGGGAAGGCCTCGAATAATGAGACAAGACTGGATTTCGGTGCCAGAGACGTTCTCACCTTCCCAGCATTTGCATCAGCAATCTCCTGCAAAGAGAGGCCGTGGTAGGCCTCCCAAACAAAAATTTGATGAAGATACTGTATCAAAGGACATCTCGACTTTAGAGAATGACCAGCAAGAACGAAAGGGTCGTGGTCGTGGTCGGGGCCGGGGCCGGGGCCGGGGCCGTGACGGTATTACAACGGGGAATAATAGGAGCAGGAAGTTGCACGAAGTGATGGTTGCTCTTGCACACAGGAAGATCTTCCCTGCATCATCACTCGCAGCTTCTCCATCCAACCTCATGTCTATCCTTGTGATCAGAGACACCGCCGCCCTGCTCATGCACATTTGGATCAAGCTTCTTTCATGTCCCAATATGTATAAAGGTAAGAACCTTACCTAA
Protein sequence
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSLPFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVTAANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEADKNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQSPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSRKLHEVMVALAHRKIFPASSLAASPSNLMSILVIRDTAALLMHIWIKLLSCPNMYKGKNLT
Homology
BLAST of Csor.00g068070 vs. ExPASy Swiss-Prot
Match:
P23444 (Histone H1 OS=Zea mays OX=4577 PE=2 SV=2)
HSP 1 Score: 47.8 bits (112), Expect = 7.7e-04
Identity = 26/66 (39.39%), Postives = 37/66 (56.06%), Query Frame = 0
Query: 84 TPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEYK-DLPWAHPAFLRRHLGKLCESG 143
+PTHLPYA M+ +AI + E G S I++F+ +++K LP L L KL G
Sbjct: 47 SPTHLPYAEMVSEAITSLKERTGSSSYAIAKFVEDKHKAKLPPNFRKLLNVQLKKLVAGG 106
Query: 144 ELVKSK 149
+L K K
Sbjct: 107 KLTKVK 112
BLAST of Csor.00g068070 vs. NCBI nr
Match:
KAG6579248.1 (hypothetical protein SDJN03_23696, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1758 bits (4554), Expect = 0.0
Identity = 899/899 (100.00%), Postives = 899/899 (100.00%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY
Sbjct: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV
Sbjct: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL
Sbjct: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT
Sbjct: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ
Sbjct: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID
Sbjct: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK
Sbjct: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG
Sbjct: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD
Sbjct: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE
Sbjct: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS
Sbjct: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
Query: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSRK 840
PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSRK
Sbjct: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSRK 840
Query: 841 LHEVMVALAHRKIFPASSLAASPSNLMSILVIRDTAALLMHIWIKLLSCPNMYKGKNLT 899
LHEVMVALAHRKIFPASSLAASPSNLMSILVIRDTAALLMHIWIKLLSCPNMYKGKNLT
Sbjct: 841 LHEVMVALAHRKIFPASSLAASPSNLMSILVIRDTAALLMHIWIKLLSCPNMYKGKNLT 899
BLAST of Csor.00g068070 vs. NCBI nr
Match:
KAG7016763.1 (hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1614 bits (4180), Expect = 0.0
Identity = 825/840 (98.21%), Postives = 828/840 (98.57%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY
Sbjct: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIEEDFNRIKRSKKL IRGPRAEEVVTSKGSKEQNNSLREVIIGAEDG HAHRGEVV
Sbjct: 181 SDDEIEEDFNRIKRSKKLKIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGDHAHRGEVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL
Sbjct: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQS+TVT
Sbjct: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSQTVT 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ
Sbjct: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVL +FKGGQEVEMID
Sbjct: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLGTFKGGQEVEMID 480
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK
Sbjct: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG
Sbjct: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD
Sbjct: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLES GE
Sbjct: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESNGE 720
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS
Sbjct: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
Query: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSRK 840
PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGR G R R+
Sbjct: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRGGERPSRGRKRE 840
BLAST of Csor.00g068070 vs. NCBI nr
Match:
XP_022938936.1 (uncharacterized protein LOC111444998 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1575 bits (4078), Expect = 0.0
Identity = 809/842 (96.08%), Postives = 816/842 (96.91%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLH+NFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY
Sbjct: 61 PFPSEQFKSVIECCLHQNFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIEEDFNRIKRSKKLNIRGP AE VVTSKGSKEQNNSLREVIIGAEDG HAHRGEVV
Sbjct: 181 SDDEIEEDFNRIKRSKKLNIRGPHAEAVVTSKGSKEQNNSLREVIIGAEDGDHAHRGEVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPK SRNLVIIGL APVAIKEIG+QSRSL
Sbjct: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKKSRNLVIIGLHAPVAIKEIGKQSRSL 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDE RQS+TVT
Sbjct: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDEKRQSQTVT 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLGVQEALAMTGIEAKCGS REEIGGLME+RKVEMINDPHDVE KSTDRAEDFGEIKQ
Sbjct: 361 AANLGVQEALAMTGIEAKCGSSREEIGGLMEIRKVEMINDPHDVEAKSTDRAEDFGEIKQ 420
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVL +FKG QEVEMID
Sbjct: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLGTFKGAQEVEMID 480
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGV+DAQGCQK
Sbjct: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVMDAQGCQK 540
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQN LEEQRISKVSDDQTG
Sbjct: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNSLEEQRISKVSDDQTG 600
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD
Sbjct: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNEYSEAL TKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLES E
Sbjct: 661 KNEYSEALLTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESNVE 720
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQ S
Sbjct: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQPS 780
Query: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRG--RGRGRGRGRGRDGITTGNNRS 840
PAKRGRGRPPKQKFDEDTVSKDI TLENDQQERKGRG RGRGRGRGRGR G R
Sbjct: 781 PAKRGRGRPPKQKFDEDTVSKDILTLENDQQERKGRGCGRGRGRGRGRGRGGERPSRGRK 840
BLAST of Csor.00g068070 vs. NCBI nr
Match:
XP_023549578.1 (uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1544 bits (3998), Expect = 0.0
Identity = 792/841 (94.17%), Postives = 809/841 (96.20%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLI GRFRDALFSAAAAKYATNGSAHSL
Sbjct: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIFGRFRDALFSAAAAKYATNGSAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLH+NFPSFRTPTHLPYASMIQKAI E+GEEDGLSEELISEFIVNEY
Sbjct: 61 PFPSEQFKSVIECCLHQNFPSFRTPTHLPYASMIQKAITEVGEEDGLSEELISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIEEDF+RIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVI+GAEDG HAHRG+VV
Sbjct: 181 SDDEIEEDFDRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIVGAEDGDHAHRGQVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEE QEDEMIDKHHREEIKYKY ANDFNLPK SRNLVIIGL APVAIKEI +QSRSL
Sbjct: 241 LDELEEFQEDEMIDKHHREEIKYKYAANDFNLPKKSRNLVIIGLHAPVAIKEIEKQSRSL 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
G KVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDE RQS+TV
Sbjct: 301 GRKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDEKRQSQTVA 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLG QEALAM GIEAKCGS REEIGGL EVRKVEMINDPHDVE KSTDRAEDFGEIKQ
Sbjct: 361 AANLGAQEALAMIGIEAKCGSSREEIGGLTEVRKVEMINDPHDVEAKSTDRAEDFGEIKQ 420
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQD+MVVGLHAKKAL KGTEDQCSSLRKNVDGAEGDCEQAGQTEVL +FKGGQEVEMID
Sbjct: 421 SQDVMVVGLHAKKALLIKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLGTFKGGQEVEMID 480
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASK SNEEEGPGEEATLDFFDAMPNDDDAKENGV+DAQGCQK
Sbjct: 481 EHHEEERQGEMMEEPKERASKGSNEEEGPGEEATLDFFDAMPNDDDAKENGVVDAQGCQK 540
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG
Sbjct: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQ VRGTSEAD
Sbjct: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQVVRGTSEAD 660
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNEYSEA+ TKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLES E
Sbjct: 661 KNEYSEAILTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESNVE 720
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLH-QQ 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQ+LH QQ
Sbjct: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQYLHHQQ 780
Query: 781 SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRGRGRDGITTGNNRSR 840
SPAKRGRGRPPKQKFDEDTVSKDIST+ENDQQERKGRGRGRGRGRGRG + + G R +
Sbjct: 781 SPAKRGRGRPPKQKFDEDTVSKDISTVENDQQERKGRGRGRGRGRGRGGERPSRGRKREK 840
BLAST of Csor.00g068070 vs. NCBI nr
Match:
XP_022993719.1 (uncharacterized protein LOC111489634 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1495 bits (3870), Expect = 0.0
Identity = 775/826 (93.83%), Postives = 791/826 (95.76%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 18 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 77
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAE+GEEDGLSEELISEFIVNEY
Sbjct: 78 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEVGEEDGLSEELISEFIVNEY 137
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 138 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 197
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIE D +RIKRSKKLNIRGP AEEVVTSKG+KE+N+SL EVI+GAEDG HA RG+V+
Sbjct: 198 SDDEIEGDIDRIKRSKKLNIRGPCAEEVVTSKGTKEKNDSLIEVIVGAEDGDHALRGQVL 257
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPK SRNLVIIGL APVAIK I +QSRSL
Sbjct: 258 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKKSRNLVIIGLHAPVAIKGIEKQSRSL 317
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQ CEK+VKSRHVIQDIDETRQS+TV
Sbjct: 318 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQLCEKKVKSRHVIQDIDETRQSQTVA 377
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLG QEALAMTGIEAKCG REEIGGLM+VRKV MINDPH VE KSTDRAEDFGEIKQ
Sbjct: 378 AANLGAQEALAMTGIEAKCGLSREEIGGLMKVRKVGMINDPHKVEVKSTDRAEDFGEIKQ 437
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKAL TKGTEDQCSSLRKNV GAEG CEQAGQTEVL +FKGGQEVEMID
Sbjct: 438 SQDLMVVGLHAKKALTTKGTEDQCSSLRKNVVGAEGGCEQAGQTEVLGTFKGGQEVEMID 497
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASK SNEEEGPGEEATLDFFD MPNDDDAKENGVIDAQGCQK
Sbjct: 498 EHHEEERQGEMMEEPKERASKRSNEEEGPGEEATLDFFDDMPNDDDAKENGVIDAQGCQK 557
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDN+A EITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQT
Sbjct: 558 LQEENEDLEFFDAKSDHGDNKATEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTR 617
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAEN QLSNKHPRVRWPSEITGTWRTSI+ASPPLEHQT APKHSEQAV GTSEAD
Sbjct: 618 ISKGCEAENHQLSNKHPRVRWPSEITGTWRTSISASPPLEHQTTAPKHSEQAVLGTSEAD 677
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNE SEAL TKDVICSPKSQP+GHRGRGRPHKLKIQETFATSLSSPAGD DQQFLESK E
Sbjct: 678 KNENSEALLTKDVICSPKSQPKGHRGRGRPHKLKIQETFATSLSSPAGDYDQQFLESKVE 737
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLH-QQ 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLH QQ
Sbjct: 738 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHHQQ 797
Query: 781 SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRG 825
SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRG G
Sbjct: 798 SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGCG 843
BLAST of Csor.00g068070 vs. ExPASy TrEMBL
Match:
A0A6J1FEI4 (uncharacterized protein LOC111444998 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444998 PE=4 SV=1)
HSP 1 Score: 1575 bits (4078), Expect = 0.0
Identity = 809/842 (96.08%), Postives = 816/842 (96.91%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLH+NFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY
Sbjct: 61 PFPSEQFKSVIECCLHQNFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIEEDFNRIKRSKKLNIRGP AE VVTSKGSKEQNNSLREVIIGAEDG HAHRGEVV
Sbjct: 181 SDDEIEEDFNRIKRSKKLNIRGPHAEAVVTSKGSKEQNNSLREVIIGAEDGDHAHRGEVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPK SRNLVIIGL APVAIKEIG+QSRSL
Sbjct: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKKSRNLVIIGLHAPVAIKEIGKQSRSL 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDE RQS+TVT
Sbjct: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDEKRQSQTVT 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLGVQEALAMTGIEAKCGS REEIGGLME+RKVEMINDPHDVE KSTDRAEDFGEIKQ
Sbjct: 361 AANLGVQEALAMTGIEAKCGSSREEIGGLMEIRKVEMINDPHDVEAKSTDRAEDFGEIKQ 420
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVL +FKG QEVEMID
Sbjct: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLGTFKGAQEVEMID 480
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGV+DAQGCQK
Sbjct: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVMDAQGCQK 540
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQN LEEQRISKVSDDQTG
Sbjct: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNSLEEQRISKVSDDQTG 600
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD
Sbjct: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNEYSEAL TKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLES E
Sbjct: 661 KNEYSEALLTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESNVE 720
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQQS 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQ S
Sbjct: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHQPS 780
Query: 781 PAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRG--RGRGRGRGRGRDGITTGNNRS 840
PAKRGRGRPPKQKFDEDTVSKDI TLENDQQERKGRG RGRGRGRGRGR G R
Sbjct: 781 PAKRGRGRPPKQKFDEDTVSKDILTLENDQQERKGRGCGRGRGRGRGRGRGGERPSRGRK 840
BLAST of Csor.00g068070 vs. ExPASy TrEMBL
Match:
A0A6J1K0W5 (uncharacterized protein LOC111489634 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489634 PE=4 SV=1)
HSP 1 Score: 1495 bits (3870), Expect = 0.0
Identity = 775/826 (93.83%), Postives = 791/826 (95.76%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL
Sbjct: 18 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 77
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAE+GEEDGLSEELISEFIVNEY
Sbjct: 78 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEVGEEDGLSEELISEFIVNEY 137
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE
Sbjct: 138 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 197
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
SDDEIE D +RIKRSKKLNIRGP AEEVVTSKG+KE+N+SL EVI+GAEDG HA RG+V+
Sbjct: 198 SDDEIEGDIDRIKRSKKLNIRGPCAEEVVTSKGTKEKNDSLIEVIVGAEDGDHALRGQVL 257
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPK SRNLVIIGL APVAIK I +QSRSL
Sbjct: 258 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKKSRNLVIIGLHAPVAIKGIEKQSRSL 317
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQ CEK+VKSRHVIQDIDETRQS+TV
Sbjct: 318 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQLCEKKVKSRHVIQDIDETRQSQTVA 377
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEVRKVEMINDPHDVETKSTDRAEDFGEIKQ 420
AANLG QEALAMTGIEAKCG REEIGGLM+VRKV MINDPH VE KSTDRAEDFGEIKQ
Sbjct: 378 AANLGAQEALAMTGIEAKCGLSREEIGGLMKVRKVGMINDPHKVEVKSTDRAEDFGEIKQ 437
Query: 421 SQDLMVVGLHAKKALPTKGTEDQCSSLRKNVDGAEGDCEQAGQTEVLVSFKGGQEVEMID 480
SQDLMVVGLHAKKAL TKGTEDQCSSLRKNV GAEG CEQAGQTEVL +FKGGQEVEMID
Sbjct: 438 SQDLMVVGLHAKKALTTKGTEDQCSSLRKNVVGAEGGCEQAGQTEVLGTFKGGQEVEMID 497
Query: 481 EHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVIDAQGCQK 540
EHHEEERQGEMMEEPKERASK SNEEEGPGEEATLDFFD MPNDDDAKENGVIDAQGCQK
Sbjct: 498 EHHEEERQGEMMEEPKERASKRSNEEEGPGEEATLDFFDDMPNDDDAKENGVIDAQGCQK 557
Query: 541 LQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTG 600
LQEENEDLEFFDAKSDHGDN+A EITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQT
Sbjct: 558 LQEENEDLEFFDAKSDHGDNKATEITGAQTSKGKVLGEVGNKQNRLEEQRISKVSDDQTR 617
Query: 601 ISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAVRGTSEAD 660
ISKGCEAEN QLSNKHPRVRWPSEITGTWRTSI+ASPPLEHQT APKHSEQAV GTSEAD
Sbjct: 618 ISKGCEAENHQLSNKHPRVRWPSEITGTWRTSISASPPLEHQTTAPKHSEQAVLGTSEAD 677
Query: 661 KNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQFLESKGE 720
KNE SEAL TKDVICSPKSQP+GHRGRGRPHKLKIQETFATSLSSPAGD DQQFLESK E
Sbjct: 678 KNENSEALLTKDVICSPKSQPKGHRGRGRPHKLKIQETFATSLSSPAGDYDQQFLESKVE 737
Query: 721 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLH-QQ 780
DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLH QQ
Sbjct: 738 DRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPSQHLHHQQ 797
Query: 781 SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGRG 825
SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRG G
Sbjct: 798 SPAKRGRGRPPKQKFDEDTVSKDISTLENDQQERKGRGRGRGRGCG 843
BLAST of Csor.00g068070 vs. ExPASy TrEMBL
Match:
A0A6J1FFG2 (eukaryotic translation initiation factor 5B-like isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444998 PE=4 SV=1)
HSP 1 Score: 1388 bits (3592), Expect = 0.0
Identity = 718/750 (95.73%), Postives = 724/750 (96.53%), Query Frame = 0
Query: 93 MIQKAIAEMGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY 152
MIQKAIAEMGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY
Sbjct: 1 MIQKAIAEMGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY 60
Query: 153 NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSK 212
NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEEDFNRIKRSKKLNIRGP AE VVTSK
Sbjct: 61 NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEEDFNRIKRSKKLNIRGPHAEAVVTSK 120
Query: 213 GSKEQNNSLREVIIGAEDGAHAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNL 272
GSKEQNNSLREVIIGAEDG HAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNL
Sbjct: 121 GSKEQNNSLREVIIGAEDGDHAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNL 180
Query: 273 PKNSRNLVIIGLQAPVAIKEIGRQSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM 332
PK SRNLVIIGL APVAIKEIG+QSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM
Sbjct: 181 PKKSRNLVIIGLHAPVAIKEIGKQSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM 240
Query: 333 IDQPCEKEVKSRHVIQDIDETRQSRTVTAANLGVQEALAMTGIEAKCGSLREEIGGLMEV 392
IDQPCEKEVKSRHVIQDIDE RQS+TVTAANLGVQEALAMTGIEAKCGS REEIGGLME+
Sbjct: 241 IDQPCEKEVKSRHVIQDIDEKRQSQTVTAANLGVQEALAMTGIEAKCGSSREEIGGLMEI 300
Query: 393 RKVEMINDPHDVETKSTDRAEDFGEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNVD 452
RKVEMINDPHDVE KSTDRAEDFGEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNVD
Sbjct: 301 RKVEMINDPHDVEAKSTDRAEDFGEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNVD 360
Query: 453 GAEGDCEQAGQTEVLVSFKGGQEVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEE 512
GAEGDCEQAGQTEVL +FKG QEVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEE
Sbjct: 361 GAEGDCEQAGQTEVLGTFKGAQEVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEE 420
Query: 513 ATLDFFDAMPNDDDAKENGVIDAQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSK 572
ATLDFFDAMPNDDDAKENGV+DAQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSK
Sbjct: 421 ATLDFFDAMPNDDDAKENGVMDAQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSK 480
Query: 573 GKVLGEVGNKQNRLEEQRISKVSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTS 632
GKVLGEVGNKQN LEEQRISKVSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTS
Sbjct: 481 GKVLGEVGNKQNSLEEQRISKVSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTS 540
Query: 633 IAASPPLEHQTMAPKHSEQAVRGTSEADKNEYSEALWTKDVICSPKSQPRGHRGRGRPHK 692
IAASPPLEHQTMAPKHSEQAVRGTSEADKNEYSEAL TKDVICSPKSQPRGHRGRGRPHK
Sbjct: 541 IAASPPLEHQTMAPKHSEQAVRGTSEADKNEYSEALLTKDVICSPKSQPRGHRGRGRPHK 600
Query: 693 LKIQETFATSLSSPAGDCDQQFLESKGEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG 752
LKIQETFATSLSSPAGDCDQQFLES EDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG
Sbjct: 601 LKIQETFATSLSSPAGDCDQQFLESNVEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG 660
Query: 753 RGRPRIMRQDWISVPETFSPSQHLHQQSPAKRGRGRPPKQKFDEDTVSKDISTLENDQQE 812
RGRPRIMRQDWISVPETFSPSQHLHQ SPAKRGRGRPPKQKFDEDTVSKDI TLENDQQE
Sbjct: 661 RGRPRIMRQDWISVPETFSPSQHLHQPSPAKRGRGRPPKQKFDEDTVSKDILTLENDQQE 720
Query: 813 RKGRG--RGRGRGRGRGRDGITTGNNRSRK 840
RKGRG RGRGRGRGRGR G R R+
Sbjct: 721 RKGRGCGRGRGRGRGRGRGGERPSRGRKRE 750
BLAST of Csor.00g068070 vs. ExPASy TrEMBL
Match:
A0A6J1JZB4 (uncharacterized protein LOC111489634 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489634 PE=4 SV=1)
HSP 1 Score: 1306 bits (3381), Expect = 0.0
Identity = 683/734 (93.05%), Postives = 699/734 (95.23%), Query Frame = 0
Query: 93 MIQKAIAEMGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY 152
MIQKAIAE+GEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY
Sbjct: 1 MIQKAIAEVGEEDGLSEELISEFIVNEYKDLPWAHPAFLRRHLGKLCESGELVKSKCGKY 60
Query: 153 NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSK 212
NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIE D +RIKRSKKLNIRGP AEEVVTSK
Sbjct: 61 NFKVEGKEVKRKKRRRKSAGRSRRREVESDDEIEGDIDRIKRSKKLNIRGPCAEEVVTSK 120
Query: 213 GSKEQNNSLREVIIGAEDGAHAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNL 272
G+KE+N+SL EVI+GAEDG HA RG+V+LDELEEVQEDEMIDKHHREEIKYKYGANDFNL
Sbjct: 121 GTKEKNDSLIEVIVGAEDGDHALRGQVLLDELEEVQEDEMIDKHHREEIKYKYGANDFNL 180
Query: 273 PKNSRNLVIIGLQAPVAIKEIGRQSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM 332
PK SRNLVIIGL APVAIK I +QSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM
Sbjct: 181 PKKSRNLVIIGLHAPVAIKGIEKQSRSLGGKVHEAEEGDHAKGGQIQVLGDVKEVQADVM 240
Query: 333 IDQPCEKEVKSRHVIQDIDETRQSRTVTAANLGVQEALAMTGIEAKCGSLREEIGGLMEV 392
IDQ CEK+VKSRHVIQDIDETRQS+TV AANLG QEALAMTGIEAKCG REEIGGLM+V
Sbjct: 241 IDQLCEKKVKSRHVIQDIDETRQSQTVAAANLGAQEALAMTGIEAKCGLSREEIGGLMKV 300
Query: 393 RKVEMINDPHDVETKSTDRAEDFGEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNVD 452
RKV MINDPH VE KSTDRAEDFGEIKQSQDLMVVGLHAKKAL TKGTEDQCSSLRKNV
Sbjct: 301 RKVGMINDPHKVEVKSTDRAEDFGEIKQSQDLMVVGLHAKKALTTKGTEDQCSSLRKNVV 360
Query: 453 GAEGDCEQAGQTEVLVSFKGGQEVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEE 512
GAEG CEQAGQTEVL +FKGGQEVEMIDEHHEEERQGEMMEEPKERASK SNEEEGPGEE
Sbjct: 361 GAEGGCEQAGQTEVLGTFKGGQEVEMIDEHHEEERQGEMMEEPKERASKRSNEEEGPGEE 420
Query: 513 ATLDFFDAMPNDDDAKENGVIDAQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSK 572
ATLDFFD MPNDDDAKENGVIDAQGCQKLQEENEDLEFFDAKSDHGDN+A EITGAQTSK
Sbjct: 421 ATLDFFDDMPNDDDAKENGVIDAQGCQKLQEENEDLEFFDAKSDHGDNKATEITGAQTSK 480
Query: 573 GKVLGEVGNKQNRLEEQRISKVSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTS 632
GKVLGEVGNKQNRLEEQRISKVSDDQT ISKGCEAEN QLSNKHPRVRWPSEITGTWRTS
Sbjct: 481 GKVLGEVGNKQNRLEEQRISKVSDDQTRISKGCEAENHQLSNKHPRVRWPSEITGTWRTS 540
Query: 633 IAASPPLEHQTMAPKHSEQAVRGTSEADKNEYSEALWTKDVICSPKSQPRGHRGRGRPHK 692
I+ASPPLEHQT APKHSEQAV GTSEADKNE SEAL TKDVICSPKSQP+GHRGRGRPHK
Sbjct: 541 ISASPPLEHQTTAPKHSEQAVLGTSEADKNENSEALLTKDVICSPKSQPKGHRGRGRPHK 600
Query: 693 LKIQETFATSLSSPAGDCDQQFLESKGEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG 752
LKIQETFATSLSSPAGD DQQFLESK EDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG
Sbjct: 601 LKIQETFATSLSSPAGDYDQQFLESKVEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRG 660
Query: 753 RGRPRIMRQDWISVPETFSPSQHLH-QQSPAKRGRGRPPKQKFDEDTVSKDISTLENDQQ 812
RGRPRIMRQDWISVPETFSPSQHLH QQSPAKRGRGRPPKQKFDEDTVSKDISTLENDQQ
Sbjct: 661 RGRPRIMRQDWISVPETFSPSQHLHHQQSPAKRGRGRPPKQKFDEDTVSKDISTLENDQQ 720
Query: 813 ERKGRGRGRGRGRG 825
ERKGRGRGRGRG G
Sbjct: 721 ERKGRGRGRGRGCG 734
BLAST of Csor.00g068070 vs. ExPASy TrEMBL
Match:
A0A5D3E3L6 (Transcription regulatory protein SNF2-like isoform X3 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold426G00270 PE=4 SV=1)
HSP 1 Score: 952 bits (2462), Expect = 0.0
Identity = 578/894 (64.65%), Postives = 658/894 (73.60%), Query Frame = 0
Query: 1 MENSQPHLSTIPTPPENSPPPSSVTPHSDHRYSLIAGRFRDALFSAAAAKYATNGSAHSL 60
ME S LS+I PPEN PSS PHSDHR+SLIAGR RDALFSA AAKY+TNG+AHSL
Sbjct: 1 MEISPSQLSSIRPPPENLSSPSSNAPHSDHRHSLIAGRLRDALFSAVAAKYSTNGTAHSL 60
Query: 61 PFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELISEFIVNEY 120
PF S+QFKSVI+C L ENFPSF+TPTHLPYASMIQ+AIAE+GEEDGLSEE ISEFIVNEY
Sbjct: 61 PFLSDQFKSVIDCRLRENFPSFQTPTHLPYASMIQRAIAEVGEEDGLSEESISEFIVNEY 120
Query: 121 KDLPWAHPAFLRRHLGKLCESGELVKSKCGKYNFKVEGKEVKRKKRRRKSAGRSRRREVE 180
+DLPWAH A+LRRHLGKLCE+GELVK KCG+YNFKVE K VKRKKRRRK+ GRSR REVE
Sbjct: 121 EDLPWAHSAYLRRHLGKLCENGELVKLKCGRYNFKVEDKGVKRKKRRRKTGGRSRYREVE 180
Query: 181 SDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLREVIIGAEDGAHAHRGEVV 240
S DEIEE F+R KRSKKL + GPR EEVVTSKGS+EQ++ REV +G E+ H G+VV
Sbjct: 181 SADEIEEGFDRKKRSKKLKVIGPRVEEVVTSKGSEEQSDFSREVTVGVENVDHVGEGQVV 240
Query: 241 LDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSRNLVIIGLQAPVAIKEIGRQSRSL 300
++E ++V+ DEM+DK H E+ K+ YGA FN SRNLVI+GL AP+A KE+ +QS S
Sbjct: 241 VNEQKKVEVDEMVDKQHGEKSKHIYGAKVFNRKNQSRNLVILGLHAPLANKEMEKQSGSF 300
Query: 301 GGKVHEAEEGDHAKGGQIQVLGDVKEVQADVMIDQPCEKEVKSRHVIQDIDETRQSRTVT 360
G +V E EEGDHAKGGQIQV G+V EVQADVMI QPCEKEVKSR QD D+ +QS+ V
Sbjct: 301 GEEVCEVEEGDHAKGGQIQVRGEVNEVQADVMIHQPCEKEVKSRGGFQDFDDKKQSQNVA 360
Query: 361 AANLGVQEALAMTGIEAKCGSLREEIGGLMEV-----RKVEMINDPHDVETKSTDRAEDF 420
A NLG QEAL MT E K GS REEI G E R+ MI + +V +D EDF
Sbjct: 361 AGNLGAQEALTMTWNEEKRGSPREEICGAKERGYDQDRQAIMIYELKEV--NGSDEVEDF 420
Query: 421 GEIKQSQDLMVVGLHAKKALPTKGTEDQCSSLRKNV-DGAEGDCEQAGQTEVLVSFKGGQ 480
G KQSQDLMVVGLHAK+AL TKGTED+CSS RKNV DG EG QAGQ EVL FK Q
Sbjct: 421 GGRKQSQDLMVVGLHAKEALMTKGTEDECSSFRKNVGDGVEGKHAQAGQIEVLDKFKEVQ 480
Query: 481 EVEMIDEHHEEERQGEMMEEPKERASKVSNEEEGPGEEATLDFFDAMPNDDDAKENGVID 540
VEMIDEH EEE+QGE MEEPKERAS S E P EEATL+FFDAM +A+ENGVID
Sbjct: 481 -VEMIDEHPEEEKQGERMEEPKERASLGSIRE--PVEEATLEFFDAMSYHSNAEENGVID 540
Query: 541 -AQGCQKLQEENEDLEFFDAKSDHGDNEANEITGAQTSKGKVLGEVGNKQNRLEEQRISK 600
A+GC+KL EENE+ EFFDAKSDHG + NEI GAQ+SK VLGEV NKQNRLEEQR SK
Sbjct: 541 DAEGCKKLLEENENFEFFDAKSDHGYDGVNEIIGAQSSKKTVLGEVSNKQNRLEEQRPSK 600
Query: 601 VSDDQTGISKGCEAENPQLSNKHPRVRWPSEITGTWRTSIAASPPLEHQTMAPKHSEQAV 660
SDDQT I GCEAE+ QL+ +H +VRWPSEITGT KHS+Q +
Sbjct: 601 FSDDQTEIRNGCEAEDLQLTKEHSQVRWPSEITGT----------------LAKHSKQEM 660
Query: 661 RGTSEADKNEYSEALWTKDVICSPKSQPRGHRGRGRPHKLKIQETFATSLSSPAGDCDQQ 720
TSEADKNE SEAL +D+ICSP SQP GHRG+GRP KLK+QE ATSLSS A D DQ+
Sbjct: 661 SRTSEADKNEKSEALSPEDIICSP-SQPWGHRGQGRPRKLKVQEILATSLSSFARDGDQR 720
Query: 721 FLESKGEDRETSGPDMCKDTHHIDQQQLKLPRGRGRGRGRGRPRIMRQDWISVPETFSPS 780
+L S D E S + THHIDQQ L LPRGRGRGRGR R++RQD S + SPS
Sbjct: 721 YLASNVVDGEASDSNTSYGTHHIDQQGLNLPRGRGRGRGR--LRVVRQDQNSRSQACSPS 780
Query: 781 QHL-HQQSPAKRGRGRPPKQKFDEDTVSKDIST-LENDQQERKGR-GRGRGRGR---GRG 840
+HL H+QSP K RGRP KQ FDED VSKDIST LEN QE KG GRG G G GR
Sbjct: 781 KHLNHRQSPGKI-RGRPLKQNFDEDIVSKDISTPLENKHQEDKGLLGRGHGIGSSSSGRM 840
Query: 841 RDGITTGN-----NRSRKLHEVMVALAH-RKIFPASSLAASPSNLMSILVIRDT 875
++ + N N+S+KL EVMV L + +FPAS LAASPSN SILVI D+
Sbjct: 841 KERGSFDNQYYTRNKSKKLLEVMVTLEYPHDLFPAS-LAASPSNRTSILVISDS 868
BLAST of Csor.00g068070 vs. TAIR 10
Match:
AT5G08780.1 (winged-helix DNA-binding transcription factor family protein )
HSP 1 Score: 56.2 bits (134), Expect = 1.5e-07
Identity = 71/273 (26.01%), Postives = 120/273 (43.96%), Query Frame = 0
Query: 54 NGSAHSLPFPSEQFKSVIECCLHENFPSFRTPTHLPYASMIQKAIAEMGEEDGLSEELIS 113
N SLP + ++ CL RTP H Y++MI AI ++ +E G SE+ IS
Sbjct: 28 NSRDFSLPETKLFQQKFLDLCLS------RTPDHPTYSAMIFIAIMDLNKEGGASEDAIS 87
Query: 114 EFIVNEYKDLPWAHPAFLRRHLGKLCESGELV---KSKCGKYNFKVEGKEVKRKKRRRKS 173
EFI ++YK+LP+AH L HL KL E E++ + C Y+ E K V +RKS
Sbjct: 88 EFIKSKYKNLPFAHTNLLSHHLAKLVEKREILCDCNNDC--YSLPGEKKTVASTDVQRKS 147
Query: 174 -AGRSRRREVESDDEIEEDFNRIKRSKKLNIRGPRAEEVVTSKGSKEQNNSLR------- 233
R + + DE+ N+ + + L P+ + +K + S R
Sbjct: 148 DLITVRTNDQRAADEVMTCQNKEESVEILKSGDPKVVLLEEQSLTKSRTGSKRKACCVIN 207
Query: 234 --EVIIGAEDGAHAHRGEVVLDELEEVQEDEMIDKHHREEIKYKYGANDFNLPKNSR--- 293
EV+ ++G A + + + E++D + E N+ + NSR
Sbjct: 208 VIEVMDTEDNGFKAGLRDSTVQIPRKEGVVEVVDVENSE--------NEARIEANSRGGE 267
Query: 294 --NLVIIGLQAPVAIKEIGRQSRSLGGKVHEAE 309
+ ++ Q V ++E G+++ V +A+
Sbjct: 268 LYEVAVLYKQNDVLMEESGKEAMETSSIVRKAK 284
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
P23444 | 7.7e-04 | 39.39 | Histone H1 OS=Zea mays OX=4577 PE=2 SV=2 | [more] |
Match Name | E-value | Identity | Description | |
KAG6579248.1 | 0.0 | 100.00 | hypothetical protein SDJN03_23696, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7016763.1 | 0.0 | 98.21 | hypothetical protein SDJN02_21873, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
XP_022938936.1 | 0.0 | 96.08 | uncharacterized protein LOC111444998 isoform X1 [Cucurbita moschata] | [more] |
XP_023549578.1 | 0.0 | 94.17 | uncharacterized protein LOC111808038 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022993719.1 | 0.0 | 93.83 | uncharacterized protein LOC111489634 isoform X1 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FEI4 | 0.0 | 96.08 | uncharacterized protein LOC111444998 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1K0W5 | 0.0 | 93.83 | uncharacterized protein LOC111489634 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FFG2 | 0.0 | 95.73 | eukaryotic translation initiation factor 5B-like isoform X2 OS=Cucurbita moschat... | [more] |
A0A6J1JZB4 | 0.0 | 93.05 | uncharacterized protein LOC111489634 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A5D3E3L6 | 0.0 | 64.65 | Transcription regulatory protein SNF2-like isoform X3 OS=Cucumis melo var. makuw... | [more] |
Match Name | E-value | Identity | Description | |
AT5G08780.1 | 1.5e-07 | 26.01 | winged-helix DNA-binding transcription factor family protein | [more] |