Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSinitialstart_codonpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAGCATGGGTTTTGCCTCTCTTGGTGTTGGAAATGGAGGATCTCCGTCGTCTTTTTCCAATTTGTCACCTTTGGCGCCGCCCTTCACTCTTGGGCGTTCCGTTACTAAACCTTTTCCGAGCCCGCCACTAGATATGACCGAACCTTCATTTGGGGTTGGGGCTGGGGCTGGGGCTGGGGCTGGGGCTGGGGTTCCCCTCAACTCTTCCCTGCACAATTGGCTCCCTTCCACCTCCAAAACCTCAGGCCTTGACTTCGTCTCCAGTTCCACCTCCGAATTTGATTGGTTCCCCTTCTCTTCTGGGTCAACATACCCCAGGTCGCAGCCTATGATGGAGCCTTCTGATAACCATGGACCTCTTTTGGGCCGTCTTACAATGTCTACAACTGACCGCTCCTTATACGGTCATTCCTCTGACGGACTAACAACTAGTATTGGTAAAGCAAAACCCTACTATCCTTCCTACGCCTCAACTTCATGTAACAAAGGTGGCCCTATGGTCCTTGTTGATCAACCAAGTTATAATTGGCCATTGCACTCGCATGTTGCTACATTCGATGTGCCCCCGTGCGCGGACCTCTCTTGGGGATCTTCAGGCTCTGAGAGATCAGTTGAAGAGGCTTCACATTCTATTGATATACCTGATCTGAATAAATGCAACGAGTTTGTGAGAGAATATCCAGACGAGGGATTGTTATTGGAGCAGAACCTTCACATGGATGCTCATTCTGCATTTCCTGGATGCCACCCCAAGACTAGGACACCGCCTTCAAATCCAGCGTCAAGTTCTCAGAACTATCAATTTCTGAAAAAGGCTCCATATCAGGAAATCTTAAGAGAGCAAGATGCTAGACTGAGTGTGGCTACTTTTTCCCTCAGACCACCTGTTGTCACTACTGATTCTTTTCTCAGGAATATCAGTCCATGTCATATTTCAGATTATGACCATGATTCCTTTGAAGGAAAACAAGGTGGCAACGACCTTTCAAATCTAAAGGAGTTTCTTCCAGTTCATTCTGATAGCAAGGAATTCTTTGGCACAGAGAACCATGGCACATGTATAGATAAAAATGATCCTATAGTTACTGAGTTCTCCTCAACCAAAATTCATGACTTACGAAGCAATATACATTCTGGTAAGGATTCACCAGACCGTACATTGAAGGCCGGAATGGGACTTTATATTCCTGATGCCAGTCCCAACTTTAGTTCGCACCTTAACCCAATTGAAACCGCCACAACAATTGAGAGTTCCTCTGAAAGTTTTGATCAGTACAACCTTGCAGCGGTAGACTCTCCTTGCTGGAAAGGAGCTCGAATTTGTCATACATCTCCATTTCAAGCTTTTGAAATTGTCACTCCGACTCGTATGAAGACCGAAGAAGTTTGCAACAGTGTGAATCTCTCATTGTCTCAAGTACCCCCTTCTACTGCCAAGGATACTGTTCATGAACCAAATGAAAGCACCATAGGCGGCATTCTGGAGAAGGGTGCAACATCTTCTCCAAAGATGCCTTCAGTTGCTGGTCCCTCCTTGCCTGCAGCACAGAAAACTAGCACTTCTGTGAAAGCAGGAGAATTTTGTTCTAAAATGGGCTGCTTCCATCCAGCTACCGGTAGCATCCATGACCCTGTAGAAGATAGTGGTGTCTCCTATTCTTCCTGTTCCATACCACTAAGTAAATATAAGCATAATTTAATGACTGGAAAAAGGATTGCAACTACAAGTTACATGAAGATGCATGCGGATGCAAGATTAAATAGTGACAACTCTTCTGAAAATGGTATGAATCATTTGTCATATGATGCCGCAAAACATATCCAGAATTTTCCTTCTGAGCTTGTGAAGGCATTTCCCAAAGAATCACTCTCAAAAATGGATATCCAGATTCTGGTTGATAAATTGCACGGTCTATCAGAATTGCTCCTTGCATATTGTTTAAATGGTTCAGCTGCATTACACTGGAAAGACGTGAAGTCTCTCAAGACTGTGATGAATAACCTTGATGTTTGTATAAATAGCTTTGAATCACAAGATTCTCTCTCACCTGAGCAACGAACTTCACAAAATCTTGAGCCGTTTCATCAACTTCATTCGGTATGTTATGTCTACTGTCTAGTTTCTTCTTCGTTAAGTTCAGGAAGTGATATTTTGAAAACCCCTGATGAAAGTTGAAGTACCATACGATCATTGTACACACGCACACATTGATGAAGATATTCTCTGGAGCAAATTTTCCCTTTTTCTGTTGAAGAAAAATGAAGATATTTTGCAGAACACTTGTACAATTTAACAGCCGCGTATATGTTATAGTTATTATTTTTGTATGAGAAACCAATCTTTCACTGAGAAAAATGAATGAATATTTTAAGGGTAGGCATAGAAATAGACCCATAAAAGTGGACGTCGAAAACTAAATATGAAACAGACTTCAATCCAGGGATCTACCCTTCCCTCCTTGTTTCATCTCTTATTTATTCAGAGAGCTACTCTGTTTGAGTTTGTATGTCATATTTGACAATGCTTAAAATTTTTCTCTGCATTGTGTGGGAATCAAATTATAATTGCTGTGATTGTGTAATGTACTTGTTCATCAGGATTTCCAGGATGTGAGAGTGCTCAAGTCCCAATCCCAGATGACAAAGATTGAAGGAAAAAATTTGGAGTGTTTATCAAATGATGGCAATGGTGTCGAGGAAACGAATCAATACATATTGTCTATCAAGAAAGACAAAGAAGCTGCGGACTCTCTTTATCTTAGGAATGGGATTGACTCGATGAAAGAAGACAGCATGACCAAGGTATCTATTAAGGATAGGAAGTGATATTTTTTTCATCTGTTGTGTGGAAGGCATGCTAAGGTATATTGTGTACTGGCTATATATAAAGGATGATATCGGCTTATATCGTTATCATATTTTTAGGCTCTTAAGAAGGTTCTGAGGGAGAATTTTCATGATGACAAAGAACATCCTCAATCTCTTTTGTACAAGAATCTATGGCTTGAAGCAGAAGCTGCATTATGTGCTTCCAAATTAATAGCTCGATTTAGTATAGCAAAGTCGGAAATGGAGAAACATGAACTACCAATAGTGAGAGGTAAGTTTTGATGTTAGAACTTTCCCCCCATATTGTATTCACTCGGCATGTAGTGTCAATGGTCTGTTCTCCTTACTCAGATATATTTATTTATGGGATAATTTTTATGGCTTAGAACATGCCGAAAATTGGGACGAACTACTCGTTTCTGGTGTATCTCCTGGTTCAAGCACCGTTGGGAAATTGGCACCTAAGACTAAAGTTGGTTCAACTTCATTTGTTCCCGTCCAGACTTCCCCTGCCGTGAGTGTCAGTAGTCATGCTGCAGATGATGTGATTACTAGATTCCATATTCTCAAATGCCGAGAGGATGAAGCAAAGGATAGGCATGCTGGATATTCCGGACAAGACATGGTTGAAAAATCAGCACTCGACAAGGAACAAACGGCAGTCCCTTATATCAATGACATGGATTCTTCCTTCCCCACGTCGAAGGTCAATGGGGATGACTCTAGGCCTGCTCTTCCATCAATTTCCCCTACCTTGACAAGGAACAGCCATACAGAAGATGTCATGTCTAGATTTCAAATTCTAAAATCTCGAGATGAGCACATAAGTTCTTTGAATGTGGGAAAGGTGCAGAAAATTAGAAGCTCCTGTTGCAGTGAGATCGACATGTTGGCACCTAAAGGTAATACTGTGCATAGCCTGGGTATCTCAACTATACATCATCGCTTTGCAGATAATAAAAGCGAAGTTGATGATTTAGATGCTTCAGCACCGGGCAGACTAGATGCCCCGAGGAGTCGTGGAAACCACATAAGCTTGACCTTGACCCCTGCGAGAGAACAGTTACAGGAGAGAGTAACTGTAAAAAAAGGGGGGTTGGGAGTTGAAACGGAACCTTTCTTGCGGTTTGAAGGTGAAGGAAGGTAG
mRNA sequence
ATGAGCATGGGTTTTGCCTCTCTTGGTGTTGGAAATGGAGGATCTCCGTCGTCTTTTTCCAATTTGTCACCTTTGGCGCCGCCCTTCACTCTTGGGCGTTCCGTTACTAAACCTTTTCCGAGCCCGCCACTAGATATGACCGAACCTTCATTTGGGGTTGGGGCTGGGGCTGGGGCTGGGGCTGGGGCTGGGGTTCCCCTCAACTCTTCCCTGCACAATTGGCTCCCTTCCACCTCCAAAACCTCAGGCCTTGACTTCGTCTCCAGTTCCACCTCCGAATTTGATTGGTTCCCCTTCTCTTCTGGGTCAACATACCCCAGGTCGCAGCCTATGATGGAGCCTTCTGATAACCATGGACCTCTTTTGGGCCGTCTTACAATGTCTACAACTGACCGCTCCTTATACGGTCATTCCTCTGACGGACTAACAACTAGTATTGGTAAAGCAAAACCCTACTATCCTTCCTACGCCTCAACTTCATGTAACAAAGGTGGCCCTATGGTCCTTGTTGATCAACCAAGTTATAATTGGCCATTGCACTCGCATGTTGCTACATTCGATGTGCCCCCGTGCGCGGACCTCTCTTGGGGATCTTCAGGCTCTGAGAGATCAGTTGAAGAGGCTTCACATTCTATTGATATACCTGATCTGAATAAATGCAACGAGTTTGTGAGAGAATATCCAGACGAGGGATTGTTATTGGAGCAGAACCTTCACATGGATGCTCATTCTGCATTTCCTGGATGCCACCCCAAGACTAGGACACCGCCTTCAAATCCAGCGTCAAGTTCTCAGAACTATCAATTTCTGAAAAAGGCTCCATATCAGGAAATCTTAAGAGAGCAAGATGCTAGACTGAGTGTGGCTACTTTTTCCCTCAGACCACCTGTTGTCACTACTGATTCTTTTCTCAGGAATATCAGTCCATGTCATATTTCAGATTATGACCATGATTCCTTTGAAGGAAAACAAGGTGGCAACGACCTTTCAAATCTAAAGGAGTTTCTTCCAGTTCATTCTGATAGCAAGGAATTCTTTGGCACAGAGAACCATGGCACATGTATAGATAAAAATGATCCTATAGTTACTGAGTTCTCCTCAACCAAAATTCATGACTTACGAAGCAATATACATTCTGGTAAGGATTCACCAGACCGTACATTGAAGGCCGGAATGGGACTTTATATTCCTGATGCCAGTCCCAACTTTAGTTCGCACCTTAACCCAATTGAAACCGCCACAACAATTGAGAGTTCCTCTGAAAGTTTTGATCAGTACAACCTTGCAGCGGTAGACTCTCCTTGCTGGAAAGGAGCTCGAATTTGTCATACATCTCCATTTCAAGCTTTTGAAATTGTCACTCCGACTCGTATGAAGACCGAAGAAGTTTGCAACAGTGTGAATCTCTCATTGTCTCAAGTACCCCCTTCTACTGCCAAGGATACTGTTCATGAACCAAATGAAAGCACCATAGGCGGCATTCTGGAGAAGGGTGCAACATCTTCTCCAAAGATGCCTTCAGTTGCTGGTCCCTCCTTGCCTGCAGCACAGAAAACTAGCACTTCTGTGAAAGCAGGAGAATTTTGTTCTAAAATGGGCTGCTTCCATCCAGCTACCGGTAGCATCCATGACCCTGTAGAAGATAGTGGTGTCTCCTATTCTTCCTGTTCCATACCACTAAGTAAATATAAGCATAATTTAATGACTGGAAAAAGGATTGCAACTACAAGTTACATGAAGATGCATGCGGATGCAAGATTAAATAGTGACAACTCTTCTGAAAATGGTATGAATCATTTGTCATATGATGCCGCAAAACATATCCAGAATTTTCCTTCTGAGCTTGTGAAGGCATTTCCCAAAGAATCACTCTCAAAAATGGATATCCAGATTCTGGTTGATAAATTGCACGGTCTATCAGAATTGCTCCTTGCATATTGTTTAAATGGTTCAGCTGCATTACACTGGAAAGACGTGAAGTCTCTCAAGACTGTGATGAATAACCTTGATGTTTGTATAAATAGCTTTGAATCACAAGATTCTCTCTCACCTGAGCAACGAACTTCACAAAATCTTGAGCCGTTTCATCAACTTCATTCGGATTTCCAGGATGTGAGAGTGCTCAAGTCCCAATCCCAGATGACAAAGATTGAAGGAAAAAATTTGGAGTGTTTATCAAATGATGGCAATGGTGTCGAGGAAACGAATCAATACATATTGTCTATCAAGAAAGACAAAGAAGCTGCGGACTCTCTTTATCTTAGGAATGGGATTGACTCGATGAAAGAAGACAGCATGACCAAGGCTCTTAAGAAGGTTCTGAGGGAGAATTTTCATGATGACAAAGAACATCCTCAATCTCTTTTGTACAAGAATCTATGGCTTGAAGCAGAAGCTGCATTATGTGCTTCCAAATTAATAGCTCGATTTAGTATAGCAAAGTCGGAAATGGAGAAACATGAACTACCAATAGTGAGAGAACATGCCGAAAATTGGGACGAACTACTCGTTTCTGGTGTATCTCCTGGTTCAAGCACCGTTGGGAAATTGGCACCTAAGACTAAAGTTGGTTCAACTTCATTTGTTCCCGTCCAGACTTCCCCTGCCGTGAGTGTCAGTAGTCATGCTGCAGATGATGTGATTACTAGATTCCATATTCTCAAATGCCGAGAGGATGAAGCAAAGGATAGGCATGCTGGATATTCCGGACAAGACATGGTTGAAAAATCAGCACTCGACAAGGAACAAACGGCAGTCCCTTATATCAATGACATGGATTCTTCCTTCCCCACGTCGAAGGTCAATGGGGATGACTCTAGGCCTGCTCTTCCATCAATTTCCCCTACCTTGACAAGGAACAGCCATACAGAAGATGTCATGTCTAGATTTCAAATTCTAAAATCTCGAGATGAGCACATAAGTTCTTTGAATGTGGGAAAGGTGCAGAAAATTAGAAGCTCCTGTTGCAGTGAGATCGACATGTTGGCACCTAAAGGTAATACTGTGCATAGCCTGGGTATCTCAACTATACATCATCGCTTTGCAGATAATAAAAGCGAAGTTGATGATTTAGATGCTTCAGCACCGGGCAGACTAGATGCCCCGAGGAGTCGTGGAAACCACATAAGCTTGACCTTGACCCCTGCGAGAGAACAGTTACAGGAGAGAGTAACTGTAAAAAAAGGGGGGTTGGGAGTTGAAACGGAACCTTTCTTGCGGTTTGAAGGTGAAGGAAGGTAG
Coding sequence (CDS)
ATGAGCATGGGTTTTGCCTCTCTTGGTGTTGGAAATGGAGGATCTCCGTCGTCTTTTTCCAATTTGTCACCTTTGGCGCCGCCCTTCACTCTTGGGCGTTCCGTTACTAAACCTTTTCCGAGCCCGCCACTAGATATGACCGAACCTTCATTTGGGGTTGGGGCTGGGGCTGGGGCTGGGGCTGGGGCTGGGGTTCCCCTCAACTCTTCCCTGCACAATTGGCTCCCTTCCACCTCCAAAACCTCAGGCCTTGACTTCGTCTCCAGTTCCACCTCCGAATTTGATTGGTTCCCCTTCTCTTCTGGGTCAACATACCCCAGGTCGCAGCCTATGATGGAGCCTTCTGATAACCATGGACCTCTTTTGGGCCGTCTTACAATGTCTACAACTGACCGCTCCTTATACGGTCATTCCTCTGACGGACTAACAACTAGTATTGGTAAAGCAAAACCCTACTATCCTTCCTACGCCTCAACTTCATGTAACAAAGGTGGCCCTATGGTCCTTGTTGATCAACCAAGTTATAATTGGCCATTGCACTCGCATGTTGCTACATTCGATGTGCCCCCGTGCGCGGACCTCTCTTGGGGATCTTCAGGCTCTGAGAGATCAGTTGAAGAGGCTTCACATTCTATTGATATACCTGATCTGAATAAATGCAACGAGTTTGTGAGAGAATATCCAGACGAGGGATTGTTATTGGAGCAGAACCTTCACATGGATGCTCATTCTGCATTTCCTGGATGCCACCCCAAGACTAGGACACCGCCTTCAAATCCAGCGTCAAGTTCTCAGAACTATCAATTTCTGAAAAAGGCTCCATATCAGGAAATCTTAAGAGAGCAAGATGCTAGACTGAGTGTGGCTACTTTTTCCCTCAGACCACCTGTTGTCACTACTGATTCTTTTCTCAGGAATATCAGTCCATGTCATATTTCAGATTATGACCATGATTCCTTTGAAGGAAAACAAGGTGGCAACGACCTTTCAAATCTAAAGGAGTTTCTTCCAGTTCATTCTGATAGCAAGGAATTCTTTGGCACAGAGAACCATGGCACATGTATAGATAAAAATGATCCTATAGTTACTGAGTTCTCCTCAACCAAAATTCATGACTTACGAAGCAATATACATTCTGGTAAGGATTCACCAGACCGTACATTGAAGGCCGGAATGGGACTTTATATTCCTGATGCCAGTCCCAACTTTAGTTCGCACCTTAACCCAATTGAAACCGCCACAACAATTGAGAGTTCCTCTGAAAGTTTTGATCAGTACAACCTTGCAGCGGTAGACTCTCCTTGCTGGAAAGGAGCTCGAATTTGTCATACATCTCCATTTCAAGCTTTTGAAATTGTCACTCCGACTCGTATGAAGACCGAAGAAGTTTGCAACAGTGTGAATCTCTCATTGTCTCAAGTACCCCCTTCTACTGCCAAGGATACTGTTCATGAACCAAATGAAAGCACCATAGGCGGCATTCTGGAGAAGGGTGCAACATCTTCTCCAAAGATGCCTTCAGTTGCTGGTCCCTCCTTGCCTGCAGCACAGAAAACTAGCACTTCTGTGAAAGCAGGAGAATTTTGTTCTAAAATGGGCTGCTTCCATCCAGCTACCGGTAGCATCCATGACCCTGTAGAAGATAGTGGTGTCTCCTATTCTTCCTGTTCCATACCACTAAGTAAATATAAGCATAATTTAATGACTGGAAAAAGGATTGCAACTACAAGTTACATGAAGATGCATGCGGATGCAAGATTAAATAGTGACAACTCTTCTGAAAATGGTATGAATCATTTGTCATATGATGCCGCAAAACATATCCAGAATTTTCCTTCTGAGCTTGTGAAGGCATTTCCCAAAGAATCACTCTCAAAAATGGATATCCAGATTCTGGTTGATAAATTGCACGGTCTATCAGAATTGCTCCTTGCATATTGTTTAAATGGTTCAGCTGCATTACACTGGAAAGACGTGAAGTCTCTCAAGACTGTGATGAATAACCTTGATGTTTGTATAAATAGCTTTGAATCACAAGATTCTCTCTCACCTGAGCAACGAACTTCACAAAATCTTGAGCCGTTTCATCAACTTCATTCGGATTTCCAGGATGTGAGAGTGCTCAAGTCCCAATCCCAGATGACAAAGATTGAAGGAAAAAATTTGGAGTGTTTATCAAATGATGGCAATGGTGTCGAGGAAACGAATCAATACATATTGTCTATCAAGAAAGACAAAGAAGCTGCGGACTCTCTTTATCTTAGGAATGGGATTGACTCGATGAAAGAAGACAGCATGACCAAGGCTCTTAAGAAGGTTCTGAGGGAGAATTTTCATGATGACAAAGAACATCCTCAATCTCTTTTGTACAAGAATCTATGGCTTGAAGCAGAAGCTGCATTATGTGCTTCCAAATTAATAGCTCGATTTAGTATAGCAAAGTCGGAAATGGAGAAACATGAACTACCAATAGTGAGAGAACATGCCGAAAATTGGGACGAACTACTCGTTTCTGGTGTATCTCCTGGTTCAAGCACCGTTGGGAAATTGGCACCTAAGACTAAAGTTGGTTCAACTTCATTTGTTCCCGTCCAGACTTCCCCTGCCGTGAGTGTCAGTAGTCATGCTGCAGATGATGTGATTACTAGATTCCATATTCTCAAATGCCGAGAGGATGAAGCAAAGGATAGGCATGCTGGATATTCCGGACAAGACATGGTTGAAAAATCAGCACTCGACAAGGAACAAACGGCAGTCCCTTATATCAATGACATGGATTCTTCCTTCCCCACGTCGAAGGTCAATGGGGATGACTCTAGGCCTGCTCTTCCATCAATTTCCCCTACCTTGACAAGGAACAGCCATACAGAAGATGTCATGTCTAGATTTCAAATTCTAAAATCTCGAGATGAGCACATAAGTTCTTTGAATGTGGGAAAGGTGCAGAAAATTAGAAGCTCCTGTTGCAGTGAGATCGACATGTTGGCACCTAAAGGTAATACTGTGCATAGCCTGGGTATCTCAACTATACATCATCGCTTTGCAGATAATAAAAGCGAAGTTGATGATTTAGATGCTTCAGCACCGGGCAGACTAGATGCCCCGAGGAGTCGTGGAAACCACATAAGCTTGACCTTGACCCCTGCGAGAGAACAGTTACAGGAGAGAGTAACTGTAAAAAAAGGGGGGTTGGGAGTTGAAACGGAACCTTTCTTGCGGTTTGAAGGTGAAGGAAGGTAG
Protein sequence
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAGAGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGEGR
Homology
BLAST of Csor.00g283210 vs. NCBI nr
Match:
KAG6576619.1 (hypothetical protein SDJN03_24193, partial [Cucurbita argyrosperma subsp. sororia] >KAG7014671.1 hypothetical protein SDJN02_22300, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 2155 bits (5585), Expect = 0.0
Identity = 1079/1079 (100.00%), Postives = 1079/1079 (100.00%), Query Frame = 0
Query: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG
Sbjct: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
Query: 61 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP
Sbjct: 61 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
Query: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH
Sbjct: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
Query: 181 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHM 240
SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHM
Sbjct: 181 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHM 240
Query: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT
Sbjct: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
Query: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP
Sbjct: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
Query: 361 IVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSS 420
IVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSS
Sbjct: 361 IVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSS 420
Query: 421 ESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
ESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK
Sbjct: 421 ESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
Query: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG
Sbjct: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
Query: 541 SIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
SIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL
Sbjct: 541 SIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
Query: 601 SYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVK 660
SYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVK
Sbjct: 601 SYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVK 660
Query: 661 SLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGK 720
SLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGK
Sbjct: 661 SLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGK 720
Query: 721 NLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
NLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD
Sbjct: 721 NLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
Query: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV
Sbjct: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
Query: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG
Sbjct: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
Query: 901 YSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVM 960
YSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVM
Sbjct: 901 YSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVM 960
Query: 961 SRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSE 1020
SRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSE
Sbjct: 961 SRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSE 1020
Query: 1021 VDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGEGR 1079
VDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGEGR
Sbjct: 1021 VDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGEGR 1079
BLAST of Csor.00g283210 vs. NCBI nr
Match:
XP_022922596.1 (uncharacterized protein LOC111430557 [Cucurbita moschata])
HSP 1 Score: 2101 bits (5444), Expect = 0.0
Identity = 1061/1083 (97.97%), Postives = 1064/1083 (98.25%), Query Frame = 0
Query: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTL RSVTKPFPSPPLDMTEPSFGVG G G G
Sbjct: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLDRSVTKPFPSPPLDMTEPSFGVGVGVGVG 60
Query: 61 --AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH 120
AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH
Sbjct: 61 VGAGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH 120
Query: 121 GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP 180
GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP
Sbjct: 121 GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP 180
Query: 181 LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNL 240
LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDE LLLEQNL
Sbjct: 181 LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEELLLEQNL 240
Query: 241 HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV 300
HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV
Sbjct: 241 HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV 300
Query: 301 TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN 360
TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN
Sbjct: 301 TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN 360
Query: 361 DPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIES 420
DPIVTEFSSTKIHDLRSNIHS KDSPD TLKAGMGLYIPDASPNFSSHLNPIETATTIES
Sbjct: 361 DPIVTEFSSTKIHDLRSNIHSDKDSPDCTLKAGMGLYIPDASPNFSSHLNPIETATTIES 420
Query: 421 SSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST 480
SSESFD YNLAAVDSPCWKGARIC TSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST
Sbjct: 421 SSESFDPYNLAAVDSPCWKGARICRTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST 480
Query: 481 AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA 540
AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA
Sbjct: 481 AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA 540
Query: 541 TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN 600
TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN
Sbjct: 541 TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN 600
Query: 601 HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKD 660
HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSE+LLAYC NGSAALH KD
Sbjct: 601 HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEMLLAYCSNGSAALHRKD 660
Query: 661 VKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIE 720
VKSLKTVMNNLDVCINSF SQDSLSPEQRTSQNLE FHQLHSDFQDVRVLKSQSQMTK+E
Sbjct: 661 VKSLKTVMNNLDVCINSFGSQDSLSPEQRTSQNLETFHQLHSDFQDVRVLKSQSQMTKME 720
Query: 721 GKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF 780
GK LECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF
Sbjct: 721 GKYLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF 780
Query: 781 HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS 840
HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS
Sbjct: 781 HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS 840
Query: 841 GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH 900
GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH
Sbjct: 841 GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH 900
Query: 901 AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED 960
AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED
Sbjct: 901 AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED 960
Query: 961 VMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK 1020
VMSRFQILKSRDE ISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK
Sbjct: 961 VMSRFQILKSRDERISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK 1020
Query: 1021 SEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG-- 1079
+EVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG
Sbjct: 1021 TEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGGK 1080
BLAST of Csor.00g283210 vs. NCBI nr
Match:
XP_022984354.1 (uncharacterized protein LOC111482682 [Cucurbita maxima])
HSP 1 Score: 2034 bits (5271), Expect = 0.0
Identity = 1036/1081 (95.84%), Postives = 1047/1081 (96.85%), Query Frame = 0
Query: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTL RSV+KPFP+P LDMTEPSFGVG GAGAG
Sbjct: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLDRSVSKPFPTPLLDMTEPSFGVGVGAGAG 60
Query: 61 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
AG V LNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP
Sbjct: 61 AG--VLLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
Query: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH
Sbjct: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
Query: 181 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHM 240
SHVATFDVPPCADLSWGSSGSERS EEASHSIDIPDLNKCNEFVREYPDE LLLEQNLHM
Sbjct: 181 SHVATFDVPPCADLSWGSSGSERSGEEASHSIDIPDLNKCNEFVREYPDEELLLEQNLHM 240
Query: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT
Sbjct: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
Query: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP
Sbjct: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
Query: 361 IVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSS 420
IVTEFSSTKIHD+RSNIHS KDSPD TLKAGMGLYIPDASPNFSS +TATTIESSS
Sbjct: 361 IVTEFSSTKIHDVRSNIHSDKDSPDCTLKAGMGLYIPDASPNFSS-----QTATTIESSS 420
Query: 421 ESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
ESFDQYNLAAVDSPCWKGARIC TSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK
Sbjct: 421 ESFDQYNLAAVDSPCWKGARICRTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
Query: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG
Sbjct: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
Query: 541 SIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
SIHDPVEDSGVSYSSCSIP SKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL
Sbjct: 541 SIHDPVEDSGVSYSSCSIPQSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
Query: 601 SYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVK 660
SYDAAKHIQNFPSELVKAF +ESLSKMDIQILVDKLH LSELLLAYC NGSAALH KDVK
Sbjct: 601 SYDAAKHIQNFPSELVKAFHRESLSKMDIQILVDKLHSLSELLLAYCSNGSAALHRKDVK 660
Query: 661 SLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGK 720
SLKTVMNNLDVCINSF SQDSLSPEQR+SQNLE FHQLHS+FQDVRVLKSQSQ TKIEG+
Sbjct: 661 SLKTVMNNLDVCINSFGSQDSLSPEQRSSQNLEQFHQLHSEFQDVRVLKSQSQTTKIEGE 720
Query: 721 NLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
+LECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD
Sbjct: 721 SLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
Query: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV
Sbjct: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
Query: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG
Sbjct: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
Query: 901 YSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVM 960
YSGQDMVEK ALDKEQTAVPYINDMDSSFPTS+VNGDDSRPALPSISPTLTR+ HTEDVM
Sbjct: 901 YSGQDMVEKLALDKEQTAVPYINDMDSSFPTSEVNGDDSRPALPSISPTLTRSCHTEDVM 960
Query: 961 SRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSE 1020
SRFQILKSRDE ISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGIS IHHR ADNKSE
Sbjct: 961 SRFQILKSRDERISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGIS-IHHRVADNKSE 1020
Query: 1021 VDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG--EG 1079
VDDLDAS PGRLD RSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG EG
Sbjct: 1021 VDDLDASVPGRLDVLRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGGKEG 1073
BLAST of Csor.00g283210 vs. NCBI nr
Match:
XP_022968240.1 (uncharacterized protein LOC111467537 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1467 bits (3797), Expect = 0.0
Identity = 788/1100 (71.64%), Postives = 876/1100 (79.64%), Query Frame = 0
Query: 3 MGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPS--FGVGAGAGAG 62
MGFA GVGNGGS SSFSNLSPLAPPFTL RSVTKP +P +D+TEP FGVG G
Sbjct: 1 MGFAPFGVGNGGSSSSFSNLSPLAPPFTLDRSVTKPLSTPLVDITEPEPEFGVGGG---- 60
Query: 63 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 122
VPLN HNWLPSTSKTS DF SS EFDW PFS+GS +PRSQ MM+PS NHGP
Sbjct: 61 ----VPLNPLQHNWLPSTSKTSAHDFFSS---EFDWLPFSTGSGFPRSQAMMDPSHNHGP 120
Query: 123 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 182
LLGRLT+++TD S Y SSDG+TTS+GK KPYYPSYA+TS NK GP V+VDQPSY+W +
Sbjct: 121 LLGRLTITSTDLSSYHGSSDGVTTSMGKPKPYYPSYAATSSNKAGPTVIVDQPSYDWLSN 180
Query: 183 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLH- 242
SHV TF+ PPC D S GSS SERS EEASHS+D+ DLNKCNEFVREYP+E L E+NL+
Sbjct: 181 SHVVTFEGPPCTDFSRGSSASERSTEEASHSVDVLDLNKCNEFVREYPNEELFSERNLNI 240
Query: 243 -----MDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSV------ 302
MDAHSAFPGCHPKTRTPPSNPASSSQN FLKK PY EI REQD+RL+V
Sbjct: 241 ERISNMDAHSAFPGCHPKTRTPPSNPASSSQNSPFLKKPPYLEISREQDSRLNVTASIVN 300
Query: 303 --ATFSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFF 362
ATFS+RP VV+TDSF N+ CH+SDY +DSFE KQGGN+LSNLKE LPV+S+SKEF
Sbjct: 301 SPATFSIRPSVVSTDSFAWNVGSCHVSDYGYDSFEAKQGGNNLSNLKELLPVNSESKEFV 360
Query: 363 GTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSH 422
EN+ TCIDKNDP++TE SSTKIHDLR+NIHS KDSPDR LKAGM L+IPDASP+FS
Sbjct: 361 SAENYDTCIDKNDPVITEPSSTKIHDLRNNIHSAKDSPDRRLKAGMRLHIPDASPHFSLD 420
Query: 423 LNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNS 482
IETATT ESSSESFDQYNLAAVDSPCWKG I SPFQAFEIVTP+R K EV NS
Sbjct: 421 PKGIETATTTESSSESFDQYNLAAVDSPCWKGVPINQISPFQAFEIVTPSRTKMLEVYNS 480
Query: 483 VNLSLSQVPPSTAKDTV----HEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTS 542
VNLSLSQVPPSTA+DTV HEPNESTIG ILEKGATSSPKMPSV G SLPA QK+S S
Sbjct: 481 VNLSLSQVPPSTAEDTVKVIVHEPNESTIGSILEKGATSSPKMPSVIGSSLPAEQKSSNS 540
Query: 543 VKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMH 602
VKAGEFCSKMGCFHPAT S+++ D G YSSCSIP +KYKHNL++GKRI TS + H
Sbjct: 541 VKAGEFCSKMGCFHPATSSVYEAFGDGGDFYSSCSIPQNKYKHNLVSGKRIGRTSCTEKH 600
Query: 603 ADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEL 662
ADARLNSDNSS NG+NHLS+DAA+H+QN PSELVKAF ES SK+DI+ILVD LH LS L
Sbjct: 601 ADARLNSDNSSGNGLNHLSFDAAEHVQNLPSELVKAFHGESTSKVDIRILVDTLHSLSGL 660
Query: 663 LLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDF 722
LLA+C NG ALH KDV SL+TVMNNLDVCINS SQ SLSPEQRTSQ+LE FHQLH+ F
Sbjct: 661 LLAHCSNGLDALHQKDVMSLETVMNNLDVCINSVGSQGSLSPEQRTSQSLEQFHQLHAHF 720
Query: 723 QDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMK 782
QD+ VLKSQSQMTKIEG+NLECLSND NGVEETN+YILS+KKDKEAA S LRNGID MK
Sbjct: 721 QDLGVLKSQSQMTKIEGENLECLSNDRNGVEETNRYILSVKKDKEAASSHRLRNGIDLMK 780
Query: 783 EDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHEL 842
EDSMTKALKKVL ENFHDD+EHPQ+LLYKNLWL+AEAALCAS L ARFS AKSEMEKHE
Sbjct: 781 EDSMTKALKKVLSENFHDDEEHPQTLLYKNLWLQAEAALCASNLRARFSSAKSEMEKHES 840
Query: 843 PIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVIT 902
P V+EHA+N D+L VSG SPGS+T+ ++A KTKVGSTSFV VQTSP VSV SHA+DDVIT
Sbjct: 841 PKVKEHAKNHDQLFVSGASPGSNTIAEVASKTKVGSTSFVSVQTSPTVSVRSHASDDVIT 900
Query: 903 RFHILKCREDEAKDRHAGYSG----------QDMVEKSALDKEQTAVPYINDMDSSFPTS 962
RF+ILK R+DEAK R A G Q MVEKSAL+KEQTA P++ DMDSSFP+S
Sbjct: 901 RFNILKHRDDEAKLRDAENLGTLSDFEVSVKQGMVEKSALEKEQTAGPHVKDMDSSFPSS 960
Query: 963 KVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEI 1022
KV G+DS PA S S LTR SH +DVMSRFQILKSRDEH+SSLNVGKVQK+ SS CSEI
Sbjct: 961 KVKGNDSGPAPQSTSLILTRTSHIDDVMSRFQILKSRDEHVSSLNVGKVQKVTSSHCSEI 1020
Query: 1023 DMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQ 1072
+ AP+G IS IHH ADNK+EVDDLD S GRLD RSRGN+IS T PA E
Sbjct: 1021 EKAAPEGV------ISMIHHPIADNKNEVDDLDGSVVGRLDVLRSRGNNISPT--PAGEN 1079
BLAST of Csor.00g283210 vs. NCBI nr
Match:
XP_038891692.1 (uncharacterized protein LOC120081084 [Benincasa hispida])
HSP 1 Score: 1464 bits (3790), Expect = 0.0
Identity = 779/1124 (69.31%), Postives = 886/1124 (78.83%), Query Frame = 0
Query: 3 MGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAGAG 62
MGF+S VGNG S SSFSNLS LAPPFTL RSVT+PF SP +DMTEPSFGVGAG
Sbjct: 1 MGFSS--VGNGASSSSFSNLSHLAPPFTLDRSVTRPFSSPLVDMTEPSFGVGAG------ 60
Query: 63 AGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGPLL 122
VPLNS+LHNWLPST+KTSGLDF SSST EFDW F++GS YPR QPMMEPSD H PLL
Sbjct: 61 --VPLNSTLHNWLPSTTKTSGLDFFSSSTPEFDWLSFATGSKYPRLQPMMEPSDKHEPLL 120
Query: 123 GRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLHSH 182
G LT+S+TD S+ G SS GLTTSIGK KPYYPSYASTSCNK P+V+ DQP+Y+WP +SH
Sbjct: 121 GSLTVSSTDPSVSGESSAGLTTSIGKEKPYYPSYASTSCNKAVPVVIFDQPTYDWPSNSH 180
Query: 183 VATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLH--- 242
V TF VPPC + S GSSG ERSVEE+SHS D+ DLN+CNEFVRE P E LLL+QNL+
Sbjct: 181 VVTFSVPPCTNFSHGSSGFERSVEESSHSTDMLDLNRCNEFVRECPSEELLLKQNLNIEQ 240
Query: 243 --------MDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVAT- 302
MDAHSAFPGCHPKTRTPPSNPAS N+Q+L+KAPYQEILREQDARLSV T
Sbjct: 241 ANDLRISDMDAHSAFPGCHPKTRTPPSNPASRFHNFQYLRKAPYQEILREQDARLSVTTS 300
Query: 303 --------FSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDS 362
FS+RPPV+ TDSF+ NI PCH+S SFE KQGG+DLSNLK+FLPV+SDS
Sbjct: 301 IVNPPNTNFSIRPPVLDTDSFVCNIGPCHMSGNGDQSFEAKQGGDDLSNLKKFLPVNSDS 360
Query: 363 KEFFGTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPN 422
+EFF TENHGTC+DK+DPIVTEFSS K HDLR+NIH +DSPD TLKAGMGL++PD+SP
Sbjct: 361 QEFFRTENHGTCLDKHDPIVTEFSSIKTHDLRNNIHYAEDSPDHTLKAGMGLHVPDSSPQ 420
Query: 423 FSSHLNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEE 482
FS L + ATTIESSSE+FDQYNLAAVDSPCWKGA IC SPFQAFE TP+ +K E
Sbjct: 421 FSLDLKT-KIATTIESSSENFDQYNLAAVDSPCWKGAPICRVSPFQAFETSTPSSVKMVE 480
Query: 483 VCNSVNLSLSQVPPSTAKDTV----HEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQK 542
V N VNLSLSQV PS+A++TV HEP+ESTIG ++EKGATS+ +MPS+AG SL A QK
Sbjct: 481 VNNDVNLSLSQVLPSSAENTVEVFVHEPSESTIGSVVEKGATSTTQMPSIAGSSLLATQK 540
Query: 543 TSTSVKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSY 602
TS SVKAGEF SKMG FHP TG IH+P ED G SYSSCS+P SKYK+NLM+GK+IA TSY
Sbjct: 541 TSNSVKAGEFYSKMGGFHPTTGCIHEPGEDVGGSYSSCSMPQSKYKNNLMSGKKIAPTSY 600
Query: 603 MKMHADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHG 662
MK HADA LN D+S ENG+NHL YD AKH+QN P ELVK F ES+SK+DI+ILVD LH
Sbjct: 601 MKKHADAELNCDDSFENGLNHLPYDVAKHVQNLPFELVKLFLGESISKIDIRILVDTLHS 660
Query: 663 LSELLLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQL 722
LSELLL LNG AALH KDVKSL+ V+NNLDVC+ S SQ SLSPEQRTSQNLE FHQL
Sbjct: 661 LSELLLVCHLNGLAALHQKDVKSLEAVINNLDVCLKSVGSQGSLSPEQRTSQNLEQFHQL 720
Query: 723 HSDFQDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGI 782
H D V VLKSQ QMTKIEG NLECLSNDGN V++ NQY+LS+KKD+EAADSLYLRN I
Sbjct: 721 HLD---VGVLKSQLQMTKIEGGNLECLSNDGNDVDKKNQYMLSVKKDREAADSLYLRNRI 780
Query: 783 DSMKEDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEME 842
DS+KEDSMTKALKK + ENFHDD+EHPQ+LLYKNLWLEAEAALCA+ L AR + A+SEME
Sbjct: 781 DSVKEDSMTKALKKAMSENFHDDEEHPQTLLYKNLWLEAEAALCANNLRARLNSARSEME 840
Query: 843 KHELPIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAAD 902
KHE P VRE+ +N DE L+S SPGS+T+G LA KTKVGSTSFV QTSPAVSV+SHAAD
Sbjct: 841 KHESPKVRENVKNLDEALISDASPGSNTIGTLASKTKVGSTSFVSFQTSPAVSVTSHAAD 900
Query: 903 DVITRFHILKCREDEAKDRHAG----------YSGQDMVEKSALDKEQTAVPYINDMDSS 962
DVITRFHILKCRED + R G +D+ EKSALDK+QTAVPYI DMDSS
Sbjct: 901 DVITRFHILKCREDVVRHRDVGNLVTLSDFEVLGKKDVAEKSALDKKQTAVPYIKDMDSS 960
Query: 963 FPTSKVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSC 1022
FPTSKV G+DS PA+PSISPTLTR+SH +DVMSRFQILKSR E +SSL+ GKVQKI +S
Sbjct: 961 FPTSKVKGNDSAPAVPSISPTLTRSSHVDDVMSRFQILKSRGERLSSLDTGKVQKITNSG 1020
Query: 1023 CSEIDMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLT--- 1075
C+EIDMLA +G+T+H LGIST+HH AD+K+EVD+LDAS R D R RGN+ISLT
Sbjct: 1021 CNEIDMLAHEGDTMHGLGISTMHHPIADDKNEVDNLDASVLARQDVLRRRGNNISLTPAG 1080
BLAST of Csor.00g283210 vs. ExPASy TrEMBL
Match:
A0A6J1E4K1 (uncharacterized protein LOC111430557 OS=Cucurbita moschata OX=3662 GN=LOC111430557 PE=4 SV=1)
HSP 1 Score: 2101 bits (5444), Expect = 0.0
Identity = 1061/1083 (97.97%), Postives = 1064/1083 (98.25%), Query Frame = 0
Query: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTL RSVTKPFPSPPLDMTEPSFGVG G G G
Sbjct: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLDRSVTKPFPSPPLDMTEPSFGVGVGVGVG 60
Query: 61 --AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH 120
AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH
Sbjct: 61 VGAGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNH 120
Query: 121 GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP 180
GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP
Sbjct: 121 GPLLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWP 180
Query: 181 LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNL 240
LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDE LLLEQNL
Sbjct: 181 LHSHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEELLLEQNL 240
Query: 241 HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV 300
HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV
Sbjct: 241 HMDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVV 300
Query: 301 TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN 360
TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN
Sbjct: 301 TTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKN 360
Query: 361 DPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIES 420
DPIVTEFSSTKIHDLRSNIHS KDSPD TLKAGMGLYIPDASPNFSSHLNPIETATTIES
Sbjct: 361 DPIVTEFSSTKIHDLRSNIHSDKDSPDCTLKAGMGLYIPDASPNFSSHLNPIETATTIES 420
Query: 421 SSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST 480
SSESFD YNLAAVDSPCWKGARIC TSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST
Sbjct: 421 SSESFDPYNLAAVDSPCWKGARICRTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPST 480
Query: 481 AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA 540
AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA
Sbjct: 481 AKDTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPA 540
Query: 541 TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN 600
TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN
Sbjct: 541 TGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMN 600
Query: 601 HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKD 660
HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSE+LLAYC NGSAALH KD
Sbjct: 601 HLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEMLLAYCSNGSAALHRKD 660
Query: 661 VKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIE 720
VKSLKTVMNNLDVCINSF SQDSLSPEQRTSQNLE FHQLHSDFQDVRVLKSQSQMTK+E
Sbjct: 661 VKSLKTVMNNLDVCINSFGSQDSLSPEQRTSQNLETFHQLHSDFQDVRVLKSQSQMTKME 720
Query: 721 GKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF 780
GK LECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF
Sbjct: 721 GKYLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENF 780
Query: 781 HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS 840
HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS
Sbjct: 781 HDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVS 840
Query: 841 GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH 900
GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH
Sbjct: 841 GVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRH 900
Query: 901 AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED 960
AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED
Sbjct: 901 AGYSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTED 960
Query: 961 VMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK 1020
VMSRFQILKSRDE ISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK
Sbjct: 961 VMSRFQILKSRDERISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNK 1020
Query: 1021 SEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG-- 1079
+EVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG
Sbjct: 1021 TEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGGK 1080
BLAST of Csor.00g283210 vs. ExPASy TrEMBL
Match:
A0A6J1JA97 (uncharacterized protein LOC111482682 OS=Cucurbita maxima OX=3661 GN=LOC111482682 PE=4 SV=1)
HSP 1 Score: 2034 bits (5271), Expect = 0.0
Identity = 1036/1081 (95.84%), Postives = 1047/1081 (96.85%), Query Frame = 0
Query: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPSFGVGAGAGAG 60
MSMGFASLGVGNGGSPSSFSNLSPLAPPFTL RSV+KPFP+P LDMTEPSFGVG GAGAG
Sbjct: 1 MSMGFASLGVGNGGSPSSFSNLSPLAPPFTLDRSVSKPFPTPLLDMTEPSFGVGVGAGAG 60
Query: 61 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
AG V LNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP
Sbjct: 61 AG--VLLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 120
Query: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH
Sbjct: 121 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 180
Query: 181 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLHM 240
SHVATFDVPPCADLSWGSSGSERS EEASHSIDIPDLNKCNEFVREYPDE LLLEQNLHM
Sbjct: 181 SHVATFDVPPCADLSWGSSGSERSGEEASHSIDIPDLNKCNEFVREYPDEELLLEQNLHM 240
Query: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT
Sbjct: 241 DAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSVATFSLRPPVVTT 300
Query: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP
Sbjct: 301 DSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFFGTENHGTCIDKNDP 360
Query: 361 IVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSHLNPIETATTIESSS 420
IVTEFSSTKIHD+RSNIHS KDSPD TLKAGMGLYIPDASPNFSS +TATTIESSS
Sbjct: 361 IVTEFSSTKIHDVRSNIHSDKDSPDCTLKAGMGLYIPDASPNFSS-----QTATTIESSS 420
Query: 421 ESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
ESFDQYNLAAVDSPCWKGARIC TSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK
Sbjct: 421 ESFDQYNLAAVDSPCWKGARICRTSPFQAFEIVTPTRMKTEEVCNSVNLSLSQVPPSTAK 480
Query: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG
Sbjct: 481 DTVHEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTSVKAGEFCSKMGCFHPATG 540
Query: 541 SIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
SIHDPVEDSGVSYSSCSIP SKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL
Sbjct: 541 SIHDPVEDSGVSYSSCSIPQSKYKHNLMTGKRIATTSYMKMHADARLNSDNSSENGMNHL 600
Query: 601 SYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSELLLAYCLNGSAALHWKDVK 660
SYDAAKHIQNFPSELVKAF +ESLSKMDIQILVDKLH LSELLLAYC NGSAALH KDVK
Sbjct: 601 SYDAAKHIQNFPSELVKAFHRESLSKMDIQILVDKLHSLSELLLAYCSNGSAALHRKDVK 660
Query: 661 SLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDFQDVRVLKSQSQMTKIEGK 720
SLKTVMNNLDVCINSF SQDSLSPEQR+SQNLE FHQLHS+FQDVRVLKSQSQ TKIEG+
Sbjct: 661 SLKTVMNNLDVCINSFGSQDSLSPEQRSSQNLEQFHQLHSEFQDVRVLKSQSQTTKIEGE 720
Query: 721 NLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
+LECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD
Sbjct: 721 SLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMKEDSMTKALKKVLRENFHD 780
Query: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV
Sbjct: 781 DKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHELPIVREHAENWDELLVSGV 840
Query: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG
Sbjct: 841 SPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVITRFHILKCREDEAKDRHAG 900
Query: 901 YSGQDMVEKSALDKEQTAVPYINDMDSSFPTSKVNGDDSRPALPSISPTLTRNSHTEDVM 960
YSGQDMVEK ALDKEQTAVPYINDMDSSFPTS+VNGDDSRPALPSISPTLTR+ HTEDVM
Sbjct: 901 YSGQDMVEKLALDKEQTAVPYINDMDSSFPTSEVNGDDSRPALPSISPTLTRSCHTEDVM 960
Query: 961 SRFQILKSRDEHISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGISTIHHRFADNKSE 1020
SRFQILKSRDE ISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGIS IHHR ADNKSE
Sbjct: 961 SRFQILKSRDERISSLNVGKVQKIRSSCCSEIDMLAPKGNTVHSLGIS-IHHRVADNKSE 1020
Query: 1021 VDDLDASAPGRLDAPRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG--EG 1079
VDDLDAS PGRLD RSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEG EG
Sbjct: 1021 VDDLDASVPGRLDVLRSRGNHISLTLTPAREQLQERVTVKKGGLGVETEPFLRFEGGKEG 1073
BLAST of Csor.00g283210 vs. ExPASy TrEMBL
Match:
A0A6J1HWP0 (uncharacterized protein LOC111467537 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111467537 PE=4 SV=1)
HSP 1 Score: 1467 bits (3797), Expect = 0.0
Identity = 788/1100 (71.64%), Postives = 876/1100 (79.64%), Query Frame = 0
Query: 3 MGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPS--FGVGAGAGAG 62
MGFA GVGNGGS SSFSNLSPLAPPFTL RSVTKP +P +D+TEP FGVG G
Sbjct: 1 MGFAPFGVGNGGSSSSFSNLSPLAPPFTLDRSVTKPLSTPLVDITEPEPEFGVGGG---- 60
Query: 63 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 122
VPLN HNWLPSTSKTS DF SS EFDW PFS+GS +PRSQ MM+PS NHGP
Sbjct: 61 ----VPLNPLQHNWLPSTSKTSAHDFFSS---EFDWLPFSTGSGFPRSQAMMDPSHNHGP 120
Query: 123 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 182
LLGRLT+++TD S Y SSDG+TTS+GK KPYYPSYA+TS NK GP V+VDQPSY+W +
Sbjct: 121 LLGRLTITSTDLSSYHGSSDGVTTSMGKPKPYYPSYAATSSNKAGPTVIVDQPSYDWLSN 180
Query: 183 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLH- 242
SHV TF+ PPC D S GSS SERS EEASHS+D+ DLNKCNEFVREYP+E L E+NL+
Sbjct: 181 SHVVTFEGPPCTDFSRGSSASERSTEEASHSVDVLDLNKCNEFVREYPNEELFSERNLNI 240
Query: 243 -----MDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSV------ 302
MDAHSAFPGCHPKTRTPPSNPASSSQN FLKK PY EI REQD+RL+V
Sbjct: 241 ERISNMDAHSAFPGCHPKTRTPPSNPASSSQNSPFLKKPPYLEISREQDSRLNVTASIVN 300
Query: 303 --ATFSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFF 362
ATFS+RP VV+TDSF N+ CH+SDY +DSFE KQGGN+LSNLKE LPV+S+SKEF
Sbjct: 301 SPATFSIRPSVVSTDSFAWNVGSCHVSDYGYDSFEAKQGGNNLSNLKELLPVNSESKEFV 360
Query: 363 GTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSH 422
EN+ TCIDKNDP++TE SSTKIHDLR+NIHS KDSPDR LKAGM L+IPDASP+FS
Sbjct: 361 SAENYDTCIDKNDPVITEPSSTKIHDLRNNIHSAKDSPDRRLKAGMRLHIPDASPHFSLD 420
Query: 423 LNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNS 482
IETATT ESSSESFDQYNLAAVDSPCWKG I SPFQAFEIVTP+R K EV NS
Sbjct: 421 PKGIETATTTESSSESFDQYNLAAVDSPCWKGVPINQISPFQAFEIVTPSRTKMLEVYNS 480
Query: 483 VNLSLSQVPPSTAKDTV----HEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTS 542
VNLSLSQVPPSTA+DTV HEPNESTIG ILEKGATSSPKMPSV G SLPA QK+S S
Sbjct: 481 VNLSLSQVPPSTAEDTVKVIVHEPNESTIGSILEKGATSSPKMPSVIGSSLPAEQKSSNS 540
Query: 543 VKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMH 602
VKAGEFCSKMGCFHPAT S+++ D G YSSCSIP +KYKHNL++GKRI TS + H
Sbjct: 541 VKAGEFCSKMGCFHPATSSVYEAFGDGGDFYSSCSIPQNKYKHNLVSGKRIGRTSCTEKH 600
Query: 603 ADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEL 662
ADARLNSDNSS NG+NHLS+DAA+H+QN PSELVKAF ES SK+DI+ILVD LH LS L
Sbjct: 601 ADARLNSDNSSGNGLNHLSFDAAEHVQNLPSELVKAFHGESTSKVDIRILVDTLHSLSGL 660
Query: 663 LLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDF 722
LLA+C NG ALH KDV SL+TVMNNLDVCINS SQ SLSPEQRTSQ+LE FHQLH+ F
Sbjct: 661 LLAHCSNGLDALHQKDVMSLETVMNNLDVCINSVGSQGSLSPEQRTSQSLEQFHQLHAHF 720
Query: 723 QDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMK 782
QD+ VLKSQSQMTKIEG+NLECLSND NGVEETN+YILS+KKDKEAA S LRNGID MK
Sbjct: 721 QDLGVLKSQSQMTKIEGENLECLSNDRNGVEETNRYILSVKKDKEAASSHRLRNGIDLMK 780
Query: 783 EDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHEL 842
EDSMTKALKKVL ENFHDD+EHPQ+LLYKNLWL+AEAALCAS L ARFS AKSEMEKHE
Sbjct: 781 EDSMTKALKKVLSENFHDDEEHPQTLLYKNLWLQAEAALCASNLRARFSSAKSEMEKHES 840
Query: 843 PIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVIT 902
P V+EHA+N D+L VSG SPGS+T+ ++A KTKVGSTSFV VQTSP VSV SHA+DDVIT
Sbjct: 841 PKVKEHAKNHDQLFVSGASPGSNTIAEVASKTKVGSTSFVSVQTSPTVSVRSHASDDVIT 900
Query: 903 RFHILKCREDEAKDRHAGYSG----------QDMVEKSALDKEQTAVPYINDMDSSFPTS 962
RF+ILK R+DEAK R A G Q MVEKSAL+KEQTA P++ DMDSSFP+S
Sbjct: 901 RFNILKHRDDEAKLRDAENLGTLSDFEVSVKQGMVEKSALEKEQTAGPHVKDMDSSFPSS 960
Query: 963 KVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEI 1022
KV G+DS PA S S LTR SH +DVMSRFQILKSRDEH+SSLNVGKVQK+ SS CSEI
Sbjct: 961 KVKGNDSGPAPQSTSLILTRTSHIDDVMSRFQILKSRDEHVSSLNVGKVQKVTSSHCSEI 1020
Query: 1023 DMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQ 1072
+ AP+G IS IHH ADNK+EVDDLD S GRLD RSRGN+IS T PA E
Sbjct: 1021 EKAAPEGV------ISMIHHPIADNKNEVDDLDGSVVGRLDVLRSRGNNISPT--PAGEN 1079
BLAST of Csor.00g283210 vs. ExPASy TrEMBL
Match:
A0A6J1HUB8 (uncharacterized protein LOC111467537 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111467537 PE=4 SV=1)
HSP 1 Score: 1458 bits (3774), Expect = 0.0
Identity = 786/1100 (71.45%), Postives = 874/1100 (79.45%), Query Frame = 0
Query: 3 MGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPS--FGVGAGAGAG 62
MGFA GVGNGGS SSFSNLSPLAPPFTL RSVTKP +P +D+TEP FGVG G
Sbjct: 1 MGFAPFGVGNGGSSSSFSNLSPLAPPFTLDRSVTKPLSTPLVDITEPEPEFGVGGG---- 60
Query: 63 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 122
VPLN HNWLPSTSKTS DF SS EFDW PFS+GS +PRSQ MM+PS NHGP
Sbjct: 61 ----VPLNPLQHNWLPSTSKTSAHDFFSS---EFDWLPFSTGSGFPRSQAMMDPSHNHGP 120
Query: 123 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 182
LLGRLT+++TD S Y SSDG+TTS+GK KPYYPSYA+TS NK GP V+VDQPSY+W +
Sbjct: 121 LLGRLTITSTDLSSYHGSSDGVTTSMGKPKPYYPSYAATSSNKAGPTVIVDQPSYDWLSN 180
Query: 183 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLH- 242
SHV TF+ PPC D S GSS SERS EEASHS+D+ DLNKCNEFVREYP+E L E+NL+
Sbjct: 181 SHVVTFEGPPCTDFSRGSSASERSTEEASHSVDVLDLNKCNEFVREYPNEELFSERNLNI 240
Query: 243 -----MDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSV------ 302
MDAHSAFPGCHPKTRTPPSNPASSSQN FLKK PY EI REQD+RL+V
Sbjct: 241 ERISNMDAHSAFPGCHPKTRTPPSNPASSSQNSPFLKKPPYLEISREQDSRLNVTASIVN 300
Query: 303 --ATFSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFF 362
ATFS+RP VV+TDSF N+ CH+SDY +DSFE KQGGN+LSNLKE LPV+S+SKEF
Sbjct: 301 SPATFSIRPSVVSTDSFAWNVGSCHVSDYGYDSFEAKQGGNNLSNLKELLPVNSESKEFV 360
Query: 363 GTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSH 422
EN+ TCIDKNDP++TE SSTKIHDLR+NIHS KDSPDR LKAGM L+IPDASP+FS
Sbjct: 361 SAENYDTCIDKNDPVITEPSSTKIHDLRNNIHSAKDSPDRRLKAGMRLHIPDASPHFSLD 420
Query: 423 LNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNS 482
IETATT ESSSESFDQYNLAAVDSPCWKG I SPFQAFEIVTP+R K EV NS
Sbjct: 421 PKGIETATTTESSSESFDQYNLAAVDSPCWKGVPINQISPFQAFEIVTPSRTKMLEVYNS 480
Query: 483 VNLSLSQVPPSTAKDTV----HEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTS 542
VNLSLSQVPPSTA+DTV HEPNESTIG ILEKGATSSPKMPSV G SLPA QK+S S
Sbjct: 481 VNLSLSQVPPSTAEDTVKVIVHEPNESTIGSILEKGATSSPKMPSVIGSSLPAEQKSSNS 540
Query: 543 VKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMH 602
VKAGEFCSKMGCFHPAT S+++ D G YSSCSIP +KYKHNL++GKRI TS + H
Sbjct: 541 VKAGEFCSKMGCFHPATSSVYEAFGDGGDFYSSCSIPQNKYKHNLVSGKRIGRTSCTEKH 600
Query: 603 ADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEL 662
ADARLNSDNSS NG+NHLS+DAA+H+QN PSELVKAF ES SK+DI+ILVD LH LS L
Sbjct: 601 ADARLNSDNSSGNGLNHLSFDAAEHVQNLPSELVKAFHGESTSKVDIRILVDTLHSLSGL 660
Query: 663 LLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDF 722
LLA+C NG ALH KDV SL+TVMNNLDVCINS SQ SLSPEQRTSQ+LE FHQLH+D
Sbjct: 661 LLAHCSNGLDALHQKDVMSLETVMNNLDVCINSVGSQGSLSPEQRTSQSLEQFHQLHAD- 720
Query: 723 QDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMK 782
+ VLKSQSQMTKIEG+NLECLSND NGVEETN+YILS+KKDKEAA S LRNGID MK
Sbjct: 721 --LGVLKSQSQMTKIEGENLECLSNDRNGVEETNRYILSVKKDKEAASSHRLRNGIDLMK 780
Query: 783 EDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHEL 842
EDSMTKALKKVL ENFHDD+EHPQ+LLYKNLWL+AEAALCAS L ARFS AKSEMEKHE
Sbjct: 781 EDSMTKALKKVLSENFHDDEEHPQTLLYKNLWLQAEAALCASNLRARFSSAKSEMEKHES 840
Query: 843 PIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVIT 902
P V+EHA+N D+L VSG SPGS+T+ ++A KTKVGSTSFV VQTSP VSV SHA+DDVIT
Sbjct: 841 PKVKEHAKNHDQLFVSGASPGSNTIAEVASKTKVGSTSFVSVQTSPTVSVRSHASDDVIT 900
Query: 903 RFHILKCREDEAKDRHAGYSG----------QDMVEKSALDKEQTAVPYINDMDSSFPTS 962
RF+ILK R+DEAK R A G Q MVEKSAL+KEQTA P++ DMDSSFP+S
Sbjct: 901 RFNILKHRDDEAKLRDAENLGTLSDFEVSVKQGMVEKSALEKEQTAGPHVKDMDSSFPSS 960
Query: 963 KVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEI 1022
KV G+DS PA S S LTR SH +DVMSRFQILKSRDEH+SSLNVGKVQK+ SS CSEI
Sbjct: 961 KVKGNDSGPAPQSTSLILTRTSHIDDVMSRFQILKSRDEHVSSLNVGKVQKVTSSHCSEI 1020
Query: 1023 DMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQ 1072
+ AP+G IS IHH ADNK+EVDDLD S GRLD RSRGN+IS T PA E
Sbjct: 1021 EKAAPEGV------ISMIHHPIADNKNEVDDLDGSVVGRLDVLRSRGNNISPT--PAGEN 1076
BLAST of Csor.00g283210 vs. ExPASy TrEMBL
Match:
A0A6J1HT35 (uncharacterized protein LOC111467537 isoform X3 OS=Cucurbita maxima OX=3661 GN=LOC111467537 PE=4 SV=1)
HSP 1 Score: 1411 bits (3652), Expect = 0.0
Identity = 768/1100 (69.82%), Postives = 853/1100 (77.55%), Query Frame = 0
Query: 3 MGFASLGVGNGGSPSSFSNLSPLAPPFTLGRSVTKPFPSPPLDMTEPS--FGVGAGAGAG 62
MGFA GVGNGGS SSFSNLSPLAPPFTL RSVTKP +P +D+TEP FGVG G
Sbjct: 1 MGFAPFGVGNGGSSSSFSNLSPLAPPFTLDRSVTKPLSTPLVDITEPEPEFGVGGG---- 60
Query: 63 AGAGVPLNSSLHNWLPSTSKTSGLDFVSSSTSEFDWFPFSSGSTYPRSQPMMEPSDNHGP 122
VPLN HNWLPSTSKTS DF SS EFDW PFS+GS +PRSQ MM+PS NHGP
Sbjct: 61 ----VPLNPLQHNWLPSTSKTSAHDFFSS---EFDWLPFSTGSGFPRSQAMMDPSHNHGP 120
Query: 123 LLGRLTMSTTDRSLYGHSSDGLTTSIGKAKPYYPSYASTSCNKGGPMVLVDQPSYNWPLH 182
LLGRLT+++TD S Y SSDG+TTS+GK KPYYPSYA+TS NK GP V+VDQPSY+W +
Sbjct: 121 LLGRLTITSTDLSSYHGSSDGVTTSMGKPKPYYPSYAATSSNKAGPTVIVDQPSYDWLSN 180
Query: 183 SHVATFDVPPCADLSWGSSGSERSVEEASHSIDIPDLNKCNEFVREYPDEGLLLEQNLH- 242
SHV TF+ PPC D S GSS SERS EEASHS+D+ DLNKCNEFVREYP+E L E+NL+
Sbjct: 181 SHVVTFEGPPCTDFSRGSSASERSTEEASHSVDVLDLNKCNEFVREYPNEELFSERNLNI 240
Query: 243 -----MDAHSAFPGCHPKTRTPPSNPASSSQNYQFLKKAPYQEILREQDARLSV------ 302
MDAHSAFPGCHPKTRTPPSNPASSSQN FLKK PY EI REQD+RL+V
Sbjct: 241 ERISNMDAHSAFPGCHPKTRTPPSNPASSSQNSPFLKKPPYLEISREQDSRLNVTASIVN 300
Query: 303 --ATFSLRPPVVTTDSFLRNISPCHISDYDHDSFEGKQGGNDLSNLKEFLPVHSDSKEFF 362
ATFS+RP VV+TDSF N+ CH V+S+SKEF
Sbjct: 301 SPATFSIRPSVVSTDSFAWNVGSCH--------------------------VNSESKEFV 360
Query: 363 GTENHGTCIDKNDPIVTEFSSTKIHDLRSNIHSGKDSPDRTLKAGMGLYIPDASPNFSSH 422
EN+ TCIDKNDP++TE SSTKIHDLR+NIHS KDSPDR LKAGM L+IPDASP+FS
Sbjct: 361 SAENYDTCIDKNDPVITEPSSTKIHDLRNNIHSAKDSPDRRLKAGMRLHIPDASPHFSLD 420
Query: 423 LNPIETATTIESSSESFDQYNLAAVDSPCWKGARICHTSPFQAFEIVTPTRMKTEEVCNS 482
IETATT ESSSESFDQYNLAAVDSPCWKG I SPFQAFEIVTP+R K EV NS
Sbjct: 421 PKGIETATTTESSSESFDQYNLAAVDSPCWKGVPINQISPFQAFEIVTPSRTKMLEVYNS 480
Query: 483 VNLSLSQVPPSTAKDTV----HEPNESTIGGILEKGATSSPKMPSVAGPSLPAAQKTSTS 542
VNLSLSQVPPSTA+DTV HEPNESTIG ILEKGATSSPKMPSV G SLPA QK+S S
Sbjct: 481 VNLSLSQVPPSTAEDTVKVIVHEPNESTIGSILEKGATSSPKMPSVIGSSLPAEQKSSNS 540
Query: 543 VKAGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLMTGKRIATTSYMKMH 602
VKAGEFCSKMGCFHPAT S+++ D G YSSCSIP +KYKHNL++GKRI TS + H
Sbjct: 541 VKAGEFCSKMGCFHPATSSVYEAFGDGGDFYSSCSIPQNKYKHNLVSGKRIGRTSCTEKH 600
Query: 603 ADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDKLHGLSEL 662
ADARLNSDNSS NG+NHLS+DAA+H+QN PSELVKAF ES SK+DI+ILVD LH LS L
Sbjct: 601 ADARLNSDNSSGNGLNHLSFDAAEHVQNLPSELVKAFHGESTSKVDIRILVDTLHSLSGL 660
Query: 663 LLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPFHQLHSDF 722
LLA+C NG ALH KDV SL+TVMNNLDVCINS SQ SLSPEQRTSQ+LE FHQLH+ F
Sbjct: 661 LLAHCSNGLDALHQKDVMSLETVMNNLDVCINSVGSQGSLSPEQRTSQSLEQFHQLHAHF 720
Query: 723 QDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLRNGIDSMK 782
QD+ VLKSQSQMTKIEG+NLECLSND NGVEETN+YILS+KKDKEAA S LRNGID MK
Sbjct: 721 QDLGVLKSQSQMTKIEGENLECLSNDRNGVEETNRYILSVKKDKEAASSHRLRNGIDLMK 780
Query: 783 EDSMTKALKKVLRENFHDDKEHPQSLLYKNLWLEAEAALCASKLIARFSIAKSEMEKHEL 842
EDSMTKALKKVL ENFHDD+EHPQ+LLYKNLWL+AEAALCAS L ARFS AKSEMEKHE
Sbjct: 781 EDSMTKALKKVLSENFHDDEEHPQTLLYKNLWLQAEAALCASNLRARFSSAKSEMEKHES 840
Query: 843 PIVREHAENWDELLVSGVSPGSSTVGKLAPKTKVGSTSFVPVQTSPAVSVSSHAADDVIT 902
P V+EHA+N D+L VSG SPGS+T+ ++A KTKVGSTSFV VQTSP VSV SHA+DDVIT
Sbjct: 841 PKVKEHAKNHDQLFVSGASPGSNTIAEVASKTKVGSTSFVSVQTSPTVSVRSHASDDVIT 900
Query: 903 RFHILKCREDEAKDRHAGYSG----------QDMVEKSALDKEQTAVPYINDMDSSFPTS 962
RF+ILK R+DEAK R A G Q MVEKSAL+KEQTA P++ DMDSSFP+S
Sbjct: 901 RFNILKHRDDEAKLRDAENLGTLSDFEVSVKQGMVEKSALEKEQTAGPHVKDMDSSFPSS 960
Query: 963 KVNGDDSRPALPSISPTLTRNSHTEDVMSRFQILKSRDEHISSLNVGKVQKIRSSCCSEI 1022
KV G+DS PA S S LTR SH +DVMSRFQILKSRDEH+SSLNVGKVQK+ SS CSEI
Sbjct: 961 KVKGNDSGPAPQSTSLILTRTSHIDDVMSRFQILKSRDEHVSSLNVGKVQKVTSSHCSEI 1020
Query: 1023 DMLAPKGNTVHSLGISTIHHRFADNKSEVDDLDASAPGRLDAPRSRGNHISLTLTPAREQ 1072
+ AP+G IS IHH ADNK+EVDDLD S GRLD RSRGN+IS T PA E
Sbjct: 1021 EKAAPEGV------ISMIHHPIADNKNEVDDLDGSVVGRLDVLRSRGNNISPT--PAGEN 1053
BLAST of Csor.00g283210 vs. TAIR 10
Match:
AT3G49490.1 (unknown protein; Has 722 Blast hits to 186 proteins in 64 species: Archae - 0; Bacteria - 30; Metazoa - 72; Fungi - 48; Plants - 38; Viruses - 0; Other Eukaryotes - 534 (source: NCBI BLink). )
HSP 1 Score: 82.4 bits (202), Expect = 2.4e-15
Identity = 137/581 (23.58%), Postives = 236/581 (40.62%), Query Frame = 0
Query: 463 VCNSVN-LSLSQVPPSTAKDTVHEPNESTIGGILEKGATSSPKMPSV----AGPSLPAAQ 522
V +S N +S S + +T HEP + + +G S+P M S+ GPS P +
Sbjct: 355 VADSENGVSESSLKNATEDLNCHEPRSWSHFMVTSEG-PSAPTMFSMGSESGGPSAPTMK 414
Query: 523 KTSTSVK-AGEFCSKMGCFHPATGSIHDPVEDSGVSYSSCSIPLSKYKHNLM-TGKRIAT 582
+ + + AG + P GS P ED + SC+ L K ++M K+I +
Sbjct: 415 ADNENAQSAGNYKP------PFEGSTTQPSEDVPTNQESCN--LQKQTFDIMDRDKKIRS 474
Query: 583 TSYMKMHADARLNSDNSSENGMNHLSYDAAKHIQNFPSELVKAFPKESLSKMDIQILVDK 642
+ + + +R N+D+ S + +FPS ++P+ + +V+
Sbjct: 475 LTDVGLDLSSRSNADDVSTGRSPERHFCDQ---GDFPSP--TSYPR-------VSSVVNA 534
Query: 643 LHGLSELLLAYCLNGSAALHWKDVKSLKTVMNNLDVCINSFESQDSLSPEQRTSQNLEPF 702
+H LSE+L+ C N + L + +++L V++NL C+ + + E P
Sbjct: 535 MHNLSEVLVYECFNNGSWLKLEQLENLDKVVDNLTKCLKKITDNKTTAGEATL-----PT 594
Query: 703 HQLHSDFQDVRVLKSQSQMTKIEGKNLECLSNDGNGVEETNQYILSIKKDKEAADSLYLR 762
+H + N+ L GV + Q S+K DS ++
Sbjct: 595 QSMH-----------------VTCPNVVDLHEAATGVAKDFQR-FSVK----PLDSFGVK 654
Query: 763 NGIDSMKEDSMTKALKKVLRENFHDDKE-HPQSLLYKNLWLEAEAALCASKLIARFSIAK 822
+D ++ MT+++K +L NF D +E HPQ+LLYKNLWLE EAALC++ +AR+ K
Sbjct: 655 EPVD---KNEMTQSIKNILASNFPDGEENHPQTLLYKNLWLETEAALCSTTCMARYHRIK 714
Query: 823 SEM------------------------EKHELPIVREHA--ENWDELLVSGVSPGSSTV- 882
+E+ + +PI+ +A + + ++ G + G +
Sbjct: 715 NEIGNLKLNNKEISADAVSFMQEPSLNTQKSVPIMNANADKDTPESIIKHGSNCGKNAAT 774
Query: 883 ------------------------------------GKLAPKTKVGSTSFVPVQTSPAVS 942
G L P + + + S
Sbjct: 775 MSHDASESSRINSDPVDAVLSVMSRSFTGGLEQTIRGNLRPDDATFAKIPDAIWQETSAS 834
Query: 943 VSSHAADDVITRFHILKCREDE--AKDRHAGYSGQDMVEKSALDKEQTAVPYINDMDSSF 971
+ + +VI RF ILK +E E K + S D++++ + K+Q +
Sbjct: 835 TTENKHREVIDRFQILKEQETERKLKSQKLPDSDIDVIDRFQILKQQETNRKLK--AQKC 881
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6576619.1 | 0.0 | 100.00 | hypothetical protein SDJN03_24193, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_022922596.1 | 0.0 | 97.97 | uncharacterized protein LOC111430557 [Cucurbita moschata] | [more] |
XP_022984354.1 | 0.0 | 95.84 | uncharacterized protein LOC111482682 [Cucurbita maxima] | [more] |
XP_022968240.1 | 0.0 | 71.64 | uncharacterized protein LOC111467537 isoform X1 [Cucurbita maxima] | [more] |
XP_038891692.1 | 0.0 | 69.31 | uncharacterized protein LOC120081084 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1E4K1 | 0.0 | 97.97 | uncharacterized protein LOC111430557 OS=Cucurbita moschata OX=3662 GN=LOC1114305... | [more] |
A0A6J1JA97 | 0.0 | 95.84 | uncharacterized protein LOC111482682 OS=Cucurbita maxima OX=3661 GN=LOC111482682... | [more] |
A0A6J1HWP0 | 0.0 | 71.64 | uncharacterized protein LOC111467537 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HUB8 | 0.0 | 71.45 | uncharacterized protein LOC111467537 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1HT35 | 0.0 | 69.82 | uncharacterized protein LOC111467537 isoform X3 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT3G49490.1 | 2.4e-15 | 23.58 | unknown protein; Has 722 Blast hits to 186 proteins in 64 species: Archae - 0; B... | [more] |