Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
TCATCGGTTGGGTGAAACGGTTTTGTACAAAAACTGGGGTCGAAAATCGACCTAATTGGTTTATACTAAGGAAAACCGACAAGCAATGTTCAAGAATTGAATCGAGCGACCAAACCGCGGGTTGGTTTCAATCAATTTGGTTTTCTCATCAGTTTTTCCACCCCTAATGGTTCATTGGGCCTAACATTTACTTCAAATATAAATATTTGAAGTTCGACCAGAAATAATAAATATAAAATAAATATCTATATTCCATTTATATCTATATATTATTTTTGTATGATCAGTCCCTTTCGGGTAACAAGTCGGTTGGTTTTGCGAAACCTTTTAATTAAATCAACCACTGAACGTTTTTTTCTAATAAAACCGATCATTGGATGTCGGTATCGTTCAGATCGTTTTTTCGAAGGTTAATGTTTAACCATGAGAAAAACATAGATCTATCATCAACCCATGAATCTCTCACCGTCTAACTGTTATATCAACTAATTTGAGAAAAACCTGTTGAGTGTCAACTTTTAATTACACGTGCCACAAAAATCCATACCAAAATTGTATTTCAGCCATTATATGAACCCTAATTCCCAACTTCAATCTGAATAAAACTCAATCTTGACCTCAAATCCCACCTGATTCATCCATTTCTTCACTCATATTTCCTCTCAGCTTAAGCAGTGTAGTTAGTGCTTCATCAACCTCTGGATGAGACTGCAAACCAGAAGAAAATGATTGGATCTTTGATTTGGCTTCAATCCAACTTACACCAGGATCCTTTCCCACCATTTTCTCTCTTATTCTTCTCCTCATCTCAACAACACCATCCCATTTTCCTGCTGCAGCATAAAGATTTGATAGCAATATATGCGCTGCACTGTCTTCTGGATCTATTCTCAACACTTGTTTTGCTGCATTAACTCCTACTCTAAGGTTTTTTTTAACAACACATGAACTTAAGAGAGTTCTCCATAGCTCTGGATCATTATTTGCAAATGGTGATTTGGTAATCATTTCCTCTGCTTCGTCCATGAACCCAGCTCCACTTAACAAGCTTACCATACAAGAATAGTGCTTAGAATTCGGTATAATGTTGCATTCTTTCATATAGTTCCATAAAAACTGCCCTATTTCAACTGAGTTACTGTGATTACAAGCAGAAAGTAAGGAAAGAAATGTTACTTGATCTGGTTTGACACCATTGTTTTGAAGGTTGAAGAAGAGGTTCAATGCTTGTTCCATATTCCCATGGTGGCTGTACCCACCAAGCATGGAGTTCCAACACTTCAAATCTGGACATGGGACTTGAGAAAATATCAATTGAGCTGAACCCAAATCACCATTTTTAGCATACATGTTGATTAGACTCCCTAAGACATAGATTTCAGCTTCACTCCCTGTTTTTATTGCTAGAGAATGAAAAATCTCCCCTTGTTTTAGAGTGGCAAGATCAGCACATGAACTCAAAGCCAAACTTAGTGAAAAACTGTCAAGTTCATGGCCATTTTGATGCATTTGATGAAAGCATTTGATTGCTTTTTCCCCTTCACCAATTCTAGAATACCCACTGATCATTTCTGTCCATAGAACAACATCCTTCACTGCAACAGTCACAAAAACCCTCGCAGCAGCTTGAGATTCACCATTTCTAAATAACATGGATACAATAACACTGCTTATAAACACACTGCCCTCAAATCCATCCTTTATAACTTGAGCAATAAAAGACATTCCACTCAAAAGGTTGTCAATGGTGGAAATTACAGCTGCATATGTATAATCATCTGGTTTGGTAAGCGATGATTTCTTCAATTGTTGAAATAGCTTCATGGCCTTCTCGTCTTCTTCATTCTCAGAACATCCTGAGATGATCGTGTTCCAAGCAACCAAATCTGGGTTTTCATTTCTGTTAAAAATGCAAAATGCTGTATGGATATCCCCACAATTGCAGTACAAATCCAACAGAACATTTTGCAAAGTTCTATCAATAATGGCGTTTGAGGTGATGATACGGCCATGGATGAGGCGGCCAAACAGGTAGTCTCCATTTCTACAACATATATTCAAAATCATTGCATAGGTGAATTGAGTAGGAATTAGACCAATCCCCAGCATTTGATTGAACAAACGAAGCGCCTCGTTTAATTTATCGTGCTTCAAATTTCCAAAGATCATCGTATTCCAAGTCACTACATCCTTATCGATCGTCCACCGGAAAACTTTTCCAGCTGATTCCAAATCCAAGCAATGTGAGTAAGTCCCAATTAGTGCGGTTTGAACACGTACATCATGAACAAATCCACATTTCACTACTTGGGCATGAATTAAAGAGCTCCAAAATGGATCTTCAGTGAAAGAAGCTGCCTGCAATAAGCTCGTTATAGTAAAACTATTGGGTTTGAGGAATTCAAGCTCCATTTGTGAAAGCAGGTTGAAAGCCAATGGAGCATGACCGTGAGACCGAGAATAGGCTGCGATCAGAGCATTAAACGAAACGAGATTTCTTTGCGGCATTTTCTCGAACACTTTCTGTGATTCCCAAATTGCACCACATCGTGCGTACATGGATAAGATGTTGTTGCAGACGTAAGGAGATAGAGACGAGGCGGTGGCTATGGAGGTGAGAATCAGAGCGTGGAGCTGACGCGCCGCCTTCAGCGAGGTTACGGAAATGCATTTTTGGACAAGCGCTGATAAAGAAAGCGCTTCCGTCATGTGAAGAGGAACTCTTTTATTTTGGGTTTCAAAAAATGAACGAAAGGTTTCTCTTTTATACGAAATATCATTTTCACCCTTTAGAAAGACCCCCGGAGCCAATATCATTTCGTCTCACAATTCGCCATGGAAACTGAAAATTTCGAAGCTTCAGTAGCTGAACTCATCGACGGTGCCTGAGAATGACGCTTGGCGGTGAGAAGTGCTTGCGGCTTAGCCGGCTCGTCGATCATTGTCTTCATCCTTTTACAGTAAGGTTTCGCTGTTCTCTATTGTTTCCATTTCTTTTTCTCGATTACATATTGCGCGCCAAGTTTTCTGTGATTATCATTGAAGGTTCATGTTTTTCACTTTTGGTGAATAGGAAGAAGATGGGGTGGTGTTGGAGAGCAAAGGGAAGGAAAAGGAACTGCTTATTGCGCTCTCTCATGTTTGCTTCGCTCTTCTTTACTCTCTTGGTTCCTGTGATTCTGTTGTTGCTATTGGATTTGGAAATCTTCAGTATTTTACATTAGTTCGTTGCCTCAGTTTGTTGACTGGTCGTTCATTAGGTTGTAACTGAAGTTCAGCGCTGGGTTCGAGAAATTGATGGCGACTCTGATAATGTTAGTCTCTCTCACCTCTCAGACATTTGTCCTGTTCAGTAACACAAAATTCTTCATTGATATAGTATTGTCGATCAGGAAGGGATTCAAATGCCGAATTCTCATGAGAAAAGAAGTGATGAGCAGGACCAATCTTTGGAGAGCCACCATTATATGACGAAGATTGTTTCTGAATTGGTAATTTCTTACAATCAAGGACTTAGAGAGGAAGATCACAAAATTTTGGAGTTCTCTCTTCTGAAATATTTGTTCTTGCATTCTATTAACTATATACTTCTTCCGTAAATTAAACAAGGAAGTTTTTTATACATCTTTTTATTTCTCTGTACTGTGCATGCATAATGTATTTTACATTTCCCTTTGTGTGCAGGTACCTTTACTTGCTTTTGAGAATAAGTATGTCAAACATCTTGTGGGCAATGTACTAACAGCAGTTACAAAATTTATATTTCTAACTGTATGTAACTCTACTCATGTTTGAAATTTTAGTTTATCGCTATCACATAATTTTGGGTTAATAATAAACACATTTGATTATGCAGGGAAACGCAAGCGACTGGTGTGAATTAGTACACTCGTTATGTTTTAGCATGGAGTTAGTGCTTGCCAGAATCATATCTTCTCCTGCACCTTCGATCACTGGGTCTGAAAATTTAGACTTTTATTTATCAATCTTACAACCTAAGCTGAAAAATGCCAATTTTTCCACGGTTGCTGGCCTTCTCCAAGTTTTGCGAAATACCTTGAAATTCTTGAAGCAAGAGCAGAGTGATCTTATTGGAGAGTTGTTTGATTCTGTTAATTCTTGTCTTTCAAAGATTCCTTGGGATTTATTAGGTAGGATCCTTACGGAGAAAATATGTAACATTGTAGAAGTCCAGAGCAACGATGATGCGTGTTCTGATAATTTGCATCAGAGGCAAGGATTAAAATTTCTGTTCCTAGGAAATTTTGTTCAATTTCTCTGTTCCTTGGCTGAGCCAAGTGACTTTGAGGAAGCTTCATGTGGTTCATTTAAGAGTCACCCCCTACTTGGCACAATCATCAACCTGATTCCTAACCTTTTTGATTGGTGCCTTAACAACCAAGTAGATCACTTTGATAGGTGCTTGTCCCGATATTTCAGTCACAAGCTGCTGGTATAGATCTACTCTTTTTCTCCCCCTCCTCAGAGTATACCTTAGATTAGCCTTCTTGAGATAAGGCAAAAATATAATAGGAATACCTGGTGGCTGGCTAGATTGTTCTATATGGTTTTTATGGTGCCTCAATTATTTATTTGGTTGGATGTTTTTGAGAAGCTTGATTCTCATAAGTGACAATGGCATCTACTTCCAACATCAAAGAACAGCGTAGATAGTGAAAGTTGAATTCTATTTAAAGAAATAGGTGCCTGGAACATATGTTTTATTGGAAGTTTTCCACTTTACATTTTTATTATAATTCTGATTCTTTTTTACTTTTAAACCCTATCAAGTTAAATCTATACCTTTAAATTAACTTTTGTATCTGGTTTAGTAATCTCCAATGTTTGTGCTAAAAGAATTTTAATTGGACAGAGGGTTTTGAGATCGATTTCAATTTCAATTGTGCTGTGAATGTGTCCGTACCATTTATGTGTTACCGACACTGTACATGTTATATAACTACCCTGTTTCTGCTTCCACAACTTTGGGTTTCATGGAAGGGTAGTTCAAGTGTCTGAGTGACGTTCTTCTAGTTGTCACTCAAATTTTTGCAATCCTGGATTGTACGTTTTTGTATGTGAACGATAGCTTGACTCTTGATCTTAGTAAAGTTCTACTTTTTATGAGGAAGAATGTGTTTCTTCTAGTATAACTTACTCCCCTCTTGGAAAGACATCTTTAGCTTTTTGATCCCTGCCCCACCCCACCTCTTCTTCCTCATAATATTTACATAGATACACTAATTATCATATCTAAGCTTAATAATTTATTTCTGGGTGCTAATTGCATTGTGTGAATTTGATTTACCTGCCAAAATCTTGGGCATCATTTGATAACCATTTGATTTTTGGCACTTGAAAATCAAGCTTATAAACACTACCAGGACTTATTTGTTTTGTTATCTACTTTTTATGAGTTTTAAAAAACTAAACGAAGTTTTGGAAACTAGAAAAACTAGCTTTTAAAAAGTTAATTTTGTTTATGAAATTTGATTGAAAATACAATTGTTTAGTTAGGTATAATGAAAACAATGATTAAGAAGTTGTCCGAAAACAAGTACAATTAACCGAAAGCAACAAATGATTAACAAACAGGATCTTCAGTTTTAAAAGATTGTTGATGATATGCTGCACTGCTCTTGAGTCATACAAGGTTTAATTCCTAGTGGTTCACTAAGTTCAGTAATTCTTCATTTTTCAGATATTAATGATCAGGCTTAGTTTCCATTGCCATCTTCAGTGTTCCACTCTTGTTCTATGGTTGCAACTTTGCAGAAATTGTTTCCAAAACCTCTTGTTGCTTCCAAAGCTTGAGCTGGAATCTACCGCTGATACTTCCCTTGAAGATTCTCCATTGATTGTGAGCTATTTTGGTGACAAACGCAGTCCATGTTCCTTGCATCTGCGAAGACTAGCTGTTTTTCTATTCCTCAGGTGTTCCTTAAGCTTCATATGTAAACAACCAACTGAAAAGTGTGATCCGTCTATAGCTATAAAATCTCAGTTGATCTACACTACAACATTGGAAAGTAAATGTGATGACTGCACTTGTAGCAAGAAAGGTGTACTGGAGCTTTATAAGTGGCTTCTGGGGAACCTTCCAACAAATATTTTTCTGGATACTAATATGTATGCAAAAAACTGCACCAAGTTCGCATCATCATTTCTCCAGCTATATATGCACGAGGTTTGTACACTTTCTTCTCTGGTTTTCACTATGTCATATAAGGTCTTAGAATTATTTTGTGTTTTATCTTTTATTTTATTGACGCACATTTTCATTTTTTTGCATTGTCCAAACGATTGGGCTTTATGGTGCTCCTGCTTTGTACATCTTTCTCTCTTGATGTTTCTTATCAAACAATGCTAGTATGTTTTAAGCTCTATCAAACTAGAGCTAAAGTAATGTTTGAAAAGAAAAACATATTTAAATAACATACATTTAGGCTTATATCTAATAATTTTATTGTCTTGAACTCTAGGATGATTTATTATTCAAAGTGCTGTTGCAACTTCTTCGGCTGCCTTCTCATACAGAGCCATGGTTAGTAATAATCAGTTTGGTTTTACATATATTTTGATTAACTACCACTTGCAGCATATTTAAGAAGATCATTGCTTGTATTTAGTTCTAGTGAAGGACCATCCCAGGAGGTGAAGGAAGTTATACTTTTTCATGTTTCAAACATTTTTGATCCTCAGCACATGTTCCACATTTTTCTTAAGGAGGCAAGTAACATTACTCCAGCCAGGCGATACCTCCACCTAAGTTTCAATTCCTTCATTCCTCTATTTGCTTTGACTGCTTCCATTTTCCTTCCTAGAATTCTAGACGTTGTTTGACTAGTGTCTTTATGTGCAGTTAAACTATGATCATGAAATGCTCTTAGATTACCTCATGTCAAAAGATGCAGGAATATATTGTTTGGAATACCTACTAAGGTATGCATTGTTTTAAAATAATAACAATGATTTTCTTGGGTGTGCCCTACTGCTTTGTCACTTTTTAAAACTGAATTCAGAAGTACAGTTGACATTTAATTACCTTGGAGTCCACAACTTCCATGCTCCTTGTCTGCAATATGGTCTAAACTTTTGTTTTCTGGTAACTTTTTATTATGTGTGGGGCAGGTGAGTGGTAATTGGTGCCTTAGGATATGAGCATCGACCTTCCCTTCCTGATACCTCAGATTTTTCTTTTTGTAGATGCCTGCATATAATAAATGATTCCAGGCATGCGCTAGGGGATTCATCAACAATTTTGGATATCTTAACTGACTCTTCTGGCAAGAGAAGAAAGGTTATGCTGAACAGCTCAACTATTTCAGAAGAGCGATTGTCTGGTTCACTCAATCAAAGCAATGAAACACTTCCATCCTTTGAGGACACTGGAAATTATGATTATGGCTACAAGCCTCAAAGAGTTGGGGTAGAATCCCTGAAAAAATCTAAAAATTGTTTGCACTCGTTGAAAACATCCTTGGAAAATCTTCACAGAGAGAATCTCTTTCCATACAATCCCAAAGTGCTCATAAAACGGTATGCATGTCTGCACGATGCTTTATTGAGTAAGATTTTAACTCTGGTTTTAATAATTTATCAAAAAATTCTCTGTTTGTGTTGAGTAATTTACAACATCTCCTTTGATGTGTCCTAGAAATCTGTGAATATTTTATCTAAAGGCTATGTTAACAAACAAATTTAGCAAAATAATAGGCACTCTTGCCCATTACTTATGAAGGATTCTCACAAGTTCATTAGAAATTTTGTACATGAGGCATCGAGAAAGTTTTGTTTATTCATGTCGGATAGGTGTTCCATGCCAAATAAAATTATTTTTTTCCTAATTAGAGAACCTTTAAGTATGTCAAGAGTCTTCTTCAGAGTTTGGTTGGATCTATGATGGAATGAATGAACCTTCGGGGGATTTAAGAAAAGAAGTCACTTCTCTGCCGGGAAGAATTAGGAATTAAGGTAAAAGGAACAAGAAAAAGCTCGTATTTAGGTTGAATATCAGTTGGTTGGAGTCACTTATTAAAGAGTACAAACTGAAAACCAACCTACTGGCCTCACTTACTTGGAAACCAGAGAAACTGAAATTAACCTATTGACTAGTGGTATGGTTTGTTTAATTATAACTGGAATTGAGCTTTCTTTTAACTGGAGTAGTATGATTGTCTTCTTTTTACATGGCAGTTTGACGAAATTTTTGGAGCTTCCCATGGAGATTAAGTAATTAAGAGTCGTCCAGCCCCATGCAGAGGTCCTCTTCTTGAAGCATGGAGCTCCCATTTTGATTTAACCCTCAAGCGTGGAGATCCTTTCAATCTAAAAGCTTCACTTTCGAGTTCTTAAAATAAGACTACTAATGGTTTGATCAATTGACTTATCTTCAAGTTATCTCCTTCCTAGACGGGTAAATAGATTGACCTTAAAACAGCAACGCTTAATTACTATTGCTATAAAACAAGCTCGTATTTTATCTTTGTTACCTTTTCTTAATAATGATAAAAATGATAAACAATTTGAAAGAAGCGAGTCGACTCCTAGAACTATTGGTCTTAGAACCAGGAATAAATAGGCTTAGCTTACTCTTTAATTGAATTAAAATTCGGAGCTTATTTGAGAGTGAATCATCAATTGAGGTAATTCAAATGTATTGTATTTTTCAATTCAGAAATGCTTTTGCTCCCTTGGCTTACAGTGCGTTTGGATTGACTAAGTGTTTATAAACATTTTTTCACACTTAAAAAACTAATCCAAAATGTGCTATCTAATCCTGCTTAATTTGGGATTGTGGTGGGTTTTCCCATACCTGTAATGAGTCAACCCAGCAGTATACACCAACACACAGATATTTCTATACAGTAACTCAATACACAAATATCAAGAGGAACAGTTATACGATTTATAACTCTTAACACAAAAACAACAAGCAGTCAAAGTATGGTTCCAGCCAAATAAGAGGGTTGTTCCTCTCCCAATGAAACCCAGTTCATCAAAAAACAACTCATTACAGCCAACGGACTATAGGGCTTTGAATCTATGATTTCACGTTCTTATTTGGCTGTCTTGAAGGCTTTGAAATCAATAGTAGAATTTGGTGTTGATGTTAATTTGAATATGAAATGAACGAAAACGAAGTCAAGTTGCATCAACGGGATCCAAAGGAGCAAAAGAGGCTTATATGGCCTCTGCCTTGCAGCATCAGGGCACTCTGAAGTTTGTTTGTGCGACAATTCTCATCAATGTCACGATGTTGTCCCTAGCGCAATGATGTCGTTACATTTGTTTAAAAACATGTTTCTGCCGCCCAATTGACCTATTTCTGATTCCAACTGCTTTTTGTACGTCTTCCTCTCTACATTTTGAGTTCTTTTCATGAGTTTTATGCCTTTTATCATTTTCTCATCGTTCGGCGTGTGGATGAGAGGTTAAGATACTCGTTTCTAGCTTGGGTCACTAAGATAATTTATCTATTTCTTTGTATTTGTAACATTTGAATCAACTCCTTTATATTGTCAGTTTCTACTGAATATTGATCATGTTTTTGGATGTTTTTGGATGGGGATTGTATAATTCGTTGCACCTTTTCATATCTTCAGTTAAGAAGTTAAAGGTTAGAATAG
mRNA sequence
ATGACGCTTGGCGGTGAGAAGTGCTTGCGGCTTAGCCGGCTCGTCGATCATTGTCTTCATCCTTTTACAGAAGAAGATGGGGTGGTGTTGGAGAGCAAAGGGAAGGAAAAGGAACTGCTTATTGCGCTCTCTCATGTTGTAACTGAAGTTCAGCGCTGGGTTCGAGAAATTGATGGCGACTCTGATAATGAAGGGATTCAAATGCCGAATTCTCATGAGAAAAGAAGTGATGAGCAGGACCAATCTTTGGAGAGCCACCATTATATGACGAAGATTGTTTCTGAATTGGTACCTTTACTTGCTTTTGAGAATAAGTATGTCAAACATCTTGTGGGCAATGTACTAACAGCAGTTACAAAATTTATATTTCTAACTGGAAACGCAAGCGACTGGTGTGAATTAGTACACTCGTTATGTTTTAGCATGGAGTTAGTGCTTGCCAGAATCATATCTTCTCCTGCACCTTCGATCACTGGGTCTGAAAATTTAGACTTTTATTTATCAATCTTACAACCTAAGCTGAAAAATGCCAATTTTTCCACGGTTGCTGGCCTTCTCCAAGTTTTGCGAAATACCTTGAAATTCTTGAAGCAAGAGCAGAGTGATCTTATTGGAGAGTTGTTTGATTCTGTTAATTCTTGTCTTTCAAAGATTCCTTGGGATTTATTAGGTAGGATCCTTACGGAGAAAATATGTAACATTGTAGAAGTCCAGAGCAACGATGATGCGTGTTCTGATAATTTGCATCAGAGGCAAGGATTAAAATTTCTGTTCCTAGGAAATTTTGTTCAATTTCTCTGTTCCTTGGCTGAGCCAAGTGACTTTGAGGAAGCTTCATGTGGTTCATTTAAGAGTCACCCCCTACTTGGCACAATCATCAACCTGATTCCTAACCTTTTTGATTGGTGCCTTAACAACCAAGTAGATCACTTTGATAGGTGCTTGTCCCGATATTTCAGTCACAAGCTGCTGATATTAATGATCAGGCTTAGTTTCCATTGCCATCTTCAGTGTTCCACTCTTGTTCTATGGTTGCAACTTTGCAGAAATTGTTTCCAAAACCTCTTGTTGCTTCCAAAGCTTGAGCTGGAATCTACCGCTGATACTTCCCTTGAAGATTCTCCATTGATTGTGAGCTATTTTGGTGACAAACGCAGTCCATGTTCCTTGCATCTGCGAAGACTAGCTGTTTTTCTATTCCTCAGGTGTTCCTTAAGCTTCATATGTAAACAACCAACTGAAAAGTGTGATCCGTCTATAGCTATAAAATCTCAGTTGATCTACACTACAACATTGGAAAGTAAATGTGATGACTGCACTTGTAGCAAGAAAGGTGTACTGGAGCTTTATAAGTGGCTTCTGGGGAACCTTCCAACAAATATTTTTCTGGATACTAATATGTATGCAAAAAACTGCACCAAGTTCGCATCATCATTTCTCCAGCTATATATGCACGAGGATGATTTATTATTCAAAGTGCTGTTGCAACTTCTTCGGCTGCCTTCTCATACAGAGCCATGTTCTAGTGAAGGACCATCCCAGGAGGTGAAGGAAGTTATACTTTTTCATGTTTCAAACATTTTTGATCCTCAGCACATGTTCCACATTTTTCTTAAGGAGTTAAACTATGATCATGAAATGCTCTTAGATTACCTCATGTCAAAAGATGCAGGAATATATTGTTTGGAATACCTACTAAGATGCCTGCATATAATAAATGATTCCAGGCATGCGCTAGGGGATTCATCAACAATTTTGGATATCTTAACTGACTCTTCTGGCAAGAGAAGAAAGGTTATGCTGAACAGCTCAACTATTTCAGAAGAGCGATTGTCTGGTTCACTCAATCAAAGCAATGAAACACTTCCATCCTTTGAGGACACTGGAAATTATGATTATGGCTACAAGCCTCAAAGAGTTGGGGTAGAATCCCTGAAAAAATCTAAAAATTGTTTGCACTCGTTGAAAACATCCTTGGAAAATCTTCACAGAGAGAATCTCTTTCCATACAATCCCAAAGTGCTCATAAAACGGTATGCATGTCTGCACGATGCTTTATTGATTTGA
Coding sequence (CDS)
ATGACGCTTGGCGGTGAGAAGTGCTTGCGGCTTAGCCGGCTCGTCGATCATTGTCTTCATCCTTTTACAGAAGAAGATGGGGTGGTGTTGGAGAGCAAAGGGAAGGAAAAGGAACTGCTTATTGCGCTCTCTCATGTTGTAACTGAAGTTCAGCGCTGGGTTCGAGAAATTGATGGCGACTCTGATAATGAAGGGATTCAAATGCCGAATTCTCATGAGAAAAGAAGTGATGAGCAGGACCAATCTTTGGAGAGCCACCATTATATGACGAAGATTGTTTCTGAATTGGTACCTTTACTTGCTTTTGAGAATAAGTATGTCAAACATCTTGTGGGCAATGTACTAACAGCAGTTACAAAATTTATATTTCTAACTGGAAACGCAAGCGACTGGTGTGAATTAGTACACTCGTTATGTTTTAGCATGGAGTTAGTGCTTGCCAGAATCATATCTTCTCCTGCACCTTCGATCACTGGGTCTGAAAATTTAGACTTTTATTTATCAATCTTACAACCTAAGCTGAAAAATGCCAATTTTTCCACGGTTGCTGGCCTTCTCCAAGTTTTGCGAAATACCTTGAAATTCTTGAAGCAAGAGCAGAGTGATCTTATTGGAGAGTTGTTTGATTCTGTTAATTCTTGTCTTTCAAAGATTCCTTGGGATTTATTAGGTAGGATCCTTACGGAGAAAATATGTAACATTGTAGAAGTCCAGAGCAACGATGATGCGTGTTCTGATAATTTGCATCAGAGGCAAGGATTAAAATTTCTGTTCCTAGGAAATTTTGTTCAATTTCTCTGTTCCTTGGCTGAGCCAAGTGACTTTGAGGAAGCTTCATGTGGTTCATTTAAGAGTCACCCCCTACTTGGCACAATCATCAACCTGATTCCTAACCTTTTTGATTGGTGCCTTAACAACCAAGTAGATCACTTTGATAGGTGCTTGTCCCGATATTTCAGTCACAAGCTGCTGATATTAATGATCAGGCTTAGTTTCCATTGCCATCTTCAGTGTTCCACTCTTGTTCTATGGTTGCAACTTTGCAGAAATTGTTTCCAAAACCTCTTGTTGCTTCCAAAGCTTGAGCTGGAATCTACCGCTGATACTTCCCTTGAAGATTCTCCATTGATTGTGAGCTATTTTGGTGACAAACGCAGTCCATGTTCCTTGCATCTGCGAAGACTAGCTGTTTTTCTATTCCTCAGGTGTTCCTTAAGCTTCATATGTAAACAACCAACTGAAAAGTGTGATCCGTCTATAGCTATAAAATCTCAGTTGATCTACACTACAACATTGGAAAGTAAATGTGATGACTGCACTTGTAGCAAGAAAGGTGTACTGGAGCTTTATAAGTGGCTTCTGGGGAACCTTCCAACAAATATTTTTCTGGATACTAATATGTATGCAAAAAACTGCACCAAGTTCGCATCATCATTTCTCCAGCTATATATGCACGAGGATGATTTATTATTCAAAGTGCTGTTGCAACTTCTTCGGCTGCCTTCTCATACAGAGCCATGTTCTAGTGAAGGACCATCCCAGGAGGTGAAGGAAGTTATACTTTTTCATGTTTCAAACATTTTTGATCCTCAGCACATGTTCCACATTTTTCTTAAGGAGTTAAACTATGATCATGAAATGCTCTTAGATTACCTCATGTCAAAAGATGCAGGAATATATTGTTTGGAATACCTACTAAGATGCCTGCATATAATAAATGATTCCAGGCATGCGCTAGGGGATTCATCAACAATTTTGGATATCTTAACTGACTCTTCTGGCAAGAGAAGAAAGGTTATGCTGAACAGCTCAACTATTTCAGAAGAGCGATTGTCTGGTTCACTCAATCAAAGCAATGAAACACTTCCATCCTTTGAGGACACTGGAAATTATGATTATGGCTACAAGCCTCAAAGAGTTGGGGTAGAATCCCTGAAAAAATCTAAAAATTGTTTGCACTCGTTGAAAACATCCTTGGAAAATCTTCACAGAGAGAATCTCTTTCCATACAATCCCAAAGTGCTCATAAAACGGTATGCATGTCTGCACGATGCTTTATTGATTTGA
Protein sequence
MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGDSDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTKFIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFSTVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSNDDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLFDWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPKLELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSIAIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFLQLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKELNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVMLNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLENLHRENLFPYNPKVLIKRYACLHDALLI*
Homology
BLAST of CsaV3_2G016900 vs. NCBI nr
Match:
XP_031736578.1 (uncharacterized protein LOC101211532 isoform X2 [Cucumis sativus] >KAE8651922.1 hypothetical protein Csa_006065 [Cucumis sativus])
HSP 1 Score: 1389.8 bits (3596), Expect = 0.0e+00
Identity = 688/688 (100.00%), Postives = 688/688 (100.00%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD
Sbjct: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK
Sbjct: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS
Sbjct: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI
Sbjct: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE
Sbjct: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
Query: 661 NLHRENLFPYNPKVLIKRYACLHDALLI 689
NLHRENLFPYNPKVLIKRYACLHDALLI
Sbjct: 661 NLHRENLFPYNPKVLIKRYACLHDALLI 688
BLAST of CsaV3_2G016900 vs. NCBI nr
Match:
XP_031736577.1 (uncharacterized protein LOC101211532 isoform X1 [Cucumis sativus])
HSP 1 Score: 1369.0 bits (3542), Expect = 0.0e+00
Identity = 678/678 (100.00%), Postives = 678/678 (100.00%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD
Sbjct: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK
Sbjct: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS
Sbjct: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI
Sbjct: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE
Sbjct: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
Query: 661 NLHRENLFPYNPKVLIKR 679
NLHRENLFPYNPKVLIKR
Sbjct: 661 NLHRENLFPYNPKVLIKR 678
BLAST of CsaV3_2G016900 vs. NCBI nr
Match:
XP_008457821.1 (PREDICTED: uncharacterized protein LOC103497413 isoform X2 [Cucumis melo])
HSP 1 Score: 1210.7 bits (3131), Expect = 0.0e+00
Identity = 615/688 (89.39%), Postives = 628/688 (91.28%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLV NVLTAVTK
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVANVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTG+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 FIFLTGSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLLRCLHIINDSRHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLLRCLHIINDSRHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVGVESLKKSKNCLH LKTSLE
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVGVESLKKSKNCLHLLKTSLE 660
Query: 661 NLHRENLFPYNPKVLIKRYACLHDALLI 689
NLHRENLFPYNPKVLIKRYA LHDALLI
Sbjct: 661 NLHRENLFPYNPKVLIKRYAYLHDALLI 667
BLAST of CsaV3_2G016900 vs. NCBI nr
Match:
XP_008457820.1 (PREDICTED: uncharacterized protein LOC103497413 isoform X1 [Cucumis melo])
HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 606/678 (89.38%), Postives = 619/678 (91.30%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLV NVLTAVTK
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVANVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTG+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 FIFLTGSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLLRCLHIINDSRHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLLRCLHIINDSRHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVGVESLKKSKNCLH LKTSLE
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVGVESLKKSKNCLHLLKTSLE 657
Query: 661 NLHRENLFPYNPKVLIKR 679
NLHRENLFPYNPKVLIKR
Sbjct: 661 NLHRENLFPYNPKVLIKR 657
BLAST of CsaV3_2G016900 vs. NCBI nr
Match:
XP_038901656.1 (uncharacterized protein LOC120088436 isoform X2 [Benincasa hispida])
HSP 1 Score: 1148.3 bits (2969), Expect = 0.0e+00
Identity = 579/682 (84.90%), Postives = 618/682 (90.62%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+ GGE+CL+L RLV HCL PFT EDGVVLESKGKEKELLIALSHVVTEVQRWV+EIDGD
Sbjct: 1 MSPGGERCLQLCRLVGHCLRPFT-EDGVVLESKGKEKELLIALSHVVTEVQRWVQEIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
S N+ IQM S EK SDE+DQS ESHH MTKIVSELVPLL+FEN+YVKHLVGNVLTA+TK
Sbjct: 61 SYNDAIQMLTSQEKSSDERDQSFESHHCMTKIVSELVPLLSFENQYVKHLVGNVLTAITK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQ----PKLKN 180
F+FLTG+ S+W ELVHSLCF MELVL RI+SSPAPSITGSENLD YLS L PKLKN
Sbjct: 121 FVFLTGSTSNWYELVHSLCFGMELVLDRIMSSPAPSITGSENLDCYLSTLSNTLLPKLKN 180
Query: 181 ANFSTVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVE 240
ANFSTVAG+LQVLRNTLK LKQEQSDL+G+ FDSVNSCLSKIPW+LLGRILTEK CN VE
Sbjct: 181 ANFSTVAGILQVLRNTLKSLKQEQSDLVGQFFDSVNSCLSKIPWNLLGRILTEKNCNSVE 240
Query: 241 VQSNDDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLI 300
VQ++DD +NLHQR+GLKFLFLGNFVQFLCSLAEPSDFEEASC S K+HPLLGT+INLI
Sbjct: 241 VQNDDDLRYNNLHQRKGLKFLFLGNFVQFLCSLAEPSDFEEASCDSLKTHPLLGTVINLI 300
Query: 301 PNLFDWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLL 360
PNL DWCLNNQVDHF+RCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRN FQ LL
Sbjct: 301 PNLLDWCLNNQVDHFNRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNRFQKLL 360
Query: 361 LLPKLELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKC 420
LLPKLELES TSLEDSPLIVSYFGDKRSPCSLHL+RLA+FLFLRCSLSFI +PTEK
Sbjct: 361 LLPKLELESAPGTSLEDSPLIVSYFGDKRSPCSLHLQRLAIFLFLRCSLSFIFTKPTEKY 420
Query: 421 DPSIAIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFA 480
D SIA+KSQLI+TTTLESKC DC+CSKKG+LELYKWL GNLPTNIFLDT MYAKNC KFA
Sbjct: 421 DASIALKSQLIFTTTLESKCHDCSCSKKGILELYKWLQGNLPTNIFLDTKMYAKNCIKFA 480
Query: 481 SSFLQLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHI 540
SSFLQLYMHEDDLLFKVLLQLLRLPSHT PCS EGPSQEVKE ILFHVSNIFDPQH+FH+
Sbjct: 481 SSFLQLYMHEDDLLFKVLLQLLRLPSHTGPCSCEGPSQEVKEDILFHVSNIFDPQHVFHV 540
Query: 541 FLKELNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKR 600
FLKELNYDHEMLLDYLMSKDAG YCLEYLLRCLHIINDSR+AL +SST DI T SS KR
Sbjct: 541 FLKELNYDHEMLLDYLMSKDAGTYCLEYLLRCLHIINDSRYALVNSSTEWDISTHSSCKR 600
Query: 601 RKVMLNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLK 660
RKV+LNSSTISEE SGS NQSNETLPSF DT N DYGYKPQRVGV+SLKKSKNCL SLK
Sbjct: 601 RKVLLNSSTISEELSSGSPNQSNETLPSFADTINCDYGYKPQRVGVKSLKKSKNCLQSLK 660
Query: 661 TSLENLHRENLFPYNPKVLIKR 679
TSLENLHRENLFPYNP+VL+KR
Sbjct: 661 TSLENLHRENLFPYNPEVLVKR 681
BLAST of CsaV3_2G016900 vs. ExPASy TrEMBL
Match:
A0A0A0LLZ2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G286460 PE=4 SV=1)
HSP 1 Score: 1369.0 bits (3542), Expect = 0.0e+00
Identity = 678/678 (100.00%), Postives = 678/678 (100.00%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD
Sbjct: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK
Sbjct: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS
Sbjct: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI
Sbjct: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE
Sbjct: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
Query: 661 NLHRENLFPYNPKVLIKR 679
NLHRENLFPYNPKVLIKR
Sbjct: 661 NLHRENLFPYNPKVLIKR 678
BLAST of CsaV3_2G016900 vs. ExPASy TrEMBL
Match:
A0A1S3C7P1 (uncharacterized protein LOC103497413 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103497413 PE=4 SV=1)
HSP 1 Score: 1210.7 bits (3131), Expect = 0.0e+00
Identity = 615/688 (89.39%), Postives = 628/688 (91.28%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLV NVLTAVTK
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVANVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTG+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 FIFLTGSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLLRCLHIINDSRHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLLRCLHIINDSRHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVGVESLKKSKNCLH LKTSLE
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVGVESLKKSKNCLHLLKTSLE 660
Query: 661 NLHRENLFPYNPKVLIKRYACLHDALLI 689
NLHRENLFPYNPKVLIKRYA LHDALLI
Sbjct: 661 NLHRENLFPYNPKVLIKRYAYLHDALLI 667
BLAST of CsaV3_2G016900 vs. ExPASy TrEMBL
Match:
A0A1S3C723 (uncharacterized protein LOC103497413 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103497413 PE=4 SV=1)
HSP 1 Score: 1193.7 bits (3087), Expect = 0.0e+00
Identity = 606/678 (89.38%), Postives = 619/678 (91.30%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLV NVLTAVTK
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVANVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTG+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 FIFLTGSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLLRCLHIINDSRHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLLRCLHIINDSRHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVGVESLKKSKNCLH LKTSLE
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVGVESLKKSKNCLHLLKTSLE 657
Query: 661 NLHRENLFPYNPKVLIKR 679
NLHRENLFPYNPKVLIKR
Sbjct: 661 NLHRENLFPYNPKVLIKR 657
BLAST of CsaV3_2G016900 vs. ExPASy TrEMBL
Match:
A0A1S3C5Z0 (uncharacterized protein LOC103497413 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103497413 PE=4 SV=1)
HSP 1 Score: 1125.5 bits (2910), Expect = 0.0e+00
Identity = 578/678 (85.25%), Postives = 591/678 (87.17%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSEL
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSEL------------------------ 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
G+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 -----GSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLLRCLHIINDSRHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLLRCLHIINDSRHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESLKKSKNCLHSLKTSLE 660
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVGVESLKKSKNCLH LKTSLE
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVGVESLKKSKNCLHLLKTSLE 628
Query: 661 NLHRENLFPYNPKVLIKR 679
NLHRENLFPYNPKVLIKR
Sbjct: 661 NLHRENLFPYNPKVLIKR 628
BLAST of CsaV3_2G016900 vs. ExPASy TrEMBL
Match:
A0A5A7TVC1 (Golgin candidate 6 isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold243G003300 PE=4 SV=1)
HSP 1 Score: 1095.9 bits (2833), Expect = 0.0e+00
Identity = 561/641 (87.52%), Postives = 574/641 (89.55%), Query Frame = 0
Query: 1 MTLGGEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGD 60
M+LGGEK LRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRW REIDGD
Sbjct: 1 MSLGGEKRLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWAREIDGD 60
Query: 61 SDNEGIQMPNSHEKRSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVGNVLTAVTK 120
SDNE IQM NSHEK SDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLV NVLTAVTK
Sbjct: 61 SDNEAIQMKNSHEKSSDEQDQSLESHHYMTKIVSELVPLLAFENKYVKHLVANVLTAVTK 120
Query: 121 FIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSITGSENLDFYLSILQPKLKNANFS 180
FIFLTG+ASDW ELVHSLCF MELVLARIISSPAPS GS+NL YLSIL PKLKNANFS
Sbjct: 121 FIFLTGSASDWYELVHSLCFGMELVLARIISSPAPSNAGSDNLHCYLSILLPKLKNANFS 180
Query: 181 TVAGLLQVLRNTLKFLKQEQSDLIGELFDSVNSCLSKIPWDLLGRILTEKICNIVEVQSN 240
TVAGLLQVLRNTLKFLKQEQSD IGELFDSVNSCLSKIPWDLLGRILTEK CNIVE+QSN
Sbjct: 181 TVAGLLQVLRNTLKFLKQEQSDFIGELFDSVNSCLSKIPWDLLGRILTEKSCNIVEIQSN 240
Query: 241 DDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLLGTIINLIPNLF 300
DD S+NLH+RQGLKFLFLGNFVQFLCSLAE SDFEEAS GSFKSHPLLGTIINLIPNLF
Sbjct: 241 DDMRSNNLHRRQGLKFLFLGNFVQFLCSLAEQSDFEEASRGSFKSHPLLGTIINLIPNLF 300
Query: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCRNCFQNLLLLPK 360
DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLV+WLQLCR FQNLLLLPK
Sbjct: 301 DWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVIWLQLCRKRFQNLLLLPK 360
Query: 361 LELESTADTSLEDSPLIVSYFGDKRSPCSLHLRRLAVFLFLRCSLSFICKQPTEKCDPSI 420
LELES++DTSLEDSPLIVSYFGDK SPCSLHLRRLAVFLFLRCSLSF CKQ TEKCDPS
Sbjct: 361 LELESSSDTSLEDSPLIVSYFGDKCSPCSLHLRRLAVFLFLRCSLSFTCKQTTEKCDPS- 420
Query: 421 AIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
T + DDCTCSKKG+LELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL
Sbjct: 421 ----------TFLATSDDCTCSKKGILELYKWLLGNLPTNIFLDTNMYAKNCTKFASSFL 480
Query: 481 QLYMHEDDLLFKVLLQLLRLPSHTEPCSSEGPSQEVKEVILFHVSNIFDPQHMFHIFLKE 540
QLYMHEDDLLFKVLLQLL+LPSH EPCS EGPSQEVKE ILFHVSNIFDPQHMFHIFLKE
Sbjct: 481 QLYMHEDDLLFKVLLQLLQLPSHREPCSCEGPSQEVKEDILFHVSNIFDPQHMFHIFLKE 540
Query: 541 LNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTILDILTDSSGKRRKVM 600
LNYDHEMLLDYLMSKDAG CLEYLL RHA L DSSGKRRKVM
Sbjct: 541 LNYDHEMLLDYLMSKDAGTCCLEYLL---------RHA----------LVDSSGKRRKVM 600
Query: 601 LNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVG 642
LNSSTISEERLSGS N+S ETLPSFEDTGN DYGYKPQRVG
Sbjct: 601 LNSSTISEERLSGSPNRSKETLPSFEDTGNCDYGYKPQRVG 611
BLAST of CsaV3_2G016900 vs. TAIR 10
Match:
AT3G50430.1 (unknown protein; Has 54 Blast hits to 54 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 29; Fungi - 0; Plants - 24; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink). )
HSP 1 Score: 379.4 bits (973), Expect = 6.1e-105
Identity = 254/702 (36.18%), Postives = 380/702 (54.13%), Query Frame = 0
Query: 5 GEKCLRLSRLVDHCLHPFTEEDGVVLESKGKEKELLIALSHVVTEVQRWVREIDGDSDNE 64
G RL L+D C+ F E + V L +K EK+LL++LS V+ E+Q W EI D E
Sbjct: 4 GNLASRLHLLIDLCVRQFPEREEVTLPTKETEKDLLLSLSQVLREIQSWRSEISCDKAVE 63
Query: 65 GIQMPNSHEKRSDEQDQSLES--------HHYMTKIVSELVPLLAFENKYVKHLVGNVLT 124
S E+ + D+S ++ + + ++V++LV LL EN +VKHL GN+L
Sbjct: 64 ------SREETVVDHDESADALYGPDSIEYLCLERLVADLVCLLGMENVHVKHLAGNILV 123
Query: 125 AVTKFIFLTGNASDWCELVHSLCFSMELVLARIISSPAPSI---TGSENLD---FYLSIL 184
V+ + +G S W E + LC + LA I S P P++ TG +LD F +L
Sbjct: 124 EVSGCLVESG--SQWDEFIRLLCECLR--LAVIYSFPIPAVGSETGFGSLDQCFFGSDVL 183
Query: 185 QPKLKNANFSTVAGLLQVLRNTLKFLKQEQSDLIGELF-DSVNSCLSKIPWDLLGRILTE 244
+ KL+ AN+STV+ + +VLRN LK L QE ++ I +++ +SVNS L+K+PW L I +
Sbjct: 184 KCKLEKANWSTVSDIFRVLRNILKRLSQEDNEEIFDVYLESVNSTLAKVPWCRLDTIFSH 243
Query: 245 KICNIVEVQSNDDACSDNLHQRQGLKFLFLGNFVQFLCSLAEPSDFEEASCGSFKSHPLL 304
+ + + N S N + +FLG+FVQFLCS+ + E S S+ +L
Sbjct: 244 QHGS---GERNFQGQSGNSEEAT----VFLGSFVQFLCSMVQQVHVVEDSDDFEPSYLIL 303
Query: 305 GTIINLIPNLFDWCLNNQVDHFDRCLSRYFSHKLLILMIRLSFHCHLQCSTLVLWLQLCR 364
I LIP+L WC C+SRY HKLL+LMIRL+ ++C+ L+ WLQ +
Sbjct: 304 QKTIKLIPDLLRWCQPKLKSQSGSCMSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQ 363
Query: 365 NCFQNLLLLPKLELESTADTSLEDSPLIVSYFG-DKRSPCSLHLRRLAVFLFLRCSLSFI 424
Q L + + D LE SP VS + S HL+RL+VFLFLRCS +
Sbjct: 364 RDSQGFLQHTLTKFKPVQDNCLEGSPFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFT-- 423
Query: 425 CKQPTEKCDPSIAIKSQLIYTTTLESKCDDCTCSKKGVLELYKWLLGNLPTNIFLDTNMY 484
LIY++ K + C KKG+ E++KW+ +P N+F D +Y
Sbjct: 424 -----------------LIYSSRHNDKLCEFDCRKKGMAEMFKWIERQIPGNMFSDHRIY 483
Query: 485 AKNCTKFASSFLQLYMHEDDLLFKVLLQLLRLPSHTE--PCSSEGPSQEVKEVILFHVSN 544
+K +F++SF++L+MHEDDLLFKVLLQLL +P H + P G ++ +++ LF +S
Sbjct: 484 SKKNVEFSASFVRLFMHEDDLLFKVLLQLLSVPLHRQELPNVEGGSLEDEEQITLFRLST 543
Query: 545 IFDPQHMFHIFLKELNYDHEMLLDYLMSKDAGIYCLEYLLRCLHIINDSRHALGDSSTIL 604
+F+P +F IFL EL+YDH++LLDYL+SKD G C EYLLRCL + DS +
Sbjct: 544 LFNPVRLFCIFLSELHYDHQVLLDYLISKDIGASCAEYLLRCLRAVCDSWTLFVEFP--F 603
Query: 605 DILTDS-SGKRRKVMLNSSTISEERLSGSLNQSNETLPSFEDTGNYDYGYKPQRVGVESL 664
+ TD+ S KRRKV+ +S + + R+ ++
Sbjct: 604 EGSTDAPSPKRRKVLPETSEVEQN----------------------------WRLHAQAF 639
Query: 665 KKSKNCLHSLKTSLENLHRENLFPYNPKVLIKRYACLHDALL 688
+ +K+CL SL+ S+ LH++ LFPYNP+ L++R + H+ L
Sbjct: 664 EDAKDCLLSLQNSVVKLHQKKLFPYNPEALLRRLSRFHELCL 639
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_031736578.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC101211532 isoform X2 [Cucumis sativus] >KAE8651922.1 ... | [more] |
XP_031736577.1 | 0.0e+00 | 100.00 | uncharacterized protein LOC101211532 isoform X1 [Cucumis sativus] | [more] |
XP_008457821.1 | 0.0e+00 | 89.39 | PREDICTED: uncharacterized protein LOC103497413 isoform X2 [Cucumis melo] | [more] |
XP_008457820.1 | 0.0e+00 | 89.38 | PREDICTED: uncharacterized protein LOC103497413 isoform X1 [Cucumis melo] | [more] |
XP_038901656.1 | 0.0e+00 | 84.90 | uncharacterized protein LOC120088436 isoform X2 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0LLZ2 | 0.0e+00 | 100.00 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G286460 PE=4 SV=1 | [more] |
A0A1S3C7P1 | 0.0e+00 | 89.39 | uncharacterized protein LOC103497413 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3C723 | 0.0e+00 | 89.38 | uncharacterized protein LOC103497413 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A1S3C5Z0 | 0.0e+00 | 85.25 | uncharacterized protein LOC103497413 isoform X3 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5A7TVC1 | 0.0e+00 | 87.52 | Golgin candidate 6 isoform 2 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_sca... | [more] |
Match Name | E-value | Identity | Description | |
AT3G50430.1 | 6.1e-105 | 36.18 | unknown protein; Has 54 Blast hits to 54 proteins in 22 species: Archae - 0; Bac... | [more] |