Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAAAAAACTCTCACTCTCTCTCTCCCTTTCACCATTACTACTTCTCACTCTCTTTCTCTCCCTTTCACCAGAGAGAAATCTCTTCACACTTCACTCCTTACTTCATCATCTTTCTCCTCTGTCTCTTCTCCTTTCAATTTGACACCGTTTCCCAATTTTAACCCCAACCCACTTCCCAAAATTTCATCTTTTTTACTTGGGTCTCTCATGGCTGATCCCACCTCCACTTTCTCAATCAGGTGAGTTCACAATGTCTTATCTTACTAATTAAGCTATATATGTTAAGAAAAAGGGGAAAAAAATATATAGAGAGTTCACATCTTACCAATGATCAAATTTCTCTGCTGTTTATTTATTTATTTTCTTTGTTTGTGGATTATATAACGGTTTTTGTTTAATTTTGGGTAGAGAGTATGCTTTGAATAAAAGAAGCATGGGTTTAACGACAATTAGTTGGCCATTTAGTGAGAAGGTGAAGAAAGAAGTGGCAGAATCTCTACTTCCACCAATGGATGTAAAGAAATTCCGTTGGTGGTCGTCGTTATGGTTGTCGTCACAGGAAGAAGAAGAAGGAGAAGAAGGAGAAGAAAAAGAAGTAATTACAGAGAGAATTAAAATGCAGAAGATTTGTCCGGTCTGCGGGGTTTTTGTTGCAGCCACGGTAGCAGCGGTAAATGCACATATTGATACTTGTTTAGCTCAAACAACAAGCAAGGAAATTAGGAGAAAGAACTATTTGAAGGCAAAATCAAGAACCCCAAAAAAGAGATCAATAGCCGAAATATTTGCAGTGGCTCCGCCAGTAAAAACAATGATTGTTGTTAATGATTGTTGTGAAGACGAAGAAGAAAAAAAGGCCGTTGGGAAACAAATTATTCATCACAACAAGAACCTCAAAACGACATCGTTGGCTACAAGTCTTGTCTCTGCAATCAAGACAATCAAAAACAAAATCGCTACGACAACAGAAGAACCAACAATTCTTGCCAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAATAAGGTAATTTTAAAATTTTTAATTTAGAGTAGATTTAATAGACTGCAAATATTCTCTCAATTGGGTGTGTGGGGTTTTTTTTTTTGGTAGCTCCTTTCTCTTTCTGCTCATAATTACTGCTGGAAAAAGCTGGTGAAAAGACTGCATTACGTTGGAGAGAAAAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAAAGAAAAACAGTACTTGTTTTTTGTGTGATTAAGAGCGGGGAAATAGGCTGATTCACAATTGGAAAACAAAAAAAAGGTCATTACTCACTACAAGACGAGTTTATATCGTTTGTGACGGCTTCATTTGAGGTGATTATGTGTATGTATTGGATGTATACATGTATGTATATATTATAATGCTCCAGCTGCAAGAGTACGGGCTTGCCAGTGAGTGACTTTATTTGGTGGAAGAGTTTTTGTCCTTTGTTTAACAAATTTATGCTACGTTAATGATGCTGTTTTTAACCTGTCAAATTATAATTTCTTACTTGAAACATGATTTTGGATTTGTATAGTAATAGTAAAGACGAGTGTTCCCGATCGTACCTCGTTTGGGGTTTTAGGGTTTATGATTTTAATGAAATAAATTTTGAGGGAGTTCGTATATATTTGAGTTTGTTAAAGTGGGTTTCGTTTTACTATTATATATCATCTTCCATTTTGATGTACTTGGATTTTGACAACTTTTTGACCAAATTTGCTACCCACCAACAGTTTTGCAACGCATTATCATATGGGAATTATTTTACAAGTTACGTTTGAGTTTTGCTTAAATGATCAAGTACTGAACCGCAAAACTTCAATCTTATTCTTTTTCTCCAATAATCTTTATGAAATGAGAGTTATAGTTCGATAGTTTTTGAGTTATTTTATGATGTCTTTCTGCTGATTAATTAGTAGAAGGCATTGAAGATACAATTAGTGGAATGCTGTGAATGAAAGAGAGAGTTGAAAGGGTAATCTCTCTTTTAGGATATAGTACATTTCATTATTATTTAAGTAGTTAGTTTTAGGGGTCCTCCCTTATCTATGACTAGTCTGCCTTTGTTCTTTAACAAATTGTCTTATTTGTTATATATGCCTTTTGTCTTATTGATGTAAGTTTTTGACCATGGAGAATAGAAGAATATATATCTATATATGATGAGGTTTTGTAGGTGAAGATTGCTAGGATAGATGATATCATTATTATATAATAATAACCAAATTAGTAGCAATATTTTCTCTACCATATAGCTAATAGACCATTCTTTATGATAGGATTTGAAATTTTATGTGTTCAAGATTTAGAGGGTTGAGGTGCTGCAATAACCCACTATAAATTCTCATTTTAAATTAAATAAATTTGATAGCTCTATGATCCATAACACATAGCTAGCCTTCTTAATTAGTTGACCAATCTTTGGAGTTTAATTCTCTCTCCCATAATGATGCATTATGTAATATATATATTAGTTCAATAACAAATAAGCTTAATCACTGAAACTGAAATATTTGCATATTTAACATTCAGTTTGGTTAGGTTTTTGAAAGCTGCCTCTTTAAGCAATAATATTTGGCTTGCTTATGCTTTAATTTTTTGTGGTTGTTCTTTTAACTAGGAATATAATTAGATTGTGTAAAAGCTACAACAGGTTATTATTTATCACAAGTATGATGGTGAAAATGACGACAAATACATCACATGCTGTAGTGTTTAGCCAGTTTTGAAAAATTATAATGAATGTTCTTTTTCCCATTTTGCTTATATGTATGTGTACGATCATATTCTCATTCTTAGAGATTAATTTGATCAACGTCTTGTTTCTTCTTTCATTTCATGTATTATAACAAAATCATTTTCAAGTTTAGTATCGAAGCCATTGGCTTCCCATGTCAACTGATTTGATCAGTGTTCCTTTGGCTTCCTTTAAGATATTGATGCTTTATAAACCTAGTTTCACCCTTATATTTGTGTAAGGAAACACACACTCAATAATATCTATGCACGTTTAACGATATTCATCAATATTAAATTGAGACTTGTTTAGATTGACTTTCTAAAATGAGTAAAAACACTTTTGTAAGCACTAAAAACCCAATTCGAACAAATTGTTTGATGTAATATGAAGGACATATATTGCTTGTAATCTTTATATTTACTTCTAATAATTCGTTTCTGAAATAATTTTGTATTGGTTATAGGACTTTTGTCATGGGAAACTTTGCAAGAAGGGAGATATCAGAAATCACAAGGATGTTTCTACTTTTTGTAAGAGACGGCCTTGTTTTAAACGATTGTCTAAACAAAAAAAGAAAAAACTAGCTAAAAAATCCACTGTGGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTTATCAATTTAAAAGGCAGCAATCAAGCTTTCAACAATGGCGGTCAAAAGTCCGATAGGCGTGTTAGCTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACCAGAACCATTTCTGATACATTTGAACAAAATGTTGGCAATCCATTTCAAGCTTCAGAAGTAAGTACTAATTCAGGTGAAAGTAATAAAGAAGTTCCTTCAATGGAGGCCAATTTAAACGATGATGTTGATTGCTTTAACAGTACCCGACACAAAGTCGATAGTCAACATGTCAAAGGAAAGATTCAATTGCCTAATTTTCATAATCAGGTCAATGCTCAAAGTTGGGAAAATCCGAAGCATTCAACCGAGAAGTTGATATTGGAAAGTCGAGATATTCCTCATGATCGAAATGATTTGCATTTGTTTGATCATGTCTATGTAGATGCACATCAGAAGCTGCCACCAGAACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCCATATGGCCATGTAAGAACTCAATGTGGTTTAAATGTTGTCCCACAAGCTCATTCTCTTTATGGAAAATCAGTTGATCATTTGATAAATAATAATAACCATTTCAATGGAGTAGCTGCCTTAGGCTCAGTTACGAGCAGAGTGCCTTCTTCTTCTTTAACTGAAAATCCTGTTAGCAGATTTTTAAATCTGGCTGAATCTTCTGCTAGAGACAGTAATAGATTTCAAATTTCGAATGGGGAACAAAGTGTGGTTACTTACAAAGAGAAAGGGGTAAATGATGGATTTTTCTGTCTGCCATTGAACTCAAGGGGTGAACTGATACAGCTAAATTCAGGTTTGACTGATAGGTTTGACCAAATGAATGAAGCTAGTACCACTATAGCCGGTTCTAGCAGGATACCAGTATGCAATTTCGTCGTGCCAAGGAGCAGGGATTATTTCGTCGACAATGAGAAGCTCTTTCTTGACACGAAACTTACAGGAAACCAGTTGACTTTATTTCCATTGCATAGTCATATGCAAGAAAATCAAAATCGATATTTGCCAGCTGGTTTCGACGTCCCCGAGCCTGGAACTTCAGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAACTGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTACGAAAAGTTTCAGAACCAAAATGTAAGTGCACAGTTTTATCCAGAAAATTCAAGTAGCATGTGTGCGAATCCTGGTCGACAAACAATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGAAAAGATGTCCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCACACTTAATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATTATCCATCAAGAGGAGAAACCTTGTTTTATCATCCTGCAGGCTTTCATGGTAATCAAGTGGCACAAGGCAATTTATTGGCAAATGCTCCACAAGCTGTTAGGTACCCACATCCGTGTACCAATCGAAAAAGCAGTTTATTGTATCCAAGACCTGAATCTGTCATCAACTTAAATGAAAGGTTCAACAACATCCATTCTTTTCCTACCTCGTCGACTGACACGCTTAATATGGCACGAAACTTTCAAGCACCCTTTGTTTCTGGTCTGGAAACACAAAGGTTTTGTTCACAGCCGTCAGCATTTTCTACTTCTCACCACGTGTGTCCAAATAGATATGAAAATTCTTTTGAACTTGGCTTTAACCAGAGTTTACATCCAGCAAAATTAGGAACCTTTAACTTCCCTTTCTTGCAGCCAGATGATGGAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAAGAGTCTTTCCCCATGGATATTACACGATCACCAACGGGAAGTGCCACCAACTGCAAATTCTAAACTCGCTGACGTAAATGGATACTATTGTCCATGTACTCCTGGCACAGATGTTCTCATTAGCCCATCTTCGATTCATCACCAGCTCGAAACTGCCTATCCTTGTAGTACAATGGCATATTCTCACTTACAGACGAAGAACCATATACCTGGCTCGACATCTTTGTTTCAACCAATTCCCATTGCTCCACGAGTACTTCATTCACCAATTGCCAACGCAGGCCATGAAATTAGAATGAGATCTGAGGATAGATTGAAGTTCAACTCTTTGAGTGTCAAGAACTCTGATTTTTCAAGTAAAAAACAACTGGCCGAAGAGTTTGTTGATTCGAGGAAGCGTCAAAAGACATTAAGTTTAGAAACGAACAATTCTGGTGTTGTGCCAGAATGGACAAGAGGAAAATACAGTGATGATCACCTGAAATCTAACCCCGGGACGGTGAAAATTCATGCTAACTGGGACAAAGCTGTTAACTCAGTAGGAAATATCCCAAATATGACTCAAACTACTGATGGAATAGTGATATCTGCCAACAATAATGAAGCTCATAGGGTTGAATGTATGGCAAGATCAGGCCCCATCAAGTTGACAGCAGGAGCAAAACACATCTTGAAACCAAGTCAGAGTATGGATGTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGATAGCTTGGCAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTTTAAAAGTAAGTATAATACTTCGTCATGTTCTGTAATTCTTTGCTATATCTAATGCTAATCTAATCATATTGTGCTGATATGATTAAGCAAATGATTTTATCTACATAACAGGAAGGAACTTAGCTCTAATCCATCCATGATGCAATCATCTCATCTTTGTAATCACTGACATGACAGAGTTGTTGTAATTTTAAGATGACCCTGTCATTCACCAAGCAAGTTTTTCGGCATGCGGAGGCATGTTATCTTAAACCGAAACTAATCATGCTGCTACAAGTTTTGACCTTGTTTCTTATTGTAAGTTCTAGATCAAATCTCCACTACAAAATCTCTCGTAGGCGATAAAATTACACAAGTGTTAAGATTCACTGTTTAAAAACGTCTTGTTAGTTCCTGTAATGAATGCCCGAGAGCACGGAGTTGTATTTATTTCCTTGAGGGCTAAGTTTAGTTAATATAGCTAACATTATATGTATTTAACATGAT
mRNA sequence
AAAAAAAACTCTCACTCTCTCTCTCCCTTTCACCATTACTACTTCTCACTCTCTTTCTCTCCCTTTCACCAGAGAGAAATCTCTTCACACTTCACTCCTTACTTCATCATCTTTCTCCTCTGTCTCTTCTCCTTTCAATTTGACACCGTTTCCCAATTTTAACCCCAACCCACTTCCCAAAATTTCATCTTTTTTACTTGGGTCTCTCATGGCTGATCCCACCTCCACTTTCTCAATCAGAGAGTATGCTTTGAATAAAAGAAGCATGGGTTTAACGACAATTAGTTGGCCATTTAGTGAGAAGGTGAAGAAAGAAGTGGCAGAATCTCTACTTCCACCAATGGATGTAAAGAAATTCCGTTGGTGGTCGTCGTTATGGTTGTCGTCACAGGAAGAAGAAGAAGGAGAAGAAGGAGAAGAAAAAGAAGTAATTACAGAGAGAATTAAAATGCAGAAGATTTGTCCGGTCTGCGGGGTTTTTGTTGCAGCCACGGTAGCAGCGGTAAATGCACATATTGATACTTGTTTAGCTCAAACAACAAGCAAGGAAATTAGGAGAAAGAACTATTTGAAGGCAAAATCAAGAACCCCAAAAAAGAGATCAATAGCCGAAATATTTGCAGTGGCTCCGCCAGTAAAAACAATGATTGTTGTTAATGATTGTTGTGAAGACGAAGAAGAAAAAAAGGCCGTTGGGAAACAAATTATTCATCACAACAAGAACCTCAAAACGACATCGTTGGCTACAAGTCTTGTCTCTGCAATCAAGACAATCAAAAACAAAATCGCTACGACAACAGAAGAACCAACAATTCTTGCCAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAATAAGGACTTTTGTCATGGGAAACTTTGCAAGAAGGGAGATATCAGAAATCACAAGGATGTTTCTACTTTTTGTAAGAGACGGCCTTGTTTTAAACGATTGTCTAAACAAAAAAAGAAAAAACTAGCTAAAAAATCCACTGTGGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTTATCAATTTAAAAGGCAGCAATCAAGCTTTCAACAATGGCGGTCAAAAGTCCGATAGGCGTGTTAGCTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACCAGAACCATTTCTGATACATTTGAACAAAATGTTGGCAATCCATTTCAAGCTTCAGAAGTAAGTACTAATTCAGGTGAAAGTAATAAAGAAGTTCCTTCAATGGAGGCCAATTTAAACGATGATGTTGATTGCTTTAACAGTACCCGACACAAAGTCGATAGTCAACATGTCAAAGGAAAGATTCAATTGCCTAATTTTCATAATCAGGTCAATGCTCAAAGTTGGGAAAATCCGAAGCATTCAACCGAGAAGTTGATATTGGAAAGTCGAGATATTCCTCATGATCGAAATGATTTGCATTTGTTTGATCATGTCTATGTAGATGCACATCAGAAGCTGCCACCAGAACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCCATATGGCCATGTAAGAACTCAATGTGGTTTAAATGTTGTCCCACAAGCTCATTCTCTTTATGGAAAATCAGTTGATCATTTGATAAATAATAATAACCATTTCAATGGAGTAGCTGCCTTAGGCTCAGTTACGAGCAGAGTGCCTTCTTCTTCTTTAACTGAAAATCCTGTTAGCAGATTTTTAAATCTGGCTGAATCTTCTGCTAGAGACAGTAATAGATTTCAAATTTCGAATGGGGAACAAAGTGTGGTTACTTACAAAGAGAAAGGGGTAAATGATGGATTTTTCTGTCTGCCATTGAACTCAAGGGGTGAACTGATACAGCTAAATTCAGGTTTGACTGATAGGTTTGACCAAATGAATGAAGCTAGTACCACTATAGCCGGTTCTAGCAGGATACCAGTATGCAATTTCGTCGTGCCAAGGAGCAGGGATTATTTCGTCGACAATGAGAAGCTCTTTCTTGACACGAAACTTACAGGAAACCAGTTGACTTTATTTCCATTGCATAGTCATATGCAAGAAAATCAAAATCGATATTTGCCAGCTGGTTTCGACGTCCCCGAGCCTGGAACTTCAGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAACTGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTACGAAAAGTTTCAGAACCAAAATGTAAGTGCACAGTTTTATCCAGAAAATTCAAGTAGCATGTGTGCGAATCCTGGTCGACAAACAATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGAAAAGATGTCCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCACACTTAATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATTATCCATCAAGAGGAGAAACCTTGTTTTATCATCCTGCAGGCTTTCATGGTAATCAAGTGGCACAAGGCAATTTATTGGCAAATGCTCCACAAGCTGTTAGGTACCCACATCCGTGTACCAATCGAAAAAGCAGTTTATTGTATCCAAGACCTGAATCTGTCATCAACTTAAATGAAAGGTTCAACAACATCCATTCTTTTCCTACCTCGTCGACTGACACGCTTAATATGGCACGAAACTTTCAAGCACCCTTTGTTTCTGGTCTGGAAACACAAAGGTTTTGTTCACAGCCGTCAGCATTTTCTACTTCTCACCACGTGTGTCCAAATAGATATGAAAATTCTTTTGAACTTGGCTTTAACCAGAGTTTACATCCAGCAAAATTAGGAACCTTTAACTTCCCTTTCTTGCAGCCAGATGATGGAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAAGAGTCTTTCCCCATGGATATTACACGATCACCAACGGGAAGTGCCACCAACTGCAAATTCTAAACTCGCTGACGTAAATGGATACTATTGTCCATGTACTCCTGGCACAGATGTTCTCATTAGCCCATCTTCGATTCATCACCAGCTCGAAACTGCCTATCCTTGTAGTACAATGGCATATTCTCACTTACAGACGAAGAACCATATACCTGGCTCGACATCTTTGTTTCAACCAATTCCCATTGCTCCACGAGTACTTCATTCACCAATTGCCAACGCAGGCCATGAAATTAGAATGAGATCTGAGGATAGATTGAAGTTCAACTCTTTGAGTGTCAAGAACTCTGATTTTTCAAGTAAAAAACAACTGGCCGAAGAGTTTGTTGATTCGAGGAAGCGTCAAAAGACATTAAGTTTAGAAACGAACAATTCTGGTGTTGTGCCAGAATGGACAAGAGGAAAATACAGTGATGATCACCTGAAATCTAACCCCGGGACGGTGAAAATTCATGCTAACTGGGACAAAGCTGTTAACTCAGTAGGAAATATCCCAAATATGACTCAAACTACTGATGGAATAGTGATATCTGCCAACAATAATGAAGCTCATAGGGTTGAATGTATGGCAAGATCAGGCCCCATCAAGTTGACAGCAGGAGCAAAACACATCTTGAAACCAAGTCAGAGTATGGATGTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGATAGCTTGGCAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTTTAAAAGAAGGAACTTAGCTCTAATCCATCCATGATGCAATCATCTCATCTTTGTAATCACTGACATGACAGAGTTGTTGTAATTTTAAGATGACCCTGTCATTCACCAAGCAAGTTTTTCGGCATGCGGAGGCATGTTATCTTAAACCGAAACTAATCATGCTGCTACAAGTTTTGACCTTGTTTCTTATTGTAAGTTCTAGATCAAATCTCCACTACAAAATCTCTCGTAGGCGATAAAATTACACAAGTGTTAAGATTCACTGTTTAAAAACGTCTTGTTAGTTCCTGTAATGAATGCCCGAGAGCACGGAGTTGTATTTATTTCCTTGAGGGCTAAGTTTAGTTAATATAGCTAACATTATATGTATTTAACATGAT
Coding sequence (CDS)
ATGGCTGATCCCACCTCCACTTTCTCAATCAGAGAGTATGCTTTGAATAAAAGAAGCATGGGTTTAACGACAATTAGTTGGCCATTTAGTGAGAAGGTGAAGAAAGAAGTGGCAGAATCTCTACTTCCACCAATGGATGTAAAGAAATTCCGTTGGTGGTCGTCGTTATGGTTGTCGTCACAGGAAGAAGAAGAAGGAGAAGAAGGAGAAGAAAAAGAAGTAATTACAGAGAGAATTAAAATGCAGAAGATTTGTCCGGTCTGCGGGGTTTTTGTTGCAGCCACGGTAGCAGCGGTAAATGCACATATTGATACTTGTTTAGCTCAAACAACAAGCAAGGAAATTAGGAGAAAGAACTATTTGAAGGCAAAATCAAGAACCCCAAAAAAGAGATCAATAGCCGAAATATTTGCAGTGGCTCCGCCAGTAAAAACAATGATTGTTGTTAATGATTGTTGTGAAGACGAAGAAGAAAAAAAGGCCGTTGGGAAACAAATTATTCATCACAACAAGAACCTCAAAACGACATCGTTGGCTACAAGTCTTGTCTCTGCAATCAAGACAATCAAAAACAAAATCGCTACGACAACAGAAGAACCAACAATTCTTGCCAAGAGGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAATAAGGACTTTTGTCATGGGAAACTTTGCAAGAAGGGAGATATCAGAAATCACAAGGATGTTTCTACTTTTTGTAAGAGACGGCCTTGTTTTAAACGATTGTCTAAACAAAAAAAGAAAAAACTAGCTAAAAAATCCACTGTGGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTTATCAATTTAAAAGGCAGCAATCAAGCTTTCAACAATGGCGGTCAAAAGTCCGATAGGCGTGTTAGCTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACCAGAACCATTTCTGATACATTTGAACAAAATGTTGGCAATCCATTTCAAGCTTCAGAAGTAAGTACTAATTCAGGTGAAAGTAATAAAGAAGTTCCTTCAATGGAGGCCAATTTAAACGATGATGTTGATTGCTTTAACAGTACCCGACACAAAGTCGATAGTCAACATGTCAAAGGAAAGATTCAATTGCCTAATTTTCATAATCAGGTCAATGCTCAAAGTTGGGAAAATCCGAAGCATTCAACCGAGAAGTTGATATTGGAAAGTCGAGATATTCCTCATGATCGAAATGATTTGCATTTGTTTGATCATGTCTATGTAGATGCACATCAGAAGCTGCCACCAGAACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCCATATGGCCATGTAAGAACTCAATGTGGTTTAAATGTTGTCCCACAAGCTCATTCTCTTTATGGAAAATCAGTTGATCATTTGATAAATAATAATAACCATTTCAATGGAGTAGCTGCCTTAGGCTCAGTTACGAGCAGAGTGCCTTCTTCTTCTTTAACTGAAAATCCTGTTAGCAGATTTTTAAATCTGGCTGAATCTTCTGCTAGAGACAGTAATAGATTTCAAATTTCGAATGGGGAACAAAGTGTGGTTACTTACAAAGAGAAAGGGGTAAATGATGGATTTTTCTGTCTGCCATTGAACTCAAGGGGTGAACTGATACAGCTAAATTCAGGTTTGACTGATAGGTTTGACCAAATGAATGAAGCTAGTACCACTATAGCCGGTTCTAGCAGGATACCAGTATGCAATTTCGTCGTGCCAAGGAGCAGGGATTATTTCGTCGACAATGAGAAGCTCTTTCTTGACACGAAACTTACAGGAAACCAGTTGACTTTATTTCCATTGCATAGTCATATGCAAGAAAATCAAAATCGATATTTGCCAGCTGGTTTCGACGTCCCCGAGCCTGGAACTTCAGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAACTGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTACGAAAAGTTTCAGAACCAAAATGTAAGTGCACAGTTTTATCCAGAAAATTCAAGTAGCATGTGTGCGAATCCTGGTCGACAAACAATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGAAAAGATGTCCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCACACTTAATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATTATCCATCAAGAGGAGAAACCTTGTTTTATCATCCTGCAGGCTTTCATGGTAATCAAGTGGCACAAGGCAATTTATTGGCAAATGCTCCACAAGCTGTTAGGTACCCACATCCGTGTACCAATCGAAAAAGCAGTTTATTGTATCCAAGACCTGAATCTGTCATCAACTTAAATGAAAGGTTCAACAACATCCATTCTTTTCCTACCTCGTCGACTGACACGCTTAATATGGCACGAAACTTTCAAGCACCCTTTGTTTCTGGTCTGGAAACACAAAGGTTTTGTTCACAGCCGTCAGCATTTTCTACTTCTCACCACGTGTGTCCAAATAGATATGAAAATTCTTTTGAACTTGGCTTTAACCAGAGTTTACATCCAGCAAAATTAGGAACCTTTAACTTCCCTTTCTTGCAGCCAGATGATGGAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAAGAGTCTTTCCCCATGGATATTACACGATCACCAACGGGAAGTGCCACCAACTGCAAATTCTAAACTCGCTGACGTAAATGGATACTATTGTCCATGTACTCCTGGCACAGATGTTCTCATTAGCCCATCTTCGATTCATCACCAGCTCGAAACTGCCTATCCTTGTAGTACAATGGCATATTCTCACTTACAGACGAAGAACCATATACCTGGCTCGACATCTTTGTTTCAACCAATTCCCATTGCTCCACGAGTACTTCATTCACCAATTGCCAACGCAGGCCATGAAATTAGAATGAGATCTGAGGATAGATTGAAGTTCAACTCTTTGAGTGTCAAGAACTCTGATTTTTCAAGTAAAAAACAACTGGCCGAAGAGTTTGTTGATTCGAGGAAGCGTCAAAAGACATTAAGTTTAGAAACGAACAATTCTGGTGTTGTGCCAGAATGGACAAGAGGAAAATACAGTGATGATCACCTGAAATCTAACCCCGGGACGGTGAAAATTCATGCTAACTGGGACAAAGCTGTTAACTCAGTAGGAAATATCCCAAATATGACTCAAACTACTGATGGAATAGTGATATCTGCCAACAATAATGAAGCTCATAGGGTTGAATGTATGGCAAGATCAGGCCCCATCAAGTTGACAGCAGGAGCAAAACACATCTTGAAACCAAGTCAGAGTATGGATGTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGATAGCTTGGCAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTTTAA
Protein sequence
MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSSQEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNYLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEASTTIAGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPGTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF*
Homology
BLAST of CSPI06G22850 vs. ExPASy TrEMBL
Match:
A0A0A0KJS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1)
HSP 1 Score: 2384.0 bits (6177), Expect = 0.0e+00
Identity = 1189/1191 (99.83%), Postives = 1190/1191 (99.92%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS
Sbjct: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
Query: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY 120
QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY
Sbjct: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY 120
Query: 121 LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT 180
LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT
Sbjct: 121 LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT 180
Query: 181 SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF 240
SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF
Sbjct: 181 SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF 240
Query: 241 CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN 300
CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN
Sbjct: 241 CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN 300
Query: 301 QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS 360
QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS
Sbjct: 301 QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS 360
Query: 361 MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP 420
MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP
Sbjct: 361 MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP 420
Query: 421 HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK 480
HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK
Sbjct: 421 HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK 480
Query: 481 SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQS 540
SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQ
Sbjct: 481 SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQG 540
Query: 541 VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEASTTIAGSSRIPVCNFVVPR 600
VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEA+TTIAGSSRIPVCNFVVPR
Sbjct: 541 VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTIAGSSRIPVCNFVVPR 600
Query: 601 SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN 660
SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN
Sbjct: 601 SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN 660
Query: 661 SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK 720
SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK
Sbjct: 661 SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK 720
Query: 721 DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF 780
DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF
Sbjct: 721 DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF 780
Query: 781 YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT 840
YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT
Sbjct: 781 YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT 840
Query: 841 SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL 900
SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL
Sbjct: 841 SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL 900
Query: 901 GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG 960
GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG
Sbjct: 901 GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG 960
Query: 961 TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH 1020
TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH
Sbjct: 961 TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH 1020
Query: 1021 EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY 1080
EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY
Sbjct: 1021 EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY 1080
Query: 1081 SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK 1140
SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK
Sbjct: 1081 SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK 1140
Query: 1141 LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF 1192
LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF
Sbjct: 1141 LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF 1191
BLAST of CSPI06G22850 vs. ExPASy TrEMBL
Match:
A0A5D3DCZ7 (Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold75860G00170 PE=4 SV=1)
HSP 1 Score: 1759.2 bits (4555), Expect = 0.0e+00
Identity = 871/922 (94.47%), Postives = 888/922 (96.31%), Query Frame = 0
Query: 272 MPPLRSILKHSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTIS 331
MPPLRSILK SVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTR IS
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRAIS 60
Query: 332 DTFEQNVGNPFQASEVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLP 391
DTFEQNVGNPFQASEV NSGESNK VPSMEANLNDDVDCFN TRHKVDSQHVKGKIQLP
Sbjct: 61 DTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFN-TRHKVDSQHVKGKIQLP 120
Query: 392 NFHNQVNAQSWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALL 451
NFHNQVNA+ WEN KHSTEKLILESRDIPHDRNDLH F HVYVDAHQKLP +HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 452 AAQEERPYGHVRTQCGL-NVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSL 511
A QEERPYGHVRTQCGL NVVPQAHSLYGKSVD+LINNNNHFNGVAALGSVTSRVPSSSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 512 TENPVSRFLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 571
TENPVSR NLAESSARDSNRFQ NGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 572 LTDRFDQMNEASTTIAGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSH 631
LTDRFDQMNEAS T+AGSSRIPVCN VVPRSRDYFVDNEKLF+DTKLTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 632 MQENQNRYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQ 691
MQENQNRYLPAGFDVPEPGTSETADIRLM+SERGTETGRFFHP LMDSPFNRCRYYEKFQ
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 692 NQNVSAQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCL 751
NQNVS QFYPENSSSMC NPGRQTMRLMGKDVAVGGNGKD QEPEVINF KNSHL+GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 752 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 811
TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 812 TNRKSSLLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPS 871
TNRKSS+LYPRPESVINLNERF++IHSFP SSTDTLNMARNFQAPFVSGLETQRFCSQPS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 872 AFSTSHHVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPW 931
AFSTSHH+CPNRYENSFELGFNQSLHPAKLGTFNFPFLQ DDGNHVQLPWSHTSKSLSPW
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSKSLSPW 660
Query: 932 ILHDHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQ 991
ILHDHQRE+PPTANSKLAD+NGYYCPCTP GTDVLISPSSIHH+LETAYPCSTMAYSHLQ
Sbjct: 661 ILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAYPCSTMAYSHLQ 720
Query: 992 TKNHIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLA 1051
TKNHI GSTS FQPIPIAPRVL SPIANAGHEIRMRSEDRLKFNSLSVK+SDFSSKKQLA
Sbjct: 721 TKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVKDSDFSSKKQLA 780
Query: 1052 EEFVDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIP 1111
EEFVDSRKRQKTLSLETNNSG+VPEWTRGKYSDDHLKSNPG KIHAN DKAVNSVGNIP
Sbjct: 781 EEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANRDKAVNSVGNIP 840
Query: 1112 NMTQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 1171
NMTQTTDGIVISANNNEAH+VEC ARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS
Sbjct: 841 NMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 900
Query: 1172 AGLVHSDSLAGSQKKSTKVYSF 1192
AGLVHSDSLAGSQKKSTKVYSF
Sbjct: 901 AGLVHSDSLAGSQKKSTKVYSF 920
BLAST of CSPI06G22850 vs. ExPASy TrEMBL
Match:
A0A1S3BM77 (uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=4 SV=1)
HSP 1 Score: 1759.2 bits (4555), Expect = 0.0e+00
Identity = 871/922 (94.47%), Postives = 888/922 (96.31%), Query Frame = 0
Query: 272 MPPLRSILKHSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTIS 331
MPPLRSILK SVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTR IS
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRAIS 60
Query: 332 DTFEQNVGNPFQASEVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLP 391
DTFEQNVGNPFQASEV NSGESNK VPSMEANLNDDVDCFN TRHKVDSQHVKGKIQLP
Sbjct: 61 DTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFN-TRHKVDSQHVKGKIQLP 120
Query: 392 NFHNQVNAQSWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALL 451
NFHNQVNA+ WEN KHSTEKLILESRDIPHDRNDLH F HVYVDAHQKLP +HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 452 AAQEERPYGHVRTQCGL-NVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSL 511
A QEERPYGHVRTQCGL NVVPQAHSLYGKSVD+LINNNNHFNGVAALGSVTSRVPSSSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 512 TENPVSRFLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 571
TENPVSR NLAESSARDSNRFQ NGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 572 LTDRFDQMNEASTTIAGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSH 631
LTDRFDQMNEAS T+AGSSRIPVCN VVPRSRDYFVDNEKLF+DTKLTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 632 MQENQNRYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQ 691
MQENQNRYLPAGFDVPEPGTSETADIRLM+SERGTETGRFFHP LMDSPFNRCRYYEKFQ
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 692 NQNVSAQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCL 751
NQNVS QFYPENSSSMC NPGRQTMRLMGKDVAVGGNGKD QEPEVINF KNSHL+GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 752 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 811
TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 812 TNRKSSLLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPS 871
TNRKSS+LYPRPESVINLNERF++IHSFP SSTDTLNMARNFQAPFVSGLETQRFCSQPS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 872 AFSTSHHVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPW 931
AFSTSHH+CPNRYENSFELGFNQSLHPAKLGTFNFPFLQ DDGNHVQLPWSHTSKSLSPW
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSKSLSPW 660
Query: 932 ILHDHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQ 991
ILHDHQRE+PPTANSKLAD+NGYYCPCTP GTDVLISPSSIHH+LETAYPCSTMAYSHLQ
Sbjct: 661 ILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAYPCSTMAYSHLQ 720
Query: 992 TKNHIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLA 1051
TKNHI GSTS FQPIPIAPRVL SPIANAGHEIRMRSEDRLKFNSLSVK+SDFSSKKQLA
Sbjct: 721 TKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVKDSDFSSKKQLA 780
Query: 1052 EEFVDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIP 1111
EEFVDSRKRQKTLSLETNNSG+VPEWTRGKYSDDHLKSNPG KIHAN DKAVNSVGNIP
Sbjct: 781 EEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANRDKAVNSVGNIP 840
Query: 1112 NMTQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 1171
NMTQTTDGIVISANNNEAH+VEC ARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS
Sbjct: 841 NMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 900
Query: 1172 AGLVHSDSLAGSQKKSTKVYSF 1192
AGLVHSDSLAGSQKKSTKVYSF
Sbjct: 901 AGLVHSDSLAGSQKKSTKVYSF 920
BLAST of CSPI06G22850 vs. ExPASy TrEMBL
Match:
A0A6J1D428 (uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)
HSP 1 Score: 1369.0 bits (3542), Expect = 0.0e+00
Identity = 784/1217 (64.42%), Postives = 899/1217 (73.87%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWS----SL 60
MA S FSIREYALN R L WPF + VKKEVAE++LPP+ V KFRWWS +L
Sbjct: 1 MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 WLSSQEE-----EEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTT 120
S+ E ++ EE++VI M+KICPVCGVFV ATV A+NAHID+CLAQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCLAQTI 120
Query: 121 SKEIRRKN---YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIH 180
+ + R+ N +K KSRTPKKRSIAEIFAVAPPV+T++ ED G II
Sbjct: 121 TNQKRKNNSNGAVKPKSRTPKKRSIAEIFAVAPPVETVV------ED-------GGGIIR 180
Query: 181 HNKNLKTTSLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKK--KNKDFCHGKLC 240
+ LK TSLA +LV+A+KTIK AKR K+ K K KNKDF H L
Sbjct: 181 QKQQLKATSLARTLVTAMKTIK-------------AKRNKQHKLKASVVKNKDFGHELLR 240
Query: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
KKG+ RNHKDVS CK +PCFKRLS+QKKKKL KKS V AKQQRP+P +RSILK SVK +
Sbjct: 241 KKGE-RNHKDVSVRCK-KPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILKQSVKVV 300
Query: 301 SETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASE 360
SET+ S NLKGS Q NNGG++SDRRVSF DKDDVLGP TR SDTFEQ+VGNPFQ SE
Sbjct: 301 SETDPSG-NLKGSKQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVGNPFQDSE 360
Query: 361 VSTNSGESNKEVPSME-ANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQ----- 420
+T SGESNK V SME LNDD+ F STRH VDSQ +KGKIQLPN H+QVNAQ
Sbjct: 361 GNTMSGESNKGVASMEDVGLNDDIVSF-STRHGVDSQRIKGKIQLPNIHDQVNAQISSMR 420
Query: 421 ---SWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEER 480
W N KH E+ I +R +PH+ N HLFDHVY+DA Q+ PP HSAIPALLAAQ+ER
Sbjct: 421 PHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALLAAQDER 480
Query: 481 PYGHVRTQCGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSR 540
YG VRTQ G N P AH+ GKSVDHL+ N NGVA LGS+TS VP+ +LTEN V R
Sbjct: 481 QYGQVRTQXGSN-FPGAHTFNGKSVDHLV---NPINGVANLGSMTSTVPTFTLTENGVGR 540
Query: 541 FLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQ 600
NLAESSA+D NR N EQ V YKEKG+NDGFFCLPLNS+GELIQLNSGL +R+DQ
Sbjct: 541 LFNLAESSAKD-NRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRYDQ 600
Query: 601 MNEASTTIAGSSRIPVCNFVVPRS-RDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQN 660
MNEA +A SSRIPVC V PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS MQEN+N
Sbjct: 601 MNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQENRN 660
Query: 661 RYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSA 720
+YL A FDV EPGTS DIRL+NSERGT++G H NLMD+PFNRCRYY K NQNVS
Sbjct: 661 QYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNVST 720
Query: 721 QFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQE 780
+ YPENSS+M ANP RQTMRLMGKDVAVGGNGK+VQEPE INFWKNS LI NCLTN IQE
Sbjct: 721 EIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSIQE 780
Query: 781 THMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSS 840
MRKRNFLQDR LHYPS+GETLFY PAGFH QVAQ NLL NAPQ VRYPHP NRK+
Sbjct: 781 NPMRKRNFLQDRVLHYPSKGETLFY-PAGFHSGQVAQSNLLPNAPQ-VRYPHPRLNRKNG 840
Query: 841 LLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSH 900
++Y R +SVINLNERF+NI++F SST+ NMA NFQAPF+SG T RF QP AFSTS
Sbjct: 841 VMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTSQ 900
Query: 901 HVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQ 960
H+C NRYE+SFELG+NQ+ HPAKLGTFNFPFLQPDD NHV W Q
Sbjct: 901 HMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHVPPSWL--------------Q 960
Query: 961 REVPPTANSKLADVNGYYCP-CTPGTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIP 1020
++ PTA SKLAD+NG Y P + G DVL SP S+ + E A+PCSTM SH Q KN IP
Sbjct: 961 QDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP-SHRQVKN-IP 1020
Query: 1021 GSTSLFQPIPIAPRVLHSPIANAGHEIRMRS-EDRLKFNSLSVKNSDFSSKKQLAEEFVD 1080
GSTS+FQPIP+ PR I AGHE R+ EDRLKF +LSVK++D SKKQ E +D
Sbjct: 1021 GSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLLSKKQPVGELID 1080
Query: 1081 SRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQT 1140
SRKRQK LSLETNNSGVV EWT GK++D+ +SNPG+ KIH NWDKAVN N+PN+T+
Sbjct: 1081 SRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVNPTXNLPNVTE- 1140
Query: 1141 TDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVH 1192
TDG+++ + NE+ +VE MARSGP+KLTAGAKHILKPSQSMD+DNTKPTYSTIPS+GLVH
Sbjct: 1141 TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKPTYSTIPSSGLVH 1153
BLAST of CSPI06G22850 vs. ExPASy TrEMBL
Match:
A0A6J1D325 (uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)
HSP 1 Score: 1354.0 bits (3503), Expect = 0.0e+00
Identity = 777/1217 (63.85%), Postives = 893/1217 (73.38%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWS----SL 60
MA S FSIR+ WPF + VKKEVAE++LPP+ V KFRWWS +L
Sbjct: 1 MAVAPSGFSIRDLGR----------CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 WLSSQEE-----EEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTT 120
S+ E ++ EE++VI M+KICPVCGVFV ATV A+NAHID+CLAQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCLAQTI 120
Query: 121 SKEIRRKN---YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIH 180
+ + R+ N +K KSRTPKKRSIAEIFAVAPPV+T++ ED G II
Sbjct: 121 TNQKRKNNSNGAVKPKSRTPKKRSIAEIFAVAPPVETVV------ED-------GGGIIR 180
Query: 181 HNKNLKTTSLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKK--KNKDFCHGKLC 240
+ LK TSLA +LV+A+KTIK AKR K+ K K KNKDF H L
Sbjct: 181 QKQQLKATSLARTLVTAMKTIK-------------AKRNKQHKLKASVVKNKDFGHELLR 240
Query: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
KKG+ RNHKDVS CK +PCFKRLS+QKKKKL KKS V AKQQRP+P +RSILK SVK +
Sbjct: 241 KKGE-RNHKDVSVRCK-KPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILKQSVKVV 300
Query: 301 SETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASE 360
SET+ S NLKGS Q NNGG++SDRRVSF DKDDVLGP TR SDTFEQ+VGNPFQ SE
Sbjct: 301 SETDPSG-NLKGSKQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVGNPFQDSE 360
Query: 361 VSTNSGESNKEVPSME-ANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQ----- 420
+T SGESNK V SME LNDD+ F STRH VDSQ +KGKIQLPN H+QVNAQ
Sbjct: 361 GNTMSGESNKGVASMEDVGLNDDIVSF-STRHGVDSQRIKGKIQLPNIHDQVNAQISSMR 420
Query: 421 ---SWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEER 480
W N KH E+ I +R +PH+ N HLFDHVY+DA Q+ PP HSAIPALLAAQ+ER
Sbjct: 421 PHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALLAAQDER 480
Query: 481 PYGHVRTQCGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSR 540
YG VRTQ G N P AH+ GKSVDHL+ N NGVA LGS+TS VP+ +LTEN V R
Sbjct: 481 QYGQVRTQXGSN-FPGAHTFNGKSVDHLV---NPINGVANLGSMTSTVPTFTLTENGVGR 540
Query: 541 FLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQ 600
NLAESSA+D NR N EQ V YKEKG+NDGFFCLPLNS+GELIQLNSGL +R+DQ
Sbjct: 541 LFNLAESSAKD-NRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRYDQ 600
Query: 601 MNEASTTIAGSSRIPVCNFVVPRS-RDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQN 660
MNEA +A SSRIPVC V PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS MQEN+N
Sbjct: 601 MNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQENRN 660
Query: 661 RYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSA 720
+YL A FDV EPGTS DIRL+NSERGT++G H NLMD+PFNRCRYY K NQNVS
Sbjct: 661 QYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNVST 720
Query: 721 QFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQE 780
+ YPENSS+M ANP RQTMRLMGKDVAVGGNGK+VQEPE INFWKNS LI NCLTN IQE
Sbjct: 721 EIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSIQE 780
Query: 781 THMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSS 840
MRKRNFLQDR LHYPS+GETLFY PAGFH QVAQ NLL NAPQ VRYPHP NRK+
Sbjct: 781 NPMRKRNFLQDRVLHYPSKGETLFY-PAGFHSGQVAQSNLLPNAPQ-VRYPHPRLNRKNG 840
Query: 841 LLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSH 900
++Y R +SVINLNERF+NI++F SST+ NMA NFQAPF+SG T RF QP AFSTS
Sbjct: 841 VMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTSQ 900
Query: 901 HVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQ 960
H+C NRYE+SFELG+NQ+ HPAKLGTFNFPFLQPDD NHV W Q
Sbjct: 901 HMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHVPPSWL--------------Q 960
Query: 961 REVPPTANSKLADVNGYYCP-CTPGTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIP 1020
++ PTA SKLAD+NG Y P + G DVL SP S+ + E A+PCSTM SH Q KN IP
Sbjct: 961 QDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP-SHRQVKN-IP 1020
Query: 1021 GSTSLFQPIPIAPRVLHSPIANAGHEIRMRS-EDRLKFNSLSVKNSDFSSKKQLAEEFVD 1080
GSTS+FQPIP+ PR I AGHE R+ EDRLKF +LSVK++D SKKQ E +D
Sbjct: 1021 GSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLLSKKQPVGELID 1080
Query: 1081 SRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQT 1140
SRKRQK LSLETNNSGVV EWT GK++D+ +SNPG+ KIH NWDKAVN N+PN+T+
Sbjct: 1081 SRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVNPTXNLPNVTE- 1140
Query: 1141 TDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVH 1192
TDG+++ + NE+ +VE MARSGP+KLTAGAKHILKPSQSMD+DNTKPTYSTIPS+GLVH
Sbjct: 1141 TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKPTYSTIPSSGLVH 1144
BLAST of CSPI06G22850 vs. NCBI nr
Match:
XP_011657559.1 (uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical protein Csa_004444 [Cucumis sativus])
HSP 1 Score: 2384.0 bits (6177), Expect = 0.0e+00
Identity = 1189/1191 (99.83%), Postives = 1190/1191 (99.92%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS
Sbjct: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
Query: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY 120
QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY
Sbjct: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKNY 120
Query: 121 LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT 180
LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT
Sbjct: 121 LKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHNKNLKTTSLAT 180
Query: 181 SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF 240
SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF
Sbjct: 181 SLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKDVSTF 240
Query: 241 CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN 300
CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN
Sbjct: 241 CKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINLKGSN 300
Query: 301 QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS 360
QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS
Sbjct: 301 QAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGESNKEVPS 360
Query: 361 MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP 420
MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP
Sbjct: 361 MEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLILESRDIP 420
Query: 421 HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK 480
HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK
Sbjct: 421 HDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQAHSLYGK 480
Query: 481 SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQS 540
SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQ
Sbjct: 481 SVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQISNGEQG 540
Query: 541 VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEASTTIAGSSRIPVCNFVVPR 600
VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEA+TTIAGSSRIPVCNFVVPR
Sbjct: 541 VVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTIAGSSRIPVCNFVVPR 600
Query: 601 SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN 660
SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN
Sbjct: 601 SRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPGTSETADIRLMN 660
Query: 661 SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK 720
SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK
Sbjct: 661 SERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQTMRLMGK 720
Query: 721 DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF 780
DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF
Sbjct: 721 DVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYPSRGETLF 780
Query: 781 YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT 840
YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT
Sbjct: 781 YHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERFNNIHSFPT 840
Query: 841 SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL 900
SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL
Sbjct: 841 SSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFNQSLHPAKL 900
Query: 901 GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG 960
GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG
Sbjct: 901 GTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNGYYCPCTPG 960
Query: 961 TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH 1020
TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH
Sbjct: 961 TDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVLHSPIANAGH 1020
Query: 1021 EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY 1080
EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY
Sbjct: 1021 EIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGVVPEWTRGKY 1080
Query: 1081 SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK 1140
SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK
Sbjct: 1081 SDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVECMARSGPIK 1140
Query: 1141 LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF 1192
LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF
Sbjct: 1141 LTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF 1191
BLAST of CSPI06G22850 vs. NCBI nr
Match:
XP_038888639.1 (uncharacterized protein LOC120078436 [Benincasa hispida])
HSP 1 Score: 1880.1 bits (4869), Expect = 0.0e+00
Identity = 972/1200 (81.00%), Postives = 1049/1200 (87.42%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
MA PTS FSIREYALNKRS LT ISWPFSEKVKKEVAE+LLPPMDVKKFRWWSS
Sbjct: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSS----- 60
Query: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCL-AQTTSKEIRRKN 120
E EE+EVI ERIKMQKICPVCGVFVAATV AVNAHID+CL +Q TSKEIR+K
Sbjct: 61 ----ERVISEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNSQITSKEIRKK- 120
Query: 121 YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHH---NKNLKTT 180
LKAKSRTPKKRSIA+IFAVAPPVKTMI+ NDCC++EEEKKAVGKQII H N NLKTT
Sbjct: 121 -LKAKSRTPKKRSIADIFAVAPPVKTMIIANDCCDEEEEKKAVGKQIIRHNNNNNNLKTT 180
Query: 181 SLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKKKNKDFCHGKLCKKGDIRNHKD 240
SLATSLVS IKTI T E+P+IL K+KK KDF HG+LC+KG+IRNHKD
Sbjct: 181 SLATSLVSTIKTINT--TTEQEQPSILHKKKK---------KDFGHGQLCRKGEIRNHKD 240
Query: 241 VSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAISETNSSFINL 300
VST CK +PCFKRL +QK+KKL KKS VVAKQQRPMP LRSILKHSVKA SETN S INL
Sbjct: 241 VSTLCK-KPCFKRLCRQKRKKLVKKSNVVAKQQRPMPLLRSILKHSVKATSETNFSSINL 300
Query: 301 KG-SNQAFNN-GGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASEVSTNSGES 360
+G +NQ FNN GGQKSDRRVSFLDKDDVLG ST SDTFEQNVGNPFQASEVSTNSGES
Sbjct: 301 RGNNNQVFNNGGGQKSDRRVSFLDKDDVLGLSTEVFSDTFEQNVGNPFQASEVSTNSGES 360
Query: 361 NKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENPKHSTEKLIL 420
NKEV +EANLNDDV CF ST+H+VD QH KGKIQLPNFHNQVNA+SW+N KHSTE LI
Sbjct: 361 NKEVAPVEANLNDDV-CF-STQHEVDGQHAKGKIQLPNFHNQVNAESWDNAKHSTENLIS 420
Query: 421 ESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQCGLNVVPQA 480
+++DIPHD+NDL LFDHVYVD QKL P HSAIPALLAAQEER YGHVRTQCGLN + QA
Sbjct: 421 KNQDIPHDQNDLRLFDHVYVDGLQKLSPVHSAIPALLAAQEERQYGHVRTQCGLNSIRQA 480
Query: 481 HSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESSARDSNRFQI 540
HSLYGKS DHLIN N NGVAALGS+TSRVPSSSL+ENPVSRFLNLAESS +D+ F
Sbjct: 481 HSLYGKSTDHLINPFN--NGVAALGSITSRVPSSSLSENPVSRFLNLAESSIKDT-IFPF 540
Query: 541 SNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEASTTIAGSSRIPVC 600
SNGE+S+V+YKEKGVNDGFFCLPLNS+GELIQLNSGL +RFDQMNEAS TIA SSRIPVC
Sbjct: 541 SNGEESMVSYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNTIACSSRIPVC 600
Query: 601 NFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDVPEPG-TSET 660
+ V+PRSRDYF+DNEKL +DT+LTGNQLTLFPLHSH+ ENQNRY PAGFD+ EPG TSET
Sbjct: 601 SLVLPRSRDYFIDNEKLLVDTELTGNQLTLFPLHSHLPENQNRYFPAGFDISEPGITSET 660
Query: 661 ADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSSMCANPGRQ 720
ADIRLMNSERGTE+GRFFHPNLMDSP+NRCRYY KFQNQNVS QFYPENSSSMCANPG+Q
Sbjct: 661 ADIRLMNSERGTESGRFFHPNLMDSPYNRCRYYGKFQNQNVSTQFYPENSSSMCANPGQQ 720
Query: 721 TMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFLQDRELHYP 780
TMRLMGKDVAVGGN ++VQEPEVINFWKNS LIGNCLTNPIQETHMRKRNFLQDRELH+P
Sbjct: 721 TMRLMGKDVAVGGNRQEVQEPEVINFWKNSTLIGNCLTNPIQETHMRKRNFLQDRELHHP 780
Query: 781 SRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESVINLNERF- 840
S+GETLFYHPAGFHGNQVAQ N ANA Q VRYPHP NRKSS++Y RP+SVINLNE F
Sbjct: 781 SKGETLFYHPAGFHGNQVAQSNFFANASQ-VRYPHPHLNRKSSIMYQRPDSVINLNESFN 840
Query: 841 NNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYENSFELGFN 900
NNIH+F SSTDT NMA+NFQ PF+SG ET RF SQPSAFSTSHH CPNRYENSFELGFN
Sbjct: 841 NNIHAFSPSSTDTFNMAQNFQGPFISGPETLRFGSQPSAFSTSHHTCPNRYENSFELGFN 900
Query: 901 QSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQREVPPTANSKLADVNG 960
Q+LHPAKLGTFNFPFLQPDD HVQLPWSHTSKSL PW+LHDHQRE P T NSKLAD+NG
Sbjct: 901 QNLHPAKLGTFNFPFLQPDDETHVQLPWSHTSKSLPPWMLHDHQREAPQTTNSKLADLNG 960
Query: 961 YYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIPGSTSLFQPIPIAPRVL 1020
YYCPC P GTDVLI+PSS+HH+LETAYPCSTM YSHLQTKNHIPG TS FQP+P+APR+L
Sbjct: 961 YYCPCIPFGTDVLINPSSMHHRLETAYPCSTMPYSHLQTKNHIPGPTSFFQPMPVAPRIL 1020
Query: 1021 HSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGV 1080
SPIANAGHEIR+ SEDRLKFN+LSVK+ DFSSK LA E VDSRKRQK SLETNNSGV
Sbjct: 1021 QSPIANAGHEIRLSSEDRLKFNTLSVKDFDFSSKTLLAGELVDSRKRQKISSLETNNSGV 1080
Query: 1081 VPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQTTDGIVISANNNEAHRVE 1140
VP WTRGK+SDDHL+SNPGTVKIHANWDKAVNS GNIPNMTQTTDG+VIS NNE + E
Sbjct: 1081 VPGWTRGKFSDDHLESNPGTVKIHANWDKAVNSAGNIPNMTQTTDGVVISTKNNETPKFE 1140
Query: 1141 CMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVHSDSLAGSQKKSTKVYSF 1192
CMARSGPIKLTAGAKHILKPSQS+D+DNTKPTYSTIPSAGLVHS SLAGSQKKSTKVYSF
Sbjct: 1141 CMARSGPIKLTAGAKHILKPSQSVDIDNTKPTYSTIPSAGLVHSVSLAGSQKKSTKVYSF 1171
BLAST of CSPI06G22850 vs. NCBI nr
Match:
XP_008449514.1 (PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa] >TYK21149.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa])
HSP 1 Score: 1759.2 bits (4555), Expect = 0.0e+00
Identity = 871/922 (94.47%), Postives = 888/922 (96.31%), Query Frame = 0
Query: 272 MPPLRSILKHSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTIS 331
MPPLRSILK SVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTR IS
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRAIS 60
Query: 332 DTFEQNVGNPFQASEVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLP 391
DTFEQNVGNPFQASEV NSGESNK VPSMEANLNDDVDCFN TRHKVDSQHVKGKIQLP
Sbjct: 61 DTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFN-TRHKVDSQHVKGKIQLP 120
Query: 392 NFHNQVNAQSWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALL 451
NFHNQVNA+ WEN KHSTEKLILESRDIPHDRNDLH F HVYVDAHQKLP +HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 452 AAQEERPYGHVRTQCGL-NVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSL 511
A QEERPYGHVRTQCGL NVVPQAHSLYGKSVD+LINNNNHFNGVAALGSVTSRVPSSSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 512 TENPVSRFLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 571
TENPVSR NLAESSARDSNRFQ NGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 572 LTDRFDQMNEASTTIAGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSH 631
LTDRFDQMNEAS T+AGSSRIPVCN VVPRSRDYFVDNEKLF+DTKLTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 632 MQENQNRYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQ 691
MQENQNRYLPAGFDVPEPGTSETADIRLM+SERGTETGRFFHP LMDSPFNRCRYYEKFQ
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 692 NQNVSAQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCL 751
NQNVS QFYPENSSSMC NPGRQTMRLMGKDVAVGGNGKD QEPEVINF KNSHL+GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 752 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 811
TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 812 TNRKSSLLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPS 871
TNRKSS+LYPRPESVINLNERF++IHSFP SSTDTLNMARNFQAPFVSGLETQRFCSQPS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 872 AFSTSHHVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPW 931
AFSTSHH+CPNRYENSFELGFNQSLHPAKLGTFNFPFLQ DDGNHVQLPWSHTSKSLSPW
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSKSLSPW 660
Query: 932 ILHDHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQ 991
ILHDHQRE+PPTANSKLAD+NGYYCPCTP GTDVLISPSSIHH+LETAYPCSTMAYSHLQ
Sbjct: 661 ILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAYPCSTMAYSHLQ 720
Query: 992 TKNHIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLA 1051
TKNHI GSTS FQPIPIAPRVL SPIANAGHEIRMRSEDRLKFNSLSVK+SDFSSKKQLA
Sbjct: 721 TKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVKDSDFSSKKQLA 780
Query: 1052 EEFVDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIP 1111
EEFVDSRKRQKTLSLETNNSG+VPEWTRGKYSDDHLKSNPG KIHAN DKAVNSVGNIP
Sbjct: 781 EEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANRDKAVNSVGNIP 840
Query: 1112 NMTQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 1171
NMTQTTDGIVISANNNEAH+VEC ARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS
Sbjct: 841 NMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPS 900
Query: 1172 AGLVHSDSLAGSQKKSTKVYSF 1192
AGLVHSDSLAGSQKKSTKVYSF
Sbjct: 901 AGLVHSDSLAGSQKKSTKVYSF 920
BLAST of CSPI06G22850 vs. NCBI nr
Match:
XP_022148072.1 (uncharacterized protein LOC111016842 isoform X1 [Momordica charantia])
HSP 1 Score: 1370.1 bits (3545), Expect = 0.0e+00
Identity = 784/1217 (64.42%), Postives = 899/1217 (73.87%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWS----SL 60
MA S FSIREYALN R L WPF + VKKEVAE++LPP+ V KFRWWS +L
Sbjct: 1 MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 WLSSQEE-----EEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTT 120
S+ E ++ EE++VI M+KICPVCGVFV ATV A+NAHID+CLAQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCLAQTI 120
Query: 121 SKEIRRKN---YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIH 180
+ + R+ N +K KSRTPKKRSIAEIFAVAPPV+T++ ED G II
Sbjct: 121 TNQKRKNNSNGAVKPKSRTPKKRSIAEIFAVAPPVETVV------ED-------GGGIIR 180
Query: 181 HNKNLKTTSLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKK--KNKDFCHGKLC 240
+ LK TSLA +LV+A+KTIK AKR K+ K K KNKDF H L
Sbjct: 181 QKQQLKATSLARTLVTAMKTIK-------------AKRNKQHKLKASVVKNKDFGHELLR 240
Query: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
KKG+ RNHKDVS CK +PCFKRLS+QKKKKL KKS V AKQQRP+P +RSILK SVK +
Sbjct: 241 KKGE-RNHKDVSVRCK-KPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILKQSVKVV 300
Query: 301 SETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASE 360
SET+ S NLKGS Q NNGG++SDRRVSF DKDDVLGP TR SDTFEQ+VGNPFQ SE
Sbjct: 301 SETBPSG-NLKGSKQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVGNPFQDSE 360
Query: 361 VSTNSGESNKEVPSME-ANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQ----- 420
+T SGESNK V SME LNDD+ F STRH VDSQ +KGKIQLPN H+QVNAQ
Sbjct: 361 GNTMSGESNKGVASMEDVGLNDDIVSF-STRHGVDSQRIKGKIQLPNIHDQVNAQISSMR 420
Query: 421 ---SWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEER 480
W N KH E+ I +R +PH+ N HLFDHVY+DA Q+ PP HSAIPALLAAQ+ER
Sbjct: 421 PHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALLAAQDER 480
Query: 481 PYGHVRTQCGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSR 540
YG VRTQ G N P AH+ GKSVDHL+ N NGVA LGS+TS VP+ +LTEN V R
Sbjct: 481 QYGQVRTQXGSN-FPGAHTFNGKSVDHLV---NPINGVANLGSMTSTVPTFTLTENGVGR 540
Query: 541 FLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQ 600
NLAESSA+D NR N EQ V YKEKG+NDGFFCLPLNS+GELIQLNSGL +R+DQ
Sbjct: 541 LFNLAESSAKD-NRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRYDQ 600
Query: 601 MNEASTTIAGSSRIPVCNFVVPRS-RDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQN 660
MNEA +A SSRIPVC V PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS MQEN+N
Sbjct: 601 MNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQENRN 660
Query: 661 RYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSA 720
+YL A FDV EPGTS DIRL+NSERGT++G H NLMD+PFNRCRYY K NQNVS
Sbjct: 661 QYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNVST 720
Query: 721 QFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQE 780
+ YPENSS+M ANP RQTMRLMGKDVAVGGNGK+VQEPE INFWKNS LI NCLTN IQE
Sbjct: 721 EIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSIQE 780
Query: 781 THMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSS 840
MRKRNFLQDR LHYPS+GETLFY PAGFH QVAQ NLL NAPQ VRYPHP NRK+
Sbjct: 781 NPMRKRNFLQDRVLHYPSKGETLFY-PAGFHSGQVAQSNLLPNAPQ-VRYPHPRLNRKNG 840
Query: 841 LLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSH 900
++Y R +SVINLNERF+NI++F SST+ NMA NFQAPF+SG T RF QP AFSTS
Sbjct: 841 VMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTSQ 900
Query: 901 HVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQ 960
H+C NRYE+SFELG+NQ+ HPAKLGTFNFPFLQPDD NHV W Q
Sbjct: 901 HMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHVPPSWL--------------Q 960
Query: 961 REVPPTANSKLADVNGYYCP-CTPGTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIP 1020
++ PTA SKLAD+NG Y P + G DVL SP S+ + E A+PCSTM SH Q KN IP
Sbjct: 961 QDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP-SHRQVKN-IP 1020
Query: 1021 GSTSLFQPIPIAPRVLHSPIANAGHEIRMRS-EDRLKFNSLSVKNSDFSSKKQLAEEFVD 1080
GSTS+FQPIP+ PR I AGHE R+ EDRLKF +LSVK++D SKKQ E +D
Sbjct: 1021 GSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLLSKKQPVGELID 1080
Query: 1081 SRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQT 1140
SRKRQK LSLETNNSGVV EWT GK++D+ +SNPG+ KIH NWDKAVN N+PN+T+
Sbjct: 1081 SRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVNPTXNLPNVTE- 1140
Query: 1141 TDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVH 1192
TDG+++ + NE+ +VE MARSGP+KLTAGAKHILKPSQSMD+DNTKPTYSTIPS+GLVH
Sbjct: 1141 TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKPTYSTIPSSGLVH 1153
BLAST of CSPI06G22850 vs. NCBI nr
Match:
XP_022148073.1 (uncharacterized protein LOC111016842 isoform X2 [Momordica charantia])
HSP 1 Score: 1355.1 bits (3506), Expect = 0.0e+00
Identity = 777/1217 (63.85%), Postives = 893/1217 (73.38%), Query Frame = 0
Query: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWS----SL 60
MA S FSIR+ WPF + VKKEVAE++LPP+ V KFRWWS +L
Sbjct: 1 MAVAPSGFSIRDLGR----------CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 WLSSQEE-----EEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTT 120
S+ E ++ EE++VI M+KICPVCGVFV ATV A+NAHID+CLAQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCLAQTI 120
Query: 121 SKEIRRKN---YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIH 180
+ + R+ N +K KSRTPKKRSIAEIFAVAPPV+T++ ED G II
Sbjct: 121 TNQKRKNNSNGAVKPKSRTPKKRSIAEIFAVAPPVETVV------ED-------GGGIIR 180
Query: 181 HNKNLKTTSLATSLVSAIKTIKNKIATTTEEPTILAKRKKKKKKKKK--KNKDFCHGKLC 240
+ LK TSLA +LV+A+KTIK AKR K+ K K KNKDF H L
Sbjct: 181 QKQQLKATSLARTLVTAMKTIK-------------AKRNKQHKLKASVVKNKDFGHELLR 240
Query: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
KKG+ RNHKDVS CK +PCFKRLS+QKKKKL KKS V AKQQRP+P +RSILK SVK +
Sbjct: 241 KKGE-RNHKDVSVRCK-KPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILKQSVKVV 300
Query: 301 SETNSSFINLKGSNQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQASE 360
SET+ S NLKGS Q NNGG++SDRRVSF DKDDVLGP TR SDTFEQ+VGNPFQ SE
Sbjct: 301 SETBPSG-NLKGSKQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVGNPFQDSE 360
Query: 361 VSTNSGESNKEVPSME-ANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQ----- 420
+T SGESNK V SME LNDD+ F STRH VDSQ +KGKIQLPN H+QVNAQ
Sbjct: 361 GNTMSGESNKGVASMEDVGLNDDIVSF-STRHGVDSQRIKGKIQLPNIHDQVNAQISSMR 420
Query: 421 ---SWENPKHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEER 480
W N KH E+ I +R +PH+ N HLFDHVY+DA Q+ PP HSAIPALLAAQ+ER
Sbjct: 421 PHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALLAAQDER 480
Query: 481 PYGHVRTQCGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSR 540
YG VRTQ G N P AH+ GKSVDHL+ N NGVA LGS+TS VP+ +LTEN V R
Sbjct: 481 QYGQVRTQXGSN-FPGAHTFNGKSVDHLV---NPINGVANLGSMTSTVPTFTLTENGVGR 540
Query: 541 FLNLAESSARDSNRFQISNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQ 600
NLAESSA+D NR N EQ V YKEKG+NDGFFCLPLNS+GELIQLNSGL +R+DQ
Sbjct: 541 LFNLAESSAKD-NRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRYDQ 600
Query: 601 MNEASTTIAGSSRIPVCNFVVPRS-RDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQN 660
MNEA +A SSRIPVC V PRS RDYF+DNEK+ +DT+LT NQLTLFPLHS MQEN+N
Sbjct: 601 MNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQENRN 660
Query: 661 RYLPAGFDVPEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSA 720
+YL A FDV EPGTS DIRL+NSERGT++G H NLMD+PFNRCRYY K NQNVS
Sbjct: 661 QYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNVST 720
Query: 721 QFYPENSSSMCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQE 780
+ YPENSS+M ANP RQTMRLMGKDVAVGGNGK+VQEPE INFWKNS LI NCLTN IQE
Sbjct: 721 EIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSIQE 780
Query: 781 THMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSS 840
MRKRNFLQDR LHYPS+GETLFY PAGFH QVAQ NLL NAPQ VRYPHP NRK+
Sbjct: 781 NPMRKRNFLQDRVLHYPSKGETLFY-PAGFHSGQVAQSNLLPNAPQ-VRYPHPRLNRKNG 840
Query: 841 LLYPRPESVINLNERFNNIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSH 900
++Y R +SVINLNERF+NI++F SST+ NMA NFQAPF+SG T RF QP AFSTS
Sbjct: 841 VMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTSQ 900
Query: 901 HVCPNRYENSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSKSLSPWILHDHQ 960
H+C NRYE+SFELG+NQ+ HPAKLGTFNFPFLQPDD NHV W Q
Sbjct: 901 HMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHVPPSWL--------------Q 960
Query: 961 REVPPTANSKLADVNGYYCP-CTPGTDVLISPSSIHHQLETAYPCSTMAYSHLQTKNHIP 1020
++ PTA SKLAD+NG Y P + G DVL SP S+ + E A+PCSTM SH Q KN IP
Sbjct: 961 QDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP-SHRQVKN-IP 1020
Query: 1021 GSTSLFQPIPIAPRVLHSPIANAGHEIRMRS-EDRLKFNSLSVKNSDFSSKKQLAEEFVD 1080
GSTS+FQPIP+ PR I AGHE R+ EDRLKF +LSVK++D SKKQ E +D
Sbjct: 1021 GSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLLSKKQPVGELID 1080
Query: 1081 SRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVGNIPNMTQT 1140
SRKRQK LSLETNNSGVV EWT GK++D+ +SNPG+ KIH NWDKAVN N+PN+T+
Sbjct: 1081 SRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVNPTXNLPNVTE- 1140
Query: 1141 TDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAGLVH 1192
TDG+++ + NE+ +VE MARSGP+KLTAGAKHILKPSQSMD+DNTKPTYSTIPS+GLVH
Sbjct: 1141 TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKPTYSTIPSSGLVH 1144
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KJS6 | 0.0e+00 | 99.83 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1 | [more] |
A0A5D3DCZ7 | 0.0e+00 | 94.47 | Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
A0A1S3BM77 | 0.0e+00 | 94.47 | uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=... | [more] |
A0A6J1D428 | 0.0e+00 | 64.42 | uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1D325 | 0.0e+00 | 63.85 | uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
XP_011657559.1 | 0.0e+00 | 99.83 | uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical ... | [more] |
XP_038888639.1 | 0.0e+00 | 81.00 | uncharacterized protein LOC120078436 [Benincasa hispida] | [more] |
XP_008449514.1 | 0.0e+00 | 94.47 | PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 put... | [more] |
XP_022148072.1 | 0.0e+00 | 64.42 | uncharacterized protein LOC111016842 isoform X1 [Momordica charantia] | [more] |
XP_022148073.1 | 0.0e+00 | 63.85 | uncharacterized protein LOC111016842 isoform X2 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |