Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTCGGTTTGTCGGTTTGTGTTCTTGTCTACCATAAATATGCCTTTGGAAAAAGGAAACACATCCTTCCATTGATTGGGATAATGAGAAGGGCAGACAGCTTTGTGACAGTGGGAATGACGTAATTTAATAGTTAACCAAGGACACGGATGTAATTTCACGTAAAAAATCAATCCTCACTTCGGTCTGTCCATTCTCTATTCCCGTGTCCAAACACCCCCAAATTCATCCCACCTTCTTCCTCAAACTTCCGTTTCCCTTAAACCGCTTCTTCAATCTCCCTCTTCACTGAGCGGGGAGAAGGAGATGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGACTCCGATGAGCTCACCCTCAAATCTATGCCCTCCAATTCAAAACCTACCAAAATCAAAGCCTCCGATGGCAAAGAAGAAGGCGAAGTATCTTCTTCCGACAACGATACACAAACACACGACGTACTTTCTTCTCATTCCCTTTTCTTTTTGTTTTTCTCTATAATCTGCATCCCATTCCTCTTCGTTTTCGTCTTCTCCGATGCTGAAAATGATTGTTCATCCCTTACCATTTTCTTGCTTCCTCTGTTTTTCTTGGAATTGAAGGCTTCCTATAGCTTGCTTCATACACTATTCTACTAATCAAAGTTCTAGGTCTCTGTTTATCTTTTAATTAATAGTTGTTTATGTTGTCGAGGTGAGAATAGAGGGACTGCTGTATTTGAAGTTCAAACGGTTTATCTTTATGCCGTAGGGTTGAAAGTTTTAGTTCTGACTTGTGAACTCTTGTGTTTTTAGTTATGATTATGTTTAAGAACACTTTGGACTTGTGAATTTGTTTTCATTCTTTATATTCGGATCGGTCAAGGATTAGCATTTTTTGTAGTATCGTTAATGGATTGGTTATCTTTTCTATTCGTGGTATAATTTTTGTATTTCTGTTTTATATGGCTCATGAGGGATTTTTACTTCTCAGGTACATCCTGTTTGTTCTACTGTGCCTGCTTCGATTGCGTCACGCATATCATCCATTCTTCCTCCTAAGAATAAATGTAATCCAGGGATCAAGACTGGTAAGTATTGCCTCTTATGAACGATTCTGGTTTTTTAAGGCATTGTCTATTAGTTTATTAACTCCATCCTTTCACTTTGAATTCTGCAACAGTATAAAAAGTTGTTTTGAAAGCTATTTTGGTTATGAAGCTTTTGGTATATCTGCAGTATAAAATACGTTGGATTTAGTGTATATGCATATTTTAAACTAAGGATATTGGACACCTACGGCTTTCTTTCGTGTGGCAGTTTACATGGATTTAGGTTTACTATGGAGAAATACTTTTCATCTTTACTAAAGTTGAAGTGTTCACACATTTCTTTTCTTTAAGGATCTAAAAGATGTGGATACTAGCTGGGGTTCTTTCTAGCTGTGATGTTTGCATTTTAGGTTTCAGTGCTTGAATATTTCCCATTTTTCTGGATGACGGGATTCTCAAATTATTGGTCTTATGAATGAATCTATCTGTGCACTAAGCTTTATGGTGGTTTTATATTAAAAGTACTTTATGATGACCAATATCTGCCACTGACACAGTGAGCTGTTTCTGGAAATAAATTTTCAGCTTCTGCTGATGTTTGCACAAGAACATCCATATCAACTATGTCTCAGAAGATACGTGATAATGCTCAAATTGTAAACAAAGCTAGTACTCCTTGGGTTGCTTCCAGAAAGGCCAATTCTAATCTTGTGATTAGCTTCTCCGATGACAGTGGCAGTGAATTGGAGGAATGTAGTAAAGTGAGAACTTCAAAATCTCATAGTGATGCTGTTAGGCATTTTAAGCCTCCAACTTCAATACTTGATAGATCAAACAAGTTACGGAGTATGACAAGGAACAAAGTTGTGGTAAACAAACTGCCTTTGAGTCAAGCGTTTATCCCTTCAATGACCAAGAATCACAAAGCATATTCCAAAGGTGCTGCTGGGCCCTCATTTGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGGAACCTACAGAGCCAAGGGCGTGGAAATGATCAAGGAATGAACGTAAACACTAGCAAGCTGCAGGATTTGCGGCAGCAGATAGCCATTCGTGAAAGCAAATTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGGGTTTTAGTTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAGGGGAATGCTACTATCTCGCAGTTTCCTCCACTAGGGCCTAAGGATCTAAATGCGAAGCGCATGAAAACCAGCGGGTCTTATTCTTCCAAGCTGAGTTTGAATGGGCAACAACTTCGTTCATTGATTGCTGCAAAATTTATTTGGCCTCAGGAGCCTGGAGAAGAGACACAGAACATTAAGGGCTCTTATAACCAGAAGGGGAAATCCTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAGCATGTGGCTGCTTCACCTTCACTTGGCATTGACCTTGGCAAAGTACAGGATGGTGAATAGTTTGTTCATACTGAATAGAAAATTAAAATTTTCAGTTTAGTTCAATCTTACTGTTTATGAGGTGCTGCCCTAACTTGTTATGTAAAATGCTGTGCAGATACTGACATTGTTGCTAATGGCAATCAATCAGACTTTATTGGCAATCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACAGCTCTACCAAATGTCGCATCCAATGTCCAATCTCAGTTTGTAAGTCAATAATTGTATATGCGGCATGATGTCTTTTCTCATTGATGTCTTTTTTTATCAAGTATCTGAAATTAACTATACTTCTCAGAAAATCTAGAAAAGTTGCTTCATTAAACCTTTGATGAATTTCCACTATCTTATGTTTGACGATATTGTTTGTGGTTTTTCCTAAAATCGTTAAGTTCATGGTGAAGTTAATTGAACCATATCTCATATCTGTATTATTAACAGTGGATGATCAGGATTTCTTGCATATTTAGGAAATAATAAAGTATTTGTATACATAAATGGATTTCTCTTCAAAATTATGTTACCGTTTATTATACACAGTTTTTCCTTCTCTCTCCTTGAAGCATAATTTGAATATTTGTCTCAACAGGATAATGTTGAGTTTCACCGTCAGAGTGATGGCCTTCAGCCATCTGCTTCAACTGCAAAATTTTTTGAAAGAACACCTCCTCAATCAGCATCCAATGTCAAGACACCAGAGCCATGCAGTAATTTCTTTAAGGTTTATCTTCAAGCTCAACTGTTCATTTTTGGTTTTTTGTTCTCGCTCTGTTTTTATAACATGGATTTCCTCTGGATTTATTATCATCGCAGTCATTGATCAACAGTAAAACTTCTGGGACTGCTTTTGGTAATCCATCAAGTTGCTTGGATTTTGGCAATTTTGATCTCCAATCATTGTTTGAAATTGAGGAGTCACTAGACAAGGATCTAGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTGTTGAACTTTATCACAAGAGAGAATTATTTTCAGTTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTCAGTTAGTTCTTCAAGACAGCAAGAAGACATGATAATTGACGTGGATCACTTAAATAGTATGTCTGGACATGCTAATATAGCTTCTCCTTTGTATCAGAAGCATTCTGAATATAACAGTTCTACTCGGCTACATAATGATTTAAATATGCAACTTGAAAATGCTGGTGCCATAAATACTTCCAATCTCCATGAGAATGGACAAAGTTTGGGGTCTGAACCTGGATCCTGCTCTGACCTAGGTGGTAATACATTGGATCCATTGCCTTTTAAGGGCAATAATATTGCAGATAGAATTTTCTCTCCTTCTGTTGATCCAAACGTTTCAATGGATGGAGATGAAGAGTCATTTCCATCTGACCATGAAATGATTGATTCCTATAATGAATGCTACATGAGAAAGAAACACTTTGAAAATGATCAAATGGAGGCATATAATACGTCGAAGAACAACCACTGCGACAATAATATTGAGGATTCTTTGCGCCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGGCTTGCAATCCATGCAATAACCTTCAAACTTCAGTCGAACAGGGGACTGAGAATGATGCTAGGGATGACATAACTCAGCAAAATAATACAGAACTTACAGTAGATCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGAATGAGAGTGCCTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGTAACTTTATCTTCAGGAATGAGACTTATGCTTCATCATATGTGATCTTGCAGGCATGTTTGAAGTGGCTTTTAAAGATACCTTTTCTACTGTTCCCATGTTATTCATGTTATTTTAGGTAAACATCTTTTACCTAAAATTTATCATTACCTCAGTCCTTTTCATGAAGGCCGGTTCTGGAGGTCTCTTTGAATTGAAAAATATGGCAATGTGTGGAAATATACTTTAGATTTAAATTGGGGCCATTTATTCTTTTGCTCATGTCCGAAGGAAACTTTTGTGAATTTAAATTCTAACGGCTGAGTCTAATTCTGAGTAAAAAATTAATAGAGATAAGAGCTTGTCAACTTGAAGTCTTTGAGTCTATTTTTTTTTCATATCGTTCTTTGAGTGTATTCCTAGCTCTTTAAAGCTTAGGGTGAAAGGGGCCTAAAAGTGCTAGTATTTTGTGGAATTGTGTTGTTAGGGCATTGTTGTGGAATATTTGGCTGGAAAGGAATTCGAGGACCACTTCTTGAGTATTGTTCTCTTCTTTTGATTTTGTTTTTGGAAAAAATTGCCTCTTCTCTCCCTTCCTACACCATCAGTTTAACTAAGAAGAATACATATCAATGAACATAATCTGACAGAGTAGGTAGTTTCTGGTTGGAAAATTATTCATAGCCCTCTTTGTCTTGGCACCATTGCCTTGCAAATACTTTTCCCACATTTCTAAGAGGGAATTTGCAAATGATTTTGGCCTTTGGTATCTACTCATGAAACCTCAGTGCATAATGCTTCAGTTTTTTTCTCTTAGCATCGAGCGTCATTTCAAACTGCTGATTGTGGATGCTCTGTTAACTTAAACCTCCATTACCTGCACATATTGCCATAAAATGAATCAAACCTTAAAGTTATGCATAGGGAATCATGGATTAAAGAGATAAGCCATAACTTGGTTGTTTTTGAGTTTATCAGTTCAAGATTTAGTCTCTTGCTGATTATGATATTGATCATTTTTTTCTCATTTCAAAATAATATAAATATTTTGAGCCTTTTTTACTCTTAAGCTGTGGTTCAGATGGCAGTTTTCTCAGGTTGGTTGAGTGTGAATTATGGTTAAAAATGATTTAGTGAAATACTGAGGATTTTGCTTAGAATATTTACTTCCTTATGCAGGCACAGACAGATGCAAAACCCCAGATGAGATCCATGGTCGTTATCATTTTGAAAACTTGCCATCAGAGGCTCCGGATTTAACAGACTCTGATGACAATGAACCATTCAGTAGAGAAGGATCTTGCTCCAAAACTACTAATAGTTTTACACCATTGACTATGAACAGTGTTCTGCAACACATGAAGGTCATATCCTCAGTTAGTATAGAAGTCTTACTCACTAGAACTCATGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGGAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGATTGCCAGACCCATGTTACAGAGTGATGGCTCTTATACTGACGATCTAGCAATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGTCCTTGGCAACACATGAAGGACTTCTCTTTTGCCAATAGAAGCCAGTGTCAGCATGGCCACATCAACTCTTCTGGTATGATGATTCTGATGATCTTTTAATTAGATGGTTCAAATTCTGTAGACTTTTCATCTGCATCTTTTTCAAATTCTTGCGATGGACTATCATTTTCTTCAGATGAAACAAAAGTCTTCAAGAATGAAGATCAAATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCCGATTCACGTTCATATGGCCATGTTTTAGCTCAGAGAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGTTATTTTTACATGATGCTCGTATAGAGGCCAAGGGAAGTTGGAATAGACCATCATCATACTTTCAGAGGGGAGGCTCTGTATTGGTTTGTTTATTACTCCTTATATATGCCCTTGGTTCAGTCTGCAGCTTAACATTTTTTTTCCTGTTCTTTTCCTCTTGACATGCGGCACACGATGGCTGTGTTGAACTAATTTGTATTTGAGTTCCTTTATCTCTCTCTTCCTTCTCTGTCATGCATGGCTGCATATCTTTTTATTGTTTTTGGATCAGAAATATCTATTTAACTCTCTTAGCATGCTGTGATCACAGTACTTGATCATTGCCACTTTCTTCAATTAAACTTCATTCTGAATGGAGTTTTGCCTAAATTATTATAATTCTTTTCTTGGTGTAATTTAAAACTGCGCCATCATATCTGAACTGTCTGATTTTGGGAAACAGGGTACTCTTCCCTTGCTCAAATCATGGCATTTAATTTGAGTATTTTAACCTGAGAAATATCGTATTTTCTTCCCCTCCTTTTTGCACACACACGCATGTATGTAGTCTGCACAGGAGTATTTCATTTCTCCTTTTTGCATATTGTTATTATATGAAGATGTTGCCTTTTATTTTTTATGCTTGTGCTTGATGTTACCTCTTTTTTTCCCCTTTTCTTTTTTACATGAAGATAAAATTATCCTTTTTTAGTTTTTGTGACGTTATGATTCTGGTCTTTGAAGGTTTCTATTACATATTACTGATAACTGATCCTTAACATCTTCCGGTTGTTTGGTCCATGATCTCACTTACACTGACTTTTCTTGCAACTGCTATAAAGTTTGCTGTCACTGATTCTGAAAAAAGAACATTAACCATCTTTCTGTTGTTTGATCCATGATCTCATTGTGGCAACTTCTTACAAAGTAGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAATCAGGAAATGAACAGTCGAGAGGGCATGAAAAAGGTACGTTTTTTTTGTTATTCATTCTTCTGTTGCTTCTTGCTTGTGTGAACCATGTTGCAATACAATTGGAAATATTTGTCTTGCAGGCTCTTCCTGTACTGTCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGGCTGTTTACCTTCTAATATTCTATAGCTATACTACAACTGGAGGGAAGGATGACATGTTCTCTTATGCGGTATGCATTATTATATTTGTTCCTGCTCTGCTGCGTAATATAATCTGGATGTAGAAAAGTACTGCAAAAACGTTTTTAAAGATGTTAATTAAATCTATGTATTAGAATATATATAGTAGAAAGATGAAACAAGATTTGCGATATCGAAGATTTTTGCTTTTATCAACATTTAGTAAACACTTTAGCATTTTGGTGATGCTGACTGTCAATCCAAGTTATTTTTTGGTTGTATCAACTTTATAAAGCGAGGACTAAAACATAATATGTTTTTTCATCTACTTGGTGATATACCACTACAATGTTGAAACTACAATAAAGTTTCTAGTATTTTGAAAACAGTCCATCTTTGAAGTAACATTGAAGTTACAAAAAAGCTGTCATTGCTTAACAAGGGAGAAAGAGAAACCAGATGTTTACTAGAATGTTAATGAAAAATCTCTAGAAAGCTTCTTCTTCTTCATCTGTTTTGTTTTGTTTTGTATTTTTTTAAAATAAAAAATAAAAAATAAAAATCTCTAAAAAGAAGAAATTAGAAGTATAAAAGATTTTAGACAATGGGCCAGAAGAGTTCATCTACTTTGCTAAGTGGTATGTTATGAACCAGAACTACTAGTGATCTCTGAGAACATTTTATTCCAGCTGATGTATGGTCTTTGGGTCCTGTCTAGTTCTTTTCCTTTTAAAAAACTTTCTCCTTGTACTCGTATCACCTTTATGATTCAGTGTTGGTCTTTGGTGGGCACTTCTGTCCATTTTTATTCTCACTCGCTTAATCAAACATTGTGTGGAAATCCTTTGTTCTTGTTGTTCAGGTCAAGCATAATGGGCAATCTTATGAACTTTGGCTCATGTACATTAACAGCCGCATGAATCTTGATGCTCGATTGGCTGCATATGATTCTGCAATTTCTGCACTTTGCCACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAATGTCGAGAAGGCTATTCAGAGGATTTTTGGACTTCTTCAAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCACATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTCTGTTGTGTATTTAGTTCTTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCTGTTCATTTGACAAATGGTGAGAAGCTAAGGGCTTCTAGGGTGGTCAAGAAAGCAGTAGATTTTGTCGATTCATGCTTGAACAATGAATCACTTGATAGTAAATGCTACCAAAAATCCATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGATATAGAATTCAGTAGGAACTTGTTGGATAAGTACGTTAAACTTTATCCATCTTGCCCAGAACTTCTTTTGCTTGATATACGGGCAAGGAAACATGATTTTGGGGATGCAACTGTTATGGCTTTTGAAAAAGTAATCAGGTACTGGCCAAAAGAAGTACCTGGAGTTCAGTGCATCTGGAATCAATATGCTGAATATTTACTTCGGAATGGGAGGATCAAATGTACTGAAGAACTAATGGCCCGTCGGTTTGACTCTACTTCAAAGATGGATTGTTCTAAAACTAGAACACCAGTTAATAGTGACTGCGACTCCTTGCACCTGCTAGATCATGCTTCAGGATCAATTGTACGTGCATTAGATTGCAGTCCCAATGAGGTGGATGTGGTGTTTTGGTATCTTAATCATTCAGTTCACAAATTACTACTGAATGATCAATTAGAAGCACGTTTGGCCTTCGAAAATGCTCTGAGGGCTGCTAGTTCTGAAACTTTTAGATATTGTATGAGAGAATATGCTATGTTTTTGCTTACCGACGAGTCTTTGTTGAATGAGGCTGCCTCTGTTGGTGGAATAAGAAGCATCTTAGAGGGTTATCTCAACGATGCTCGGGCTTTCCCAGTTCCCGAACCCTTGTCCAGAAGATTCATCAAGGATATCAGGAAGCCAAGAGTTCGACTTCTTGTCAGTAACATGCTGTCTCCAATTTCTCCGGATGTTTCACTAGTGAACTGTATTCTTGAAGTTTGGTATGGGCCATCTCTTTTACCTCAAAAATTTAACAAACCAAAGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATTTTGCCTTCTAATTACCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAACGACGACAACTACTCTTCCCAAGCTGCCTCTCCCAGCCTTATTTTCTGGGCCTGCTCGAATTTGATCATTGCAATTTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGCCAACGTTAAAGGTCTTGAAGCCATAACTGAGAGGTTTCACAAACGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATATTACAACATATGTAGAACTAGAGGAGATACAAGTGCTGTTCTGCAAGAAGTAAACGAAAGGGGAATCCAACTCAACGAGCCTTCGTGATAGAGTTTCACCTTGGTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTTAGTTATGGGAGAAGCCTCAGTGTACAGATAGTTCACCTCATACCGTTTTATCATATACATGCTAGGCACGGTAGTTTCCCCAATTTTGTTAAGTGGTAGAGAGTTGATTGCGGCTAGGAATTAGATATTAATATAAATACTTAGCAGAAAGAAAAACTTGGTGGAAAAATAGCCTTTTGTTTCCCGTCTGTAAAGTCATCGAGACCATATCATCATCAGATTTCTCAGCCAGTTTCTTGCTCTGACCGTTGTTCGGCTATTTCGTTAAGGTGTGTCCTCGAATCTCAGGTTTTCCTCTGTCTTTTTTGGTAGAATTAATTTCTTTAATTGACCCTACCCATTTAAATGTAAGTACTAACCATTTTTAGGTTTAAATATGATATGGTATAAAATGAAATGCTAATACTCAATGTTCATCGTCATTGTTTTGGTTTTCCTACTATCTATGTTATTTTGTTTGTCAAACAGTGGTTTATTCTAGAATTTTGCTACAAAGCTTGCTTGGGTTGCTGACTTAAGTATTGCCCCTTCATCACAATTGTTGCTGAACTACTTTGTTGATGCAATGAGTTAAATATTTGTACAAGTTTTAAATGGTGATGAACATATGATTTACCTAGCTAGGTTGGAATATTGTTTTGAACTTTAAGAAAAGATTTACAATTGTGTTGGCTTGTATAATTTGAATTATTTTCTTAAAAAAGGGTGTTTTTGGAGACTAATTGATAAAACAAACTATTTATGGC
mRNA sequence
TGTCGGTTTGTCGGTTTGTGTTCTTGTCTACCATAAATATGCCTTTGGAAAAAGGAAACACATCCTTCCATTGATTGGGATAATGAGAAGGGCAGACAGCTTTGTGACAGTGGGAATGACGTAATTTAATAGTTAACCAAGGACACGGATGTAATTTCACGTAAAAAATCAATCCTCACTTCGGTCTGTCCATTCTCTATTCCCGTGTCCAAACACCCCCAAATTCATCCCACCTTCTTCCTCAAACTTCCGTTTCCCTTAAACCGCTTCTTCAATCTCCCTCTTCACTGAGCGGGGAGAAGGAGATGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGACTCCGATGAGCTCACCCTCAAATCTATGCCCTCCAATTCAAAACCTACCAAAATCAAAGCCTCCGATGGCAAAGAAGAAGGCGAAGTATCTTCTTCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACTGTGCCTGCTTCGATTGCGTCACGCATATCATCCATTCTTCCTCCTAAGAATAAATGTAATCCAGGGATCAAGACTGCTTCTGCTGATGTTTGCACAAGAACATCCATATCAACTATGTCTCAGAAGATACGTGATAATGCTCAAATTGTAAACAAAGCTAGTACTCCTTGGGTTGCTTCCAGAAAGGCCAATTCTAATCTTGTGATTAGCTTCTCCGATGACAGTGGCAGTGAATTGGAGGAATGTAGTAAAGTGAGAACTTCAAAATCTCATAGTGATGCTGTTAGGCATTTTAAGCCTCCAACTTCAATACTTGATAGATCAAACAAGTTACGGAGTATGACAAGGAACAAAGTTGTGGTAAACAAACTGCCTTTGAGTCAAGCGTTTATCCCTTCAATGACCAAGAATCACAAAGCATATTCCAAAGGTGCTGCTGGGCCCTCATTTGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGGAACCTACAGAGCCAAGGGCGTGGAAATGATCAAGGAATGAACGTAAACACTAGCAAGCTGCAGGATTTGCGGCAGCAGATAGCCATTCGTGAAAGCAAATTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGGGTTTTAGTTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAGGGGAATGCTACTATCTCGCAGTTTCCTCCACTAGGGCCTAAGGATCTAAATGCGAAGCGCATGAAAACCAGCGGGTCTTATTCTTCCAAGCTGAGTTTGAATGGGCAACAACTTCGTTCATTGATTGCTGCAAAATTTATTTGGCCTCAGGAGCCTGGAGAAGAGACACAGAACATTAAGGGCTCTTATAACCAGAAGGGGAAATCCTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAGCATGTGGCTGCTTCACCTTCACTTGGCATTGACCTTGGCAAAGTACAGGATGATACTGACATTGTTGCTAATGGCAATCAATCAGACTTTATTGGCAATCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACAGCTCTACCAAATGTCGCATCCAATGTCCAATCTCAGTTTGATAATGTTGAGTTTCACCGTCAGAGTGATGGCCTTCAGCCATCTGCTTCAACTGCAAAATTTTTTGAAAGAACACCTCCTCAATCAGCATCCAATGTCAAGACACCAGAGCCATGCAGTAATTTCTTTAAGTCATTGATCAACAGTAAAACTTCTGGGACTGCTTTTGGTAATCCATCAAGTTGCTTGGATTTTGGCAATTTTGATCTCCAATCATTGTTTGAAATTGAGGAGTCACTAGACAAGGATCTAGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTGTTGAACTTTATCACAAGAGAGAATTATTTTCAGTTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTCAGTTAGTTCTTCAAGACAGCAAGAAGACATGATAATTGACGTGGATCACTTAAATAGTATGTCTGGACATGCTAATATAGCTTCTCCTTTGTATCAGAAGCATTCTGAATATAACAGTTCTACTCGGCTACATAATGATTTAAATATGCAACTTGAAAATGCTGGTGCCATAAATACTTCCAATCTCCATGAGAATGGACAAAGTTTGGGGTCTGAACCTGGATCCTGCTCTGACCTAGGTGGTAATACATTGGATCCATTGCCTTTTAAGGGCAATAATATTGCAGATAGAATTTTCTCTCCTTCTGTTGATCCAAACGTTTCAATGGATGGAGATGAAGAGTCATTTCCATCTGACCATGAAATGATTGATTCCTATAATGAATGCTACATGAGAAAGAAACACTTTGAAAATGATCAAATGGAGGCATATAATACGTCGAAGAACAACCACTGCGACAATAATATTGAGGATTCTTTGCGCCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGGCTTGCAATCCATGCAATAACCTTCAAACTTCAGTCGAACAGGGGACTGAGAATGATGCTAGGGATGACATAACTCAGCAAAATAATACAGAACTTACAGTAGATCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGAATGAGAGTGCCTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAGACAGATGCAAAACCCCAGATGAGATCCATGGTCGTTATCATTTTGAAAACTTGCCATCAGAGGCTCCGGATTTAACAGACTCTGATGACAATGAACCATTCAGTAGAGAAGGATCTTGCTCCAAAACTACTAATAGTTTTACACCATTGACTATGAACAGTGTTCTGCAACACATGAAGGTCATATCCTCAGTTAGTATAGAAGTCTTACTCACTAGAACTCATGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGGAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGATTGCCAGACCCATGTTACAGAGTGATGGCTCTTATACTGACGATCTAGCAATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGTCCTTGGCAACACATGAAGGACTTCTCTTTTGCCAATAGAAGCCAGTGTCAGCATGGCCACATCAACTCTTCTGATGAAACAAAAGTCTTCAAGAATGAAGATCAAATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCCGATTCACGTTCATATGGCCATGTTTTAGCTCAGAGAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGTTATTTTTACATGATGCTCGTATAGAGGCCAAGGGAAGTTGGAATAGACCATCATCATACTTTCAGAGGGGAGGCTCTGTATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAATCAGGAAATGAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTGTCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGGCTGTTTACCTTCTAATATTCTATAGCTATACTACAACTGGAGGGAAGGATGACATGTTCTCTTATGCGGTCAAGCATAATGGGCAATCTTATGAACTTTGGCTCATGTACATTAACAGCCGCATGAATCTTGATGCTCGATTGGCTGCATATGATTCTGCAATTTCTGCACTTTGCCACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAATGTCGAGAAGGCTATTCAGAGGATTTTTGGACTTCTTCAAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCACATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTCTGTTGTGTATTTAGTTCTTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCTGTTCATTTGACAAATGGTGAGAAGCTAAGGGCTTCTAGGGTGGTCAAGAAAGCAGTAGATTTTGTCGATTCATGCTTGAACAATGAATCACTTGATAGTAAATGCTACCAAAAATCCATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGATATAGAATTCAGTAGGAACTTGTTGGATAAGTACGTTAAACTTTATCCATCTTGCCCAGAACTTCTTTTGCTTGATATACGGGCAAGGAAACATGATTTTGGGGATGCAACTGTTATGGCTTTTGAAAAAGTAATCAGGTACTGGCCAAAAGAAGTACCTGGAGTTCAGTGCATCTGGAATCAATATGCTGAATATTTACTTCGGAATGGGAGGATCAAATGTACTGAAGAACTAATGGCCCGTCGGTTTGACTCTACTTCAAAGATGGATTGTTCTAAAACTAGAACACCAGTTAATAGTGACTGCGACTCCTTGCACCTGCTAGATCATGCTTCAGGATCAATTGTACGTGCATTAGATTGCAGTCCCAATGAGGTGGATGTGGTGTTTTGGTATCTTAATCATTCAGTTCACAAATTACTACTGAATGATCAATTAGAAGCACGTTTGGCCTTCGAAAATGCTCTGAGGGCTGCTAGTTCTGAAACTTTTAGATATTGTATGAGAGAATATGCTATGTTTTTGCTTACCGACGAGTCTTTGTTGAATGAGGCTGCCTCTGTTGGTGGAATAAGAAGCATCTTAGAGGGTTATCTCAACGATGCTCGGGCTTTCCCAGTTCCCGAACCCTTGTCCAGAAGATTCATCAAGGATATCAGGAAGCCAAGAGTTCGACTTCTTGTCAGTAACATGCTGTCTCCAATTTCTCCGGATGTTTCACTAGTGAACTGTATTCTTGAAGTTTGGTATGGGCCATCTCTTTTACCTCAAAAATTTAACAAACCAAAGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATTTTGCCTTCTAATTACCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAACGACGACAACTACTCTTCCCAAGCTGCCTCTCCCAGCCTTATTTTCTGGGCCTGCTCGAATTTGATCATTGCAATTTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGCCAACGTTAAAGGTCTTGAAGCCATAACTGAGAGGTTTCACAAACGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATATTACAACATATGTAGAACTAGAGGAGATACAAGTGCTGTTCTGCAAGAAGTAAACGAAAGGGGAATCCAACTCAACGAGCCTTCGTGATAGAGTTTCACCTTGGTTTGGTAGGATCAGTAAAATTACAGGAAAACTAGGATCAGTTTTTTAGTTATGGGAGAAGCCTCAGTGTACAGATAGTTCACCTCATACCGTTTTATCATATACATGCTAGGCACGGTAGTTTCCCCAATTTTGTTAAGTGGTAGAGAGTTGATTGCGGCTAGGAATTAGATATTAATATAAATACTTAGCAGAAAGAAAAACTTGGTGGAAAAATAGCCTTTTGTTTCCCGTCTGTAAAGTCATCGAGACCATATCATCATCAGATTTCTCAGCCAGTTTCTTGCTCTGACCGTTGTTCGGCTATTTCGTTAAGGTGTGTCCTCGAATCTCAGGTTTTCCTCTGTCTTTTTTGGTAGAATTAATTTCTTTAATTGACCCTACCCATTTAAATGTAAGTACTAACCATTTTTAGGTTTAAATATGATATGGTATAAAATGAAATGCTAATACTCAATGTTCATCGTCATTGTTTTGGTTTTCCTACTATCTATGTTATTTTGTTTGTCAAACAGTGGTTTATTCTAGAATTTTGCTACAAAGCTTGCTTGGGTTGCTGACTTAAGTATTGCCCCTTCATCACAATTGTTGCTGAACTACTTTGTTGATGCAATGAGTTAAATATTTGTACAAGTTTTAAATGGTGATGAACATATGATTTACCTAGCTAGGTTGGAATATTGTTTTGAACTTTAAGAAAAGATTTACAATTGTGTTGGCTTGTATAATTTGAATTATTTTCTTAAAAAAGGGTGTTTTTGGAGACTAATTGATAAAACAAACTATTTATGGC
Coding sequence (CDS)
ATGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGAGAAGGACTCCGATGAGCTCACCCTCAAATCTATGCCCTCCAATTCAAAACCTACCAAAATCAAAGCCTCCGATGGCAAAGAAGAAGGCGAAGTATCTTCTTCCGACAACGATACACAAACACACGACGTACATCCTGTTTGTTCTACTGTGCCTGCTTCGATTGCGTCACGCATATCATCCATTCTTCCTCCTAAGAATAAATGTAATCCAGGGATCAAGACTGCTTCTGCTGATGTTTGCACAAGAACATCCATATCAACTATGTCTCAGAAGATACGTGATAATGCTCAAATTGTAAACAAAGCTAGTACTCCTTGGGTTGCTTCCAGAAAGGCCAATTCTAATCTTGTGATTAGCTTCTCCGATGACAGTGGCAGTGAATTGGAGGAATGTAGTAAAGTGAGAACTTCAAAATCTCATAGTGATGCTGTTAGGCATTTTAAGCCTCCAACTTCAATACTTGATAGATCAAACAAGTTACGGAGTATGACAAGGAACAAAGTTGTGGTAAACAAACTGCCTTTGAGTCAAGCGTTTATCCCTTCAATGACCAAGAATCACAAAGCATATTCCAAAGGTGCTGCTGGGCCCTCATTTGCTGAACAAGGATCTAAAATCAGAGCTTTCAGTGGGAACCTACAGAGCCAAGGGCGTGGAAATGATCAAGGAATGAACGTAAACACTAGCAAGCTGCAGGATTTGCGGCAGCAGATAGCCATTCGTGAAAGCAAATTGAAGCTCAAGTCTGCCCAACAGAACAAGGAGAGGGTTTTAGTTACAAACCAGGATTATATTGTCACAAATTCAAAATCTGATTTGGGTAGGAAGGGGAATGCTACTATCTCGCAGTTTCCTCCACTAGGGCCTAAGGATCTAAATGCGAAGCGCATGAAAACCAGCGGGTCTTATTCTTCCAAGCTGAGTTTGAATGGGCAACAACTTCGTTCATTGATTGCTGCAAAATTTATTTGGCCTCAGGAGCCTGGAGAAGAGACACAGAACATTAAGGGCTCTTATAACCAGAAGGGGAAATCCTTGAGTAGAGAAGAGTCCAGTGTGTTGAAGCAGAGTAAGGAAGATATCAAGCATGTGGCTGCTTCACCTTCACTTGGCATTGACCTTGGCAAAGTACAGGATGATACTGACATTGTTGCTAATGGCAATCAATCAGACTTTATTGGCAATCAGGTGGATCCTCACCCTCTTGTTGTCTTAGATCAAGCTACAGCTCTACCAAATGTCGCATCCAATGTCCAATCTCAGTTTGATAATGTTGAGTTTCACCGTCAGAGTGATGGCCTTCAGCCATCTGCTTCAACTGCAAAATTTTTTGAAAGAACACCTCCTCAATCAGCATCCAATGTCAAGACACCAGAGCCATGCAGTAATTTCTTTAAGTCATTGATCAACAGTAAAACTTCTGGGACTGCTTTTGGTAATCCATCAAGTTGCTTGGATTTTGGCAATTTTGATCTCCAATCATTGTTTGAAATTGAGGAGTCACTAGACAAGGATCTAGAAGAAGCACAAGATTGCAGGCGCCAATGTGAAATTGAAGAAAGAAATGCTTTCAAAATTTATTCTAGAGCTCAAAGGGCTTTGATTGAGGCTAATTCTAGATGTGTTGAACTTTATCACAAGAGAGAATTATTTTCAGTTCATTTTCACTCTTTTTGCATGAATAATCCTGGTTCAGTTAGTTCTTCAAGACAGCAAGAAGACATGATAATTGACGTGGATCACTTAAATAGTATGTCTGGACATGCTAATATAGCTTCTCCTTTGTATCAGAAGCATTCTGAATATAACAGTTCTACTCGGCTACATAATGATTTAAATATGCAACTTGAAAATGCTGGTGCCATAAATACTTCCAATCTCCATGAGAATGGACAAAGTTTGGGGTCTGAACCTGGATCCTGCTCTGACCTAGGTGGTAATACATTGGATCCATTGCCTTTTAAGGGCAATAATATTGCAGATAGAATTTTCTCTCCTTCTGTTGATCCAAACGTTTCAATGGATGGAGATGAAGAGTCATTTCCATCTGACCATGAAATGATTGATTCCTATAATGAATGCTACATGAGAAAGAAACACTTTGAAAATGATCAAATGGAGGCATATAATACGTCGAAGAACAACCACTGCGACAATAATATTGAGGATTCTTTGCGCCTTGAAGCAAAATTAAGGTCTGAACTATTTGCACGTCTAGGAACAAGAAATTTGTCAAAGGCTTGCAATCCATGCAATAACCTTCAAACTTCAGTCGAACAGGGGACTGAGAATGATGCTAGGGATGACATAACTCAGCAAAATAATACAGAACTTACAGTAGATCTAGCAGTTGGAAGTGACGTCGACCTCATAAGTAAGAAGAATGAGAGTGCCTTACTATCAGGAAAGGGAGATCAACAGTTTGGTTTTGGAGGCACAGACAGATGCAAAACCCCAGATGAGATCCATGGTCGTTATCATTTTGAAAACTTGCCATCAGAGGCTCCGGATTTAACAGACTCTGATGACAATGAACCATTCAGTAGAGAAGGATCTTGCTCCAAAACTACTAATAGTTTTACACCATTGACTATGAACAGTGTTCTGCAACACATGAAGGTCATATCCTCAGTTAGTATAGAAGTCTTACTCACTAGAACTCATGGGAGTCTCTCTAATCTTGGTTTCCCTGAAGACGGTGATTCTTTGGAAGTGGATCAAATCCACTGGAGAAAATTAAAAGAGAACTCTGTCCATGAGATTGCCAGACCCATGTTACAGAGTGATGGCTCTTATACTGACGATCTAGCAATTGATCCATCGTGGCCACTTTGCATGTATGAACTCCGTGGAAAATGCAACAATGATGAATGTCCTTGGCAACACATGAAGGACTTCTCTTTTGCCAATAGAAGCCAGTGTCAGCATGGCCACATCAACTCTTCTGATGAAACAAAAGTCTTCAAGAATGAAGATCAAATGACTCCTCCAACTTACCTGGTTGGCATAGATATTCTAAAAGCCGATTCACGTTCATATGGCCATGTTTTAGCTCAGAGAAGTAGTCAATGCTGGCAAAGCTTTTTTAGTATTTCTTTGACGTTACCAAATTTGCTCCAAAAGGATGCTTCTGCTGATGGGTTATTTTTACATGATGCTCGTATAGAGGCCAAGGGAAGTTGGAATAGACCATCATCATACTTTCAGAGGGGAGGCTCTGTATTGAGTCAGCTGAAACAGGGTGATGAGAACCTAGCTCTGGAAACAGCTCTAATTATTATTAATCAGGAAATGAACAGTCGAGAGGGCATGAAAAAGGCTCTTCCTGTACTGTCACGTGCTGTAGAGAACAATCCAAAATCTATAGCTCTCTGGGCTGTTTACCTTCTAATATTCTATAGCTATACTACAACTGGAGGGAAGGATGACATGTTCTCTTATGCGGTCAAGCATAATGGGCAATCTTATGAACTTTGGCTCATGTACATTAACAGCCGCATGAATCTTGATGCTCGATTGGCTGCATATGATTCTGCAATTTCTGCACTTTGCCACAATATATTTACTCATAACTTGGATGGGAAATATGCTAGTGCCCATATCTTGGACCTGATTTTACAGATGACAAATTGTTTGTGTATGTCTGGGAATGTCGAGAAGGCTATTCAGAGGATTTTTGGACTTCTTCAAGTTGCTATGGATTCTGATGAGCCTTATTCTTTTACGCATTCTGATATGCTCACATGCTTAAATATATCTGACAAATGTATTTTCTGGGTTTCTGTTGTGTATTTAGTTCTTTACAGGAAACTGCCTCATGCTATAGTGCAGCAGCTTGAATGTGAGAAAGAACTGATCGAGATTGAATGGCCTGCTGTTCATTTGACAAATGGTGAGAAGCTAAGGGCTTCTAGGGTGGTCAAGAAAGCAGTAGATTTTGTCGATTCATGCTTGAACAATGAATCACTTGATAGTAAATGCTACCAAAAATCCATTCAAATGTTTGCTGTCAATCATATAAGGTGCTTGATGGCATTTGAGGATATAGAATTCAGTAGGAACTTGTTGGATAAGTACGTTAAACTTTATCCATCTTGCCCAGAACTTCTTTTGCTTGATATACGGGCAAGGAAACATGATTTTGGGGATGCAACTGTTATGGCTTTTGAAAAAGTAATCAGGTACTGGCCAAAAGAAGTACCTGGAGTTCAGTGCATCTGGAATCAATATGCTGAATATTTACTTCGGAATGGGAGGATCAAATGTACTGAAGAACTAATGGCCCGTCGGTTTGACTCTACTTCAAAGATGGATTGTTCTAAAACTAGAACACCAGTTAATAGTGACTGCGACTCCTTGCACCTGCTAGATCATGCTTCAGGATCAATTGTACGTGCATTAGATTGCAGTCCCAATGAGGTGGATGTGGTGTTTTGGTATCTTAATCATTCAGTTCACAAATTACTACTGAATGATCAATTAGAAGCACGTTTGGCCTTCGAAAATGCTCTGAGGGCTGCTAGTTCTGAAACTTTTAGATATTGTATGAGAGAATATGCTATGTTTTTGCTTACCGACGAGTCTTTGTTGAATGAGGCTGCCTCTGTTGGTGGAATAAGAAGCATCTTAGAGGGTTATCTCAACGATGCTCGGGCTTTCCCAGTTCCCGAACCCTTGTCCAGAAGATTCATCAAGGATATCAGGAAGCCAAGAGTTCGACTTCTTGTCAGTAACATGCTGTCTCCAATTTCTCCGGATGTTTCACTAGTGAACTGTATTCTTGAAGTTTGGTATGGGCCATCTCTTTTACCTCAAAAATTTAACAAACCAAAGGAATTGGTGGATTTCGTGGAAACTATCTTAGAGATTTTGCCTTCTAATTACCAGTTGGTACTTTCTGTCTGTAAGCAATTATGCAACGACGACAACTACTCTTCCCAAGCTGCCTCTCCCAGCCTTATTTTCTGGGCCTGCTCGAATTTGATCATTGCAATTTTTAGTTCTGTCCCAATACCACCAGAGTTCATTTGGGTAGAAGCTGCTAATATTCTGGCCAACGTTAAAGGTCTTGAAGCCATAACTGAGAGGTTTCACAAACGAGCTTTATCTGTTTACCCGTTCTCTGTTCAGCTGTGGAAATCATATTACAACATATGTAGAACTAGAGGAGATACAAGTGCTGTTCTGCAAGAAGTAAACGAAAGGGGAATCCAACTCAACGAGCCTTCGTGA
Protein sequence
MEKEKEKEKEKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEVNERGIQLNEPS*
Homology
BLAST of CsGy5G017110 vs. ExPASy Swiss-Prot
Match:
O60293 (Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=1 SV=3)
HSP 1 Score: 69.7 bits (169), Expect = 3.7e-10
Identity = 109/513 (21.25%), Postives = 189/513 (36.84%), Query Frame = 0
Query: 954 IDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQH---------GHINSSDETKVF 1013
I+P C ++L G CN+D+C WQH++D++ + + Q G +S ++
Sbjct: 1179 IEPDQCFCRFDLTGTCNDDDCQWQHIQDYTLSRKQLFQDILSYNLSLIGCAETSTNEEIT 1238
Query: 1014 KNEDQMTPPTYLVGIDILKAD----------SRSYGHV---LAQRSSQCWQSFFSISLTL 1073
+ ++ + V D + D + S GH + + W+ F
Sbjct: 1239 ASAEKYVEKLFGVNKDRMSMDQMAVLLVSNINESKGHTPPFTTYKDKRKWKPKFWRKPIS 1298
Query: 1074 PNLLQKDASADGLFLHDARIEAKGSWNRPS----------SYFQRGGSVLSQLK------ 1133
N D + A + + N P+ YF ++ L+
Sbjct: 1299 DNSFSSDEEQSTGPIKYA-FQPENQINVPALDTVVTPDDVRYFTNETDDIANLEASVLEN 1358
Query: 1134 QGDENLALETALIIINQ-EMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFYSYTTT 1193
L L+ A +NQ E E + AL VL+RA+ENN + +W YL +F T
Sbjct: 1359 PSHVQLWLKLAYKYLNQNEGECSESLDSALNVLARALENNKDNPEIWCHYLRLFSKRGTK 1418
Query: 1194 GGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIF--THNLDGKY 1253
+M AV++ W ++L++ D + +
Sbjct: 1419 DEVQEMCETAVEYAPDYQSFWTF-----LHLESTFEEKDYVCERMLEFLMGAAKQETSNI 1478
Query: 1254 ASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNISDKC 1313
S +L+ +L +G + A+ + L+ A D + L SD+C
Sbjct: 1479 LSFQLLEALLFRVQLHIFTGRCQSALAILQNALKSAND---------GIVAEYLKTSDRC 1538
Query: 1314 IFWVSVVYLVLYRKLPHAIVQQLE------CEKELIEIEWPAVH--LTNGEKLRASRVVK 1373
+ W++ ++L+ + LP E + W AV TN + L A V +
Sbjct: 1539 LAWLAYIHLIEFNILPSKFYDPSNDNPSRIVNTESFVMPWQAVQDVKTNPDMLLA--VFE 1598
Query: 1374 KAVD-FVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFE-DIEFSRNLLDKYVKLYPSC 1408
AV D L E C M A++ + L +E +E ++LL+ SC
Sbjct: 1599 DAVKACTDESLAVEERIEACLPLYTNMIALHQL--LERYEAAMELCKSLLE-------SC 1658
BLAST of CsGy5G017110 vs. NCBI nr
Match:
XP_011655356.2 (uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical protein Csa_009223 [Cucumis sativus])
HSP 1 Score: 3464 bits (8981), Expect = 0.0
Identity = 1759/1759 (100.00%), Postives = 1759/1759 (100.00%), Query Frame = 0
Query: 1 MEKEKEKEKEKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVH 60
MEKEKEKEKEKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVH
Sbjct: 1 MEKEKEKEKEKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVH 60
Query: 61 PVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKAST 120
PVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKAST
Sbjct: 61 PVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKAST 120
Query: 121 PWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMT 180
PWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMT
Sbjct: 121 PWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMT 180
Query: 181 RNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGM 240
RNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGM
Sbjct: 181 RNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGM 240
Query: 241 NVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQF 300
NVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQF
Sbjct: 241 NVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQF 300
Query: 301 PPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGK 360
PPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGK
Sbjct: 301 PPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGK 360
Query: 361 SLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVV 420
SLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVV
Sbjct: 361 SLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVV 420
Query: 421 LDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNF 480
LDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNF
Sbjct: 421 LDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNF 480
Query: 481 FKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAF 540
FKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAF
Sbjct: 481 FKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAF 540
Query: 541 KIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNS 600
KIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNS
Sbjct: 541 KIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNS 600
Query: 601 MSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDL 660
MSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDL
Sbjct: 601 MSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDL 660
Query: 661 GGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFEND 720
GGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFEND
Sbjct: 661 GGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFEND 720
Query: 721 QMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTEN 780
QMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTEN
Sbjct: 721 QMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTEN 780
Query: 781 DARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHG 840
DARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHG
Sbjct: 781 DARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHG 840
Query: 841 RYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLL 900
RYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLL
Sbjct: 841 RYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLL 900
Query: 901 TRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPL 960
TRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPL
Sbjct: 901 TRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPL 960
Query: 961 CMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGID 1020
CMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGID
Sbjct: 961 CMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGID 1020
Query: 1021 ILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRP 1080
ILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRP
Sbjct: 1021 ILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRP 1080
Query: 1081 SSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIAL 1140
SSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIAL
Sbjct: 1081 SSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIAL 1140
Query: 1141 WAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALC 1200
WAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALC
Sbjct: 1141 WAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALC 1200
Query: 1201 HNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHS 1260
HNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHS
Sbjct: 1201 HNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHS 1260
Query: 1261 DMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRAS 1320
DMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRAS
Sbjct: 1261 DMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRAS 1320
Query: 1321 RVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYP 1380
RVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYP
Sbjct: 1321 RVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYP 1380
Query: 1381 SCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTE 1440
SCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTE
Sbjct: 1381 SCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTE 1440
Query: 1441 ELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHS 1500
ELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHS
Sbjct: 1441 ELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHS 1500
Query: 1501 VHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILE 1560
VHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILE
Sbjct: 1501 VHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILE 1560
Query: 1561 GYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLP 1620
GYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLP
Sbjct: 1561 GYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLP 1620
Query: 1621 QKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIA 1680
QKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIA
Sbjct: 1621 QKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIA 1680
Query: 1681 IFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGD 1740
IFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGD
Sbjct: 1681 IFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGD 1740
Query: 1741 TSAVLQEVNERGIQLNEPS 1759
TSAVLQEVNERGIQLNEPS
Sbjct: 1741 TSAVLQEVNERGIQLNEPS 1759
BLAST of CsGy5G017110 vs. NCBI nr
Match:
XP_008463476.1 (PREDICTED: uncharacterized protein LOC103501638 [Cucumis melo])
HSP 1 Score: 3190 bits (8272), Expect = 0.0
Identity = 1625/1751 (92.80%), Postives = 1673/1751 (95.55%), Query Frame = 0
Query: 10 EKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPAS 69
EKEKEKEK+SDELT+KS PSNSKP+KIKASD KEEGE+SSSDNDTQTHDV PVCSTVPAS
Sbjct: 2 EKEKEKEKNSDELTVKSTPSNSKPSKIKASDTKEEGELSSSDNDTQTHDVRPVCSTVPAS 61
Query: 70 IASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKAN 129
IAS ISS LPPK+KCNPGI+TASAD+C RTSISTMSQKIRDNAQIVNKASTPW ASRKAN
Sbjct: 62 IASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKASTPWGASRKAN 121
Query: 130 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKL 189
SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSNKLRSMTRNKV+ NKL
Sbjct: 122 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRSMTRNKVMANKL 181
Query: 190 PLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQD 249
PLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQD
Sbjct: 182 PLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQD 241
Query: 250 LRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLN 309
LRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVTNSK DLGRKGN TISQFPPLGPK+ N
Sbjct: 242 LRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTISQFPPLGPKEPN 301
Query: 310 AKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSV 369
KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQEPGEE QNIKGSYNQKGKSLSREE+SV
Sbjct: 302 VKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQEPGEEIQNIKGSYNQKGKSLSREEASV 361
Query: 370 LKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVANGNQSDFIGNQVDPHPLVVLDQATALP 429
LKQSKEDIKHVAASPSLGIDLGKVQDD TDIVANGN SD IG QVDPHPLVVLDQATALP
Sbjct: 362 LKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHPLVVLDQATALP 421
Query: 430 NVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSK 489
NVASNVQSQFDNVEF RQSDGLQPSASTAK FE TPPQSA NVK PEPCSNFFKSLIN K
Sbjct: 422 NVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPCSNFFKSLINCK 481
Query: 490 TSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR 549
+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR
Sbjct: 482 SSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR 541
Query: 550 ALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIA 609
ALIEANSRC++LY+KRELFS HFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSG+ANI
Sbjct: 542 ALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGNANIT 601
Query: 610 SPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPL 669
SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSNLHENGQ+LGSEPGSCSDLGGNT+DPL
Sbjct: 602 SPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSDLGGNTVDPL 661
Query: 670 PFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTS 729
PFKGNNIADRI SPSVDPN+S+DGDEES PSDHEMIDSYNECY+RKKHFE+DQMEAYN
Sbjct: 662 PFKGNNIADRICSPSVDPNISLDGDEESLPSDHEMIDSYNECYVRKKHFEDDQMEAYNML 721
Query: 730 KNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQ 789
K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNN+QTSVEQGTENDAR+D TQ
Sbjct: 722 KKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQGTENDARNDRTQ 781
Query: 790 QNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLP 849
QNNTELTV LAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHG YHFENLP
Sbjct: 782 QNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGPYHFENLP 841
Query: 850 SEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLS 909
SE PDLTDSDDNEPFSREGSCSKTT SFTPLTMNSVLQHMKVISSVSIEVLLTRT
Sbjct: 842 SETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIEVLLTRT----L 901
Query: 910 NLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGK 969
NLGFPEDGDSLEVD+IHWRK ENSV EI RPMLQSDGSYTDDLAIDPSWPLCMYELRGK
Sbjct: 902 NLGFPEDGDSLEVDRIHWRKFIENSVLEIVRPMLQSDGSYTDDLAIDPSWPLCMYELRGK 961
Query: 970 CNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRS 1029
CNNDECPWQHMKDFSFANR QCQHGHINSSDETKVFK ED+MTPPTYLVGIDILKADSRS
Sbjct: 962 CNNDECPWQHMKDFSFANRRQCQHGHINSSDETKVFKYEDRMTPPTYLVGIDILKADSRS 1021
Query: 1030 YGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGG 1089
YG VLAQRSSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARIEAKGSWNRPSSYFQRGG
Sbjct: 1022 YGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSWNRPSSYFQRGG 1081
Query: 1090 SVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIF 1149
SVLSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALWAVYLLIF
Sbjct: 1082 SVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWAVYLLIF 1141
Query: 1150 YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNL 1209
YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALC NIF+HNL
Sbjct: 1142 YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCDNIFSHNL 1201
Query: 1210 DGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNI 1269
DGK ASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF HSDMLTCLNI
Sbjct: 1202 DGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFMHSDMLTCLNI 1261
Query: 1270 SDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVD 1329
SDKCIFWV VVYLVLYRKLPHAIVQQLECEKELIEIEWPAV LTNGEKLRASRVVKK VD
Sbjct: 1262 SDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKLRASRVVKKVVD 1321
Query: 1330 FVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLL 1389
F DSCLNNES +SKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPEL+LL
Sbjct: 1322 FADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELILL 1381
Query: 1390 DIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFD 1449
DIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCIWNQYAEYLLRNGRIKCTEELMAR F+
Sbjct: 1382 DIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIKCTEELMARWFN 1441
Query: 1450 STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLND 1509
STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSP+EVDVVFWYLNHSVHKLL+ND
Sbjct: 1442 STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYLNHSVHKLLVND 1501
Query: 1510 QLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARA 1569
QLEARLAF+NALRAAS+ TFRYCMREYAMFLLTD SLLNEAASVGGIRSILEGYLNDARA
Sbjct: 1502 QLEARLAFDNALRAASAGTFRYCMREYAMFLLTDGSLLNEAASVGGIRSILEGYLNDARA 1561
Query: 1570 FPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE 1629
FPV EPLSRRFI DI+KPRVRLLVSN LSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE
Sbjct: 1562 FPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE 1621
Query: 1630 LVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIP 1689
LVDFVETILE+LPSNYQLVLSVCKQL N DNYSSQAASPSLIFWACSNLI AIF+ VPIP
Sbjct: 1622 LVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNLITAIFNCVPIP 1681
Query: 1690 PEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEV 1749
PEFIWVEAANIL NVKGLEAITERFHKRALSVYPFSVQLWKSYY++C+TRGDTS VLQEV
Sbjct: 1682 PEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKTRGDTSTVLQEV 1741
Query: 1750 NERGIQLNEPS 1759
NERGI+LNEPS
Sbjct: 1742 NERGIELNEPS 1748
BLAST of CsGy5G017110 vs. NCBI nr
Match:
TYK05820.1 (zinc finger C3H1 domain-containing protein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 3125 bits (8102), Expect = 0.0
Identity = 1583/1702 (93.01%), Postives = 1628/1702 (95.65%), Query Frame = 0
Query: 59 VHPVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKA 118
V PVCSTVPASIAS ISS LPPK+KCNPGI+TASAD+C RTSISTMSQKIRDNAQIVNKA
Sbjct: 167 VRPVCSTVPASIASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKA 226
Query: 119 STPWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRS 178
STPW ASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSNKLRS
Sbjct: 227 STPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRS 286
Query: 179 MTRNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 238
MTRNKV+ NKLPLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ
Sbjct: 287 MTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 346
Query: 239 GMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATIS 298
GMN+NTSKLQDLRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVTNSK DLGRKGN TIS
Sbjct: 347 GMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTIS 406
Query: 299 QFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQK 358
QFPPLGPK+ N KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQEPGEE QNIKGSYNQK
Sbjct: 407 QFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQEPGEEIQNIKGSYNQK 466
Query: 359 GKSLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVANGNQSDFIGNQVDPHP 418
GKSLSREE+SVLKQSKEDIKHVAASPSLGIDLGKVQDD TDIVANGN SD IG QVDPHP
Sbjct: 467 GKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHP 526
Query: 419 LVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPC 478
LVVLDQATALPNVASNVQSQFDNVEF RQSDGLQPSASTAK FE TPPQSA NVK PEPC
Sbjct: 527 LVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPC 586
Query: 479 SNFFKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER 538
SNFFKSLIN K+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER
Sbjct: 587 SNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER 646
Query: 539 NAFKIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDH 598
NAFKIYSRAQRALIEANSRC++LY+KRELFS HFHSFCMNNPGSVSSSRQQEDMIIDVDH
Sbjct: 647 NAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDH 706
Query: 599 LNSMSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSC 658
LNSMSG+ANI SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSNLHENGQ+LGSEPGSC
Sbjct: 707 LNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSC 766
Query: 659 SDLGGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHF 718
SDLGGNT+DPLPFKGNNIADRI SPSV+PN+S+DGDEES PSDHEMIDSYNECYMRKKHF
Sbjct: 767 SDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHEMIDSYNECYMRKKHF 826
Query: 719 ENDQMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQG 778
E+DQMEAYN K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNN+QTSVEQG
Sbjct: 827 EDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQG 886
Query: 779 TENDARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE 838
TENDAR+D TQQNNTELTV LAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE
Sbjct: 887 TENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE 946
Query: 839 IHGRYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIE 898
IHG YHFENLPSE PDLTDSDDNEPFSREGSCSKTT SFTPLTMNSVLQHMKVISSVSIE
Sbjct: 947 IHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIE 1006
Query: 899 VLLTRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPS 958
VLL+RT NLGFPEDGDSLEVD+IHWRK ENSVHEI RPMLQSDGSYTDDLAIDPS
Sbjct: 1007 VLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPMLQSDGSYTDDLAIDPS 1066
Query: 959 WPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLV 1018
WPLCMYELRGKCNNDECPWQHMKDFSFANR QCQHGHINSSDETKVFK ED+MTPPTYLV
Sbjct: 1067 WPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHINSSDETKVFKYEDRMTPPTYLV 1126
Query: 1019 GIDILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSW 1078
GIDILKADSRSYG VLAQRSSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARIEAKGSW
Sbjct: 1127 GIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSW 1186
Query: 1079 NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKS 1138
NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKS
Sbjct: 1187 NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKS 1246
Query: 1139 IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS 1198
IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS
Sbjct: 1247 IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS 1306
Query: 1199 ALCHNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF 1258
ALC NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF
Sbjct: 1307 ALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF 1366
Query: 1259 THSDMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKL 1318
HSDMLTCLNISDKCIFWV VVYLVLYRKLPHAIVQQLECEKELIEIEWPAV LTNGEKL
Sbjct: 1367 MHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKL 1426
Query: 1319 RASRVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK 1378
RASRVVKK VDF DSCLNNES +SKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK
Sbjct: 1427 RASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK 1486
Query: 1379 LYPSCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIK 1438
LYPSCPEL+LLDIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCIWNQYAEYLLRNGRIK
Sbjct: 1487 LYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIK 1546
Query: 1439 CTEELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYL 1498
CTEELMAR F+STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSP+EVDVVFWYL
Sbjct: 1547 CTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYL 1606
Query: 1499 NHSVHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRS 1558
NHSVHKLL+NDQLEARLAF+NALRAAS+ TFRYCMREYAMFLLTDESLLNEAASVGGIRS
Sbjct: 1607 NHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDESLLNEAASVGGIRS 1666
Query: 1559 ILEGYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPS 1618
ILEGYLNDARAFPV EPLSRRFI DI+KPRVRLLVSN LSPISPDVSLVNCILEVWYGPS
Sbjct: 1667 ILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPS 1726
Query: 1619 LLPQKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNL 1678
LLPQKFNKPKELVDFVETILE+LPSNYQLVLSVCKQL N DNYSSQAASPSLIFWACSNL
Sbjct: 1727 LLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNL 1786
Query: 1679 IIAIFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRT 1738
I AIF+ VPIPPEFIWVEAANIL NVKGLEAITERFHKRALSVYPFSVQLWKSYY++C+T
Sbjct: 1787 ITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKT 1846
Query: 1739 RGDTSAVLQEVNERGIQLNEPS 1759
RGDTS VLQEVNERGI+LNEPS
Sbjct: 1847 RGDTSTVLQEVNERGIELNEPS 1864
BLAST of CsGy5G017110 vs. NCBI nr
Match:
KAA0066024.1 (zinc finger C3H1 domain-containing protein isoform X1 [Cucumis melo var. makuwa])
HSP 1 Score: 3053 bits (7916), Expect = 0.0
Identity = 1545/1657 (93.24%), Postives = 1587/1657 (95.78%), Query Frame = 0
Query: 104 MSQKIRDNAQIVNKASTPWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF 163
MSQKIRDNAQIVNKASTPW ASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF
Sbjct: 1 MSQKIRDNAQIVNKASTPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF 60
Query: 164 KPPTSILDRSNKLRSMTRNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIR 223
KPPTS LDRSNKLRSMTRNKV+ NKLPLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIR
Sbjct: 61 KPPTSTLDRSNKLRSMTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIR 120
Query: 224 AFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVT 283
AFSGNLQSQGRGNDQGMN+NTSKLQDLRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVT
Sbjct: 121 AFSGNLQSQGRGNDQGMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVT 180
Query: 284 NSKSDLGRKGNATISQFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQE 343
NSK DLGRKGN TISQFPPLGPK+ N KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQE
Sbjct: 181 NSKPDLGRKGNNTISQFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQE 240
Query: 344 PGEETQNIKGSYNQKGKSLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVAN 403
PGEE QNIKGSYNQKGKSLSREE+SVLKQSKEDIKHVAASPSLGIDLGKVQDD TDIVAN
Sbjct: 241 PGEEIQNIKGSYNQKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVAN 300
Query: 404 GNQSDFIGNQVDPHPLVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFER 463
GN SD IG QVDPHPLVVLDQATALPNVASNVQSQFDNVEF RQSDGLQPSASTAK FE
Sbjct: 301 GNHSDLIGKQVDPHPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEG 360
Query: 464 TPPQSASNVKTPEPCSNFFKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDL 523
TPPQSA NVK PEPCSNFFKSLIN K+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDL
Sbjct: 361 TPPQSAYNVKIPEPCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDL 420
Query: 524 EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSV 583
EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRC++LY+KRELFS HFHSFCMNNPGSV
Sbjct: 421 EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSV 480
Query: 584 SSSRQQEDMIIDVDHLNSMSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSN 643
SSSRQQEDMIIDVDHLNSMSG+ANI SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSN
Sbjct: 481 SSSRQQEDMIIDVDHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSN 540
Query: 644 LHENGQSLGSEPGSCSDLGGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHE 703
LHENGQ+LGSEPGSCSDLGGNT+DPLPFKGNNIADRI SPSV+PN+S+DGDEES PSDHE
Sbjct: 541 LHENGQNLGSEPGSCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHE 600
Query: 704 MIDSYNECYMRKKHFENDQMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSK 763
MIDSYNECYMRKKHFE+DQMEAYN K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSK
Sbjct: 601 MIDSYNECYMRKKHFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSK 660
Query: 764 ACNPCNNLQTSVEQGTENDARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQ 823
ACNPCNN+QTSVEQGTENDAR+D TQQNNTELTV LAVGSDVDLISKKNESALLSGKGDQ
Sbjct: 661 ACNPCNNIQTSVEQGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQ 720
Query: 824 QFGFGGTDRCKTPDEIHGRYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMN 883
QFGFGGTDRCKTPDEIHG YHFENLPSE PDLTDSDDNEPFSREGSCSKTT SFTPLTMN
Sbjct: 721 QFGFGGTDRCKTPDEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMN 780
Query: 884 SVLQHMKVISSVSIEVLLTRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPML 943
SVLQHMKVISSVSIEVLL+RT NLGFPEDGDSLEVD+IHWRK ENSVHEI RPML
Sbjct: 781 SVLQHMKVISSVSIEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPML 840
Query: 944 QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETK 1003
QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANR QCQHGHINSSDETK
Sbjct: 841 QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHINSSDETK 900
Query: 1004 VFKNEDQMTPPTYLVGIDILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASAD 1063
VFK ED+MTPPTYLVGIDILKADSRSYG VLAQRSSQCWQ+FFSISLTLPNLL+KDASAD
Sbjct: 901 VFKYEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASAD 960
Query: 1064 GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKK 1123
GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQE NSREGMKK
Sbjct: 961 GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKK 1020
Query: 1124 ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR 1183
ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR
Sbjct: 1021 ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR 1080
Query: 1184 MNLDARLAAYDSAISALCHNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIF 1243
MNLDARLAAYDSAISALC NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEKAIQRIF
Sbjct: 1081 MNLDARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIF 1140
Query: 1244 GLLQVAMDSDEPYSFTHSDMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELI 1303
GLLQVAMDSDEPYSF HSDMLTCLNISDKCIFWV VVYLVLYRKLPHAIVQQLECEKELI
Sbjct: 1141 GLLQVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELI 1200
Query: 1304 EIEWPAVHLTNGEKLRASRVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAF 1363
EIEWPAV LTNGEKLRASRVVKK VDF DSCLNNES +SKCYQKSIQMFAVNHIRCLMAF
Sbjct: 1201 EIEWPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAF 1260
Query: 1364 EDIEFSRNLLDKYVKLYPSCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCI 1423
EDIEFSRNLLDKYVKLYPSCPEL+LLDIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCI
Sbjct: 1261 EDIEFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCI 1320
Query: 1424 WNQYAEYLLRNGRIKCTEELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA 1483
WNQYAEYLLRNGRIKCTEELMAR F+STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA
Sbjct: 1321 WNQYAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA 1380
Query: 1484 LDCSPNEVDVVFWYLNHSVHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTD 1543
LDCSP+EVDVVFWYLNHSVHKLL+NDQLEARLAF+NALRAAS+ TFRYCMREYAMFLLTD
Sbjct: 1381 LDCSPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTD 1440
Query: 1544 ESLLNEAASVGGIRSILEGYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPD 1603
ESLLNEAASVGGIRSILEGYLNDARAFPV EPLSRRFI DI+KPRVRLLVSN LSPISPD
Sbjct: 1441 ESLLNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPD 1500
Query: 1604 VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSS 1663
VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILE+LPSNYQLVLSVCKQL N DNYSS
Sbjct: 1501 VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSS 1560
Query: 1664 QAASPSLIFWACSNLIIAIFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYP 1723
QAASPSLIFWACSNLI AIF+ VPIPPEFIWVEAANIL NVKGLEAITERFHKRALSVYP
Sbjct: 1561 QAASPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYP 1620
Query: 1724 FSVQLWKSYYNICRTRGDTSAVLQEVNERGIQLNEPS 1759
FSVQLWKSYY++C+TRGDTS VLQEVNERGI+LNEPS
Sbjct: 1621 FSVQLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1653
BLAST of CsGy5G017110 vs. NCBI nr
Match:
XP_038890115.1 (uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida])
HSP 1 Score: 2955 bits (7660), Expect = 0.0
Identity = 1505/1755 (85.75%), Postives = 1598/1755 (91.05%), Query Frame = 0
Query: 17 KDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRISS 76
K+++ELT+KSM SNS+P+K KASD +EEGEVSSSDNDTQTHDVHPVCSTVPAS+ S ISS
Sbjct: 4 KNTEELTVKSMASNSQPSKSKASDSREEGEVSSSDNDTQTHDVHPVCSTVPASVTSPISS 63
Query: 77 ILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVISF 136
ILPPKNK NPGI+ SADVCTRTSI T+SQKI DNAQ+VNK STPW ASR+ANSNLVISF
Sbjct: 64 ILPPKNKYNPGIQAVSADVCTRTSIQTISQKICDNAQVVNKVSTPWGASREANSNLVISF 123
Query: 137 SDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQAFI 196
SDDSGSELEECSKVRTSKSHSDAVRH+KPPTSI+DRSNKLRSMTRNKVV NKL LSQ FI
Sbjct: 124 SDDSGSELEECSKVRTSKSHSDAVRHYKPPTSIIDRSNKLRSMTRNKVVANKLSLSQPFI 183
Query: 197 PSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAI 256
PSMTKNH+AYSKGAAGPS AEQGSKIRAFSGNLQSQGRGNDQG N+NTSKLQDLR+QIAI
Sbjct: 184 PSMTKNHRAYSKGAAGPSLAEQGSKIRAFSGNLQSQGRGNDQGKNLNTSKLQDLREQIAI 243
Query: 257 RESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMKTS 316
ESKLK KSAQQNKE + VTNQDYIVTNSKSDL RKG+ATI QFPPL PK+ + KR+KTS
Sbjct: 244 CESKLKFKSAQQNKESISVTNQDYIVTNSKSDLARKGSATIPQFPPLVPKEPDVKRLKTS 303
Query: 317 GSYSSKLSLNGQQ-LRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQSK 376
GSYS+KLSL+GQQ LR++ A K ++ PQEPGEETQNIK +YNQKG SL REESSVLKQSK
Sbjct: 304 GSYSTKLSLSGQQHLRTMYAGKSVFRPQEPGEETQNIKVTYNQKGISLGREESSVLKQSK 363
Query: 377 EDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVASNV 436
EDIKHVAASPS GIDLGKVQDD DIVANGNQ D+I QVDPHPLVVLD AT LPN+ SNV
Sbjct: 364 EDIKHVAASPSPGIDLGKVQDDNDIVANGNQLDWISKQVDPHPLVVLDLATVLPNMTSNV 423
Query: 437 QSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGTAF 496
Q+QFDNVEFHRQSDGLQPSAS AK FE T PQSASNVK PEPCSNFFKSLINSK+SGTAF
Sbjct: 424 QTQFDNVEFHRQSDGLQPSASAAKHFEGTLPQSASNVKIPEPCSNFFKSLINSKSSGTAF 483
Query: 497 GNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEAN 556
GN SCL F NFDLQSLFE+EESLDKDLEEAQD RRQCEIEERNAFKIYSRAQRALIEAN
Sbjct: 484 GNSPSCLGFSNFDLQSLFEMEESLDKDLEEAQDIRRQCEIEERNAFKIYSRAQRALIEAN 543
Query: 557 SRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLYQK 616
SRC++LYHKRELFS HFHSFCMNNPG +SSSRQQEDM I DHLNSMSG+AN ASPLYQK
Sbjct: 544 SRCLDLYHKRELFSAHFHSFCMNNPGLISSSRQQEDMKIGADHLNSMSGNANGASPLYQK 603
Query: 617 HSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKGNN 676
HSEYNSST+LH DLNMQ ENAG IN+SNLHENGQ+LGSEP CSDLGGN LDPLP KGNN
Sbjct: 604 HSEYNSSTQLHTDLNMQHENAGPINSSNLHENGQNLGSEPELCSDLGGNKLDPLPSKGNN 663
Query: 677 IADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNHCD 736
IADRI SPSVDPNVS+DGDEES PSDHEMIDSY+ECYM KK FE+DQME YN SK N CD
Sbjct: 664 IADRICSPSVDPNVSVDGDEESLPSDHEMIDSYDECYMGKKQFEDDQMETYNISKKNQCD 723
Query: 737 NNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNTEL 796
NNIEDSLRLEAKLRSELFARLG RNLSK CNPC+N+QT VEQGT++DARDD TQQNNTE
Sbjct: 724 NNIEDSLRLEAKLRSELFARLGIRNLSKTCNPCHNIQTPVEQGTKSDARDDRTQQNNTEP 783
Query: 797 TVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAPDL 856
TV LAVGSD DL SKK ES LLSGKGDQQFGFGG +RC TPD+IHGRYHFENLPSE D
Sbjct: 784 TVGLAVGSDADLTSKKTESTLLSGKGDQQFGFGGPNRCNTPDDIHGRYHFENLPSETQDS 843
Query: 857 TDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGFPE 916
DSDDNEPF+REGSCSKTT SFTPLTMNSVLQH+K I SVSIEVLL RT GSLSNLGFPE
Sbjct: 844 ADSDDNEPFNREGSCSKTTFSFTPLTMNSVLQHIKAIPSVSIEVLLARTRGSLSNLGFPE 903
Query: 917 DGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNNDEC 976
DGDSLEVDQIHWRKLKENSVHE RPM QSDGSY DDLAIDP WPLCMYELRGKCNNDEC
Sbjct: 904 DGDSLEVDQIHWRKLKENSVHETVRPMFQSDGSYIDDLAIDPLWPLCMYELRGKCNNDEC 963
Query: 977 PWQHMKDFSFANRSQCQHGHIN----------SSDETKVFKNEDQMTPPTYLVGIDILKA 1036
PWQH+KD+S ANR QCQH HIN SSDETK+FK ED MTPPTYLVGIDILKA
Sbjct: 964 PWQHVKDYSLANRRQCQHDHINYSDSCNGLSFSSDETKIFKYEDCMTPPTYLVGIDILKA 1023
Query: 1037 DSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYF 1096
DS SY VLAQ+SSQCWQ+FFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYF
Sbjct: 1024 DSHSYDPVLAQKSSQCWQNFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYF 1083
Query: 1097 QRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVY 1156
QRG S+LSQLKQGDE+LALETALIIINQE NSREGMKKALPVLSRAVENNPKS+ALW +Y
Sbjct: 1084 QRGSSILSQLKQGDEDLALETALIIINQETNSREGMKKALPVLSRAVENNPKSVALWTIY 1143
Query: 1157 LLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIF 1216
LLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYD+A+SALC NI
Sbjct: 1144 LLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDAALSALCDNIV 1203
Query: 1217 THNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLT 1276
T NLDGKYAS HILDLILQMTNCLCMSGNVEKAIQRI GLL+VAMDSDEPYSFTHSDMLT
Sbjct: 1204 TPNLDGKYASTHILDLILQMTNCLCMSGNVEKAIQRILGLLRVAMDSDEPYSFTHSDMLT 1263
Query: 1277 CLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVK 1336
CLNISDKCIFWV VVYLV+YRKLPHA+VQQLECEKELIEIEWPA+ LT+GEKLRASRVVK
Sbjct: 1264 CLNISDKCIFWVCVVYLVIYRKLPHAVVQQLECEKELIEIEWPAIQLTDGEKLRASRVVK 1323
Query: 1337 KAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPE 1396
KAVDFVDSC NNES DSKCYQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVKLYPSC E
Sbjct: 1324 KAVDFVDSCPNNESPDSKCYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKLYPSCLE 1383
Query: 1397 LLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMA 1456
L+LL +RA+K DFGD TV+AFE+ I WPKEVPG+QCIWNQYAEYLL+NGRIKCTEELM
Sbjct: 1384 LILLKVRAKKRDFGDETVVAFEQAIGNWPKEVPGIQCIWNQYAEYLLQNGRIKCTEELMV 1443
Query: 1457 RRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKL 1516
R F+ST KMDCSKTRT N DCD L+LLD+ASGSIV A+DCSPNEVDVVFWYLN SVHKL
Sbjct: 1444 RWFESTPKMDCSKTRTLDNGDCDCLNLLDYASGSIVHAMDCSPNEVDVVFWYLNLSVHKL 1503
Query: 1517 LLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLN 1576
LLNDQLEARLAF+NALRAASS TFRYCMREYAMFLLTDESLLNEAASVGGIR+ILEGYLN
Sbjct: 1504 LLNDQLEARLAFDNALRAASSGTFRYCMREYAMFLLTDESLLNEAASVGGIRNILEGYLN 1563
Query: 1577 DARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFN 1636
DARAFP+PEPLSR+FI DI+KPRVRLL+SNMLSP+SPDVSLVNCILEVWYGPSLLPQKFN
Sbjct: 1564 DARAFPIPEPLSRKFINDIKKPRVRLLISNMLSPLSPDVSLVNCILEVWYGPSLLPQKFN 1623
Query: 1637 KPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSS 1696
KPKELVDFVETILE+LPSNYQLVLSVCKQLCN D+ SSQAAS SLIFWACSNLI AIFSS
Sbjct: 1624 KPKELVDFVETILEMLPSNYQLVLSVCKQLCNGDD-SSQAASASLIFWACSNLISAIFSS 1683
Query: 1697 VPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAV 1756
VPIPPE +WVEAANIL NVKG EAI ERFHKRALSVYPFSVQLW SYYN+C+TRGDTSAV
Sbjct: 1684 VPIPPESVWVEAANILINVKGFEAIIERFHKRALSVYPFSVQLWTSYYNMCKTRGDTSAV 1743
Query: 1757 LQEVNERGIQLNEPS 1759
L+EVNERGI+LNEPS
Sbjct: 1744 LREVNERGIELNEPS 1757
BLAST of CsGy5G017110 vs. ExPASy TrEMBL
Match:
A0A0A0KS73 (zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=4 SV=1)
HSP 1 Score: 3453 bits (8953), Expect = 0.0
Identity = 1750/1750 (100.00%), Postives = 1750/1750 (100.00%), Query Frame = 0
Query: 10 EKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPAS 69
EKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPAS
Sbjct: 2 EKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPAS 61
Query: 70 IASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKAN 129
IASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKAN
Sbjct: 62 IASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKAN 121
Query: 130 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKL 189
SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKL
Sbjct: 122 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKL 181
Query: 190 PLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQD 249
PLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQD
Sbjct: 182 PLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQD 241
Query: 250 LRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLN 309
LRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLN
Sbjct: 242 LRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLN 301
Query: 310 AKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSV 369
AKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSV
Sbjct: 302 AKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSV 361
Query: 370 LKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPN 429
LKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPN
Sbjct: 362 LKQSKEDIKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPN 421
Query: 430 VASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKT 489
VASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKT
Sbjct: 422 VASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKT 481
Query: 490 SGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRA 549
SGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRA
Sbjct: 482 SGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRA 541
Query: 550 LIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIAS 609
LIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIAS
Sbjct: 542 LIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIAS 601
Query: 610 PLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLP 669
PLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLP
Sbjct: 602 PLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLP 661
Query: 670 FKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSK 729
FKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSK
Sbjct: 662 FKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSK 721
Query: 730 NNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQ 789
NNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQ
Sbjct: 722 NNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQ 781
Query: 790 NNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPS 849
NNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPS
Sbjct: 782 NNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPS 841
Query: 850 EAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSN 909
EAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSN
Sbjct: 842 EAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSN 901
Query: 910 LGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKC 969
LGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKC
Sbjct: 902 LGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKC 961
Query: 970 NNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRSY 1029
NNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRSY
Sbjct: 962 NNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRSY 1021
Query: 1030 GHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGGS 1089
GHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGGS
Sbjct: 1022 GHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGGS 1081
Query: 1090 VLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFY 1149
VLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFY
Sbjct: 1082 VLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFY 1141
Query: 1150 SYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNLD 1209
SYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNLD
Sbjct: 1142 SYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNLD 1201
Query: 1210 GKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNIS 1269
GKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNIS
Sbjct: 1202 GKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNIS 1261
Query: 1270 DKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVDF 1329
DKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVDF
Sbjct: 1262 DKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVDF 1321
Query: 1330 VDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLLD 1389
VDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLLD
Sbjct: 1322 VDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLLD 1381
Query: 1390 IRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFDS 1449
IRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFDS
Sbjct: 1382 IRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFDS 1441
Query: 1450 TSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLNDQ 1509
TSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLNDQ
Sbjct: 1442 TSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLNDQ 1501
Query: 1510 LEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAF 1569
LEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAF
Sbjct: 1502 LEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAF 1561
Query: 1570 PVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKEL 1629
PVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKEL
Sbjct: 1562 PVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKEL 1621
Query: 1630 VDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIPP 1689
VDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIPP
Sbjct: 1622 VDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIPP 1681
Query: 1690 EFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEVN 1749
EFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEVN
Sbjct: 1682 EFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEVN 1741
Query: 1750 ERGIQLNEPS 1759
ERGIQLNEPS
Sbjct: 1742 ERGIQLNEPS 1751
BLAST of CsGy5G017110 vs. ExPASy TrEMBL
Match:
A0A1S3CJD3 (uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=4 SV=1)
HSP 1 Score: 3190 bits (8272), Expect = 0.0
Identity = 1625/1751 (92.80%), Postives = 1673/1751 (95.55%), Query Frame = 0
Query: 10 EKEKEKEKDSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPAS 69
EKEKEKEK+SDELT+KS PSNSKP+KIKASD KEEGE+SSSDNDTQTHDV PVCSTVPAS
Sbjct: 2 EKEKEKEKNSDELTVKSTPSNSKPSKIKASDTKEEGELSSSDNDTQTHDVRPVCSTVPAS 61
Query: 70 IASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKAN 129
IAS ISS LPPK+KCNPGI+TASAD+C RTSISTMSQKIRDNAQIVNKASTPW ASRKAN
Sbjct: 62 IASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKASTPWGASRKAN 121
Query: 130 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKL 189
SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSNKLRSMTRNKV+ NKL
Sbjct: 122 SNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRSMTRNKVMANKL 181
Query: 190 PLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQD 249
PLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMN+NTSKLQD
Sbjct: 182 PLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNLNTSKLQD 241
Query: 250 LRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLN 309
LRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVTNSK DLGRKGN TISQFPPLGPK+ N
Sbjct: 242 LRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTISQFPPLGPKEPN 301
Query: 310 AKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSV 369
KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQEPGEE QNIKGSYNQKGKSLSREE+SV
Sbjct: 302 VKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQEPGEEIQNIKGSYNQKGKSLSREEASV 361
Query: 370 LKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVANGNQSDFIGNQVDPHPLVVLDQATALP 429
LKQSKEDIKHVAASPSLGIDLGKVQDD TDIVANGN SD IG QVDPHPLVVLDQATALP
Sbjct: 362 LKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHPLVVLDQATALP 421
Query: 430 NVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSK 489
NVASNVQSQFDNVEF RQSDGLQPSASTAK FE TPPQSA NVK PEPCSNFFKSLIN K
Sbjct: 422 NVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPCSNFFKSLINCK 481
Query: 490 TSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR 549
+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR
Sbjct: 482 SSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQR 541
Query: 550 ALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIA 609
ALIEANSRC++LY+KRELFS HFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSG+ANI
Sbjct: 542 ALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGNANIT 601
Query: 610 SPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPL 669
SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSNLHENGQ+LGSEPGSCSDLGGNT+DPL
Sbjct: 602 SPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSCSDLGGNTVDPL 661
Query: 670 PFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTS 729
PFKGNNIADRI SPSVDPN+S+DGDEES PSDHEMIDSYNECY+RKKHFE+DQMEAYN
Sbjct: 662 PFKGNNIADRICSPSVDPNISLDGDEESLPSDHEMIDSYNECYVRKKHFEDDQMEAYNML 721
Query: 730 KNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQ 789
K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNN+QTSVEQGTENDAR+D TQ
Sbjct: 722 KKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQGTENDARNDRTQ 781
Query: 790 QNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLP 849
QNNTELTV LAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHG YHFENLP
Sbjct: 782 QNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGPYHFENLP 841
Query: 850 SEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLS 909
SE PDLTDSDDNEPFSREGSCSKTT SFTPLTMNSVLQHMKVISSVSIEVLLTRT
Sbjct: 842 SETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIEVLLTRT----L 901
Query: 910 NLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGK 969
NLGFPEDGDSLEVD+IHWRK ENSV EI RPMLQSDGSYTDDLAIDPSWPLCMYELRGK
Sbjct: 902 NLGFPEDGDSLEVDRIHWRKFIENSVLEIVRPMLQSDGSYTDDLAIDPSWPLCMYELRGK 961
Query: 970 CNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRS 1029
CNNDECPWQHMKDFSFANR QCQHGHINSSDETKVFK ED+MTPPTYLVGIDILKADSRS
Sbjct: 962 CNNDECPWQHMKDFSFANRRQCQHGHINSSDETKVFKYEDRMTPPTYLVGIDILKADSRS 1021
Query: 1030 YGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGG 1089
YG VLAQRSSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARIEAKGSWNRPSSYFQRGG
Sbjct: 1022 YGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSWNRPSSYFQRGG 1081
Query: 1090 SVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIF 1149
SVLSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKSIALWAVYLLIF
Sbjct: 1082 SVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKSIALWAVYLLIF 1141
Query: 1150 YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNL 1209
YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALC NIF+HNL
Sbjct: 1142 YSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCDNIFSHNL 1201
Query: 1210 DGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNI 1269
DGK ASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF HSDMLTCLNI
Sbjct: 1202 DGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFMHSDMLTCLNI 1261
Query: 1270 SDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVD 1329
SDKCIFWV VVYLVLYRKLPHAIVQQLECEKELIEIEWPAV LTNGEKLRASRVVKK VD
Sbjct: 1262 SDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKLRASRVVKKVVD 1321
Query: 1330 FVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLL 1389
F DSCLNNES +SKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPEL+LL
Sbjct: 1322 FADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELILL 1381
Query: 1390 DIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFD 1449
DIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCIWNQYAEYLLRNGRIKCTEELMAR F+
Sbjct: 1382 DIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIKCTEELMARWFN 1441
Query: 1450 STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLND 1509
STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSP+EVDVVFWYLNHSVHKLL+ND
Sbjct: 1442 STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYLNHSVHKLLVND 1501
Query: 1510 QLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARA 1569
QLEARLAF+NALRAAS+ TFRYCMREYAMFLLTD SLLNEAASVGGIRSILEGYLNDARA
Sbjct: 1502 QLEARLAFDNALRAASAGTFRYCMREYAMFLLTDGSLLNEAASVGGIRSILEGYLNDARA 1561
Query: 1570 FPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE 1629
FPV EPLSRRFI DI+KPRVRLLVSN LSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE
Sbjct: 1562 FPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKE 1621
Query: 1630 LVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIP 1689
LVDFVETILE+LPSNYQLVLSVCKQL N DNYSSQAASPSLIFWACSNLI AIF+ VPIP
Sbjct: 1622 LVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNLITAIFNCVPIP 1681
Query: 1690 PEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEV 1749
PEFIWVEAANIL NVKGLEAITERFHKRALSVYPFSVQLWKSYY++C+TRGDTS VLQEV
Sbjct: 1682 PEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKTRGDTSTVLQEV 1741
Query: 1750 NERGIQLNEPS 1759
NERGI+LNEPS
Sbjct: 1742 NERGIELNEPS 1748
BLAST of CsGy5G017110 vs. ExPASy TrEMBL
Match:
A0A5D3C3A9 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold98G003300 PE=4 SV=1)
HSP 1 Score: 3125 bits (8102), Expect = 0.0
Identity = 1583/1702 (93.01%), Postives = 1628/1702 (95.65%), Query Frame = 0
Query: 59 VHPVCSTVPASIASRISSILPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKA 118
V PVCSTVPASIAS ISS LPPK+KCNPGI+TASAD+C RTSISTMSQKIRDNAQIVNKA
Sbjct: 167 VRPVCSTVPASIASPISSSLPPKDKCNPGIQTASADICPRTSISTMSQKIRDNAQIVNKA 226
Query: 119 STPWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRS 178
STPW ASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTS LDRSNKLRS
Sbjct: 227 STPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHFKPPTSTLDRSNKLRS 286
Query: 179 MTRNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 238
MTRNKV+ NKLPLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ
Sbjct: 287 MTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQ 346
Query: 239 GMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATIS 298
GMN+NTSKLQDLRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVTNSK DLGRKGN TIS
Sbjct: 347 GMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVTNSKPDLGRKGNNTIS 406
Query: 299 QFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQEPGEETQNIKGSYNQK 358
QFPPLGPK+ N KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQEPGEE QNIKGSYNQK
Sbjct: 407 QFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQEPGEEIQNIKGSYNQK 466
Query: 359 GKSLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVANGNQSDFIGNQVDPHP 418
GKSLSREE+SVLKQSKEDIKHVAASPSLGIDLGKVQDD TDIVANGN SD IG QVDPHP
Sbjct: 467 GKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVANGNHSDLIGKQVDPHP 526
Query: 419 LVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPC 478
LVVLDQATALPNVASNVQSQFDNVEF RQSDGLQPSASTAK FE TPPQSA NVK PEPC
Sbjct: 527 LVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEGTPPQSAYNVKIPEPC 586
Query: 479 SNFFKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER 538
SNFFKSLIN K+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER
Sbjct: 587 SNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEER 646
Query: 539 NAFKIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDH 598
NAFKIYSRAQRALIEANSRC++LY+KRELFS HFHSFCMNNPGSVSSSRQQEDMIIDVDH
Sbjct: 647 NAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSVSSSRQQEDMIIDVDH 706
Query: 599 LNSMSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSC 658
LNSMSG+ANI SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSNLHENGQ+LGSEPGSC
Sbjct: 707 LNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSNLHENGQNLGSEPGSC 766
Query: 659 SDLGGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHF 718
SDLGGNT+DPLPFKGNNIADRI SPSV+PN+S+DGDEES PSDHEMIDSYNECYMRKKHF
Sbjct: 767 SDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHEMIDSYNECYMRKKHF 826
Query: 719 ENDQMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQG 778
E+DQMEAYN K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNN+QTSVEQG
Sbjct: 827 EDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSKACNPCNNIQTSVEQG 886
Query: 779 TENDARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE 838
TENDAR+D TQQNNTELTV LAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE
Sbjct: 887 TENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDE 946
Query: 839 IHGRYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIE 898
IHG YHFENLPSE PDLTDSDDNEPFSREGSCSKTT SFTPLTMNSVLQHMKVISSVSIE
Sbjct: 947 IHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMNSVLQHMKVISSVSIE 1006
Query: 899 VLLTRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPS 958
VLL+RT NLGFPEDGDSLEVD+IHWRK ENSVHEI RPMLQSDGSYTDDLAIDPS
Sbjct: 1007 VLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPMLQSDGSYTDDLAIDPS 1066
Query: 959 WPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLV 1018
WPLCMYELRGKCNNDECPWQHMKDFSFANR QCQHGHINSSDETKVFK ED+MTPPTYLV
Sbjct: 1067 WPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHINSSDETKVFKYEDRMTPPTYLV 1126
Query: 1019 GIDILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSW 1078
GIDILKADSRSYG VLAQRSSQCWQ+FFSISLTLPNLL+KDASADGLFLHDARIEAKGSW
Sbjct: 1127 GIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASADGLFLHDARIEAKGSW 1186
Query: 1079 NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKS 1138
NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQE NSREGMKKALPVLSRAVENNPKS
Sbjct: 1187 NRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKKALPVLSRAVENNPKS 1246
Query: 1139 IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS 1198
IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS
Sbjct: 1247 IALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAIS 1306
Query: 1199 ALCHNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF 1258
ALC NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF
Sbjct: 1307 ALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSF 1366
Query: 1259 THSDMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKL 1318
HSDMLTCLNISDKCIFWV VVYLVLYRKLPHAIVQQLECEKELIEIEWPAV LTNGEKL
Sbjct: 1367 MHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVQLTNGEKL 1426
Query: 1319 RASRVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK 1378
RASRVVKK VDF DSCLNNES +SKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK
Sbjct: 1427 RASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVK 1486
Query: 1379 LYPSCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIK 1438
LYPSCPEL+LLDIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCIWNQYAEYLLRNGRIK
Sbjct: 1487 LYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCIWNQYAEYLLRNGRIK 1546
Query: 1439 CTEELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYL 1498
CTEELMAR F+STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSP+EVDVVFWYL
Sbjct: 1547 CTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPSEVDVVFWYL 1606
Query: 1499 NHSVHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRS 1558
NHSVHKLL+NDQLEARLAF+NALRAAS+ TFRYCMREYAMFLLTDESLLNEAASVGGIRS
Sbjct: 1607 NHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTDESLLNEAASVGGIRS 1666
Query: 1559 ILEGYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPS 1618
ILEGYLNDARAFPV EPLSRRFI DI+KPRVRLLVSN LSPISPDVSLVNCILEVWYGPS
Sbjct: 1667 ILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPDVSLVNCILEVWYGPS 1726
Query: 1619 LLPQKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNL 1678
LLPQKFNKPKELVDFVETILE+LPSNYQLVLSVCKQL N DNYSSQAASPSLIFWACSNL
Sbjct: 1727 LLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSSQAASPSLIFWACSNL 1786
Query: 1679 IIAIFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRT 1738
I AIF+ VPIPPEFIWVEAANIL NVKGLEAITERFHKRALSVYPFSVQLWKSYY++C+T
Sbjct: 1787 ITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYPFSVQLWKSYYSMCKT 1846
Query: 1739 RGDTSAVLQEVNERGIQLNEPS 1759
RGDTS VLQEVNERGI+LNEPS
Sbjct: 1847 RGDTSTVLQEVNERGIELNEPS 1864
BLAST of CsGy5G017110 vs. ExPASy TrEMBL
Match:
A0A5A7VFE0 (Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold32G00140 PE=4 SV=1)
HSP 1 Score: 3053 bits (7916), Expect = 0.0
Identity = 1545/1657 (93.24%), Postives = 1587/1657 (95.78%), Query Frame = 0
Query: 104 MSQKIRDNAQIVNKASTPWVASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF 163
MSQKIRDNAQIVNKASTPW ASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF
Sbjct: 1 MSQKIRDNAQIVNKASTPWGASRKANSNLVISFSDDSGSELEECSKVRTSKSHSDAVRHF 60
Query: 164 KPPTSILDRSNKLRSMTRNKVVVNKLPLSQAFIPSMTKNHKAYSKGAAGPSFAEQGSKIR 223
KPPTS LDRSNKLRSMTRNKV+ NKLPLSQ FIPSMTKNHKAYSKGAAGPSFAEQGSKIR
Sbjct: 61 KPPTSTLDRSNKLRSMTRNKVMANKLPLSQVFIPSMTKNHKAYSKGAAGPSFAEQGSKIR 120
Query: 224 AFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKERVLVTNQDYIVT 283
AFSGNLQSQGRGNDQGMN+NTSKLQDLRQQIAIRESKLKLKSAQQNKE +LVTNQDYIVT
Sbjct: 121 AFSGNLQSQGRGNDQGMNLNTSKLQDLRQQIAIRESKLKLKSAQQNKESLLVTNQDYIVT 180
Query: 284 NSKSDLGRKGNATISQFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLRSLIAAKFIWPQE 343
NSK DLGRKGN TISQFPPLGPK+ N KRMKTSGSYSSKLSLN QQL SLIAAKF+WPQE
Sbjct: 181 NSKPDLGRKGNNTISQFPPLGPKEPNVKRMKTSGSYSSKLSLNEQQLHSLIAAKFVWPQE 240
Query: 344 PGEETQNIKGSYNQKGKSLSREESSVLKQSKEDIKHVAASPSLGIDLGKVQDD-TDIVAN 403
PGEE QNIKGSYNQKGKSLSREE+SVLKQSKEDIKHVAASPSLGIDLGKVQDD TDIVAN
Sbjct: 241 PGEEIQNIKGSYNQKGKSLSREEASVLKQSKEDIKHVAASPSLGIDLGKVQDDITDIVAN 300
Query: 404 GNQSDFIGNQVDPHPLVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQPSASTAKFFER 463
GN SD IG QVDPHPLVVLDQATALPNVASNVQSQFDNVEF RQSDGLQPSASTAK FE
Sbjct: 301 GNHSDLIGKQVDPHPLVVLDQATALPNVASNVQSQFDNVEFRRQSDGLQPSASTAKSFEG 360
Query: 464 TPPQSASNVKTPEPCSNFFKSLINSKTSGTAFGNPSSCLDFGNFDLQSLFEIEESLDKDL 523
TPPQSA NVK PEPCSNFFKSLIN K+SGTAFGN SSCLDFGNFDLQSLFEIEESLDKDL
Sbjct: 361 TPPQSAYNVKIPEPCSNFFKSLINCKSSGTAFGNSSSCLDFGNFDLQSLFEIEESLDKDL 420
Query: 524 EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCVELYHKRELFSVHFHSFCMNNPGSV 583
EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRC++LY+KRELFS HFHSFCMNNPGSV
Sbjct: 421 EEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCLDLYNKRELFSAHFHSFCMNNPGSV 480
Query: 584 SSSRQQEDMIIDVDHLNSMSGHANIASPLYQKHSEYNSSTRLHNDLNMQLENAGAINTSN 643
SSSRQQEDMIIDVDHLNSMSG+ANI SPLYQKHSEYNSSTRL NDLNMQ ENAG INTSN
Sbjct: 481 SSSRQQEDMIIDVDHLNSMSGNANITSPLYQKHSEYNSSTRLRNDLNMQHENAGPINTSN 540
Query: 644 LHENGQSLGSEPGSCSDLGGNTLDPLPFKGNNIADRIFSPSVDPNVSMDGDEESFPSDHE 703
LHENGQ+LGSEPGSCSDLGGNT+DPLPFKGNNIADRI SPSV+PN+S+DGDEES PSDHE
Sbjct: 541 LHENGQNLGSEPGSCSDLGGNTVDPLPFKGNNIADRICSPSVNPNISLDGDEESLPSDHE 600
Query: 704 MIDSYNECYMRKKHFENDQMEAYNTSKNNHCDNNIEDSLRLEAKLRSELFARLGTRNLSK 763
MIDSYNECYMRKKHFE+DQMEAYN K NHCDNNIEDSLRLEAKLRSELFARLGTRNLSK
Sbjct: 601 MIDSYNECYMRKKHFEDDQMEAYNMLKKNHCDNNIEDSLRLEAKLRSELFARLGTRNLSK 660
Query: 764 ACNPCNNLQTSVEQGTENDARDDITQQNNTELTVDLAVGSDVDLISKKNESALLSGKGDQ 823
ACNPCNN+QTSVEQGTENDAR+D TQQNNTELTV LAVGSDVDLISKKNESALLSGKGDQ
Sbjct: 661 ACNPCNNIQTSVEQGTENDARNDRTQQNNTELTVGLAVGSDVDLISKKNESALLSGKGDQ 720
Query: 824 QFGFGGTDRCKTPDEIHGRYHFENLPSEAPDLTDSDDNEPFSREGSCSKTTNSFTPLTMN 883
QFGFGGTDRCKTPDEIHG YHFENLPSE PDLTDSDDNEPFSREGSCSKTT SFTPLTMN
Sbjct: 721 QFGFGGTDRCKTPDEIHGPYHFENLPSETPDLTDSDDNEPFSREGSCSKTTFSFTPLTMN 780
Query: 884 SVLQHMKVISSVSIEVLLTRTHGSLSNLGFPEDGDSLEVDQIHWRKLKENSVHEIARPML 943
SVLQHMKVISSVSIEVLL+RT NLGFPEDGDSLEVD+IHWRK ENSVHEI RPML
Sbjct: 781 SVLQHMKVISSVSIEVLLSRT----LNLGFPEDGDSLEVDRIHWRKFIENSVHEIVRPML 840
Query: 944 QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRSQCQHGHINSSDETK 1003
QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANR QCQHGHINSSDETK
Sbjct: 841 QSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPWQHMKDFSFANRRQCQHGHINSSDETK 900
Query: 1004 VFKNEDQMTPPTYLVGIDILKADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASAD 1063
VFK ED+MTPPTYLVGIDILKADSRSYG VLAQRSSQCWQ+FFSISLTLPNLL+KDASAD
Sbjct: 901 VFKYEDRMTPPTYLVGIDILKADSRSYGPVLAQRSSQCWQNFFSISLTLPNLLRKDASAD 960
Query: 1064 GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKK 1123
GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQE NSREGMKK
Sbjct: 961 GLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQGDENLALETALIIINQETNSREGMKK 1020
Query: 1124 ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR 1183
ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR
Sbjct: 1021 ALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSR 1080
Query: 1184 MNLDARLAAYDSAISALCHNIFTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIF 1243
MNLDARLAAYDSAISALC NIF+HNLDGK ASAHILDLILQMTNCLCMSGNVEKAIQRIF
Sbjct: 1081 MNLDARLAAYDSAISALCDNIFSHNLDGKDASAHILDLILQMTNCLCMSGNVEKAIQRIF 1140
Query: 1244 GLLQVAMDSDEPYSFTHSDMLTCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELI 1303
GLLQVAMDSDEPYSF HSDMLTCLNISDKCIFWV VVYLVLYRKLPHAIVQQLECEKELI
Sbjct: 1141 GLLQVAMDSDEPYSFMHSDMLTCLNISDKCIFWVCVVYLVLYRKLPHAIVQQLECEKELI 1200
Query: 1304 EIEWPAVHLTNGEKLRASRVVKKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAF 1363
EIEWPAV LTNGEKLRASRVVKK VDF DSCLNNES +SKCYQKSIQMFAVNHIRCLMAF
Sbjct: 1201 EIEWPAVQLTNGEKLRASRVVKKVVDFADSCLNNESPESKCYQKSIQMFAVNHIRCLMAF 1260
Query: 1364 EDIEFSRNLLDKYVKLYPSCPELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCI 1423
EDIEFSRNLLDKYVKLYPSCPEL+LLDIRARKHDFGDATV+AFE+ IRYWPKEVPG+QCI
Sbjct: 1261 EDIEFSRNLLDKYVKLYPSCPELILLDIRARKHDFGDATVVAFEQAIRYWPKEVPGIQCI 1320
Query: 1424 WNQYAEYLLRNGRIKCTEELMARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA 1483
WNQYAEYLLRNGRIKCTEELMAR F+STSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA
Sbjct: 1321 WNQYAEYLLRNGRIKCTEELMARWFNSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRA 1380
Query: 1484 LDCSPNEVDVVFWYLNHSVHKLLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTD 1543
LDCSP+EVDVVFWYLNHSVHKLL+NDQLEARLAF+NALRAAS+ TFRYCMREYAMFLLTD
Sbjct: 1381 LDCSPSEVDVVFWYLNHSVHKLLVNDQLEARLAFDNALRAASAGTFRYCMREYAMFLLTD 1440
Query: 1544 ESLLNEAASVGGIRSILEGYLNDARAFPVPEPLSRRFIKDIRKPRVRLLVSNMLSPISPD 1603
ESLLNEAASVGGIRSILEGYLNDARAFPV EPLSRRFI DI+KPRVRLLVSN LSPISPD
Sbjct: 1441 ESLLNEAASVGGIRSILEGYLNDARAFPVCEPLSRRFINDIKKPRVRLLVSNTLSPISPD 1500
Query: 1604 VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSS 1663
VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILE+LPSNYQLVLSVCKQL N DNYSS
Sbjct: 1501 VSLVNCILEVWYGPSLLPQKFNKPKELVDFVETILEMLPSNYQLVLSVCKQLSNGDNYSS 1560
Query: 1664 QAASPSLIFWACSNLIIAIFSSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYP 1723
QAASPSLIFWACSNLI AIF+ VPIPPEFIWVEAANIL NVKGLEAITERFHKRALSVYP
Sbjct: 1561 QAASPSLIFWACSNLITAIFNCVPIPPEFIWVEAANILVNVKGLEAITERFHKRALSVYP 1620
Query: 1724 FSVQLWKSYYNICRTRGDTSAVLQEVNERGIQLNEPS 1759
FSVQLWKSYY++C+TRGDTS VLQEVNERGI+LNEPS
Sbjct: 1621 FSVQLWKSYYSMCKTRGDTSTVLQEVNERGIELNEPS 1653
BLAST of CsGy5G017110 vs. ExPASy TrEMBL
Match:
A0A6J1JRE8 (uncharacterized protein LOC111487161 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111487161 PE=4 SV=1)
HSP 1 Score: 2610 bits (6765), Expect = 0.0
Identity = 1355/1743 (77.74%), Postives = 1496/1743 (85.83%), Query Frame = 0
Query: 18 DSDELTLKSMPSNSKPTKIKASDGKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRISSI 77
D++ELT+K+M SN KPT+ K S+ +EEGEVSSSDNDTQTH VH V S +PAS+ S ISSI
Sbjct: 5 DAEELTVKAMESNLKPTRSKTSNSREEGEVSSSDNDTQTHGVHHVRSAMPASVTSPISSI 64
Query: 78 LPPKNKCNPGIKTASADVCTRTSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVISFS 137
LPPKNKCN GI+ SADVC +TSI T QKI DN QIV+KA TPWVASR AN+NLVISFS
Sbjct: 65 LPPKNKCNAGIQAVSADVCPKTSIQTTIQKICDNDQIVHKAITPWVASRDANANLVISFS 124
Query: 138 DDSGSELEECSKVRTSKSHSDAVRHFKPPTSILDRSNKLRSMTRNKVVVNKLPLSQAFIP 197
DDSGS+++E SK +TSKS S+AV FKPPT +LD+SNKLRSMTRNKVV NK SQ+FI
Sbjct: 125 DDSGSDMDERSKEKTSKSRSNAVGDFKPPTLLLDKSNKLRSMTRNKVVANKFSSSQSFIT 184
Query: 198 SMTKNHKAYSKGAAGPSFAEQGSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAIR 257
S T +A SKGAAGPSF EQGS+IRAFSGNL SQG NDQGMN+ +SKLQDLR+QIAI
Sbjct: 185 SKTMTQRACSKGAAGPSFVEQGSRIRAFSGNLPSQGHRNDQGMNLKSSKLQDLREQIAIW 244
Query: 258 ESKLKLKSAQQNKERVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMKTSG 317
ESKLKLKSAQQNKE + TNQDYIVTNSKSDLGRKGNATISQFPP GP +AKRMKT G
Sbjct: 245 ESKLKLKSAQQNKEIISATNQDYIVTNSKSDLGRKGNATISQFPPSGPTQPDAKRMKTIG 304
Query: 318 SYSSKLSLNGQQLRSLIAAKFIW-PQEPGEETQNIKGSYNQKGKSLSREESSVLKQSKED 377
SYS+KLSL+GQ +R+ A K ++ PQEPGEETQNIK +YNQKG S++R+E LKQ KED
Sbjct: 305 SYSTKLSLSGQHIRATNAVKSVFRPQEPGEETQNIKVTYNQKGNSMNRDEFIALKQKKED 364
Query: 378 IKHVAASPSLGIDLGKVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVASNVQS 437
IKHVAASPS G DLGKV D TDIVANGNQSD+I QVDPHPLVVL QA+ LPN ASNVQ+
Sbjct: 365 IKHVAASPSPGSDLGKVHDGTDIVANGNQSDWISKQVDPHPLVVLGQASVLPNTASNVQT 424
Query: 438 QFDNVEFHRQSDGLQPSASTAKFFERTPPQSASNVKTPEPCSNFFKSLINSKTSGTAFGN 497
FDN EFH +DGLQ SASTA F E T PQSASNVK PE SNFFKSLINSK +GTAFGN
Sbjct: 425 LFDNGEFHSPNDGLQQSASTANFSEGTCPQSASNVKIPESFSNFFKSLINSKNTGTAFGN 484
Query: 498 PSSCLDFGNFDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSR 557
PSSCL F N DL+SLFE+EE LDKDLEEAQD R +CE+EERNAFKIYSRAQRALIEANSR
Sbjct: 485 PSSCLGFSNVDLESLFEMEELLDKDLEEAQDFRHRCEVEERNAFKIYSRAQRALIEANSR 544
Query: 558 CVELYHKRELFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPLYQKHS 617
C++LYHKRELFS HFHSFCMNNPG VSSSRQQE+M I VDH NSMSG+ N ASPLYQKHS
Sbjct: 545 CLDLYHKRELFSAHFHSFCMNNPGLVSSSRQQENMKIGVDHSNSMSGNENRASPLYQKHS 604
Query: 618 EYNSSTRLHNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKGNNIA 677
EYNS T+L NDLNMQ ENA INTS LHEN Q+LGSEP SCSDL G TL+P+P KG NIA
Sbjct: 605 EYNSFTQLRNDLNMQHENASLINTSILHENRQNLGSEPESCSDLCGITLNPVPSKGKNIA 664
Query: 678 DRIFSPSVDPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNHCDNN 737
DRI SPS++PNVS+DGDEESF SDHEMIDSY+ECY+ KK FE+DQMEA N SK NH D+N
Sbjct: 665 DRICSPSIEPNVSVDGDEESFHSDHEMIDSYDECYIGKKRFEDDQMEACNMSKKNHYDDN 724
Query: 738 IEDSLRLEAKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNTELTV 797
EDSLRLEAKLRSELFARLGTRN S+ CNPC+N+QTS E+G E DARDD TQQN TE TV
Sbjct: 725 TEDSLRLEAKLRSELFARLGTRNSSQTCNPCHNIQTSAEKGAEKDARDDKTQQNYTEPTV 784
Query: 798 DLAVGSDVDLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAPDLTD 857
AVG+D+D K +SALLSGK DQ+FGFGGTDRCKTPD+I +FEN P E D+ D
Sbjct: 785 RQAVGNDID----KTKSALLSGKRDQKFGFGGTDRCKTPDDIRSHCNFENFPLETHDVAD 844
Query: 858 SDDNEPFSREGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRTHGSLSNLGFPEDG 917
SD NEP +REG CS S+ PLT+NSVLQHMK +SSVS EVLL+RT SLSNLG PE+G
Sbjct: 845 SDVNEPSNREGPCSYF--SYAPLTLNSVLQHMKAVSSVSTEVLLSRTRESLSNLGLPEEG 904
Query: 918 DSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNNDECPW 977
D LE D+IHWRKL+EN V + M QSDGSYTDDL+IDPSWPLCMYELRGKCNNDECPW
Sbjct: 905 DLLEGDRIHWRKLEENHVPDTVSCMFQSDGSYTDDLSIDPSWPLCMYELRGKCNNDECPW 964
Query: 978 QHMKDFSFANRSQCQHGHINSSDETKVFKNEDQMTPPTYLVGIDILKADSRSYGHVLAQR 1037
QH+KD S +NR CQ N SDETKVFK ED MTPPTYLVG+DILKADS SY VL Q+
Sbjct: 965 QHVKDSSLSNRRPCQDSQSNYSDETKVFKYEDLMTPPTYLVGVDILKADSHSYNPVLVQK 1024
Query: 1038 SSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSYFQRGGSVLSQLKQ 1097
SS+CWQ+FFS+SLTLPNLLQKDASADGLFLHDARIEAKGSWNR SSYFQ G + LSQLKQ
Sbjct: 1025 SSKCWQNFFSVSLTLPNLLQKDASADGLFLHDARIEAKGSWNRQSSYFQSGSTTLSQLKQ 1084
Query: 1098 GDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAVYLLIFYSYTTTGG 1157
DEN ALETALIIINQEMNSREGMK+ALP+LSRA+E+NPKSIALW +YLLIFYSYTT GG
Sbjct: 1085 ADENQALETALIIINQEMNSREGMKRALPILSRAIESNPKSIALWTMYLLIFYSYTTNGG 1144
Query: 1158 KDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNIFTHNLDGKYASAH 1217
KDDMFSYAVKHN QSYELWL+YINS MNLDAR+AAYD+A+SAL +NI T +D K ASAH
Sbjct: 1145 KDDMFSYAVKHNEQSYELWLLYINSHMNLDARIAAYDAALSALFNNILTQ-MDEKRASAH 1204
Query: 1218 ILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDMLTCLNISDKCIFWV 1277
ILDLILQMTNCLCMSGNVEKA Q+IFGLL+VAMDSDEP SF HSDMLTCLNISDKCIFWV
Sbjct: 1205 ILDLILQMTNCLCMSGNVEKATQKIFGLLRVAMDSDEPGSFMHSDMLTCLNISDKCIFWV 1264
Query: 1278 SVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVVKKAVDFVDSCLNN 1337
VVYLV+YRKLPHA+VQQLECEKEL+EIEWP +HLT+GEK RAS V+KKAVDFVDSCLNN
Sbjct: 1265 CVVYLVIYRKLPHAVVQQLECEKELVEIEWPTIHLTDGEKQRASTVIKKAVDFVDSCLNN 1324
Query: 1338 ESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCPELLLLDIRARKHD 1397
ESL+S+ YQKSIQMFAVNHIRCLMAFEDI FSRNLLDKYVK YPSC ELLLL++ +KHD
Sbjct: 1325 ESLESQSYQKSIQMFAVNHIRCLMAFEDIGFSRNLLDKYVKRYPSCLELLLLNVWTKKHD 1384
Query: 1398 FGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELMARRFDSTSKMDCS 1457
FG+ TV AFE+VIR WPKEVPGVQCIWNQYAEYLL+NGRIK TEELMAR FDS+SK+ S
Sbjct: 1385 FGE-TVAAFEEVIRNWPKEVPGVQCIWNQYAEYLLQNGRIKYTEELMARWFDSSSKIG-S 1444
Query: 1458 KTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHKLLLNDQLEARLAF 1517
+TRT NSD +SLHLLD+ASGSIV ALDCSP+EVD+VFWYLN SVHKLLLND LEARLAF
Sbjct: 1445 RTRTLDNSDYNSLHLLDYASGSIVHALDCSPSEVDLVFWYLNLSVHKLLLNDLLEARLAF 1504
Query: 1518 ENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYLNDARAFPVPEPLS 1577
+NALRAASS TF+YCMREYAMFLLTDESLLNEA SVGGIRSILEGYL+D RAFPVPE LS
Sbjct: 1505 DNALRAASSGTFKYCMREYAMFLLTDESLLNEAGSVGGIRSILEGYLSDVRAFPVPETLS 1564
Query: 1578 RRFIKDIRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQKFNKPKELVDFVETI 1637
R+FI DI+KPRV++LVSNMLSP+SPDVSLVNCILE WYGPSLLP KF+KPKELVDFVETI
Sbjct: 1565 RKFINDIKKPRVQVLVSNMLSPLSPDVSLVNCILEAWYGPSLLPPKFSKPKELVDFVETI 1624
Query: 1638 LEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIFSSVPIPPEFIWVEA 1697
LE+LPSNYQLVL VCKQLCN +N SSQ S SLIFWACSNLI AIF +VPIPPEFIWVEA
Sbjct: 1625 LEMLPSNYQLVLCVCKQLCNGNN-SSQVTSTSLIFWACSNLISAIFCAVPIPPEFIWVEA 1684
Query: 1698 ANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTSAVLQEVNERGIQLN 1757
NIL NVKG AITERFHKRALSVYPFSVQLWKSYYN C+ +GDTSAVLQ VNERGI+L+
Sbjct: 1685 TNILVNVKGFGAITERFHKRALSVYPFSVQLWKSYYNKCKAKGDTSAVLQAVNERGIELS 1737
Query: 1758 EPS 1759
PS
Sbjct: 1745 LPS 1737
BLAST of CsGy5G017110 vs. TAIR 10
Match:
AT2G39580.1 (CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Has 249 Blast hits to 219 proteins in 85 species: Archae - 0; Bacteria - 144; Metazoa - 29; Fungi - 8; Plants - 50; Viruses - 0; Other Eukaryotes - 18 (source: NCBI BLink). )
HSP 1 Score: 778.5 bits (2009), Expect = 1.2e-224
Identity = 604/1755 (34.42%), Postives = 905/1755 (51.57%), Query Frame = 0
Query: 41 GKEEGEVSSSDNDTQTHDVHPVCSTVPASIASRIS--SILPPKNKCNPGIKTASADVCTR 100
GKEEGE+S+SD++ Q P+ ++ + + IS + + + N G +D T
Sbjct: 14 GKEEGELSTSDDEVQ-----PMQTSTRSPLTEHISANTNIQRRQAGNGGSFIKPSD-ATP 73
Query: 101 TSISTMSQKIRDNAQIVNKASTPWVASRKANSNLVISFS-DDSGSELEECSKVRTSKSHS 160
T ++ +I + Q + R NSNLVI+FS DDSGSE + + +TSK
Sbjct: 74 TKLTNPGGRIFETKQAIAAIHGKKFPVRGNNSNLVINFSDDDSGSESDCKGRTQTSK--- 133
Query: 161 DAVRHFKPPTSIL-DRSNKLRSMTRNK--VVVNKLPLSQAFIPSMTKNHKAYSKGAAGPS 220
+P +I +R+ S T+ K ++ +++ + + T +H A SK + S
Sbjct: 134 -----IQPKGTISGNRNPSTFSQTKLKGPRQIDIRAITKKALSTSTFSHAATSK-VSNLS 193
Query: 221 FAEQ---GSKIRAFSGNLQSQGRGNDQGMNVNTSKLQDLRQQIAIRESKLKLKSAQQNKE 280
FA++ I + + + +Q + N++KLQDL+QQIA+RES+LKLK+AQ K+
Sbjct: 194 FAKEMKSNKYIHSSERTVSKDAQRPEQIVESNSNKLQDLKQQIALRESELKLKAAQPKKD 253
Query: 281 RVLVTNQDYIVTNSKSDLGRKGNATISQFPPLGPKDLNAKRMKTSGSYSSKLSLNGQQLR 340
V N K R+ + L P + KR+K SG +S+ ++ +
Sbjct: 254 AV----------NPKITPARRVSIISDDTRHLEPNEPPKKRLKVSGIDTSQPVIDYR--- 313
Query: 341 SLIAAKFIWPQEPGEETQNIKGSYNQKGKSLSREESSVLKQSKEDIKHVAASPSLGIDLG 400
+AA P + +++ N + KH+
Sbjct: 314 --VAASAAAPMNAPDIRKSLLPGVN----------------ANSSCKHL----------- 373
Query: 401 KVQDDTDIVANGNQSDFIGNQVDPHPLVVLDQATALPNVASNVQSQFDNVEFHRQSDGLQ 460
G++SD I V P V + ++++ ++ + ++ +
Sbjct: 374 -----------GSKSDEIVPPVIPQHTVEGNTSSSVLQKSTGKVNHYEGGRELETMKNVD 433
Query: 461 PSASTAKFFE------RTPPQSASNVKTPEPCSNFFKSLINSKTSGTAFGNPSSCLDFGN 520
S S+ + + + +S++N PCSN L N S T G+
Sbjct: 434 RSVSSEQLLKIVNGNHQVFSRSSNNNWKRLPCSN-NSGLYNIPGSTTVPGH-------SQ 493
Query: 521 FDLQSLFEIEESLDKDLEEAQDCRRQCEIEERNAFKIYSRAQRALIEANSRCVELYHKRE 580
D+ SL +EESLDK+LEEAQ+ +R EIEERNA K+Y +AQR+LIEAN+RC ELY KRE
Sbjct: 494 LDMLSLTNLEESLDKELEEAQERKRLFEIEERNALKVYRKAQRSLIEANARCAELYSKRE 553
Query: 581 LFSVHFHSFCMNNPGSVSSSRQQEDMIIDVDHLNSMSGHANIASPL-YQKHSEYNSSTRL 640
+ S H+ S + + + S E+ LN+ +G ++A+ +HS+ S+ +
Sbjct: 554 ILSAHYGSLIVRDSRLLWPSIHGENPETGFHFLNNSTGSIDLATKTDIAQHSQLESNHKY 613
Query: 641 HNDLNMQLENAGAINTSNLHENGQSLGSEPGSCSDLGGNTLDPLPFKGNNIADRIFSPSV 700
++ E G+ + +GQ+LG SDLG +T D LP A R+ SPS
Sbjct: 614 NS------EYVGS-HPPPHSRSGQNLG-----YSDLGASTSDGLPCGNKQTASRLCSPSS 673
Query: 701 DPNVSMDGDEESFPSDHEMIDSYNECYMRKKHFENDQMEAYNTSKNNHCDNNIEDSLRLE 760
D N+ D+ESFP DHE + Q E + + N +++L LE
Sbjct: 674 DANIL--PDDESFPVDHESTEGN----------PGHQKENIDQTLGN------QNALLLE 733
Query: 761 AKLRSELFARLGTRNLSKACNPCNNLQTSVEQGTENDARDDITQQNNTELTVDLAVGSDV 820
A LRS+LF RLG R S+ C N +T +++G E D + TQ++N GS
Sbjct: 734 ASLRSKLFDRLGMRAESRG-GTCFNEETVIDRGDERDFGSEGTQRDN---------GSPF 793
Query: 821 DLISKKNESALLSGKGDQQFGFGGTDRCKTPDEIHGRYHFENLPSEAPDLTDSDDNEPFS 880
I N+S G + PSEAP S + +
Sbjct: 794 SEIYLHNDSL-----------------------EPGANKLQGSPSEAPVERRSIEENSLN 853
Query: 881 REGSCSKTTNSFTPLTMNSVLQHMKVISSVSIEVLLTRT--------HGSLSNLG----F 940
+ S ++ +P ++SSV++ L R+ S+++LG
Sbjct: 854 YQLSIDMESHRSSP--------ENALLSSVALSGPLFRSTIYHLKVPGSSITSLGPEYIL 913
Query: 941 PEDGDSLEVDQIHWRKLKENSVHEIARPMLQSDGSYTDDLAIDPSWPLCMYELRGKCNND 1000
SL D+ R L E V+E + G YT +L +DPSWPLCMYELRG+CNND
Sbjct: 914 QNKTYSLYSDKRQCRSLTETIVYE------KKIGFYTCNLKVDPSWPLCMYELRGRCNND 973
Query: 1001 ECPWQHMKDFSFANRSQCQH---GHINSSDETKVFKNE------DQMTPPTYLVGIDILK 1060
EC WQH KDFS + Q H G + SS K + D + PTYLV +D +K
Sbjct: 974 ECSWQHFKDFSDDSLHQSLHDPDGRVGSSSHQKTHNSSKGSQILDSVFSPTYLVSLDTMK 1033
Query: 1061 ADSRSYGHVLAQRSSQCWQSFFSISLTLPNLLQKDASADGLFLHDARIEAKGSWNRPSSY 1120
DS SY VLAQR Q W FS L N L ++ A ++ RI G+ SSY
Sbjct: 1034 VDSWSYESVLAQRHGQIWCKHFSACLASSNSLYRNVPAKE---NEGRIVVLGNSKTYSSY 1093
Query: 1121 FQRGGSVLSQLKQGDENLALETALIIINQEMNSREGMKKALPVLSRAVENNPKSIALWAV 1180
F+ S++ + Q AL +LS+ +E +P S LWAV
Sbjct: 1094 FRIKHSLMWHIFQ--------------------------ALSLLSQGLEGDPTSEILWAV 1153
Query: 1181 YLLIFYSYTTTGGKDDMFSYAVKHNGQSYELWLMYINSRMNLDARLAAYDSAISALCHNI 1240
YLLI+++Y + GK DMFSY VKH+ +SY +WLMYINSR L+ +L AYD+A+SALC N
Sbjct: 1154 YLLIYHAYEGSDGK-DMFSYGVKHSSRSYVIWLMYINSRGQLNDQLIAYDTALSALC-NH 1213
Query: 1241 FTHNLDGKYASAHILDLILQMTNCLCMSGNVEKAIQRIFGLLQVAMDSDEPYSFTHSDML 1300
+ ++D +ASA ILD++LQM N LC+SGNV KAIQRI L A SD+P S +L
Sbjct: 1214 ASGSIDRNHASACILDVLLQMFNLLCISGNVSKAIQRISKLQAPAAVSDDPDFSLMSHIL 1273
Query: 1301 TCLNISDKCIFWVSVVYLVLYRKLPHAIVQQLECEKELIEIEWPAVHLTNGEKLRASRVV 1360
TCL SDKC+FWV VYLV+YRKLP +I+++LE EKEL+EIEWP V+L K A R+
Sbjct: 1274 TCLTYSDKCVFWVCCVYLVIYRKLPDSIIRRLEMEKELLEIEWPTVNLDGDLKQMALRLF 1333
Query: 1361 KKAVDFVDSCLNNESLDSKCYQKSIQMFAVNHIRCLMAFEDIEFSRNLLDKYVKLYPSCP 1420
K + V+ NN ++ +FA+N+ ++A +++E R++L V+LYP+C
Sbjct: 1334 DKGMRSVEHGTNN-----GIQKRPAGLFALNYALFMIAVDELESRRDILKASVQLYPTCL 1393
Query: 1421 ELLLLDIRARKHDFGDATVMAFEKVIRYWPKEVPGVQCIWNQYAEYLLRNGRIKCTEELM 1480
EL LL +R + ++ D FE++++ KE +QCIWNQYAEY L G ELM
Sbjct: 1394 ELKLLAVRMQSNELKDMFSSGFEELLKQEAKEASCIQCIWNQYAEYALEGGSYDLARELM 1453
Query: 1481 ARRFDSTSKMDCSKTRTPVNSDCDSLHLLDHASGSIVRALDCSPNEVDVVFWYLNHSVHK 1540
+R + S + K +T ++ + D+ S + L+ + ++VDV+F YLN S+H
Sbjct: 1454 SRWYGSVWDVLSHKYKTVRGNEEEG---DDNMLESALSDLNVASDQVDVMFGYLNLSLHN 1513
Query: 1541 LLLNDQLEARLAFENALRAASSETFRYCMREYAMFLLTDESLLNEAASVGGIRSILEGYL 1600
LL ++ EARLA + AL+A + E F +C+RE+A+F L +E S+ +L YL
Sbjct: 1514 LLQSNWTEARLAIDQALKATAPEHFMHCLREHAVFQLINELQATGEFSINLQMRLLNSYL 1567
Query: 1601 NDARAFPVPEPLSRRFIKD-IRKPRVRLLVSNMLSPISPDVSLVNCILEVWYGPSLLPQK 1660
+ A + PV EPLS +FI + KPRVR LV+N+L+P+S ++ +VN +LE W+GPSL+P+K
Sbjct: 1574 DRASSLPVKEPLSWKFISNSAEKPRVRKLVTNLLAPVSSELFVVNVVLEAWHGPSLVPEK 1567
Query: 1661 FNKPKELVDFVETILEILPSNYQLVLSVCKQLCNDDNYSSQAASPSLIFWACSNLIIAIF 1720
+K KELVDFVETIL ++PSNY L LSV K L ++ S +S + FWA NL I
Sbjct: 1634 LSKQKELVDFVETILGLVPSNYPLALSVSKLLRKEEKQSDSGSSSGIHFWAGLNLASTIS 1567
Query: 1721 SSVPIPPEFIWVEAANILANVKGLEAITERFHKRALSVYPFSVQLWKSYYNICRTRGDTS 1758
++P+ PE+IWVEA I++++ G + ERF K+ALSVYP SV+LW+ Y+++C++
Sbjct: 1694 CAIPVAPEYIWVEAGEIVSDINGFKTRAERFLKKALSVYPMSVKLWRCYWSLCKS----- 1567
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
O60293 | 3.7e-10 | 21.25 | Zinc finger C3H1 domain-containing protein OS=Homo sapiens OX=9606 GN=ZFC3H1 PE=... | [more] |
Match Name | E-value | Identity | Description | |
XP_011655356.2 | 0.0 | 100.00 | uncharacterized protein LOC101211906 [Cucumis sativus] >KGN51732.2 hypothetical ... | [more] |
XP_008463476.1 | 0.0 | 92.80 | PREDICTED: uncharacterized protein LOC103501638 [Cucumis melo] | [more] |
TYK05820.1 | 0.0 | 93.01 | zinc finger C3H1 domain-containing protein isoform X1 [Cucumis melo var. makuwa] | [more] |
KAA0066024.1 | 0.0 | 93.24 | zinc finger C3H1 domain-containing protein isoform X1 [Cucumis melo var. makuwa] | [more] |
XP_038890115.1 | 0.0 | 85.75 | uncharacterized protein LOC120079791 isoform X3 [Benincasa hispida] | [more] |
Match Name | E-value | Identity | Description | |
A0A0A0KS73 | 0.0 | 100.00 | zf-C3H1 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G512890 PE=... | [more] |
A0A1S3CJD3 | 0.0 | 92.80 | uncharacterized protein LOC103501638 OS=Cucumis melo OX=3656 GN=LOC103501638 PE=... | [more] |
A0A5D3C3A9 | 0.0 | 93.01 | Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... | [more] |
A0A5A7VFE0 | 0.0 | 93.24 | Zinc finger C3H1 domain-containing protein isoform X1 OS=Cucumis melo var. makuw... | [more] |
A0A6J1JRE8 | 0.0 | 77.74 | uncharacterized protein LOC111487161 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
Match Name | E-value | Identity | Description | |
AT2G39580.1 | 1.2e-224 | 34.42 | CONTAINS InterPro DOMAIN/s: Putative zinc-finger domain (InterPro:IPR019607); Ha... | [more] |