Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonfive_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAGGTTGATGAACAGAGACCAGAATTCCAGCACTCGAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATTTATCCAAACCCTAGAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTATTTTTGAATAATTTTTGTTTTGGATCCAGGTGTTTTGTTTTTGGTTCGTTTTGTTTGTTTTTTTTCATGTACCAAAACCCTAGTTTCAATCTGGAGCTCTCGGATTGAACAAAACCCTAGAATTTGTTCTGGTGCGACATCTAAGATCACGTGTTATAACGGTAATTTTGTTTGTGGTGGACGGATCGTGTGGGAATTGGATTTATGATTCGTTGAAAGCGCGATTCTGTGTAGTAATCTTGTTTTTTGGGGTTGAGATTTAATTGAAAGAAGTGGTAAACGGGATTTGGAGATTGATAAATTTTTGGCGAGGGAGCTTGGGTGGATGGAGTTCACTTTTTGAGGGATTTTGATGCGCATTCAGCTGGATCATAGATGGGATTGCCTCTGAGTCTGCATTTTTAATGCTTTCTGACTTGAATTAGCTTCGATCGAACGAACAAATTTGAAGACAAGGCGATGGAGAGTAGTTTTTAGCGTGTTGTCTGCATTGAATTAATTGATTTTCTTTAAAGAATGAGACGGTATAGTATCCAATAAATTTCGAATTAGCTTTGCGGTAATCTTGGGCGGCGTTGTTGTTCAGTAAATTGGTTCTGGTGCTGTTGAAGACTTTTTGCATATCGATCCATTGCACTGATTAATCTTGTCCCGAGTAAGCAAGCAGAATGAGCCTCAAGAAGGAGGATTTGAAGCCACACGATCAGACTGCTACAGTGAAGCATGATTTGCGAAAGTAATTACATATAATTTGCAGTTCCTAACCGATATCTGTGGTTTATTTTCTTGCTTCTTTCTTCTCCTCTATTATTTGTTTTCTACTGTTAATTCTGGCTAGTATTGACTGCATTTTCTTGATCCTGTTTAGGAAACCGAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGGATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAGTTATCGCGTAAGATGGTCTTTCGTGCATCGGTTCTATTTTTTTTTAACTACTAATAAATTGTTATTATTCTTTGTTTACGCTACTGAAGTCTGTATCTTACAAAGTAAAGAACTGACGATAAGCCTTGTGATAGTATTCTTGTTTTCCTTGTTGGATTGTTTTTCTGAGCATACGTCTCCCCTTGTGTAGTGAATTAGAAGAAGCTTCCTATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGCTTTAGGCGCAATGAGTATGGTTCATCACCACCCAATAGGGCAGAAACGACTAATTATGCTCGTCGAATACATGGAAAGAAGGACATTAATTCTTCTGGACGGAGTGATAAGGATAGCGATTCACAATCTGACCGGGATTCAGGTGAGGTCTCTTCTTAATCTATTGTTAAGTTTCGATAACTTCATTGTTCTATACAAACTAATCTGAATATCATGTCATATCCGGCTTTGTTAACTGTTGGGTTCCTGCTGTAGTTGATTTAACTGAAAAAATACTGCTGTCTTGGAAATAATTTCATTTACATGTATATTAATAATGATATTTCTGCCTACTCAAACGTGGTTGACTAATGGAGCTCTGACAGCATTTCTCTATTTTAAAGCAGAGCACCTTGAGAACACGTTATTTGTATTGGTTTGATATCCTTAAATAGATGTAGATGTTAAAACCAGTCAATTCAGGATTTATGATGTGAAATTACTACAATATTGACATTTGGAATATTCATTTCAACAGTGGATTCTGGGTGGCGATTAAGTGATCATTCTAGGAGGCCTTCGCAGGGTCCTGAACAGGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCACCTGGATATGCAACAGCATTTTCGGCACCAAAAGTTCGAGCACATGATCAGTACCAGCTAAATAGAAGCAATGAGCCATATCATCCACCTCGACCTTATAAGGTTTTCATTATATTTTCCAATCTTTCTACTTTGCAAACAAAATTTAAAATTAAATTTCCATCCTTGGATTATGGTTATACATTCAACCATATGTCTTTAAACATAAACAGAATCATTTTGTTGAATGGGTAAGTGAATTACACAACAAGAATTTGAAATTATTTTGAGTAACCTTTCCTTATGGATGTTGTTTTTTCATTAATCAGGCTGTAGCCCATCAACGAGGGAATACTCATGATTCGTACAATCATGAAACTTTTGGTTCTTCTGAGTTGACAAGTGAGGATAGAGTTGAAGAGGAAAGAAAGAGAAGAGGTAATCGCTCTTTTCTCCATGATGCTTGATGTACTAACTTGAACATTTACAGATCTTGTTCAAATACTTATTCCAACAATACTTTTAGCTGACAAACTTAATTGCAGCTTCATTTGAGTCGATGAGGAAAGAACAACATAGGGCATTTCAAGAAGGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGGATTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAACTACTGAATACAAGCAGCGGTTTCGATGAACCTATCTCCTTACAATCTTCAAAAAATGATCGAGAAACTTTTTTTCCATCTCAGACAACTGTATCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCGTTTGCTTGAGGTAATTATTCGTTTTCATCATATTTTTTTTTGGTTTTAATAGCTTGGAGTATAAGGTTGTTTCCCATTTAAATGTTGTCAATGGGGTGGATATGTGCCTGGGCAGTGTGTTTTTGTCGTCAGCAAGTGTCATATTGTTATAATAAAGAGTTATTCTTCTGGCACTAGATAACACTCTTTCTCCTTATTAACTGCCTATATGTTTGTCACTTGTGTAGGGTAAGGATGACGTCGACAAGTCTCTGCAAACCAAGGACAAGCAATTGCATAATGGGTTCTCTGAAGATTTAGAGGGAAAAAGTTCATTAGAGCAAATGGGTCGCCCTGAACATTATGGAAAAACAAGCACCAATGCTTCTACTAACAACACTGGTGAAAATATTATTCATCTGCTTTCGGCTGTAGACATGTCTAATCAAACAACTGGAACAGACGTTCAATCTCGTGAAAATTCTTTGGAAGTTTTTGAAGCTATTGAAAACAGTGCAGTTGATAATTGTAAGACTGAAATGGTGCCAGCGAATACAGCTGTTGGTGAAGCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTACCATAAAGTTAGATGGCGGTGCTACTAATTTTATTGAGGTACTCCTGAATGTAGAAAAATCTTTTTCAGCCATGGTATTAATTTTCTGGTTTTGAATCAAAATCAACTTCCTATATGTGGTATTATTACTAGAAGCCAGCTGGTATATTATCCTATGCATTTTACCCAGATTAGCTTTCTAAAATTTTCTAAGTTAAGAAAAACAGGCCTAAATGCATAGAAGTTATGAAAATCAACCCCAGCATGTGGATATCTTTTTGCAATTTCAATCTTATCATAGGAAATCATTTTATTGACGAATCATTATGTCCCTGTTGAGTAAGGGATGGAATTGTTTCATAATTTGGAACTATAATTCTTTTGTTACATTTCTTTTCCTTCAATTTGTATGTAGTTATTTCTGCTCGAAGAAAAATAAGTACCTCCTTTTCTCCTCGTTTGATTTTCATTTTAAGAACAGAAGTAACAAAAGCAATTCTAGTTTGGATACTTCTATCGCTGATGTATGCTTTGCGGGGGTCTTATGCAGCAGCAGGACAGTGAGAAGGATGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAGATTCGCTCATTGGTTCATGGACAACGGTATGTTGGTAAAAATTCTGTATTCAAACGATTCCTTATATCAGTGGTTTATTACCTGATTGGCTTCGAGTTTTAATGCTCATAGCTTCATTTTTTTTCACCTTGCAGATAGGAAACAGGGGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGCTGGAGAAAAGGGTGGATATGATTTTGTATCTGATGTGAAGCATTCTGAGCAATCTCTGCCCACAGTTGTCTTTCAGGGTTATGAATCTGCAGAAAGTTATATCACATCAAGTGCAACATCATCCAATGTTGCAAAGACTGAGCCATTCTATGATAAGAGTAAGCCGGAGGCTGTTTCTGCGATTCTTACCTGTGAGGCCGTTGAACAGACACTGCTGTCAAAAGTTAAAGAAAATGACTCAGCTTTGCAACCGTCTGATCAAAGATGGAGTCATTCTGATGATGATGTGAAACATCCAACTGTTAAGAATGATGATCTTGCATCATTGCACCTTCTCTCATTGTTACAGAAGGGTTCGAGTCCAGTGATTGCAGGATATGGTGATGATGGTGTGAGTGTAGGCTCTGCAATTCACAATAAAAAGGAGGAAAGTACGCACAACGTTTCAAATCCAGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCTGTTTCTGCACAAAGGGGGGGTTCATCAGGATCTGTCAAAAGTGATGTTCCAGAACCATGCGATCCGATCACAGATGATGGTCTGTTGTCCAACAATGAAATTCGGCCCAGTATGATTAATCACGATCATGGTGTTCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGACAGTGGTTAAATCTGAATGGCCCTCCGCCTGGAATGGATTCTTCTCATCCCCATGCTAAGTTAGGACATAAGATGGGTGGCTATGATGGAGCAGCTGAAATGCCCTTTCCTCAAGAGGACAGTTTAATCATAAGTGATTCTATGAATCTTCAGAATCTCATGTCTATTGGGAATTCTGCTAGACCTCAACCTCTGTTCTCACACAACTCACAAGACAGTAATGCTGCAATCTTTAACCCTGCCTTCAAAGATGAAAGGCCTAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCTCTACGATCGGAGGGAGACTGAAATGCCACAATGGAAAGCTCCGGTTCATTCCAACTTCTCCCAGCTTCATCCCCAACAAACGAATAATGTCAAGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATATAGCGTTGCCAGAAGGAATGGTTCATCACGGCTCACCATCTAATCATCAATTTGTATCAAATATGCTTCGTCCTCCTACCTCCGGATTATCTGGATTTGATCATTTGATTCATCACCCGATGATACAGCAGATGCAAACTTCAGGCAATCTTCCCCCACAGCATCTTCTTCAAGCGTTATCTAGAGGTGCACCTCTGCCTATGACAAACAGAAGCGTTCCTCTACATCCTCATTCCATCAGAGGTAGTGCAGCAACTCTCCAACCGAACAATCAGGTTCCTGGATTAATGCAGGAACAAAATTCAATCCAAGGTTTTCATACCGGTCAGCGTGTGCCCAATACTGGTGGTCCCAGAATTCCCTCGCCAGGTAACACTCATGTTATTTTGCTTCAACTTTAAGTCCTATTTTATATTTGTTATTAAGATATATATTTGACTGATAAATTAGTTTCCATATCCTGCTTATCTGGCCATTATGTGTTCATATTTTTCCCCAATCGATTTGCCTCTATTAAAGTCTGCAGCTGATTAATAGTATCTTTTGTCCTTTAGCAATATAAGGAAAATTATATGCTTCTCGACTCATTTGATCTGATTGTGCTCAAGACATTCTTTACTTTTCCCCAACTTGGATACATTTACACTTAAAATCACTCTCTATGCATGATGCCTGCTGCGGAGTGCTTGTTTTGCTGTGACTATAATTAAAATACTCGTATAATTTAATTTGCCTTAATCCATTAAGCTTTTGGGTTGATTGGTGATTTAACATGGTGTTGTGTTATCTCCTCCCCAATCAATATTGTTTCTCACTTGTTAGACCTTTTTTAAATCTTGAAATCCACGAGTGGGGGAGAGCATTTAAGTAGTGATGTTGTTAAATTTACAATAATTCATCCATTTAAGCTTTAAAGTTGATTGGGTGCTTTAACGAGCTATATAGATAAAATGCAATGCAGTTGAATATGTGTAGCTCATTGATCTCTATTTAGAGGGTGGCTCTCTTTTATTCTGATTTCTCGTTTACATTTCAGCTCCTGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGACCTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGCCATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGTTATGGGTACAGGTAATATTGTAACACTGTACTTCGAGCATACTTGCCCAAATCCTCAATTCCTTGGATAGAGAATGGACACAATTTGCTCCTCTTAGTAGGATTGAGTGAGAAATGTAAGTAATGAATCCTTCTTATGCTTGTTTTTGTTGGTTTATCTTTCGCTTATTTCCCGTGTGGTTTCTAACTCGAAACATAGTTCCGAAGTAACCCGACAGACAAAATGAAATCAGTAGGTGAAGGAAAGTGCTGTGTTTCATTTTTTTGGGAGGCATTTTTAAGAAAGGAACAGAATTGGCTGCTATAGGGATAATACCCT
mRNA sequence
TAGGTTGATGAACAGAGACCAGAATTCCAGCACTCGAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATTTATCCAAACCCTAGAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTATTTTTGAATAATTTTTGTTTTGGATCCAGGTGTTTTGTTTTTGGTTCGTTTTGTTTGTTTTTTTTCATGTACCAAAACCCTAGTTTCAATCTGGAGCTCTCGGATTGAACAAAACCCTAGAATTTGTTCTGGTGCGACATCTAAGATCACGTGTTATAACGGTAATTTTGTTTGTGGTGGACGGATCGTGTGGGAATTGGATTTATGATTCGTTGAAAGCGCGATTCTGTGTAGTAATCTTGTTTTTTGGGGTTGAGATTTAATTGAAAGAAGTGGTAAACGGGATTTGGAGATTGATAAATTTTTGGCGAGGGAGCTTGGGTGGATGGAGTTCACTTTTTGAGGGATTTTGATGCGCATTCAGCTGGATCATAGATGGGATTGCCTCTGAGTCTGCATTTTTAATGCTTTCTGACTTGAATTAGCTTCGATCGAACGAACAAATTTGAAGACAAGGCGATGGAGAGTAGTTTTTAGCGTGTTGTCTGCATTGAATTAATTGATTTTCTTTAAAGAATGAGACGGTATAGTATCCAATAAATTTCGAATTAGCTTTGCGGTAATCTTGGGCGGCGTTGTTGTTCAGTAAATTGGTTCTGGTGCTGTTGAAGACTTTTTGCATATCGATCCATTGCACTGATTAATCTTGTCCCGAGTAAGCAAGCAGAATGAGCCTCAAGAAGGAGGATTTGAAGCCACACGATCAGACTGCTACAGTGAAGCATGATTTGCGAAAGAAACCGAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGGATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAGTTATCGCTGAATTAGAAGAAGCTTCCTATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGCTTTAGGCGCAATGAGTATGGTTCATCACCACCCAATAGGGCAGAAACGACTAATTATGCTCGTCGAATACATGGAAAGAAGGACATTAATTCTTCTGGACGGAGTGATAAGGATAGCGATTCACAATCTGACCGGGATTCAGTGGATTCTGGGTGGCGATTAAGTGATCATTCTAGGAGGCCTTCGCAGGGTCCTGAACAGGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCACCTGGATATGCAACAGCATTTTCGGCACCAAAAGTTCGAGCACATGATCAGTACCAGCTAAATAGAAGCAATGAGCCATATCATCCACCTCGACCTTATAAGGCTGTAGCCCATCAACGAGGGAATACTCATGATTCGTACAATCATGAAACTTTTGGTTCTTCTGAGTTGACAAGTGAGGATAGAGTTGAAGAGGAAAGAAAGAGAAGAGCTTCATTTGAGTCGATGAGGAAAGAACAACATAGGGCATTTCAAGAAGGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGGATTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAACTACTGAATACAAGCAGCGGTTTCGATGAACCTATCTCCTTACAATCTTCAAAAAATGATCGAGAAACTTTTTTTCCATCTCAGACAACTGTATCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCGTTTGCTTGAGGGTAAGGATGACGTCGACAAGTCTCTGCAAACCAAGGACAAGCAATTGCATAATGGGTTCTCTGAAGATTTAGAGGGAAAAAGTTCATTAGAGCAAATGGGTCGCCCTGAACATTATGGAAAAACAAGCACCAATGCTTCTACTAACAACACTGGTGAAAATATTATTCATCTGCTTTCGGCTGTAGACATGTCTAATCAAACAACTGGAACAGACGTTCAATCTCGTGAAAATTCTTTGGAAGTTTTTGAAGCTATTGAAAACAGTGCAGTTGATAATTGTAAGACTGAAATGGTGCCAGCGAATACAGCTGTTGGTGAAGCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTACCATAAAGTTAGATGGCGGTGCTACTAATTTTATTGAGCAGCAGGACAGTGAGAAGGATGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAGATTCGCTCATTGGTTCATGGACAACGATAGGAAACAGGGGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGCTGGAGAAAAGGGTGGATATGATTTTGTATCTGATGTGAAGCATTCTGAGCAATCTCTGCCCACAGTTGTCTTTCAGGGTTATGAATCTGCAGAAAGTTATATCACATCAAGTGCAACATCATCCAATGTTGCAAAGACTGAGCCATTCTATGATAAGAGTAAGCCGGAGGCTGTTTCTGCGATTCTTACCTGTGAGGCCGTTGAACAGACACTGCTGTCAAAAGTTAAAGAAAATGACTCAGCTTTGCAACCGTCTGATCAAAGATGGAGTCATTCTGATGATGATGTGAAACATCCAACTGTTAAGAATGATGATCTTGCATCATTGCACCTTCTCTCATTGTTACAGAAGGGTTCGAGTCCAGTGATTGCAGGATATGGTGATGATGGTGTGAGTGTAGGCTCTGCAATTCACAATAAAAAGGAGGAAAGTACGCACAACGTTTCAAATCCAGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCTGTTTCTGCACAAAGGGGGGGTTCATCAGGATCTGTCAAAAGTGATGTTCCAGAACCATGCGATCCGATCACAGATGATGGTCTGTTGTCCAACAATGAAATTCGGCCCAGTATGATTAATCACGATCATGGTGTTCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGACAGTGGTTAAATCTGAATGGCCCTCCGCCTGGAATGGATTCTTCTCATCCCCATGCTAAGTTAGGACATAAGATGGGTGGCTATGATGGAGCAGCTGAAATGCCCTTTCCTCAAGAGGACAGTTTAATCATAAGTGATTCTATGAATCTTCAGAATCTCATGTCTATTGGGAATTCTGCTAGACCTCAACCTCTGTTCTCACACAACTCACAAGACAGTAATGCTGCAATCTTTAACCCTGCCTTCAAAGATGAAAGGCCTAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCTCTACGATCGGAGGGAGACTGAAATGCCACAATGGAAAGCTCCGGTTCATTCCAACTTCTCCCAGCTTCATCCCCAACAAACGAATAATGTCAAGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATATAGCGTTGCCAGAAGGAATGGTTCATCACGGCTCACCATCTAATCATCAATTTGTATCAAATATGCTTCGTCCTCCTACCTCCGGATTATCTGGATTTGATCATTTGATTCATCACCCGATGATACAGCAGATGCAAACTTCAGGCAATCTTCCCCCACAGCATCTTCTTCAAGCGTTATCTAGAGGTGCACCTCTGCCTATGACAAACAGAAGCGTTCCTCTACATCCTCATTCCATCAGAGGTAGTGCAGCAACTCTCCAACCGAACAATCAGGTTCCTGGATTAATGCAGGAACAAAATTCAATCCAAGGTTTTCATACCGGTCAGCGTGTGCCCAATACTGGTGGTCCCAGAATTCCCTCGCCAGCTCCTGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGACCTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGCCATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGTTATGGGTACAGGTAATATTGTAACACTGTACTTCGAGCATACTTGCCCAAATCCTCAATTCCTTGGATAGAGAATGGACACAATTTGCTCCTCTTAGTAGGATTGAGTGAGAAATGTAAGTAATGAATCCTTCTTATGCTTGTTTTTGTTGGTTTATCTTTCGCTTATTTCCCGTGTGGTTTCTAACTCGAAACATAGTTCCGAAGTAACCCGACAGACAAAATGAAATCAGTAGGTGAAGGAAAGTGCTGTGTTTCATTTTTTTGGGAGGCATTTTTAAGAAAGGAACAGAATTGGCTGCTATAGGGATAATACCCT
Coding sequence (CDS)
ATGAGCCTCAAGAAGGAGGATTTGAAGCCACACGATCAGACTGCTACAGTGAAGCATGATTTGCGAAAGAAACCGAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGGATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAGTTATCGCTGAATTAGAAGAAGCTTCCTATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGCTTTAGGCGCAATGAGTATGGTTCATCACCACCCAATAGGGCAGAAACGACTAATTATGCTCGTCGAATACATGGAAAGAAGGACATTAATTCTTCTGGACGGAGTGATAAGGATAGCGATTCACAATCTGACCGGGATTCAGTGGATTCTGGGTGGCGATTAAGTGATCATTCTAGGAGGCCTTCGCAGGGTCCTGAACAGGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCACCTGGATATGCAACAGCATTTTCGGCACCAAAAGTTCGAGCACATGATCAGTACCAGCTAAATAGAAGCAATGAGCCATATCATCCACCTCGACCTTATAAGGCTGTAGCCCATCAACGAGGGAATACTCATGATTCGTACAATCATGAAACTTTTGGTTCTTCTGAGTTGACAAGTGAGGATAGAGTTGAAGAGGAAAGAAAGAGAAGAGCTTCATTTGAGTCGATGAGGAAAGAACAACATAGGGCATTTCAAGAAGGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGGATTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAACTACTGAATACAAGCAGCGGTTTCGATGAACCTATCTCCTTACAATCTTCAAAAAATGATCGAGAAACTTTTTTTCCATCTCAGACAACTGTATCTAGGCCACTTGTGCCTCCTGGATTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAGGTCTTCAGTTAATCCTCGTTTGCTTGAGGGTAAGGATGACGTCGACAAGTCTCTGCAAACCAAGGACAAGCAATTGCATAATGGGTTCTCTGAAGATTTAGAGGGAAAAAGTTCATTAGAGCAAATGGGTCGCCCTGAACATTATGGAAAAACAAGCACCAATGCTTCTACTAACAACACTGGTGAAAATATTATTCATCTGCTTTCGGCTGTAGACATGTCTAATCAAACAACTGGAACAGACGTTCAATCTCGTGAAAATTCTTTGGAAGTTTTTGAAGCTATTGAAAACAGTGCAGTTGATAATTGTAAGACTGAAATGGTGCCAGCGAATACAGCTGTTGGTGAAGCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTACCATAAAGTTAGATGGCGGTGCTACTAATTTTATTGAGCAGCAGGACAGTGAGAAGGATGATGCATGTAGCCCTCAAAATGCTCAATCTTCTAGATTCGCTCATTGGTTCATGGACAACGATAGGAAACAGGGGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGCTGGAGAAAAGGGTGGATATGATTTTGTATCTGATGTGAAGCATTCTGAGCAATCTCTGCCCACAGTTGTCTTTCAGGGTTATGAATCTGCAGAAAGTTATATCACATCAAGTGCAACATCATCCAATGTTGCAAAGACTGAGCCATTCTATGATAAGAGTAAGCCGGAGGCTGTTTCTGCGATTCTTACCTGTGAGGCCGTTGAACAGACACTGCTGTCAAAAGTTAAAGAAAATGACTCAGCTTTGCAACCGTCTGATCAAAGATGGAGTCATTCTGATGATGATGTGAAACATCCAACTGTTAAGAATGATGATCTTGCATCATTGCACCTTCTCTCATTGTTACAGAAGGGTTCGAGTCCAGTGATTGCAGGATATGGTGATGATGGTGTGAGTGTAGGCTCTGCAATTCACAATAAAAAGGAGGAAAGTACGCACAACGTTTCAAATCCAGGGAAGACATTAACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCTGTTTCTGCACAAAGGGGGGGTTCATCAGGATCTGTCAAAAGTGATGTTCCAGAACCATGCGATCCGATCACAGATGATGGTCTGTTGTCCAACAATGAAATTCGGCCCAGTATGATTAATCACGATCATGGTGTTCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGACAGTGGTTAAATCTGAATGGCCCTCCGCCTGGAATGGATTCTTCTCATCCCCATGCTAAGTTAGGACATAAGATGGGTGGCTATGATGGAGCAGCTGAAATGCCCTTTCCTCAAGAGGACAGTTTAATCATAAGTGATTCTATGAATCTTCAGAATCTCATGTCTATTGGGAATTCTGCTAGACCTCAACCTCTGTTCTCACACAACTCACAAGACAGTAATGCTGCAATCTTTAACCCTGCCTTCAAAGATGAAAGGCCTAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCTCTACGATCGGAGGGAGACTGAAATGCCACAATGGAAAGCTCCGGTTCATTCCAACTTCTCCCAGCTTCATCCCCAACAAACGAATAATGTCAAGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATATAGCGTTGCCAGAAGGAATGGTTCATCACGGCTCACCATCTAATCATCAATTTGTATCAAATATGCTTCGTCCTCCTACCTCCGGATTATCTGGATTTGATCATTTGATTCATCACCCGATGATACAGCAGATGCAAACTTCAGGCAATCTTCCCCCACAGCATCTTCTTCAAGCGTTATCTAGAGGTGCACCTCTGCCTATGACAAACAGAAGCGTTCCTCTACATCCTCATTCCATCAGAGGTAGTGCAGCAACTCTCCAACCGAACAATCAGGTTCCTGGATTAATGCAGGAACAAAATTCAATCCAAGGTTTTCATACCGGTCAGCGTGTGCCCAATACTGGTGGTCCCAGAATTCCCTCGCCAGCTCCTGGTAACCAACCAGACGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGACCTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGCCATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGTTATGGGTACAGGTAA
Protein sequence
MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEEASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSDRDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNEPYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQEGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVSRPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLEQMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAVDNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNAQSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQGYESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHNVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSNNEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMPFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGLPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEGMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPLPMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPGNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR
Homology
BLAST of CmoCh06G011060 vs. ExPASy TrEMBL
Match:
A0A6J1F449 (uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442216 PE=4 SV=1)
HSP 1 Score: 2117.4 bits (5485), Expect = 0.0e+00
Identity = 1068/1068 (100.00%), Postives = 1068/1068 (100.00%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
Query: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
Query: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG
Sbjct: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
Query: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH
Sbjct: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM
Sbjct: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
Query: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG
Sbjct: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
Query: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL
Sbjct: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG
Sbjct: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
Query: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of CmoCh06G011060 vs. ExPASy TrEMBL
Match:
A0A6J1FA86 (uncharacterized protein LOC111442216 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111442216 PE=4 SV=1)
HSP 1 Score: 2110.9 bits (5468), Expect = 0.0e+00
Identity = 1067/1068 (99.91%), Postives = 1067/1068 (99.91%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
Query: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIE QDSEKDDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIE-QDSEKDDACSPQNA 480
Query: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG
Sbjct: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
Query: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH
Sbjct: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM
Sbjct: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
Query: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG
Sbjct: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
Query: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL
Sbjct: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG
Sbjct: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
Query: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of CmoCh06G011060 vs. ExPASy TrEMBL
Match:
A0A6J1IE91 (uncharacterized protein LOC111473257 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473257 PE=4 SV=1)
HSP 1 Score: 2053.1 bits (5318), Expect = 0.0e+00
Identity = 1039/1068 (97.28%), Postives = 1048/1068 (98.13%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
MSLKKEDLK HDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE
Sbjct: 1 MSLKKEDLKSHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRI GKKD+NSSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIQGKKDVNSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
RDSVDSGWR SDHSRRPSQGPE DGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE
Sbjct: 121 RDSVDSGWRFSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEE+KRRASFESMRKEQHRAFQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
EGH SNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS
Sbjct: 241 EGHNSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
QMGRPEHYGKTSTNASTNNT E+IIHLLSAVDMSNQTTGTDVQSREN+LEVFEAIENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTSESIIHLLSAVDMSNQTTGTDVQSRENALEVFEAIENSAV 420
Query: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGA NFIEQ DSEKDDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGAANFIEQHDSEKDDACSPQNA 480
Query: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
QSSRFAHWFMDNDRKQG+DLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSE+SLP V FQG
Sbjct: 481 QSSRFAHWFMDNDRKQGNDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEESLPRVAFQG 540
Query: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
QRWSHSD DVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH
Sbjct: 601 QRWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEP DPITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
NEIR SMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDG AE+
Sbjct: 721 NEIRLSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEI 780
Query: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
LPFSASLYDRRETEMPQ KAPVHSNFSQLHPQQTNNVKFHQFESHPPN+NSQGDIALPEG
Sbjct: 841 LPFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVKFHQFESHPPNINSQGDIALPEG 900
Query: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
MVHHGSPSNHQFVSN LRPPTSGLSGFDHLIHHPM+QQMQTSGNLPPQHLLQALSRGAPL
Sbjct: 901 MVHHGSPSNHQFVSNKLRPPTSGLSGFDHLIHHPMMQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
PMTNRSVPLHPHSIRGSAA LQPNNQVPGLMQEQNSIQGFHT QRVPNT GPRIPSPAPG
Sbjct: 961 PMTNRSVPLHPHSIRGSAANLQPNNQVPGLMQEQNSIQGFHTSQRVPNTVGPRIPSPAPG 1020
Query: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
NQPDAIQRLIQMGHRSNS SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of CmoCh06G011060 vs. ExPASy TrEMBL
Match:
A0A6J1IBP7 (uncharacterized protein LOC111473257 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111473257 PE=4 SV=1)
HSP 1 Score: 2046.6 bits (5301), Expect = 0.0e+00
Identity = 1038/1068 (97.19%), Postives = 1047/1068 (98.03%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
MSLKKEDLK HDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE
Sbjct: 1 MSLKKEDLKSHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRI GKKD+NSSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIQGKKDVNSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
RDSVDSGWR SDHSRRPSQGPE DGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE
Sbjct: 121 RDSVDSGWRFSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEE+KRRASFESMRKEQHRAFQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
EGH SNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS
Sbjct: 241 EGHNSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
QMGRPEHYGKTSTNASTNNT E+IIHLLSAVDMSNQTTGTDVQSREN+LEVFEAIENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTSESIIHLLSAVDMSNQTTGTDVQSRENALEVFEAIENSAV 420
Query: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGA NFIE DSEKDDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGAANFIE-HDSEKDDACSPQNA 480
Query: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
QSSRFAHWFMDNDRKQG+DLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSE+SLP V FQG
Sbjct: 481 QSSRFAHWFMDNDRKQGNDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEESLPRVAFQG 540
Query: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
QRWSHSD DVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH
Sbjct: 601 QRWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEP DPITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
NEIR SMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDG AE+
Sbjct: 721 NEIRLSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEI 780
Query: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVKFHQFESHPPNMNSQGDIALPEG 900
LPFSASLYDRRETEMPQ KAPVHSNFSQLHPQQTNNVKFHQFESHPPN+NSQGDIALPEG
Sbjct: 841 LPFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVKFHQFESHPPNINSQGDIALPEG 900
Query: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
MVHHGSPSNHQFVSN LRPPTSGLSGFDHLIHHPM+QQMQTSGNLPPQHLLQALSRGAPL
Sbjct: 901 MVHHGSPSNHQFVSNKLRPPTSGLSGFDHLIHHPMMQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
PMTNRSVPLHPHSIRGSAA LQPNNQVPGLMQEQNSIQGFHT QRVPNT GPRIPSPAPG
Sbjct: 961 PMTNRSVPLHPHSIRGSAANLQPNNQVPGLMQEQNSIQGFHTSQRVPNTVGPRIPSPAPG 1020
Query: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
NQPDAIQRLIQMGHRSNS SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of CmoCh06G011060 vs. ExPASy TrEMBL
Match:
A0A6J1D0Y7 (uncharacterized protein LOC111016288 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016288 PE=4 SV=1)
HSP 1 Score: 1679.1 bits (4347), Expect = 0.0e+00
Identity = 872/1074 (81.19%), Postives = 944/1074 (87.90%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
MSL K+D HDQTAT+KH+L+KK K SYTRDFLLSLS LD+CKKLPSGFDQS+I+E E+
Sbjct: 1 MSLMKDDSNSHDQTATIKHELQKKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFED 60
Query: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
ASYDRQR+SGGLSLNSFRRNEYGSSPP+RAE NY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRISGGLSLNSFRRNEYGSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
RDSVDSGWR DHSRR QGPE DGLLGSGSFPRP GYAT FSAPKVRA++QYQLNRSNE
Sbjct: 121 RDSVDSGWRYGDHSRRSLQGPEHDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
PYHPPRPYKAVAH RGN +DSYNHETFGSSE TSEDRVEEE+KRRA FESMRKEQHRAFQ
Sbjct: 181 PYHPPRPYKAVAHPRGNINDSYNHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQ 240
Query: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
E KSNPVKQRD F I+MQLDE+KDDKKLLNTSSGFDE I LQ+SKNDRE FPS TTVS
Sbjct: 241 ESQKSNPVKQRDEFGIMMQLDESKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDD-VDKSLQTKDKQLHNGFSEDLEGKSSL 360
RPLVPPGFTS VLEK+FGT+SSVNP LE KDD VDKSLQTKD+ LHNG SEDL K+S
Sbjct: 301 RPLVPPGFTSNVLEKSFGTKSSVNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSS 360
Query: 361 EQMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSA 420
EQMG PE YGKTS NAS NNT E II L SAVDMSN+TTG DV+S E+SL+ +A EN A
Sbjct: 361 EQMGCPEQYGKTSINASANNTSEKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRA 420
Query: 421 VDNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQN 480
V +CKTE V ANTA+GE SQ HSSSILEKLF S IKLDGGATNFIEQ ++E +DACSPQN
Sbjct: 421 VADCKTEKVLANTAIGETSQVHSSSILEKLFCSAIKLDGGATNFIEQHENEMEDACSPQN 480
Query: 481 AQSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQ 540
QSS+FAHWF+DND KQ D +SPKRS DLLT+I GEKGGYD +SDV SEQSLPTV F
Sbjct: 481 TQSSKFAHWFVDNDGKQEDGVSPKRSNDLLTLIVGGEKGGYD-ISDVA-SEQSLPTVAFH 540
Query: 541 GYESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPS 600
GYESAESYITSS TSSN KTEPFYDKSKPEAVS+ILTCEAVEQTLLSK+ ENDSALQPS
Sbjct: 541 GYESAESYITSSETSSNAQKTEPFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPS 600
Query: 601 DQRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYG-DDGVSVGSAIHNKKEES 660
DQRWSHSD + KHPT K+DD AS HLLSLLQKG+SP+I GYG DDG ++G+ IHNKKEES
Sbjct: 601 DQRWSHSDANNKHPTGKSDDHASQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEES 660
Query: 661 THNVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLL 720
+HN+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGS K DV E PI DDGLL
Sbjct: 661 SHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSGKVDVSESHGPIMDDGLL 720
Query: 721 SNNEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAA 780
SNNEIRPSMINHDHG QRQQNQPD+VRGQWLNLNGP P +DSSHP AKLGHK+GGYDG A
Sbjct: 721 SNNEIRPSMINHDHGDQRQQNQPDLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPA 780
Query: 781 EMPFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGL 840
EMPFP+EDSLIISDSMN QNL+SIGNS +PQPLFSH++QD+N+AIFN AFKDERPSMGGL
Sbjct: 781 EMPFPEEDSLIISDSMNFQNLISIGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGL 840
Query: 841 EGLPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDIAL 900
EGLPFSAS +DRRETEMP KAPVHS+F QLHP Q NNVK FHQFESHPPNMNSQG++ L
Sbjct: 841 EGLPFSASPFDRRETEMPHRKAPVHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLL 900
Query: 901 PEGMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRG 960
PEGMVHH SPSNHQFV+NMLRPPTSGLSGFDH IHHPM+QQ+QTS NLPPQHLLQ LSRG
Sbjct: 901 PEGMVHHDSPSNHQFVANMLRPPTSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRG 960
Query: 961 APLPMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSP 1020
AP PMTNRSVPLHPHS+RGSAA QPNNQV GL+QE NSIQGFH GQRVPN GGPRIPSP
Sbjct: 961 APPPMTNRSVPLHPHSVRGSAAPPQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSP 1020
Query: 1021 AP---GNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
AP GNQPDAIQRLIQMGHRSN KQIHPLSAS GHGQG+YGHELNMGYGYR
Sbjct: 1021 APGIGGNQPDAIQRLIQMGHRSN-PPKQIHPLSAS-GHGQGIYGHELNMGYGYR 1069
BLAST of CmoCh06G011060 vs. TAIR 10
Match:
AT4G01290.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1744 Blast hits to 1308 proteins in 219 species: Archae - 0; Bacteria - 241; Metazoa - 793; Fungi - 253; Plants - 108; Viruses - 0; Other Eukaryotes - 349 (source: NCBI BLink). )
HSP 1 Score: 459.9 bits (1182), Expect = 5.5e-129
Identity = 394/1121 (35.15%), Postives = 549/1121 (48.97%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPS---GFDQSVIAE 60
MS+ E DQ D KKP+ +YTR FL+SLS DVCKKLP+ FD++++ +
Sbjct: 2 MSIANEQQFAMDQLVETNDDSEKKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLD 61
Query: 61 LEEASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDS 120
E+ S +R R+SG S + FRRN+Y SSPP R E +R HG+ + S G +DKDSDS
Sbjct: 62 FEDPSPERARISGDFSSHGFRRNDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDS 121
Query: 121 QSDRDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNR 180
QSDRDS + G R SRR Q PE DGLLG GSFP+P G+ SAP+ +++D +QL+R
Sbjct: 122 QSDRDSGEPGRRSGMPSRRSWQAPEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSR 181
Query: 181 SNEPYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHR 240
+NEPYHPPRPYKA R + DS+N ETFGSS+ TSEDR EEERKRRASFE +RKE +
Sbjct: 182 TNEPYHPPRPYKAPPFTRRDARDSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQK 241
Query: 241 AFQEGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQT 300
AFQE KSNP +++ FD L E+KDDK + S + ++ S N T PSQ+
Sbjct: 242 AFQERQKSNPDLRKNDFDFTELLGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQS 301
Query: 301 TVSRPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKS 360
RPLVPPGF ST+LEK G + E L +K + NG S + GK
Sbjct: 302 NAPRPLVPPGFASTILEKKQGEKPQTETSQYE-----RSPLNSKGINVVNGTSVNNGGKP 361
Query: 361 SLEQMGRPEHYGK-TSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQ-SRENSLEVFEAI 420
++G E + S+ + E +++ S + +S T D + +S+ I
Sbjct: 362 LGIKIGSSEMLIEGEDVRVSSTDANERAVNISSLLGISTDTVNKDKSFEKLSSISTPTEI 421
Query: 421 ENSAVDNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEK-DDA 480
+ + + K M E S G SIL+K+F + I L+ G ++ + +++ EK ++
Sbjct: 422 QGYPIKSEKATMTLGKKKSLEHSDG--PSILDKIFNTAINLNSGDSSNMNKKNVEKVEEI 481
Query: 481 CSPQNA-QSSRFAHWFMDNDRKQGDDL-SPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQS 540
SPQ +SS+FAH F++ D K + L S + LL+++ +K D K +
Sbjct: 482 RSPQTINKSSKFAHLFLEEDNKPVEVLPSSEPPRGLLSLLQGADKLQ---TFDTKANPDL 541
Query: 541 LPTVVFQGYESAES-YITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKE 600
FQG+ + + ++S++T+ +V AV +LTCE +EQ++LS+V +
Sbjct: 542 STDFPFQGHATKRTDQLSSTSTTKSVT------------AVPPVLTCEDLEQSILSEVGD 601
Query: 601 NDSALQPSDQRWSHSDDDVKHPTVKN--------DDLASLHLLSLLQKGSSPVIAGYGDD 660
+ P D D P+VK DD AS HLLSLLQ+ S P
Sbjct: 602 SYHPPPPP------VDQDTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDP-----KSQ 661
Query: 661 GVSVGSAIHNKK--------------EESTHNVSNPGKTLTLETLFGSAFMKELQSVGAP 720
+ SA + + +T ++PGK+LTLE LFGSAFM ELQS+G P
Sbjct: 662 DTQLLSATERRPPPPSMKTTTPPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEP 721
Query: 721 VSAQRGGSSGSVKSDVP-EPCDPITDDGLLSNNEIRPSMINHDHGVQRQQNQPDIVRGQW 780
VS + ++ SD P P G LS QR Q +PD
Sbjct: 722 VSGR------AMVSDAPGVPLRSERSIGELS---------------QRNQIRPD------ 781
Query: 781 LNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMPFPQEDSLI-ISDSMNLQNLMSIGNSAR 840
GPP G+ + P++ +L+ + N MS S
Sbjct: 782 ----GPPGGV---------------------LALPEDGNLLAVGGHANPSKYMSFPGSHN 841
Query: 841 PQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGLPFSASLYDRRETEMPQWKAPVHSNFS 900
+P + N D AA+ N ++ERP+MGG +GL F
Sbjct: 842 QEPEVAFNISDKLAAL-NSGPRNERPTMGGQDGL------------------------FL 901
Query: 901 QLHPQQ--TN--------NVKFHQFESHPPNMNSQGDIALPEGMV--HHGSPSNHQFVSN 960
HPQQ TN FH F+S ++ Q D P + HH P NH+F N
Sbjct: 902 HQHPQQYVTNPSSHLNGSGPVFHPFDSQHAHVKPQLDFMGPGSTMSQHHDPPPNHRFPPN 961
Query: 961 ML-RP-----PTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPLPMTNRSVPL 1020
M+ RP PTSG FD L H M+Q+M NL HL+Q P P +
Sbjct: 962 MIHRPPFHHTPTSGHPEFDRLPPH-MMQKMHMQDNLQHHHLMQGFPGSGPQPHHS----- 991
Query: 1021 HPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPA-PGNQPDAIQR 1069
PH NNQ+PGL+ E N QGF R PN G P S G P ++Q
Sbjct: 1022 -PH----------VNNQMPGLIPELNPSQGFPFAHRQPNYGMPPPGSQVNRGEHPASLQT 991
BLAST of CmoCh06G011060 vs. TAIR 10
Match:
AT4G01290.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1797 Blast hits to 1352 proteins in 216 species: Archae - 0; Bacteria - 202; Metazoa - 850; Fungi - 267; Plants - 109; Viruses - 0; Other Eukaryotes - 369 (source: NCBI BLink). )
HSP 1 Score: 456.1 bits (1172), Expect = 8.0e-128
Identity = 394/1121 (35.15%), Postives = 549/1121 (48.97%), Query Frame = 0
Query: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPS---GFDQSVIAE 60
MS+ E DQ D KKP+ +YTR FL+SLS DVCKKLP+ FD++++ +
Sbjct: 2 MSIANEQQFAMDQLVETNDDSEKKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLD 61
Query: 61 LEEASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDS 120
E+ S +R R+SG S + FRRN+Y SSPP R E +R HG+ + S G +DKDSDS
Sbjct: 62 FEDPSPERARISGDFSSHGFRRNDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDS 121
Query: 121 QSDRDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNR 180
QSDRDS + G R SRR Q PE DGLLG GSFP+P G+ SAP+ +++D +QL+R
Sbjct: 122 QSDRDS-EPGRRSGMPSRRSWQAPEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSR 181
Query: 181 SNEPYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHR 240
+NEPYHPPRPYKA R + DS+N ETFGSS+ TSEDR EEERKRRASFE +RKE +
Sbjct: 182 TNEPYHPPRPYKAPPFTRRDARDSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQK 241
Query: 241 AFQEGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQT 300
AFQE KSNP +++ FD L E+KDDK + S + ++ S N T PSQ+
Sbjct: 242 AFQERQKSNPDLRKNDFDFTELLGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQS 301
Query: 301 TVSRPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKS 360
RPLVPPGF ST+LEK G + E L +K + NG S + GK
Sbjct: 302 NAPRPLVPPGFASTILEKKQGEKPQTETSQYE-----RSPLNSKGINVVNGTSVNNGGKP 361
Query: 361 SLEQMGRPEHYGK-TSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQ-SRENSLEVFEAI 420
++G E + S+ + E +++ S + +S T D + +S+ I
Sbjct: 362 LGIKIGSSEMLIEGEDVRVSSTDANERAVNISSLLGISTDTVNKDKSFEKLSSISTPTEI 421
Query: 421 ENSAVDNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEK-DDA 480
+ + + K M E S G SIL+K+F + I L+ G ++ + +++ EK ++
Sbjct: 422 QGYPIKSEKATMTLGKKKSLEHSDG--PSILDKIFNTAINLNSGDSSNMNKKNVEKVEEI 481
Query: 481 CSPQNA-QSSRFAHWFMDNDRKQGDDL-SPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQS 540
SPQ +SS+FAH F++ D K + L S + LL+++ +K D K +
Sbjct: 482 RSPQTINKSSKFAHLFLEEDNKPVEVLPSSEPPRGLLSLLQGADKLQ---TFDTKANPDL 541
Query: 541 LPTVVFQGYESAES-YITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKE 600
FQG+ + + ++S++T+ +V AV +LTCE +EQ++LS+V +
Sbjct: 542 STDFPFQGHATKRTDQLSSTSTTKSVT------------AVPPVLTCEDLEQSILSEVGD 601
Query: 601 NDSALQPSDQRWSHSDDDVKHPTVKN--------DDLASLHLLSLLQKGSSPVIAGYGDD 660
+ P D D P+VK DD AS HLLSLLQ+ S P
Sbjct: 602 SYHPPPPP------VDQDTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDP-----KSQ 661
Query: 661 GVSVGSAIHNKK--------------EESTHNVSNPGKTLTLETLFGSAFMKELQSVGAP 720
+ SA + + +T ++PGK+LTLE LFGSAFM ELQS+G P
Sbjct: 662 DTQLLSATERRPPPPSMKTTTPPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEP 721
Query: 721 VSAQRGGSSGSVKSDVP-EPCDPITDDGLLSNNEIRPSMINHDHGVQRQQNQPDIVRGQW 780
VS + ++ SD P P G LS QR Q +PD
Sbjct: 722 VSGR------AMVSDAPGVPLRSERSIGELS---------------QRNQIRPD------ 781
Query: 781 LNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMPFPQEDSLI-ISDSMNLQNLMSIGNSAR 840
GPP G+ + P++ +L+ + N MS S
Sbjct: 782 ----GPPGGV---------------------LALPEDGNLLAVGGHANPSKYMSFPGSHN 841
Query: 841 PQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGLPFSASLYDRRETEMPQWKAPVHSNFS 900
+P + N D AA+ N ++ERP+MGG +GL F
Sbjct: 842 QEPEVAFNISDKLAAL-NSGPRNERPTMGGQDGL------------------------FL 901
Query: 901 QLHPQQ--TN--------NVKFHQFESHPPNMNSQGDIALPEGMV--HHGSPSNHQFVSN 960
HPQQ TN FH F+S ++ Q D P + HH P NH+F N
Sbjct: 902 HQHPQQYVTNPSSHLNGSGPVFHPFDSQHAHVKPQLDFMGPGSTMSQHHDPPPNHRFPPN 961
Query: 961 ML-RP-----PTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPLPMTNRSVPL 1020
M+ RP PTSG FD L H M+Q+M NL HL+Q P P +
Sbjct: 962 MIHRPPFHHTPTSGHPEFDRLPPH-MMQKMHMQDNLQHHHLMQGFPGSGPQPHHS----- 990
Query: 1021 HPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPA-PGNQPDAIQR 1069
PH NNQ+PGL+ E N QGF R PN G P S G P ++Q
Sbjct: 1022 -PH----------VNNQMPGLIPELNPSQGFPFAHRQPNYGMPPPGSQVNRGEHPASLQT 990
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1F449 | 0.0e+00 | 100.00 | uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1FA86 | 0.0e+00 | 99.91 | uncharacterized protein LOC111442216 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IE91 | 0.0e+00 | 97.28 | uncharacterized protein LOC111473257 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1IBP7 | 0.0e+00 | 97.19 | uncharacterized protein LOC111473257 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1D0Y7 | 0.0e+00 | 81.19 | uncharacterized protein LOC111016288 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT4G01290.1 | 5.5e-129 | 35.15 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G01290.2 | 8.0e-128 | 35.15 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |