Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GATGTGGTTTAGGTTGATGAACAGAGACCAGAAATCCAGCACTCAAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATAGATCAAAACCCTAAAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTATTTTTGAATAATTCGTGTTTTGGATCCAAGTGTTTTTGTTTTTGTTTTGTTTCTTGTTATGTACCAAAACCCTAGTTTCAATTTGGAGCTCTCGGATTGAACAAAACCCCAGAATCTGTTCTGGTGCAACATCTACGATCACCTGTTATAACGAGAATTTTGTTTGTGCTGGATGGATCATGCGGGAATTGGATTTATGATTCGTGGAAAGCGCGACTCTACGTAATAATCTGTTTTTTGGGGTTGAGATTTTATTGAAAAAAGTGGTAAACGGGAGTTGGAGATTGATAATTTTACTGCGAGGAAGCTTGGGTGGATGGAGTTCCCATTTTGATGGATTTTGATGCGCATTCAGCTGGATCAACACATGGGATTGCCTCTGAGTTTGCGTTTTTAATGCTTTCTGACTTGAATTAGCTTCGATTGAACGAACAAATTCGAGGACAAGGCGATGGAGAGAAGTTTTTAGCGTGTTGCCTACATTGAATTAATTGGTTTTCTATAAAGAATGAGTCGGTATAGCATCCAATAAGTGTAGAATTAGCTTTGCGGCAATCTCAGGCGGCGTTGTTGTTCAGTAAACTGTTTCTGGTGCTGTCGAAGACTTTTGCATATTGATCGATTGGACTGATAACCTTGTTTCGAAGCAAGCAGAATGAGCCTCAAGAAGGATGATTCGAACTCACACGATCAGACCGCTACAGTAAAGCATGATCTGCAAAAGTAATTACCTACGATTTGCAGTTCCAATTACGATACATGTGGTTCATTTTCTTATTTCTTTCTGCTCCTCTATTATTTTTCTTCCATTGTGGATGCTGGCTACTATTGACTATATTTTTCCGATCGTTTTTAGGAAACCAAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGAATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAATCATCGCGTAAGATGCTCTTTCGTGCATCTATTTTAATTTTCCTGACTAGTGTTAAATTGTTATTATTCTTTGTATGCGCAACTGAATTCCACAGTTAGCCTTGTGGTAGTATTATTATTTTCCCTGTTGGATTATTTTTCTAAGCATGCGTTCCCCCTTTGTGTAGTGAATTTGAAGAAGCTTCATATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGTTTTAAGCGCAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGACTAATTATTCTCGTCGAATACATGGAAAGAGGGAAGTTCATTCCTCTGGACGGAGTGATAAGGATAGTGATTCACAATCCGATAGGGATTCAGGTAAGCTCTCATCTTAATCTATTATTAAGTGTGAATAACTTCATATCCTATCCGATTTGGTTAACTGTTGGGTTCCTGCTGTAGTTGATTTAATTGAAAAAAATTACTGCTCTCTGGGAAACAATTTCATTCATGCATGTGTTTGTAGTACGTTTATATTTATAATGATATTTCTGCCTGCTCTAACGCTGTTGACAAATGGAGCTCTGTCAGCATGACTCTATTCCAAAGCAGAGCACCTTGAGAACATGTTAATTGTATTGGGTTGCTATCCTCAATTGTTGTAGATTTGAGTTTGTTAAAACCAGTCAGTTCAGGATTTTTAATGTGAAATTACTACAATATTGACCCTTTGGATTATTCATTTTAACAGTGGATTCTGGGTGGCGATATGGTGATCATTCTAGAAGGTCTTCGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCATCTGGATATGCAACAGGATTTTCGGCACCAAAGGTTCGAGCACATGATCAATACCAGCTCAACAAAAGCAATGAACCATATCATCCACCTCGTCCTTATAAGGTCTGCATTATATTTCCCAATCTCTCTGCTTGCAAACAAAATTTAAAATTAAATTTCTATCCTTGGATTATGGTTATACACTCAATCATATGTCTTTAAACATAAGTAGAATCATTTTGTTTAAATGATAAGTGAATCACTCAACAAGAATTTGAAATTATTTTGATTAACCTCACCTTATGAATGTTGTTCTTTTATTAATCAGGCTGTAGCCCATCAGCGAGGGAATACTAATGATTCATACAATCATGAAACTTTTGGCTCTTCTGAGTACACTAGTGAGGATAGGGTTGAAGAGGAAAAAAAAAGAAGAGGTAATCACTCTTTTCTCCATGATGCTTGATGTACGAACTGGTACATCTACTGATCTTGTTCAAATAATTCTATCCAACAACAACTTTTTGCTGAAAAACTTATTTGCAGCTTCATTTGAGTCAATGAGGAAAGAACAACATAGGGTATTTCAAGAAAGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGAGTTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAATTACCGAATACAAGCAGTGGTTTCGATGATTCTATCTCCTTACAATCTTCAAAGAATGATCGAGAAAAATCTTTTCCATCTCAGGCAACCGTATCTAGGCCACTTGTGCCTCCAGGGTTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAAGTCTTCAGTTATGCTGGAGGTGATTATTTGTTTTCATCGTATATTTGTGATTTTAATAGGTCTGACTGTGATGTATTTTCCCATTTTAAAGTTTCAATGTTGAATGACTTCAAGCTTCTATAGTGGGTATGTGCCTGTACAGTGTGTTTTTGTCATCAGCACGTGTCTAGTAGTCATATATCATTATAATAAATAGTTATTCTTCTTCTTACTGATATCTTTTGGCACAAGTTAACTCCCTTTATCCTTCATCAACTGTACGTATGTTTGTCACTTTTGTAGGGTAAGGATGATGTTGACAAGTGTGTGCAAGCTAAAGATGAACAATTGCGTAATGGGATCTCTGAAGACTTGGAGGAAAAAAGTTCATCAGAGCAAATGGGTTGCACTGAACAATATGGAAAAGCAAGCATCAATGCTTCTACTAACAACACTAGTGAAATGATTATTGACCTGTTTTCAGCTGTAGACATGTCTAATAAAACAACTGGAATGGATGTTCAATCACGTGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGTTGATTGTAAGACTGAAAAGGTGCCAGCGAATACAGACAATGGTGAACCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATAAAGTTAGATGGTGGTGCTACTAATTTTATTGAGGTACTCATGAATGTAAATAAATCCTTTTCAGCCATGGTATTAATTTTCAGGTTTTGAATCAAAATCAACATTCTATGTGTGATATTATTCTTAGAAGGTACATGATCCTATTTTAGATTTCTAAAATGTCATAAGTTATGAAAAACAGGCGTCAATGCTTGAAAGTTATGCAGAATCAACCTCCGACATTTGGATGTCTTTTCTGAATTGTAATGTTATCACAGGAAATCATTTTATTGACGAATCATTTGGTTCCTATTCAATAGGGGATGAAATTGTTATGTAATTTGGAACTAGTTCTCGTTGCATATCTTTTCCTTGAATTCGTATGTAGTTATTTTCTGCTTGGGGAAAAATAAGTACCTCCTTTGCTTGTTTGATTTTCATTTAAGATCAAAAGTAACAAAATCGATTCAAGTTTGGATGCATCTATAACTGATGTGTGTTTTTCCGGGTCTTATGCAGCAGCATGACAATGAGATGGACGATGCATGTAGCCCTCAAAATGCTCAATCTTCCAAATTTGCTCATTGGTTCGTGGATAATGGTATGTCGGTAAAGTTTCTGTATTCAAAAGATTCCTTATATCAGTGATTTGTTCACTGATTAGCTTTGAGTTTTTAATGCCCATAGCTCATTTTTTTCACCTTGCAGATAGGAAACAGGAGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGGTGGAGAAAAGGGTGGATATGATGTTGTATCTGATGTGAAGCATTCTGTGCAATCTCTGCCTACAGTTGCCTTTCAGGGTTATGAATCTGCAGAAAGTTTTATCACATCAAGTGCAACACCATCCAATGTTGCGAAAATTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCGATCCTTACCTGTGAAGCCGTTGAACAGACTCTGCTGTCAAAAATTAGTGAAAATGACTCAGCTTTGCAGCCGTCTGATCAAAGATGGAGTCACTCTGATGCTGATGTGAAGCATACAACTGTAAAAAATGATGATCATGCATCACTGCACCTTCTCTCACTGTTACAGAAGGGGTCAGGTCCAGTGATTGCAGGATATGGTGATGATGGTGTAAATATAGGCTCTGCATTTCACAGTAAAAAGGAGGAAAATACCCACAACATTTCAAATCCAGGGAAGACATTGACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCATCAGGATCTGTCAAAAGTGATGTTTCGGAGTCTCATGGTTCGATCACAGATGATGGTCTCTTGTCGAACAATGAAATTCGCCCCAGTATGATTAATCACGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGGCAGTGGTTAAATCTGAATGGCCCTAGATCTGAATTGGATTCTTCTCATCCCCATGCGAAGTTAGGACACAAGATGGGCGGGTATGACGGACCAGCTGAAATGCCCTTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAATTTTCAAAATCTCATTTCTATGGGACATTCTGCTAAACCTCAACCACTGTTCTCACACAACGCACAAGACAGTAATGCTGCAATTTTTAATCCTGCCTTCAAAGATGAAAGGCCAAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCCCTACGATAGAAGGGAGACTGAAATGCCACATCGGAAAGCTCCTGTTCATTCCAACTTTTCCCAGCTTCATCCCCAACACACAAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATTTAATGTTGCCCGAAGGAATGGTTCATCACGACTCACCATCGAATCATCAATTTATAGCAAATATGTTTCGCCCTCCTACCAGTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCAGTCAATCTTCCGCCTCAGCATCTACTTCAAGGATTATCTAGAGGTGCTCCTCCGCCTATGACAAACAGAAGCGTTCCTGTACATCCTCATTCTATCAGAGGTAGTGCAGCACCTCCCCAACCCAACAATCAGGTTCCTGGGTTAGTGCAGGAACTCAATTCAATCCAAGGTTTTCATATTGGTCAGCGTGTGCCTAATATTGGTAGTCCCAGAATACCCTCGCCAGGTAACTCTCATGAAGTTGTTGAACTTTAAGTACTATTTTATTTTTTCTTTTTAATATATATATATATATATACTGATAAACTAGTATCCATATCCTGGTTATCTGGTCATTATTTGTTTATTTTTTGCCCAATTGACACCTTTAGTCCTTCAGTGATATAAGCAAAATTATGATTCTCGACCCATTTGATCTGATTCATAATATCTTTTGTCCTTTAGTGATATAAGCAAAATTATGATTCTCGACCCGTTTGATCTGATTGTGCTCGAGACATTCTTTGCTATTAGACTAGTATTGTTATAATTAAATTTACCATAAACTATCAACTTAAGCTTTTGGGTTGATTGGTGATTTAAGATGGTATCAGAGCAAGAGGTAGGAGGTCCTGTGTTTGAACCCCTGTAAAGTCGTTTCCTCCCAAATTAATATTGATTTCCACTTGTTTGGTTTTTCTTCAAATTTCCAAGCCCACAAGTGATGGAGAGTGTTAGACTAGTATTGAAATAATTAAATTTACCTTAACCTATTAGTTTAAGCTTTTGGATTGATTGGTGATTTAACATTTACTTTTCCCTAACTTCGATCTATTTGTGCTTGATGCCTGTTGCGGAGTGCTTGTATGGTTGTGAATATATTATTAAAGTATTGATATAATTTAATTTACCATAACTCATATGCTTAAGCTTTTGGGTTGATTGGTGGATTTAACATGATATCAAAGTAGGAGGTCTCGTGTTCGAACTCCTAAAATATTATTTCTTTCCCCATTAATATTGATTTTTACTCGTTGGGCCTTTAAATTTCAGCTCCTAGTAACCAATCAGATGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGACATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGATATGGGTACAGGTAACTGTAACACTCTAACTTCGAGCATACTTGCCCAAATCCTCAATTCCTTGGATAGAGAATGGACACAATTTGCTCCTCTTAGTTAGTTGGATTGAGTGAGAAATGTAAGTAATGAATCCTTCTTATGGTTGTTTTTTTGTCGATTTGTCTTTCGCTTATATTTCCCGTACGGTTTCTATCTCGAAACGAAGTCCCAAGTATAACCTGATAAAGTGAAATCAGTAGGCAAAGGAAAGTCCAGTGTTTCATTTTTTGAGGGCATTTTTAAGAAATAGATTTGTCTACTATAGGGTTTTATAAAAGGGGGATCCCTTTTTATGAGCGAATTTATTCATTTGTCTCGAGGCGATGCAATGAGCTTTGAGGCTATGGCTAAGACTACGAAATATGGTCCATCTTTGCCAAAATTTGGCTCAGACAAAAATCAGTTTTGTTTCTTCTTCATATGCTGGATTGGATCCCTTCCATTCAAGAGTTTTAAAGAAATGAGAATGTGAAGTTTACAAGTAGAATAATTGCGACTGAG
mRNA sequence
GATGTGGTTTAGGTTGATGAACAGAGACCAGAAATCCAGCACTCAAATTTCTCTCTCGCGTCGATTACTCGGGAAATCCATAGATCAAAACCCTAAAAGCTTCCAATCCGCAAGAAGTTCGTGTCCATGGGGATTTATTTTTGAATAATTCGTGTTTTGGATCCAAGTGTTTTTGTTTTTGTTTTGTTTCTTGTTATGTACCAAAACCCTAGTTTCAATTTGGAGCTCTCGGATTGAACAAAACCCCAGAATCTGTTCTGGTGCAACATCTACGATCACCTGTTATAACGAGAATTTTGTTTGTGCTGGATGGATCATGCGGGAATTGGATTTATGATTCGTGGAAAGCGCGACTCTACGTAATAATCTGTTTTTTGGGGTTGAGATTTTATTGAAAAAAGTGGTAAACGGGAGTTGGAGATTGATAATTTTACTGCGAGGAAGCTTGGGTGGATGGAGTTCCCATTTTGATGGATTTTGATGCGCATTCAGCTGGATCAACACATGGGATTGCCTCTGAGTTTGCGTTTTTAATGCTTTCTGACTTGAATTAGCTTCGATTGAACGAACAAATTCGAGGACAAGGCGATGGAGAGAAGTTTTTAGCGTGTTGCCTACATTGAATTAATTGGTTTTCTATAAAGAATGAGTCGGTATAGCATCCAATAAGTGTAGAATTAGCTTTGCGGCAATCTCAGGCGGCGTTGTTGTTCAGTAAACTGTTTCTGGTGCTGTCGAAGACTTTTGCATATTGATCGATTGGACTGATAACCTTGTTTCGAAGCAAGCAGAATGAGCCTCAAGAAGGATGATTCGAACTCACACGATCAGACCGCTACAGTAAAGCATGATCTGCAAAAGAAACCAAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGAATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAATCATCGCTGAATTTGAAGAAGCTTCATATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGTTTTAAGCGCAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGACTAATTATTCTCGTCGAATACATGGAAAGAGGGAAGTTCATTCCTCTGGACGGAGTGATAAGGATAGTGATTCACAATCCGATAGGGATTCAGTGGATTCTGGGTGGCGATATGGTGATCATTCTAGAAGGTCTTCGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCATCTGGATATGCAACAGGATTTTCGGCACCAAAGGTTCGAGCACATGATCAATACCAGCTCAACAAAAGCAATGAACCATATCATCCACCTCGTCCTTATAAGGCTGTAGCCCATCAGCGAGGGAATACTAATGATTCATACAATCATGAAACTTTTGGCTCTTCTGAGTACACTAGTGAGGATAGGGTTGAAGAGGAAAAAAAAAGAAGAGCTTCATTTGAGTCAATGAGGAAAGAACAACATAGGGTATTTCAAGAAAGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGAGTTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAATTACCGAATACAAGCAGTGGTTTCGATGATTCTATCTCCTTACAATCTTCAAAGAATGATCGAGAAAAATCTTTTCCATCTCAGGCAACCGTATCTAGGCCACTTGTGCCTCCAGGGTTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAAGTCTTCAGTTATGCTGGAGGGTAAGGATGATGTTGACAAGTGTGTGCAAGCTAAAGATGAACAATTGCGTAATGGGATCTCTGAAGACTTGGAGGAAAAAAGTTCATCAGAGCAAATGGGTTGCACTGAACAATATGGAAAAGCAAGCATCAATGCTTCTACTAACAACACTAGTGAAATGATTATTGACCTGTTTTCAGCTGTAGACATGTCTAATAAAACAACTGGAATGGATGTTCAATCACGTGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGTTGATTGTAAGACTGAAAAGGTGCCAGCGAATACAGACAATGGTGAACCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATAAAGTTAGATGGTGGTGCTACTAATTTTATTGAGCATGACAATGAGATGGACGATGCATGTAGCCCTCAAAATGCTCAATCTTCCAAATTTGCTCATTGGTTCGTGGATAATGATAGGAAACAGGAGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGGTGGAGAAAAGGGTGGATATGATGTTGTATCTGATGTGAAGCATTCTGTGCAATCTCTGCCTACAGTTGCCTTTCAGGGTTATGAATCTGCAGAAAGTTTTATCACATCAAGTGCAACACCATCCAATGTTGCGAAAATTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCGATCCTTACCTGTGAAGCCGTTGAACAGACTCTGCTGTCAAAAATTAGTGAAAATGACTCAGCTTTGCAGCCGTCTGATCAAAGATGGAGTCACTCTGATGCTGATGTGAAGCATACAACTGTAAAAAATGATGATCATGCATCACTGCACCTTCTCTCACTGTTACAGAAGGGGTCAGGTCCAGTGATTGCAGGATATGGTGATGATGGTGTAAATATAGGCTCTGCATTTCACAGTAAAAAGGAGGAAAATACCCACAACATTTCAAATCCAGGGAAGACATTGACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCATCAGGATCTGTCAAAAGTGATGTTTCGGAGTCTCATGGTTCGATCACAGATGATGGTCTCTTGTCGAACAATGAAATTCGCCCCAGTATGATTAATCACGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGGCAGTGGTTAAATCTGAATGGCCCTAGATCTGAATTGGATTCTTCTCATCCCCATGCGAAGTTAGGACACAAGATGGGCGGGTATGACGGACCAGCTGAAATGCCCTTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAATTTTCAAAATCTCATTTCTATGGGACATTCTGCTAAACCTCAACCACTGTTCTCACACAACGCACAAGACAGTAATGCTGCAATTTTTAATCCTGCCTTCAAAGATGAAAGGCCAAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCCCTACGATAGAAGGGAGACTGAAATGCCACATCGGAAAGCTCCTGTTCATTCCAACTTTTCCCAGCTTCATCCCCAACACACAAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATTTAATGTTGCCCGAAGGAATGGTTCATCACGACTCACCATCGAATCATCAATTTATAGCAAATATGTTTCGCCCTCCTACCAGTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCAGTCAATCTTCCGCCTCAGCATCTACTTCAAGGATTATCTAGAGGTGCTCCTCCGCCTATGACAAACAGAAGCGTTCCTGTACATCCTCATTCTATCAGAGGTAGTGCAGCACCTCCCCAACCCAACAATCAGGTTCCTGGGTTAGTGCAGGAACTCAATTCAATCCAAGGTTTTCATATTGGTCAGCGTGTGCCTAATATTGGTAGTCCCAGAATACCCTCGCCAGCTCCTAGTAACCAATCAGATGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGACATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGATATGGGTACAGGTAACTGTAACACTCTAACTTCGAGCATACTTGCCCAAATCCTCAATTCCTTGGATAGAGAATGGACACAATTTGCTCCTCTTAGTTAGTTGGATTGAGTGAGAAATGTAAGTAATGAATCCTTCTTATGGTTGTTTTTTTGTCGATTTGTCTTTCGCTTATATTTCCCGTACGGTTTCTATCTCGAAACGAAGTCCCAAGTATAACCTGATAAAGTGAAATCAGTAGGCAAAGGAAAGTCCAGTGTTTCATTTTTTGAGGGCATTTTTAAGAAATAGATTTGTCTACTATAGGGTTTTATAAAAGGGGGATCCCTTTTTATGAGCGAATTTATTCATTTGTCTCGAGGCGATGCAATGAGCTTTGAGGCTATGGCTAAGACTACGAAATATGGTCCATCTTTGCCAAAATTTGGCTCAGACAAAAATCAGTTTTGTTTCTTCTTCATATGCTGGATTGGATCCCTTCCATTCAAGAGTTTTAAAGAAATGAGAATGTGAAGTTTACAAGTAGAATAATTGCGACTGAG
Coding sequence (CDS)
ATGAGCCTCAAGAAGGATGATTCGAACTCACACGATCAGACCGCTACAGTAAAGCATGATCTGCAAAAGAAACCAAAGTTTTCTTACACGAGGGATTTCCTCTTGTCCCTGAGCGAATTGGATGTTTGCAAAAAGTTGCCGAGCGGTTTTGACCAATCAATCATCGCTGAATTTGAAGAAGCTTCATATGATAGGCAAAGAGTTTCTGGAGGGTTGTCTTTGAATAGTTTTAAGCGCAACGAGTATGGTTCATCACCACCCAGTAGGGCAGAAACGACTAATTATTCTCGTCGAATACATGGAAAGAGGGAAGTTCATTCCTCTGGACGGAGTGATAAGGATAGTGATTCACAATCCGATAGGGATTCAGTGGATTCTGGGTGGCGATATGGTGATCATTCTAGAAGGTCTTCGCAGGGTCCTGAACATGATGGACTTCTTGGTAGTGGTTCTTTTCCAAGACCATCTGGATATGCAACAGGATTTTCGGCACCAAAGGTTCGAGCACATGATCAATACCAGCTCAACAAAAGCAATGAACCATATCATCCACCTCGTCCTTATAAGGCTGTAGCCCATCAGCGAGGGAATACTAATGATTCATACAATCATGAAACTTTTGGCTCTTCTGAGTACACTAGTGAGGATAGGGTTGAAGAGGAAAAAAAAAGAAGAGCTTCATTTGAGTCAATGAGGAAAGAACAACATAGGGTATTTCAAGAAAGTCACAAGTCAAATCCTGTGAAGCAGAGAGATGAGTTTGACATCCTAATGCAGTTGGACGAGGCTAAAGATGATAAGAAATTACCGAATACAAGCAGTGGTTTCGATGATTCTATCTCCTTACAATCTTCAAAGAATGATCGAGAAAAATCTTTTCCATCTCAGGCAACCGTATCTAGGCCACTTGTGCCTCCAGGGTTCACGAGCACTGTGTTGGAGAAAAACTTTGGAACAAAGTCTTCAGTTATGCTGGAGGGTAAGGATGATGTTGACAAGTGTGTGCAAGCTAAAGATGAACAATTGCGTAATGGGATCTCTGAAGACTTGGAGGAAAAAAGTTCATCAGAGCAAATGGGTTGCACTGAACAATATGGAAAAGCAAGCATCAATGCTTCTACTAACAACACTAGTGAAATGATTATTGACCTGTTTTCAGCTGTAGACATGTCTAATAAAACAACTGGAATGGATGTTCAATCACGTGAAAATTCTTTGGAAGTTTTTGAAGCTTCTGAAAACAGTGCAGTTGTTGATTGTAAGACTGAAAAGGTGCCAGCGAATACAGACAATGGTGAACCAAGTCAAGGCCATTCATCTTCAATCTTAGAAAAACTTTTTGGCAGTGCCATAAAGTTAGATGGTGGTGCTACTAATTTTATTGAGCATGACAATGAGATGGACGATGCATGTAGCCCTCAAAATGCTCAATCTTCCAAATTTGCTCATTGGTTCGTGGATAATGATAGGAAACAGGAGGATGACCTTTCACCTAAAAGGTCAATTGACTTGCTTACTATGATTGGAGGTGGAGAAAAGGGTGGATATGATGTTGTATCTGATGTGAAGCATTCTGTGCAATCTCTGCCTACAGTTGCCTTTCAGGGTTATGAATCTGCAGAAAGTTTTATCACATCAAGTGCAACACCATCCAATGTTGCGAAAATTGAGCCATTTTATGATAAGAGTAAGCCAGAGGCTGTTTCTGCGATCCTTACCTGTGAAGCCGTTGAACAGACTCTGCTGTCAAAAATTAGTGAAAATGACTCAGCTTTGCAGCCGTCTGATCAAAGATGGAGTCACTCTGATGCTGATGTGAAGCATACAACTGTAAAAAATGATGATCATGCATCACTGCACCTTCTCTCACTGTTACAGAAGGGGTCAGGTCCAGTGATTGCAGGATATGGTGATGATGGTGTAAATATAGGCTCTGCATTTCACAGTAAAAAGGAGGAAAATACCCACAACATTTCAAATCCAGGGAAGACATTGACTCTTGAAACACTTTTTGGGTCTGCTTTTATGAAGGAGCTTCAGTCAGTTGGAGCTCCAGTTTCTGCACAAAGGGGTTCATCAGGATCTGTCAAAAGTGATGTTTCGGAGTCTCATGGTTCGATCACAGATGATGGTCTCTTGTCGAACAATGAAATTCGCCCCAGTATGATTAATCACGATCATGGTGATCAAAGACAGCAAAACCAACCAGATATAGTTCGTGGGCAGTGGTTAAATCTGAATGGCCCTAGATCTGAATTGGATTCTTCTCATCCCCATGCGAAGTTAGGACACAAGATGGGCGGGTATGACGGACCAGCTGAAATGCCCTTTCCTGAAGAGGACAGTTTAATCATAAGCGATTCTATGAATTTTCAAAATCTCATTTCTATGGGACATTCTGCTAAACCTCAACCACTGTTCTCACACAACGCACAAGACAGTAATGCTGCAATTTTTAATCCTGCCTTCAAAGATGAAAGGCCAAGCATGGGAGGTCTGGAAGGGCTGCCTTTTTCAGCCAGCCCCTACGATAGAAGGGAGACTGAAATGCCACATCGGAAAGCTCCTGTTCATTCCAACTTTTCCCAGCTTCATCCCCAACACACAAATAATGTCAAGTTGTTTCATCAATTTGAATCTCATCCTCCTAACATGAATTCTCAGGGAGATTTAATGTTGCCCGAAGGAATGGTTCATCACGACTCACCATCGAATCATCAATTTATAGCAAATATGTTTCGCCCTCCTACCAGTGGATTATCTGGATTTGATCATTCGATTCATCACCCGATGATGCAGCAGATGCAAACTTCAGTCAATCTTCCGCCTCAGCATCTACTTCAAGGATTATCTAGAGGTGCTCCTCCGCCTATGACAAACAGAAGCGTTCCTGTACATCCTCATTCTATCAGAGGTAGTGCAGCACCTCCCCAACCCAACAATCAGGTTCCTGGGTTAGTGCAGGAACTCAATTCAATCCAAGGTTTTCATATTGGTCAGCGTGTGCCTAATATTGGTAGTCCCAGAATACCCTCGCCAGCTCCTAGTAACCAATCAGATGCAATTCAGAGGCTCATCCAAATGGGACATAGATCGAACTCGAAGCAAATTCATCCGCTTTCTGCCAGTGGTGGACATGGTCAGGGGATGTATGGTCACGAGTTGAACATGGGATATGGGTACAGGTAA
Protein sequence
MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEEASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSDRDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNEPYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVSRPLVPPGFTSTVLEKNFGTKSSVMLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSEQMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAVVDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGYESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSVKSDVSESHGSITDDGLLSNNEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMPFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGLPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPEGMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAPSNQSDAIQRLIQMGHRSNSKQIHPLSASGGHGQGMYGHELNMGYGYR
Homology
BLAST of Tan0004805 vs. NCBI nr
Match:
XP_023539105.1 (uncharacterized protein LOC111799853 isoform X3 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1803.5 bits (4670), Expect = 0.0e+00
Identity = 931/1069 (87.09%), Postives = 974/1069 (91.11%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPEHDGLLGSGSFPRP GYAT FSAPKVR HDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRVHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II L SAV MSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVGMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQ 480
V+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNAQ
Sbjct: 421 VNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQDSEKDDACSPQNAQ 480
Query: 481 SSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGY 540
SS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTVAFQGY
Sbjct: 481 SSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVAFQGY 540
Query: 541 ESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQ 600
ESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQ+LLSK+ ENDSALQPSDQ
Sbjct: 541 ESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQSLLSKVKENDSALQPSDQ 600
Query: 601 RWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHN 660
RWSHSDADVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+THN
Sbjct: 601 RWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHN 660
Query: 661 ISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSNN 720
+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSNN
Sbjct: 661 VSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSNN 720
Query: 721 EIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMP 780
EIRPSMINHDHG RQQNQPD VRGQWLNLNGP +DSSHPHAKLGHKMGGYDGPAEMP
Sbjct: 721 EIRPSMINHDHGVLRQQNQPDKVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEMP 780
Query: 781 FPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGL 840
FP+EDSLIISDSMN QNL+S+G+SA+PQPL SHN+QDSNAAIFNPAFKDERPSMGGLEGL
Sbjct: 781 FPQEDSLIISDSMNLQNLMSIGNSARPQPLLSHNSQDSNAAIFNPAFKDERPSMGGLEGL 840
Query: 841 PFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGD-LMLPE 900
PFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD + LPE
Sbjct: 841 PFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDNIALPE 900
Query: 901 GMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAP 960
GMVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 GMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAP 960
Query: 961 PPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAP 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPAP
Sbjct: 961 LPMTNRSVPLHPHSIRGSAATLQPNNQVPGLIQEQNSIQGFHTGQRVPNTGGPRIPSPAP 1020
Query: 1021 SNQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 GNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of Tan0004805 vs. NCBI nr
Match:
XP_022935295.1 (uncharacterized protein LOC111442216 isoform X2 [Cucurbita moschata])
HSP 1 Score: 1802.7 bits (4668), Expect = 0.0e+00
Identity = 930/1068 (87.08%), Postives = 973/1068 (91.10%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPE DGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEE+KRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II L SAVDMSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQ 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNAQ
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQDSEKDDACSPQNAQ 480
Query: 481 SSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGY 540
SS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTV FQGY
Sbjct: 481 SSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQGY 540
Query: 541 ESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQ 600
ESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSDQ
Sbjct: 541 ESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQ 600
Query: 601 RWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHN 660
RWSHSD DVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+THN
Sbjct: 601 RWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHN 660
Query: 661 ISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSNN 720
+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSNN
Sbjct: 661 VSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSNN 720
Query: 721 EIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMP 780
EIRPSMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDG AEMP
Sbjct: 721 EIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMP 780
Query: 781 FPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGL 840
FP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEGL
Sbjct: 781 FPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGL 840
Query: 841 PFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPEG 900
PFSAS YDRRETEMP KAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD+ LPEG
Sbjct: 841 PFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDIALPEG 900
Query: 901 MVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPP 960
MVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAPS 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPAP
Sbjct: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
Query: 1021 NQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of Tan0004805 vs. NCBI nr
Match:
XP_022974561.1 (uncharacterized protein LOC111473257 isoform X2 [Cucurbita maxima])
HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 928/1068 (86.89%), Postives = 974/1068 (91.20%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D SHDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKSHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRI GK++V+SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIQGKKDVNSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR+ DHSRR SQGPEHDGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRFSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E H SNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHNSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNTSE II L SAVDMSN+TTG DVQSREN+LEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTSESIIHLLSAVDMSNQTTGTDVQSRENALEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQ 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGA NFIEHD+E DDACSPQNAQ
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGAANFIEHDSEKDDACSPQNAQ 480
Query: 481 SSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGY 540
SS+FAHWF+DNDRKQ +DLSPKRSIDLLTMIG GEKGGYD VSDVKHS +SLP VAFQGY
Sbjct: 481 SSRFAHWFMDNDRKQGNDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEESLPRVAFQGY 540
Query: 541 ESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQ 600
ESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSDQ
Sbjct: 541 ESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQ 600
Query: 601 RWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHN 660
RWSHSDADVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+THN
Sbjct: 601 RWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHN 660
Query: 661 ISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSNN 720
+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSNN
Sbjct: 661 VSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSNN 720
Query: 721 EIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMP 780
EIR SMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDGPAE+P
Sbjct: 721 EIRLSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEIP 780
Query: 781 FPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGL 840
FP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEGL
Sbjct: 781 FPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGL 840
Query: 841 PFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPEG 900
PFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPN+NSQGD+ LPEG
Sbjct: 841 PFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNINSQGDIALPEG 900
Query: 901 MVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPP 960
MVHH SPSNHQF++N RPPTSGLSGFDH IHHPMMQQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 MVHHGSPSNHQFVSNKLRPPTSGLSGFDHLIHHPMMQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAPS 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH QRVPN PRIPSPAP
Sbjct: 961 PMTNRSVPLHPHSIRGSAANLQPNNQVPGLMQEQNSIQGFHTSQRVPNTVGPRIPSPAPG 1020
Query: 1021 NQSDAIQRLIQMGHR--SNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHR SNSKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of Tan0004805 vs. NCBI nr
Match:
KAG6597140.1 (hypothetical protein SDJN03_10320, partial [Cucurbita argyrosperma subsp. sororia] >KAG7028609.1 hypothetical protein SDJN02_09790 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 931/1069 (87.09%), Postives = 973/1069 (91.02%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPE DGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LE KDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEVKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II LFSAVDMSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLFSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIE-HDNEMDDACSPQNA 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
Query: 481 QSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQG 540
QSS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTV FQG
Sbjct: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
Query: 541 YESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSD 600
YESAES+ITSSAT SNVAK EP YDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPVYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTH 660
QRWSHSD DVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+TH
Sbjct: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSN 720
N+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEM 780
NEIRPSMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDG AEM
Sbjct: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
Query: 781 PFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEG 840
PFP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPE 900
LPFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD+ LPE
Sbjct: 841 LPFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDMALPE 900
Query: 901 GMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAP 960
GMVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 GMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAP 960
Query: 961 PPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAP 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPAP
Sbjct: 961 LPMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAP 1020
Query: 1021 SNQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 GNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of Tan0004805 vs. NCBI nr
Match:
XP_023539103.1 (uncharacterized protein LOC111799853 isoform X1 [Cucurbita pepo subsp. pepo] >XP_023539104.1 uncharacterized protein LOC111799853 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1798.9 bits (4658), Expect = 0.0e+00
Identity = 931/1070 (87.01%), Postives = 974/1070 (91.03%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPEHDGLLGSGSFPRP GYAT FSAPKVR HDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRVHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II L SAV MSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVGMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIE-HDNEMDDACSPQNA 480
V+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNA
Sbjct: 421 VNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
Query: 481 QSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQG 540
QSS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTVAFQG
Sbjct: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVAFQG 540
Query: 541 YESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSD 600
YESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQ+LLSK+ ENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQSLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTH 660
QRWSHSDADVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+TH
Sbjct: 601 QRWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSN 720
N+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEM 780
NEIRPSMINHDHG RQQNQPD VRGQWLNLNGP +DSSHPHAKLGHKMGGYDGPAEM
Sbjct: 721 NEIRPSMINHDHGVLRQQNQPDKVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEM 780
Query: 781 PFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEG 840
PFP+EDSLIISDSMN QNL+S+G+SA+PQPL SHN+QDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLLSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGD-LMLP 900
LPFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD + LP
Sbjct: 841 LPFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDNIALP 900
Query: 901 EGMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGA 960
EGMVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGA
Sbjct: 901 EGMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGA 960
Query: 961 PPPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPA 1020
P PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPA
Sbjct: 961 PLPMTNRSVPLHPHSIRGSAATLQPNNQVPGLIQEQNSIQGFHTGQRVPNTGGPRIPSPA 1020
Query: 1021 PSNQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
P NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 PGNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1069
BLAST of Tan0004805 vs. ExPASy TrEMBL
Match:
A0A6J1FA86 (uncharacterized protein LOC111442216 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111442216 PE=4 SV=1)
HSP 1 Score: 1802.7 bits (4668), Expect = 0.0e+00
Identity = 930/1068 (87.08%), Postives = 973/1068 (91.10%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPE DGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEE+KRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II L SAVDMSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQ 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNAQ
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQDSEKDDACSPQNAQ 480
Query: 481 SSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGY 540
SS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTV FQGY
Sbjct: 481 SSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQGY 540
Query: 541 ESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQ 600
ESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSDQ
Sbjct: 541 ESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQ 600
Query: 601 RWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHN 660
RWSHSD DVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+THN
Sbjct: 601 RWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHN 660
Query: 661 ISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSNN 720
+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSNN
Sbjct: 661 VSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSNN 720
Query: 721 EIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMP 780
EIRPSMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDG AEMP
Sbjct: 721 EIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEMP 780
Query: 781 FPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGL 840
FP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEGL
Sbjct: 781 FPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGL 840
Query: 841 PFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPEG 900
PFSAS YDRRETEMP KAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD+ LPEG
Sbjct: 841 PFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDIALPEG 900
Query: 901 MVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPP 960
MVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 MVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAPS 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPAP
Sbjct: 961 PMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAPG 1020
Query: 1021 NQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of Tan0004805 vs. ExPASy TrEMBL
Match:
A0A6J1IBP7 (uncharacterized protein LOC111473257 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111473257 PE=4 SV=1)
HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 928/1068 (86.89%), Postives = 974/1068 (91.20%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D SHDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKSHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRI GK++V+SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIQGKKDVNSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR+ DHSRR SQGPEHDGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRFSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E H SNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHNSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNTSE II L SAVDMSN+TTG DVQSREN+LEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTSESIIHLLSAVDMSNQTTGTDVQSRENALEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNAQ 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGA NFIEHD+E DDACSPQNAQ
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGAANFIEHDSEKDDACSPQNAQ 480
Query: 481 SSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQGY 540
SS+FAHWF+DNDRKQ +DLSPKRSIDLLTMIG GEKGGYD VSDVKHS +SLP VAFQGY
Sbjct: 481 SSRFAHWFMDNDRKQGNDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEESLPRVAFQGY 540
Query: 541 ESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSDQ 600
ESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSDQ
Sbjct: 541 ESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSDQ 600
Query: 601 RWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTHN 660
RWSHSDADVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+THN
Sbjct: 601 RWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTHN 660
Query: 661 ISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSNN 720
+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSNN
Sbjct: 661 VSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSNN 720
Query: 721 EIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEMP 780
EIR SMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDGPAE+P
Sbjct: 721 EIRLSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEIP 780
Query: 781 FPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEGL 840
FP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEGL
Sbjct: 781 FPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEGL 840
Query: 841 PFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPEG 900
PFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPN+NSQGD+ LPEG
Sbjct: 841 PFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNINSQGDIALPEG 900
Query: 901 MVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPP 960
MVHH SPSNHQF++N RPPTSGLSGFDH IHHPMMQQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 MVHHGSPSNHQFVSNKLRPPTSGLSGFDHLIHHPMMQQMQTSGNLPPQHLLQALSRGAPL 960
Query: 961 PMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAPS 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH QRVPN PRIPSPAP
Sbjct: 961 PMTNRSVPLHPHSIRGSAANLQPNNQVPGLMQEQNSIQGFHTSQRVPNTVGPRIPSPAPG 1020
Query: 1021 NQSDAIQRLIQMGHR--SNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHR SNSKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 NQPDAIQRLIQMGHRSNSNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1067
BLAST of Tan0004805 vs. ExPASy TrEMBL
Match:
A0A6J1F449 (uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111442216 PE=4 SV=1)
HSP 1 Score: 1798.1 bits (4656), Expect = 0.0e+00
Identity = 930/1069 (87.00%), Postives = 973/1069 (91.02%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D HDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKPHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRIHGK++++SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIHGKKDINSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR DHSRR SQGPE DGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRLSDHSRRPSQGPEQDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEE+KRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEERKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E HKSNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHKSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNT E II L SAVDMSN+TTG DVQSRENSLEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTGENIIHLLSAVDMSNQTTGTDVQSRENSLEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIE-HDNEMDDACSPQNA 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGATNFIE D+E DDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGATNFIEQQDSEKDDACSPQNA 480
Query: 481 QSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQG 540
QSS+FAHWF+DNDRKQ DDLSPKRSIDLLTMIG GEKGGYD VSDVKHS QSLPTV FQG
Sbjct: 481 QSSRFAHWFMDNDRKQGDDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEQSLPTVVFQG 540
Query: 541 YESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSD 600
YESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTH 660
QRWSHSD DVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+TH
Sbjct: 601 QRWSHSDDDVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSN 720
N+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPCDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEM 780
NEIRPSMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDG AEM
Sbjct: 721 NEIRPSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGAAEM 780
Query: 781 PFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEG 840
PFP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPE 900
LPFSAS YDRRETEMP KAPVHSNFSQLHPQ TNNVK FHQFESHPPNMNSQGD+ LPE
Sbjct: 841 LPFSASLYDRRETEMPQWKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNMNSQGDIALPE 900
Query: 901 GMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAP 960
GMVHH SPSNHQF++NM RPPTSGLSGFDH IHHPM+QQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 GMVHHGSPSNHQFVSNMLRPPTSGLSGFDHLIHHPMIQQMQTSGNLPPQHLLQALSRGAP 960
Query: 961 PPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAP 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH GQRVPN G PRIPSPAP
Sbjct: 961 LPMTNRSVPLHPHSIRGSAATLQPNNQVPGLMQEQNSIQGFHTGQRVPNTGGPRIPSPAP 1020
Query: 1021 SNQSDAIQRLIQMGHRSN--SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN SKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 GNQPDAIQRLIQMGHRSNSTSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of Tan0004805 vs. ExPASy TrEMBL
Match:
A0A6J1IE91 (uncharacterized protein LOC111473257 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111473257 PE=4 SV=1)
HSP 1 Score: 1795.0 bits (4648), Expect = 0.0e+00
Identity = 928/1069 (86.81%), Postives = 974/1069 (91.11%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSLKK+D SHDQTATVKHDL+KKPKFSYTRDFLLSLS LDVCKKLPSGFDQS+IAE EE
Sbjct: 1 MSLKKEDLKSHDQTATVKHDLRKKPKFSYTRDFLLSLSGLDVCKKLPSGFDQSVIAELEE 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQRVSGGLSLNSF+RNEYGSSPP+RAETTNY+RRI GK++V+SSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRVSGGLSLNSFRRNEYGSSPPNRAETTNYARRIQGKKDVNSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWR+ DHSRR SQGPEHDGLLGSGSFPRP GYAT FSAPKVRAHDQYQLN+SNE
Sbjct: 121 RDSVDSGWRFSDHSRRPSQGPEHDGLLGSGSFPRPPGYATAFSAPKVRAHDQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAHQRGNT+DSYNHETFGSSE TSEDRVEEEKKRRASFESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHQRGNTHDSYNHETFGSSELTSEDRVEEEKKRRASFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
E H SNPVKQRD FDILMQLDEAKDDKKL NTSSGFD+ ISLQSSKNDRE FPSQ TVS
Sbjct: 241 EGHNSNPVKQRDGFDILMQLDEAKDDKKLLNTSSGFDEPISLQSSKNDRETFFPSQTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGFTSTVLEKNFGT+SSV +LEGKDDVDK +Q KD+QL NG SEDLE KSS E
Sbjct: 301 RPLVPPGFTSTVLEKNFGTRSSVNPRLLEGKDDVDKSLQTKDKQLHNGFSEDLEGKSSLE 360
Query: 361 QMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSAV 420
QMG E YGK S NASTNNTSE II L SAVDMSN+TTG DVQSREN+LEVFEA ENSAV
Sbjct: 361 QMGRPEHYGKTSTNASTNNTSESIIHLLSAVDMSNQTTGTDVQSRENALEVFEAIENSAV 420
Query: 421 VDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIE-HDNEMDDACSPQNA 480
+CKTE VPANT GE SQGHSSSILEKLFGS IKLDGGA NFIE HD+E DDACSPQNA
Sbjct: 421 DNCKTEMVPANTAVGEASQGHSSSILEKLFGSTIKLDGGAANFIEQHDSEKDDACSPQNA 480
Query: 481 QSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQG 540
QSS+FAHWF+DNDRKQ +DLSPKRSIDLLTMIG GEKGGYD VSDVKHS +SLP VAFQG
Sbjct: 481 QSSRFAHWFMDNDRKQGNDLSPKRSIDLLTMIGAGEKGGYDFVSDVKHSEESLPRVAFQG 540
Query: 541 YESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSD 600
YESAES+ITSSAT SNVAK EPFYDKSKPEAVSAILTCEAVEQTLLSK+ ENDSALQPSD
Sbjct: 541 YESAESYITSSATSSNVAKTEPFYDKSKPEAVSAILTCEAVEQTLLSKVKENDSALQPSD 600
Query: 601 QRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAFHSKKEENTH 660
QRWSHSDADVKH TVKNDD ASLHLLSLLQKGS PVIAGYGDDGV++GSA H+KKEE+TH
Sbjct: 601 QRWSHSDADVKHPTVKNDDLASLHLLSLLQKGSSPVIAGYGDDGVSVGSAIHNKKEESTH 660
Query: 661 NISNPGKTLTLETLFGSAFMKELQSVGAPVSAQR-GSSGSVKSDVSESHGSITDDGLLSN 720
N+SNPGKTLTLETLFGSAFMKELQSVGAPVSAQR GSSGSVKSDV E ITDDGLLSN
Sbjct: 661 NVSNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGGSSGSVKSDVPEPRDPITDDGLLSN 720
Query: 721 NEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEM 780
NEIR SMINHDHG QRQQNQPDIVRGQWLNLNGP +DSSHPHAKLGHKMGGYDGPAE+
Sbjct: 721 NEIRLSMINHDHGVQRQQNQPDIVRGQWLNLNGPPPGMDSSHPHAKLGHKMGGYDGPAEI 780
Query: 781 PFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEG 840
PFP+EDSLIISDSMN QNL+S+G+SA+PQPLFSHN+QDSNAAIFNPAFKDERPSMGGLEG
Sbjct: 781 PFPQEDSLIISDSMNLQNLMSIGNSARPQPLFSHNSQDSNAAIFNPAFKDERPSMGGLEG 840
Query: 841 LPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPE 900
LPFSAS YDRRETEMP RKAPVHSNFSQLHPQ TNNVK FHQFESHPPN+NSQGD+ LPE
Sbjct: 841 LPFSASLYDRRETEMPQRKAPVHSNFSQLHPQQTNNVK-FHQFESHPPNINSQGDIALPE 900
Query: 901 GMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAP 960
GMVHH SPSNHQF++N RPPTSGLSGFDH IHHPMMQQMQTS NLPPQHLLQ LSRGAP
Sbjct: 901 GMVHHGSPSNHQFVSNKLRPPTSGLSGFDHLIHHPMMQQMQTSGNLPPQHLLQALSRGAP 960
Query: 961 PPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAP 1020
PMTNRSVP+HPHSIRGSAA QPNNQVPGL+QE NSIQGFH QRVPN PRIPSPAP
Sbjct: 961 LPMTNRSVPLHPHSIRGSAANLQPNNQVPGLMQEQNSIQGFHTSQRVPNTVGPRIPSPAP 1020
Query: 1021 SNQSDAIQRLIQMGHR--SNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHR SNSKQIHPLSASGGHGQGMYGHELNMGYGYR
Sbjct: 1021 GNQPDAIQRLIQMGHRSNSNSKQIHPLSASGGHGQGMYGHELNMGYGYR 1068
BLAST of Tan0004805 vs. ExPASy TrEMBL
Match:
A0A6J1D0L9 (uncharacterized protein LOC111016288 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016288 PE=4 SV=1)
HSP 1 Score: 1783.1 bits (4617), Expect = 0.0e+00
Identity = 920/1071 (85.90%), Postives = 977/1071 (91.22%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPSGFDQSIIAEFEE 60
MSL KDDSNSHDQTAT+KH+LQKK K SYTRDFLLSLSELD+CKKLPSGFDQSII+EFE+
Sbjct: 1 MSLMKDDSNSHDQTATIKHELQKKSKISYTRDFLLSLSELDICKKLPSGFDQSIISEFED 60
Query: 61 ASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDSQSD 120
ASYDRQR+SGGLSLNSF+RNEYGSSPPSRAE NYSRRIHGKREVHSSGRSDKDSDSQSD
Sbjct: 61 ASYDRQRISGGLSLNSFRRNEYGSSPPSRAEANNYSRRIHGKREVHSSGRSDKDSDSQSD 120
Query: 121 RDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNKSNE 180
RDSVDSGWRYGDHSRRS QGPEHDGLLGSGSFPRPSGYATGFSAPKVRA++QYQLN+SNE
Sbjct: 121 RDSVDSGWRYGDHSRRSLQGPEHDGLLGSGSFPRPSGYATGFSAPKVRANEQYQLNRSNE 180
Query: 181 PYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHRVFQ 240
PYHPPRPYKAVAH RGN NDSYNHETFGSSE TSEDRVEEEKKRRA FESMRKEQHR FQ
Sbjct: 181 PYHPPRPYKAVAHPRGNINDSYNHETFGSSEDTSEDRVEEEKKRRALFESMRKEQHRAFQ 240
Query: 241 ESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQATVS 300
ES KSNPVKQRDEF I+MQLDE+KDDKKL NTSSGFD+SI LQ+SKNDREK FPS TVS
Sbjct: 241 ESQKSNPVKQRDEFGIMMQLDESKDDKKLLNTSSGFDESIILQASKNDREKPFPSHTTVS 300
Query: 301 RPLVPPGFTSTVLEKNFGTKSSV---MLEGKDD-VDKCVQAKDEQLRNGISEDLEEKSSS 360
RPLVPPGFTS VLEK+FGTKSSV LE KDD VDK +Q KDE L NGISEDL EK+SS
Sbjct: 301 RPLVPPGFTSNVLEKSFGTKSSVNPHFLEVKDDVVDKSLQTKDEHLHNGISEDLVEKNSS 360
Query: 361 EQMGCTEQYGKASINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSA 420
EQMGC EQYGK SINAS NNTSE IIDLFSAVDMSNKTTG+DV+S E+SL+ +ASEN A
Sbjct: 361 EQMGCPEQYGKTSINASANNTSEKIIDLFSAVDMSNKTTGIDVESLESSLQALQASENRA 420
Query: 421 VVDCKTEKVPANTDNGEPSQGHSSSILEKLFGSAIKLDGGATNFIEHDNEMDDACSPQNA 480
V DCKTEKV ANT GE SQ HSSSILEKLF SAIKLDGGATNFIEH+NEM+DACSPQN
Sbjct: 421 VADCKTEKVLANTAIGETSQVHSSSILEKLFCSAIKLDGGATNFIEHENEMEDACSPQNT 480
Query: 481 QSSKFAHWFVDNDRKQEDDLSPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLPTVAFQG 540
QSSKFAHWFVDND KQED +SPKRS DLLT+I GGEKGGYD +SDV S QSLPTVAF G
Sbjct: 481 QSSKFAHWFVDNDGKQEDGVSPKRSNDLLTLIVGGEKGGYD-ISDVA-SEQSLPTVAFHG 540
Query: 541 YESAESFITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISENDSALQPSD 600
YESAES+ITSS T SN K EPFYDKSKPEAVS+ILTCEAVEQTLLSK+SENDSALQPSD
Sbjct: 541 YESAESYITSSETSSNAQKTEPFYDKSKPEAVSSILTCEAVEQTLLSKMSENDSALQPSD 600
Query: 601 QRWSHSDADVKHTTVKNDDHASLHLLSLLQKGSGPVIAGYG-DDGVNIGSAFHSKKEENT 660
QRWSHSDA+ KH T K+DDHAS HLLSLLQKG+ P+I GYG DDG N+G+ H+KKEE++
Sbjct: 601 QRWSHSDANNKHPTGKSDDHASQHLLSLLQKGTSPMIVGYGSDDGWNMGTGIHNKKEESS 660
Query: 661 HNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSVKSDVSESHGSITDDGLLSN 720
HNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGS K DVSESHG I DDGLLSN
Sbjct: 661 HNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSSGSGKVDVSESHGPIMDDGLLSN 720
Query: 721 NEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSELDSSHPHAKLGHKMGGYDGPAEM 780
NEIRPSMINHDHGDQRQQNQPD+VRGQWLNLNGPR ELDSSHP AKLGHK+GGYDGPAEM
Sbjct: 721 NEIRPSMINHDHGDQRQQNQPDLVRGQWLNLNGPRPELDSSHPQAKLGHKIGGYDGPAEM 780
Query: 781 PFPEEDSLIISDSMNFQNLISMGHSAKPQPLFSHNAQDSNAAIFNPAFKDERPSMGGLEG 840
PFPEEDSLIISDSMNFQNLIS+G+S KPQPLFSH+ QD+N+AIFN AFKDERPSMGGLEG
Sbjct: 781 PFPEEDSLIISDSMNFQNLISIGNSIKPQPLFSHHTQDNNSAIFNSAFKDERPSMGGLEG 840
Query: 841 LPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNVKLFHQFESHPPNMNSQGDLMLPE 900
LPFSASP+DRRETEMPHRKAPVHS+F QLHP NNVKLFHQFESHPPNMNSQG+L+LPE
Sbjct: 841 LPFSASPFDRRETEMPHRKAPVHSSFPQLHPSQANNVKLFHQFESHPPNMNSQGELLLPE 900
Query: 901 GMVHHDSPSNHQFIANMFRPPTSGLSGFDHSIHHPMMQQMQTSVNLPPQHLLQGLSRGAP 960
GMVHHDSPSNHQF+ANM RPPTSGLSGFDHSIHHPM+QQ+QTSVNLPPQHLLQGLSRGAP
Sbjct: 901 GMVHHDSPSNHQFVANMLRPPTSGLSGFDHSIHHPMLQQIQTSVNLPPQHLLQGLSRGAP 960
Query: 961 PPMTNRSVPVHPHSIRGSAAPPQPNNQVPGLVQELNSIQGFHIGQRVPNIGSPRIPSPAP 1020
PPMTNRSVP+HPHS+RGSAAPPQPNNQV GLVQELNSIQGFHIGQRVPN+G PRIPSPAP
Sbjct: 961 PPMTNRSVPLHPHSVRGSAAPPQPNNQVSGLVQELNSIQGFHIGQRVPNMGGPRIPSPAP 1020
Query: 1021 ---SNQSDAIQRLIQMGHRSN-SKQIHPLSASGGHGQGMYGHELNMGYGYR 1063
NQ DAIQRLIQMGHRSN KQIHPLSAS GHGQG+YGHELNMGYGYR
Sbjct: 1021 GIGGNQPDAIQRLIQMGHRSNPPKQIHPLSAS-GHGQGIYGHELNMGYGYR 1068
BLAST of Tan0004805 vs. TAIR 10
Match:
AT4G01290.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1744 Blast hits to 1308 proteins in 219 species: Archae - 0; Bacteria - 241; Metazoa - 793; Fungi - 253; Plants - 108; Viruses - 0; Other Eukaryotes - 349 (source: NCBI BLink). )
HSP 1 Score: 476.9 bits (1226), Expect = 4.3e-134
Identity = 395/1103 (35.81%), Postives = 562/1103 (50.95%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPS---GFDQSIIAE 60
MS+ + + DQ D +KKP+ +YTR FL+SLSE DVCKKLP+ FD++++ +
Sbjct: 2 MSIANEQQFAMDQLVETNDDSEKKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLD 61
Query: 61 FEEASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDS 120
FE+ S +R R+SG S + F+RN+Y SSPP+R E SR HG+ E S G +DKDSDS
Sbjct: 62 FEDPSPERARISGDFSSHGFRRNDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDS 121
Query: 121 QSDRDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNK 180
QSDRDS + G R G SRRS Q PEHDGLLG GSFP+PSG+ G SAP+ +++D +QL++
Sbjct: 122 QSDRDSGEPGRRSGMPSRRSWQAPEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSR 181
Query: 181 SNEPYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHR 240
+NEPYHPPRPYKA R + DS+N ETFGSS+ TSEDR EEE+KRRASFE +RKE +
Sbjct: 182 TNEPYHPPRPYKAPPFTRRDARDSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQK 241
Query: 241 VFQESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQA 300
FQE KSNP ++++FD L E+KDDK P+ S + + ++ S N S PSQ+
Sbjct: 242 AFQERQKSNPDLRKNDFDFTELLGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQS 301
Query: 301 TVSRPLVPPGFTSTVLEKNFGTKSSVMLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGF ST+LEK G K E + +K + NG S + K
Sbjct: 302 NAPRPLVPPGFASTILEKKQGEKPQT--ETSQYERSPLNSKGINVVNGTSVNNGGKPLGI 361
Query: 361 QMGCTEQYGKA-SINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSA 420
++G +E + + S+ + +E +++ S + +S T D +S E + +E
Sbjct: 362 KIGSSEMLIEGEDVRVSSTDANERAVNISSLLGISTDTVNKD-KSFEKLSSISTPTEIQG 421
Query: 421 VVDCKTEKVPANTDNGEPSQGHSS--SILEKLFGSAIKLDGGATNFIEHDN--EMDDACS 480
K+EK T + S HS SIL+K+F +AI L+ G ++ + N ++++ S
Sbjct: 422 -YPIKSEKA-TMTLGKKKSLEHSDGPSILDKIFNTAINLNSGDSSNMNKKNVEKVEEIRS 481
Query: 481 PQNA-QSSKFAHWFVDNDRKQEDDL-SPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLP 540
PQ +SSKFAH F++ D K + L S + LL+++ G +K D K +
Sbjct: 482 PQTINKSSKFAHLFLEEDNKPVEVLPSSEPPRGLLSLLQGADK---LQTFDTKANPDLST 541
Query: 541 TVAFQGYESAES-FITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISEN- 600
FQG+ + + ++S++T +V + P +LTCE +EQ++LS++ ++
Sbjct: 542 DFPFQGHATKRTDQLSSTSTTKSVTAVPP------------VLTCEDLEQSILSEVGDSY 601
Query: 601 DSALQPSDQRWSHSDADV-KHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAF 660
P DQ S + K DD AS HLLSLLQ+ S P + SA
Sbjct: 602 HPPPPPVDQDTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDP-----KSQDTQLLSAT 661
Query: 661 HSKK--------------EENTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSS 720
+ + T ++PGK+LTLE LFGSAFM ELQS+G PVS
Sbjct: 662 ERRPPPPSMKTTTPPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEPVSG----- 721
Query: 721 GSVKSDVSESHGSITDDGLLSNNEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSEL 780
++ VS++ G P G+ Q+NQ + +GP
Sbjct: 722 ---RAMVSDAPGV-------------PLRSERSIGELSQRNQ--------IRPDGP---- 781
Query: 781 DSSHPHAKLGHKMGGYDGPAEMPFPEEDSLI-ISDSMNFQNLISMGHSAKPQPLFSHNAQ 840
GG + PE+ +L+ + N +S S +P + N
Sbjct: 782 ------------PGGV-----LALPEDGNLLAVGGHANPSKYMSFPGSHNQEPEVAFNIS 841
Query: 841 DSNAAIFNPAFKDERPSMGGLEGLPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNV 900
D AA+ N ++ERP+MGG +GL P H +
Sbjct: 842 DKLAAL-NSGPRNERPTMGGQDGLFLHQHPQQYVTNPSSHL---------------NGSG 901
Query: 901 KLFHQFESHPPNMNSQGDLMLPEGMV--HHDSPSNHQFIANMF-RP-----PTSGLSGFD 960
+FH F+S ++ Q D M P + HHD P NH+F NM RP PTSG FD
Sbjct: 902 PVFHPFDSQHAHVKPQLDFMGPGSTMSQHHDPPPNHRFPPNMIHRPPFHHTPTSGHPEFD 961
Query: 961 HSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTNRSVPVHPHSIRGSAAPPQPNNQVP 1020
H MMQ+M NL HL+QG P P HS P NNQ+P
Sbjct: 962 RLPPH-MMQKMHMQDNLQHHHLMQGFPGSGPQP---------HHS-------PHVNNQMP 991
Query: 1021 GLVQELNSIQGFHIGQRVPNIGSPRIPSPAPSNQSD---AIQRLIQMGHRSN-SKQIHPL 1063
GL+ ELN QGF R PN G P P + N+ + ++Q L+ + R + +KQI +
Sbjct: 1022 GLIPELNPSQGFPFAHRQPNYGMP--PPGSQVNRGEHPASLQTLLGIQQRMDPAKQIPAV 991
BLAST of Tan0004805 vs. TAIR 10
Match:
AT4G01290.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 24 plant structures; EXPRESSED DURING: 15 growth stages; Has 1797 Blast hits to 1352 proteins in 216 species: Archae - 0; Bacteria - 202; Metazoa - 850; Fungi - 267; Plants - 109; Viruses - 0; Other Eukaryotes - 369 (source: NCBI BLink). )
HSP 1 Score: 473.0 bits (1216), Expect = 6.3e-133
Identity = 395/1103 (35.81%), Postives = 562/1103 (50.95%), Query Frame = 0
Query: 1 MSLKKDDSNSHDQTATVKHDLQKKPKFSYTRDFLLSLSELDVCKKLPS---GFDQSIIAE 60
MS+ + + DQ D +KKP+ +YTR FL+SLSE DVCKKLP+ FD++++ +
Sbjct: 2 MSIANEQQFAMDQLVETNDDSEKKPRITYTRKFLISLSEKDVCKKLPNLPGEFDEALLLD 61
Query: 61 FEEASYDRQRVSGGLSLNSFKRNEYGSSPPSRAETTNYSRRIHGKREVHSSGRSDKDSDS 120
FE+ S +R R+SG S + F+RN+Y SSPP+R E SR HG+ E S G +DKDSDS
Sbjct: 62 FEDPSPERARISGDFSSHGFRRNDYSSSPPTRGELGTNSRGTHGRWEGRSGGWNDKDSDS 121
Query: 121 QSDRDSVDSGWRYGDHSRRSSQGPEHDGLLGSGSFPRPSGYATGFSAPKVRAHDQYQLNK 180
QSDRDS + G R G SRRS Q PEHDGLLG GSFP+PSG+ G SAP+ +++D +QL++
Sbjct: 122 QSDRDS-EPGRRSGMPSRRSWQAPEHDGLLGKGSFPKPSGFGAGTSAPRPQSNDSHQLSR 181
Query: 181 SNEPYHPPRPYKAVAHQRGNTNDSYNHETFGSSEYTSEDRVEEEKKRRASFESMRKEQHR 240
+NEPYHPPRPYKA R + DS+N ETFGSS+ TSEDR EEE+KRRASFE +RKE +
Sbjct: 182 TNEPYHPPRPYKAPPFTRRDARDSFNDETFGSSDSTSEDRAEEERKRRASFELLRKEHQK 241
Query: 241 VFQESHKSNPVKQRDEFDILMQLDEAKDDKKLPNTSSGFDDSISLQSSKNDREKSFPSQA 300
FQE KSNP ++++FD L E+KDDK P+ S + + ++ S N S PSQ+
Sbjct: 242 AFQERQKSNPDLRKNDFDFTELLGESKDDKGRPSRSDEVNHAPTIPGSSN---TSLPSQS 301
Query: 301 TVSRPLVPPGFTSTVLEKNFGTKSSVMLEGKDDVDKCVQAKDEQLRNGISEDLEEKSSSE 360
RPLVPPGF ST+LEK G K E + +K + NG S + K
Sbjct: 302 NAPRPLVPPGFASTILEKKQGEKPQT--ETSQYERSPLNSKGINVVNGTSVNNGGKPLGI 361
Query: 361 QMGCTEQYGKA-SINASTNNTSEMIIDLFSAVDMSNKTTGMDVQSRENSLEVFEASENSA 420
++G +E + + S+ + +E +++ S + +S T D +S E + +E
Sbjct: 362 KIGSSEMLIEGEDVRVSSTDANERAVNISSLLGISTDTVNKD-KSFEKLSSISTPTEIQG 421
Query: 421 VVDCKTEKVPANTDNGEPSQGHSS--SILEKLFGSAIKLDGGATNFIEHDN--EMDDACS 480
K+EK T + S HS SIL+K+F +AI L+ G ++ + N ++++ S
Sbjct: 422 -YPIKSEKA-TMTLGKKKSLEHSDGPSILDKIFNTAINLNSGDSSNMNKKNVEKVEEIRS 481
Query: 481 PQNA-QSSKFAHWFVDNDRKQEDDL-SPKRSIDLLTMIGGGEKGGYDVVSDVKHSVQSLP 540
PQ +SSKFAH F++ D K + L S + LL+++ G +K D K +
Sbjct: 482 PQTINKSSKFAHLFLEEDNKPVEVLPSSEPPRGLLSLLQGADK---LQTFDTKANPDLST 541
Query: 541 TVAFQGYESAES-FITSSATPSNVAKIEPFYDKSKPEAVSAILTCEAVEQTLLSKISEN- 600
FQG+ + + ++S++T +V + P +LTCE +EQ++LS++ ++
Sbjct: 542 DFPFQGHATKRTDQLSSTSTTKSVTAVPP------------VLTCEDLEQSILSEVGDSY 601
Query: 601 DSALQPSDQRWSHSDADV-KHTTVKNDDHASLHLLSLLQKGSGPVIAGYGDDGVNIGSAF 660
P DQ S + K DD AS HLLSLLQ+ S P + SA
Sbjct: 602 HPPPPPVDQDTSVPSVKMTKQRKTSVDDQASQHLLSLLQRSSDP-----KSQDTQLLSAT 661
Query: 661 HSKK--------------EENTHNISNPGKTLTLETLFGSAFMKELQSVGAPVSAQRGSS 720
+ + T ++PGK+LTLE LFGSAFM ELQS+G PVS
Sbjct: 662 ERRPPPPSMKTTTPPPSVKSTTAGEADPGKSLTLENLFGSAFMNELQSIGEPVSG----- 721
Query: 721 GSVKSDVSESHGSITDDGLLSNNEIRPSMINHDHGDQRQQNQPDIVRGQWLNLNGPRSEL 780
++ VS++ G P G+ Q+NQ + +GP
Sbjct: 722 ---RAMVSDAPGV-------------PLRSERSIGELSQRNQ--------IRPDGP---- 781
Query: 781 DSSHPHAKLGHKMGGYDGPAEMPFPEEDSLI-ISDSMNFQNLISMGHSAKPQPLFSHNAQ 840
GG + PE+ +L+ + N +S S +P + N
Sbjct: 782 ------------PGGV-----LALPEDGNLLAVGGHANPSKYMSFPGSHNQEPEVAFNIS 841
Query: 841 DSNAAIFNPAFKDERPSMGGLEGLPFSASPYDRRETEMPHRKAPVHSNFSQLHPQHTNNV 900
D AA+ N ++ERP+MGG +GL P H +
Sbjct: 842 DKLAAL-NSGPRNERPTMGGQDGLFLHQHPQQYVTNPSSHL---------------NGSG 901
Query: 901 KLFHQFESHPPNMNSQGDLMLPEGMV--HHDSPSNHQFIANMF-RP-----PTSGLSGFD 960
+FH F+S ++ Q D M P + HHD P NH+F NM RP PTSG FD
Sbjct: 902 PVFHPFDSQHAHVKPQLDFMGPGSTMSQHHDPPPNHRFPPNMIHRPPFHHTPTSGHPEFD 961
Query: 961 HSIHHPMMQQMQTSVNLPPQHLLQGLSRGAPPPMTNRSVPVHPHSIRGSAAPPQPNNQVP 1020
H MMQ+M NL HL+QG P P HS P NNQ+P
Sbjct: 962 RLPPH-MMQKMHMQDNLQHHHLMQGFPGSGPQP---------HHS-------PHVNNQMP 990
Query: 1021 GLVQELNSIQGFHIGQRVPNIGSPRIPSPAPSNQSD---AIQRLIQMGHRSN-SKQIHPL 1063
GL+ ELN QGF R PN G P P + N+ + ++Q L+ + R + +KQI +
Sbjct: 1022 GLIPELNPSQGFPFAHRQPNYGMP--PPGSQVNRGEHPASLQTLLGIQQRMDPAKQIPAV 990
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023539105.1 | 0.0e+00 | 87.09 | uncharacterized protein LOC111799853 isoform X3 [Cucurbita pepo subsp. pepo] | [more] |
XP_022935295.1 | 0.0e+00 | 87.08 | uncharacterized protein LOC111442216 isoform X2 [Cucurbita moschata] | [more] |
XP_022974561.1 | 0.0e+00 | 86.89 | uncharacterized protein LOC111473257 isoform X2 [Cucurbita maxima] | [more] |
KAG6597140.1 | 0.0e+00 | 87.09 | hypothetical protein SDJN03_10320, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
XP_023539103.1 | 0.0e+00 | 87.01 | uncharacterized protein LOC111799853 isoform X1 [Cucurbita pepo subsp. pepo] >XP... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FA86 | 0.0e+00 | 87.08 | uncharacterized protein LOC111442216 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IBP7 | 0.0e+00 | 86.89 | uncharacterized protein LOC111473257 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1F449 | 0.0e+00 | 87.00 | uncharacterized protein LOC111442216 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1IE91 | 0.0e+00 | 86.81 | uncharacterized protein LOC111473257 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1D0L9 | 0.0e+00 | 85.90 | uncharacterized protein LOC111016288 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT4G01290.1 | 4.3e-134 | 35.81 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |
AT4G01290.2 | 6.3e-133 | 35.81 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |