Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCATGTCCAATCAACAGCTTCCGCTTCAATGGCTCCCTTTGCGCGTGCCCACCAGGCCATCTTCTTAATCGGGCCAGCAATACCTGCGTTCTCTTCAGCAGCCCTTCATCCATCGTCATAGGCCGAGTTGAAAGCTATGGTGTTAGCTTCCCTGCCACCATTTTCTCGTTTGATTCTATTAGGAAGTTCACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTTGTAATGCTGGGTTCTTGGCTCTTCTTCTGCTTGTTTCTGAGATTCATGAAGCTTGGGGATGGCAAAAACTCTTGGTTCAGGATCAGATGGTGGGTTAGCAGACTGGACCTCTGCTTTTCCACTAGACATTGGCTGGTTAGCTGCCCTTTTGAATTTCTTGCTCTCTTCATGTGGTTTTTTATGATTTTTTGCCCTAGATTGTTAATGATTTTTCTGTTCATCTGTAGAATTTTCCTGCTGCTTTTTGCTACTTCGTGGCTTTCGTCATGACGTTTTTGTTTCGATTTCTTTATTTGATCTCTTTCACTTTGTGTTGTGGGATATTCCTCGATATCAGCCGTTTCAATTCTTAGAGAAGAGATGGGAAGTTGAAAGTTTGACATTTATCATTATGGTTTGCGGTTGGGGAAGTATACGAACTGTAAATTTCTTGGGTTAACTGGAATTCATATAAAGTTGATTGTTGTTGATCGATCAACCATTTTTCTTACTTTTGGAATCTCAACAACCATTTTGCTACCAAAACTGTCTCTACTCTTCAAGTTTCCTAGCCCTAGAATACCCCCTTTCTACTGGCAGTGAAATAGTTGGCCAATTTACAAAATCTCTCGCTTTACCTAAAGATTGTTTCTTTGTTTCTTTTCCCTTTCACTTGTGATATTAACAGTTGGACAGAACTTATCTATTTCAGGTTTCTTTTTGGGTAGTTTAGTATGCATTTTCATGTATTCTTTTTATCTGTCTGGTACTGGATTTCAATCTCCTCTAGTCAATGAAAATTAGCATTTCTCTTTTTAGTCCAAACCATAGTGTTTCAATTTTTCCTATTATAATTTGCTTTCCTTATCTGTGTTGAAGGATGATAAAAAAGTAGTCATGAAACGAAAAACTGAACTTGGTGGAACATTCTCAATAGCAAGTTGGATTCTTTTCGTTGGCTTGTTTGCTGCGTAAGTACCAATTTTCTCTATACTGTCATCTGTCATCTACTGAAAACAATTCTCTCATGTTGGTTTTGTATTGACTTGGCTTTGCAGTGTTTTCTCAGGTTGCTTTACCAAGTCATATCAAAAAGAAGCATTGAAGTGCATAATATCAAAGCAGCACATGCACCAGACATGGTTTCCTTTGTGAATGACATGGAATTTAATATAACCACAGTCTCTACTATGAGTTGTGCAAATATTCGTGATCTTGGTACTGTTGTGTTTGGGAATCCTGGTTTTCTGAAACAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGCCATAACAAGAGTGAAGGGCCAACTATAAGTGTTCAGTGTGGAAGATGTCGATTCATTCAGGACAATATTTATATCTCATGGCAGTTTGTTGATCTTCCAAGTAATCCTGCCAGTGCTGTTGGATTTCAGTTTAACCTCTCTGCAAGGAATCATGCCAAAAATGATCATGCAAGTTTTCTTAGTGGATTATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACGTTCAGAGGGAAGAATGCGAATATAGTGCAATTCAATCTATTTCCAAGAATATACGGTGACCAACAGGATTCCAAGCTTATGCAGCCTTTATTTCATGAGTTTGTCCCTGGTTCATCCTTTCAAAAGACAAGCGAGCTCCAAACATCCCTTGAAAATTCCAATGATGGACTACTCAATGTCACCTTGTACATCAATCTCCTCTCTTCTTACATTGTCGAGATAGAGAATCAAAATGTTTTTAGCCCTGGTAAGTATCGATAAACATTATGCTATATAATCGAATGTTCTTTCAGGGATTAGTGTATCATATATACCCCCATAAACATTACATGAACAACCTGACTGCCACCTCATTATTAATTAAATTTCCATTAAGCTCTTTTTTTAATTTTTTATATTTAAACAAATTTTCATTAAGCTCTTAAGCTGCTAGAAAAATTATACCATAGACTCCTCACTCTACTACCACAATAAAACTGTCCAGAATCGGACAATCTCCACCACTATATTAGCCATACTATTATACTATTTTCTCTGGTTATGTATATCACTAGACACGTTATTTTATCACTCAAATAATGGCATTTCGTAGTCTGGATAATCTGAGATATGTGGGATAAGTGGATTGGATTCTGGACTCTAGTAACTTAGATTGCGCTGCCCGGTAGATTAGCTTTTGACGTTAGCCACTCTTGGCAGGAGTTCTTGTTTCGTACTTAACATCTTCTTCAAGGTTAGTTTTGGTCTTAAACTGCTAACTAGCTGGTGATTGTGGGCAAGAAAACTACAAGAATTGTCCTTGTGACTTTCTATTGTAGAAGTTTTGCTGTTGATTATTGTTTTTTAAAATTGTTTATGTAATTTGATCTTTGCTACTTCTGTTATTTCCACAAGCCTTTTGTTTGATTCTGTCATTGAAAATTTCTAATTCCTCTGTTTTAAACTGTGCCAAGTAATAGCCTAACACATCTGAAAATAGCTTCCATCCTTCTATTTCTACTTGAATACTTTGAAGTCTTCCATAAGAAGCTGTTCAATGTAGGTGACCTAACTCAAGCCAAATTCCAATTGTTAAGGCTTCGTTTGATAACCATTTGGTTTTTGGTTTTAAAAAATTTAGCTTTTTTTCTCCCAATTTCTCTACAATGAGTTTCATCTTTCTTAAGTAAACATTTGAATTCTTAGCCAAATTCTAAAAACACAAAAACATATTTTTTAAAACTACTCTATATTTTTCAAAATTTGGCTTGGTTTTTGAAAACATACATAGAAAGTAGATAGCAAAGCAAGGAAACTCATAGGTGGAAATAGTATTTATAGGCTGAATTTTCAAAAACCAAAAACGAAAAACGAAAAACAAAAAACCAAATGGTTATCAAACAAGCCTATTTGCCTTACTACATGCCTTGACAAAGTGGTTGCTAGAGTCCTTTCTGAACGCCTTAAAAAGGTTCTACCTCAAACGATCTCAAAATTCCAATCTACTTTTGTTGCTGACAGACTGATTATAGATGCCTCCCTTATTGCCAATGAACTCATAGAGGAATATACAAGGAGCAAAAAACAGGGAGTGGTGATCGAACTTGATATAGAAAAGGCCTTTGACATGATGGATTGGGAATTTGTAGACAATATCCTTGCTGCAAAAGGATTTGGCTATAGGTGGAGAAGATGGATCCATGGATGCAGTTCATCGAAAAAAATTTTCACTTATCATAAATGGGAAGCCTCGTGGCAAGATCAAAGCCTCGAGAGAGCTTAGACAAGGAAACCCCTTATCTCCCTTCCTTTTCATTCTCATCATTGATTGTCTTAGCAGAATGTTGACCCTTGAAGCATCTAGGGGTCATATTTGTGGAATGGAAATAAGAAGAGACAAGCTTGCCATAAATCATCTACTCTTCGCCGATGACACCATTCTCTTTTCTTCACCAAACAAAGACAAGATCACTGCTCTTTTTGCCACTGTGAAAATGTTTGAGGGAGCATCGGGTTTACGCATCAATCACCACAAATCTGAATTCATAGGCATACACGTGGACGAGGCAATTGTTGATGACCTTGCTAAAGAATTTGGCTATAAAGTGGCATCTTGGCCTACTACCTTGGATTGCCCCTCAATGGTAAGCCACATAAAACAACCTTATGGAATCCCCTCATAGAGAAGGTTGAGGGGTGCTTAAAAGGGTGGAATTGTTCCTATATATCAAAAGGACGAAGGCTTACCCTTATAAAAGGCACCTTGTCCAACCTCCCCACAAATTACCTCTCCCTTTTTAAAATCTCGTCGGACGCTGCCAAAGCTATATACCGGAAATTTTTGTGGAATGGTTATAAAGAAAAGTCGGGTTTCCACTCCATCAATTGGCTAACCGTCTGTCGACCTATAGAAGAAGGAGACCTTGGTTTCCCTGATATCAAGGCCAAAAACAAAAGCCTCCTAGCTAAATGGATATGGAGATTTCGTCTAGAGAAAAATTCTCTTTCGAGGAGAGTTTTGGCTGCAAAATATGGTACCACACACTTTGATCTCCATTCAGACTCCAAGGCTATAAAGTACCATTGTGGACCATGGAAACATATTGTGGGTGAACAGAGGCTGATTTTCGACAACATTTCATGTGTCATTGGCAATAGCAAACAAACTTCCTTTTGGTGCGATCGATGGCTTGGTGATATACCTCTCAAGAATGCATACCCTTTCCTTTATCCTTTATCCCAGAAAAAGAATGAATCTGTACATGAGATGTTTAAAATTGCAGAATTATCCTGGGAGCTTGGTCTCTGTCGGAATCTAAAAGAATCAAAAATTGAGGAATGGGCTAGGCTCACGAACCACCTTCCAAGCCATTTAGGATCAACAACCGAGGATACTTGGAATTGGCCTTTAGACAAAAAGGGCATCTTCACAACAAAGTCCCTTACACACAAAATAACCACCAACAATGACATTACTCACAGATTGTTGTACAAAAATCTTTGGAAGGGCTCCATTCCTAAAAGAGTCAGATTTTTTCTTTGGGAAGTCAGTCATGAAAGCATAAATACGATGGATACAATTCTAAAGAAGGCTCCGTGGTTAGTATCAACCCCTAATTGGTGCGTCCTTTGCAAAAATGCGAGTGAAACTATGGGACACATCTTCACCACCTGTCAGTTTACAACCAGTAAATGGGTGAGGCTTCTAAATGCTTCCAACCAACAACTTGTGTTTCCTAACTCTATGATTTCGATCCTTGAGGTTCTTTTGTTGGGTCACCCTTTCACACAAAGCAAGGCTTACGCGTGGGACAACATTGTTAAAGCGCTTCTCTGGTCCATATGGCTTGAAAGAAACAACAGGATCTTCAATGATAAAGCTAGAGAGCCAAACCTACTTCGACCACCTTCATTATCTTGCTTTTTCTTGGTGTAAATTCTCTCATTTTTTTGGTGATTACTATCTCATTTCCCTTCTTAATCTTAGCAATTGAGAAAATCTTTTGTAATTGGGTTACCACACCCCTTTTGTACATTTCATACAATCAATGAAATGATCTAAACGTTTCTCATAAAAAAATAATCAAACAAGATTAAACAAACTATTTTGTATAAAGATGCAAGAATAAATGATCCTACTCCTTCCTCATTCATACCACCCTGAGGATACAATGGCACTAACAAAATGAAGGTAATTGGTAAGGGAACTAATAATGATTTATGGGTAGGAGTAGTATGGTATTAGTAAAGGAATAAGGGTAATTAGTTAGGGAGCTAGTTAGTGTTTTTGGTTATAAATAGAGGGAGGGAGATTGAGGGAAGACAGACAATTGTTGTGTGATTTAGGGCTTGGGTTAGCATACTCAAGAGAAGGGAGGTTCCAAGTGCCTTATACTTGGTTTATCTGATAGCTTTTTATCATTTTCATTTCAATATATTTCAGTTCTTAATTCTTGTTAGGTGGTATACTAACAATTGGTATCAAAGCATCAGTTCTGACCATGGAATAGTTGGACATGGATTGTTTGGATTGTCTGATAGAGTTGGTGTTGGCAACTCAGCAGAGATGCGAGGAAACACATCAAATCGTGAAAGAAATGTGAGAGCGTTGTGAGGAACTCAATTGCTAAAAGATGGAGAGAGAACAAAAATGTCAAAGGGAAAATGATGACTGTAGCAGGGTTGGAAAAATTGAAGAAGCAATGAAGAAAAAGGGAGCAAGGAAAAATTGAGTAGATCAAGATGAGGCAGCAAAATCAACTCGAAATTATCCATTTGCAGTTGAAGAGTGGTGGTAAGGCCTCGATCTTATGCGAGGGCATCTTCGAAAGCGCAAATTTTGAAAGTGTTTTTGTTCAAGGAAACCGAGGGCTCATACAAGAGTTATCAATATTCAAAAGAGTGGCGGTCTTGGATTTGGAGTTGCATTAGGACAATAAGGTATTCTATCGTGGTTCATGGCAAACCTAGGGGTAGGATTCTTGCTTCTAGAGGACTCAGACAAGATGACCCTCTATCTCCCTTTCTTTTCCTCTTGGTGGTGGATGTCTTGAGTAGGATTATTTCTAAAGGGGTGGAAGGAAATATTCTGGAAGGATTCAAGGTGGGTAAGAATAAGGTGTCTCGTCTCATCTTCAGTTTGCTGATGATACAATCTTTTTTTGTTTAGGTAGGGATGAGTCTTTCTGCAATTTGAATCAGATTCTTGTGTTTTTTTAGGCCATTTTGGGCCTCGAATCAATAGAAGTAAATGCCAGGTGTTAGGCCTCAATTGCGATCATGCTAAATTGGAAAGATGGGCGTCTTTAGTGCAATGTGCGATTGGTACATTCCCTACCTCCTACTTAGGGCTTCCTCTTGGCCATAATCCAAGAAGTATATCTTTTTGGGAATCGGTTTTGAACAAAACACGCAAAAGACTGGCGTCCTGGAGAAAGAGTTTCTTTTCTAAAGGCGAGAGACTTACTCTTACCCAGTCCGTCTTGAGCGGCATTCCAACGTACTTTATGTCTCTTTTTAGAATCCTGAACTCTGTCAGTAAGGGAATTGAGAAGCTCATGAGAGATTTCCTTTGGGAAGGGGTTGATGAAGGTAAAAGCTTGCATCTTGTAAATTGGGAGGTGGTCTCTAAGCCACTTAATCAGGGGGTTTAGGGATTGGTAATGTAAGGGCAAGAAACAAGGCCCTTTTAGCTAAATGGTTATGGTGATTCCATAATGAACCCAATACCCTTTGGCACAAGATTATTGTAAGCAAGGATGTCCTTCATCCCTTCGAGTGGACTTTTGGTGGAGTTCTTGGCACTTCTAGAAATCCGTGGAAAGAGATTTCGTCTCAACTCCCTGTTTTTTCGCATTTGTTCGTTGTGTGGTGGGAAATGGGAGGGACACTTATTTTTGGGAAGACCATTGGGTGGGGAATAGACCTCTTTGTTCCTTGTATCCTCATTTATATCATCTATCCACTTTAAAAAACCACTCGGTAGCTTTGATTTTTTCTAGCAATTCATCTACGCATGTTGTCGCTCCGTCTCCTTCCCTTGGATTCAGTCGTGCTTTGACCAATAGGGAAATGACCGAAGTTATTTCTTTATTGTCTTTGCTCAATCAGTGTCGACTCTACCCTCATTGAAGCGATGCCCGCCTTTGGACCCCCTCTCCTTCTAGAGGCTTTTCATGCAACTCTTTCTTTCATTGTCTGGTGAGCCCTGTCGCGTCGCGTGCTTCTGTATTCTCCTCTCTCTGGAAGGTGAAGGTTCTGAAGAAGGTTAAGTTTTTTGTCTGGCAAGTTTGGCACAGAAGGGTGAATACTTTGGATCGTTTGTTGGCTAGGGGGTCTCCCTTGGTTGGGCCATTCTGTTGTATTCTTTGCAGGAGGGCAGACGAGACCCTGGATCATATTCTCTGGAGCTGTGACTTTGCTCGAGTTATTTGGAGTTCTTTCTTTCAACAATTCAACTTTTGTTTCGTTGGTTACTTGGATTGTAGGGAGATGTTCATGAAGCTTCTTCTCAATTCGCTTTTTCGGGAGAAAGGCTTATTTTTGTGGCAGGCTGGAGTTTGTGCTATTTTTTGGATTCTGTGGCGGGAGAGAAACAATAGAATCTTTAGGGGAAAAGAGAGCCATCCTTTGGAAGTGTGGTCCCTCATTAGATTCTATGTTTCTCTTTGGGCCTCGGTGTTGGGGTTCTTCTGTAATTATTCTCTAAATATTATTTTGCTTGATTGGAGCCCCCTCGTTATAGTTGGCTCCCCTTTTTTTGTGGGCTTCTTTTTTGTACGCCCGTGTATTCATTCATTTTTATCTCAATAAAAGTTTGATTTTTCATCAAAAACAAAAAAAAATTCAAAAGAGTGGCGGAAGGGAAGAGAAAGAGGAGGAAGGTGGAATGGGAGACTCCAACGATGGAAAGTCCGCAAGGAATATGAAGAGAAGTGATGGAGGGGAGGCCGATTTGTTTCCTATCGGAGTTCGATGAACGTGCACAAGGGGGCTGGAATAATACTCGGCAGCAGAATCAGAGACTCAAGGGAGAAGTTGGCAACGAATACTACTGGAGCCGAAAAGTCGAGGAAGGCGTCAACGACAAGAAAGTCATTGTTTCCTTGAAAAAAGGAGGAAATCAGCCGACAGAGACCCTTGCTGGCGGCAAATACTGGTACAATTGGAAATAAAGACACGATGGGAGGCTAAACGACAGAGGAGGGAAAGTTTTCCGTAGTTATTGCTAGGGTTACATTGTAGAAAAAATCCATTAGTTGGGAATGGGGCTTAAACAAGCCCAATATGGTAAAATAGATCAGGGGGCTTGATAGGTTGATTTTCAATGGGCTTGTCTTGAATGGATTGGTTGAAGGGTTCATTTATTTCCAAAGACAGAAATTGGAGCTATTATACAAGATTGGGTCTTGGGTTGGGTTGCTCTATATTTTAAAATTAGGATCCAACCCGCTTTATCATCTTATTTACGTAAAAAGAACCCTAGCTGTATTTATGTTCTGCCTTACAATTATGTTGCCGACCATCTCCATTAGGCAAGCTCACGCTGTTGGCCATTAGCACCATTCATCACCGCCAATCATCGATACCGTCCATGGACATATGCTAAGTTCCTTGACCAGCTGGGGTTTGTTTTAAAAAAAAGAGTTTTTTTCTTTTGTCATTATCCAATTTTGGGATTGTTTTGTTAAACACATCTTGAGGACAAGGTGTCTTTGAAGGACGGAGTAATGTAAGGGAACTAATAGGGGTTTATGGATAGGATTAGTATGGTATTAGTAAGGGCATAAGGGTAATTAGTTAGGGAGTTGGTTAATATTTTTGGTTATAAATAGAGGGAGGGGATTGAGGGAAGATAGGCAATTGTTGAGTGATTTAGGGTTTGGGTTAGCACACCCAAGAGAGGGGAGGTTCCAAGAGCCTTATACTTGGTTTATCTGGTAGCTTTTTTATCCTTTTCATTTCAATATATTTCCATTCTTAATTCTTGTTAGGTGGTATCCTAACATAATCTTCATACTATAAATATTCTAATGAACTTGATGCTTTTAGGTTTGCATACCATAACGTGGCAAATCTATGAATGAAACATATATTTGCTTAGTACCTAAAAAGTTGGCTTCCAAATCTGTTAATGAGTTTCGACCCATAAGCCTCATATTCTGTGTTTATAAGATTGTTGCTCGGGTTTTATCTGACCGTTTGAAGCCTATTTTGGTCACTACTATTACAGATAACCAACTTGCTTTTGTTTCGCAAAGACAGATCCTTGATGCTTCTTTAATGGCGAATGAATTAATTGATGATTGCACAAATCGTAATCTAAAAGGTGTGGTCTTGAAGCTGGATCTTGAAAAGACATTTGATACAGTTGACTGGGATTTTCTTGATGTAGTGCTACAGGCAAAAGGCTTTGGGAGTCTTTGGAGGAAATGGATCCGTGGTTGTATTACTAGTGCAAACTATTCTATTATTATAAATGGTAGGGCTCGTGGGAAAATTATTCCTAGTTGTGGTATTCGACAAGGTGATCCCTTATCACCTTTTCTCTTTATTTTGGTCTCTGACTGTCTCAGTCAGTTGTTTACCCACAGTGCTAATAGGGGCCTTATTTCAACCCATCCAATTGGTGCATCATCCTTCTGTTTAAACCATTTCATTTTGCGGATGATACTTTACTCTTTTCTACGGCTGATCGTTTTGCGCGGAAATATTTGTTTGAAGTGGTTGAAATTTTTGAGAGGGCTTCAGGGTTAAAAATTAACCATGCTAAGAGTGAGATTTTGGGTGTTCATGTTGATTATTCCGAATTGGATTGGATGTTGTCGATCTTTGGTTGCAAACAAGGTATGTGGCTGACTACCTATCTTGGTTTGCCTTTGGGAGGGAACTCAAAAAGTATTTCTTTTTGGAATCTAATAATTGAACATCTGCAACAGAAACTCCATAATTGGAAGTATGCTTTTATTTCTAAGGATGGGAGACATACCCTTATTTAAGCTACTCTCTCTAGTATGCCTTTGTATTATATGTTCTTGTTCAAACTTCCGCCAAAAGTCATTGCGAATCTGGATAAGATTATTAGGGATTTCTTTTGGGAAGGTGCAAGAGGTGATGGTGGAGTACATAATGTGAAGTGGGCAACCACACAACTCCCTAAACTTTTGGGCGGTCTTGGGATTGACAACTTTAAGCAGCGTAATATAATTTGACCCTCCTAGCAAAATGGGTTTGGCGGTTTTCTCAGGAACATGATTCCCTTTGGACGAATTTTATTGTTGCAAAGTATTATGATCATGCTCAGGTAGTCTTTTGGCCTCCTTTTTCTCCTAATGATATTCGAGATTTGGTTTCTTCTCACTCCATTCGGCGTATTTGGAATGGTTGTGATACTCGATTTTGGATCGATTTTTGGCTCAATTGTGGTCCTCTTGAAACTGTCTGCCCACGACTCTTTCGACTTTGTTTGTTCCCTGATTCTAGAGTTGCAAGTCTTTGGAATTCCACGAATTCAGCATGGGACCTAAAACTTCGACGAAACTTAACAGATTTGGAGACTATTGAATGAGCCTATCTTTCTCAACTACTCTCCTCCGTAAACCTTTCGAATATCCCTGATTCTAGGGTTTGGTCTTTAGAGCCTTCTGGATGTTTCTCTGTAAAATCTCTTACAGGGAGTTTGCTGGTTTCTGCTGATTTGGTGACACGAGACATCTATTCTGTAATTTGGAAGGATAAATATCCAAAGAAGATTAAGATATTCCTTTGGGAACTTAGTCTGGGAGCCGTTAATACTTCTGATAGACTCCAAAGAAGAATGCCTTTTATTAGTCTCTCCCCCTCATGGTGTCCCATGTGTCATTTGGGATCAGAGTCTGCGGGCCATTTGTTCTAGCATTGTTTGTTTGCTCAACGTTTATGGTGGTGTGTATTGGATGCTTTCGGTTGAGTCCTGCCTTTCTCCAATAATATTTTTGATTTCCTTTCCTCCATTTTGGTGGGCCATCCGTTCAAAGGTTCGAAGAAGGTTCTTTGGTTGGCTTTTGTTCGTGCTTTCTTGTGGTATCTTTGGATCGAGAGGAATGGTCGTTTGTTCATGGATGTTTCCTCTCCTTTTGAACGTTTTATGGATCATGTTCTCTCAATTGTTTATTCATGGTGCAAGCATGTACAACCTTTTGGTCTTTATAACTTATCGTTTTTTATGTCAAATTGGCGATCATTCTTGTAACTCACCTTTTGGTGCTCGGAGTTATCTCTATTTCATTTATTAATGAAATAGTTTCTTATACCAAAAAAATCTTTTGGAAATAGTAATAATGAATCTTACTGATTACAACTAAACTTTTTGGCTCTGCTTTCCAACTAAAATTTTGTCACTTTGTGATAAGCATGACATGTCTATTTCCAATTTTGTTTTTCCATATCTCTCATCTCAGTGATCAAGGTTAGTTAAATGTTATGATAATGGTTAAGAAAGTTGTCCTCGAGATAGTCCATTTTCTAGTGATCCTTGCTAGTATCTTGGCTCAGATCTCTCAATGAACTCTTGAAAGAATGTTCTTTCTCCTTTCCATCTGATTATGAAATTTCTTTCATGAACGGATGTCTCTAGATTCTCCACTTGATCTATCTAATAATCTATGGTTACTTTTTTAAAAGATCGACTATCAAGTATTTTATAGTTTTATAGTTCTTATTTTGTAAATAATATTTATTATGGAATTTCACTAACATTATGATGGGTCTTATAATTTCTTTGTTTGCAGTTAGCTTTCTCGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTTCTCTACCTCATGTTGCAGGTATGTCACTTTCCCAAAAAGTGAAATTGAAGCTTGTAATTAATAATTATTGCATGTTGTGTCTTCAAACTCTCTGTTATAATAATATCATATTCCTCGTTCGTAAGAATTTGTGTAACACAACACTGACATATATTATACATCTACACACAACAAGCATTATAAGTTATCATCCTAGAACAAGATATTATTTCATAAGTGAGCACGTTCTCATAAAAAAAAAAGTGAGCATGTTATTCATTAGACATAAATATCTTTCGTTCAAAAATGAAGAAATACAGAAAACAAATTGGCACCAATATTTAACAATGAATAATTGCAGTGTTACACACAACAAGCATTATAAGTTATCATCTAGAACAAGATATTATTTCATAAGTGAGCACGTTCTCATAAAAAAAAAAAGTGAGCATGTTATTCATTAGAAATAAATATCTTTCGTTCAAAAATGAAGAAATACAGAAAACAAATTGGCACCAATATTTAACAATGAATAATTGCAGTGTTACTGATATTGTATAATTGGCAGAAATAAATAAAGAAAATTAAATGAGAAATGATAGTGTTAGGAATACTATTAGGGTAGAAGTTACGAATATTATTAGGATAGTAAGGTCATACGAGTAATTAGCTTAAGCTTTGGTTATCAATAGAGGGAGTGAGAGGGATGAAGTTAGTCAATTGTTTAATTAGCTTATGTTCGAGTGATCTCAAGAGAGGGACAGTCCAAGTACCTTGAATTACTTGGGAGTAGTTGTACTATTTTATCGTAGTTCTAGTTGGTTTCTATTAGATGGACAACTGTGGGAGATGGAAAGATAGTTCCTCCTAGTCTCTCTCTCTCTCTGTTATTAGGTGTAGACTACGTGACTGAGATTATTGCTTCTTCCCTAAGATGGGTATTTTTATGGGGAATTTTGTTTATACTAACTTTTTCCCTCCCTTTTGGTAGTTGTTTCACTTCCCTAATGTGGTTTGTTTTGTAGTTCGAGTACAGAATTAAAAAGCTCCGCAACGAAGACAGTGTTATGCATAATATTAGAAGTCGAAAGAAAGCACAAGAGCGTTGGAATAAGGTATTTTTCTTCATCATTTCCTCACCTTATTTGAGATACCAATCAGTTGGCATTATTTAGTTTCCTGTCCTGTTTTGCTTTTAACGAGGGAAACCCCACTTGATTGCTAGTTGCTAAAGCAATAGAAAATGTCATTTTAGTTCACAAACTTTTAGGCCTATTCCAATTTGGTCTTTGAAATTTCAAAATATTTATTTCAATTCTTGAACTTTAAATAAAGGGACTATTTTAATTTTTGGAATTTCAAAATATCTATTTTAATCCTTCCAATTTCAAAAAATAATCATTTTAGTTCTTACTATTATTTTGCCACCTAAGATCTGCCGACTGATCTATTATGTGTCTCAGTTCATATCTATATTAACATGCTAATGCCGATATTTTGGGCCATGTTATCCATTTGATAAAAAGCAAAATAACAATAAAGACCAATATGATTTTTTTTTTTAAGTAGAAGGACAAATGAATATTGTGACAGTACTTTAGAGATAAAAATGGAATAAATCTAAAAGTCCATGGAATAAAATAGACCGTTTGAAAGTTCATGAACCAAAAGGAAATTATCTCGAAAGTTCATGGACCCAAATTGAATTTAAACCAAAACTAAATTCTATATATAAACCAAATTTCTATCTCTCTCTCTAAGGAGGCTTTTTGTTTGTGGGGCGAGGTGATAACAATGTGTGTAAGGTAATCTTGGATCCAACGGTACATAGCATGGCTAGCAATATTTAGAACTTAGATGGGCTAGGAGAGAGATAATTGGTGGATGAAATTAGAATTCTTGTACGATGGTGGTTATCAAACTAACTTATTTTATTTTCTAGTTGAGGAAATATGTAATGTTTACATGGGGCTGCAGTACACTGGATGATTATTACAATGATCTGTCAACAGCACCGAGTTGCACCAATTGCATGGTTCAATCAAATTTTCAGAGTGGATTGTTGCGCAAGCGAAGGCCGAAGAGTGGAGATACTACTTTCTGTTGTAACAGAAAAGTAAATGTTCTGTTTATTACATCTTCATGAGCTTATTTTTTTCTGTTTCCAAACTGATTAGACTTCTGATATGCAGACTGCTAATCAGGATACGAAATTGTCAAAGACAACTCCTACTGACTCGGAAATGCGAACGATAACAACTGAACAAGAACTAGTGAGTTTCCTTGTCCTTAGGCATGGGAATTTTGACTATGTAATTCTAGTCCCTCAGCAAGTTCTTCATATTCATTATGAATTGAGTTTGTGAGGTTACATATTAACCATATTAATGTGAAAAACTAGAGCTATTAGTCATTTAAATTTAATATTTTAATTCATGATTGACTTGGTAAAATCTGTCACTTTTTAACATGGGGCAACATGTGAATGGACGAAAATTTCTTTGACAAGCTAGTTCCTTTTCTGATTTGGGGGAAAAAAAATAAAGAAGCCATGATTAACAATCATTAGCATATCCTACATTCCCTTGACGAGAACAAAAATCTTGCTGATGTAAAAGTTGCCAAATGAATTGGAAGTACATGTTCAATCACTAATAATTTGACATACGTGTTCTTCAGTCTCCGAAACATCGCATACTTTGTTCTATCGACGAGGGGAAGCAAAGCTTAGACAGTCCATGCGAGGGAGATTCCTCACAACTTGGAGACTTTTTTCATTCTGAAGATCTTATTCCCCCACCACCCACAATAGGTAACTACCACATTTTTTAAATTCAATATTTAATATTTCCCCTCTTCATCTCCATTGTTGTCAAGAATCATAATGTTATTGTTCAATGAAATGAAGACGAGTGTGGGTAGGTTGGTCATAGGAGTCCTAGAACATGTTTTTCACTGCAACATCAGAATCATGACAAGAGTGCATTTACTTTGTGGATATGTGACACAGTTTTTGCGCTGATGAACCATGCTCAGAGTTACTTCCAATTTTTCTGATGTAGAGTTCAAGGATAGTTCTGATATCAACATGTCCGATATCTTGGAGAAGATGAAAAGTTTGTACGAGTATAATGTAATTTTTAGAGAAAAGTTATTGGCTACTGAATCTGAGATTCGTGCTTTAGCAACTAAATCCTCTTGA
mRNA sequence
ATGTCATGTCCAATCAACAGCTTCCGCTTCAATGGCTCCCTTTGCGCGTGCCCACCAGGCCATCTTCTTAATCGGGCCAGCAATACCTGCGTTCTCTTCAGCAGCCCTTCATCCATCGTCATAGGCCGAGTTGAAAGCTATGGTGTTAGCTTCCCTGCCACCATTTTCTCGTTTGATTCTATTAGGAAGTTCACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTTGTAATGCTGGGTTCTTGGCTCTTCTTCTGCTTGTTTCTGAGATTCATGAAGCTTGGGGATGGCAAAAACTCTTGGTTCAGGATCAGATGGTGGGTTAGCAGACTGGACCTCTGCTTTTCCACTAGACATTGGCTGGATGATAAAAAAGTAGTCATGAAACGAAAAACTGAACTTGGTGGAACATTCTCAATAGCAAGTTGGATTCTTTTCGTTGGCTTGTTTGCTGCGTTGCTTTACCAAGTCATATCAAAAAGAAGCATTGAAGTGCATAATATCAAAGCAGCACATGCACCAGACATGGTTTCCTTTGTGAATGACATGGAATTTAATATAACCACAGTCTCTACTATGAGTTGTGCAAATATTCGTGATCTTGGTACTGTTGTGTTTGGGAATCCTGGTTTTCTGAAACAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGCCATAACAAGAGTGAAGGGCCAACTATAAGTGTTCAGTGTGGAAGATGTCGATTCATTCAGGACAATATTTATATCTCATGGCAGTTTGTTGATCTTCCAAGTAATCCTGCCAGTGCTGTTGGATTTCAGTTTAACCTCTCTGCAAGGAATCATGCCAAAAATGATCATGCAAGTTTTCTTAGTGGATTATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACGTTCAGAGGGAAGAATGCGAATATAGTGCAATTCAATCTATTTCCAAGAATATACGGTGACCAACAGGATTCCAAGCTTATGCAGCCTTTATTTCATGAGTTTGTCCCTGGTTCATCCTTTCAAAAGACAAGCGAGCTCCAAACATCCCTTGAAAATTCCAATGATGGACTACTCAATGTCACCTTGTACATCAATCTCCTCTCTTCTTACATTGTCGAGATAGAGAATCAAAATGTTTTTAGCCCTGTTAGCTTTCTCGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTTCTCTACCTCATGTTGCAGTTCGAGTACAGAATTAAAAAGCTCCGCAACGAAGACAGTGTTATGCATAATATTAGAAGTCGAAAGAAAGCACAAGAGCGTTGGAATAAGTTGAGGAAATATGTAATGTTTACATGGGGCTGCAGTACACTGGATGATTATTACAATGATCTGTCAACAGCACCGAGTTGCACCAATTGCATGGTTCAATCAAATTTTCAGAGTGGATTGTTGCGCAAGCGAAGGCCGAAGAGTGGAGATACTACTTTCTGTTGTAACAGAAAAACTGCTAATCAGGATACGAAATTGTCAAAGACAACTCCTACTGACTCGGAAATGCGAACGATAACAACTGAACAAGAACTATCTCCGAAACATCGCATACTTTGTTCTATCGACGAGGGGAAGCAAAGCTTAGACAGTCCATGCGAGGGAGATTCCTCACAACTTGGAGACTTTTTTCATTCTGAAGATCTTATTCCCCCACCACCCACAATAGAGTTCAAGGATAGTTCTGATATCAACATGTCCGATATCTTGGAGAAGATGAAAAGTTTGTACGAGTATAATGTAATTTTTAGAGAAAAGTTATTGGCTACTGAATCTGAGATTCGTGCTTTAGCAACTAAATCCTCTTGA
Coding sequence (CDS)
ATGTCATGTCCAATCAACAGCTTCCGCTTCAATGGCTCCCTTTGCGCGTGCCCACCAGGCCATCTTCTTAATCGGGCCAGCAATACCTGCGTTCTCTTCAGCAGCCCTTCATCCATCGTCATAGGCCGAGTTGAAAGCTATGGTGTTAGCTTCCCTGCCACCATTTTCTCGTTTGATTCTATTAGGAAGTTCACGCAGTCTCAGGCTGTGTTTCTTCAGGCCACTCTTGTAATGCTGGGTTCTTGGCTCTTCTTCTGCTTGTTTCTGAGATTCATGAAGCTTGGGGATGGCAAAAACTCTTGGTTCAGGATCAGATGGTGGGTTAGCAGACTGGACCTCTGCTTTTCCACTAGACATTGGCTGGATGATAAAAAAGTAGTCATGAAACGAAAAACTGAACTTGGTGGAACATTCTCAATAGCAAGTTGGATTCTTTTCGTTGGCTTGTTTGCTGCGTTGCTTTACCAAGTCATATCAAAAAGAAGCATTGAAGTGCATAATATCAAAGCAGCACATGCACCAGACATGGTTTCCTTTGTGAATGACATGGAATTTAATATAACCACAGTCTCTACTATGAGTTGTGCAAATATTCGTGATCTTGGTACTGTTGTGTTTGGGAATCCTGGTTTTCTGAAACAGAAAGTAATGCCTCTGTCAAATTTTGCAAACTACTCCTGCCATAACAAGAGTGAAGGGCCAACTATAAGTGTTCAGTGTGGAAGATGTCGATTCATTCAGGACAATATTTATATCTCATGGCAGTTTGTTGATCTTCCAAGTAATCCTGCCAGTGCTGTTGGATTTCAGTTTAACCTCTCTGCAAGGAATCATGCCAAAAATGATCATGCAAGTTTTCTTAGTGGATTATTAAAAAATGGAAGCAATTTTGATGATACACCAGTTACGTTCAGAGGGAAGAATGCGAATATAGTGCAATTCAATCTATTTCCAAGAATATACGGTGACCAACAGGATTCCAAGCTTATGCAGCCTTTATTTCATGAGTTTGTCCCTGGTTCATCCTTTCAAAAGACAAGCGAGCTCCAAACATCCCTTGAAAATTCCAATGATGGACTACTCAATGTCACCTTGTACATCAATCTCCTCTCTTCTTACATTGTCGAGATAGAGAATCAAAATGTTTTTAGCCCTGTTAGCTTTCTCGCAGATCTTGGTGGCCTATATTGCATTAGTGTTGGCATTTTTCTCTACCTCATGTTGCAGTTCGAGTACAGAATTAAAAAGCTCCGCAACGAAGACAGTGTTATGCATAATATTAGAAGTCGAAAGAAAGCACAAGAGCGTTGGAATAAGTTGAGGAAATATGTAATGTTTACATGGGGCTGCAGTACACTGGATGATTATTACAATGATCTGTCAACAGCACCGAGTTGCACCAATTGCATGGTTCAATCAAATTTTCAGAGTGGATTGTTGCGCAAGCGAAGGCCGAAGAGTGGAGATACTACTTTCTGTTGTAACAGAAAAACTGCTAATCAGGATACGAAATTGTCAAAGACAACTCCTACTGACTCGGAAATGCGAACGATAACAACTGAACAAGAACTATCTCCGAAACATCGCATACTTTGTTCTATCGACGAGGGGAAGCAAAGCTTAGACAGTCCATGCGAGGGAGATTCCTCACAACTTGGAGACTTTTTTCATTCTGAAGATCTTATTCCCCCACCACCCACAATAGAGTTCAAGGATAGTTCTGATATCAACATGTCCGATATCTTGGAGAAGATGAAAAGTTTGTACGAGTATAATGTAATTTTTAGAGAAAAGTTATTGGCTACTGAATCTGAGATTCGTGCTTTAGCAACTAAATCCTCTTGA
Protein sequence
MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDSIRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHWLDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFVNDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQCGRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDTPVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGLLNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNEDSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLLRKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQSLDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLLATESEIRALATKSS
Homology
BLAST of Tan0006216 vs. NCBI nr
Match:
XP_022933152.1 (uncharacterized protein LOC111439953 [Cucurbita moschata])
HSP 1 Score: 1033.1 bits (2670), Expect = 9.7e-298
Identity = 509/614 (82.90%), Postives = 556/614 (90.55%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINSFRFN SLCACPPGHLL+R +NTCVLFSS S+IVIG+ ES VSFP TIFSFDS
Sbjct: 1 MSCPINSFRFNASLCACPPGHLLDRTTNTCVLFSSSSAIVIGQAESDAVSFPVTIFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 61 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNMWFRMRWWVSRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDDKKV+ KRKTELGGTFSIASWILF+GLFAALLYQ ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 121 LDDKKVITKRKTELGGTFSIASWILFIGLFAALLYQTISKRSIEVHNIKAAHAPDMASFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLGT+VFGNPGFL+QKVMPLSNFANYSCHN S+GPTISV+C
Sbjct: 181 NDMEYNITTVSTMSCANVRDLGTIVFGNPGFLQQKVMPLSNFANYSCHNNSDGPTISVRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
GRCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN S+FDDT
Sbjct: 241 GRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVSSFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSYIVEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKLR+E
Sbjct: 361 LNVTLFINLLSSYIVEIENQNIFDPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLRDE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 421 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDTK K T TDSEM+TITT+QE + C +GKQS
Sbjct: 481 RKRKAKSGYTTFRFNRQTAKQDTKSLKATATDSEMKTITTKQEPPLINAWFCG--QGKQS 540
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
L PCEGDSSQLG+ FHSE LIPPPPTIEFKDSSDI+M D+LEK+KSL+EYNVI R+KLL
Sbjct: 541 LTRPCEGDSSQLGELFHSEALIPPPPTIEFKDSSDIDMFDVLEKIKSLHEYNVILRDKLL 600
Query: 601 ATESEIRALATKSS 615
ATESEIR+LA KSS
Sbjct: 601 ATESEIRSLAIKSS 612
BLAST of Tan0006216 vs. NCBI nr
Match:
KAG6598964.1 (Protein PELOTA 1, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1032.3 bits (2668), Expect = 1.7e-297
Identity = 508/614 (82.74%), Postives = 556/614 (90.55%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINSFRFN SLCACPPGHLL+R +NTCVLFSS S+IVIG+ ES VSFP TIFSFDS
Sbjct: 390 MSCPINSFRFNASLCACPPGHLLDRTTNTCVLFSSSSAIVIGQAESDAVSFPVTIFSFDS 449
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 450 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNMWFRMRWWVSRLDLCFATTHW 509
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KV+ KRKTELGGTFSIASWILF+GLFAALLYQ ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 510 LDDQKVITKRKTELGGTFSIASWILFIGLFAALLYQTISKRSIEVHNIKAAHAPDMASFV 569
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLGT+VFGNPGFL+QKVMPLSNFANYSCHN S+GPTISV+C
Sbjct: 570 NDMEYNITTVSTMSCANVRDLGTIVFGNPGFLQQKVMPLSNFANYSCHNNSDGPTISVRC 629
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
GRCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN S+FDDT
Sbjct: 630 GRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVSSFDDT 689
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 690 PVTFRGKNANIVQFNLFPRIYGNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 749
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSYIVEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKL +E
Sbjct: 750 LNVTLFINLLSSYIVEIENQNIFDPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLHDE 809
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 810 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 869
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDTKL K T TDSEM+TITT+QE + C +GKQS
Sbjct: 870 RKRKAKSGYTTFRFNRQTAKQDTKLLKATATDSEMKTITTKQERPLINAWFCG--QGKQS 929
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
L PCEGDSSQLG+ FHSE LIPPPPTIEFKDSSDI+M D+LEK+KSL+EYNVI R+KLL
Sbjct: 930 LTRPCEGDSSQLGELFHSEALIPPPPTIEFKDSSDIDMFDVLEKIKSLHEYNVILRDKLL 989
Query: 601 ATESEIRALATKSS 615
ATESEIR+LA KSS
Sbjct: 990 ATESEIRSLAIKSS 1001
BLAST of Tan0006216 vs. NCBI nr
Match:
XP_023547104.1 (uncharacterized protein LOC111806016 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1021.9 bits (2641), Expect = 2.2e-294
Identity = 503/614 (81.92%), Postives = 552/614 (89.90%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINSFRFN SLCACPPGHLL+R +NTCVLFSS S+IVIG+ ES VSFP TIFSFDS
Sbjct: 1 MSCPINSFRFNASLCACPPGHLLDRTTNTCVLFSSSSAIVIGQAESDAVSFPVTIFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 61 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNMWFRMRWWVSRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KV+ KRKTELGGTFSIASWILF+GLFAALLYQ ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 121 LDDQKVITKRKTELGGTFSIASWILFIGLFAALLYQTISKRSIEVHNIKAAHAPDMASFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLG++VFGNPGFLKQKVMPLSNFANYSCHN S+GPTISV+C
Sbjct: 181 NDMEYNITTVSTMSCANVRDLGSIVFGNPGFLKQKVMPLSNFANYSCHNNSDGPTISVRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
GRCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN ++FDDT
Sbjct: 241 GRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVTSFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSY+VEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKLR+E
Sbjct: 361 LNVTLFINLLSSYVVEIENQNIFDPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLRDE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 421 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDTK K T TDSEM+TI T+QE + C +GKQS
Sbjct: 481 RKRKAKSGYTTFRFNRQTAKQDTKSLKATATDSEMKTIITKQERPLINAWFCR--QGKQS 540
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
PCEGDSSQLG+ FHSE LIP PPTIEFKDSSDI+M D+LEK+KSLYEYNVI R+KLL
Sbjct: 541 STRPCEGDSSQLGELFHSEALIPLPPTIEFKDSSDIHMFDVLEKIKSLYEYNVILRDKLL 600
Query: 601 ATESEIRALATKSS 615
TESEIR+LA KSS
Sbjct: 601 VTESEIRSLAIKSS 612
BLAST of Tan0006216 vs. NCBI nr
Match:
XP_022973907.1 (uncharacterized protein LOC111472540 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1011.1 bits (2613), Expect = 4.0e-291
Identity = 498/614 (81.11%), Postives = 548/614 (89.25%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINS RFN SLCACPPGHLL+R +NTCVLF S +IVI + +S VSFP TIFSFDS
Sbjct: 1 MSCPINSLRFNASLCACPPGHLLDRTTNTCVLFGSSPAIVISQADSDAVSFPVTIFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 61 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNIWFRMRWWVSRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KV+ KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 121 LDDQKVITKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAAHAPDMASFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLGT+VFGNPGFLKQKVMPLSNFANYSCHN S+GPT+ VQC
Sbjct: 181 NDMEYNITTVSTMSCANVRDLGTIVFGNPGFLKQKVMPLSNFANYSCHNNSDGPTVRVQC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN S+FDDT
Sbjct: 241 RRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVSSFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIY +QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYVNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSYIVEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKL +E
Sbjct: 361 LNVTLFINLLSSYIVEIENQNIFGPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLHDE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 421 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDT+ K T T SEM+TITT+QE + LC +GKQS
Sbjct: 481 RKRKAKSGYTTFRFNRQTAKQDTESLKATATGSEMKTITTKQERPLINAWLCG--QGKQS 540
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
L PCEGDSSQL + FHSE +IPPPPTIEFKDSSDI+M D+LEK+KSLYEYNVI R+KLL
Sbjct: 541 LIRPCEGDSSQLSELFHSEAIIPPPPTIEFKDSSDIDMFDVLEKIKSLYEYNVILRDKLL 600
Query: 601 ATESEIRALATKSS 615
ATESEIR+LA KSS
Sbjct: 601 ATESEIRSLAIKSS 612
BLAST of Tan0006216 vs. NCBI nr
Match:
XP_022144498.1 (uncharacterized protein LOC111014173 isoform X2 [Momordica charantia])
HSP 1 Score: 1002.3 bits (2590), Expect = 1.8e-288
Identity = 490/615 (79.67%), Postives = 552/615 (89.76%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCP NSFR+NG+LCACPPGHL + +N+C LFSSPS+IV+GRVES VS+P T+FSFDS
Sbjct: 1 MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSYPGTMFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+RK TQSQ VFLQATLVML SWL FCLFLRFMKLGDG++ WFR+RWWV+RLDLCF+T HW
Sbjct: 61 LRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSIWFRMRWWVTRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KVV+KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAA+A DMVSFV
Sbjct: 121 LDDQKVVIKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAANAQDMVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
DMEFNITTVSTMSC NIRDLG++VFGNPGFLKQKVMPLSNFANYSCHNKS+GPTIS +C
Sbjct: 181 CDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRF QDNIYISWQFVDLP++PASAVGFQFNLS+ NHAKN+HASF+SG LKNGSNF+DT
Sbjct: 241 ARCRFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDS+L+QPLFHEF+PGSSFQ+ SELQ+SLENS DGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISELQSSLENSEDGL 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LN+T+YINLLSSYIVEIE QN+ PVSFLA+LGGLYCISV IF YL++QFEYRIKKLRNE
Sbjct: 361 LNITMYINLLSSYIVEIEKQNILGPVSFLANLGGLYCISVAIFSYLLVQFEYRIKKLRNE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
D VM NIR+R+KAQE WNKLRKYVM+TWGC TLD YYNDLST PSC +CMVQS+ +
Sbjct: 421 DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYYNDLSTTPSCADCMVQSSRKRASS 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
K+RPK G TTF NR+TANQDTKLSK +D EMR I T+QELSPKH +L ID GKQS
Sbjct: 481 GKQRPKRGYTTFSFNRETANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQS 540
Query: 541 LDS-PCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKL 600
L + PC GDSS+LGDFFHSED+IPPPPTIEFKD SDI+MSDIL+ +KSLY+YN+I REKL
Sbjct: 541 LKTGPCVGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKL 600
Query: 601 LATESEIRALATKSS 615
L TESE+RALATKSS
Sbjct: 601 LTTESEVRALATKSS 615
BLAST of Tan0006216 vs. ExPASy TrEMBL
Match:
A0A6J1EYZ3 (uncharacterized protein LOC111439953 OS=Cucurbita moschata OX=3662 GN=LOC111439953 PE=4 SV=1)
HSP 1 Score: 1033.1 bits (2670), Expect = 4.7e-298
Identity = 509/614 (82.90%), Postives = 556/614 (90.55%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINSFRFN SLCACPPGHLL+R +NTCVLFSS S+IVIG+ ES VSFP TIFSFDS
Sbjct: 1 MSCPINSFRFNASLCACPPGHLLDRTTNTCVLFSSSSAIVIGQAESDAVSFPVTIFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 61 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNMWFRMRWWVSRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDDKKV+ KRKTELGGTFSIASWILF+GLFAALLYQ ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 121 LDDKKVITKRKTELGGTFSIASWILFIGLFAALLYQTISKRSIEVHNIKAAHAPDMASFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLGT+VFGNPGFL+QKVMPLSNFANYSCHN S+GPTISV+C
Sbjct: 181 NDMEYNITTVSTMSCANVRDLGTIVFGNPGFLQQKVMPLSNFANYSCHNNSDGPTISVRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
GRCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN S+FDDT
Sbjct: 241 GRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVSSFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSYIVEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKLR+E
Sbjct: 361 LNVTLFINLLSSYIVEIENQNIFDPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLRDE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 421 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDTK K T TDSEM+TITT+QE + C +GKQS
Sbjct: 481 RKRKAKSGYTTFRFNRQTAKQDTKSLKATATDSEMKTITTKQEPPLINAWFCG--QGKQS 540
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
L PCEGDSSQLG+ FHSE LIPPPPTIEFKDSSDI+M D+LEK+KSL+EYNVI R+KLL
Sbjct: 541 LTRPCEGDSSQLGELFHSEALIPPPPTIEFKDSSDIDMFDVLEKIKSLHEYNVILRDKLL 600
Query: 601 ATESEIRALATKSS 615
ATESEIR+LA KSS
Sbjct: 601 ATESEIRSLAIKSS 612
BLAST of Tan0006216 vs. ExPASy TrEMBL
Match:
A0A6J1I8T2 (uncharacterized protein LOC111472540 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111472540 PE=4 SV=1)
HSP 1 Score: 1011.1 bits (2613), Expect = 1.9e-291
Identity = 498/614 (81.11%), Postives = 548/614 (89.25%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCPINS RFN SLCACPPGHLL+R +NTCVLF S +IVI + +S VSFP TIFSFDS
Sbjct: 1 MSCPINSLRFNASLCACPPGHLLDRTTNTCVLFGSSPAIVISQADSDAVSFPVTIFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+R F QSQAVFLQATLVML SWLFFCLFLRFMKLGDG+N WFR+RWWVSRLDLCF+T HW
Sbjct: 61 LRMFMQSQAVFLQATLVMLASWLFFCLFLRFMKLGDGRNIWFRMRWWVSRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KV+ KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAAHAPDM SFV
Sbjct: 121 LDDQKVITKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAAHAPDMASFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
NDME+NITTVSTMSCAN+RDLGT+VFGNPGFLKQKVMPLSNFANYSCHN S+GPT+ VQC
Sbjct: 181 NDMEYNITTVSTMSCANVRDLGTIVFGNPGFLKQKVMPLSNFANYSCHNNSDGPTVRVQC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRFIQD IYISWQFVDLPS+PASAVGFQFNLSARNH K DHASFLSG+LKN S+FDDT
Sbjct: 241 RRCRFIQDKIYISWQFVDLPSSPASAVGFQFNLSARNHPKKDHASFLSGILKNVSSFDDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIY +QQDSKL+QPLFHEFVPGS FQK S+LQTSLENSNDG+
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYVNQQDSKLLQPLFHEFVPGSFFQKASDLQTSLENSNDGV 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LNVTL+INLLSSYIVEIENQN+F PVSFLADLGGLYCISV IFLYL++QFEYRIKKL +E
Sbjct: 361 LNVTLFINLLSSYIVEIENQNIFGPVSFLADLGGLYCISVAIFLYLVVQFEYRIKKLHDE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
DSVM NIR+RK AQ+ WNKLRKYVMFTWGCST+DDYYNDLS PSC +CMVQS+ + GLL
Sbjct: 421 DSVMRNIRNRKIAQDHWNKLRKYVMFTWGCSTMDDYYNDLSATPSCADCMVQSSLEGGLL 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
RKR+ KSG TTF NR+TA QDT+ K T T SEM+TITT+QE + LC +GKQS
Sbjct: 481 RKRKAKSGYTTFRFNRQTAKQDTESLKATATGSEMKTITTKQERPLINAWLCG--QGKQS 540
Query: 541 LDSPCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKLL 600
L PCEGDSSQL + FHSE +IPPPPTIEFKDSSDI+M D+LEK+KSLYEYNVI R+KLL
Sbjct: 541 LIRPCEGDSSQLSELFHSEAIIPPPPTIEFKDSSDIDMFDVLEKIKSLYEYNVILRDKLL 600
Query: 601 ATESEIRALATKSS 615
ATESEIR+LA KSS
Sbjct: 601 ATESEIRSLAIKSS 612
BLAST of Tan0006216 vs. ExPASy TrEMBL
Match:
A0A6J1CTF3 (uncharacterized protein LOC111014173 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111014173 PE=4 SV=1)
HSP 1 Score: 1002.3 bits (2590), Expect = 8.9e-289
Identity = 490/615 (79.67%), Postives = 552/615 (89.76%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCP NSFR+NG+LCACPPGHL + +N+C LFSSPS+IV+GRVES VS+P T+FSFDS
Sbjct: 1 MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSYPGTMFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+RK TQSQ VFLQATLVML SWL FCLFLRFMKLGDG++ WFR+RWWV+RLDLCF+T HW
Sbjct: 61 LRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSIWFRMRWWVTRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KVV+KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAA+A DMVSFV
Sbjct: 121 LDDQKVVIKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAANAQDMVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
DMEFNITTVSTMSC NIRDLG++VFGNPGFLKQKVMPLSNFANYSCHNKS+GPTIS +C
Sbjct: 181 CDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRF QDNIYISWQFVDLP++PASAVGFQFNLS+ NHAKN+HASF+SG LKNGSNF+DT
Sbjct: 241 ARCRFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDS+L+QPLFHEF+PGSSFQ+ SELQ+SLENS DGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISELQSSLENSEDGL 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LN+T+YINLLSSYIVEIE QN+ PVSFLA+LGGLYCISV IF YL++QFEYRIKKLRNE
Sbjct: 361 LNITMYINLLSSYIVEIEKQNILGPVSFLANLGGLYCISVAIFSYLLVQFEYRIKKLRNE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
D VM NIR+R+KAQE WNKLRKYVM+TWGC TLD YYNDLST PSC +CMVQS+ +
Sbjct: 421 DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYYNDLSTTPSCADCMVQSSRKRASS 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
K+RPK G TTF NR+TANQDTKLSK +D EMR I T+QELSPKH +L ID GKQS
Sbjct: 481 GKQRPKRGYTTFSFNRETANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQS 540
Query: 541 LDS-PCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKL 600
L + PC GDSS+LGDFFHSED+IPPPPTIEFKD SDI+MSDIL+ +KSLY+YN+I REKL
Sbjct: 541 LKTGPCVGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKL 600
Query: 601 LATESEIRALATKSS 615
L TESE+RALATKSS
Sbjct: 601 LTTESEVRALATKSS 615
BLAST of Tan0006216 vs. ExPASy TrEMBL
Match:
A0A6J1CTV4 (uncharacterized protein LOC111014173 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111014173 PE=4 SV=1)
HSP 1 Score: 996.1 bits (2574), Expect = 6.4e-287
Identity = 490/620 (79.03%), Postives = 552/620 (89.03%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCP NSFR+NG+LCACPPGHL + +N+C LFSSPS+IV+GRVES VS+P T+FSFDS
Sbjct: 1 MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSYPGTMFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+RK TQSQ VFLQATLVML SWL FCLFLRFMKLGDG++ WFR+RWWV+RLDLCF+T HW
Sbjct: 61 LRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSIWFRMRWWVTRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KVV+KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAA+A DMVSFV
Sbjct: 121 LDDQKVVIKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAANAQDMVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
DMEFNITTVSTMSC NIRDLG++VFGNPGFLKQKVMPLSNFANYSCHNKS+GPTIS +C
Sbjct: 181 CDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRF QDNIYISWQFVDLP++PASAVGFQFNLS+ NHAKN+HASF+SG LKNGSNF+DT
Sbjct: 241 ARCRFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDS+L+QPLFHEF+PGSSFQ+ SELQ+SLENS DGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISELQSSLENSEDGL 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LN+T+YINLLSSYIVEIE QN+ PVSFLA+LGGLYCISV IF YL++QFEYRIKKLRNE
Sbjct: 361 LNITMYINLLSSYIVEIEKQNILGPVSFLANLGGLYCISVAIFSYLLVQFEYRIKKLRNE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
D VM NIR+R+KAQE WNKLRKYVM+TWGC TLD YYNDLST PSC +CMVQS+ +
Sbjct: 421 DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYYNDLSTTPSCADCMVQSSRKRASS 480
Query: 481 RKRRPKSGDTTFCCNR-----KTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSID 540
K+RPK G TTF NR +TANQDTKLSK +D EMR I T+QELSPKH +L ID
Sbjct: 481 GKQRPKRGYTTFSFNRETNHLQTANQDTKLSKARASDQEMRAIATKQELSPKHHVLGFID 540
Query: 541 EGKQSLDS-PCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVI 600
GKQSL + PC GDSS+LGDFFHSED+IPPPPTIEFKD SDI+MSDIL+ +KSLY+YN+I
Sbjct: 541 GGKQSLKTGPCVGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLI 600
Query: 601 FREKLLATESEIRALATKSS 615
REKLL TESE+RALATKSS
Sbjct: 601 LREKLLTTESEVRALATKSS 620
BLAST of Tan0006216 vs. ExPASy TrEMBL
Match:
A0A6J1CSG8 (uncharacterized protein LOC111014173 isoform X3 OS=Momordica charantia OX=3673 GN=LOC111014173 PE=4 SV=1)
HSP 1 Score: 989.6 bits (2557), Expect = 6.0e-285
Identity = 486/615 (79.02%), Postives = 548/615 (89.11%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGVSFPATIFSFDS 60
MSCP NSFR+NG+LCACPPGHL + +N+C LFSSPS+IV+GRVES VS+P T+FSFDS
Sbjct: 1 MSCPSNSFRYNGTLCACPPGHLFDLTTNSCGLFSSPSAIVMGRVESSAVSYPGTMFSFDS 60
Query: 61 IRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRHW 120
+RK TQSQ VFLQATLVML SWL FCLFLRFMKLGDG++ WFR+RWWV+RLDLCF+T HW
Sbjct: 61 LRKLTQSQVVFLQATLVMLLSWLLFCLFLRFMKLGDGRSIWFRMRWWVTRLDLCFATTHW 120
Query: 121 LDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSFV 180
LDD+KVV+KRKTELGGTFSIASWILF+GLFAALLYQ+ISKRSIEVHNIKAA+A DMVSFV
Sbjct: 121 LDDQKVVIKRKTELGGTFSIASWILFIGLFAALLYQIISKRSIEVHNIKAANAQDMVSFV 180
Query: 181 NDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQC 240
DMEFNITTVSTMSC NIRDLG++VFGNPGFLKQKVMPLSNFANYSCHNKS+GPTIS +C
Sbjct: 181 CDMEFNITTVSTMSCENIRDLGSIVFGNPGFLKQKVMPLSNFANYSCHNKSQGPTISFRC 240
Query: 241 GRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDDT 300
RCRF QDNIYISWQFVDLP++PASAVGFQFNLS+ NHAKN+HASF+SG LKNGSNF+DT
Sbjct: 241 ARCRFSQDNIYISWQFVDLPNSPASAVGFQFNLSSMNHAKNNHASFISGTLKNGSNFNDT 300
Query: 301 PVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDGL 360
PVTFRGKNANIVQFNLFPRIYG+QQDS+L+QPLFHEF+PGSSFQ+ SELQ+SLENS DGL
Sbjct: 301 PVTFRGKNANIVQFNLFPRIYGNQQDSELIQPLFHEFLPGSSFQEISELQSSLENSEDGL 360
Query: 361 LNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRNE 420
LN+T+YINLLSSYIVEIE QN+ PVSFLA+LGGLYCISV IF YL++QFEYRIKKLRNE
Sbjct: 361 LNITMYINLLSSYIVEIEKQNILGPVSFLANLGGLYCISVAIFSYLLVQFEYRIKKLRNE 420
Query: 421 DSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGLL 480
D VM NIR+R+KAQE WNKLRKYVM+TWGC TLD YYNDLST PSC +CMVQS+ +
Sbjct: 421 DRVMRNIRNRRKAQEHWNKLRKYVMYTWGCITLDGYYNDLSTTPSCADCMVQSSRKRASS 480
Query: 481 RKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQS 540
K+RPK G TTF NR +DTKLSK +D EMR I T+QELSPKH +L ID GKQS
Sbjct: 481 GKQRPKRGYTTFSFNR----EDTKLSKARASDQEMRAIATKQELSPKHHVLGFIDGGKQS 540
Query: 541 LDS-PCEGDSSQLGDFFHSEDLIPPPPTIEFKDSSDINMSDILEKMKSLYEYNVIFREKL 600
L + PC GDSS+LGDFFHSED+IPPPPTIEFKD SDI+MSDIL+ +KSLY+YN+I REKL
Sbjct: 541 LKTGPCVGDSSRLGDFFHSEDVIPPPPTIEFKDGSDIDMSDILKNIKSLYKYNLILREKL 600
Query: 601 LATESEIRALATKSS 615
L TESE+RALATKSS
Sbjct: 601 LTTESEVRALATKSS 611
BLAST of Tan0006216 vs. TAIR 10
Match:
AT5G16520.1 (unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 25; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 638.6 bits (1646), Expect = 5.0e-183
Identity = 330/622 (53.05%), Postives = 429/622 (68.97%), Query Frame = 0
Query: 1 MSCPINSFRFNGSLCACPPGHLLNRASNTCVLFSSPSSIVIGRVESYGV-SFPATIFSFD 60
M+CP NS +N + CAC G LLNR+S +C +F PS+I + +Y V SF T+F+FD
Sbjct: 1 MACPRNSITYNATRCACGIGQLLNRSSGSCEIFGWPSTISTDKDVNYSVISFAETLFAFD 60
Query: 61 SIRKFTQSQAVFLQATLVMLGSWLFFCLFLRFMKLGDGKNSWFRIRWWVSRLDLCFSTRH 120
IRKFTQSQA+FL+ATLVML SWL FC FLRF KLGDG+N WF +RWW++RLD+ FSTRH
Sbjct: 61 RIRKFTQSQAIFLEATLVMLLSWLVFCFFLRFTKLGDGRNVWFNLRWWITRLDVFFSTRH 120
Query: 121 WLDDKKVVMKRKTELGGTFSIASWILFVGLFAALLYQVISKRSIEVHNIKAAHAPDMVSF 180
WLDD+++V KRKTELGGTFS+ASWI+F+GLFAALLYQ+I+KR+IEVHN++A +PD++SF
Sbjct: 121 WLDDQQIVKKRKTELGGTFSVASWIVFIGLFAALLYQIITKRTIEVHNVRATGSPDLISF 180
Query: 181 VNDMEFNITTVSTMSCANIRDLGTVVFGNPGFLKQKVMPLSNFANYSCHNKSEGPTISVQ 240
ND+EFNIT VS MSC+N+R +G VV GNPGF + KV LS+ +Y+C N + GPT++ +
Sbjct: 181 ENDLEFNITAVSDMSCSNLRGIGNVVMGNPGFSEFKVAALSSLGSYTCKNTTSGPTVNFK 240
Query: 241 CGRCRFIQDNIYISWQFVDLPSNPASAVGFQFNLSARNHAKNDHASFLSGLLKNGSNFDD 300
C +CR D IYISW FVDLP +PA+AVGFQFN +++N H SF+SG L+NGS D+
Sbjct: 241 CTKCRLTNDYIYISWHFVDLPDSPAAAVGFQFNFTSKNGPNEKHMSFVSGTLRNGSILDE 300
Query: 301 TPVTFRGKNANIVQFNLFPRIYGDQQDSKLMQPLFHEFVPGSSFQKTSELQTSLENSNDG 360
+PVTFRG NI++FNLFPRIY D KL+QPLFHEF+PGS ++ T++LQ S+ S DG
Sbjct: 301 SPVTFRGTEGNILKFNLFPRIYHHLHDLKLIQPLFHEFIPGSVYRDTTQLQASMGRSTDG 360
Query: 361 LLNVTLYINLLSSYIVEIENQNVFSPVSFLADLGGLYCISVGIFLYLMLQFEYRIKKLRN 420
+LN TL+IN LS+YIVEI+++N+ PVSFLADLGGLYCIS+GIF YL++Q EYRIKKLRN
Sbjct: 361 ILNTTLFINYLSAYIVEIDHENILGPVSFLADLGGLYCISIGIFFYLLVQCEYRIKKLRN 420
Query: 421 EDSVMHNIRSRKKAQERWNKLRKYVMFTWGCSTLDDYYNDLSTAPSCTNCMVQSNFQSGL 480
ED+V IR+R+KA + W+KLR+YV +TW CS L D +++ SG+
Sbjct: 421 EDTVFRKIRNRRKALDHWDKLRRYVAYTWDCSILVD-------------DAIKTTKVSGM 480
Query: 481 LRKRRPKSGDTTFCCNRKTANQDTKLSKTTPTDSEMRTITTEQELSPKHRILCSIDEGKQ 540
RP + N + +K E I+ L L S D
Sbjct: 481 CGLTRPPTSS-----NSSEHGESIMANKKPNLGIEKNVISQPASLE-----LSSFD---- 540
Query: 541 SLDSPCEGDS-----SQLGDFFHSEDL-IPPPPTIEFKD---SSDINMSDILEKMKSLYE 600
S S GD+ S HSED+ IPPPP +EF D S+++ DI K + LY+
Sbjct: 541 SASSLAHGDNFSNKKSITHPISHSEDVSIPPPPPMEFIDGSSGSEVDAMDIKNKFQLLYD 595
Query: 601 YNVIFREKLLATESEIRALATK 613
YNV+ REKLL T+S + LA K
Sbjct: 601 YNVLLREKLLETQSLLNTLAPK 595
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022933152.1 | 9.7e-298 | 82.90 | uncharacterized protein LOC111439953 [Cucurbita moschata] | [more] |
KAG6598964.1 | 1.7e-297 | 82.74 | Protein PELOTA 1, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_023547104.1 | 2.2e-294 | 81.92 | uncharacterized protein LOC111806016 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022973907.1 | 4.0e-291 | 81.11 | uncharacterized protein LOC111472540 isoform X1 [Cucurbita maxima] | [more] |
XP_022144498.1 | 1.8e-288 | 79.67 | uncharacterized protein LOC111014173 isoform X2 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1EYZ3 | 4.7e-298 | 82.90 | uncharacterized protein LOC111439953 OS=Cucurbita moschata OX=3662 GN=LOC1114399... | [more] |
A0A6J1I8T2 | 1.9e-291 | 81.11 | uncharacterized protein LOC111472540 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1CTF3 | 8.9e-289 | 79.67 | uncharacterized protein LOC111014173 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CTV4 | 6.4e-287 | 79.03 | uncharacterized protein LOC111014173 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1CSG8 | 6.0e-285 | 79.02 | uncharacterized protein LOC111014173 isoform X3 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT5G16520.1 | 5.0e-183 | 53.05 | unknown protein; Has 25 Blast hits to 25 proteins in 9 species: Archae - 0; Bact... | [more] |