Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTGGTCAAGTGAGGCCCGCCCCGCTTCTTCCTATTATAAATTCTCGACGACTCACATTTCACTCCTTTTTCCCTTCTCCGTCTCTCTTTTCTTCCCTTCCTGTGTCATCTTCTTTCTCCATTTCTCCCCGTGTTTTTCTTTCCACGGAACGATTTTCGTCCATTGAGATCTGATTAATTCGGGAAGACGGAGAAAGGGATGCGCTTTTTTCTTCCTTTCTAGGGTTTCTTTTCTGCCTCGCCCTCCATTGCTTTGCAACTGCGTGTTGCTGCCGTCTGTCTGGGCTTTTTTTTTTTTTTTTTTTTTTTTTAAATGGACGGAGGGGGAGGAACTTTGTCTTCAATGGATGCTTTTGATTCCTTCCTCTTCTCTCTTAGCAATGCATTTTCTACTCCTCTTGCACTCTTCGTTCAGATCCAGGTTCGTTTCTCCTTTTTCAGATCCAGCTTTGTTTCCCTCTTCTCAAAATCAATATATATTTTTTTTTTTTTAAATCTGGGTTTAGCTATTTCTGTTTTTATGTTTTGCGCTCTCTGCTGGTTCTGGGTTACTAAGCCTTAAATGGGAAATTCGCGTTTCTTTCTAATACCTTTTTGAGCTGAGGTTTAGTTAGTTAGTTCTATTCTGGTTTTTGAATTTTCTAGTGGCTCAGTAGCAGTGTTTGTGAAAGGAAAAGTTTGAATGTATTTCCATTTCTTTCTTTCTTTCTTTTTTAGTGTTTGTTTAAGCTGGAAGTTGCTATGATGTTGAAATTGCGTCTGGATTTGGAAATGTGCTTTGTAATTTGAAGATTTTTTTTCTTTTGGCTGATTGGACTTTAAGCTCTCAGTTTAAATTTGATGATAGTCATTGAACTTCTTTGATGCCTGATTCTCTCATTCACTGTAATATTATAGATATTTCAACGCGCTGAAGATTTTTCTATCATTTTACAGAAGCCTCTTTTCTTTGAATTTATTGTTATAGGGATGCGTTATCTGCCTAGTTCTTGCTTTTGGGTGGGCTTGTGCTGCTTATGTCAGGTACTCCTATAATCTGAACTCTGTGTGATCTATACTCCATCCCATGGCATATGAATGTCTAGCACCTGTTTAACTTATAAAGTGTTGTGCTTCCATTCCCCCTGCAGAAATAGAGAAATTAAACGTATAAAGGGGAGAGTTCGAGCTGGCAATAGCTTTGCTTTCATTTGCAATGACATCAGTGAACTCGAGCACTCCAATCAAGTCAATCTACCGAGGGTGACAGTTATTATGCCTTTAAAAGGGTTTGGAGAACATAATTTACACAATTGGAGAAGCCAGGTGAGTTTCTTTCTTTTTCCTTTTCGATGGTGGGCGGGGGATTGATGTTTTTTATGCACCTGCAAATTAAGGAAAAAACAAATTTTCCTTTCATGTTCGGCCCTTGAATAATTGCCAATGTTTCTCATACCTGAAATGTTCGGTCCTTGAATGATTGCCTACTTCAATTATTATATTTTTATCATATTTATCCTAATAGATAAAAAAAGAGGAAATCAAGTCAAAATTGTATTTTTTGCCACATGTTAAAGAAGCAATGTAACGTAATTTTATGGTTTCGAATGGGAATAAGATCTTACAATTGTATATTTCAAATTATATTATATGTATTTCTGCGGATAATTATATTTTGTACCAAAACAAAAAATAGGAATATTCCAGGTGGGGTACCTGACCTCATATATTAGATGATCTGCCCATGACTCGCATATCAGTATATTGCTTGTTTGAGATTTCCTGAAAGTGTTCAAGTTTGGTACTCTATCCGGAAAGTACATTGTTTTAATTTAGTCCCTTTGCAAATTAAAACTGTTCCTAGGTGTTGTGTTGCCTTCTTCTCATTTATCAATGAATTTCCAGGTGACGTCCCTTTATGGAGGTCCTTTAGAATTTCTTTTTGTGGTGGAAAGTACGGAGGACCCTGCTTACAATGCTGTATTGCGGTTGCTATCTGATTATAGGGTTTGTATGAATGTATTCTCTTTCACCAATTTTTTCCATTCATAGTTCTTGTTTTCACTTATCATGGTTCACGTAAAGACTAGGGATCAACAGTTTGCAGAACTTTTTTTCATTATCCCCTCGCTTGGCATTTATGTGTGTTTCTTTTTTTTTCTTCTTCTTTTTCTTACAGGATGATGTGGATGCTAGAATTCTTGTGGCTGGGCTAGCAACAACTTGCAGCCAGAAAATTCACAATCAATTGGTATGTGATAAATTGCTGTTAATCTTGTTTTAGTACTTTCTGTCACAAATGCTGGGTTGGAGGATTTGGATATGATAATGTAAGTTCCATATGAAAGATGGAGGAATCGTAGGGAATTGGTTTGGCAGAAGCCTTTATTGGAGGGATGCAATGTATTAGTTACGAGAGAGTTCTAGATCTCTTGCCTGTCGAACATTCTACTAGTATTCATGGTATACTGAAGTTAATTGTTTCTAGTTGAACATATTTGGCAAACTAATTTGAAAACAGTTTTTCACTTTTGTTTTCAAAACTATTCTAGTGCCGCTGTACATTCAAGTTTGTATTCACAAGGTTGAAGTTTGAGTGGCAAGTGTTGATAAGAAGTTTAGGACACCGATCTACTTCCCACAAGTGGTTGAAGGAAGATATCAGGGAATGATAATGTTGAATTTTACAGATGAGAGAAGTAAGGAGGTGGGCTTGGTTAGGGCTTTTAGAAGGATGACATTTATGTGGAGAGACACACAGGCTCTTGGATGTAAACGGTTTGGTTCTTTTGGCCATAAACGGTTATCTGTGGCTTGTTGCCCAGTGGAGTTAATTTGGAATAGTAAAAGGGGGAAGGGAAAGCTATTTGAGGGGATGCAAGTTATTGGTAGAGCTGGAGGGAGGGTCTATGGTTCTTGAGAAAATTCTAATGTCAGGAAAGTTTTGTACTTTTCCTTCCTCTTAACTAGTAGGCTGTACGCCAAAAAAAAAAAAAAAAAAAAAAACACTCTAGTACGCTGTTTTGTTTTGTTTTATTACCAATTTGGCTATTTTTTTTTTCGTTCTTTGTATGAGAGGGTTTAATAATATATACTTATTAACATTGAGAAACTAATGCTTTTCTGCAGGTTGGGGTGGAGGAAATGCATAAAGATACCAAATATGTGTTATTTTTGGATGATGATGTCAGGTTACATCCTGGAACAATTGGAGCCCTCACAGCTGAAATGGAAAAAAATCCTGATGTATGTTACTTGCTATCCACAATTTCTTGCGTGCATGCTCATATTTGGACTACAGATTGCTTTTATAACTCAATTTTATGTATGCAGATATTTATCCAAACTGGATACCCTCTTGATTTACCTTCGGGGAGTTTAGGGAGTTATTGCATCTATGAGTATCATATGGTATGATATGATATATTTTTTTTACACTTTCTAGAAGAAGAGAAATTCTTTTATGTGGAATTCCTTGGTCTGAAGTCTGGAAAAAAAAAAAAAAATTCTTACTTTTATGTGTCAACGTCAACTTTACAGCCTTGTTCAATGGGCTTTGCTACTGGTGGAAAAACATTTTTTCTTTGGGGAGGTTGCATGATGGTAAGTTGGTTCTACATCTTTGTAACTACTGTATGTGTAAGATATCAAGCTATCTTAACTTCATTCGACTTGTTACAGATGCATGCTGATGATTTTAGATATGACCGTTATGGAGTGGTCTCTGGACTTCGAGATGGTGGATACTCAGACGATATGACTCTAGCGGCTATAGCAGGTAACATACAGGCCATTCATCCTTANGGGGGGGGGGGGGGGGTGCTTTTAGTGAAATAGAAGCACAATCTAATTAATCAATTCACATATACTTTTGAGTCCTCAAGTTTGGACTTCTATTTGTATTATGGTACATAATATGACATCTTGGTGGATTTCTTACATACAAGATTCTTTTGCAACCACAGCCTCTTAATGATTATTAAAAGATTGGAAGGCTCTCATGGTTTCTTCGTGGGTGTCTCCTTGACCTCAACCCTTAGGCTGTCTTTGCTTCTATAGGAATATATTGACCTCGTCTCTCATAAATAATAAGAATAATAATAAATTCAATTAGCTTTCTATATTAGGTATCTCATGGCTGCTTTTCCGTTGCTACATGCTCTGTTTCTTTTCAGGTGCTCATAAGAGGCTTATTACATCACCTCCTGTTGCAATTTTTCCTCATCCTCTTGCTAGTGATCTTAACTTGGGAAGGTCTGATATCTATATTGACAAATACAGTCTGACAATCTTGTTTATTCACTTCCATGTTTTTGACCTCTTTGACATGGTAGGTATTGGAATTACTTGAGGAAACAAACATTTGTTTTGGAATCATACATATCACATGTTAACAAGATAATGAACCGAGCATTATTTACATCTCACTGCTATCTGTCGTGGGGGTTTGTGGCACCATACTTTATGTCTATGATCCACGTTGCCGCAGCACTACGGTTCTATACTAAAGGGTACTCACTCGAGGAAGCAGGTTTTAGTTCTGTGGGTGAGCACACTGCTTTCCATTTTAGTTAGACTATCAGCATTACTTCATACCAAAACTGTTACTTGAGATCCTTGTTTGGGTTACATTTCATGCTTACTTTGCTCAATGCTGAATGCCAATATGGTGAATTGATGGTAGGGCTTCCGAAGTGAAGTCGGAATGATAAGAGGATGTTTAGGGCATTAAGTATCCAATATATAATTACTGATTATTGGTGGTTATTGATGTAGTTATTTATGGTCTATATAAAATGGCAGCTTGAGTTGTTGAGGAAGTGTACTTGTAAATATAATACGCGCGTCAAATCCTTTTCTAAGAATGCTTGAGTTGGAGCTTTGTTTGTCTTGTGCAGTTTTCGGGTTGCCATACACAGCTAACTATTTTTTTTGTTCTTTCATAGGGATGTCAATGGTCTGCAGTCTTGCTGCATGCACCGTTATAGAACTCTTCTCAATGTGGAACTTGACACGGGTAGAAGTTCACTTATGCAACATTCTGTCCCCTGAGGCTCCCCAGCTCTCTCTTGCTTCCTACAACTGGGGATTGGTAAGCGTTTATAGCCTGTTATTTGGAGGATAGGAGGATATGCTTGGTAGACACAATTCACCTGGTTGCTTGATTTTACATCTTAAGGTTGAATTGATGATGTCATGCATGGGTTTTGATCTTAGTGTATAATATTTTGACGTCCTAATTCGTCTTGAAGTTGCAGTTGAATTGTTACATCATGACATCTTTTGATTAGATCATGAAGCTATTAATTTAACATCATGATTAGTAAATATGCCTTTTTTTTTTCGCAGGTATTCATTGCAATCTTGGTGGACAACTTTTTGTACACTATCTCTGCAATACGTTCTCATTTTTCTCAATCAATCAACTGGTCAGGCATTCGATATTACTTGAAAGATGGGAAGATAAACAAGGTAACTTGGCTAAGGCCTTATAGTTATATTTTATTTCAATGAACTGATGCTTAGTAAAATTGTCAAGAAATTTGCATGAACTCTAGAGGCTGTAAATGAACTGGTTTTTGGTAGGCAAGAAAATTCTGGCTGTAAATGATAGAGTAGTTTAATGGCCTGAGTAATAAAAGTATAGTCAGCCAGTTATGATTACTGGATGAATAGTTGTTTTCATTCTCTGCAGATTGAAAGAAGTATACCAAAAGTTGATATGGGTCCAATTTATACGGACTTGGGAGGGAAGCATTTGTATGGGAAGAAAGGAATGGCTCCAAAGGTATCATTCCTGGGCTCCTTGGCCAAAACTTTGGCACAGTGGCGTCAACCTAAGAAATTCGATAGCTAGGATAGAGTTGTTCACAAGAACTGCTAATTATTGATGATATGATGTGTTCCCATTTGGGGATCTCATGGGGAAGGGCCGATAGAGCAATATAGGTGAAAGAGTGGATTGAATAGCTTAGATTCTTTGGGACTACTCTGCCGCTCCATTTGTTGATTGTTCATTATTATATAGAGTTCTAGGCACTTCCTCTCATTCTTTTCTTTATTCCTGTTTCATCCATTTTATTCCCACCTCGCCAGGGTTTGATTAGGTAATTGGCAGTGATTTTTTTTCCCCCCATGTGATTCTTGTAATTTCAATTATACAACCTTGTGATTTCCCCTCTTGTTTGCTATTTGAAATGATGTAACTCTCTGGTAGTGACAGTCTAGTGATTATTGTGTACCTCCTTGTGCCCTCGGTATTTATGATATTGTATTAAGTACTTAAGTAATACGAAGTTGCGATATTGTTCTTGACTTTATACTATATGAATAATACTATAGCTACTATTGGAAGTATATAGGCAACAAAGTTTGGAGGTAATTTGATTTCTACTGTTATTCTGAGTTTGAATTTAATATATTTCTTTTATCAAAGTCTATTCCTATCTCAGAAGATTTTCTGTGTCTTAACGCCTGTGATTGAAGGCATCAGACGTGAAAATAGCCTGTAAAGCTCTCATTATTCTCCACTGAAAGTCACTTTCTCTTCCTGCAAAAGTTATTCATCAAGAATAGAATAAAGTTACCCTGTAAGGATCAATTACACAACTTCTTCCAGACCTAATCTTTGGATTTCATTTATCTATTAGATTCCTTTATTATGATATAATCACACGTAAGAACACAGTTCCATTATAAGAAAACTTAAGCAATTCCTTCTGCAATTAGCCTGAACTGTAGGCTAAAAGCTACCAAAATGGGCTGTTTCAGCTTTCGCCCTTCGAAATCTCCACCCCCCATGAAAGTCTCGCACCCCATCCAAGCTCCTCCTCAGGAAAACTCTGAACCCAAAAGGTTCAAACAAGAAGTCCTTTTACAGATACCAGGATGCAGAGCTCATCTGATGGATGGAGGAGAAGCTTTAGAACTCGCCAATGGAGAATTTAAAGTCGAAAGAATCATGGAGGACGACGTGTCTCTCGCTACGATCATAAAAGTCGGCGAGGATCTTCAGTGGCCTTTGACGAAAGATGAGCCTGTAGTGAAGCTTAACTCTTTAAACTATCTGTTTTCATTGCCAATGAAAGATGGAGATCCTCTCAGCTATGGGGTGACATTTCTGGAGCAGTATAGCAGCAGTTTGAGCTTGCTGGACTTGTTTTTGAAGGACAATTCTTGCTTCTGTTCCTCGTCGTGTAATGCAAACAACAAGAGCATCATTAATTGGAAGGAGTATGCGCCGAAGATTGATGATTACAATAACATGTTGGCCAAGGCAATTGCAGAAGGCACTGGCCAGATCGTGCAGGGGATTTTCAAATGCAGCAACAGTTATGCCAATCAGGTATTATCAACAATGAATAACCCAGATTCAGATTGTTTCCTGTGCTAAGAAAAATTGATCAAATTATCAGGTAAACAAGGGAGGCGAAATGATTCTAAACAGTCCTCAAGCTGCTGCAGGTGCAGAAAGATCAGGAAGTTCTAGTGCCACCCAAACCAACAAAACTGCAATCAACAGAAGCTTGAAACGGTAAGTGTCATCCTGAGAACAAATCAAATCTCAAATCTTATTTGAATGTTAGACTGTGAGATCCTATCGGTTGGAGAGGGAAACGAAGCATTCCTTATAAGGATGTAGAAACCTCTTCCTAGTAGACACATTTTAAAACTGTGAGGCTGTGGCGATACGTAACGGGTCAAAGCGGACAATATTTGTTAGTGGTGGGCTTGGGTTGTTACAAATGGTATCAGAGCCCAGGTAGTGTGCCAACGAGGACGTTGGACCCCCAAGGGGGATGGATTGTGAGATCTTACATCAGTTGGATAGGAGAACGAAGCATTCCTTTTAAGGGTGTGGAAACCTCTCCCTAATAGACGTGTTTTAAAATCATGAGGCTGACAACGATACATAACAGGCCAAAACGGATAAAATTTATTAGTGGTGGGCTTGGATCATTACATATGTAATATTATGGTTGTCTGATAAGATTTCTGTTTCAAGTTTTTGAAAAACATAGCTTATTGGCATGTTCTTAACGAATGATATAGAGTAAGAAACCTGTCAAAGACGACAGAGAAGCTGAGCAAATCCATGCTAGACATGGTTGGGGTAGCCTCAGGATCAGTAATGGGTCCTGTAATGAAATCCCAAGCAGGAAGGGCGTTCTTTTCCATGGTTCCGGGGCAGGTCCTGTTGGCATCATTAGACGCAGTCAGTAAGTCCCAAAACCAAACATGTTATACATTTGAGTTTTCCAAAGATAAAAAGTTGCAGAAGAAGAAGATAATGAAAAATGTGGATGAATTGCAGACAAGATTCTGGACGCAGCAGAAGCAGCTGAAAAGCAAGCACTTTTGGCGACCACGACTGCCACAACAAGAATGGTGTCAAACAGATTTGGGGAAAGTGCAGGGGAGGCAACTGGGGATGTGCTTGCCACTGCAGGGCACTGTGCTAACACTGCATGGAATGTGTTCAAGATTCGGAAAGCCATTAATCCAGCTTCCTCTGTCTCTGCTGGAGCTCTCAGGAATGCTGCAAAAACTAGAAGCTTTTAAGTATGTCTTAGTTTCTTTTCGATGGATCTAGTGATTATAAAAGTCATGACTTATGAACTAACCACTTTAGATCGGTCTAGTAGCCTCTACGGGTCGATGTAAATAATAAACGATTTCAAAGAAATGGATTGAAGTTATGGTGATCACTTATCTAGGATATTAGCCCTCAAATTTGTTCAGAAGCATGCTTTGTGAGGATGATCTAGATGTGTGCAAATTATACAATTCCATACTAAATCAGCCACGTTTTAACCAAGTTCTGAAGATTCTAAATTTCATATTATTTTCATAATAATAATAATAAAATGATTTCTTTTTAAAATAATATAAATATATAAACAACAGTCCTAATGATCTAAAAAAAAACACTCAAATTTATTATTCATTTAGAAAGAAAAAAAATATTTATTTTCATATTTATATTTTATTATTTGAAAAGTGATTTTTTTAAAATATAATTTGAAATTCAGCGGTAAATGTTGAAAAATAAATAAATACATGCTTTACAAATTAATATTATACTTT
mRNA sequence
TGTGGTCAAGTGAGGCCCGCCCCGCTTCTTCCTATTATAAATTCTCGACGACTCACATTTCACTCCTTTTTCCCTTCTCCGTCTCTCTTTTCTTCCCTTCCTGTGTCATCTTCTTTCTCCATTTCTCCCCGTGTTTTTCTTTCCACGGAACGATTTTCGTCCATTGAGATCTGATTAATTCGGGAAGACGGAGAAAGGGATGCGCTTTTTTCTTCCTTTCTAGGGTTTCTTTTCTGCCTCGCCCTCCATTGCTTTGCAACTGCGTGTTGCTGCCGTCTGTCTGGGCTTTTTTTTTTTTTTTTTTTTTTTTTAAATGGACGGAGGGGGAGGAACTTTGTCTTCAATGGATGCTTTTGATTCCTTCCTCTTCTCTCTTAGCAATGCATTTTCTACTCCTCTTGCACTCTTCGTTCAGATCCAGGGATGCGTTATCTGCCTAGTTCTTGCTTTTGGGTGGGCTTGTGCTGCTTATGTCAGAAATAGAGAAATTAAACGTATAAAGGGGAGAGTTCGAGCTGGCAATAGCTTTGCTTTCATTTGCAATGACATCAGTGAACTCGAGCACTCCAATCAAGTCAATCTACCGAGGGTGACAGTTATTATGCCTTTAAAAGGGTTTGGAGAACATAATTTACACAATTGGAGAAGCCAGGTGACGTCCCTTTATGGAGGTCCTTTAGAATTTCTTTTTGTGGTGGAAAGTACGGAGGACCCTGCTTACAATGCTGTATTGCGGTTGCTATCTGATTATAGGGATGATGTGGATGCTAGAATTCTTGTGGCTGGGCTAGCAACAACTTGCAGCCAGAAAATTCACAATCAATTGGTTGGGGTGGAGGAAATGCATAAAGATACCAAATATGTGTTATTTTTGGATGATGATGTCAGGTTACATCCTGGAACAATTGGAGCCCTCACAGCTGAAATGGAAAAAAATCCTGATATATTTATCCAAACTGGATACCCTCTTGATTTACCTTCGGGGAGTTTAGGGAGTTATTGCATCTATGAGTATCATATGCCTTGTTCAATGGGCTTTGCTACTGGTGGAAAAACATTTTTTCTTTGGGGAGGTTGCATGATGATGCATGCTGATGATTTTAGATATGACCGTTATGGAGTGGTCTCTGGACTTCGAGATGGTGGATACTCAGACGATATGACTCTAGCGGCTATAGCAGGTGCTCATAAGAGGCTTATTACATCACCTCCTGTTGCAATTTTTCCTCATCCTCTTGCTAGTGATCTTAACTTGGGAAGGTATTGGAATTACTTGAGGAAACAAACATTTGTTTTGGAATCATACATATCACATGTTAACAAGATAATGAACCGAGCATTATTTACATCTCACTGCTATCTGTCGTGGGGGTTTGTGGCACCATACTTTATGTCTATGATCCACGTTGCCGCAGCACTACGGTTCTATACTAAAGGGTACTCACTCGAGGAAGCAGGTTTTAGTTCTGTGGGGATGTCAATGGTCTGCAGTCTTGCTGCATGCACCGTTATAGAACTCTTCTCAATGTGGAACTTGACACGGGTAGAAGTTCACTTATGCAACATTCTGTCCCCTGAGGCTCCCCAGCTCTCTCTTGCTTCCTACAACTGGGGATTGGTATTCATTGCAATCTTGGTGGACAACTTTTTGTACACTATCTCTGCAATACGTTCTCATTTTTCTCAATCAATCAACTGGTCAGGCATTCGATATTACTTGAAAGATGGGAAGATAAACAAGATTGAAAGAAGTATACCAAAAGTTGATATGGGTCCAATTTATACGGACTTGGGAGGGAAGCATTTGTATGGGAAGAAAGGAATGGCTCCAAAGGTATCATTCCTGGGCTCCTTGGCCAAAACTTTGGCACAGTGGCGTCAACCTAAGAAATTCGATAGCTAGGATAGAGTTGTTCACAAGAACTGCTAATTATTGATGATATGATGTGTTCCCATTTGGGGATCTCATGGGGAAGGGCCGATAGAGCAATATAGGTGAAAGAGTGGATTGAATAGCTTAGATTCTTTGGGACTACTCTGCCGCTCCATTTGTTGATTGTTCATTATTATATAGAGTTCTAGGCACTTCCTCTCATTCTTTTCTTTATTCCTGTTTCATCCATTTTATTCCCACCTCGCCAGGGTTTGATTAGGTAATTGGCAGTGATTTTTTTTCCCCCCATGTGATTCTTGTAATTTCAATTATACAACCTTGTGATTTCCCCTCTTGTTTGCTATTTGAAATGATGTAACTCTCTGGTAGTGACAGTCTAGTGATTATTGTGTACCTCCTTGTGCCCTCGGTATTTATGATATTGTATTAAGTACTTAAGTAATACGAAGTTGCGATATTGTTCTTGACTTTATACTATATGAATAATACTATAGCTACTATTGGAAGTATATAGGCAACAAAGTTTGGAGGTAATTTGATTTCTACTGTTATTCTGAGTTTGAATTTAATATATTTCTTTTATCAAAGTCTATTCCTATCTCAGAAGATTTTCTGTGTCTTAACGCCTGTGATTGAAGGCATCAGACGTGAAAATAGCCTGTAAAGCTCTCATTATTCTCCACTGAAAGTCACTTTCTCTTCCTGCAAAAGTTATTCATCAAGAATAGAATAAAGTTACCCTGTAAGGATCAATTACACAACTTCTTCCAGACCTAATCTTTGGATTTCATTTATCTATTAGATTCCTTTATTATGATATAATCACACGTAAGAACACAGTTCCATTATAAGAAAACTTAAGCAATTCCTTCTGCAATTAGCCTGAACTGTAGGCTAAAAGCTACCAAAATGGGCTGTTTCAGCTTTCGCCCTTCGAAATCTCCACCCCCCATGAAAGTCTCGCACCCCATCCAAGCTCCTCCTCAGGAAAACTCTGAACCCAAAAGGTTCAAACAAGAAGTCCTTTTACAGATACCAGGATGCAGAGCTCATCTGATGGATGGAGGAGAAGCTTTAGAACTCGCCAATGGAGAATTTAAAGTCGAAAGAATCATGGAGGACGACGTGTCTCTCGCTACGATCATAAAAGTCGGCGAGGATCTTCAGTGGCCTTTGACGAAAGATGAGCCTGTAGTGAAGCTTAACTCTTTAAACTATCTGTTTTCATTGCCAATGAAAGATGGAGATCCTCTCAGCTATGGGGTGACATTTCTGGAGCAGTATAGCAGCAGTTTGAGCTTGCTGGACTTGTTTTTGAAGGACAATTCTTGCTTCTGTTCCTCGTCGTGTAATGCAAACAACAAGAGCATCATTAATTGGAAGGAGTATGCGCCGAAGATTGATGATTACAATAACATGTTGGCCAAGGCAATTGCAGAAGGCACTGGCCAGATCGTGCAGGGGATTTTCAAATGCAGCAACAGTTATGCCAATCAGGTATTATCAACAATGAATAACCCAGATTCAGATTGTTTCCTGTGCTAAGAAAAATTGATCAAATTATCAGGTAAACAAGGGAGGCGAAATGATTCTAAACAGTCCTCAAGCTGCTGCAGGTGCAGAAAGATCAGGAAGTTCTAGTGCCACCCAAACCAACAAAACTGCAATCAACAGAAGCTTGAAACGAGTAAGAAACCTGTCAAAGACGACAGAGAAGCTGAGCAAATCCATGCTAGACATGGTTGGGGTAGCCTCAGGATCAGTAATGGGTCCTGTAATGAAATCCCAAGCAGGAAGGGCGTTCTTTTCCATGGTTCCGGGGCAGGTCCTGTTGGCATCATTAGACGCAGTCAACAAGATTCTGGACGCAGCAGAAGCAGCTGAAAAGCAAGCACTTTTGGCGACCACGACTGCCACAACAAGAATGGTGTCAAACAGATTTGGGGAAAGTGCAGGGGAGGCAACTGGGGATGTGCTTGCCACTGCAGGGCACTGTGCTAACACTGCATGGAATGTGTTCAAGATTCGGAAAGCCATTAATCCAGCTTCCTCTGTCTCTGCTGGAGCTCTCAGGAATGCTGCAAAAACTAGAAGCTTTTAAGTATGTCTTAGTTTCTTTTCGATGGATCTAGTGATTATAAAAGTCATGACTTATGAACTAACCACTTTAGATCGGTCTAGTAGCCTCTACGGGTCGATGTAAATAATAAACGATTTCAAAGAAATGGATTGAAGTTATGGTGATCACTTATCTAGGATATTAGCCCTCAAATTTGTTCAGAAGCATGCTTTGTGAGGATGATCTAGATGTGTGCAAATTATACAATTCCATACTAAATCAGCCACGTTTTAACCAAGTTCTGAAGATTCTAAATTTCATATTATTTTCATAATAATAATAATAAAATGATTTCTTTTTAAAATAATATAAATATATAAACAACAGTCCTAATGATCTAAAAAAAAACACTCAAATTTATTATTCATTTAGAAAGAAAAAAAATATTTATTTTCATATTTATATTTTATTATTTGAAAAGTGATTTTTTTAAAATATAATTTGAAATTCAGCGGTAAATGTTGAAAAATAAATAAATACATGCTTTACAAATTAATATTATACTTT
Coding sequence (CDS)
ATGGACGGAGGGGGAGGAACTTTGTCTTCAATGGATGCTTTTGATTCCTTCCTCTTCTCTCTTAGCAATGCATTTTCTACTCCTCTTGCACTCTTCGTTCAGATCCAGGGATGCGTTATCTGCCTAGTTCTTGCTTTTGGGTGGGCTTGTGCTGCTTATGTCAGAAATAGAGAAATTAAACGTATAAAGGGGAGAGTTCGAGCTGGCAATAGCTTTGCTTTCATTTGCAATGACATCAGTGAACTCGAGCACTCCAATCAAGTCAATCTACCGAGGGTGACAGTTATTATGCCTTTAAAAGGGTTTGGAGAACATAATTTACACAATTGGAGAAGCCAGGTGACGTCCCTTTATGGAGGTCCTTTAGAATTTCTTTTTGTGGTGGAAAGTACGGAGGACCCTGCTTACAATGCTGTATTGCGGTTGCTATCTGATTATAGGGATGATGTGGATGCTAGAATTCTTGTGGCTGGGCTAGCAACAACTTGCAGCCAGAAAATTCACAATCAATTGGTTGGGGTGGAGGAAATGCATAAAGATACCAAATATGTGTTATTTTTGGATGATGATGTCAGGTTACATCCTGGAACAATTGGAGCCCTCACAGCTGAAATGGAAAAAAATCCTGATATATTTATCCAAACTGGATACCCTCTTGATTTACCTTCGGGGAGTTTAGGGAGTTATTGCATCTATGAGTATCATATGCCTTGTTCAATGGGCTTTGCTACTGGTGGAAAAACATTTTTTCTTTGGGGAGGTTGCATGATGATGCATGCTGATGATTTTAGATATGACCGTTATGGAGTGGTCTCTGGACTTCGAGATGGTGGATACTCAGACGATATGACTCTAGCGGCTATAGCAGGTGCTCATAAGAGGCTTATTACATCACCTCCTGTTGCAATTTTTCCTCATCCTCTTGCTAGTGATCTTAACTTGGGAAGGTATTGGAATTACTTGAGGAAACAAACATTTGTTTTGGAATCATACATATCACATGTTAACAAGATAATGAACCGAGCATTATTTACATCTCACTGCTATCTGTCGTGGGGGTTTGTGGCACCATACTTTATGTCTATGATCCACGTTGCCGCAGCACTACGGTTCTATACTAAAGGGTACTCACTCGAGGAAGCAGGTTTTAGTTCTGTGGGGATGTCAATGGTCTGCAGTCTTGCTGCATGCACCGTTATAGAACTCTTCTCAATGTGGAACTTGACACGGGTAGAAGTTCACTTATGCAACATTCTGTCCCCTGAGGCTCCCCAGCTCTCTCTTGCTTCCTACAACTGGGGATTGGTATTCATTGCAATCTTGGTGGACAACTTTTTGTACACTATCTCTGCAATACGTTCTCATTTTTCTCAATCAATCAACTGGTCAGGCATTCGATATTACTTGAAAGATGGGAAGATAAACAAGATTGAAAGAAGTATACCAAAAGTTGATATGGGTCCAATTTATACGGACTTGGGAGGGAAGCATTTGTATGGGAAGAAAGGAATGGCTCCAAAGGTATCATTCCTGGGCTCCTTGGCCAAAACTTTGGCACAGTGGCGTCAACCTAAGAAATTCGATAGCTAG
Protein sequence
MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGGPLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKDTKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Homology
BLAST of Cp4.1LG18g03990.1 vs. NCBI nr
Match:
XP_023516306.1 (uncharacterized protein LOC111780203 isoform X2 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1080 bits (2792), Expect = 0.0
Identity = 529/529 (100.00%), Postives = 529/529 (100.00%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP
Sbjct: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
Query: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM
Sbjct: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
Query: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS
Sbjct: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
Query: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS
Sbjct: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
Query: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
BLAST of Cp4.1LG18g03990.1 vs. NCBI nr
Match:
XP_022960957.1 (uncharacterized protein LOC111461602 isoform X2 [Cucurbita moschata] >KAG7023614.1 hypothetical protein SDJN02_14640 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1078 bits (2787), Expect = 0.0
Identity = 528/529 (99.81%), Postives = 528/529 (99.81%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP
Sbjct: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
Query: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM
Sbjct: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
Query: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS
Sbjct: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
Query: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS
Sbjct: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
Query: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
BLAST of Cp4.1LG18g03990.1 vs. NCBI nr
Match:
XP_022987677.1 (uncharacterized protein LOC111485161 [Cucurbita maxima])
HSP 1 Score: 1069 bits (2764), Expect = 0.0
Identity = 524/529 (99.05%), Postives = 526/529 (99.43%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIK RVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKRRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSD+RDDVDA+ILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDHRDDVDAKILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP
Sbjct: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
Query: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM
Sbjct: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
Query: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
SMIHVAAALRFYTKGYSLEEAG SSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS
Sbjct: 361 SMIHVAAALRFYTKGYSLEEAGVSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
Query: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS
Sbjct: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
Query: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
BLAST of Cp4.1LG18g03990.1 vs. NCBI nr
Match:
XP_023516305.1 (uncharacterized protein LOC111780203 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1065 bits (2755), Expect = 0.0
Identity = 529/555 (95.32%), Postives = 529/555 (95.32%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMM--------------------------MHADDFRYDRYGVVSGL 300
GFATGGKTFFLWGGCMM MHADDFRYDRYGVVSGL
Sbjct: 241 GFATGGKTFFLWGGCMMVSWFYIFVTTVCVRYQAILTSFDLLQMHADDFRYDRYGVVSGL 300
Query: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH
Sbjct: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
Query: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL
Sbjct: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
Query: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS
Sbjct: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
Query: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 529
HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL
Sbjct: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 540
BLAST of Cp4.1LG18g03990.1 vs. NCBI nr
Match:
XP_022960956.1 (uncharacterized protein LOC111461602 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1063 bits (2750), Expect = 0.0
Identity = 528/555 (95.14%), Postives = 528/555 (95.14%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMM--------------------------MHADDFRYDRYGVVSGL 300
GFATGGKTFFLWGGCMM MHADDFRYDRYGVVSGL
Sbjct: 241 GFATGGKTFFLWGGCMMVSWFYIYVTTVCVRYQAILTSFDLLQMHADDFRYDRYGVVSGL 300
Query: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH
Sbjct: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
Query: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL
Sbjct: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
Query: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS
Sbjct: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
Query: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 529
HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL
Sbjct: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 540
BLAST of Cp4.1LG18g03990.1 vs. ExPASy TrEMBL
Match:
A0A6J1H8U5 (Ceramide glucosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111461602 PE=3 SV=1)
HSP 1 Score: 1078 bits (2787), Expect = 0.0
Identity = 528/529 (99.81%), Postives = 528/529 (99.81%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP
Sbjct: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
Query: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM
Sbjct: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
Query: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS
Sbjct: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
Query: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS
Sbjct: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
Query: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
BLAST of Cp4.1LG18g03990.1 vs. ExPASy TrEMBL
Match:
A0A6J1JB05 (Ceramide glucosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111485161 PE=3 SV=1)
HSP 1 Score: 1069 bits (2764), Expect = 0.0
Identity = 524/529 (99.05%), Postives = 526/529 (99.43%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIK RVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKRRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSD+RDDVDA+ILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDHRDDVDAKILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP
Sbjct: 241 GFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPP 300
Query: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM
Sbjct: 301 VAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFM 360
Query: 361 SMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
SMIHVAAALRFYTKGYSLEEAG SSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS
Sbjct: 361 SMIHVAAALRFYTKGYSLEEAGVSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILS 420
Query: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS
Sbjct: 421 PEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERS 480
Query: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
BLAST of Cp4.1LG18g03990.1 vs. ExPASy TrEMBL
Match:
A0A6J1H915 (Ceramide glucosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111461602 PE=3 SV=1)
HSP 1 Score: 1063 bits (2750), Expect = 0.0
Identity = 528/555 (95.14%), Postives = 528/555 (95.14%), Query Frame = 0
Query: 1 MDGGGGTLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
MDGGGGTL SMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK
Sbjct: 1 MDGGGGTLPSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIK 60
Query: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG
Sbjct: 61 RIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGG 120
Query: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD
Sbjct: 121 PLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKD 180
Query: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM
Sbjct: 181 TKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSM 240
Query: 241 GFATGGKTFFLWGGCMM--------------------------MHADDFRYDRYGVVSGL 300
GFATGGKTFFLWGGCMM MHADDFRYDRYGVVSGL
Sbjct: 241 GFATGGKTFFLWGGCMMVSWFYIYVTTVCVRYQAILTSFDLLQMHADDFRYDRYGVVSGL 300
Query: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH
Sbjct: 301 RDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISH 360
Query: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL
Sbjct: 361 VNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSL 420
Query: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS
Sbjct: 421 AACTVIELFSMWNLTRVEVHLCNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRS 480
Query: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 529
HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL
Sbjct: 481 HFSQSINWSGIRYYLKDGKINKIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSL 540
BLAST of Cp4.1LG18g03990.1 vs. ExPASy TrEMBL
Match:
A0A1S3BPF0 (Ceramide glucosyltransferase OS=Cucumis melo OX=3656 GN=LOC103492261 PE=3 SV=1)
HSP 1 Score: 1053 bits (2724), Expect = 0.0
Identity = 514/534 (96.25%), Postives = 525/534 (98.31%), Query Frame = 0
Query: 1 MDGGGG-----TLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVR 60
MDGGGG LSSMDAFDSFLFSLSN+FSTPLALF+QIQGC+ICLVLAFGWACAAYVR
Sbjct: 1 MDGGGGGGGGALLSSMDAFDSFLFSLSNSFSTPLALFIQIQGCIICLVLAFGWACAAYVR 60
Query: 61 NREIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVT 120
NREIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVT
Sbjct: 61 NREIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVT 120
Query: 121 SLYGGPLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVE 180
SLYGGPLEFLFVVESTEDPAYNAVLRLLSDYRD+VDARILVAGLATTCSQKIHNQLVGVE
Sbjct: 121 SLYGGPLEFLFVVESTEDPAYNAVLRLLSDYRDEVDARILVAGLATTCSQKIHNQLVGVE 180
Query: 181 EMHKDTKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYH 240
+MHKD+KYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYH
Sbjct: 181 QMHKDSKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYH 240
Query: 241 MPCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRL 300
MPCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGL+DGGYSDDMTLAAIAGAHKRL
Sbjct: 241 MPCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLQDGGYSDDMTLAAIAGAHKRL 300
Query: 301 ITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFV 360
ITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESY SHVNKIMNRALFTSHCYLSWGFV
Sbjct: 301 ITSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYTSHVNKIMNRALFTSHCYLSWGFV 360
Query: 361 APYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHL 420
APYFMSMIHVAAALRFY KGYSLEE GFS+VGM+MVCSLAACT+IELFSMWNLTRVEVHL
Sbjct: 361 APYFMSMIHVAAALRFYAKGYSLEETGFSTVGMTMVCSLAACTIIELFSMWNLTRVEVHL 420
Query: 421 CNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKIN 480
CNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKI+
Sbjct: 421 CNILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKIH 480
Query: 481 KIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
KIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 KIERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 534
BLAST of Cp4.1LG18g03990.1 vs. ExPASy TrEMBL
Match:
A0A0A0LYQ2 (Ceramide glucosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_1G574910 PE=3 SV=1)
HSP 1 Score: 1050 bits (2716), Expect = 0.0
Identity = 510/533 (95.68%), Postives = 525/533 (98.50%), Query Frame = 0
Query: 1 MDGGGG----TLSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRN 60
MDGGGG LSSMDAFDSFLFSLSN+FSTPLALF+QIQGC+ICLVLAFGWACAAYVRN
Sbjct: 1 MDGGGGEGGGALSSMDAFDSFLFSLSNSFSTPLALFIQIQGCIICLVLAFGWACAAYVRN 60
Query: 61 REIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTS 120
REIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVT+IMPLKGFGEHNLHNWRSQVTS
Sbjct: 61 REIKRIKGRVRAGNSFAFICNDISELEHSNQVNLPRVTIIMPLKGFGEHNLHNWRSQVTS 120
Query: 121 LYGGPLEFLFVVESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEE 180
LYGGPLEFLFVVESTEDPAY+AVLRLLSDYRD+VDARILVAGLATTCSQKIHNQL+GVE+
Sbjct: 121 LYGGPLEFLFVVESTEDPAYSAVLRLLSDYRDEVDARILVAGLATTCSQKIHNQLIGVEQ 180
Query: 181 MHKDTKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHM 240
MHKD+KYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHM
Sbjct: 181 MHKDSKYVLFLDDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHM 240
Query: 241 PCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLI 300
PCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGL+DGGYSDDMTLAAIAGAHKRLI
Sbjct: 241 PCSMGFATGGKTFFLWGGCMMMHADDFRYDRYGVVSGLQDGGYSDDMTLAAIAGAHKRLI 300
Query: 301 TSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVA 360
TSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESY SHVNK+MNRALFTSHCYLSWGFVA
Sbjct: 301 TSPPVAIFPHPLASDLNLGRYWNYLRKQTFVLESYTSHVNKMMNRALFTSHCYLSWGFVA 360
Query: 361 PYFMSMIHVAAALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLC 420
PYFMSMIHVAAALRFY KGYSLEE GFS+VGM+MVCSLAACT+IELFSMWNLTRVEVHLC
Sbjct: 361 PYFMSMIHVAAALRFYAKGYSLEETGFSTVGMTMVCSLAACTIIELFSMWNLTRVEVHLC 420
Query: 421 NILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINK 480
NILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKI+K
Sbjct: 421 NILSPEAPQLSLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKIHK 480
Query: 481 IERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 529
IERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS
Sbjct: 481 IERSIPKVDMGPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFDS 533
BLAST of Cp4.1LG18g03990.1 vs. TAIR 10
Match:
AT2G19880.2 (Nucleotide-diphospho-sugar transferases superfamily protein )
HSP 1 Score: 866.3 bits (2237), Expect = 1.3e-251
Identity = 412/522 (78.93%), Postives = 462/522 (88.51%), Query Frame = 0
Query: 8 LSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIKRIKGRVR 67
+S++D+ D+ LFSLS AF++P A+FVQIQGC ICL+LA GW A YVRNRE+KRIK ++
Sbjct: 1 MSTLDSIDAILFSLSRAFTSPFAVFVQIQGCTICLLLALGWLLAEYVRNREVKRIKNSIK 60
Query: 68 AGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGGPLEFLFV 127
AGNS AF+ DI+ELEHS QV LPRV+V+MPLKGFGEHNLHNWRSQ+TSLYGGPLEFLFV
Sbjct: 61 AGNSLAFLYQDINELEHSRQVKLPRVSVVMPLKGFGEHNLHNWRSQITSLYGGPLEFLFV 120
Query: 128 VESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKDTKYVLFL 187
VESTEDPAY+AV RLLS Y+D V+A+++VAGL+TTCSQKIHNQL+GVE+MHKDTKYVLFL
Sbjct: 121 VESTEDPAYHAVSRLLSMYQDHVEAKVVVAGLSTTCSQKIHNQLIGVEKMHKDTKYVLFL 180
Query: 188 DDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSMGFATGGK 247
DDDVRLHPGTIGALT EMEKNP+IFIQTGYPLDLPSG+LGSYCIYEYHMPCSMGFATGG+
Sbjct: 181 DDDVRLHPGTIGALTTEMEKNPEIFIQTGYPLDLPSGTLGSYCIYEYHMPCSMGFATGGR 240
Query: 248 TFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHP 307
TFFLWGGCMMMHADDFR DRYGVVSGLRDGGYSDDMTLA++AGAHKRLITSPPVA+FPHP
Sbjct: 241 TFFLWGGCMMMHADDFRQDRYGVVSGLRDGGYSDDMTLASLAGAHKRLITSPPVAVFPHP 300
Query: 308 LASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAA 367
LASDL+ GRYWNYLRKQTFVLESYIS VN IMN+ALF HCYLSWGFVAPY M++IH+ +
Sbjct: 301 LASDLSFGRYWNYLRKQTFVLESYISKVNWIMNKALFAVHCYLSWGFVAPYVMAIIHITS 360
Query: 368 ALRFYTKGY-SLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILSPEAPQL 427
ALR Y KGY LE+ +S GM +V +LA CT IEL SMWNLTR EV LCN+LSPEAP+L
Sbjct: 361 ALRIYIKGYHQLEDTTSASGGMMLVITLAICTFIELLSMWNLTRREVQLCNMLSPEAPRL 420
Query: 428 SLASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERSIPKVDM 487
SLA+YNWGLVF+A+LVDNFLY ISA RSHFSQSINWSGIRY+LKDGKI KIER + DM
Sbjct: 421 SLATYNWGLVFVAMLVDNFLYPISAFRSHFSQSINWSGIRYHLKDGKIFKIER---RKDM 480
Query: 488 GPIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFD 529
GP TDLGGKHLYGKKG K SFL SL + LA WRQPKKFD
Sbjct: 481 GPTKTDLGGKHLYGKKGAPQKASFLSSLGRNLAHWRQPKKFD 519
BLAST of Cp4.1LG18g03990.1 vs. TAIR 10
Match:
AT2G19880.1 (Nucleotide-diphospho-sugar transferases superfamily protein )
HSP 1 Score: 865.9 bits (2236), Expect = 1.7e-251
Identity = 411/521 (78.89%), Postives = 460/521 (88.29%), Query Frame = 0
Query: 8 LSSMDAFDSFLFSLSNAFSTPLALFVQIQGCVICLVLAFGWACAAYVRNREIKRIKGRVR 67
+S++D+ D+ LFSLS AF++P A+FVQIQGC ICL+LA GW A YVRNRE+KRIK ++
Sbjct: 1 MSTLDSIDAILFSLSRAFTSPFAVFVQIQGCTICLLLALGWLLAEYVRNREVKRIKNSIK 60
Query: 68 AGNSFAFICNDISELEHSNQVNLPRVTVIMPLKGFGEHNLHNWRSQVTSLYGGPLEFLFV 127
AGNS AF+ DI+ELEHS QV LPRV+V+MPLKGFGEHNLHNWRSQ+TSLYGGPLEFLFV
Sbjct: 61 AGNSLAFLYQDINELEHSRQVKLPRVSVVMPLKGFGEHNLHNWRSQITSLYGGPLEFLFV 120
Query: 128 VESTEDPAYNAVLRLLSDYRDDVDARILVAGLATTCSQKIHNQLVGVEEMHKDTKYVLFL 187
VESTEDPAY+AV RLLS Y+D V+A+++VAGL+TTCSQKIHNQL+GVE+MHKDTKYVLFL
Sbjct: 121 VESTEDPAYHAVSRLLSMYQDHVEAKVVVAGLSTTCSQKIHNQLIGVEKMHKDTKYVLFL 180
Query: 188 DDDVRLHPGTIGALTAEMEKNPDIFIQTGYPLDLPSGSLGSYCIYEYHMPCSMGFATGGK 247
DDDVRLHPGTIGALT EMEKNP+IFIQTGYPLDLPSG+LGSYCIYEYHMPCSMGFATGG+
Sbjct: 181 DDDVRLHPGTIGALTTEMEKNPEIFIQTGYPLDLPSGTLGSYCIYEYHMPCSMGFATGGR 240
Query: 248 TFFLWGGCMMMHADDFRYDRYGVVSGLRDGGYSDDMTLAAIAGAHKRLITSPPVAIFPHP 307
TFFLWGGCMMMHADDFR DRYGVVSGLRDGGYSDDMTLA++AGAHKRLITSPPVA+FPHP
Sbjct: 241 TFFLWGGCMMMHADDFRQDRYGVVSGLRDGGYSDDMTLASLAGAHKRLITSPPVAVFPHP 300
Query: 308 LASDLNLGRYWNYLRKQTFVLESYISHVNKIMNRALFTSHCYLSWGFVAPYFMSMIHVAA 367
LASDL+ GRYWNYLRKQTFVLESYIS VN IMN+ALF HCYLSWGFVAPY M++IH+ +
Sbjct: 301 LASDLSFGRYWNYLRKQTFVLESYISKVNWIMNKALFAVHCYLSWGFVAPYVMAIIHITS 360
Query: 368 ALRFYTKGYSLEEAGFSSVGMSMVCSLAACTVIELFSMWNLTRVEVHLCNILSPEAPQLS 427
ALR Y KGY E S+ GM +V +LA CT IEL SMWNLTR EV LCN+LSPEAP+LS
Sbjct: 361 ALRIYIKGYHQLEDTTSASGMMLVITLAICTFIELLSMWNLTRREVQLCNMLSPEAPRLS 420
Query: 428 LASYNWGLVFIAILVDNFLYTISAIRSHFSQSINWSGIRYYLKDGKINKIERSIPKVDMG 487
LA+YNWGLVF+A+LVDNFLY ISA RSHFSQSINWSGIRY+LKDGKI KIER + DMG
Sbjct: 421 LATYNWGLVFVAMLVDNFLYPISAFRSHFSQSINWSGIRYHLKDGKIFKIER---RKDMG 480
Query: 488 PIYTDLGGKHLYGKKGMAPKVSFLGSLAKTLAQWRQPKKFD 529
P TDLGGKHLYGKKG K SFL SL + LA WRQPKKFD
Sbjct: 481 PTKTDLGGKHLYGKKGAPQKASFLSSLGRNLAHWRQPKKFD 518
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023516306.1 | 0.0 | 100.00 | uncharacterized protein LOC111780203 isoform X2 [Cucurbita pepo subsp. pepo] | [more] |
XP_022960957.1 | 0.0 | 99.81 | uncharacterized protein LOC111461602 isoform X2 [Cucurbita moschata] >KAG7023614... | [more] |
XP_022987677.1 | 0.0 | 99.05 | uncharacterized protein LOC111485161 [Cucurbita maxima] | [more] |
XP_023516305.1 | 0.0 | 95.32 | uncharacterized protein LOC111780203 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022960956.1 | 0.0 | 95.14 | uncharacterized protein LOC111461602 isoform X1 [Cucurbita moschata] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H8U5 | 0.0 | 99.81 | Ceramide glucosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111461602 PE=3 ... | [more] |
A0A6J1JB05 | 0.0 | 99.05 | Ceramide glucosyltransferase OS=Cucurbita maxima OX=3661 GN=LOC111485161 PE=3 SV... | [more] |
A0A6J1H915 | 0.0 | 95.14 | Ceramide glucosyltransferase OS=Cucurbita moschata OX=3662 GN=LOC111461602 PE=3 ... | [more] |
A0A1S3BPF0 | 0.0 | 96.25 | Ceramide glucosyltransferase OS=Cucumis melo OX=3656 GN=LOC103492261 PE=3 SV=1 | [more] |
A0A0A0LYQ2 | 0.0 | 95.68 | Ceramide glucosyltransferase OS=Cucumis sativus OX=3659 GN=Csa_1G574910 PE=3 SV=... | [more] |
Match Name | E-value | Identity | Description | |
AT2G19880.2 | 1.3e-251 | 78.93 | Nucleotide-diphospho-sugar transferases superfamily protein | [more] |
AT2G19880.1 | 1.7e-251 | 78.89 | Nucleotide-diphospho-sugar transferases superfamily protein | [more] |
Relationships
This mRNA is a part of the following gene feature(s):
The following five_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG18g03990.1:five_prime_utr:001 | Cp4.1LG18g03990.1:five_prime_utr:001 | five_prime_UTR |
The following exon feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG18g03990.1:exon:001 | Cp4.1LG18g03990.1:exon:001 | exon |
Cp4.1LG18g03990.1:exon:002 | Cp4.1LG18g03990.1:exon:002 | exon |
Cp4.1LG18g03990.1:exon:003 | Cp4.1LG18g03990.1:exon:003 | exon |
Cp4.1LG18g03990.1:exon:004 | Cp4.1LG18g03990.1:exon:004 | exon |
Cp4.1LG18g03990.1:exon:005 | Cp4.1LG18g03990.1:exon:005 | exon |
Cp4.1LG18g03990.1:exon:006 | Cp4.1LG18g03990.1:exon:006 | exon |
Cp4.1LG18g03990.1:exon:007 | Cp4.1LG18g03990.1:exon:007 | exon |
Cp4.1LG18g03990.1:exon:008 | Cp4.1LG18g03990.1:exon:008 | exon |
Cp4.1LG18g03990.1:exon:009 | Cp4.1LG18g03990.1:exon:009 | exon |
Cp4.1LG18g03990.1:exon:010 | Cp4.1LG18g03990.1:exon:010 | exon |
Cp4.1LG18g03990.1:exon:011 | Cp4.1LG18g03990.1:exon:011 | exon |
Cp4.1LG18g03990.1:exon:012 | Cp4.1LG18g03990.1:exon:012 | exon |
Cp4.1LG18g03990.1:exon:013 | Cp4.1LG18g03990.1:exon:013 | exon |
Cp4.1LG18g03990.1:exon:014 | Cp4.1LG18g03990.1:exon:014 | exon |
Cp4.1LG18g03990.1:exon:015 | Cp4.1LG18g03990.1:exon:015 | exon |
Cp4.1LG18g03990.1:exon:016 | Cp4.1LG18g03990.1:exon:016 | exon |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG18g03990.1:cds:001 | Cp4.1LG18g03990.1:cds:001 | CDS |
Cp4.1LG18g03990.1:cds:002 | Cp4.1LG18g03990.1:cds:002 | CDS |
Cp4.1LG18g03990.1:cds:003 | Cp4.1LG18g03990.1:cds:003 | CDS |
Cp4.1LG18g03990.1:cds:004 | Cp4.1LG18g03990.1:cds:004 | CDS |
Cp4.1LG18g03990.1:cds:005 | Cp4.1LG18g03990.1:cds:005 | CDS |
Cp4.1LG18g03990.1:cds:006 | Cp4.1LG18g03990.1:cds:006 | CDS |
Cp4.1LG18g03990.1:cds:007 | Cp4.1LG18g03990.1:cds:007 | CDS |
Cp4.1LG18g03990.1:cds:008 | Cp4.1LG18g03990.1:cds:008 | CDS |
Cp4.1LG18g03990.1:cds:009 | Cp4.1LG18g03990.1:cds:009 | CDS |
Cp4.1LG18g03990.1:cds:010 | Cp4.1LG18g03990.1:cds:010 | CDS |
Cp4.1LG18g03990.1:cds:011 | Cp4.1LG18g03990.1:cds:011 | CDS |
Cp4.1LG18g03990.1:cds:012 | Cp4.1LG18g03990.1:cds:012 | CDS |
Cp4.1LG18g03990.1:cds:013 | Cp4.1LG18g03990.1:cds:013 | CDS |
Cp4.1LG18g03990.1:cds:014 | Cp4.1LG18g03990.1:cds:014 | CDS |
The following three_prime_UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG18g03990.1:three_prime_utr:001 | Cp4.1LG18g03990.1:three_prime_utr:001 | three_prime_UTR |
Cp4.1LG18g03990.1:three_prime_utr:002 | Cp4.1LG18g03990.1:three_prime_utr:002 | three_prime_UTR |
Cp4.1LG18g03990.1:three_prime_utr:003 | Cp4.1LG18g03990.1:three_prime_utr:003 | three_prime_UTR |
The following polypeptide feature(s) derives from this mRNA:
Feature Name | Unique Name | Type |
Cp4.1LG18g03990.1 | Cp4.1LG18g03990.1-protein | polypeptide |