Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAACTCCCTTTCATAAAATACTTTTCTTAGGCGGTGGGCTTCGTCTTCCTCAAGGAAAAAGCCGACTGAAAATTCTCTGCACAGTCCCTTTCAACTATCGTTTCTTTCCATTTTTTTACTTCCACACGAAAACTCTGGGCATAGAAATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTTAACCATCTGATACATAATTTCAACTTCAAATTCTACATATTCTACTGAAAATGCATGAGAATCTGACATCGTTAATCATTTCCCATTTGTGAAGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAAGTATTTATACACGATATGAAAATCGAAGCATCTCTTGTTTGATTGATTGTTTTTCAATTTTCTTTTATGTATTTTCATTGCTCTTTTGGTTCTCTTTGTTGACGTGGTGTTGTTTTCTTCTTCAAGTGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGGTATGAACACTTCCCTTGTATTTTTACTCATGTTGGGGATGTGCTGAGTTTTCTGATACTACAGTGAATAGGGTGCCATGGGGGAGGTTTTTTTGACATTTAATTCTTTACATGTTTATCGCTATTGCGGGCCTAGTTGATTGAGTTTAAATTCATTGAAATCCCCCATTATTTTGTTCAACTGCAATAGTAATTCCAATCATGTTATTAGAATCCCTGGATGTGCCTCATGGTCTGTCCGTTGCATTGATTGTTTCTATACCCCCTCCTCCTGCAGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTAGGTTTTTGTTGCTGCTATGTTCAACTTAAACTAGTTATCCCTCTTTCTTGTCAATTACTTTTCCCTTTCTTTTTGAACTTCATTGTTTGTTATCACTATGCTAATTATAGCCTTCTGTGTTCTTAAACTAAGAATTCTAAATATCTTCAGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGGTAAGAACACATACGGATGATTGTGAATTAAACTATGAATGTGATTGAGCCTTGGATGGTACCAGTATAGTGCCTGTGTGTTTTTAATTAAATACTTATTGTTCATACATTTTTTCTACTTAAAACCTTACTTTAAGTTTGAAACTATAAATAGCACAGCTTTCTCGAATCATTAAAGAACTGTCGAATTATTCATCCAATACACACTTCTTTTACAACTTAGGAACATTTTGTTAATGCTGCAAAACTGTTATATAATTGAATTCTATGATTCCTTTATGGGTCTAGGGGTTCTCAACCCCACGAACTCCCCACCTGTCCCTCAGCTTCTTTGTGTTAAGCATCATTGACCTGATAAAACTCCTAAACTATGCAAATCTATTTAAAGCCCTCAATATTGGAACAAATGACGGTGTTCCTTGCATTTTCAACGTTCATGTTGTTAACTCCAAAAAAGGAAGAAAAAAATATCAGCTTTCTTTAGATTATGAATCCGTTAGTTTCATCTGTTTTTATGTTGGTGTGGAACTAAGCGTGAGGTTGTCACTCTTTTTTGTAGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGTACATATAATATACTATAGCGGTTATTTATCTGCTCAATATGGTTAACTTGTCCTGCATCTTGTTACAATAAATATTTTTTTGCCAAAACTAACTGTAATTTATTGCTCTGTAGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGTATTGTAGCGGGTACTTTGACTTTGAGATATTTATTTTATTGTGCAACACTTAATGCTTGCTGAATATCATTTTAAAGAAAAATTCCTCCAACCATGCTCTCTTTTAATAGAGCTTATGTTTATTTATTTATTTATTTTCCCTCAAGAAGTTTAACGTGGTCTCTTTTTAGCATCAGTGCCATTTTTTCAATTTGTACTTGTTTTCTTATCATTTCTCTACGACGGGGATCCATCTTCCTTAAGAAAATATTCGAATTCTTAGCCAAATTTTAAAAACAAAAGGAAGCTTTTGAAAACTGCATTTTGTAGTTTTTGAAAACACCTGTAAAGAGTAGATAATGAGGCTAATTTTTTTTTATCAAACAGGGCTTTAGTTACTAATAATTTGTGTTTCTGATTCTTATTGAAATTAAAACTTCCCAGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGGTTTACTATCTCCTACTCCCTTTTTCTTGTTCACTAACATGTGTATCCCCCGTCTGTTTTTATTGTTGGTCTTGAGAGACATTAGTAAAAATGGTGTTAGAAATTATGCTATCCCTTCAGTTTTTTTCTTTCTTTTTGTAAATGTTATTAAACTCGTACAAGTTGGTTACTGTGATATCCAACAACATATTCTTAACATGAATTTGAAACAGACATGATTTGAATTGTCATGGCTGAATGAGCTGCCTCTCTGACTTCCTGTTGTTAGTTAAGAAGCAAGATGGTAGTTGCTTAGTGGTTTGATCATGCAAGAAAAATCATTATGGTCGATCATTGTAGATGAAGTGGAATGGGCAATTTGACACGGCAATGTAGCCTTGCTTGCTTTTGGCAAAACTAACGCAAAGAATTTGTTTAGAATGTTATTAAAATAATTGTTAGGAATAATATTACGATGTTAGGGTTATATTGATCATTAGATAAAGAGTTCGCTAGCAGGGTGGTTATAAATAATGGAGGTGGGAAGGGTTTGTAAGTCATGCATGTTTTTGAACTGGTCTTGTGAGCCAACTAAGCCTCTTTAATTCTTCAGTAACTTCATAATTTTTTCTCGACATTCCAATAAATTTGCAGTCCATATTTTCTTGATTTATTATCTCCAGATTGGGAATCTATTAGAATTATGGCCAGGATATATTTTTACTCAAGCATGCCTGATAGAGCTAGAGCAATCATCATGAGTTACACCATGCTCTCTTTTGTTCTTCCTTTGTTTTATCATCCCACCTTGAAAATTTTCATTCTGATATAATTGATTTTCTTTTTATGCAATTTATAACATAATTTTATTATATTTCCAATTGATGAAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGGTTTTGAAGAGAGAATTATAAACTTCAGTTATACAAGCTAGTGGTCTGACTTTGGATATTGATGGTGTTCTTAATTTTTTTGTTGTATATTGCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTAAGCAATAAGTTTACTGCATTTCATTTGATTAGAAAAAGGTGAATTAGATTGAATTAGATTAACTTGTCAAATTTCAATTTTCACATTCCAAACCTCTTCCCATCAAGTTTTACTTGGGACACTCGTATCCCTTGGACCTATGATTACTCGTTTTGGCACCATGGATATACAAGCTACATATAGATCATGAAGATCACATCTATATATAACCTATGTGAAAGAACTAGGAAGCATTACACTGTTTTAGCTCTGTTTTAGGTAGAGCTTCTGTGTCAAAACACGTGTTAAAAGATCACCACTAGTCCAATAGGGTCAAGCCCAATAATTGTATGAGCCCATTGTCTAGATATTTTGGAAATATAGATGTCTAGATATTTTAAGAATAAAAATGTCCACATATTTTAGGAAAAAGAATATGTAGACATTTTAGGAATAAATCTAGAATATCAATATGGTCTATATTGTCTAGCTTCTACTAAAAATACCCTTTCACCCTACTAAGTTTTTCATCCCAAGAAATTTCAAGCAAAGCAAAGTGTAGTAGTTTCTACCGAGTGTCTTGTAGTCTCTTTTCGATTGGTCTTAATAAAGAGTGTGAATGTTTTCCATAAAAGAGTTGTGTTCAATATGTTGATCATGCACATGTTCGTCTGGACACACTCCGAACATGTCTGACATGCTAAAAACATGGGTAATATTTATTCTATTTTCTTTTTATCTAGTTTTGGACACAGCTAAAAATAAAGCCAGACCTAAAAAGTTTATAAAAGAAAAGGAGTGAACGACCTTACTTGACCTTGATCTGTTTTATAGGTTATACAATTGTGTCCTTTGAGTTCTAAAAGAAAAGGAGAGAGGATTCTATAACAATTTCTAATCTTTTAGGATGACATCTTCTTCATCACGCTATGGCCAATTAAATTCTTCTATATTTTGAAGCTCTTAAAATATAGTTAGTTGACTCGACTTTCTATGTTTTTATCTATGATGAAGTATAATAAAAAAACTGTCCAGAACATATCTCTGTCCTACATTTTTAGAAATCGATGTGCCTGTGTTGTATTGAGTCCATGTTTCTTAGAGAAGGAATCATGTGCCTTTTATTATATTTTCAATTCAAGTTTTGCATAAATATCATTTAGATAAAAAATTACAAAACAAATTGCAAGGCTAGGTAGAAAAATTGCACGAACAATGTCCTACCCAGGTTATGAGTTGGATTATTTACTTTTACGAGTAAGGGTTTTCTAGACTGAGAGCCTTTTATGATCGATTTTCTTGCTGTACAAATTCAAAAATATGAAAATCACTTGAGAGATTAGCACAATCTAATTGATTGTGTTGAAAGCAGTTGACAGTAATCTATAATCACCTAATACAGGTGACCCTATTGCCTAATTTAGTCTTCTGAATCTGTAAATTTTGTCATTATCAGTAGGACAAAGAAATCTTTTGATAGTTTCTTGTTTTGCTTTGATATTCTCAGGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGTAAGTATAGCTACTGCCACTTGTAATTTAAGATTTATTTATTTAATGCGAGCAACTGGATTGTGATTTTCTGTTACCATTTTTATAATAAGAGATAGAGTGAAGTTGGCATGTATTTCTAAATGCATGAGGACTCCAGAAAATAAACCTCGACCTCTTTAGAACATGTTAGATATTATCTTGTTTTTAATTTTTTCCCTTCCTAATGGAAAACTATGAACATGAAAGCTCGTATATACCCACTAAGATAAGTATTGGATGCTTAGAAACTATTGGCATTTTGGTTAATAATATCTATGATCTGATTTTGACTGTTACAATGAGAATGCAATGTACAGTAACTATTAAGGTTTCTACTTTTCAGAAGGGGTTTGTAGTAGATATCTTTTTCTAATAGGAAACCATAACATTTCATTGATATGATGAAATTACTTAAGGATAAGTTCCTATCCAATGAATTGCAAAAAACTTGCCCAATTGGTCGTGAGAGAAGATAAACTATAGGTACAAAAGGAGCAATCGATTTACACTAAGAGAGAGCCAAAAAAGTAATAGATGTCCGAAAGGAGTGTATGTTTAATTTCTTGTCTTTAAAGATATGGGAATTGCGCTTGTTCCAAGTAGACCAAAAGGAAGCCCAGATAAAATTCAGCCAAAGGATTTTCATTTTCTTTTTTGAATGGATGACCATTGAACACATTAGAAAGGAATGCTGATGATTTTGAGGGAATAGGAGAAAAGGATAGATGGTAGATGGGGAGATAAGCCAAAATAGCTTGTATGAGAGTCAGTCGACCTTACTTTGAAATATGAGCACATTTTCAACTATTGGCATCCAAAAGGTATTGAGACTTTGGCTTACCATTCAGAGGCAAACCCAAATAATCAGCAGGCCACTTTCCAATTTTGCAACCCCATTTCGCTGCCAATGCCTCTTGCAAGAGAAGTATCAAGATTAATCCCCAAAAAGTCAGTTTTCTGACAATTAATATTCATCCCAGATGCTTCCTTAAACTCATTATTGGTGTTGAACAAATTATTAATGTGATAGACATCAGGGGATGGAAAAAAAGGTGAGTAATCGATAAGCAAAGATCCACCCTGTCCACTTCAAACCCTTTTAATCAAACCTTTCATTTTCGCGAGGGAGAGAATTCTGCTAAGAGCGTCCATAACAATGGTAAAAAGGAAGTGTGAAGTACACCTTGTCTTAGGCGTCGAGAAGCAATGATCTTACCAAGGGGTTTCCACTGATAAGAGGGAGAAGTTAGTAGATGGCATATAACCCCTATTCCACCTCCTCAGATGCCCAAACCCTTTGCTTGCAGAATAAGGTCTAGGAAATTCCAATCAACCTTGTCAAAAGCCTTTTCAATATCTAGTTTGATCACCATAACCGATTCCATCTGTCAATGTATCTCATCTATTCGCTCATAAGCAATCAAAGATGCATCAATAATCTGTCTATTAGCCACAAAAGCTCACTGATTATCTGTTTGACCCCAGTGATTTGTTAGTGCTCAATTTATTAACAAAATGAATGTCTTCTTCGGGAAATGGTGCCTCCAAAAACATTTATTTTCCGTGGATTCTTCTAGAAACGAATTTTCTTATAATTTTTTTTTACCTTTTTTATTAAACATTTTAGAAAACAAGAACAACGGTTATCAAACCCGTTTCTATTTTTTTAATTATTTTTTATTATTCAATAAAAATGAAAGATCAAGAAACAAAATACAGTTAACAAGCGTAATTGATAATTCTATTTTTCTTTATTAAAAAAGTGGAAACGAGTAACAATAAATGGAAGTTACCAAACGTGCCTGAAGGTGTTTTACATCATTTTAACATTTTATTTAGACCTTTATAAACATTTCATCCCCAAAAGCCAACATGTATAGATCAAATTTGTCTCAGGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGTAAGTATGTTAAGCTCTTTGACTTATATTTGCGTTTATGCCATCGTTTTCTACACCCCAGTTTCTTTCGTTAAACCATTTGGGAAGGTAAGTATATGGTAAAAAAACTAGATGTTTCAGCTTGCTTTGTAAATGATTAAGTTTGGAAAATCCCTTGGAAGCTAGACATTTCTCGGATGACAAAATTCTTCGCTAAAGTTCTTGCAAGGCTTATTAGTGGGAGAGTAGAGTGATAATGTTTTCAACAATGATTAATGTGGAACAGTGGTATCTTCCCTGCTATGCCATGGTTTTAACAGAAAATATGGAATTATGAATATTTTTTGGATTGTAAGGTTGTAAGTGTTGCTATTCTCTTTTTGCACTGGTTTTCTATTTATCAAAAATCATTAACTCTTTATCTTTTGTGTAAAGCGTTATATTGACAACGGAAAGGATTTTGTTTTAAAAAGATGTTTGTGATAGAATGTGAACTGAAATATGCCATCGCCAATCATTTACTCAAACCTAGGCGAGTATCTAATGATATGTGCTTTATGTACCTATAATATTTCACTTATATTTCAACTGTCAATACAGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGTGAGTGAAGGGTTTAGGGTTTATTTCTTTTGATAATATTTTCCTGCTTCAAGTATGTTTCCTATCTAAAGCAATTTGGAATTGGATATCTAATCTGCTTTAGTTGCAGAATATATGTCATTCCAATTCAAGCCAATCCTTCTGCACTCACTTGACAAGATTAGACATTCCTCTGTTTCCTTACATGTTAGAATTAAAACAAAATGCTTAAGCTGATGGTAATGGAGTTAAGTTCATATATCAAACATGTTGGTCCACAGGCTTCGAAGTGTAACAGATTCTTGAAATAAGAATTGCTGTCTGTTCGTGAAATTAATTTGGTTGCTAAGCAATGTTTTAGAAAACATTCCTCAAGGCAACCTCGAGAAGCAAGACATAAGCCTTGAGGCAGTATGAGATGTAACATTACAATAGTACAAAACATACTCGAAAATTAATAACTATAAAGGAAAATATCTCAATAGTCCATCTATATTCATTACTACATCTAACTTGCTTGTGTTAATAAGAATCTAATCAAAATGAAAACAAAGGTAAAATAGGAAAAAGAATAAAACTCACCGTGAACAAATAAAAACAAAAAGAGAACAAGAGTGAAGGGGAAAAATAAAACAAAATTCACCTTCTTGTGTTGTTGTCAAAGATTTAGCTACCTTGCAGTTTTTTTTCTTTGGTTAATGCAGACACTCGCAGTTAGCTCTATTTTAAAATAACACAGCAAGGTGAAAACTTTTTTGTTTGTTAAGTAAACCAATTGAACTGCTTATGCCTTCTTAGTTTTCAGTGCATTTTTAAGTTCTCTCAGATATTTTCATGGCCCTATAGCTCACTACCAATGAACACAAGGCTTATGCCTTGTAGCCTCAGAGGCGTTCATGGACAGGCGTAGGTCTTTTTTTATGTGTTGAAGGCATAAGCCTTGAGACGCAATGTTTCCGCCTCACCCTTGAGGTGCAATATTTCCTCCTTACCTCAAGGCTTATGCCTTGAATTGAACGCATTATAAAGCATTGTTCCCAAGATATATGATGTGATGCTTCATACATTACTTGGTTATCACTGTTGTTAAGCCTTATGGAAAGATGTTTTTAGTGTTCTTTTAGAGTCTGGAGGGTATCATCCTTTTTCACTCAAGACTGCTTCAACTGAACAAATTGACCTATTTTCACTTAATTATCTTCAGCAAGCAACAATTTTGACATTGTTCTTCATTTCTTTCATGATCATAGTTTTTAAAAGTCTTAACAAGCTGCTATGGCAGGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGGTGAGTGTTTCAGTCATTTTTTGTTACTTCCAACACTCTTTACCTGTCTTTTAACTGGCCTTAAAGAATCTTGATCTTAGTTTTGTAGAAAAATTCCCAAAAGATAAAAATACATGATATTGGTCAAGCCAATACTTAAATTAAAAATAGTCTGCAAGGTGCAAAGAATTCAAAATGTATTGTTCACGATGATTTATCATCCTTTATTATTGTGGATGGGGTAACTTGCAACCTTAGCGTATGGTCATTGGCATGGAGGCCTATTTTTCATAAAAAATGAGAACGCAGATGCTAAAAAACATGGATATGAACGCATAGAAATGGGTGCATGTATGAAGATTTGATATAGATATGTAGTACTTTGCGTTTTTTCATGTAAATTAGGCATAGCAAGACATTTTGATGTTTAGAGGATAAATATCTTATTTAGAAGCACATCCTATAAATTTGCATCAAATGGGTCTTCCTTGGTCTAGACAATTTTCTTAGTTTGTTTTTTGTTTAGGAAGTTTCAACTTCCAGTAAGAAGGTTGCTTGGTTATGGGTTTAGTATTATCCTGGGCAAGAAACTATGTGGGTTCTAGTTTCCTATGTATTTAAATTTGTATGACTTGCTAACACAACTTTTTCTTCAGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGTATATGTGCTTGCTCTCCCTTGCATCCATCTTTTGAAGTAAATCAGAGATTTTTTTCATGATTGTCGTGTATGCATCATTAGAAATCCCACGGTCGCTACAAGCATAAATCTTACTTAATTAGTTATTTATCTCAGGAATTTAAACAGTATTATAATTAAGCGAGCATTCTCCTCCTTTTATGGTGTTCCAGTTAAGGGTTGTTGATTTCTTTAGTTGCGACATAGTTAAGCATTCTCAAGCTTTTGTATTAGCCGTTGGCATGCAAACCTGCAATGTTCTTCAGATGTATAGAACATTAGTTCTTCCAGTGTCACAGTTTCAAACGAAATGTTCTAAGCATGTTCAGAAAAGAATTATTGTTGATGGCCTATGCTCATGTTCAATTTTAACTTCCTTTCTTCAAATTAATATCCAAAATCTTGCAAATCATATTCACCCACTTGTTAATAACAAGTTTGCTGAATATATTGCTTTCTAGACCCTGTGAAAGCAACCCTACGAACTCCAAGCAATTCGAAATATGCTGTTGGGTATTAGAATTGCTCCCATTGATCAAATCCCTTTGCTGTTCTTGTCTGTTGTTGGATCAAAGACATTTATTAGCATTTATTTCTGAATCATATCTTGGTGTCGTTAAGTCTTTTTTGTAGTCTAATTATTTCAATACATTGACATGCATTATATCATTATGAAAAATGTAATGTAGGGTGCCGCTATTTTTTGGTGTGCATTTCAGGAGTGTCTTTCATGATGCAATTCTTCATGTATGAACGCATTTCTTATTGTATGAATCTGCTTTTGTGAGATTCATAAACTCTTTTGAAACAAATATTTCATGCACGGGTTGGCATGTAATTAGATAGATTTGAGCGATCCTCCCCCTATGTTTGATGCTAATCTCAATCACATGGTTTATATTTTATTATAATTTTATATATAATTTTTTTTTTCATTTTTTTTTCAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGGTTAGATGGGATGCATAAATATTTCATCCTTTTCTTCAACATTTTCCTTTTCAAGGCAGTTCCTTCTAGTAGTGCTCACAAAACTGGCTCTGGTTGTATGGCTCATGTCTGATTAAGAAAGAAAAATCTAATTGATATAATATGTGTATGGCTATGCAATAATACCTTAAAATAGGAAAGGAAGGGAGAAAAACAACAAAAACAAAACTAAGCAGATAGCCCGGAAAGGCCATCTCCATGTTTTCAGTTTATCTTGTATACATCCTTTTATTGTTTTTTTTATAATTAATGTGGGGTGAAGATTTGAACATCTAACCCCTTGATCGACGTTATATGCTTTATGCTGTCTCAATGTCTGAATCAAGCCTCTTGTCGTCTTACTTGGGAAAAAAGTAGTGTGTAATTCTGCTGTTTGAGTTTGAAAATTTGAACTTCGTAGTGGTTATATGCCTGCTAATCGTTACACGGGTTTGTCTGCAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAGGAGTCTAGTATGGATCAAAAGTAAGCAAGCCTTAGATACAATGGTTCGTAGGGTAGTTGCATCTTTGGTTTCTAGTTGATCCGAGGAGTTCACAGAAGAAGTTGAAACTAGATTGAGCATTTTCCATGGATTGTGTGCACTAATATTTCTCTACTTTGTTCAGTTTCCTGAATGGATTGCTTAGTCAAATTTGCTTGGGACAATGAATTTGTTTAAGTCAATTTTTTTTTTTAAATATTTCATATAAAACTTTTTTTTAAGAAAAAAATTAATCCAATTGTTTGAATCACTTAT
mRNA sequence
CAACTCCCTTTCATAAAATACTTTTCTTAGGCGGTGGGCTTCGTCTTCCTCAAGGAAAAAGCCGACTGAAAATTCTCTGCACAGTCCCTTTCAACTATCGTTTCTTTCCATTTTTTTACTTCCACACGAAAACTCTGGGCATAGAAATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAATGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAGGAGTCTAGTATGGATCAAAAGTAAGCAAGCCTTAGATACAATGGTTCGTAGGGTAGTTGCATCTTTGGTTTCTAGTTGATCCGAGGAGTTCACAGAAGAAGTTGAAACTAGATTGAGCATTTTCCATGGATTGTGTGCACTAATATTTCTCTACTTTGTTCAGTTTCCTGAATGGATTGCTTAGTCAAATTTGCTTGGGACAATGAATTTGTTTAAGTCAATTTTTTTTTTTAAATATTTCATATAAAACTTTTTTTTAAGAAAAAAATTAATCCAATTGTTTGAATCACTTAT
Coding sequence (CDS)
ATGGAGGAATTTGCTGTGGATGATCCGACTCAGCTACTTGAAGCAGCTGCAGATTTCGCAAATTATCCCGGTGTTCGGACTGATGCGTCGGTGAAGGAATTCTTCAGCCGCTTTCCCCTTCCCGTCGTAATCAATGCTTTACAAGCAAAAGCGGAAATTCCTGGTTTGGAAAACACTTTGGTTGCATGTCTCGACAGGATATTCAAAACCAAGTATGGTGCTTCACTTATACCACATTATATGCCCTTTGTACAGGTTGGACTACAAGCAGATTCTCAAGCAGTTAGAGGCTTAGCTTGTAAAACGGTCACTCGCCTGCTGGAGGAGACCGATCCGACTACTCAGTTGGCCCCACAACTTATTGTTGACTATAACATCTATCCACTTTTGATTGAGTGCCTTCTCAATGGTAACGAACAAGTTGCTAACTCATCAATGGATGCAATAAAGAAATTAGCTGCATTTCCAAAGGGGATGGAAATCATCTTCCCAACAAATAAAACGGAAGCAACACACCTAGGAACTGTAGCTTCAACATGCTCATCTCTGGGAAGAGTCCGAGTTATGGCTTTGATAGTGAAACTGTTTTCAGTTTCTAGCTCTGTGGCATCTGCAGTATACAATTCAAATTTACTAAACCTACTGGAAAGTGAAATCAGCAACTCAAACGACACACTTGTAACTTTAAGCGTGTTGGAGCTCTTGTACGAGTTAGTGGAGATTGAACATGGTACAAATTTTTTGCCAAGGACCAGCTTTCTCCAACAACTAAGCTGTATAATCAGCAACCGGTCGGCAGAGTCTATATTAAGATCCAGAGCAATGGTCATTAGTGGAAGACTTTTGTCTAAAGAGAATATTTTCTCGCTTGTAGATGAATCTTGTGTACGAATTTTAATATCTGCTATAGATGAAATTCTTGGATCATCTGAAGGCCAGGATGTAAATGTATGTGAATCTGCATTCGAAGCACTGGGTCAAATTGGTTCGAGCAAATGGGGAGCCACTTTACTGCTGTCAAGTTATCCAACTTGTGTGAAGTATGTAATTAACGCAGCGTTTGATCGGCATGAACATGGTAAACAGCTGGCAGCCATGCACGCTCTTGGTAACATCTTTGGTGAAACTCGATCTGAGAATGATGTTCTGCTGAATGATAATGCAGAAGAAAATTTACGGGACTTAATTTATCAAACTGCATCCAGAAGTCCAAAAATGACGCCATCAGGCCTTATTCTAGCTGTCCTTCAACAGGACTCTGAGATTCGCTTGGCGAGTTATAGAATGATAACTGGGTTGGTCGCTCGACCGTGGTGCCTTGTGGAAATCTGCTCGAAACAAGACATAATAAATATAGTTTCTGATGCAAGTACCGAGACTACAAAAATAGGAATGGAAGCTAGATATAACTGTTGTTTGGCTATCCATAAGACATTCATGTCTTCAACAAGGCTTACGGGCGATCCTGCCCTTGCTGGAATAGCTTCGAAGTTGCAGGAAGCTGTTCAAAATGGTCCATATCTTAGTAGAAGAAAACTGGAAACTCAACCAGCAATAATGACAGCTGAGAGATTTTAG
Protein sequence
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Homology
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match:
XP_023525525.1 (uncharacterized protein LOC111789113 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1008 bits (2605), Expect = 0.0
Identity = 525/525 (100.00%), Postives = 525/525 (100.00%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM
Sbjct: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match:
XP_022940916.1 (uncharacterized protein LOC111446360 [Cucurbita moschata])
HSP 1 Score: 994 bits (2571), Expect = 0.0
Identity = 518/525 (98.67%), Postives = 521/525 (99.24%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match:
KAG6607893.1 (26S proteasome non-ATPase regulatory subunit 5, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 993 bits (2566), Expect = 0.0
Identity = 517/525 (98.48%), Postives = 521/525 (99.24%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQA+SQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQANSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLV RPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM
Sbjct: 421 EIRLASYRMITGLVPRPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match:
XP_022981236.1 (uncharacterized protein LOC111480433 [Cucurbita maxima])
HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 513/525 (97.71%), Postives = 519/525 (98.86%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTD SVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDESVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDA+KKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDALKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
S+IDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SSIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSEND++LNDNAEENL DLIYQTASRSPK+TPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLGDLIYQTASRSPKITPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. NCBI nr
Match:
KAG7037420.1 (26S proteasome non-ATPase regulatory subunit 5 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 979 bits (2532), Expect = 0.0
Identity = 517/548 (94.34%), Postives = 521/548 (95.07%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQA+SQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQANSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYE--- 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYEVYY 240
Query: 241 --------------------LVEIEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVI 300
LVEIEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVI
Sbjct: 241 LLLPDFLFTSMCIPRLFLLLLVEIEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVI 300
Query: 301 SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL 360
SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL
Sbjct: 301 SGRLLSKENIFSLVDESCVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATL 360
Query: 361 LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIY 420
LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIY
Sbjct: 361 LLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIY 420
Query: 421 QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIVSDAS 480
QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLV RPWCL+EICSKQDIINIVSDAS
Sbjct: 421 QTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVPRPWCLMEICSKQDIINIVSDAS 480
Query: 481 TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP 525
TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP
Sbjct: 481 TETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQP 540
BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match:
A0A6J1FJQ7 (uncharacterized protein LOC111446360 OS=Cucurbita moschata OX=3662 GN=LOC111446360 PE=4 SV=1)
HSP 1 Score: 994 bits (2571), Expect = 0.0
Identity = 518/525 (98.67%), Postives = 521/525 (99.24%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSEND++LNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match:
A0A6J1IVZ7 (uncharacterized protein LOC111480433 OS=Cucurbita maxima OX=3661 GN=LOC111480433 PE=4 SV=1)
HSP 1 Score: 986 bits (2550), Expect = 0.0
Identity = 513/525 (97.71%), Postives = 519/525 (98.86%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFANYPGVRTD SVKEFFSRFPLPVVINALQAKAEIPGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDESVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVR LACKTVTRLLEETDPTTQLAPQL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRSLACKTVTRLLEETDPTTQLAPQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
IVDYNIYPLLIECLLNGNEQVANSSMDA+KKLAAFPKGMEIIFPTNKTEATHLGTVASTC
Sbjct: 121 IVDYNIYPLLIECLLNGNEQVANSSMDALKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGTNFLPRTSFLQ LS IISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI
Sbjct: 241 IEHGTNFLPRTSFLQLLSSIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
S+IDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG
Sbjct: 301 SSIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGETRSEND++LNDNAEENL DLIYQTASRSPK+TPSGLILAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGETRSENDIMLNDNAEENLGDLIYQTASRSPKITPSGLILAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL+EICSKQDIINIVSDASTETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match:
A0A6J1CFN3 (uncharacterized protein LOC111010386 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111010386 PE=4 SV=1)
HSP 1 Score: 886 bits (2290), Expect = 0.0
Identity = 458/525 (87.24%), Postives = 489/525 (93.14%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEFAVDDPTQLLEAAADFA+YPGVRTDASVKEF RFPLPV+INALQ KAE PGLENTL
Sbjct: 1 MEEFAVDDPTQLLEAAADFASYPGVRTDASVKEFLDRFPLPVIINALQTKAETPGLENTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGAS IPH+MPF+QVGL+ADSQ VR LACKTVT LLEE+D LA QL
Sbjct: 61 VACLDRIFKTKYGASFIPHFMPFIQVGLRADSQTVRDLACKTVTFLLEESDNDAVLAIQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
I+DY IYPLL+ECLLNGNEQVANSSMDAIKKLAAFPKGME+IFPTN+TEATHLGT+ASTC
Sbjct: 121 IIDYGIYPLLLECLLNGNEQVANSSMDAIKKLAAFPKGMEVIFPTNETEATHLGTLASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMAL+VKLFSVS SVASA+YNSNLLNLLESEI+NSNDTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSRSVASAIYNSNLLNLLESEINNSNDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGT FLPRTS LQ LS IISN S ESILRSRAMVISGRLLSKEN++ LVDESCVRILI
Sbjct: 241 IEHGTKFLPRTSILQLLSSIISNSSTESILRSRAMVISGRLLSKENMYLLVDESCVRILI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SAIDE LGSSEGQDVNVCESAFEALGQIGS+ GATLLLSS+ TCVK +I+AAFDRHEHG
Sbjct: 301 SAIDEALGSSEGQDVNVCESAFEALGQIGSTNRGATLLLSSFSTCVKLLIHAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNI GETRSEND++LND AEENLRDL+YQ ASRS K+ PSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNICGETRSENDIMLNDMAEENLRDLMYQIASRSSKIMPSGLFLAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL+EICSKQ+IINIV+DASTETTKIGMEARYNCC+AIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLMEICSKQEIINIVTDASTETTKIGMEARYNCCMAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SSTRLTGDPALAGIASKLQEAV+NGPYL+RR ETQPA+MTAERF
Sbjct: 481 SSTRLTGDPALAGIASKLQEAVRNGPYLTRRNFETQPAVMTAERF 525
BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match:
A0A0A0L246 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G629990 PE=4 SV=1)
HSP 1 Score: 881 bits (2276), Expect = 0.0
Identity = 455/525 (86.67%), Postives = 487/525 (92.76%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
MEEF+V+DPT+LL+AAA+FANYPGVRTDASVKEF RFPLP +INALQ KAE PGLE+TL
Sbjct: 1 MEEFSVNDPTRLLQAAANFANYPGVRTDASVKEFLDRFPLPAIINALQTKAEFPGLEDTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQ VR LACKTVTRLL+E+D T QL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQTVRTLACKTVTRLLQESDETALSPIQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
I+DY IYPLL++CLLNGNEQVANSSMD+IK LAAFP+GMEII P+NKTEATHLGTVASTC
Sbjct: 121 IIDYGIYPLLLDCLLNGNEQVANSSMDSIKTLAAFPQGMEIIIPSNKTEATHLGTVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMAL+VKLFSVSSSVASAVYN+NLL+LLESEI+NS DTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSSSVASAVYNANLLSLLESEINNSKDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGT FLPRTSFLQ L IISN SAESILRSRAMVI GRLLSKENIFSLVDESC+R LI
Sbjct: 241 IEHGTKFLPRTSFLQLLGSIISNSSAESILRSRAMVICGRLLSKENIFSLVDESCLRNLI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SA+D ILGSSEG+DVNV E+A EALGQIGSS WGATLLLSS+PTCVK+VI AAFDRHEHG
Sbjct: 301 SAVDGILGSSEGEDVNVSEAAIEALGQIGSSTWGATLLLSSFPTCVKHVIYAAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGE RSEND++LNDNAEENLRDLIYQ ASRS KMTPSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGEGRSENDIMLNDNAEENLRDLIYQIASRSSKMTPSGLFLAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL EICSKQDI+NIV DAS+ETTKIGMEARYNCCLAIHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLTEICSKQDIVNIVGDASSETTKIGMEARYNCCLAIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SS RLTGDPALAGIASKLQEAV+NGPYL+RR +ETQPAIMTAERF
Sbjct: 481 SSPRLTGDPALAGIASKLQEAVRNGPYLNRRNVETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. ExPASy TrEMBL
Match:
A0A1S3CKC0 (uncharacterized protein LOC103501781 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103501781 PE=4 SV=1)
HSP 1 Score: 877 bits (2267), Expect = 0.0
Identity = 452/525 (86.10%), Postives = 487/525 (92.76%), Query Frame = 0
Query: 1 MEEFAVDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTL 60
ME+F+V+DPTQLL+AAA+FANYPGVRTDASVKEF RFPLP +INALQ KAE PG+E+TL
Sbjct: 1 MEDFSVNDPTQLLQAAANFANYPGVRTDASVKEFLDRFPLPAIINALQTKAEFPGVEDTL 60
Query: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQL 120
VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQ VR LACKTVTRLL+E+D T A QL
Sbjct: 61 VACLDRIFKTKYGASLIPHYMPFVQVGLQADSQTVRTLACKTVTRLLQESDETVPSAIQL 120
Query: 121 IVDYNIYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTC 180
I+DY IYPLL++CLLNGNEQVANSSMD+IK LAAFP+GMEII P+NKTEATHLG VASTC
Sbjct: 121 IIDYGIYPLLLDCLLNGNEQVANSSMDSIKTLAAFPQGMEIIIPSNKTEATHLGIVASTC 180
Query: 181 SSLGRVRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVE 240
SSLGRVRVMAL+VKLFSVSSSVASAVYN+NLL+LLESEI+NS DTLVTLSVLELLYELVE
Sbjct: 181 SSLGRVRVMALVVKLFSVSSSVASAVYNANLLSLLESEINNSKDTLVTLSVLELLYELVE 240
Query: 241 IEHGTNFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILI 300
IEHGT FLPRTSFLQ LS IISN SAESILRSRAMVI GRLLSKENIFSLVDESCVR LI
Sbjct: 241 IEHGTKFLPRTSFLQLLSSIISNSSAESILRSRAMVICGRLLSKENIFSLVDESCVRNLI 300
Query: 301 SAIDEILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHG 360
SA+D ILGSSEG+DVNV E+A EALGQIGSS WGATLLLSS+PTCVK+ I AFDRHEHG
Sbjct: 301 SAVDGILGSSEGEDVNVSEAAIEALGQIGSSTWGATLLLSSFPTCVKHAIYTAFDRHEHG 360
Query: 361 KQLAAMHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDS 420
KQLAAMHALGNIFGE+RSEND++LNDNAEENLRDLIYQ ASRS KMTPSGL LAVLQQDS
Sbjct: 361 KQLAAMHALGNIFGESRSENDIVLNDNAEENLRDLIYQIASRSSKMTPSGLFLAVLQQDS 420
Query: 421 EIRLASYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFM 480
EIRLASYRMITGLVARPWCL EICSKQ+I+NIV DAS+ETTKIGMEARYNCCL+IHK FM
Sbjct: 421 EIRLASYRMITGLVARPWCLTEICSKQEIVNIVCDASSETTKIGMEARYNCCLSIHKAFM 480
Query: 481 SSTRLTGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 525
SS RLTGDPALAGIASKLQEAV+NGPYL+RR +ETQPAIMTAERF
Sbjct: 481 SSPRLTGDPALAGIASKLQEAVRNGPYLNRRNVETQPAIMTAERF 525
BLAST of Cp4.1LG02g01820 vs. TAIR 10
Match:
AT3G15180.1 (ARM repeat superfamily protein )
HSP 1 Score: 565.5 bits (1456), Expect = 4.6e-161
Identity = 292/520 (56.15%), Postives = 385/520 (74.04%), Query Frame = 0
Query: 6 VDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLD 65
++D QL +AA +FA+YPG + + SVKEF RFPLPV+ NALQ +IPG ENTLV CL+
Sbjct: 1 MEDVNQLFDAAFEFAHYPGAQNETSVKEFLDRFPLPVIFNALQTDPDIPGFENTLVTCLE 60
Query: 66 RIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYN 125
R+FKTKYGASLIP YMP +QVGL+ADS V+ LACKTV LLE+ D + QL+V+
Sbjct: 61 RLFKTKYGASLIPQYMPVLQVGLKADSAVVKSLACKTVLCLLEDCDTNDVSSVQLVVNNG 120
Query: 126 IYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGR 185
IYPLL++ ++N +++VAN++ + IK LA FP M +IFP+ + THL +A+ CSSL R
Sbjct: 121 IYPLLLDYIINSDDEVANAASETIKSLARFPDAMSVIFPSETNDPTHLRNLAARCSSLAR 180
Query: 186 VRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGT 245
VRV++LIVKLFS+S VAS V S LL+LLE+E+ + DTLV L+VLEL YEL+E+EH +
Sbjct: 181 VRVLSLIVKLFSISRLVASEVKKSGLLDLLEAEMKGTKDTLVILNVLELYYELMEVEHSS 240
Query: 246 NFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDESCVRILISAIDE 305
F+P+TS +Q L IIS S + RAM+ISGRLLSKENI+ +V+E+ V+ LISAID
Sbjct: 241 EFVPQTSLIQLLCSIISGTSTGPYEKLRAMMISGRLLSKENIYKVVEEASVKALISAIDG 300
Query: 306 ILGSSEGQDVNVCESAFEALGQIGSSKWGATLLLSSYPTCVKYVINAAFDRHEHGKQLAA 365
L S E D + E+A +ALGQ+GS+ GA L+LS+ P ++V+ +AFDR+ HGKQLAA
Sbjct: 301 SLESVEMNDTDAQEAAIDALGQMGSTTKGADLVLSTSPPAARHVVASAFDRNAHGKQLAA 360
Query: 366 MHALGNIFGETRSENDVLLNDNAEENLRDLIYQTASRSPKMTPSGLILAVLQQDSEIRLA 425
+HAL NI GETR +++ +++ AEE+LR LIY A++S K+TPSGL L+VLQQ SEIRLA
Sbjct: 361 LHALANIAGETRPKSNRIVDGKAEESLRCLIYDAAAQSTKLTPSGLFLSVLQQSSEIRLA 420
Query: 426 SYRMITGLVARPWCLVEICSKQDIINIVSDASTETTKIGMEARYNCCLAIHKTFMSSTRL 485
YR +T LVARPWCLVEI +K++IINIV+DA+TET KI MEARYNCC AIH+ F+ S
Sbjct: 421 GYRTLTALVARPWCLVEILAKEEIINIVTDATTETAKIAMEARYNCCKAIHEAFLCS-NF 480
Query: 486 TGDPALAGIASKLQEAVQNGPYLSRRKLETQPAIMTAERF 526
DP KLQEAV++GPY+S++ +P +MT E F
Sbjct: 481 VDDPRRLKTGDKLQEAVRSGPYMSKKHRGARPEVMTGEGF 519
BLAST of Cp4.1LG02g01820 vs. TAIR 10
Match:
AT3G15180.2 (ARM repeat superfamily protein )
HSP 1 Score: 552.7 bits (1423), Expect = 3.1e-157
Identity = 293/552 (53.08%), Postives = 386/552 (69.93%), Query Frame = 0
Query: 6 VDDPTQLLEAAADFANYPGVRTDASVKEFFSRFPLPVVINALQAKAEIPGLENTLVACLD 65
++D QL +AA +FA+YPG + + SVKEF RFPLPV+ NALQ +IPG ENTLV CL+
Sbjct: 1 MEDVNQLFDAAFEFAHYPGAQNETSVKEFLDRFPLPVIFNALQTDPDIPGFENTLVTCLE 60
Query: 66 RIFKTKYGASLIPHYMPFVQVGLQADSQAVRGLACKTVTRLLEETDPTTQLAPQLIVDYN 125
R+FKTKYGASLIP YMP +QVGL+ADS V+ LACKTV LLE+ D + QL+V+
Sbjct: 61 RLFKTKYGASLIPQYMPVLQVGLKADSAVVKSLACKTVLCLLEDCDTNDVSSVQLVVNNG 120
Query: 126 IYPLLIECLLNGNEQVANSSMDAIKKLAAFPKGMEIIFPTNKTEATHLGTVASTCSSLGR 185
IYPLL++ ++N +++VAN++ + IK LA FP M +IFP+ + THL +A+ CSSL R
Sbjct: 121 IYPLLLDYIINSDDEVANAASETIKSLARFPDAMSVIFPSETNDPTHLRNLAARCSSLAR 180
Query: 186 VRVMALIVKLFSVSSSVASAVYNSNLLNLLESEISNSNDTLVTLSVLELLYELVEIEHGT 245
VRV++LIVKLFS+S VAS V S LL+LLE+E+ + DTLV L+VLEL YEL+E+EH +
Sbjct: 181 VRVLSLIVKLFSISRLVASEVKKSGLLDLLEAEMKGTKDTLVILNVLELYYELMEVEHSS 240
Query: 246 NFLPRTSFLQQLSCIISNRSAESILRSRAMVISGRLLSKENIFSLVDES----------- 305
F+P+TS +Q L IIS S + RAM+ISGRLLSKENI+ +V+E+
Sbjct: 241 EFVPQTSLIQLLCSIISGTSTGPYEKLRAMMISGRLLSKENIYKVVEEARPVVPCLKASV 300
Query: 306 ---------------------CVRILISAIDEILGSSEGQDVNVCESAFEALGQIGSSKW 365
CV+ LISAID L S E D + E+A +ALGQ+GS+
Sbjct: 301 CCAHKTSDEVEKLTTNCFVSECVKALISAIDGSLESVEMNDTDAQEAAIDALGQMGSTTK 360
Query: 366 GATLLLSSYPTCVKYVINAAFDRHEHGKQLAAMHALGNIFGETRSENDVLLNDNAEENLR 425
GA L+LS+ P ++V+ +AFDR+ HGKQLAA+HAL NI GETR +++ +++ AEE+LR
Sbjct: 361 GADLVLSTSPPAARHVVASAFDRNAHGKQLAALHALANIAGETRPKSNRIVDGKAEESLR 420
Query: 426 DLIYQTASRSPKMTPSGLILAVLQQDSEIRLASYRMITGLVARPWCLVEICSKQDIINIV 485
LIY A++S K+TPSGL L+VLQQ SEIRLA YR +T LVARPWCLVEI +K++IINIV
Sbjct: 421 CLIYDAAAQSTKLTPSGLFLSVLQQSSEIRLAGYRTLTALVARPWCLVEILAKEEIINIV 480
Query: 486 SDASTETTKIGMEARYNCCLAIHKTFMSSTRLTGDPALAGIASKLQEAVQNGPYLSRRKL 526
+DA+TET KI MEARYNCC AIH+ F+ S DP KLQEAV++GPY+S++
Sbjct: 481 TDATTETAKIAMEARYNCCKAIHEAFLCS-NFVDDPRRLKTGDKLQEAVRSGPYMSKKHR 540
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_023525525.1 | 0.0 | 100.00 | uncharacterized protein LOC111789113 [Cucurbita pepo subsp. pepo] | [more] |
XP_022940916.1 | 0.0 | 98.67 | uncharacterized protein LOC111446360 [Cucurbita moschata] | [more] |
KAG6607893.1 | 0.0 | 98.48 | 26S proteasome non-ATPase regulatory subunit 5, partial [Cucurbita argyrosperma ... | [more] |
XP_022981236.1 | 0.0 | 97.71 | uncharacterized protein LOC111480433 [Cucurbita maxima] | [more] |
KAG7037420.1 | 0.0 | 94.34 | 26S proteasome non-ATPase regulatory subunit 5 [Cucurbita argyrosperma subsp. ar... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1FJQ7 | 0.0 | 98.67 | uncharacterized protein LOC111446360 OS=Cucurbita moschata OX=3662 GN=LOC1114463... | [more] |
A0A6J1IVZ7 | 0.0 | 97.71 | uncharacterized protein LOC111480433 OS=Cucurbita maxima OX=3661 GN=LOC111480433... | [more] |
A0A6J1CFN3 | 0.0 | 87.24 | uncharacterized protein LOC111010386 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A0A0L246 | 0.0 | 86.67 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G629990 PE=4 SV=1 | [more] |
A0A1S3CKC0 | 0.0 | 86.10 | uncharacterized protein LOC103501781 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |