Cp4.1LG01g20100 (gene) Cucurbita pepo (Zucchini)

NameCp4.1LG01g20100
Typegene
OrganismCucurbita pepo (Cucurbita pepo (Zucchini))
DescriptionUnknown protein
LocationCp4.1LG01 : 17134976 .. 17144247 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATGGTTTGTCACGTCTCTAGTGAAAGCAACAGGACACAACCAGCCACCATGGTTGGCGCTATTAGCTTACACACATGCATTTGCAAGTGCAGCTTGCTTTAGTTTGCTATTGTCATAGTGCTTCACTCACTCATCATTTCAAAACTTTTCACTTTCTCACGACTCTATTTGATATTGAAGTGAACCCTCCTTAACCCATCGTTAAGAAATATTGATCATTCGCTTAATCGTAAGTAGTTCATTAAGTCATCGATATTTTCTAAAGGGAGTCGTTGAAACACGAAACCGTCAACCAAGTTAAAGAAAATATCGATGATTCAGTGAACTTGTGAGATTCCACGTCAGTTGGAGACGGAAACATAATATTCTTCTCCCTAACAGACGCGTTTTAGAAACCGTGAGAAGAAACCCGAAAGAGCTTGAACTGTTACAGAACTGTTGACAAAACTAAAAGTGTATCGTCGGCCAAATCAAGGTAAATCTCGATGGTTTGTCGATCTATTCTTAGGAAACGTTAAAGCCAACTATTTGGGTCGGACATTCCCATAGAGGTTGACTAGACCTCATAAAGCATCGGCCAAAGTTAGCCCAAAAGGATGTGTTGACCAAATTGTCGATACGAGTTTTTGCGTCAAGAACAGGTTGATGTGACATTGAGCACCAAATGAGTATTATGCTTATTTTTTTATTCCTTCTTTTGTCGTTTTTCAATCTTATTCTTAAATTTTTGTAGCGTTCTTAATTTCTATTCTTTTATATGTTAGATTCTTTCGTCTAGATATCTTATTAACGTCGTGACTGAGCTCAAGATCAATCCTATCACCTTTAAAATTACCTCTCCATTTTATTCACACAAAACACACCATTAACTTTTATGCCATAAATGGATTAGATTAAACATGGGCAACCTAGAAAAGACAAGTTGTTAAGATATTGGTGAGGTGGAAAATGGCAATTGGTTAGGTTAGTTAGTTAAATCGTTACTAAAATTTCATAAAATTTGAATATGAACATTTATAAGTCACGAACACATCACTAACATTAATTATAAAGGTTAAATATTCTTAATCTTTTCAAAAATAGTTTGAATAATTACAGTCCTATTTTGCAGGTTGGTCCGGCTTATTTGACCAAAAAAATGTTAACCCACAAACTAAAGCTAATTTTTTATTTTTTAAAAATTAAACTCTGACCTATATTAAATTTTTTTAAATTCAATTAACTTTTATAATTTGAGTTGCATCGTACCAGATAGTCGAATTATCACATCATATGAACACCTCTAACGGTTCAATTTATCAAGTATTTTTATTAATATAAGTAAAAATTTAAAAACCTTAATTAATGTTTACCACTACGATGCTGACACGTGGTGATCTACAACCATCCAACGGTTCAGATTAGTTGTAAACTCAGCTTTTATTTGGAATATGGAATAATTATAGAAGCCACTCAATTGGGAATCTAGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGGTCCAGTTCAACCATGGATGAATTTGTTTGACTTTTGGTTAAAAATGAACTCCGATTTGACTCAATTTGTTTGGTAATTGCAGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGGGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGGTAAGATTTCGACACATCAAAACACCTTTTCGATTTCCTTTTGTTAGCAGAAAGATTGCAATAGATTAGAGTAGCGGAATTTTTGGATATTTGATTTCTTTTTTAAAGAGGAAGTGTGTGGATTTGTTCGTTGCATTGAATGTGGACGGAATTTGAGCTATCTTCTCAAATCTTACAGTTGTTTGTGTACTGATTATAAATTAATTTAGGAATAGGAGGATCCTTCCTTCGAAAAGAAACCCTAGAAGCCATCGAGTGTGGAAAAGGGGGGAAACTGGTTATGGACTTGAACGATAATGGGATATTTCTTTTACTTTTTTTAGTTCTATTCTCTTTTTGCATTTCTTTTCAATAAAGTAATGGAGAACTTATGAACCTTTTCCTACTTTCCTGTACTTTTTTCTTCATGGTTTCCTGACTGAGCAAAAGTGCCCCAAGTGCGATTTGTTGAGCTATTGAACTCCTCCACTGTGGATGGTTTGCAGGAGGAGCTGGGAAAATAAGACATGTAGTCTCTTTTTACCATCCATTATTGAGCTATAAACCCAACCATGGTGGCCATGTTTTTGGGTTTTGATTTGCATGGCTGGTTAGCCAGAGGATATCCTCTTTGAACTTACTCTTTTGGGTTTTCCCGTCACAGTTTTAAAACGCGTCTAGTAGTGAGAGGTTTCCACACCCTTAGACGTTTGCTTTTACAACTAATGTAGGATCTCAAAATCCACCCCCTTCAAGGCCTAGCATTCTCGCTAGTACACCACTCCCACCCCCTTCGAAGCTCAGCGTCTTCATTGACACATCGTCTGGTGTTTGGCTCTGATACCATTTGTAACAACCCAAGCTCTCTCCTAGTGGTGGGCTTGAACTGTTACAAATGGTATTAGAGCCGGTCATCGGACGGTGCATGGGACTAGTTAGAGTAGGGCTAGACCCTTTCCAAAGTAGACGCGTTTTAAAACCTTGAGGAGAAGTCTGGAAAGGAAAGCTCAAAAAGAACAATATTTACTAGAGTTTGAACGGTATGAGGTCCCATTATAGAGAAAAATTTATTAGAAAAGGGCGTTACAATGTAGCTTGTACTTTTAAACTATTTGATGTATGATAGTGTGAGAGAAAAAGATTTTCAAATCTAACCCGTTCTTATAATAGCCAGACAGTTTCTTATAAGAGGATTTTTTGTTTGTAGAAGTGAATTCTCCATCTGAGTTCCTGGTTTCCCACAGCTTGTTGAATCATTTGTCCTATTCGATTGTCTTTATATAATTTCGACCCTTGTATTGAATTGAAACATTTGATCACATTCTTAGACTTCTATTTGCTTGTTCTATTCCACATAAAACCCCTAAAGAAATGTCACACAAATGCAAACATTGACAGCTTGTGAAGAAAAGTATACATGGGCCTAACTGATACTGAAGCTTTTTTTGGATCTCTTTCTGTGTAAGGTCTTGTCCTGATTAGTGAGTGGCCCACGAGGTGAATTTTTGACATTACTTAGTCTTACAATCTTAGAACAGAAGCTTTCTCACATGGCCAGTTCTAGGCAAATTAGTAACGAAATATTAATAGTTGTGTACCCAAGTCGATGCCTTCTCAAACATTAATAGTTGCTATGGTTGAGGATATCGACACTTCACACCAAATAAATGGGTTTCTTCCATTGTTTTTTCTTCTGTGGTTTCCTCACCACGGGGGAACGTCAAGGGGCCCTTCCAATGGCGAACTTGAACACAATAGAGATTGACATTAGAAATCTTTCATCTTTATGCAACAATACTCAGTACATTCACAAGGTGGGGGGTTCTATGAAGAGACATTCTTAGTGCAGATGATTCTCTAAGAACACTCCAAGGATCAATCGAAGTCCTGGAAGTTGGTCTTGGGAATTGAATCTTCCCGATACTCTTCTAGATGCATGGACATTGAACACTATATACTTTCGAATTTACTATACATATATACTCATCTAGGATGTATATATCATGGACACTGGACACTATCATGGTCCGGAAAGCTGACTTCAGAATTCTTTTGAACCCACACTAGGGATTCTAAGTCTAAGGTAGCTGAATACTTTTTTCTGTTTCCATTAGAATCTGATTTTTTTCAATTTGTATGATTAGATTTAGGCTCTTCTATAACCCTTGAATCCTTGATAAGAATATGCATATCCTTGACTTTGGCTTTACAGGGTGGGTTTTCTGATTGACTTTCTTTCTCACTCTGCCATTGTTGGTTTTATGGGTGGGAGCTGCTATTGCCCTTTCACTTTCACTCTCATCTTAACTGAACTCTTTAACTGAACTCTTTAAGTAAGGACACTATCTTCATTAGTATGAGACATTTTCGAGAAATCAAAAGCAAGGCCATAGAACTTATATTCAAAGTAGACAATATCATATCATTGTGGAGAGTCGTGGTTTCTAACAGGTATGTACTGATCCAACACAAATCTAATAGTTTCTCCTCTCGCACCAAAATTGAGTAGAGAATAACGCAAATGAAACAACACATAAAAAGATCTCAAAAATCTCATCGCATTCCTGTAATTCCTCTGCAAGAACTGATACCCAAATTGTTAAATAAGTGAAAGAACCATGGGCACACGAGACATTGATACCCCGATACTCACATCACTGTCAGACGTAGTGGCTCATTGATTGAACTTCCGCAGATTGATTGAATTCATCGTCAAAGACCCTTCAAACTTCAACACTTTTCACGTTTCAAGGATAGTGTCGGATGGCACAGGTGTGCCGTCATTGCACTCGAGCCCATGTGTCGGAGGTTGGATCATATACATCCTATCTGATACAGAGTCTGTAAATGGTATGACATGGATCAAGAGGGTCCACAATACCTCAATAGAATCTAGATGGATGAATGTTCAGAATGTCTCGATATTATGAACAACACATGAGCTCGTTCTCGATAGGCAAGACCAAGTGAATTATAATGGTCGATGACATCCCAAGACTTGAGATGACATCATGATATGAATGTCATGTCAATTGAGAACCAAGACGAGATTTGAGATGCAACATTGTACACGAGAGACCTTGTCCCCAAAGTAGAGGCAAGGTCAAGCCGACTAACGTGGCCAGTTAAAATTGTAAATGATTGAAAATACACATAAGTAGTGTTTGGTGGGTAAAAACAAATCTTGTCTAAGGTCCCACATAGCTACACGGACCAACCAACAAACTAAATAGGGGGCTACAAAGTCAAATTTGGAGGATAGTAGTGGATAATAATTATGAATATTATATTATATTATAAACTTTTCATATACAAAACATTATTAAAATATCCAACTATAATTCTAAAAAATTAAAGAAATGCATTAATGCATTTTTTTTTTTCTATCTTCAAAATTGTGGAAATAAAAATAGAGTGATTAGAAAATTGGAGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGATTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGGTGCGGGTGGATTAACTGCAACCATTAAAAAAACATAATTACAAAGATGGTCCCTATATTTGGCAATATTTATTTATTGTACTTTTGTCTTTGTGATTTTACAATTATATCCCCAGTTTGAAAAGAAAGTGGGGTTGTGTAAGCGTACCTGGAGCAGGCGTTTGAGCCATAGTTGGAGCGGGCACTGGCGCTCCTGTACCGGGAGCTCGTGATGGCACTCTGCTACCTGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGAACCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCATCACCTAGGGCTCGCGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCGGCACCGGGGGCCCGCGATGGCACTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTGTCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCCGCAAATGGCATTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCATGGGGAGCTTGCGATGGTGCTTTACTACCTGGAAATGGCATTAACGCTGCTGCACCAGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCGTTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCACTGCTGCATCAGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCGTTGACGCTGCAGCACCGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTTGGCGCTGCGGCAACTGGAGCTCGCGATGAGGCTTTAGTACCTGGAATTGTCGTTGGCCCTGCTGCAACGGGAGCTCGCGATGGCGCTTTACTACCTGGAACTATCCTTGGCGCTGCAGCAACGGGAGCTTGTGAAGGCGCTGCAGCAACGGGAGCTTGTGAAGGCGCTGCAGCAACGGGAGCTTGTGAAGGCGCTTTACTACCTGGAATTATCGTTGGCGCTGCGGCAACCGGAGCTCGTGATGAGGCTCTACTACCTGGAATTGTCGTTGGTGCTGCAGCACGGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCTCGATGGCGTTTTACTGCCGGGAGTTGTCGTTGGCGCTGCGGCACCGGGAGCTCGTGAAGGTGCTTTACTACCTGTAATTGTCGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCAGGAGCTCGCAAAGGCGCTTTACTACCTGAAATGGTCATTGGCGCTGCGGCAACCGGAGCTCGCGGTGGCGCTTTAATACCTGGAGTTGTTGTCGCTCCAGCACCGGGAGCTCGCGAAGGTGCTTTACTACCTGGAATTGTCGTTGGTGCCGCAGCAACCGGAGCTCGCGATGGTGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCACTTTACTACCTGGAACTGTCCTTGGCGCTGCAGCAACAGGAGCTCGCGATGGCGCTTTACCACCTAGAAACGTCGTTGGCGCTGGCGGTTTACTATACGAAACTGTAGCAGGGGCCATACCAGGGGGATGA

mRNA sequence

ATGGCTATGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGGGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGAAGTGAATTCTCCATCTGAGTTCCTGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGATTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGGTGCGGTTGGAGCGGGCACTGGCGCTCCTGTACCGGGAGCTCGTGATGGCACTCTGCTACCTGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGAACCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCATCACCTAGGGCTCGCGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCGGCACCGGGGGCCCGCGATGGCACTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTGTCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCCGCAAATGGCATTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCATGGGGAGCTTGCGATGGTGCTTTACTACCTGGAAATGGCATTAACGCTGCTGCACCAGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCGTTGTACTACCTGAAAATCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCACTGCTGCATCAGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCCACCGGGAGCTCGCGATGGGGCTTTACTACCTGGACACGGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCTCGATGGCGTTTTACTGCCGGGAGTTGTCGTTGGCGCTGCGGCACCGGGAGCTCGTGAAGGTGCTTTACTACCTGTAATTGTCGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCAGGAGCTCGCAAAGGCGCTTTACTACCTGAAATGGTCATTGGCGCTGCGGCAACCGGAGCTCGCGGTGGCGCTTTAATACCTGGAGTTGTTGTCGCTCCAGCACCGGGAGCTCGCGAAGGTGCTTTACTACCTGGAATTGTCGTTGGTGCCGCAGCAACCGGAGCTCGCGATGGTGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCACTTTACTACCTGGAACTGTCCTTGGCGCTGCAGCAACAGGAGCTCGCGATGGCGCTTTACCACCTAGAAACGTCGTTGGCGCTGGCGGTTTACTATACGAAACTGTAGCAGGGGCCATACCAGGGGGATGA

Coding sequence (CDS)

ATGGCTATGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGGGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGAAGTGAATTCTCCATCTGAGTTCCTGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGATTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGGTGCGGTTGGAGCGGGCACTGGCGCTCCTGTACCGGGAGCTCGTGATGGCACTCTGCTACCTGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGAACCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCATCACCTAGGGCTCGCGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCGGCACCGGGGGCCCGCGATGGCACTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGCTTGCGATGGCGCTATACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGTGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGCAAATGTAGTTGGTGCTGCAGCACCGGGAGCTCGCGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCTGGGAGATTGCGATGGCGCTTTACTACCTGCAAATGGCATTGGCACTTCAGCACCGGGGGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCATGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTTGTGATGGCGCTATACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGGAGCTTGCAATGGTGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTGTCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCCGCAAATGGCATTGGCGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCATGGGGAGCTTGCGATGGTGCTTTACTACCTGGAAATGGCATTAACGCTGCTGCACCAGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCGTTGTACTACCTGAAAATCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCACTGCTGCATCAGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCCACCGGGAGCTCGCGATGGGGCTTTACTACCTGGACACGGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGGGCTTTACTGCCAGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCTCGATGGCGTTTTACTGCCGGGAGTTGTCGTTGGCGCTGCGGCACCGGGAGCTCGTGAAGGTGCTTTACTACCTGTAATTGTCGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCAGGAGCTCGCAAAGGCGCTTTACTACCTGAAATGGTCATTGGCGCTGCGGCAACCGGAGCTCGCGGTGGCGCTTTAATACCTGGAGTTGTTGTCGCTCCAGCACCGGGAGCTCGCGAAGGTGCTTTACTACCTGGAATTGTCGTTGGTGCCGCAGCAACCGGAGCTCGCGATGGTGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCGATTTACTACCTGGAACTGTCCTTGGCGCTTCAGCAACAGGAGCTCGCGATGGCACTTTACTACCTGGAACTGTCCTTGGCGCTGCAGCAACAGGAGCTCGCGATGGCGCTTTACCACCTAGAAACGTCGTTGGCGCTGGCGGTTTACTATACGAAACTGTAGCAGGGGCCATACCAGGGGGATGA

Protein sequence

MAMPLKHPSSSSKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMGSDESGMLQRNEGEVNSPSEFLTPWHHNGRNKARRVRRSGVIRLQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGARDGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIGASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIGASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGINAAAPGARDGALLSGVVVGATAPGARDGVVLPENHGELAMALYYLEMALALLHQELAMALYYLELSTGSSRWGFTTWTRGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGALDGVLLPGVVVGAAAPGAREGALLPVIVVGAAATGARDGALLPGIVVGAAAPGARKGALLPEMVIGAAATGARGGALIPGVVVAPAPGAREGALLPGIVVGAAATGARDGDLLPGTVLGASATGARDGDLLPGTVLGASATGARDGTLLPGTVLGAAATGARDGALPPRNVVGAGGLLYETVAGAIPGG
BLAST of Cp4.1LG01g20100 vs. TrEMBL
Match: A0A0H5P1W0_NOCFR (Uncharacterised protein OS=Nocardia farcinica GN=ERS450000_04455 PE=4 SV=1)

HSP 1 Score: 97.1 bits (240), Expect = 1.7e-16
Identity = 330/1008 (32.74%), Postives = 408/1008 (40.48%), Query Frame = 1

Query: 102  GAGTG-ESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGA 161
            G GTG E + D     G  +GA LG   G VG G GA + GA D     G G+GA + GA
Sbjct: 138  GIGTGLEGAIDAGVDLGAGLGAGLGAGLGVVG-GVGAGLEGAIDAVAGVGAGLGAGIGGA 197

Query: 162  RDGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALL 221
             D     G GVGA   GA D     G GVGA    A D     G G+GA   GA +G   
Sbjct: 198  VDAVAGVGAGVGAGLEGALDAVAGVGAGVGAGLEGAVDAVAGVGAGLGAGLEGALEGVTD 257

Query: 222  PANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIG 281
                +G A  G           +G    G+ DG V      GA   GA DG +    G+G
Sbjct: 258  VTAGLGGALGGVAG--------LGGGLAGSLDGVV----DAGAGLGGALDGVVEAGAGLG 317

Query: 282  ASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAI 341
            A+  GA D     G   GA DGAV     +G A  GA D          A   GA DGA+
Sbjct: 318  AAVGGAVDAVAGVG---GAVDGAVEAGAGLGGAVGGAVDAV--------AGVGGALDGAV 377

Query: 342  LPANGIGASAPGARD-----GALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVV 401
                G+G +  GA D     G  L GVV     GA+     +G A  G  D         
Sbjct: 378  EAGAGLGGAVGGAVDAVAGVGGGLDGVVDAGVGGALGAVTGIGGALDGVVD--------A 437

Query: 402  GAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAALG 461
            GA   G  DGAL    GIG    GA DGA+ +G  +G                      G
Sbjct: 438  GAGVGGAVDGALGAVAGIG----GAVDGAVDAGAGLG----------------------G 497

Query: 462  DCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGAL 521
              DGA+    GIG    GA DGA+ +GV +GA   GA DGAV       A   G  +GA+
Sbjct: 498  AVDGAVDAVAGIG----GALDGAVDAGVGLGAGIGGALDGAV----DAAAGVGGGLEGAV 557

Query: 522  LPGNGISAAAPGARDGALLSGVVVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAA 581
              G G+  +  GA  GA  +G  + A  DGA      VG A+ G   GA+  G  ++A  
Sbjct: 558  GAGAGLGGSLEGALGGAAEAGAGLAAGLDGA---AGAVGGASAGLA-GAINAGTDLAAGL 617

Query: 582  PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDG 641
             GA DGAL +G  VG    GA DG V     + +   GA DGAL  G G++    GA  G
Sbjct: 618  DGAVDGALGAGAGVG----GALDGVVAAGADLTSGLNGAVDGALGAGAGLTTGLEGALGG 677

Query: 642  ALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGV 701
            A+ +G  +     GA DGAV      GA   GA  GA+  G G +A   GA  GA+ +G 
Sbjct: 678  AVDAGAGLTTGLEGAVDGAV----GAGAGLEGALGGAVEAGAGAAAGVGGAVGGAVEAGT 737

Query: 702  VVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVV-VGATAPGA 761
             + A  +GA+     V A   G   G L   +G   AA G   G  LSG V  G  A G 
Sbjct: 738  GLAAGLEGAVAAGGDVAAGLEGGLFGGL---DGAVDAASGVAGG--LSGAVNAGGQAVGG 797

Query: 762  RDGAVLPENVVGAAARGACD----GTLLPANGIGASAPGARNGALLSGVVVGATAP---- 821
             + ++         A G       G L     + A   G  +G L+SGV  G TA     
Sbjct: 798  LESSLSAGLGAALGAGGELSTELGGALDGGADLAAGLDGLVDGELVSGVDGGLTAAIGGV 857

Query: 822  -GARDGAVLPENVVG-----AAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAW 881
             G   GA L   + G         GA DGT   A G+ A   GA D A        A   
Sbjct: 858  LGGDAGAGLETGLGGVVDASGGLTGALDGTAETAAGLEAGLGGAADAA--------AGLT 917

Query: 882  GACDGALLPGNGINAAAPGARDGALLSGV--VVGATAPGARDGV--VLPENHGELAMALY 941
                G L  G G    A     G L +G+  V GATA G   G+  V     G L   L 
Sbjct: 918  AGLTGGLESGLGAATDAGAGLTGGLAAGLSGVTGATA-GLETGLSGVTGVTAG-LETGLS 977

Query: 942  YLEMALALLHQELA-MALYYLELSTGSSRWGFTTWTRGARDGALLPGVVVGAAATGARDG 1001
             +  A A L   L+ +A     L TG +         GA DG L      G   +G   G
Sbjct: 978  GVTGATAGLETGLSGVADTTTGLETGLT---------GALDGTLSG---AGEFGSGLESG 1034

Query: 1002 ALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGV 1061
             L  G   G+  TGA DG+L      GAA T    GA L      A  T    G      
Sbjct: 1038 -LGGGFAAGSGLTGALDGSL-----TGAADTTGSVGAGLESTAGSALGTTGELGGSFGST 1034

Query: 1062 VVGAAATGAR-DGALLPGVVVGAAATGARDGALLPGVVVGAAATGARD 1081
                A T +  D +   G  + +  TG+ + +L      GA A G+ D
Sbjct: 1098 AASTANTWSTVDVSSAFGSGLDSGITGSGESSLFGTAESGAEAAGSTD 1034

BLAST of Cp4.1LG01g20100 vs. TrEMBL
Match: G9NZ61_HYPAI (Uncharacterized protein OS=Hypocrea atroviridis (strain ATCC 20476 / IMI 206040) GN=TRIATDRAFT_284537 PE=4 SV=1)

HSP 1 Score: 83.6 bits (205), Expect = 2.0e-12
Identity = 187/598 (31.27%), Postives = 209/598 (34.95%), Query Frame = 1

Query: 106 GESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGARDGAL 165
           G   + G  G  GP G    G  G  G  TG+  P    G   P    G   P   DGA+
Sbjct: 347 GPQGNRGPQGFTGPKGDQ--GNQGPQGL-TGSQGPAGNRG---PQGLTGPAGPTGADGAV 406

Query: 166 LPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALLPANVV 225
            P   VG   P    GA      VG A P   DGA+ P   VG   P   DGA      V
Sbjct: 407 GPVGPVGPVGPVGPAGADGADGAVGPAGPAGADGAVGPVGPVGPVGPAGADGA---DGAV 466

Query: 226 GAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIGASAPG 285
           G A P   DG + P    GA       G   PE  +G A     DGA+ PA  +G +   
Sbjct: 467 GPAGPAGADGAVGPVGPAGADGAVGPTGDAGPEGPIGPAGPAGADGAVGPAGPVGPTGDA 526

Query: 286 ARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANG 345
             +G        GA DGAV P    G A      G   PE   G A     DGA+ P   
Sbjct: 527 GPEGP------AGA-DGAVGPVGPAGPAGDVGPTGDAGPEGPAGPAGPAGADGAVGPTGA 586

Query: 346 IGASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGDCDG 405
            GA      DGA          DGA  PA   G A P   DGAV PE  +G A     DG
Sbjct: 587 DGA------DGA----------DGAPGPAGPAGPAGPAGADGAVGPEGPIGPAGADGTDG 646

Query: 406 A------LLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVG-AAALGD-- 465
           A        PA  IG + P   DGA      VG   P   DGAV P    G     GD  
Sbjct: 647 ADGAVGPAGPAGPIGPAGPAGADGA------VGPAGPAGADGAVGPVGPAGPVGPTGDAG 706

Query: 466 ------CDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGA 525
                  DGA+ PA   G + P    G        G   P   DGAV P   VG A    
Sbjct: 707 PEGPAGADGAVGPA---GPAGPAGDVGPTGDPGPEGPAGPAGADGAVGPAGPVGPA---- 766

Query: 526 CDGALLPGNGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAARGACNGALLPGNGI 585
             G + P        P    GA          DGA+ P    GA      +GA+ P    
Sbjct: 767 --GDVGPAGDAGPEGPAGPAGA----------DGAVGPAGPAGADGADGADGAVGPAGPA 826

Query: 586 SAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPG 645
             A P             GA      DGAV P    G A     DGA+ P      A   
Sbjct: 827 GPAGPAG---------PAGADGTDGTDGAVGPAGPAGPA---GADGAVGPAGPAGPAGAD 869

Query: 646 ARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDG 689
             DGA  +    G   P   DGAV P    GA      DGA+ P        P    G
Sbjct: 887 GTDGADGAVGPAGPAGPAGADGAVGPAGPAGA------DGAVGPAGPAGPTGPTGPQG 869

BLAST of Cp4.1LG01g20100 vs. TrEMBL
Match: I4AA49_DESDJ (Uncharacterized protein OS=Desulfitobacterium dehalogenans (strain ATCC 51507 / DSM 9161 / JW/IU-DC1) GN=Desde_2503 PE=4 SV=1)

HSP 1 Score: 81.6 bits (200), Expect = 7.4e-12
Identity = 100/290 (34.48%), Postives = 134/290 (46.21%), Query Frame = 1

Query: 134 GTGAPVPGARDGTLLPGNGVGAPVPGARDGALLPGNGVGAEEPGARDGALLPGNGVGAAS 193
           G GAPVP    G  +P +G GAPVP    GA +P +G GA  P    GA +P +G GA  
Sbjct: 128 GKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPV 187

Query: 194 PRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGTLLPANVVGAAAPGARDG 253
           P    GA +P +G GA  P    GA +P++  GA  P    G  +P++  GA  P    G
Sbjct: 188 PSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKG 247

Query: 254 AVLPENVVGAAALGACDGAILPANGIGASAPGARDGALLSGVVVGARDGAVLPANVVGAA 313
           A +P +  GA       GA +P++G GA  PG   GA +        DG   P    G  
Sbjct: 248 APVPSDGKGAPVPSDGKGAPVPSDGKGAPMPGDGKGAPVPS------DGKGAPVPNDGKG 307

Query: 314 APGARDGAVLPENVVGAAALGACDGAILPANGIGASAPGARDGALLSGVVVGARDGAVLP 373
            P + DG  +P +  G  A    DG   P    G S P + DG    GV V +  GA +P
Sbjct: 308 VPVSSDGKGVPVSSDGKGAPVPSDGKGAPVPSDGKSVPVSSDG---KGVPVASDKGAPVP 367

Query: 374 ANVVGAAAPGARDGAVLPENVVGAAALGDCDGALLPANGIGTSAPGARDG 424
           ++  G + P + DG        GA   GD  G  +P++G G   P + DG
Sbjct: 368 SD--GKSVPVSSDGK-------GAPVPGDGKGTSVPSDGKG--VPVSSDG 397

BLAST of Cp4.1LG01g20100 vs. TrEMBL
Match: A0A0L0V2X5_9BASI (Uncharacterized protein OS=Puccinia striiformis f. sp. tritici PST-78 GN=PSTG_12934 PE=4 SV=1)

HSP 1 Score: 74.7 bits (182), Expect = 9.1e-10
Identity = 79/230 (34.35%), Postives = 93/230 (40.43%), Query Frame = 1

Query: 534 LSGVVVGACDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVVVGATAP 593
           + G++ G   G +LP+  VG    G   G LLPG G     PG   G LL G  VG   P
Sbjct: 59  IGGILPGGGAGGLLPDGGVGGLLPGGGFGGLLPGGGSGGILPGGGVGGLLPGGGVGGLFP 118

Query: 594 GARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGA 653
               G +LP   VG    G   G LLPG GI    PG     LL G  +    PG     
Sbjct: 119 DIGTGGLLPGGGVGDLLPGGGVGDLLPGGGIDGLLPGGGSDGLLPGGGIDGLLPGGGSDG 178

Query: 654 VLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAA 713
           +LP   +     G     LLPG GI    PGA    LL G   G  DG +    + G   
Sbjct: 179 LLPGGGIDGLLPGGGSDGLLPGGGIDGLLPGAGINGLLPG---GGVDGLLPGGGIDGLLP 238

Query: 714 RGACNGALLPGNGISAAAPGARDGA-----------LLSGVVVGATAPGA 753
            G  +G LLPG GI    PG   G            LLS    G  +PG+
Sbjct: 239 GGGVDG-LLPGGGIDGLIPGGGAGGCSNQSGVLNLNLLSSTNCGHNSPGS 284

BLAST of Cp4.1LG01g20100 vs. TrEMBL
Match: A0A0H3LZ38_EHRRW (Uncharacterized protein OS=Ehrlichia ruminantium (strain Welgevonden) GN=ERWE_CDS_01080 PE=4 SV=1)

HSP 1 Score: 72.8 bits (177), Expect = 3.5e-09
Identity = 99/477 (20.75%), Postives = 249/477 (52.20%), Query Frame = 1

Query: 151 NGVGAPVPGARDGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAA 210
           +GV   V  + +G+++  +  GA    + +G+++  +  GAA   + +G+ +  +  GAA
Sbjct: 136 SGVDTTVTSSPEGSVVTSSPEGAAVTSSPEGSVVTSSPEGAAVTSSPEGSAVTSSPEGAA 195

Query: 211 APGARDGALLPANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACD 270
              + +G+++ ++  GAA   + +G+++ ++  GAA   + +GAV+  +  GAA   + +
Sbjct: 196 VTSSPEGSVVTSSPEGAAVTSSPEGSVVTSSPEGAAVTSSPEGAVVTSSPEGAAVTSSPE 255

Query: 271 GAILPANGIGASAPGARDGALLS-----GVVVGARDGAVLPANVVGAAAPGARDGAVLPE 330
           G+++ ++  GA+   +  G++++       V  + +G+V+ ++  G+A   +  G+V+  
Sbjct: 256 GSVVTSSPKGAAVTSSPKGSVVTSSPKGAAVTSSPEGSVVTSSPKGSAVTSSPKGSVVTS 315

Query: 331 NVVGAAALGACDGAILPANGIGASAPGARDGALLS-----GVVVGARDGAVLPANVVGAA 390
           +  G+A   + +G+++ ++  G++   +  G++++      VV  + +G+V+ ++  GAA
Sbjct: 316 SPKGSAVTSSPEGSVVTSSPEGSAVTSSPKGSVVTSSPEGSVVTSSPEGSVVTSSPEGAA 375

Query: 391 APGARDGAVLPENVVGAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARD 450
              + +G+V+  +  GAA     +G+++      TS+P   +G++++    GA    + +
Sbjct: 376 VTSSPEGSVVTSSPEGAAVTSSPEGSVV------TSSP---EGSVVTSSPEGAAVTSSPE 435

Query: 451 GAVLPENVVGAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPE 510
           GA +  +  GAA     +G+++ ++  G +   + +G++++    GA    + +GA +  
Sbjct: 436 GAAVTSSPEGAAVTSSPEGSVVTSSPEGAAVTSSPEGSVVTSSPEGAAVTSSPEGAAVTS 495

Query: 511 NVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAARGAC 570
           +  G+    + +GA +  +   AA   + +G+    VV  + +G+++  +  GAA   + 
Sbjct: 496 SPEGSVVTSSPEGAAITSSPEGAAITSSPEGS----VVTSSPEGSVVTSSPEGAAVTSSP 555

Query: 571 NGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGA 618
            G+++  +   AA   + +G++++    G+    + +G+V+  +  GAA   + +GA
Sbjct: 556 EGSVVTSSPEGAAVTSSPEGSVVTSSPEGSVVTSSPEGSVVTSSPEGAAVTSSPEGA 599

BLAST of Cp4.1LG01g20100 vs. NCBI nr
Match: gi|602711645|ref|XP_007466061.1| (PREDICTED: mucin-5B [Lipotes vexillifer])

HSP 1 Score: 157.9 bits (398), Expect = 1.2e-34
Identity = 165/581 (28.40%), Postives = 247/581 (42.51%), Query Frame = 1

Query: 697  GACDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGA 756
            G  +G   P  V G ++ G   G   P      ++ G  +G      V G ++ G  +G 
Sbjct: 2405 GTIEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGV 2464

Query: 757  VLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATAPGARDGAVLPENV 816
              P  V G ++ G  +G   P    G S+ G   G      V G ++ G  +G   P  V
Sbjct: 2465 STPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRV 2524

Query: 817  VGAAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGINAAAP 876
             G ++ G  +G   P    G ++ G  +G   P  V G ++ G  +G   P      ++ 
Sbjct: 2525 TGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSST 2584

Query: 877  GARDGALLSGVVVGATAPGARDGVVLPENHGELAMALYYLEMALALLHQELAMALYYLEL 936
            G  +G      V G ++ G  +GV  P      +                          
Sbjct: 2585 GTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPS-------------------------- 2644

Query: 937  STGSSRWGFTTWTR-------GARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGAR 996
            STG++  G +T TR       G  +G   P  V G ++TG  +G   P  V G ++TG  
Sbjct: 2645 STGTTE-GVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTT 2704

Query: 997  DGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLP 1056
            +G   P  V G ++TG  +G   P  V G ++TG  +G   P  V G ++TG  +G   P
Sbjct: 2705 EGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTP 2764

Query: 1057 GVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGALDGVLLPGVVVGA 1116
              V G ++TG  +G   P  V G ++TG  +G   P  V G ++TG  +GV  P  V G 
Sbjct: 2765 TRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGP 2824

Query: 1117 AAPGAREGALLPVIVVGAAATGARDGALLPGIVVGAAAPGARKGALLPEMVIGAAATGAR 1176
            ++ G  EG   P  V G ++TG  +G   P  V G ++ G  +G   P  V G ++TG  
Sbjct: 2825 SSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTT 2884

Query: 1177 GGALIPGVVVAP-APGAREGALLPGIVVGAAATGARDGDLLPGTVLGASATGARDGDLLP 1236
             G   P  V  P + G  EG   P  V G ++TG  +G   P  V G S+TG  +G   P
Sbjct: 2885 EGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTPTRVTGPSSTGTTEGVSTP 2944

Query: 1237 GTVLGASATGARDGTLLPGTVLGAAATGARDGALPPRNVVG 1270
              V G S+TG  +G     +V G ++TG  +G      V G
Sbjct: 2945 TRVTGPSSTGTTEGVYATTSVTGPSSTGTTEGVYATTRVTG 2958

BLAST of Cp4.1LG01g20100 vs. NCBI nr
Match: gi|594635871|ref|XP_007171349.1| (PREDICTED: LOW QUALITY PROTEIN: mucin-5B [Balaenoptera acutorostrata scammoni])

HSP 1 Score: 149.1 bits (375), Expect = 5.4e-32
Identity = 153/541 (28.28%), Postives = 243/541 (44.92%), Query Frame = 1

Query: 730  AAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGAR 789
            ++PG          + G ++ G  +G   P +V G ++ G  +G   P +  G S+ G  
Sbjct: 2663 SSPGTIKRVSTPTTLTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTT 2722

Query: 790  NGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGATAPGARDGAVLP 849
             G      V G ++ G  +G   P +V G ++ G  +G   P +  G ++ G  +G   P
Sbjct: 2723 EGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTP 2782

Query: 850  ENVVGAAAWGACDGALLPGNGINAAAPGARDGALLSGVVVGATAPGARDGVVLPENHGEL 909
             +V G ++ G  +G   P +    ++ G  +G      V G ++ G  +GV  P +    
Sbjct: 2783 TSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGP 2842

Query: 910  AMALYYLEMALALLHQELAMALYYLELSTGSSRWGFTTWTRGARDGALLPGVVVGAAATG 969
            +       ++        +       +ST +S  G ++   G  +G   P  V G ++TG
Sbjct: 2843 SSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSS--TGTTEGVSTPTSVTGPSSTG 2902

Query: 970  ARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGAL 1029
              +G   P  V G ++TG  +G   P  V G ++TG  +G   P  V G ++TG  +G  
Sbjct: 2903 TTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVS 2962

Query: 1030 LPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVV 1089
             P  V G ++TG  +G   P  V G ++TG  +G   P  V G ++TG  +G   P  V 
Sbjct: 2963 TPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVT 3022

Query: 1090 GAAATGALDGVLLPGVVVGAAAPGAREGALLPVIVVGAAATGARDGALLPGIVVGAAAPG 1149
            G ++TG  +GV  P  V G ++ G  EG   P  V G ++TG  +G   P  V G ++ G
Sbjct: 3023 GPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTG 3082

Query: 1150 ARKGALLPEMVIGAAATGARGGALIPGVVVAP-APGAREGALLPGIVVGAAATGARDGDL 1209
              +G   P  V G ++TG   G   P  V  P + G  EG   P  V G ++TG  +G  
Sbjct: 3083 TTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTEGVS 3142

Query: 1210 LPGTVLGASATGARDGDLLPGTVLGASATGARDGTLLPGTVLGAAATGARDGALPPRNVV 1269
             P +V G S+TG  +G   P +V G S+TG       P T+ G ++TG       P  V 
Sbjct: 3143 TPTSVTGPSSTGTTEGVSTPTSVTGPSSTGTTQRVSTPTTLTGPSSTGTTKRVSTPTRVT 3201

BLAST of Cp4.1LG01g20100 vs. NCBI nr
Match: gi|727427849|ref|XP_010466930.1| (PREDICTED: glycine-rich cell wall structural protein 1.8-like [Camelina sativa])

HSP 1 Score: 99.8 bits (247), Expect = 3.8e-17
Identity = 271/811 (33.42%), Postives = 319/811 (39.33%), Query Frame = 1

Query: 102 GAGTGESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGAR 161
           G G G +      GAGG  G++ GG  GAVG G GA + G   G    G G    V GA 
Sbjct: 72  GGGVGGARGSVGGGAGGARGSVGGGAGGAVGGGAGAGIVGGGAG----GGGARGSVGGAV 131

Query: 162 DGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALLP 221
            G +  G  VG    GAR GA+  G GVG        GA+  G G    A G+  GA   
Sbjct: 132 GGGVGAGGVVGG---GAR-GAV--GGGVGGVVGGGTGGAVGGGAGGSGGARGSVGGA--- 191

Query: 222 ANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIGA 281
              VG    GA  G +     VG    GA  G       VG  A G+  GA+    G+G 
Sbjct: 192 ---VGGGVGGAVGGGVGAGGTVGGGVGGAVGGGT--GGAVGGGARGSAGGAV--GGGVGG 251

Query: 282 SAPGARDGA---LLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDG 341
            A GA  G+    + G V GA  G V    VVG    GA  G V    VVG  A GA  G
Sbjct: 252 GAGGAAGGSAGGAVGGGVGGAVGGGVGAGGVVGGGTAGAVGGGV-GAGVVGGGAGGAVGG 311

Query: 342 AILPANGIGASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAA 401
                  +G +  G   G    G V G   GAV     VG A  G   GAV      G A
Sbjct: 312 ---DRGNVGGAVGG---GVGAGGTVGGGAGGAV--GGAVGGAVGGGGRGAVGGGAASGGA 371

Query: 402 ALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAALGDCD 461
             G   GA      +G  A G+  GA+  GV  G T  G   GA      VG  A G   
Sbjct: 372 VGGGASGA------VGGGARGSVGGAVGGGVGAGGTVGGGVGGA------VGGGAGGAVG 431

Query: 462 GALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPG 521
           G      G+G  A G   GA + G   G    GAR  A       GA   G   GA    
Sbjct: 432 GG--AGGGVGGGARGGAGGA-VGGGAGGGVGGGARGSA------GGAVGGGVGSGAGSAA 491

Query: 522 NGISAAAPGARDGALLSGVVVGACDGAILPENVVGAAARGACNGALLPGNGISAAAPGAR 581
            G +  A G   G  + G   GA  GA      VG  ARG+  GA+  G G+   A GA 
Sbjct: 492 GGSAGGAVGGGVGGGVGGAGGGAGGGA---GGAVGGGARGSAGGAV--GGGVGGGAGGAA 551

Query: 582 DGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGAL---LPGNGISAAAPGAR--- 641
            G+  +G  VG    GA  G V    VVG    GA  G +   + G G   A  G R   
Sbjct: 552 GGS--AGGAVGGGVGGAVGGGVGAGGVVGGGTAGAVGGGVGAGVVGGGAGGAVGGDRGNV 611

Query: 642 DGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLS 701
            GA+  GV  G T  G   GAV           G   G  + G G  A   GA  G    
Sbjct: 612 GGAVGGGVGAGGTVGGGAGGAV-----------GGAVGGAVGGGGRGAVGGGAASG---- 671

Query: 702 GVVVGACDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVVVGATAPGA 761
           G V G   GA      VG  ARG+  GA+  G G      G   GA + G   GA   GA
Sbjct: 672 GAVGGGASGA------VGGGARGSVGGAVGGGVGAGGTVGGGVGGA-VGGGAGGAVGGGA 731

Query: 762 RDGAVLPENVVGAAARGACDGTL--LPANGIGASAPGARNGALLSGV--VVGATAPGARD 821
             G       VG  ARG+  G +     +G G++A G+  GA+  GV   VG    GA  
Sbjct: 732 GGG-------VGGGARGSAGGAVGGGVGSGAGSAAGGSAGGAVGGGVGGGVGGAGGGAGG 789

Query: 822 GAVLPENVVGAAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAWGACDGALLPG 881
           GA      VG  ARG+  G +    G GA   GA  G V     VG A  GA  G +  G
Sbjct: 792 GA---GGAVGGGARGSAGGAVGGGVGGGAGTGGAVGGGV--GGGVGGAVGGAIGGGVGLG 789

Query: 882 NGINAAAPGARDGALLSGVVVGATAPGARDG 900
            G+     G   G L  G  +G +  G   G
Sbjct: 852 GGVGGGGGGGLGGGLGGG--IGGSGGGGLGG 789

BLAST of Cp4.1LG01g20100 vs. NCBI nr
Match: gi|1016253396|ref|WP_062954266.1| (hypothetical protein, partial [Nocardia farcinica])

HSP 1 Score: 97.1 bits (240), Expect = 2.5e-16
Identity = 330/1008 (32.74%), Postives = 408/1008 (40.48%), Query Frame = 1

Query: 102  GAGTG-ESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGA 161
            G GTG E + D     G  +GA LG   G VG G GA + GA D     G G+GA + GA
Sbjct: 53   GIGTGLEGAIDAGVDLGAGLGAGLGAGLGVVG-GVGAGLEGAIDAVAGVGAGLGAGIGGA 112

Query: 162  RDGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALL 221
             D     G GVGA   GA D     G GVGA    A D     G G+GA   GA +G   
Sbjct: 113  VDAVAGVGAGVGAGLEGALDAVAGVGAGVGAGLEGAVDAVAGVGAGLGAGLEGALEGVTD 172

Query: 222  PANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIG 281
                +G A  G           +G    G+ DG V      GA   GA DG +    G+G
Sbjct: 173  VTAGLGGALGGVAG--------LGGGLAGSLDGVV----DAGAGLGGALDGVVEAGAGLG 232

Query: 282  ASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAI 341
            A+  GA D     G   GA DGAV     +G A  GA D          A   GA DGA+
Sbjct: 233  AAVGGAVDAVAGVG---GAVDGAVEAGAGLGGAVGGAVDAV--------AGVGGALDGAV 292

Query: 342  LPANGIGASAPGARD-----GALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVV 401
                G+G +  GA D     G  L GVV     GA+     +G A  G  D         
Sbjct: 293  EAGAGLGGAVGGAVDAVAGVGGGLDGVVDAGVGGALGAVTGIGGALDGVVD--------A 352

Query: 402  GAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAALG 461
            GA   G  DGAL    GIG    GA DGA+ +G  +G                      G
Sbjct: 353  GAGVGGAVDGALGAVAGIG----GAVDGAVDAGAGLG----------------------G 412

Query: 462  DCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGAL 521
              DGA+    GIG    GA DGA+ +GV +GA   GA DGAV       A   G  +GA+
Sbjct: 413  AVDGAVDAVAGIG----GALDGAVDAGVGLGAGIGGALDGAV----DAAAGVGGGLEGAV 472

Query: 522  LPGNGISAAAPGARDGALLSGVVVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAA 581
              G G+  +  GA  GA  +G  + A  DGA      VG A+ G   GA+  G  ++A  
Sbjct: 473  GAGAGLGGSLEGALGGAAEAGAGLAAGLDGA---AGAVGGASAGLA-GAINAGTDLAAGL 532

Query: 582  PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDG 641
             GA DGAL +G  VG    GA DG V     + +   GA DGAL  G G++    GA  G
Sbjct: 533  DGAVDGALGAGAGVG----GALDGVVAAGADLTSGLNGAVDGALGAGAGLTTGLEGALGG 592

Query: 642  ALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGV 701
            A+ +G  +     GA DGAV      GA   GA  GA+  G G +A   GA  GA+ +G 
Sbjct: 593  AVDAGAGLTTGLEGAVDGAV----GAGAGLEGALGGAVEAGAGAAAGVGGAVGGAVEAGT 652

Query: 702  VVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVV-VGATAPGA 761
             + A  +GA+     V A   G   G L   +G   AA G   G  LSG V  G  A G 
Sbjct: 653  GLAAGLEGAVAAGGDVAAGLEGGLFGGL---DGAVDAASGVAGG--LSGAVNAGGQAVGG 712

Query: 762  RDGAVLPENVVGAAARGACD----GTLLPANGIGASAPGARNGALLSGVVVGATAP---- 821
             + ++         A G       G L     + A   G  +G L+SGV  G TA     
Sbjct: 713  LESSLSAGLGAALGAGGELSTELGGALDGGADLAAGLDGLVDGELVSGVDGGLTAAIGGV 772

Query: 822  -GARDGAVLPENVVG-----AAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAW 881
             G   GA L   + G         GA DGT   A G+ A   GA D A        A   
Sbjct: 773  LGGDAGAGLETGLGGVVDASGGLTGALDGTAETAAGLEAGLGGAADAA--------AGLT 832

Query: 882  GACDGALLPGNGINAAAPGARDGALLSGV--VVGATAPGARDGV--VLPENHGELAMALY 941
                G L  G G    A     G L +G+  V GATA G   G+  V     G L   L 
Sbjct: 833  AGLTGGLESGLGAATDAGAGLTGGLAAGLSGVTGATA-GLETGLSGVTGVTAG-LETGLS 892

Query: 942  YLEMALALLHQELA-MALYYLELSTGSSRWGFTTWTRGARDGALLPGVVVGAAATGARDG 1001
             +  A A L   L+ +A     L TG +         GA DG L      G   +G   G
Sbjct: 893  GVTGATAGLETGLSGVADTTTGLETGLT---------GALDGTLSG---AGEFGSGLESG 949

Query: 1002 ALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGV 1061
             L  G   G+  TGA DG+L      GAA T    GA L      A  T    G      
Sbjct: 953  -LGGGFAAGSGLTGALDGSL-----TGAADTTGSVGAGLESTAGSALGTTGELGGSFGST 949

Query: 1062 VVGAAATGAR-DGALLPGVVVGAAATGARDGALLPGVVVGAAATGARD 1081
                A T +  D +   G  + +  TG+ + +L      GA A G+ D
Sbjct: 1013 AASTANTWSTVDVSSAFGSGLDSGITGSGESSLFGTAESGAEAAGSTD 949

BLAST of Cp4.1LG01g20100 vs. NCBI nr
Match: gi|983430423|ref|WP_060594053.1| (hypothetical protein [Nocardia farcinica])

HSP 1 Score: 97.1 bits (240), Expect = 2.5e-16
Identity = 330/1008 (32.74%), Postives = 408/1008 (40.48%), Query Frame = 1

Query: 102  GAGTG-ESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVPGARDGTLLPGNGVGAPVPGA 161
            G GTG E + D     G  +GA LG   G VG G GA + GA D     G G+GA + GA
Sbjct: 138  GIGTGLEGAIDAGVDLGAGLGAGLGAGLGVVG-GVGAGLEGAIDAVAGVGAGLGAGIGGA 197

Query: 162  RDGALLPGNGVGAEEPGARDGALLPGNGVGAASPRARDGALLPGNGVGAAAPGARDGALL 221
             D     G GVGA   GA D     G GVGA    A D     G G+GA   GA +G   
Sbjct: 198  VDAVAGVGAGVGAGLEGALDAVAGVGAGVGAGLEGAVDAVAGVGAGLGAGLEGALEGVTD 257

Query: 222  PANVVGAAAPGARDGTLLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAILPANGIG 281
                +G A  G           +G    G+ DG V      GA   GA DG +    G+G
Sbjct: 258  VTAGLGGALGGVAG--------LGGGLAGSLDGVV----DAGAGLGGALDGVVEAGAGLG 317

Query: 282  ASAPGARDGALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVVGAAALGACDGAI 341
            A+  GA D     G   GA DGAV     +G A  GA D          A   GA DGA+
Sbjct: 318  AAVGGAVDAVAGVG---GAVDGAVEAGAGLGGAVGGAVDAV--------AGVGGALDGAV 377

Query: 342  LPANGIGASAPGARD-----GALLSGVVVGARDGAVLPANVVGAAAPGARDGAVLPENVV 401
                G+G +  GA D     G  L GVV     GA+     +G A  G  D         
Sbjct: 378  EAGAGLGGAVGGAVDAVAGVGGGLDGVVDAGVGGALGAVTGIGGALDGVVD--------A 437

Query: 402  GAAALGDCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAALG 461
            GA   G  DGAL    GIG    GA DGA+ +G  +G                      G
Sbjct: 438  GAGVGGAVDGALGAVAGIG----GAVDGAVDAGAGLG----------------------G 497

Query: 462  DCDGALLPANGIGTSAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGAL 521
              DGA+    GIG    GA DGA+ +GV +GA   GA DGAV       A   G  +GA+
Sbjct: 498  AVDGAVDAVAGIG----GALDGAVDAGVGLGAGIGGALDGAV----DAAAGVGGGLEGAV 557

Query: 522  LPGNGISAAAPGARDGALLSGVVVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAA 581
              G G+  +  GA  GA  +G  + A  DGA      VG A+ G   GA+  G  ++A  
Sbjct: 558  GAGAGLGGSLEGALGGAAEAGAGLAAGLDGA---AGAVGGASAGLA-GAINAGTDLAAGL 617

Query: 582  PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDG 641
             GA DGAL +G  VG    GA DG V     + +   GA DGAL  G G++    GA  G
Sbjct: 618  DGAVDGALGAGAGVG----GALDGVVAAGADLTSGLNGAVDGALGAGAGLTTGLEGALGG 677

Query: 642  ALLSGVVVGATAPGARDGAVLPENVVGAAAWGACDGALLPGNGISAAAPGARDGALLSGV 701
            A+ +G  +     GA DGAV      GA   GA  GA+  G G +A   GA  GA+ +G 
Sbjct: 678  AVDAGAGLTTGLEGAVDGAV----GAGAGLEGALGGAVEAGAGAAAGVGGAVGGAVEAGT 737

Query: 702  VVGA-CDGAILPENVVGAAARGACNGALLPGNGISAAAPGARDGALLSGVV-VGATAPGA 761
             + A  +GA+     V A   G   G L   +G   AA G   G  LSG V  G  A G 
Sbjct: 738  GLAAGLEGAVAAGGDVAAGLEGGLFGGL---DGAVDAASGVAGG--LSGAVNAGGQAVGG 797

Query: 762  RDGAVLPENVVGAAARGACD----GTLLPANGIGASAPGARNGALLSGVVVGATAP---- 821
             + ++         A G       G L     + A   G  +G L+SGV  G TA     
Sbjct: 798  LESSLSAGLGAALGAGGELSTELGGALDGGADLAAGLDGLVDGELVSGVDGGLTAAIGGV 857

Query: 822  -GARDGAVLPENVVG-----AAARGACDGTLLPANGIGATAPGARDGAVLPENVVGAAAW 881
             G   GA L   + G         GA DGT   A G+ A   GA D A        A   
Sbjct: 858  LGGDAGAGLETGLGGVVDASGGLTGALDGTAETAAGLEAGLGGAADAA--------AGLT 917

Query: 882  GACDGALLPGNGINAAAPGARDGALLSGV--VVGATAPGARDGV--VLPENHGELAMALY 941
                G L  G G    A     G L +G+  V GATA G   G+  V     G L   L 
Sbjct: 918  AGLTGGLESGLGAATDAGAGLTGGLAAGLSGVTGATA-GLETGLSGVTGVTAG-LETGLS 977

Query: 942  YLEMALALLHQELA-MALYYLELSTGSSRWGFTTWTRGARDGALLPGVVVGAAATGARDG 1001
             +  A A L   L+ +A     L TG +         GA DG L      G   +G   G
Sbjct: 978  GVTGATAGLETGLSGVADTTTGLETGLT---------GALDGTLSG---AGEFGSGLESG 1034

Query: 1002 ALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGVVVGAAATGARDGALLPGV 1061
             L  G   G+  TGA DG+L      GAA T    GA L      A  T    G      
Sbjct: 1038 -LGGGFAAGSGLTGALDGSL-----TGAADTTGSVGAGLESTAGSALGTTGELGGSFGST 1034

Query: 1062 VVGAAATGAR-DGALLPGVVVGAAATGARDGALLPGVVVGAAATGARD 1081
                A T +  D +   G  + +  TG+ + +L      GA A G+ D
Sbjct: 1098 AASTANTWSTVDVSSAFGSGLDSGITGSGESSLFGTAESGAEAAGSTD 1034

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0H5P1W0_NOCFR1.7e-1632.74Uncharacterised protein OS=Nocardia farcinica GN=ERS450000_04455 PE=4 SV=1[more]
G9NZ61_HYPAI2.0e-1231.27Uncharacterized protein OS=Hypocrea atroviridis (strain ATCC 20476 / IMI 206040)... [more]
I4AA49_DESDJ7.4e-1234.48Uncharacterized protein OS=Desulfitobacterium dehalogenans (strain ATCC 51507 / ... [more]
A0A0L0V2X5_9BASI9.1e-1034.35Uncharacterized protein OS=Puccinia striiformis f. sp. tritici PST-78 GN=PSTG_12... [more]
A0A0H3LZ38_EHRRW3.5e-0920.75Uncharacterized protein OS=Ehrlichia ruminantium (strain Welgevonden) GN=ERWE_CD... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
gi|602711645|ref|XP_007466061.1|1.2e-3428.40PREDICTED: mucin-5B [Lipotes vexillifer][more]
gi|594635871|ref|XP_007171349.1|5.4e-3228.28PREDICTED: LOW QUALITY PROTEIN: mucin-5B [Balaenoptera acutorostrata scammoni][more]
gi|727427849|ref|XP_010466930.1|3.8e-1733.42PREDICTED: glycine-rich cell wall structural protein 1.8-like [Camelina sativa][more]
gi|1016253396|ref|WP_062954266.1|2.5e-1632.74hypothetical protein, partial [Nocardia farcinica][more]
gi|983430423|ref|WP_060594053.1|2.5e-1632.74hypothetical protein [Nocardia farcinica][more]
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cp4.1LG01g20100.1Cp4.1LG01g20100.1mRNA


The following gene(s) are orthologous to this gene:
GeneOrthologueOrganismBlock
Cp4.1LG01g20100CmaCh04G023160Cucurbita maxima (Rimu)cmacpeB721
Cp4.1LG01g20100Carg18247Silver-seed gourdcarcpeB0747
The following gene(s) are paralogous to this gene:

None