Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSinitialstart_codonpolypeptideintroninternalterminalstop_codon
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCTATGGTTTGTCACGTCTCTAGTGAAAGCAACAGGACACAACCAGCCACCATGGTTGGCGCTATTAGCTTACACACATGCATTTGCAAGTGCAGCTTGCTTTAGTTTGCTATTGTCATAGTGCTTCACTCACTCATCATTTCAAACCTTTTCACTTTCTCACGACTCTATTTGATATCAAACTGAACCCTCCTTAACCCATCGTTAAGAAACGTTGATCGTTCGGTTAATCGTAGGTAGTTCATTAAGTCGTCGATATTTCTTGAAGGGAGTCGTTGAAACACGAAACCGTCAACCAAGTTAAAAGAAAATATCGATGATTCAGTCAACTTGTGAGATTCCATGTCAGTTGGAAACGGAAACAGAATATTCTTCTCTCTAACAGATGAGTTTTAGAAACCGTGAGAAGAAACCCGAAAGAGCTTGGATTGTTACAGAACTGTTGACAAAACCAAAAGTGTATCGTCGGCCAAATCAGGGTAAATCTCGATGGTTTGTCGATCTATTCTTAGGAAACGTCAAATACAACTATTTGGGTCGAACATTCCCATAGAGGTTGACTAGACCTCATAAAGCATCGGCCAAAGTTGGCCCAGAAGGATGTGTTGACCAAATTGTCGATACGAGTTTTTGCGTCAAGAACAGGTTGATGTGACATTGAGCACCAAATGAGTATTATGCTTGTTTTTTTTTATTCCTTCTTTTGTCGTTTTTCAATCTTATTCTTAAATTTTTGTAGCGTTCTTAATTTCTCTTGTTTTATATGTTAGATTCTTTCGTCTAGATATCTTATTAACGTCGTGACTAAGCTCAACTAATCTGATCAATTCTATCACCTTTAAAATTACCTCTCCATTTTATTCATACAAAACACACCATTAACTTTTATGCCATAAATAGATTAGATTAAACATGGGCAACCTAGAAAAGACAAGTTGTTAAGATATTGGTGAGGTGGGAAATGGCAATTGGTTAGGTTAGTTAAATAAGACGTCACCAAAATTTCATAAAATTTGAATATGTAACATTTATAAGTCACAACACGACATTAATTACAAATGTTAAATATTCTTAATTTTTTCAAAAAATAGTTTGAATTACAGCCCTATTTTACGGTTGGTCAGGCTTACTTGACAAAAAAAAAAAAAAAAAAAAGTTAACCCACGAACTAAAGCTATTTTTTATTTTTTAAAAATTAAACTTGACCTATATTAAATTTTTAACTCAATTCAACTTTTATGATTTGGGTTGGAGAGGATAGTCGGATTCATATGATAGTTTTTAAACAATTTACCAAGTATTTTTATTAATTTAAGTAAAAAATTAAAACCTTCATTAATGTTTACCACTACGATGCCGACACGTGGTGACCTACAACCATCCAACGGTTCGGATTAGTTGTAAACTCAGCTTTTATTTGGAATATGGAATAATTATAGAAGCCACTCAATTGGAATCTGGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGGTACAGTTCAACCGTGGATGAATTTGTTTGACTTTTGGTTAAAGATGAACTCCGATTTGACTCGATTTGTTTGTTAATTGCAGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGTGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGGTAAGATTTCGACACATCGAAACACCTTTTCGATTTCCTTTTGTTAGCAGAAAGATTGCAATAGATTAGAATAGCGGAATTTTTGGATATTTGATTTCTTTTTTAAAGAGGAAGTGTGTGGATTTGTTCGTTGCATTGAATGTGGACGGAATTTGAGCTATCTTCTCAAATCTTACAGTTGTTTGTGTACTGATTATAAATTAATTTAGGAATAGGAGGATCCTTCCTCCGAAAAGAAACCCTAGAAGCCATCGAGTGTGGAAAAGGGGGGAAACTGGTTATGGACTTGAACGATAATGGGATATTTCTTTTACTTTTTTTAGTTCTATTCTCTTTTTGCATTTCTTTTCAATAAAGTAATGGAGAACTTATGAACCTTTTCCTACTTTCCTGTACTTTTTTCTTCATGGTTTCCTGACTGAGCAAAAGTGCCCCAAGTGCGATTCGTTGAGCTATTGAACTCCTCCACTATGGATGGTTTGCAGGAGGAGCTGGGAAAATAAGACATGTAGTCTCTTTTTACCATTCATTATTGAGCTACAAACCCAACCATGGTGGCCATGGCCATGTTTTTGGGTTTTGATTTGCATGGCTAGTTAGATAGAGGATAACTCTAACCGTCCTGCCCATATTGTCCTCTTTGAACTTTCTCTTTCGGACTTTTTCTCAAGATTTTTAAAATGCGTCTCCTAGTGAGAGGTTTCTACACCTTTATAAAGAATGTTTCGTTGTGGGATCTCACAATCAACCCCCTTCAAGGCCTAGCGTCCTCGCTAGCACACTGTTCGGAGCCCACCCCCTTCAAAGCTCAGCGTCCTCGTTGGCACACTGTCCGGTGTTTGGCTCTGATACCATTTGTAACAACTCAAGCTCTCTCCTAGCGGTGGGCTTGGACTATTACAAATGGTATCAGAGCCGGTCATCGGACGGTGTAAGGGACTAGTTGGAGTAGGGCTAGACCTTCTCCATAGTAGACATGTTTTAAGACCCTGAGGAGAAGTCCGGAAGGGAAAGCCCAAAAAAGACAATATTTACTAGTCGTGAGTTTGGACGGTTACACTAACAAAATTACTCTTTTCTTAGCAAAGTATGAGGTCCATTATAGAGAAAAATTTATTAGAAAAGAGCGTTACAATGTAGCTTGTACTTTTAAACTATTTGATGTGTGATATATTTGATGTGTGATAGTGTGAGAGAAAAAGATTTTCAAATCTAACTCCTCTAATAATAGCCAGACAGTTTCTTATGTGAGGATTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTATTCTCCATCTGAGTTCCTGGTTTCCCACAGCTTGTTGAATCATTTGTCCTATTCGATTGTCTTTATATAATTTCGACCCTTGTATTGACTTGAAACATTTGATCACATTCTTAGACTTCTATTTGCTTGTTCTATTCCACATAAAACCCCTAAAGAAATGTCACTCAAATGCAAACATTGACAGCTTGTGAGGAAAAGTATACATGGGCCTAACTGATACTGAAGTTTTTCTTGGATCTCTTTCTGTGTATGGCCTTGTCCTGATTAGTGAGTGGCCCACGAGGTGAATTTTTGACATTACTTCGTCTTACAATCTTAGAACAGAAGCTTTCTCACATGGCCAGTTCTAGGCAAATTAGTAACGGAATATTAATAGTTGTGTACCCAAGTCGATGCCTTCTCAAACATTAATAGTTGTTATGGTTGAGGATATCGACACTTCCCACCAAATTAATGGGTTTCTTCCATTGTTTTTTCTTCTGTGGTTTCCTCACCACAGAGGAACCTCAAGGGGCCCTTCCAATGTCGAACTTGAACACAATAGAGATTGACATTAGAAATCTTTCATTCTTATGCAACAATACTCGGTACATTCACAAGGTGGGGGGTTCTACCAAAGCAAAATCTCTTCAAGGAATGAAGAGACATTCTTAGTGCAGATGATTCTCTAAGAACATTCCAAGGTCAATCGAAGTCCTGGAAGTTGGTCTTGGGAATTGAATCTTCCCGATACTCTTCTGATTGTATATATCATGGATATTGAACACTTTATACTTTCGAATTTCTATACATATATACTCTTCTAGAATGTATATATCATGGACAATGATTACCATATACATCGAGAAGAGTATCTACGGCATGCATTTACAGCTACTCTTCTAGATGTATATATTCTTTTTACTTTCGATCTATCCATCTATCGTGGTTCAGAAAGCGACTTTAGAATTATTTTGAAGCCACACTAGGGATTCTAAGGGTAAGGTATTCTAAGTGACCTTAAGGAACTCTTTATACTCTTTATACTCTTTATAGCTGAATACTTTTTTCTGTTCCCATTAGAATCTGATATATATATATATATATATATATATATATATATTTTTTTTTTTTTCCTTTCAATTTTCCTCTACTTTGGGATCATTTGTATGTTTAGTTTTAGATTCTTCTATAAACCCTTGAATCCTTGACTTTGGCTTACAGGTTGGGTTTTCTGGTTGACTTTCTATCTTACTCTGCCATTGTTGGTTTTATGGGTGGAGCTACCATCACGCCCTTTCAATGGCTTAAAACTATCTCCATTGTTACGAGACATTTTAGGAAAACCAAAAGCAAAGTCACGAGAGCTTATGTTCAAAATAGACAATATCATATCGTTGTGGAGGGTCATGGTTTCTAACAAGTAAGCCACCGATGGACTAATCTGACACAAACCTAATAGTTTCTCCTCTGGCACCAAAATTGAGTGCAAATGAAACAAGGCATATATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGCAAACAAAAGGAAGTCACAAACCTTGGTGAAGAGAGAGAACCCCAATATCTTGGACAAAAGAGTAAATCAAATTACTTCAGTTCGCGGAGTGATTTCTACAAAGCATTAGTTCATCATGAATCAGATTTGCTGAAATGCAAAAGAAAAAGAACACAAAAAACATCAAATGAGCATTAGTAATCGTGTGAATAAGTGAAGGAACCATGGGCGCACGAGACATTGATAGCCCAATATTCACATCACTGTCAGACGCAGTGGCTCATTGATTGAATTCATCGGAATTCCGCCATCACAGACCCTTCAAACTTCAACACTTTTCACGTTTCTTCCACATTTTGATCATGTTTGCTGATGCGTGAGAGATAGTGTGTGTATATAGAAATTCATGTTTAGGTGATAGGTTGGCAAAAACAGAAAGAGAACGGTGTATCTAACCCATCATATATAATGTATGCATGGAAGCTAAACTAAGACATAGGTAAGACGGATGCAAGGATGTCAACATGTTGAAGGGTTGGGTAGGTCTAGGCCTTAACCTCGAAAATTGGGGTTTTAAAGGGAATTTGTGCATGTGGTTTTGGAGTCAAGTTTGAACATATGGAATATGAGCGCTCGCGAAATTTGGTATCGATCAGACACCAACAGACGCTCCACGACACTTTATGTTCATTTGCCAAAATAATGTATACACCTGTCAAATTGGAAATACTTCAAAATGTACCATAGGAGGAGACATGGTATAAGCTATCCATCACCCTTGGACCCTGTGGCCTAAGCGAGCTACCATGCGGCATATGTGTGTATCACAGTGAGGGCAAGCATTCATTGCAAGATGAGTATCTCGAGCGATGACACAGGTGTGCCCTCATTGCTCCCGAGTCCATGTGTCGGAGGTGGGATCATATACATGCTATCTGATATAGAGTCTGTAAATGGTATGACATGGATTAGGAGGGTCTAATACCTTAATAAAACCAACCATATGGATGGATGTTTAGAATGTCCCGATGTTATGAACAACACATGAACTCGTTTTCGATTGGCAAGACCAAGTGAATTATGACGGTCGATGACATCCCAGGACTTGAGATGACATCATGATATGAAGGTCGTGTCAATTGGGAACCAAGATGAGATTTTAGATGCAACATTGTACACGAGAGACCTTGTCTCCAAGTAGAGGCAAGGTCAAGCCAAATAATGTGGCTAGTTTAAATTGCAGATGATTGAAAATGCACGTACAGTCAATGTGTGTGGTTGAACAAGCTTTGATTCCATACAAGTTGTATTCAATGAGTAAAAACAAATCTTGTCTAAGGTCTAACAGAACTGAAAGATAAATTAAAAATGAACATGGGGCTACAAAGTCAAATTTGGAGGATAGTAGTGGATAATAATTATGAATATTATATTATAAACTTTTCATATACAAAACATTATTAAAAATCGAACTATAATTCTAAAAAATTAAAGAAAAATTAAAGAAATGCATTAATGCATTCTTCTTCTTCTTCTTTTTTTTCTATCTTCAAAATTGTGGAAATAAAAATAGAGTGATTAGAAAATTGGAGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGGTTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGGTGCGGGTGGATTAACTGCAACCATTAAAAAAACATAATTACAAAGATGGTCCCTATATTTGGCAATATTTATTTATTGTACTTTTGTTTTTGTGATTTTACAATTATATCCCCAGTTTGAAAAGAAAGTGGGGTTGTGTAAGCGTACCTGGAGCAGGCGTCTGAGCCATAGTTGGAGCGGGCACTGGCGCTCCTGTTCCGGGAGCTCGTGATGGCACTCTGCCACCTGGAAATGGCGTAGGCGCTCCAGTACCTGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGCTGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGTGCTCTACTACCAGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGCATCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCAGCACCTAGAGCTCGTGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTTGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACGTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCACTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGAAGCTTGCGATGGCCCTTTACTACCTTCGAATGGCATTGGCGCTTCAGCACCGGGGGCTTGCGATGGCGCTTTACCATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGATTGCGATGGTGCTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCGATGGCGCTTTCCTATCTGGAGTTGTCGTTGGTGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTACGACACCGGGAGCTTGCAATGGCGCTTTACTATCTGGAGTTGTTGGCGCTTGTGATGGTGCTATACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGGGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCATCGGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCGTTGACGCTGCAGCACCGGGAGCTCGCGATGGGTCTTTACTACCTGGAGTTGTTGTTGGCGCTGCGGCAACTGGAGCTCGCGATGAGGCTTTAGTACCTGGAATTGTCGTTGGCCCTGCTGCAACAGGAGCTCGCGATGGCGCTTTACTACCTGGAACTATCCTTGGCGCTGCAGCAACGGGAGCTCGTGAAGGCGCTTTACTACCTGGAATTGTCATTGGCGCTGCGGCAACCGGAGCTCGTGATGAGGCTCTACTACCTGGAATTGTCGTTGGTGCTGCAGCACGGGGAGCTCGCAATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGCATTTTACTACCTGTAGTTGGTGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCATGA
mRNA sequence
ATGGCTATGGTTTGTCACGTCTCTAGTGAAAGCAACAGGACACAACCAGCCACCATGGTTGGCGCTATTAGCTTACACACATGCATTTGCAAAACTGTTGACAAAACCAAAAGTGTATCGTCGGCCAAATCAGGAAGCCACTCAATTGGAATCTGGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGTGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGGTTGGGTTTTCTGGGTGATTAGAAAATTGGAGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGGTTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGTTGGAGCGGGCACTGGCGCTCCTGTTCCGGGAGCTCGTGATGGCACTCTGCCACCTGGAAATGGCGTAGGCGCTCCAGTACCTGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGCTGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGTGCTCTACTACCAGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGCATCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCAGCACCTAGAGCTCGTGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTTGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACGTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCACTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGAAGCTTGCGATGGCCCTTTACTACCTTCGAATGGCATTGGCGCTTCAGCACCGGGGGCTTGCGATGGCGCTTTACCATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGATTGCGATGGTGCTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCGATGGCGCTTTCCTATCTGGAGTTGTCGTTGGTGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTACGACACCGGGAGCTTGCAATGGCGCTTTACTATCTGGAGTTGTTGGCGCTTGTGATGGTGCTATACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGGGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCATCGGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCGTTGACGCTGCAGCACCGGGAGCTCGCGATGGGTCTTTACTACCTGGAGTTGTTGTTGGCGCTGCGGCAACTGGAGCTCGCGATGAGGCTTTAGTACCTGGAATTGTCGTTGGCCCTGCTGCAACAGGAGCTCGCGATGGCGCTTTACTACCTGGAACTATCCTTGGCGCTGCAGCAACGGGAGCTCGTGAAGGCGCTTTACTACCTGGAATTGTCATTGGCGCTGCGGCAACCGGAGCTCGTGATGAGGCTCTACTACCTGGAATTGTCGTTGGTGCTGCAGCACGGGGAGCTCGCAATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGCATTTTACTACCTGTAGTTGGTGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCATGA
Coding sequence (CDS)
ATGGCTATGGTTTGTCACGTCTCTAGTGAAAGCAACAGGACACAACCAGCCACCATGGTTGGCGCTATTAGCTTACACACATGCATTTGCAAAACTGTTGACAAAACCAAAAGTGTATCGTCGGCCAAATCAGGAAGCCACTCAATTGGAATCTGGCCGCTAAAACATCCAAGTTCATCGTCGAAGCGTCGGCGGCCACATTACTCTATTCAGTGCCAAAAGGCTTCAATTTCCGAGTTTTCGCCAATAATCAAACAGAGCCCTAAGTTTGTGGCATCGCTTTCGATGTGGAGCGACGAATCGGGAATGTTGCAGAGAAATGAAGGCGGTTGGGTTTTCTGGGTGATTAGAAAATTGGAGACGCCATGGCATCACAATGGTCGGAATAAAGCAAGAAGGGTGAGAAGAAGCGGCGTGATTAGGTTACAGAAAATGGTGAAAGATGGGGCATCTGACGGTGCCGGCACCGGCGAGTCCTCATCCGATGGTTCCACCGGTGCTGGTGGCCCTGTCGGAGCACTACTTGGCGGCACTTCTGTTGGAGCGGGCACTGGCGCTCCTGTTCCGGGAGCTCGTGATGGCACTCTGCCACCTGGAAATGGCGTAGGCGCTCCAGTACCTGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTAGGCGCTCCAGTACCGGGAGCTCGCGCTGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCTGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGTGCTCTACTACCAGGAAATGGCGTTGGCGCTCCAGTACCGGGAGCTCGCGATGGCGCTCTACTACCAGGAAATGGCGTTGGCGCTGAAGCATCGGGAGCTCGCGATGGCGCTCTATTACCTGGAAATGGCGTTGGTGCTGCAGCACCTAGAGCTCGTGATGGCGCTTTACTACCTGGAAATGGAGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTCGTTGGTGCTGCAGCACCGGGGGCTCGCGATGGCGCTTTACTACCTGCAAATGTTGTTGGTGCTGCGGCACCGGGGGCTCGCGATGGCGCTTTACTACGTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGCGATGGCACTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCTCGGGAAGCTTGCGATGGCCCTTTACTACCTTCGAATGGCATTGGCGCTTCAGCACCGGGGGCTTGCGATGGCGCTTTACCATCTGGAGTTGTCGTTGGCGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGATTGCGATGGTGCTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCGATGGCGCTTTCCTATCTGGAGTTGTCGTTGGTGCTACAGCACCGGGAGCTCGTGATGGCGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTACGACACCGGGAGCTTGCAATGGCGCTTTACTATCTGGAGTTGTTGGCGCTTGTGATGGTGCTATACTACCTGAAAATGTAGTTGGTGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCACCGGGAGCTCGCGATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGTGCTGCGGCACGGGGAGCTTGCGATGGCACTTTACTACCTGCAAATGGCATTGGCGCTTCAGCACCGGGGGCTCGCAATGGCGCTTTACTATCTGGAGTTGTCGTTGGCGCTACAGCACCGGGAGCTCGTGATGGTGCTGTACTACCTGAAAATGTAGTTGGGGCTGCAGCACGGGGAGCTTGCGATGGCGCTTTACTACCTGGAAATGGCATTAGCGCTGCTGCATCGGGAGCTCGCGATGGCGCTTTACTACTTGGAGTTGTCGTTGACGCTGCAGCACCGGGAGCTCGCGATGGGTCTTTACTACCTGGAGTTGTTGTTGGCGCTGCGGCAACTGGAGCTCGCGATGAGGCTTTAGTACCTGGAATTGTCGTTGGCCCTGCTGCAACAGGAGCTCGCGATGGCGCTTTACTACCTGGAACTATCCTTGGCGCTGCAGCAACGGGAGCTCGTGAAGGCGCTTTACTACCTGGAATTGTCATTGGCGCTGCGGCAACCGGAGCTCGTGATGAGGCTCTACTACCTGGAATTGTCGTTGGTGCTGCAGCACGGGGAGCTCGCAATGGGGCTTTACTACCTGGAGTTGTCGTTGGCGCTGCAGCAACGGGAGCTCGCGATGGCATTTTACTACCTGTAGTTGGTGTTGGTGCTGCGGCAACTGGAGCTCGCGATGGTGCTTTACTACCTGGAATTGTCGTTGGCGCTGCAGCACCATGA
Protein sequence
MAMVCHVSSESNRTQPATMVGAISLHTCICKTVDKTKSVSSAKSGSHSIGIWPLKHPSSSSKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQRNEGGWVFWVIRKLETPWHHNGRNKARRVRRSGVIRLQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSVGAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGAAAREACDGPLLPSNGIGASAPGACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARDGALLPGTILGAAATGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPGVVVGAAATGARDGILLPVVGVGAAATGARDGALLPGIVVGAAAP
Homology
BLAST of Csor.00g289960 vs. NCBI nr
Match:
KAG6602053.1 (Aggrecan core protein, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 1568 bits (4060), Expect = 0.0
Identity = 871/871 (100.00%), Postives = 871/871 (100.00%), Query Frame = 0
Query: 1 MAMVCHVSSESNRTQPATMVGAISLHTCICKTVDKTKSVSSAKSGSHSIGIWPLKHPSSS 60
MAMVCHVSSESNRTQPATMVGAISLHTCICKTVDKTKSVSSAKSGSHSIGIWPLKHPSSS
Sbjct: 1 MAMVCHVSSESNRTQPATMVGAISLHTCICKTVDKTKSVSSAKSGSHSIGIWPLKHPSSS 60
Query: 61 SKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQRNEGGWVFWVIRKLE 120
SKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQRNEGGWVFWVIRKLE
Sbjct: 61 SKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQRNEGGWVFWVIRKLE 120
Query: 121 TPWHHNGRNKARRVRRSGVIRLQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSV 180
TPWHHNGRNKARRVRRSGVIRLQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSV
Sbjct: 121 TPWHHNGRNKARRVRRSGVIRLQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSV 180
Query: 181 GAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGA 240
GAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGA
Sbjct: 181 GAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGA 240
Query: 241 PVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGAR 300
PVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGAR
Sbjct: 241 PVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGAR 300
Query: 301 DGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLP 360
DGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLP
Sbjct: 301 DGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLP 360
Query: 361 ANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGA 420
ANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGA
Sbjct: 361 ANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGA 420
Query: 421 AAREACDGPLLPSNGIGASAPGACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALL 480
AAREACDGPLLPSNGIGASAPGACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALL
Sbjct: 421 AAREACDGPLLPSNGIGASAPGACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALL 480
Query: 481 PANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGIS 540
PANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGIS
Sbjct: 481 PANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGIS 540
Query: 541 AAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARGACDGALLPGNGISAAA 600
AAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARGACDGALLPGNGISAAA
Sbjct: 541 AAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARGACDGALLPGNGISAAA 600
Query: 601 PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNG 660
PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNG
Sbjct: 601 PGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNG 660
Query: 661 ALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAASGARDGALLLGV 720
ALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAASGARDGALLLGV
Sbjct: 661 ALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAASGARDGALLLGV 720
Query: 721 VVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARDGALLPGTILGAAA 780
VVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARDGALLPGTILGAAA
Sbjct: 721 VVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARDGALLPGTILGAAA 780
Query: 781 TGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPGVVVGAAATGARDG 840
TGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPGVVVGAAATGARDG
Sbjct: 781 TGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPGVVVGAAATGARDG 840
Query: 841 ILLPVVGVGAAATGARDGALLPGIVVGAAAP 871
ILLPVVGVGAAATGARDGALLPGIVVGAAAP
Sbjct: 841 ILLPVVGVGAAATGARDGALLPGIVVGAAAP 871
BLAST of Csor.00g289960 vs. NCBI nr
Match:
XP_022959524.1 (elastin-like [Cucurbita moschata])
HSP 1 Score: 1091 bits (2822), Expect = 0.0
Identity = 673/847 (79.46%), Postives = 679/847 (80.17%), Query Frame = 0
Query: 145 MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTS--VGAGTGAPVPGARDGTLPPGNGV 204
MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTS VGAGTGAPV GARDGTL PGNGV
Sbjct: 1 MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVSGARDGTLLPGNGV 60
Query: 205 GAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGAPVPGARDGALLPGNGVGAPVPG 264
GAPVPGARDGALLPGNGVGAPVPGAR DGALLPGNGVGAPVPG
Sbjct: 61 GAPVPGARDGALLPGNGVGAPVPGAR------------------DGALLPGNGVGAPVPG 120
Query: 265 ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEASGARDGAL 324
ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEA GARDGAL
Sbjct: 121 ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEAPGARDGAL 180
Query: 325 LPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGALLPANVV 384
LPGN VGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAA PGARDG LLPANVV
Sbjct: 181 LPGNSVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAATPGARDGTLLPANVV 240
Query: 385 GAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGAAAREACDGPLLPSNGIGASAPG 444
GAAAPGARDGALL GVVVGATAPGARDG VLPENVVGAAAR ACDG LLP NGIGASAPG
Sbjct: 241 GAAAPGARDGALLPGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPVNGIGASAPG 300
Query: 445 ACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVV 504
A DGAL SGVVVGARDGAVLPENV+GAAARGDCDGALLPANGIGASAPGA DGA LSGV+
Sbjct: 301 ARDGALLSGVVVGARDGAVLPENVIGAAARGDCDGALLPANGIGASAPGALDGALLSGVL 360
Query: 505 VGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAAPGARDGA---------TTP 564
VGATAPGARDG VLPENVVGAAARGACDGALLPGNGISAAAPGARDGA T
Sbjct: 361 VGATAPGARDGTVLPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAL 420
Query: 565 GACNGALLSGVV-GACDGAILPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGV 624
GA +GALLSGVV GACDGAILPENVVGAAARGACDGALLPGNGISAAA GARDGALLSGV
Sbjct: 421 GARDGALLSGVVVGACDGAILPENVVGAAARGACDGALLPGNGISAAALGARDGALLSGV 480
Query: 625 VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATA 684
VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGAT
Sbjct: 481 VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATV 540
Query: 685 PGARDGAVLPENVVGAAARGACDG------------------------------------ 744
PGARDGAVLPENVVGAAARGACDG
Sbjct: 541 PGARDGAVLPENVVGAAARGACDGTLLPANGIGALALGARNGALLSGVVVGATALGARDG 600
Query: 745 ------------------------------------------------------------ 804
Sbjct: 601 AVLPENVVGAAARGACDGALLPGNGINAAAPGARDGALLSGVVVGATAPGARDGVVLPEN 660
Query: 805 ------------ALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGSLLPGVVVGAAA 864
ALLPGNGISAAASGARDGALLLGVVVDAAAPGARDG+LLPGVVVGAAA
Sbjct: 661 VVGAAARGACDGALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGALLPGVVVGAAA 720
Query: 865 TGARDEALVPGIVVGPAATGARDGALLPGTILGAAATGAREGALLPGIVIGAAATGARDE 871
TGARDEALVPGIVVGPAATGARDGALLPGTILGAAA GAREGALLPGIVIGAAATGARDE
Sbjct: 721 TGARDEALVPGIVVGPAATGARDGALLPGTILGAAAMGAREGALLPGIVIGAAATGARDE 780
BLAST of Csor.00g289960 vs. NCBI nr
Match:
XP_022990802.1 (elastin-like [Cucurbita maxima])
HSP 1 Score: 862 bits (2226), Expect = 5.26e-293
Identity = 558/764 (73.04%), Postives = 581/764 (76.05%), Query Frame = 0
Query: 172 GALLGGTSVGAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGA 231
GALL G VGA PGARDG + P N VG GA DGALLPGNG+ A PGAR GA
Sbjct: 412 GALLSGVVVGA----TAPGARDGAVLPENVVGVAARGACDGALLPGNGISAAAPGARDGA 471
Query: 232 LLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNG 291
LL G VGA PGA DG +LP N VGA GA DGALLP N VGA PGARDGALLPG
Sbjct: 472 LLSGVVVGATAPGAPDGVVLPENVVGAATRGACDGALLPANVVGAAAPGARDGALLPGVV 531
Query: 292 VGAPVPGARDGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGV--GAA 351
VGA PGARDGA+LP N VGA GA +GALLPGNG+ AAAP ARDGALL GV GA
Sbjct: 532 VGATAPGARDGAVLPENVVGAAVRGACNGALLPGNGISAAAPGARDGALLSRLGVVVGAT 591
Query: 352 APGARDGALLPANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARD 411
APGA DG +LP NVVGAAA GA +GALLPAN +GA+A GARDGALL GVVVGATAPGARD
Sbjct: 592 APGAPDGVVLPENVVGAAARGACNGALLPANGIGASATGARDGALLTGVVVGATAPGARD 651
Query: 412 GTVLPENVVGAAAREACDGPLLPSNGIGASAPGACDGALPSGVVVGAR-----DGAVLPE 471
G VLPENVVG AAR ACDG LLP NGI A+APGA DGAL SGVVVGA DG VLPE
Sbjct: 652 GAVLPENVVGVAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGAPDGVVLPE 711
Query: 472 NVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAA 531
NVVGAAARG C+GALLPANGIGASA GARDGA L+GVVVGATAPGARDGAVLPENVVG A
Sbjct: 712 NVVGAAARGACNGALLPANGIGASATGARDGALLTGVVVGATAPGARDGAVLPENVVGVA 771
Query: 532 ARGACDGALLPGNGISAAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARG 591
ARGACDGALLPGNGISAAAPGARDGA G GA G A DG +LPENVVGAA RG
Sbjct: 772 ARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPG---APDGVVLPENVVGAATRG 831
Query: 592 ACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTL 651
ACDGALLP N + AAAPGARDGALL GVVVG TAPGARDGAVLPENVVGAA RGACDG L
Sbjct: 832 ACDGALLPANVVGAAAPGARDGALLPGVVVGTTAPGARDGAVLPENVVGAAVRGACDGAL 891
Query: 652 LPANGIGASAPGARNGALLSGVVVGAT---------------------------APGARD 711
LPANGIGASAPGAR+G LLSGVVVGA APGARD
Sbjct: 892 LPANGIGASAPGARDGGLLSGVVVGAVLPENVVGAAAWGACDGALLPGNGISVAAPGARD 951
Query: 712 G------------AVLPENVVGAAARGACDGALLPGNGISAAASGARDGA---------- 771
G A+LPENVVGAAARGACDGALLPGNGISAAA GARDGA
Sbjct: 952 GGLLSGVVGALDGAILPENVVGAAARGACDGALLPGNGISAAAPGARDGAVLPENVVGAV 1011
Query: 772 --------LLLGVVVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARD 831
LLLGVVV AAAPGA DG+LLPGVVVG AA GARDEALVP VVGPAATGARD
Sbjct: 1012 ARGACDGALLLGVVVGAAAPGACDGALLPGVVVGTAAIGARDEALVPRFVVGPAATGARD 1071
Query: 832 GALLPGTILGAAATGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPG 871
G LLPG ++GAAATGAREG LLP IV+GAAATGARDEALLPGIVVGAAA GARNGALLPG
Sbjct: 1072 GTLLPGVVVGAAATGAREGDLLPKIVVGAAATGARDEALLPGIVVGAAAWGARNGALLPG 1131
BLAST of Csor.00g289960 vs. NCBI nr
Match:
KAG6602054.1 (hypothetical protein SDJN03_07287, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 238 bits (608), Expect = 6.26e-69
Identity = 153/219 (69.86%), Postives = 175/219 (79.91%), Query Frame = 0
Query: 650 IGASAPGARNGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAAS 709
IGA+A GAR+GALL G+VVGA A GARDGA+LP VVGAAA GA DG LLPG + AAA+
Sbjct: 3 IGAAATGARDGALLPGIVVGAAATGARDGALLPGIVVGAAATGARDGTLLPGTVLGAAAT 62
Query: 710 GARDGALLLGVVVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARDGA 769
GAR+GALL G V+ AAAPGAR+G+LLP +VVGAAATGARD AL+PGIVVG AATGARDGA
Sbjct: 63 GAREGALLPGTVLGAAAPGAREGALLPVIVVGAAATGARDGALLPGIVVGAAATGARDGA 122
Query: 770 LLPGTILGAAATGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPGVV 829
LLPG ++GAAATGAR G LLPG V+GA+ATGARD LLPG V+GA+A GA +G LLPG +
Sbjct: 123 LLPGIVVGAAATGARHGDLLPGTVLGASATGARDGDLLPGTVLGASAAGACDGPLLPGTI 182
Query: 830 VGAAATGARDGILLPVVGVGAAATGARDGALLPGIVVGA 868
+GAAATGA +G L P VGA GA GALL V GA
Sbjct: 183 LGAAATGALEGALPPTNVVGA---GAGAGALLYETVAGA 218
BLAST of Csor.00g289960 vs. NCBI nr
Match:
KAG7032747.1 (hypothetical protein SDJN02_06797, partial [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 132 bits (333), Expect = 8.98e-33
Identity = 64/64 (100.00%), Postives = 64/64 (100.00%), Query Frame = 0
Query: 47 HSIGIWPLKHPSSSSKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQR 106
HSIGIWPLKHPSSSSKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQR
Sbjct: 1 HSIGIWPLKHPSSSSKRRRPHYSIQCQKASISEFSPIIKQSPKFVASLSMWSDESGMLQR 60
Query: 107 NEGG 110
NEGG
Sbjct: 61 NEGG 64
BLAST of Csor.00g289960 vs. ExPASy TrEMBL
Match:
A0A6J1H8B0 (elastin-like OS=Cucurbita moschata OX=3662 GN=LOC111460536 PE=4 SV=1)
HSP 1 Score: 1091 bits (2822), Expect = 0.0
Identity = 673/847 (79.46%), Postives = 679/847 (80.17%), Query Frame = 0
Query: 145 MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTS--VGAGTGAPVPGARDGTLPPGNGV 204
MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTS VGAGTGAPV GARDGTL PGNGV
Sbjct: 1 MVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSGAVGAGTGAPVSGARDGTLLPGNGV 60
Query: 205 GAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGAPVPGARDGALLPGNGVGAPVPG 264
GAPVPGARDGALLPGNGVGAPVPGAR DGALLPGNGVGAPVPG
Sbjct: 61 GAPVPGARDGALLPGNGVGAPVPGAR------------------DGALLPGNGVGAPVPG 120
Query: 265 ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEASGARDGAL 324
ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEA GARDGAL
Sbjct: 121 ARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEAPGARDGAL 180
Query: 325 LPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGALLPANVV 384
LPGN VGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAA PGARDG LLPANVV
Sbjct: 181 LPGNSVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAATPGARDGTLLPANVV 240
Query: 385 GAAAPGARDGALLRGVVVGATAPGARDGTVLPENVVGAAAREACDGPLLPSNGIGASAPG 444
GAAAPGARDGALL GVVVGATAPGARDG VLPENVVGAAAR ACDG LLP NGIGASAPG
Sbjct: 241 GAAAPGARDGALLPGVVVGATAPGARDGAVLPENVVGAAARGACDGALLPVNGIGASAPG 300
Query: 445 ACDGALPSGVVVGARDGAVLPENVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVV 504
A DGAL SGVVVGARDGAVLPENV+GAAARGDCDGALLPANGIGASAPGA DGA LSGV+
Sbjct: 301 ARDGALLSGVVVGARDGAVLPENVIGAAARGDCDGALLPANGIGASAPGALDGALLSGVL 360
Query: 505 VGATAPGARDGAVLPENVVGAAARGACDGALLPGNGISAAAPGARDGA---------TTP 564
VGATAPGARDG VLPENVVGAAARGACDGALLPGNGISAAAPGARDGA T
Sbjct: 361 VGATAPGARDGTVLPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAL 420
Query: 565 GACNGALLSGVV-GACDGAILPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGV 624
GA +GALLSGVV GACDGAILPENVVGAAARGACDGALLPGNGISAAA GARDGALLSGV
Sbjct: 421 GARDGALLSGVVVGACDGAILPENVVGAAARGACDGALLPGNGISAAALGARDGALLSGV 480
Query: 625 VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATA 684
VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGAT
Sbjct: 481 VVGATAPGARDGAVLPENVVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATV 540
Query: 685 PGARDGAVLPENVVGAAARGACDG------------------------------------ 744
PGARDGAVLPENVVGAAARGACDG
Sbjct: 541 PGARDGAVLPENVVGAAARGACDGTLLPANGIGALALGARNGALLSGVVVGATALGARDG 600
Query: 745 ------------------------------------------------------------ 804
Sbjct: 601 AVLPENVVGAAARGACDGALLPGNGINAAAPGARDGALLSGVVVGATAPGARDGVVLPEN 660
Query: 805 ------------ALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGSLLPGVVVGAAA 864
ALLPGNGISAAASGARDGALLLGVVVDAAAPGARDG+LLPGVVVGAAA
Sbjct: 661 VVGAAARGACDGALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGALLPGVVVGAAA 720
Query: 865 TGARDEALVPGIVVGPAATGARDGALLPGTILGAAATGAREGALLPGIVIGAAATGARDE 871
TGARDEALVPGIVVGPAATGARDGALLPGTILGAAA GAREGALLPGIVIGAAATGARDE
Sbjct: 721 TGARDEALVPGIVVGPAATGARDGALLPGTILGAAAMGAREGALLPGIVIGAAATGARDE 780
BLAST of Csor.00g289960 vs. ExPASy TrEMBL
Match:
A0A6J1JJV8 (elastin-like OS=Cucurbita maxima OX=3661 GN=LOC111487584 PE=4 SV=1)
HSP 1 Score: 862 bits (2226), Expect = 2.55e-293
Identity = 558/764 (73.04%), Postives = 581/764 (76.05%), Query Frame = 0
Query: 172 GALLGGTSVGAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGA 231
GALL G VGA PGARDG + P N VG GA DGALLPGNG+ A PGAR GA
Sbjct: 412 GALLSGVVVGA----TAPGARDGAVLPENVVGVAARGACDGALLPGNGISAAAPGARDGA 471
Query: 232 LLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNG 291
LL G VGA PGA DG +LP N VGA GA DGALLP N VGA PGARDGALLPG
Sbjct: 472 LLSGVVVGATAPGAPDGVVLPENVVGAATRGACDGALLPANVVGAAAPGARDGALLPGVV 531
Query: 292 VGAPVPGARDGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGV--GAA 351
VGA PGARDGA+LP N VGA GA +GALLPGNG+ AAAP ARDGALL GV GA
Sbjct: 532 VGATAPGARDGAVLPENVVGAAVRGACNGALLPGNGISAAAPGARDGALLSRLGVVVGAT 591
Query: 352 APGARDGALLPANVVGAAAPGARDGALLPANVVGAAAPGARDGALLRGVVVGATAPGARD 411
APGA DG +LP NVVGAAA GA +GALLPAN +GA+A GARDGALL GVVVGATAPGARD
Sbjct: 592 APGAPDGVVLPENVVGAAARGACNGALLPANGIGASATGARDGALLTGVVVGATAPGARD 651
Query: 412 GTVLPENVVGAAAREACDGPLLPSNGIGASAPGACDGALPSGVVVGAR-----DGAVLPE 471
G VLPENVVG AAR ACDG LLP NGI A+APGA DGAL SGVVVGA DG VLPE
Sbjct: 652 GAVLPENVVGVAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGAPDGVVLPE 711
Query: 472 NVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLPENVVGAA 531
NVVGAAARG C+GALLPANGIGASA GARDGA L+GVVVGATAPGARDGAVLPENVVG A
Sbjct: 712 NVVGAAARGACNGALLPANGIGASATGARDGALLTGVVVGATAPGARDGAVLPENVVGVA 771
Query: 532 ARGACDGALLPGNGISAAAPGARDGATTPGACNGALLSGVVGACDGAILPENVVGAAARG 591
ARGACDGALLPGNGISAAAPGARDGA G GA G A DG +LPENVVGAA RG
Sbjct: 772 ARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPG---APDGVVLPENVVGAATRG 831
Query: 592 ACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGTL 651
ACDGALLP N + AAAPGARDGALL GVVVG TAPGARDGAVLPENVVGAA RGACDG L
Sbjct: 832 ACDGALLPANVVGAAAPGARDGALLPGVVVGTTAPGARDGAVLPENVVGAAVRGACDGAL 891
Query: 652 LPANGIGASAPGARNGALLSGVVVGAT---------------------------APGARD 711
LPANGIGASAPGAR+G LLSGVVVGA APGARD
Sbjct: 892 LPANGIGASAPGARDGGLLSGVVVGAVLPENVVGAAAWGACDGALLPGNGISVAAPGARD 951
Query: 712 G------------AVLPENVVGAAARGACDGALLPGNGISAAASGARDGA---------- 771
G A+LPENVVGAAARGACDGALLPGNGISAAA GARDGA
Sbjct: 952 GGLLSGVVGALDGAILPENVVGAAARGACDGALLPGNGISAAAPGARDGAVLPENVVGAV 1011
Query: 772 --------LLLGVVVDAAAPGARDGSLLPGVVVGAAATGARDEALVPGIVVGPAATGARD 831
LLLGVVV AAAPGA DG+LLPGVVVG AA GARDEALVP VVGPAATGARD
Sbjct: 1012 ARGACDGALLLGVVVGAAAPGACDGALLPGVVVGTAAIGARDEALVPRFVVGPAATGARD 1071
Query: 832 GALLPGTILGAAATGAREGALLPGIVIGAAATGARDEALLPGIVVGAAARGARNGALLPG 871
G LLPG ++GAAATGAREG LLP IV+GAAATGARDEALLPGIVVGAAA GARNGALLPG
Sbjct: 1072 GTLLPGVVVGAAATGAREGDLLPKIVVGAAATGARDEALLPGIVVGAAAWGARNGALLPG 1131
BLAST of Csor.00g289960 vs. ExPASy TrEMBL
Match:
A0A0H5P1W0 (Uncharacterized protein OS=Nocardia farcinica OX=37329 GN=ERS450000_04455 PE=4 SV=1)
HSP 1 Score: 113 bits (282), Expect = 1.21e-21
Identity = 214/572 (37.41%), Postives = 262/572 (45.80%), Query Frame = 0
Query: 172 GALLGGTSVGAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGA 231
GA+ G +GAG GA + GA G + GVGA + GA D G G+GA + GA
Sbjct: 145 GAIDAGVDLGAGLGAGL-GAGLGVV---GGVGAGLEGAIDAVAGVGAGLGAGIGGAVDAV 204
Query: 232 LLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNG 291
G GVGA + GA D G GVGA + GA D G G+GA + GA +G G
Sbjct: 205 AGVGAGVGAGLEGALDAVAGVGAGVGAGLEGAVDAVAGVGAGLGAGLEGALEGVTDVTAG 264
Query: 292 VGAPVPGARDGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALLPGNGVGAAAP 351
+G GAL G+G +G+ DG + G G+G A DG + G G+GAA
Sbjct: 265 LG--------GALGGVAGLGGGLAGSLDGVVDAGAGLGGAL----DGVVEAGAGLGAAVG 324
Query: 352 GARD------GALLPANVVGAAAPGARDGALLPANVVGAAAPGARD-GALLRGVVVGAT- 411
GA D GA+ A GA GA GA+ VG A GA + GA L G V GA
Sbjct: 325 GAVDAVAGVGGAVDGAVEAGAGLGGAVGGAVDAVAGVGGALDGAVEAGAGLGGAVGGAVD 384
Query: 412 ----APGARDGTVLPENVVGA-AAREACDGPLLPSNGIGASAPGACDGALPSGVVVG-AR 471
G DG V+ V GA A G L GA GA DGAL + +G A
Sbjct: 385 AVAGVGGGLDG-VVDAGVGGALGAVTGIGGALDGVVDAGAGVGGAVDGALGAVAGIGGAV 444
Query: 472 DGAVLPENVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVVVGATAPGARDGAVLP 531
DGAV GA G DGA+ GIG GA DGA +GV +GA GA DGAV
Sbjct: 445 DGAV----DAGAGLGGAVDGAVDAVAGIG----GALDGAVDAGVGLGAGIGGALDGAVDA 504
Query: 532 ENVVGAAARGACDGALLPGNGISAAAPGARDGATTPGACNGALLSGVVGACDGAILPENV 591
VG G +GA+ G G+ + GA GA GA A L G GA
Sbjct: 505 AAGVG----GGLEGAVGAGAGLGGSLEGALGGAAEAGAGLAAGLDGAAGA---------- 564
Query: 592 VGAAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPENVVGAAAR 651
VG A+ G GA+ G ++A GA DGAL +G VG GA DG V + +
Sbjct: 565 VGGASAGLA-GAINAGTDLAAGLDGAVDGALGAGAGVG----GALDGVVAAGADLTSGLN 624
Query: 652 GACDGTLLPANGIGASAPGARNGALLSGVVVGATAPGARDGAVLPENVVGAAARGACDGA 711
GA DG L G+ GA GA+ +G + GA DGAV GA GA GA
Sbjct: 625 GAVDGALGAGAGLTTGLEGALGGAVDAGAGLTTGLEGAVDGAV----GAGAGLEGALGGA 668
Query: 712 LLPGNGISAAASGARDGALLLGVVVDAAAPGA 729
+ G G +A GA GA+ G + A GA
Sbjct: 685 VEAGAGAAAGVGGAVGGAVEAGTGLAAGLEGA 668
BLAST of Csor.00g289960 vs. ExPASy TrEMBL
Match:
I4AA49 (Uncharacterized protein OS=Desulfitobacterium dehalogenans (strain ATCC 51507 / DSM 9161 / JW/IU-DC1) OX=756499 GN=Desde_2503 PE=4 SV=1)
HSP 1 Score: 110 bits (275), Expect = 4.49e-21
Identity = 120/302 (39.74%), Postives = 151/302 (50.00%), Query Frame = 0
Query: 142 LQKMVKDGASDGAGTGESSSDGSTGAGGPVGALLGGTSVGAGTGAPVPGARDGTLPPGNG 201
L +++ SDG G SSSDG G PV G GAPVP G P +G
Sbjct: 108 LSNLLRQFLSDGKGV-PSSSDGK---GAPV--------PSDGKGAPVPSDGKGAPVPSDG 167
Query: 202 VGAPVPGARDGALLPGNGVGAPVPGARAGALLPGNGVGAPVPGARDGALLPGNGVGAPVP 261
GAPVP GA +P +G GAPVP GA +P +G GAPVP GA +P +G GAPVP
Sbjct: 168 KGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVP 227
Query: 262 GARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAEASGARDGA 321
GA +P +G GAPVP GA +P +G GAPVP GA +P +G GA GA
Sbjct: 228 SDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGAPVPSDGKGA 287
Query: 322 LLPGNGVGAAAPRARDGALLPGNGVGAAAPGARDGALLPANVVGAAAPGARDGALLPANV 381
+PG+G GA P GA +P +G G P + DG +P + G AP DG P
Sbjct: 288 PMPGDGKGAPVPSDGKGAPVPNDGKGV--PVSSDGKGVPVSSDGKGAPVPSDGKGAPVPS 347
Query: 382 VGAAAPGARDGALLRGVVVGAT--APGARDGTVLPENVVGAAAREACDGPLLPSNGIGAS 441
G + P + DG +GV V + AP DG +P + G G +P +G G S
Sbjct: 348 DGKSVPVSSDG---KGVPVASDKGAPVPSDGKSVPVSSDG-------KGAPVPGDGKGTS 385
BLAST of Csor.00g289960 vs. ExPASy TrEMBL
Match:
A0A2A7UIQ9 (Uncharacterized protein OS=Nocardia sp. FDAARGOS_372 OX=2018066 GN=CRM89_20910 PE=4 SV=1)
HSP 1 Score: 110 bits (276), Expect = 6.09e-21
Identity = 247/712 (34.69%), Postives = 313/712 (43.96%), Query Frame = 0
Query: 172 GALLGGTSVGAGTGAPVPGARDGTLPPGNGVGAPVPGARDGALLPGNGVGAPVPGARAGA 231
GA+ G +GAG GA + GA G + GVGA + GA D G G+GA + GA
Sbjct: 145 GAIDAGVDLGAGLGAGL-GAGLGVV---GGVGAGLEGAIDAVAGVGAGLGAGIGGAVDAV 204
Query: 232 LLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDGALLPGNGVGAPVPGARDG------- 291
G GVGA + GA D G GVGA + GA D G G+GA + GA +G
Sbjct: 205 AGVGAGVGAGLEGALDAVAGVGAGVGAGLEGAVDAVAGVGAGLGAGLEGALEGVTDVTAG 264
Query: 292 ---ALLPGNGVGAPVPGARDGALLPGNGVGAEASGARDGALLPGNGVGAAAPRARDGALL 351
AL G+G + G+ DG + G G+G GA DG + G G+GA A GA+
Sbjct: 265 LGGALGGVAGLGGGLAGSLDGVVDAGAGLG----GALDGVVEAGAGLGAGLGAAVGGAVD 324
Query: 352 PGNGVGAAAPGARDGALLPANVVGAAAPGARDGA---------LLPANVVGAAAPGARDG 411
GVG GA DGA+ +G A GA D ++ A V GA G
Sbjct: 325 AVAGVG----GALDGAVEAGAGLGGAVGGAVDAVAGVGGGLNGVVDAGVGGALGAVTGIG 384
Query: 412 ALLRGVV-VGATAPGARDGTVLPENVVGAAAREACDGPLLPSNGIGASAPGACDGALPSG 471
L GVV GA GA DG + +G A D GA GA DGA+ +
Sbjct: 385 GALDGVVDAGAGVGGAVDGALGAVAGIGGAVGGGVDA--------GAGLGGAVDGAVDAV 444
Query: 472 VVVG-ARDGAVLPENVVGAAARGDCDGALLPANGIGASAPGARDGAFLSGVVVGATAPGA 531
+G A DGAV +GA G DGA+ A G+G G +GA +G +G + GA
Sbjct: 445 AGIGGALDGAVDAGAGLGAGIGGAIDGAVDAAAGVG----GGLEGAVGAGAGLGGSLEGA 504
Query: 532 RDGAVLPENVVGAAARGACDGALLPGNGISAAAPGARDGATTPGACNGALLSGVVGACDG 591
DGA GA DGA+ G +A GA D T L +G+ GA DG
Sbjct: 505 LDGAA----EAGAGLAAGLDGAVGAVGGATAGLAGAIDAGTD-------LAAGLDGAVDG 564
Query: 592 AILPENVVGAAARGACDGALLPGNGISAAAPGARDGALLSGVVVGATAPGARDGAVLPEN 651
A+ GA GA DGAL G +++ GA DGAL +G + + GA GAV
Sbjct: 565 AL----GAGAGVGGALDGALAAGADLTSGLNGAVDGALGAGAGLTSGLEGALGGAV---- 624
Query: 652 VVGAAARGACDGTLLPANGIGASAPGARNGALLSGVVVGATAPGARDGAVLPENVVGAAA 711
GA +G + A G GA GA GA+ +G A GA GAV GA
Sbjct: 625 DAGAGLTTGLEGAVNGAVGAGAGLEGALEGAVDAGAGAAAGVGGAVGGAV----EAGAGL 684
Query: 712 RGACDGALLPGNGISAAASGARDGALLLGVVVDAAAPGARDGSLLPGVVVGAAATGARDE 771
+GA+ G ++A G G L VDAA+ A G L V G A G +
Sbjct: 685 AAGLEGAVAAGGDVAAGLEGGLFGGL--DGAVDAASGVA--GGLSGAVNAGGQAVGGLES 744
Query: 772 ALVPGIVVGPAATGARDGALLPGTILGAAATGAREGALLPGIVIGAAATGARD--EALLP 831
+L G+ AA GA G L + GA GA A L G+V G +G A +
Sbjct: 745 SLSAGL---GAALGA--GGELSTELGGALDGGADLAAGLDGLVDGELVSGVDGGLTAAIG 798
Query: 832 GIVVGAAARGARNGALLPGVV-VGAAATGARDGILLPVVGVGAAATGARDGA 859
G++ G A G G L GVV TGA DG G+ A GA D A
Sbjct: 805 GVLGGDAGAGLETG--LGGVVDASGGLTGALDGTAETAAGLEAGLGGAADAA 798
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
KAG6602053.1 | 0.0 | 100.00 | Aggrecan core protein, partial [Cucurbita argyrosperma subsp. sororia] | [more] |
XP_022959524.1 | 0.0 | 79.46 | elastin-like [Cucurbita moschata] | [more] |
XP_022990802.1 | 5.26e-293 | 73.04 | elastin-like [Cucurbita maxima] | [more] |
KAG6602054.1 | 6.26e-69 | 69.86 | hypothetical protein SDJN03_07287, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
KAG7032747.1 | 8.98e-33 | 100.00 | hypothetical protein SDJN02_06797, partial [Cucurbita argyrosperma subsp. argyro... | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1H8B0 | 0.0 | 79.46 | elastin-like OS=Cucurbita moschata OX=3662 GN=LOC111460536 PE=4 SV=1 | [more] |
A0A6J1JJV8 | 2.55e-293 | 73.04 | elastin-like OS=Cucurbita maxima OX=3661 GN=LOC111487584 PE=4 SV=1 | [more] |
A0A0H5P1W0 | 1.21e-21 | 37.41 | Uncharacterized protein OS=Nocardia farcinica OX=37329 GN=ERS450000_04455 PE=4 S... | [more] |
I4AA49 | 4.49e-21 | 39.74 | Uncharacterized protein OS=Desulfitobacterium dehalogenans (strain ATCC 51507 / ... | [more] |
A0A2A7UIQ9 | 6.09e-21 | 34.69 | Uncharacterized protein OS=Nocardia sp. FDAARGOS_372 OX=2018066 GN=CRM89_20910 P... | [more] |
Match Name | E-value | Identity | Description | |