Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCGACGTCGTCATCGTCCTCAGCATCTTCAGTAAGGTCGTGGAGGACGGCGTTTCTGACTCTGAGAGACGAATCGATCTCTTCTTCAACTTCAATTTCCCAACTTCTCTACGACACCATCTTCTCGCACTCCGATTCCTTAATCGCTGCCGCGCGCTACCTTCCTCCACCAGAAGTACGATTTTCTCATTTCTTCTATTTATCTTATTCGAGGCTCTTCAAAAAATGCATAATCATCGCTTTGTGGCTCCTTCTTTGTGTGCATAGGTTTCATCAGATCTGCTGTTTCTCTTAGAAGTGGCTACTTCTGCTTCCGACTCCGTGCAAGACATTGTGCTTATCTTCGCAGATATCATACATCTGGTGCAATTCACATTTTCGATTTCTATTATTCCTTTTTTCTTCTTTTGGACCAATTTCCCAGGGGTAGAGCAAGAACGATGAAATTGGGAAATTTGTTCTGTTAGTTCATTTCCACGCATCTGTTAAGAATATCGCCAACAATTTAATCATGATCTATTTTGCGCATGAAAATGGGGTTCTGTGTGATGTTTAGAAACTAGAAGAAATTCTTTCTGATTTAAGTGTGGGTAATGAAAATTACCATAGTTATCAGAGAGACCCTAACGATTCAGCCTTCGGGAAGAGTAATTGTATATCATAATATTGCAATATTTATTGATTACAAGTTCATTCCATTATATAGAAGCTTCAGTTGAAAGGTCATGAATTATCGATCAGTACTTTAGAGAGAGTGCAATAGGTTAAGCAATCTTTTTTGATATCCGTGAGTGTTCAGGCCAGCTTACGCGCACCTCGACTAATCTCACAGGACAACCTGCCTGACCCTACAACAATTGGGTGTCAAAGAAATCCATAGAATATTAAGCAAATATCATTGGGTCAAGAGATTTGAAACTTCAATTTTATTGGTGACCGTTCTCAAGAATTGATAGTCTCTATTTGATATATATATATATATATATATATATATATATATATTTATTTATTTATTTAAATTTTTTATTTATGATAAATAACTAACTTTCATTGAGAAAAATTGAAAGAATGCACGGGAATACAAAAAAGGGAAGCCCATGAAAGAGAGGAGGCTACCTAAAAGAAAGCCCTCCAATCAAGCGAAATATAGCTAAGAGAATAATTACAAAAAAGTTTTGTAATTGAAGCCCGCAAAAAAACATGGAATCTAGCAAGGGACCAAACTTGATGGTCTCTACTTGATGTGATGCAAGTTCATCATCAAATGCCAAAATGCTGTTTAAGACATGCAAATTCATACCGTCGGTTTAGTTTCATTTATCGTCGTAATAATTATGCATGTGAAAGTTGAATATTTAGTGGGAAGATATGACTTCAAACAGGTTGAATATAGTGGAAAACTTCATTGTATATGGAAGATCATTTGCATTTAATGTCATTGAGAAAATGTGTGTAACACACATTGAGCATATTTCATAAACATGCGGCCTGCTAGATATTAGTTCAGGTTTAAATTACTATAATCAAGTACAATATAGTATTTGTTAAAAGATATACGTTTCTATATGACAAGGAATTCAAGTAGTGCATATCTTTATACATAAATGTTTTCCAATGGAGATAACCTTGCTTTCTTAATGTGGTTCTGGCCCTGCTTATGCTCTTCTGAATTCTCAGTTTTCTCTAATTATTAATCTTTTTGCACCCAATGCAGATCCACGGTATTTCTTATGAAGTTGCTCTTGAATTTAGTTCTTCCTCTTGGAACCTGCTCCTTCGATATTTTGGAGATGTAATCCAAATCCTACTTGGAAAGCTTAATATTCCAGGAAATTATGCTCTAATCAGGCCTGTCTTGGAATCTTTGGAGATTGTAAGGTTAGCTTCCCTTTTCTCCTCCGTTAGCTTGGGGAGATACTACTCTTTCTTGCTGCTACTTCTGCACACTGGGTCAGTATTTTTGTCATTTGGACCTAAAGTGATGTTTGCTCATCTGAGTGAATTTGTTGAATCTGATGAGATATGTATAGTAATAGTAGCTAGGTTGGTTACTCAAAGTATCCGCAGGTTTTTGTTATAGAGAGAGAGTTACTAAATCTGAATAAAGGGAAAAACTAAATTTCATCCTGGTTTTTTGATAATCTTTTTGGTGAATGTGGAAGGAGAAACAGTGTAATTTTTGAATGTAGATGGGAAGCCTTAGCTGCTATTTGGTTGAATCATGAATGTACACTTCATTAGCTTATCTTGGGATTTCTTGTACTAAAGAGGTTAAAATGCCCTAGGCAACATCTTACTTAGTTGGAAACCCTTTCTTTAGTGGGTTTTTGTGGGCTTGGTTTTTTGTATGCCCTTGTATTCTTTCATTTTTTCTTAATGAAAGCAATTGTTTGTATAAAAGAAAAAAGAGAGGTTAAAAGTTGTTGTTTGGTCATGATTTGATGTCTTGCATCTCGTACCTTGTTTTCCCTTTTATGGACATGATCAACTGTCATCTTTTTTTGCAGACACGTTGTCTGTTTACAACAGCGCAAATTTTTACCAGCAGAAGATATTCAGCTCTCAAAGTTCTTGCTTTCTGTGATTGCTGGCTCTCAATCGGCAATATTCCCCTCGTCAAATTCAATCATTAGACATGGTTGTACTGCTGAAGTTGTGAAAAGTGTGCCCAAATGTAATAGTTTATGGGATGTTCAGGCTGTAGCCTTTGATCTACTTAGTCAGACCGTCACAAGTCTGGGGTCATATTTTCCAGTCGATGTTTGGAAGTCAACAATTCAGGTAGAAACAATTCTGTTACATTTGTGGTTTTTCATATATGCATTTGTAGGAAAATCTGGATATATTCAATTTGAATGCTTTGCAAGCAATATATAGGCAGTTAGCCAATAAGGAAATAGTGTATTATTATTATTTGGATGAATGATCAAAACTTTCATTTGAGGAAAAAAAAAAAAAGGAAAAAGAATACAGGGCACACAAAAATGACAACCCACAAAAAAGGAGGTAACTAAAAAAAAGCCTCCAGTCAAGTAAAATAAGACCTAAAGGATAATTACAAAAGGACTTCGTTACTGCGGTCTACCAAATAACACAGAACTGAATGAGGGACCAAACATCATGCAAAATTTTCTCAAGCCTCTAAAAATTCTACCATTTCTTTGTCTCCAAAGACCCTCCAAAACAACACATCCTGATTCGTCATGTAAATCTCCTTTTCTTGCAAAAGGGCTGATGGGAACTTCTCAATCATTTTCTTGCATCCCCTATGCGTAGCAGGGCATAAACCAAACTCCTCGAAGAAACAACTCCAAACTACACATGTTAAATCACAACTCCAAAGAATATGATCAACGGCCTTACTTGAACAAGATCTGTTCTATAAGTTGTACAACTGTGTTCTTGAGTTCTAAGAAAACTCCAAAGAATATGATCAAGATCCTATGCGGCCTTCCTTTAAAGAATGCAACACCACAACCTAATCAAGTTGGACAACTTCCTCTATTTTGAATTCAAGATGTTAGTCTTATAATTGGAGATCTTGTATTTTGGAGCATTAGTCTCTTTCCATTATATCAATGAAATTATTATTTCCATTTTGAAAAAAAAAGAATTCAAGATGTTAACTCATTTGTGGATGACTTGTCGGCACAAAAATTTTTTAACATTAATTTTCACCTCCCGTGGTGGGACAAAAAAGCCACAATATCCATTATTTTCCTATTGGACAAAGGACAACGGAAACTAACAAAGGAGAGGGACACAGAATAAATTAGCACAATTGCCACTGAATAATTCTTCATAGAAGAGAGATGATATTAGTGAGGTACAGAGAGTAGAGGGATCTATCAACCTACCAATACCACGTATCCTTCCAGAAATAAGTATCCCTCTCATCACCCACCGAGCAACTAACAAACTAAGAAAAGAATGGATCCTCCACAAAGTTACCTTTCCAAGGGTTTCTGGAAGTGCCTTTGATCCCACTTGAGATCCAATCAAAAGGGTTAGTCCATGCTTGGCTCTAGGTGTCAAATTCACGGGGAAAATATTATATCCATTTAGCCAACAAGGCCTTGTTCCAGGCCCTTAAGTTACCTATATCTACCCTTCAAGATCCATCGGTTTTTTCACCATCTCCCACCTAACTAAATGTGAACCTTTCCCTTCCTCAACCCCTTCCCATAAGATGCCCTTTGTCAACCTCTCCAAATTTTTACTAACCAAACTACGGACCCTAAACAGGGACATAGAATAGGTCGAGATGCCACTCAAAATTGATTGAATGAGAAAATAGTGTAGTATCTATGCACTAGGAATAGTTAAGGAGCAAACTATGGAAATATTTACCTACTAATACATGAATGGGAAAATACGAAGGTATATGTAATTTGCAACAGATGTATATGTTGGGTAGTTAAGCACCTTGGAGAACCACTCCAGAAGCCAACTATTGAGGTGGGAGTGCCAAGCCACCTAAGTACCACATTGGTCATCCCATTCTAATTGATGTGAGACAAAGGTAGCCTATACTACCTTGGTTCCTAACAATACCCCATCCCCAGAAGCCAACGTCCCAGTGGCTACTCCGACGGAACTTCACTTGGCCACACCCGATCAGAACCCTCTGCCGGTTCCACTTTGGAACGTCAACAACCGTCTCTGATACTATTGTTAGGTACTTAAGCACCTTGGCCACACCCCAAAAGCCAACTATTGAGGTGGGAGAGCCAAGCCACTTAAGTACCACATTGGTCATCCCATTCTAACTGATGTGGGACAAAGGTAGCCCATACTACCTTGGTTCCTAATGGTATGATGTTTCCATTTGATGTCTCTAAGTCTAATAAGATGAGGAAGTGATCAAATGTAGCCTTTCTGATCTCTTTAGATTAGCTTTCCTATATCTGGCTAGGTAAGTGGGGGTATAGGGAATATATCAAGAAGAGTGAGTGAGAGTTATGAGCCTGTTGACAACAAAGAGTTGACATAATCATAAAGGCAATTGAGAAAATCACATCCTAAGGACCAATTAAAATTCATCATCACTTATAAAGTCTCCAATGCACTTATGCTCCATTGGATTTTTTCCTTTTGGGATAACAAATTGAACTTTCATTGTTAAAAGAGAGGAGTACGAAGGGTAATGATGAAATAAGGCATCAACTCAATGCAAGGACAACAAAAATGATAAACGAAGAATGCTTTTGATTAACATCATGAAAAACAATAATTACAATTTTTTAGATAGAAGGGAGGCCTTGAATTTTTTTTGAAACAAGAAACAACTTCTCATTGATGTAATGAAAAAGGATAAAAGTTTAGAAGATACAAACTCTCAGAAGGTTACGAACAACAAATACAACTAAATAGAGATAAAACTTTCAAACAACATTGACCAACAAGCCTCCTTCAGCTACTGAAAAACAAAGACTACTACAAAACTACCAAAGAAACTCAAGAAAAAGACCACCAACCAACAAAAGACCAACAAAGAGAAAACTAAAAACTCTTCAATGATGCACAAACTAAAAATTTTGATGAAGGAAGCTTTTGATGAACCAATGCCAAGAAACTTGAAGAGAAACTCTCTCCTCATGCTCTAAAACCTTCCAACAAGATAACGAAACAGAGCCCAACAAAGGAATGCCAATCCCAATCCTCAATCCCTTCCGAGCTAATCCTCATTGCCCATCCTACAATACTCTAAATCTTTTGTCTAAAAAGATGGCTGCAATTGTTCTGTTTTTTCTTGTTAGTGTGGTTGGTTTGAGCTTGTAAGTTCTTTTTCATGTTAGGGTTTTAGGGGATGTGTTGTTCTTTTCTTCAGGTAGATGTTAATTTTTGTTGCTTACATCTTTCTTCCATCTTCTTGTATTATTCTGAAAGGAGTATTCAAAAGCATGGACGTAGACATGAACGTGGATACATCACATGGACAAGGCCACACATTGATTTCTAGAAATGTAGCACATGAATACGTCGGAGATACATTTTTTTTCTTTCAAAAACGTGGATGCATACTAACAAATGTGATATTCCAAAATAGATAAGACACATAAATTTAACATGAAAAGTTCAAATGTGCTTGTAAAAATTAGTAGCATGATCCTCCCCATATAATTTCTTTTTGTCTTTCAAATGAGTTTTTAAGGGCAGGTTAAATTTTAGGGCTGATCATAATAAGTAACTGGTTGGGCAGTGGACATGTGGTAATGAATGGGGTTAATTGTTTTTTCCTTTTAAATTCTCAATCAAAATTTTTAGGTTGATGTGTTCGTGTGTGTCCTTGATGTGTCCGAAACTGAGAAAAATTAATATAAAATAGGATCCTCTGCAGACATCTGTTTGACAGACACTTGGTCAAGAAGAGGATGTTTGTGCTTCCTAGACGAGTAATATACTCTGCAGCTTTTCACTACCTTAATGAAAAGTTCTGTTTTCTAAAAGAGAGGGCGCGATTAGATTAGCCAAGTGTTCAAAGTTGAATTCTGGAAATATCAAGAGATATTGACTGGTGGAATATGGGTTGTTGGTTATGTTTGTGCGAATGCTTGCAGTTGCTGTCATAGTTGTCAGAACACTGGACAGTTTTGCAATAAATGTAAAATATTTTCAGCTGGCATGATAACCTTCAACACTGCAGATAACTTGTTCAGCTTGATTCTTTTGTTCATAGGTTATTCGAAAATTGATGGATTTCTTGGCATCTACTAGCTTACTTGTTGAAGACAAGGTGATGTCCAGGTATACAGCTTCAACAATATTTTATTATCGTACAGTGTTCATTATTTTTAGAGAGATATGTTACTTTTGATTTACCAGTGAATTATTGTGCCAGCCATCCTATTGGGTTTAGTTTAGTTGCTGTATTTTGCCTACTGTTTTTTTTATTTGATTGTTACTAGTCTCGATGTTTATATAAGAAGTTTCAGATTATATCTTATTGTAAGATGGATTATTGAGGCTTAATGAAAATTTCCCTAAACAACGGGATAATTTTCATCATTCTAAATATTTCTATGCAACGGGATAATTTTTGTGTGTTTAGAGGATTTCTATTTGCAGAGATTTGATGTGAAGGCATAACTTCGTCATAGTAATTTTGAATATCTACTGAATTCTAGTGTTTGAACCATTATAATGGTTGTTTCCTTTACTAATGAAAATTTCCCTAAGCCATCTTGTGCTTCTGGCTAGAGGAAAGCTTCCTGGTGTAGATCAAGTGAATGAGTTACCTGATCTTAGGCTTGCTGTCCTAATTTATCTCAACAAGCTTCCTCCATTTTTGTGGAGTCTTAGCCAAGGAAGGGACTATCAATTTTCTTGTTGGTATGAAAAACTGGTAGCAATATTCAGCTCAATCTTGTAGACAGCTTTTGTACAAGTCATTAATGGCTGCTGCTTCTAATTAATTGAGAAAAAAAAGGGCGAAATCCAACGTGGGAGTCGTAACCTTTTTACACTATAGTCTTGTTAGCTATGGGAGTAGCCTTGGTTTATAAGAAATTACCATTCAAAACTAACTATCCTAACTGATCCTTACAAGTAACGAATCTATTTTTATTTGGCTTGGGTTTTGGGTTAAAGATCAAAGAAGGATATCTTTCAACTGCCATGTGGAGTAAAGTTGAAGAATAAAGCTAAAATTTTGTGGCTTGTTGCTGTATAAACTTCTTTTAGATATTTGTTTTGAAAGGAACTAAAGAATCTTTGAAGGAAGATCTTTAGATTGGTATGAAAGATGAGATTTTGCAAGGGATGATGCTAGGTGTCTCCTGTTCCTTTCTAATTGTTTCTATGTACCATTCTAGTTGGGTTGGTGTTTGTTCATCTGTTTGTTTTTTTCCTGGTAATTTTTTTTGATAAGAAACAAATTTCATTGATGTATGAAATTTACAAAAAGAGGCCGGAGTCTGAACCAAAGGAGTTGTATAAGAAACTTCCAATTTGGTCAAAAGAGAAGTATGATTGTATGAACTAAAGAAAGATGTACATTTGCACAAAGTCATAGCATGATAGATAATGAGTTCAAAATAGCAATCAAAGGAGCTTCCCTTATCATGGTAATCATGTGGGTAGTTTTGTTTGTAATTCTTGCTGTATTTTTGGAGAGGGGGAATGTTGATCTTTGGTGGGTGTTTCTTTTTGTACTCTTAAGGGATTCATATTTTGTTTTCTTATCATTTTATGATTTCTCATTTATCAATTAAAATAAGAAACTAATCACGAGTACATAATGAAGTCCCTAAGGCTTACATCATTTACCCAGGCGACAGCCATTTCATTGGCTTTATTAGAGATTATGTGTTTTGCATTCTGTTTTACTTTAACGTTATCCAAGAGTTTCAAATTGCATGAAATGAGCATTTGCTTACAGCTTACCCCATTATCCACCTTTCCTAGTGGGAAATTGTAAGTCCTTTTCTTGGCCCATTTTCAAATTAGTGAAACAAGGCTTCATTTCCCTAAACTCCTTTTTGTATTTGGGTGGGGGGTTGGTGTTTTTTTTTTTTAATCCTAAACTTAGTTTCTTAAAAAAAAAAAAAAAAAAAAGAGTAGTCATCCTTTTCATGCCTTGGTGTTTTCTGTTCTGCAGGTACTATCTGTCTCTTCTGCGATGTCTTCATTTGGTTATAGCAGAACCCAAATGCTCCCTTTCTGACCATGTAAGGAATAAAGCTAGACTTTGAGACCTAAATTGAGGATGCTTACTGACTGAAGCATCTTTTTTTACATTATTTTTCTTTCAGGTGTCAGCTTTTGTAGCAGCATTGCGCATGTTCTTTGCCTATGGATTTTCTAATAGACCCCTGCTTGCTTGTTCAATTGGTAATCAAGGGAAAGAACCTAGTTTGACTAGTACCAAGTCCAGTTTGGAGGAACCGAAAAAGGAAAATCATAGTGCATATAGGCCCCCACACATGCGTAGAAGAGAAAATTTAAATAAGAAGCAGGCCAGTGTTCAAAATTCTCAAAGTTCAATGGCTGCAGAGTCTCTTAACTGTGATTTGATATCTTCAGATTCCGATCATGACAGTGATGGGCCAGGCAGAGATGCTGACATCATTCAGAATGGCAAGGTTCGGGTTGCTGCTATTCTTTGTATACAGGTAACTCAGAAAGAGGTGACCCATTTGATCTAGTTCTGTGTTGTTTTCACTTATCTAGGAAAAATCCATTGGTTCAGTGATCCATGTCCATAACCTTTCTTCCTCCAGCTAATCAGATATTTTCCATGATTAGCAATTTTTCTCTTGCAGTGAGTTTGCAAAATGGTGTGAACTTGGTACAGATATGGTTCTATGAATGAACATCTTTCCTCAACTTAGACTGTAGGAAAATTTTCTAATCTCCTAGGCAGAGTAATCTCTCGATAGGCCGGTTTAGGATCCTTTTAGCAATATATATTTTATCTCTTTAGTTCAAAAACTCCATTGACTCTAGAATCTACATCATTTGTCAACAATGGTAAATCCTCTCATTTTTGGTTGGATTGGTGGATTGGTAATGGACCTGTTTTTCTTAGGCTTTATGATTCTTTTTCTATGTGAAACATGCCACCATTTTTAGACTATGGGGACCACTTTAATTTTCCATGTTTCTTGATTTATGAGAAGATTCTTGGATCTTGAAATATTGGGAAGGGTCTCTCCTCTTGGTCTTATCCCCATAATGACTCTCTCTAGGACCCTGGATGGGTGTGGATGGATTGCCGATCTAACCTTCTTCTTCTTCTTCCTTTATTTTATTTTTCCCGCCAACTTTCTTTCATTTAGAAAAAATTTAGGAAAAAAATAAGAGGCTGTACAATGACATACAAAAAATGAACAAAAAAAAAAAAAGAGAGTTCAAACACGAGCCAAAGAAACAGACTCCAATCCAACAAGATCATATCAAGGTCATAATTACAAAAAGGCCGAGTAATAAACATTCATAAGGACACGATAAACTTAACGACCTCCCAAACCTCTTTCCAAGATCTCTCGACCTCCCTTAAAATTATGCATTTATGCTATTTTTCTCAAGCCAAAATTTCCACAACAACAAAGGAACAAGACCACCACTAAACTCTCCCTTTTATCTCTAAACGGTGGATCCAAGAGCACCTCCTCAACCATGGTGCTGCACTTTCTACCACACGCCAAACATAAGCCAAATGTACTCAAGCATTTTCTCCATAGGGAATGAGCAAATCAGGGGACTGAATTCGGAGATAGAGGAAATGTTTGAAAGAAAGACCAACCTTATGAGGAGTGGAGAGGTTAAGAACATGAGAAGACATATGATTTGTCAAGTATGTGACAGTGAGAGTTACCTCTCCCCAATAAAACTGAACATTAGTAGAAAATATTATTCGTTGAGTGACTTCAAGAAGATGGTGATTCTTGTGTTTAGCCACTCCATTTTGTTGTGAAGTGTCCACACAAGAGCTAAGATGAACAATTCTTTGGTACGATAGGTAAGCACCACTGGACGAACTAAAAACTTCTTTCACATTGTTTGTTTTGCCGAAAAGAGGCGACACTGGTGAGAAATGATGATTGGCAGCTAGGATTCCGGTAGTTAAGGTTGATGGCTAGAGTTTAGCTTTGATGCCAAGGAGTTTTAGGAAAAAGAAAACTCTTGCATTATTTTTTCTACTAAAGATTACACCTTCTATATCATAAATACGAGACTAAAAATAAGTGATAAACTACCCTAATTTACCCTAGCCTTTTGGAAACAAAGAATTTATTAAAAACTAAAAAGCATAACAAATCCTAAACTTTTGGGAACAAATAATGTATTGAAAAGCATAATAAATTTAACTTTTCTACTGTGTCATTTTCATTAATGCCCTATCTTTGACACTCAAATTCAACACTAGATGACATCAAATTTCTGCTGCCGCATTACTTATGTTGTCGCTTCTCAAATCCTGTTCTGCTTTCCCTTGCTTGTTATATGTAATTGAGTGAAACATTGACATATTTGGAGTTCAAATATATTTTTCCTGATTAAAAAAATCAGTTGCTGAAAAGTTCATAGTTCTAGAATGTTGGAAAATTATCAGCGGTCTACTCATGTTAACCATACATGGAATTATTAGTTGTTGAGGTTTTCCCTATTTCATTCATTCATTGAAATGTTTCTTTATCCAAAAAAACCATACATGGAATTATTCTAGTTTTTTTAAGAACTTAAAAGTTTTTATGTTGTCTTACTTTTGGTTGATAGGATCTTTGCCAAGCTGACCCCAAAGCATTCACCAGCCAATGGACACTTCTTTTGCCAACTCGGGATGTTCTGCTGCCAAGGTTGGTCTTATTTTATCTTCTCTGTTCATTTATTGTGATCTGCATACTTATTTCGTCTGTGTCCTTAATTATGTTCTTGTAGGAAATTTGATGCAACTTTGATGACATGTCTTCTATTTGATCCTTCTCTAAAGGTACATATACTGTTTCCTTCAATTGTTTCACCAACAATTTGATGTTTTGACTGCAAATAGGCTGAATGTGTTGTTCATTCTCAATTTTTTGTTTTGTATCTGTTTTTATTATTTTATTCATTCAATTTTTTGTTTTGGCTTGCCCAGGCCCAGATAGCATCTGCTGCAGCCCTGGTGGTTATGTTGGATAGGACTACTTCCATTTCCTTGCAGATTGCAGAATACAGAGATCCAGCTAAATGTGGATCCTTTATGCCTCTTTCGATTTCCCTTGGGCAGATACTAATGCAACTCCATATAGGTACTTGA
mRNA sequence
ATGGCGACGTCGTCATCGTCCTCAGCATCTTCAGTAAGGTCGTGGAGGACGGCGTTTCTGACTCTGAGAGACGAATCGATCTCTTCTTCAACTTCAATTTCCCAACTTCTCTACGACACCATCTTCTCGCACTCCGATTCCTTAATCGCTGCCGCGCGCTACCTTCCTCCACCAGAAGTTTCATCAGATCTGCTGTTTCTCTTAGAAGTGGCTACTTCTGCTTCCGACTCCGTGCAAGACATTGTGCTTATCTTCGCAGATATCATACATCTGATCCACGGTATTTCTTATGAAGTTGCTCTTGAATTTAGTTCTTCCTCTTGGAACCTGCTCCTTCGATATTTTGGAGATGTAATCCAAATCCTACTTGGAAAGCTTAATATTCCAGGAAATTATGCTCTAATCAGGCCTGTCTTGGAATCTTTGGAGATTGTAAGGTTAGCTTCCCTTTTCTCCTCCGTTAGCTTGGGGAGATACTACTCTTTCTTGCTGCTACTTCTGCACACTGGGTCAGTATTTTTGTCATTTGGACCTAAAGTGATGTTTGCTCATCTGAGTGAATTTGTTGAATCTGATGAGATATGTATAGTAATAGTAGCTAGACACGTTGTCTGTTTACAACAGCGCAAATTTTTACCAGCAGAAGATATTCAGCTCTCAAAGTTCTTGCTTTCTGTGATTGCTGGCTCTCAATCGGCAATATTCCCCTCGTCAAATTCAATCATTAGACATGGTTGTACTGCTGAAGTTGTGAAAAGTGTGCCCAAATGTAATAGTTTATGGGATGTTCAGGCTGTAGCCTTTGATCTACTTAGTCAGACCGTCACAAGTCTGGGGTCATATTTTCCAGTCGATGTTTGGAAGTCAACAATTCAGGTTATTCGAAAATTGATGGATTTCTTGGCATCTACTAGCTTACTTGTTGAAGACAAGGTGATGTCCAGGTACTATCTGTCTCTTCTGCGATGTCTTCATTTGGTTATAGCAGAACCCAAATGCTCCCTTTCTGACCATGTGTCAGCTTTTGTAGCAGCATTGCGCATGTTCTTTGCCTATGGATTTTCTAATAGACCCCTGCTTGCTTGTTCAATTGGTAATCAAGGGAAAGAACCTAGTTTGACTAGTACCAAGTCCAGTTTGGAGGAACCGAAAAAGGAAAATCATAGTGCATATAGGCCCCCACACATGCGTAGAAGAGAAAATTTAAATAAGAAGCAGGCCAGTGTTCAAAATTCTCAAAGTTCAATGGCTGCAGAGTCTCTTAACTGTGATTTGATATCTTCAGATTCCGATCATGACAGTGATGGGCCAGGCAGAGATGCTGACATCATTCAGAATGGCAAGGTTCGGGTTGCTGCTATTCTTTGTATACAGGATCTTTGCCAAGCTGACCCCAAAGCATTCACCAGCCAATGGACACTTCTTTTGCCAACTCGGGATGTTCTGCTGCCAAGGAAATTTGATGCAACTTTGATGACATGTCTTCTATTTGATCCTTCTCTAAAGGCCCAGATAGCATCTGCTGCAGCCCTGGTGGTTATGTTGGATAGGACTACTTCCATTTCCTTGCAGATTGCAGAATACAGAGATCCAGCTAAATGTGGATCCTTTATGCCTCTTTCGATTTCCCTTGGGCAGATACTAATGCAACTCCATATAGGTACTTGA
Coding sequence (CDS)
ATGGCGACGTCGTCATCGTCCTCAGCATCTTCAGTAAGGTCGTGGAGGACGGCGTTTCTGACTCTGAGAGACGAATCGATCTCTTCTTCAACTTCAATTTCCCAACTTCTCTACGACACCATCTTCTCGCACTCCGATTCCTTAATCGCTGCCGCGCGCTACCTTCCTCCACCAGAAGTTTCATCAGATCTGCTGTTTCTCTTAGAAGTGGCTACTTCTGCTTCCGACTCCGTGCAAGACATTGTGCTTATCTTCGCAGATATCATACATCTGATCCACGGTATTTCTTATGAAGTTGCTCTTGAATTTAGTTCTTCCTCTTGGAACCTGCTCCTTCGATATTTTGGAGATGTAATCCAAATCCTACTTGGAAAGCTTAATATTCCAGGAAATTATGCTCTAATCAGGCCTGTCTTGGAATCTTTGGAGATTGTAAGGTTAGCTTCCCTTTTCTCCTCCGTTAGCTTGGGGAGATACTACTCTTTCTTGCTGCTACTTCTGCACACTGGGTCAGTATTTTTGTCATTTGGACCTAAAGTGATGTTTGCTCATCTGAGTGAATTTGTTGAATCTGATGAGATATGTATAGTAATAGTAGCTAGACACGTTGTCTGTTTACAACAGCGCAAATTTTTACCAGCAGAAGATATTCAGCTCTCAAAGTTCTTGCTTTCTGTGATTGCTGGCTCTCAATCGGCAATATTCCCCTCGTCAAATTCAATCATTAGACATGGTTGTACTGCTGAAGTTGTGAAAAGTGTGCCCAAATGTAATAGTTTATGGGATGTTCAGGCTGTAGCCTTTGATCTACTTAGTCAGACCGTCACAAGTCTGGGGTCATATTTTCCAGTCGATGTTTGGAAGTCAACAATTCAGGTTATTCGAAAATTGATGGATTTCTTGGCATCTACTAGCTTACTTGTTGAAGACAAGGTGATGTCCAGGTACTATCTGTCTCTTCTGCGATGTCTTCATTTGGTTATAGCAGAACCCAAATGCTCCCTTTCTGACCATGTGTCAGCTTTTGTAGCAGCATTGCGCATGTTCTTTGCCTATGGATTTTCTAATAGACCCCTGCTTGCTTGTTCAATTGGTAATCAAGGGAAAGAACCTAGTTTGACTAGTACCAAGTCCAGTTTGGAGGAACCGAAAAAGGAAAATCATAGTGCATATAGGCCCCCACACATGCGTAGAAGAGAAAATTTAAATAAGAAGCAGGCCAGTGTTCAAAATTCTCAAAGTTCAATGGCTGCAGAGTCTCTTAACTGTGATTTGATATCTTCAGATTCCGATCATGACAGTGATGGGCCAGGCAGAGATGCTGACATCATTCAGAATGGCAAGGTTCGGGTTGCTGCTATTCTTTGTATACAGGATCTTTGCCAAGCTGACCCCAAAGCATTCACCAGCCAATGGACACTTCTTTTGCCAACTCGGGATGTTCTGCTGCCAAGGAAATTTGATGCAACTTTGATGACATGTCTTCTATTTGATCCTTCTCTAAAGGCCCAGATAGCATCTGCTGCAGCCCTGGTGGTTATGTTGGATAGGACTACTTCCATTTCCTTGCAGATTGCAGAATACAGAGATCCAGCTAAATGTGGATCCTTTATGCCTCTTTCGATTTCCCTTGGGCAGATACTAATGCAACTCCATATAGGTACTTGA
Protein sequence
MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEVSSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKVMFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNSIIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDFLASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLLACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAESLNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRDVLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPLSISLGQILMQLHIGT
Homology
BLAST of HG10008978 vs. NCBI nr
Match:
XP_038875589.1 (HEAT repeat-containing protein 6 isoform X2 [Benincasa hispida])
HSP 1 Score: 882.9 bits (2280), Expect = 1.5e-252
Identity = 477/554 (86.10%), Postives = 489/554 (88.27%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLEVATSASDSVQDIV +FADIIHLIHGIS++VALEFSSSSWNLL+RYFGDVIQ
Sbjct: 61 SSDLLFLLEVATSASDSVQDIVPVFADIIHLIHGISHQVALEFSSSSWNLLIRYFGDVIQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLNIPGNYALIRPVLESLEIV
Sbjct: 121 ILLGKLNIPGNYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHV+CLQQRKFLPAEDIQLSKFLLSVI GSQSA+FPSSNS
Sbjct: 181 --------------------RHVICLQQRKFLPAEDIQLSKFLLSVITGSQSAVFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LASTSLLVED VMSRYYLSLLRCLHLVIAEPK SLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTSLLVEDNVMSRYYLSLLRCLHLVIAEPKGSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKS LEEPKKENH+AYRPPHMRRRENLNKKQA+ QN QSSMAAES
Sbjct: 361 ACSVGNQGKEPSLTSTKSGLEEPKKENHNAYRPPHMRRRENLNKKQANAQNLQSSMAAES 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRK+DATLMTCLLFDPSLKAQIA+AAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKYDATLMTCLLFDPSLKAQIAAAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 499
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 499
BLAST of HG10008978 vs. NCBI nr
Match:
XP_038875588.1 (HEAT repeat-containing protein 6 isoform X1 [Benincasa hispida])
HSP 1 Score: 882.9 bits (2280), Expect = 1.5e-252
Identity = 477/554 (86.10%), Postives = 489/554 (88.27%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLEVATSASDSVQDIV +FADIIHLIHGIS++VALEFSSSSWNLL+RYFGDVIQ
Sbjct: 61 SSDLLFLLEVATSASDSVQDIVPVFADIIHLIHGISHQVALEFSSSSWNLLIRYFGDVIQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLNIPGNYALIRPVLESLEIV
Sbjct: 121 ILLGKLNIPGNYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHV+CLQQRKFLPAEDIQLSKFLLSVI GSQSA+FPSSNS
Sbjct: 181 --------------------RHVICLQQRKFLPAEDIQLSKFLLSVITGSQSAVFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LASTSLLVED VMSRYYLSLLRCLHLVIAEPK SLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTSLLVEDNVMSRYYLSLLRCLHLVIAEPKGSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKS LEEPKKENH+AYRPPHMRRRENLNKKQA+ QN QSSMAAES
Sbjct: 361 ACSVGNQGKEPSLTSTKSGLEEPKKENHNAYRPPHMRRRENLNKKQANAQNLQSSMAAES 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRK+DATLMTCLLFDPSLKAQIA+AAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKYDATLMTCLLFDPSLKAQIAAAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 499
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 499
BLAST of HG10008978 vs. NCBI nr
Match:
XP_004145966.1 (uncharacterized protein LOC101212003 isoform X1 [Cucumis sativus] >KGN49951.1 hypothetical protein Csa_000577 [Cucumis sativus])
HSP 1 Score: 862.8 bits (2228), Expect = 1.6e-246
Identity = 468/554 (84.48%), Postives = 482/554 (87.00%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DSVQDI LIFADIIHLIHGISY+V+LEFSSSSWNLLLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSVQDIALIFADIIHLIHGISYQVSLEFSSSSWNLLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIA SQSAI P SNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIADSQSAILPLSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLEEPKK+N+S YRPPHMRRRENL KKQASVQN+QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEEPKKDNYSPYRPPHMRRRENLTKKQASVQNAQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 499
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 499
BLAST of HG10008978 vs. NCBI nr
Match:
XP_031741422.1 (uncharacterized protein LOC101212003 isoform X2 [Cucumis sativus])
HSP 1 Score: 862.8 bits (2228), Expect = 1.6e-246
Identity = 468/554 (84.48%), Postives = 482/554 (87.00%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DSVQDI LIFADIIHLIHGISY+V+LEFSSSSWNLLLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSVQDIALIFADIIHLIHGISYQVSLEFSSSSWNLLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIA SQSAI P SNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIADSQSAILPLSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLEEPKK+N+S YRPPHMRRRENL KKQASVQN+QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEEPKKDNYSPYRPPHMRRRENLTKKQASVQNAQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 499
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 499
BLAST of HG10008978 vs. NCBI nr
Match:
XP_008437486.1 (PREDICTED: HEAT repeat-containing protein 6 isoform X3 [Cucumis melo])
HSP 1 Score: 851.7 bits (2199), Expect = 3.6e-243
Identity = 464/554 (83.75%), Postives = 478/554 (86.28%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDES SSSTSISQLLY+TIF HSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESTSSSTSISQLLYNTIFPHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DS QDI L FADIIHLIHGISY+V+LEFSSSSWN LLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSAQDIALTFADIIHLIHGISYQVSLEFSSSSWNPLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAE VKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAE-VKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLE+PKKEN+S YRPPHMRRRENL KKQASVQN QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEDPKKENYSPYRPPHMRRRENLTKKQASVQNPQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 498
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 498
BLAST of HG10008978 vs. ExPASy TrEMBL
Match:
A0A0A0KQH7 (DUF4042 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G146250 PE=4 SV=1)
HSP 1 Score: 862.8 bits (2228), Expect = 7.6e-247
Identity = 468/554 (84.48%), Postives = 482/554 (87.00%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DSVQDI LIFADIIHLIHGISY+V+LEFSSSSWNLLLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSVQDIALIFADIIHLIHGISYQVSLEFSSSSWNLLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIA SQSAI P SNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIADSQSAILPLSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLEEPKK+N+S YRPPHMRRRENL KKQASVQN+QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEEPKKDNYSPYRPPHMRRRENLTKKQASVQNAQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 499
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 499
BLAST of HG10008978 vs. ExPASy TrEMBL
Match:
A0A1S3AU95 (HEAT repeat-containing protein 6 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103482890 PE=4 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 1.8e-243
Identity = 464/554 (83.75%), Postives = 478/554 (86.28%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDES SSSTSISQLLY+TIF HSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESTSSSTSISQLLYNTIFPHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DS QDI L FADIIHLIHGISY+V+LEFSSSSWN LLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSAQDIALTFADIIHLIHGISYQVSLEFSSSSWNPLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAE VKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAE-VKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLE+PKKEN+S YRPPHMRRRENL KKQASVQN QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEDPKKENYSPYRPPHMRRRENLTKKQASVQNPQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 498
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 498
BLAST of HG10008978 vs. ExPASy TrEMBL
Match:
A0A1S3AUP9 (HEAT repeat-containing protein 6 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103482890 PE=4 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 1.8e-243
Identity = 464/554 (83.75%), Postives = 478/554 (86.28%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDES SSSTSISQLLY+TIF HSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESTSSSTSISQLLYNTIFPHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DS QDI L FADIIHLIHGISY+V+LEFSSSSWN LLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSAQDIALTFADIIHLIHGISYQVSLEFSSSSWNPLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAE VKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAE-VKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLE+PKKEN+S YRPPHMRRRENL KKQASVQN QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEDPKKENYSPYRPPHMRRRENLTKKQASVQNPQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 498
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 498
BLAST of HG10008978 vs. ExPASy TrEMBL
Match:
A0A1S3AUR0 (HEAT repeat-containing protein 6 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482890 PE=4 SV=1)
HSP 1 Score: 851.7 bits (2199), Expect = 1.8e-243
Identity = 464/554 (83.75%), Postives = 478/554 (86.28%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDES SSSTSISQLLY+TIF HSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESTSSSTSISQLLYNTIFPHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DS QDI L FADIIHLIHGISY+V+LEFSSSSWN LLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSAQDIALTFADIIHLIHGISYQVSLEFSSSSWNPLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
IIRHGCTAE VKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIRHGCTAE-VKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLE+PKKEN+S YRPPHMRRRENL KKQASVQN QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEDPKKENYSPYRPPHMRRRENLTKKQASVQNPQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 498
Query: 541 SISLGQILMQLHIG 555
SISLGQILMQLH G
Sbjct: 541 SISLGQILMQLHTG 498
BLAST of HG10008978 vs. ExPASy TrEMBL
Match:
A0A5A7TH89 (HEAT repeat-containing protein 6 isoform X1 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold44G001190 PE=4 SV=1)
HSP 1 Score: 811.6 bits (2095), Expect = 2.0e-231
Identity = 445/548 (81.20%), Postives = 464/548 (84.67%), Query Frame = 0
Query: 1 MATSSSSSASSVRSWRTAFLTLRDESISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEV 60
MAT SSSS+SSVRSWRTAFLTLRDES SSSTSISQLLY+TIF HSDSLIAAARYLPPPEV
Sbjct: 1 MATPSSSSSSSVRSWRTAFLTLRDESTSSSTSISQLLYNTIFPHSDSLIAAARYLPPPEV 60
Query: 61 SSDLLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQ 120
SSDLLFLLE+ATSA+DS QDI L FAD IHLIHGISY+V+LEFSSSSWN LLRYFGDV Q
Sbjct: 61 SSDLLFLLELATSAADSAQDIALTFADTIHLIHGISYQVSLEFSSSSWNPLLRYFGDVTQ 120
Query: 121 ILLGKLNIPGNYALIRPVLESLEIVRLASLFSSVSLGRYYSFLLLLLHTGSVFLSFGPKV 180
ILLGKLN P NYALIRPVLESLEIV
Sbjct: 121 ILLGKLNFPENYALIRPVLESLEIV----------------------------------- 180
Query: 181 MFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
RHVV +QQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS
Sbjct: 181 --------------------RHVVSIQQRKFLPAEDIQLSKFLLSVIAGSQSAIFPSSNS 240
Query: 241 IIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIRKLMDF 300
II HGCTAE VKSVPKCNSLWDVQAVAFDLLSQ +TSLGSYFPVDVWKSTIQVIRKLMDF
Sbjct: 241 IIGHGCTAE-VKSVPKCNSLWDVQAVAFDLLSQAITSLGSYFPVDVWKSTIQVIRKLMDF 300
Query: 301 LASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
LAST++LVEDK+MSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL
Sbjct: 301 LASTNVLVEDKMMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFSNRPLL 360
Query: 361 ACSIGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQSSMAAES 420
ACS+GNQGKEPSLTSTKSSLE+PKKEN+S YRPPHMRRRENL KKQASVQN QSSMA E
Sbjct: 361 ACSVGNQGKEPSLTSTKSSLEDPKKENYSPYRPPHMRRRENLTKKQASVQNPQSSMAVEY 420
Query: 421 LNCDLISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
LNCD ISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD
Sbjct: 421 LNCDSISSDSDHDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWTLLLPTRD 480
Query: 481 VLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAKCGSFMPL 540
VLLPRKFDATLMTCLLFDPSLK QIASAAALVVMLDRTTSISLQIAEYRDPAKC ++
Sbjct: 481 VLLPRKFDATLMTCLLFDPSLKVQIASAAALVVMLDRTTSISLQIAEYRDPAKCVLYLIQ 492
Query: 541 SISLGQIL 549
+ G++L
Sbjct: 541 RSTHGRLL 492
BLAST of HG10008978 vs. TAIR 10
Match:
AT4G38120.1 (ARM repeat superfamily protein )
HSP 1 Score: 293.5 bits (750), Expect = 3.6e-79
Identity = 220/561 (39.22%), Postives = 310/561 (55.26%), Query Frame = 0
Query: 5 SSSSASSVRSWRTAFLTLRDE-SISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEVSSD 64
+++++SSV WRTAFL+LRDE S + + LL D +FS S SLI+A +LP E++SD
Sbjct: 3 TAAASSSVGRWRTAFLSLRDEISTTPPPPVPLLLEDLLFSQSHSLISAVSHLPLHELTSD 62
Query: 65 LLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQILL 124
LFLL++ + A D + + LIH + + + +SSSW LLL F V++ LL
Sbjct: 63 CLFLLDLVSKADG--PDWIPVSRHTCQLIHDVCARLLFQLNSSSWPLLLHSFASVLEFLL 122
Query: 125 GKLNIPGN------YALIRPVLESLEIV-RLASLF-SSVSLGRYYSFLLLLLHTGSVFLS 184
+ +P + ++ I PV++ E + RLA + ++ L ++ ++ LLH V LS
Sbjct: 123 -RQPMPSSPYSAAYFSRIEPVIQCFETLRRLAPMHPENIHLVKFLVRVVPLLHQDLV-LS 182
Query: 185 FGPKVMFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIF 244
+G F D
Sbjct: 183 YG----------FSNQD------------------------------------------- 242
Query: 245 PSSNSIIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIR 304
PS T V K +P+ N LWD A+AFD+ + + S FP DV + T++V+R
Sbjct: 243 PSP--------TLLVEKKLPQQNRLWDSMALAFDMFGRAFSLSESLFPTDVSQCTLEVLR 302
Query: 305 KLMDFLASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFS 364
K+MD LAS LVED+ M RY +L L P S + A +A+LRMFF +G +
Sbjct: 303 KVMDVLASKGQLVEDRFMWRYMPLVLWRLQFT---PFFLGSIRLVALLASLRMFFCFGLT 362
Query: 365 NRPLLACS-IGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQS 424
P L+ S + + K ++ + K ++ YRPPH+R+R++LN +Q + +
Sbjct: 363 GPPQLSVSDVVHNDKHLNVKLSPLISGVSKNAKNTPYRPPHLRKRDDLNTRQPVSSSWRR 422
Query: 425 SMAAESLNCDLISSDSD-HDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWT 484
A +S + D+ISSDSD DSDG D+ Q+ KVR+AAI+CIQDLCQAD K+FT+QW
Sbjct: 423 LSAHDSGSSDVISSDSDFSDSDGSVPDSYFAQSSKVRIAAIVCIQDLCQADSKSFTTQWV 482
Query: 485 LLLPTRDVLLPRKFDATLMTCLLFDPSLKAQIASAAALVVMLDRTTSISLQIAEYRDPAK 544
L PT DVL PRKF+ATLMTCLLFDP LK +IASA+AL M+D +SI LQ+AEY++ K
Sbjct: 483 TLFPTSDVLKPRKFEATLMTCLLFDPHLKVRIASASALATMMDGPSSIFLQVAEYKESTK 495
Query: 545 CGSFMPLSISLGQILMQLHIG 555
GSFMPLS SLG ILMQLH G
Sbjct: 543 YGSFMPLSNSLGLILMQLHTG 495
BLAST of HG10008978 vs. TAIR 10
Match:
AT4G38120.2 (ARM repeat superfamily protein )
HSP 1 Score: 264.2 bits (674), Expect = 2.3e-70
Identity = 193/510 (37.84%), Postives = 280/510 (54.90%), Query Frame = 0
Query: 5 SSSSASSVRSWRTAFLTLRDE-SISSSTSISQLLYDTIFSHSDSLIAAARYLPPPEVSSD 64
+++++SSV WRTAFL+LRDE S + + LL D +FS S SLI+A +LP E++SD
Sbjct: 3 TAAASSSVGRWRTAFLSLRDEISTTPPPPVPLLLEDLLFSQSHSLISAVSHLPLHELTSD 62
Query: 65 LLFLLEVATSASDSVQDIVLIFADIIHLIHGISYEVALEFSSSSWNLLLRYFGDVIQILL 124
LFLL++ + A D + + LIH + + + +SSSW LLL F V++ LL
Sbjct: 63 CLFLLDLVSKADG--PDWIPVSRHTCQLIHDVCARLLFQLNSSSWPLLLHSFASVLEFLL 122
Query: 125 GKLNIPGN------YALIRPVLESLEIV-RLASLF-SSVSLGRYYSFLLLLLHTGSVFLS 184
+ +P + ++ I PV++ E + RLA + ++ L ++ ++ LLH V LS
Sbjct: 123 -RQPMPSSPYSAAYFSRIEPVIQCFETLRRLAPMHPENIHLVKFLVRVVPLLHQDLV-LS 182
Query: 185 FGPKVMFAHLSEFVESDEICIVIVARHVVCLQQRKFLPAEDIQLSKFLLSVIAGSQSAIF 244
+G F D
Sbjct: 183 YG----------FSNQD------------------------------------------- 242
Query: 245 PSSNSIIRHGCTAEVVKSVPKCNSLWDVQAVAFDLLSQTVTSLGSYFPVDVWKSTIQVIR 304
PS T V K +P+ N LWD A+AFD+ + + S FP DV + T++V+R
Sbjct: 243 PSP--------TLLVEKKLPQQNRLWDSMALAFDMFGRAFSLSESLFPTDVSQCTLEVLR 302
Query: 305 KLMDFLASTSLLVEDKVMSRYYLSLLRCLHLVIAEPKCSLSDHVSAFVAALRMFFAYGFS 364
K+MD LAS LVED+ M +Y LL C+H V+ KC +SDHV +F+A+LRMFF +G +
Sbjct: 303 KVMDVLASKGQLVEDRFMWSFYSCLLGCVHEVLTNIKCPVSDHVLSFIASLRMFFCFGLT 362
Query: 365 NRPLLACS-IGNQGKEPSLTSTKSSLEEPKKENHSAYRPPHMRRRENLNKKQASVQNSQS 424
P L+ S + + K ++ + K ++ YRPPH+R+R++LN +Q + +
Sbjct: 363 GPPQLSVSDVVHNDKHLNVKLSPLISGVSKNAKNTPYRPPHLRKRDDLNTRQPVSSSWRR 422
Query: 425 SMAAESLNCDLISSDSD-HDSDGPGRDADIIQNGKVRVAAILCIQDLCQADPKAFTSQWT 484
A +S + D+ISSDSD DSDG D+ Q+ KVR+AAI+CIQDLCQAD K+FT+QW
Sbjct: 423 LSAHDSGSSDVISSDSDFSDSDGSVPDSYFAQSSKVRIAAIVCIQDLCQADSKSFTTQWV 447
Query: 485 LLLPTRDVLLPRKFDATLMTCLLFDPSLKA 504
L PT DVL PRKF+ATLMTCLLFDP LK+
Sbjct: 483 TLFPTSDVLKPRKFEATLMTCLLFDPHLKS 447
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038875589.1 | 1.5e-252 | 86.10 | HEAT repeat-containing protein 6 isoform X2 [Benincasa hispida] | [more] |
XP_038875588.1 | 1.5e-252 | 86.10 | HEAT repeat-containing protein 6 isoform X1 [Benincasa hispida] | [more] |
XP_004145966.1 | 1.6e-246 | 84.48 | uncharacterized protein LOC101212003 isoform X1 [Cucumis sativus] >KGN49951.1 hy... | [more] |
XP_031741422.1 | 1.6e-246 | 84.48 | uncharacterized protein LOC101212003 isoform X2 [Cucumis sativus] | [more] |
XP_008437486.1 | 3.6e-243 | 83.75 | PREDICTED: HEAT repeat-containing protein 6 isoform X3 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KQH7 | 7.6e-247 | 84.48 | DUF4042 domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_5G146250 PE=... | [more] |
A0A1S3AU95 | 1.8e-243 | 83.75 | HEAT repeat-containing protein 6 isoform X3 OS=Cucumis melo OX=3656 GN=LOC103482... | [more] |
A0A1S3AUP9 | 1.8e-243 | 83.75 | HEAT repeat-containing protein 6 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103482... | [more] |
A0A1S3AUR0 | 1.8e-243 | 83.75 | HEAT repeat-containing protein 6 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103482... | [more] |
A0A5A7TH89 | 2.0e-231 | 81.20 | HEAT repeat-containing protein 6 isoform X1 OS=Cucumis melo var. makuwa OX=11946... | [more] |