Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: utr5polypeptideCDSutr3
Hold the cursor over a type above to highlight its positions in the sequence below.
TTAATTATAAGAAAGGAAAAAGAAAAAAAAAAAATTGGAGATTTTGACAGGTAAAAACGAAGTGCGTATCAAAGTGTGTAATTTGGCGGGGGGCTGAAACCTCGGGGAGAAGAACTCTGTGACTGAATCTGCAAACTCTGTACCCGCGAAGAACAAATCCATCTCAATTCTCCACACTCTTCATCAAGCTTTGATCTTCCTTCTCGATCCATCTCTCTCTCCAATCCAATCGCCCAATTCGATTCCCCCAACTCCCACTCTCCGTTTCAAGAAATTTCCCCTTTTCTTTCCGATCCCGATCGAGCTCCAATTCCAAGAGGAAAAGTGGCATCACACGAGGCCAATGTTCGGATCTAGGTTTTCCATTTTCGGAACCGGCGGTGGTGCTGATAGCGTTGACAAGTCGGCCAAGAGCGAGGTTGTTCCTGGCCTCAAGCTTCGCTCCGACAAGGATGTGTATCGCCCTGGCGATCCCGTTGTTGTCACTCTCGAAATTTCCTCTTCTGTCCCTCAATTGGACTGCTCCCTTTTGATCGACCGGCTTAGTTTTGAGATTAAAGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCGCTCCATGGATCCAAACAACGGAGAGGTTCGGCTTCCAACTCCTTTTATAATTCATTTCATCCGAAGGCGTTTGTTGGGTTTCAATTTCATGTTGTTTTGGTCTTGTCAGGCGAACACGTTTTCATGGACTGCTCTGTACAATCAGTAGTTTCAAATCAGATGGTTCCCGCGGGGTCTCCTAAGTCATGTAAGCCTATGTGTTCGATGCACTGCATTTCTGTACCTCAGTTGCTAACAATAATATGCCCATTGTGAACTTGAATATTATTTATTTAATGTTTTTTCCTTCATCATTCCCTTAGTTTAGTTCATAGTACATTACATTTCATTTTGAACTTTAGGCGTACTATCAAATGACATATGGGTACGCTTCAGTGAAAGATTGTGGGGATTTAGGACAAGAATTGCCTTCTTCCCGACGTCATAATGACCGATAAATTAAAATTCTAAATAGCCAGTACTAATTGGTGGGACGTTAAAATTAGTTGGAAGTTAGATACTAGTGATATATTTTCCACTGCTACTGTTTGTGGTCATTTCACCATGAATATATCTCTAATCCATAGAAGTTTACCGGAGGGGGGAAGGAAATCTAAATTACTTGTTTGTCCACTGCCAGTTTATCCAAGAATTGTAGTGACACATTTAATGTTTCTTGGTGTGATCCTGCCTTGGTGCAACATTGATTTGAACAATCTTTGGTTGGGAATTCTTTTAAGCTAAAAGGAAAGGTGCTTAGGAACTGTGCGATCAAATTTCTCTTTTGGGAAGTTTGGTGGGAAAGCTTTTAAACAATTCTCACGACAAATCTAGAAATGAGGAGGAGATTTGGGATATTATACCTTTTTGAGGTCTCCCTCTGGAGATTCTTGGTAAACTTTTCTATAACTTTGATTATGTTTCTTTATTAAACCAATTGGAGGAATTTTTTGTAATCCCTTTGGCTTCAGTTGGTTCCTCATCGCCCTTTTGTTTTGTCTTTCTTTCAATCGAATGGAAGTTTCTCTTGTTTCCTATAAAAAAAAAACTTGAGGCATCTCCAATCTTTGTTAAACGCTAACTGATGCTTGTAGGGATGGTTGTTTAGGTTGAAAAAAGCAACTTCCGTTTGGTCCATTTATAAGGCATGAATCATGATTCTGGTGCATAGTAGGTTCGAAGATTGGTAGGTGATATGGCACATTGTAGGCATAGCCATCGCTACATCTGTTTATTTTTAGATTAGGTTCTCTGTGAAGCTTTTCAGTTTAGTTTTGATGCTTCCGAAGCCACCCCCTCCTGTTTGGGATAAGACTTCTTTCATTCTATAAAGAAAAAATTTCTAACAGGGATGGAAAGAAAAAGTGACAAAACCTACCACCAAAGGAGCTCATAAAAAAGCTCTTAAAAAAAAAAAAGAAAAGCTCTTCCATTGGAATGAATTTAAATAAAGCACCCACCACAAGTTGTCTTATATAGGATTTCAATTGTATTTTACTCCTCCCTGCTTAATTTACCATTCTGATTGTAAGTACAGTAGTTCTGAAAATGTTCAATGTTATTGCAGATGTCGTCCGAACAACCCTTCCTAGCCGCATACCACCATCTTACAAGGGTGCTACTATTCGATACATGTATTATGTTAAAAGTACATTACAAGGACGATGGCTAATACAGGAAAATGGTCGATCTCATAAAGAGTCACTGAAGGATAAAATAGATATGGTTCGTTTTCATTAACTTAAAATTCATTGTGTGATTTGATGTGAATGGAAATTAGGTGCATTGCAGCTGGGAATTTGTTTTTGCCTTTTGTTTGTTTTTTTCATGAACAATGCTCTTCTCACTCTTACTCCGTGTTGTTACACATTTGTGGTGCTTTCTCTCTTACATTTTCATGTCTGAATCACTCTCTCTCATACAGTCAACAAGTTTTTCTTTTTAGAACGCGAATCATAGGCTTTTAGAATTTTTTAGATGCATTACTCTTTCCCTGGTATCATAATATTTTTAAAATTCATCCACAAAAGGATTATTTATTTATTTTGGATCAGAAACCTGTATATTCCAGCAGAAAAGGAAGGAAACAAACTAAAGGCTGGGTAGAGGGACCCCACCCAAAAACAAATAAACAAAGGCTACAAAAAGATTATTTATGTTCTTTTCTTATTGGAGAACAAAATTTAAAATCTAATGTGTTTAAGGATAGTTGTTAACATTCTCAAGCATAGACAAGAAACCTTTACATGCATTTTTTTGGAGTAGAAAATTGGAGGAGAATACTTGGCCTTTTCCTTTTCATGCATATTTCCTATATTTCTTGCTAAGATATTAGCTGGCTTCATTTATTATCTATCCAGGAAGCTCGGATTCCATTACAAGTGTGGGTCACTCAGAAAACCAGTGGCATGTTAATGGAATTGGAAGAAGGTCAAAATGGTCGAAATGATGGTGATCTTTCTTCTTTCTGTTTGTTGCAGTTATTGCATTTTTATTTATTTTTTTATTTTTTTATACAAGAAACAACTTTTCATTGAATATGAAAAGGATATTGCAGTTGTATTTTTAATTTCATCCATCTAAAAGTTCAAGTATTATGCAACGTGGTTTATGGAATTACATAGATATGCGCACAAGCCTATGTAGATATGGTTACTTGGATAGAGAGGGAACAGAGTTTATTGGGTCAATATCGTGTACAAACATGAATATAACTTGGAAGAATAGGAGCTTGCTGAAGTTCTTTCCAATTGTATAAAATGCCCTTTAAGTTCTACATAATCAGAATCCGTTATGGTACAAAGGAAAGAAACAAGGGATTCAAGTGACTTATAGTAGGCTTTTGAACATTTTTGAAGGAAAACTGAGGGATTCAAGTGACTTATAGGATTCTCATTTAACTATTTTTGAAAATCTATGACCCGTTAGATTCTAGAATTAATTGATCTCTCTTTTGAAGATCATTACTCAGTATGATCTGACCCTCGAGGAGTGGCCTTGGTGGTAAGGACTTGTAATGTTTGGAGATCCTCTCCAAAATCCTGCATTCAAGCCCCAGATGGAAAAACTTGGCTGATAGTAGTCATGAAGATCCTTTCCATTGGCTGGATGTACTGTAGTTTTTAGGTGTTTACTTTCATCTGGATCTTTCTTCATTTTCGATGCATTTGAGCTGAGGTTCTCCAGCATATTAGGTAGCATATTTTAGTGCTTCTGAGTTCTGACAGAAAAACTCTTTATTTTCTGGTTCACTTTTATGTTTGACATATATAGGATGATATTAACCAAATAGTGTTGGCATATAATTTGTTTTCAACCTTTGGAGAATACGGAATTGACAAATATCTATTAATTATTCATTGCCTGTAATTATGTTTCTTATTTAAGTAGTGCTAAATCTGTTTGGTTTGTACAGCATAATTATGATAGAATGAGAATTTTGCTTGCTTTTCAAATGTTTTCCATGGCTTGTTACTTCTGCAGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGGTATGCTACTTACTAATTACTATTGCATAATATCATTTTGTTCTTGTCCTCTCTAGGTTCTTATTAGGTTGGAAAAAGGCTAAATATATATTATTCGGTCCTTGGAAGTATGCACCATTTTTTTAAACTTGTCTGAGTCTCTCAAATTTTTTTTTTTTTAAACTTGTCTCTTCACTAGTTCTTTTGGAAGGGGGACCTTTCTTCCATAGTTTGTGTTTCATTTTTCATCAATACGTCTTTTGTTTCATATGAAAAAAAAAAAATTCTACTTATGGACACCATTGAAATTGAATAATGGGATGATGGTGTGACATTAAAATTTTTAAATCAAATACCTTCTTCAGTTGTGACATTCTTAATATGATATGATGTTTTGAAAATAGTAGCTCCTATATTTTTAAATGGCACGTAATTTTTATGTATTTTATTTCAATAGCATGTTTTAGTGTAGGGATTTGATTGAAAGTTTATGAAATAATTGAATTTTGTGAAATTGAAAAATGATGTATACTTTTCAGGGACAAAATGGTATACATTAACAAAAAATTGTAGATCATTTGTTACTCATATCCTCCGTGCTCGTGAGTTTATATATCATGTGTAGTTAGTATATTCTTCAAAGTAATGCCTTCTAATTTGATATTAGAGTTATGAACCCCAATTCAATTTATTTGATATAAATATACGACCAATTCCTTACAGATTAGAGCAAATGATATATATGATGGCATTGATGAAGGATATGAAAGCTCAAGGGATGAGATCTCATCGGTTTCATCATACAATCCCATGAGAGAATCTTTTCGCAGAACCTTTGGAAGTTCACCATCGTTACAATCATCTCCTGTGAGATCTTCAATAAGGGATGCTTCTTTTATTGAAGGAGAGCGATTAAGTTTTTCTAATGCCGTGCGTCCTCGGGTCTCTGTTGCTGAGGTCCTATATGATTCTGCTGGTGGGTTTATAATGCATAGGCTATGTTTTGCTCAAGGCATTTGACTGATCATTTTTTTCCTCTGCTTGTAAAGTTGTCCCGATTAAATTTTTTGGTAATTGAACTCTCTCTCTAAAAGGAAGATTATAAGTATGTCTTTTTGGATATCCAGAAAGAATGACTGGGGTTATTTATTTTGACAGCTTCTATTTTATGCAATTAGGGGTTTTAATGTTATTTAGTCTAGAGACAACAATTCTCTGACTGACAGTTTTCTATGGAAACAGATGTCGCAACACCTCAGAAGTCATCTGCAGTTGTCTCTCCCAGCCAGGCGTTGAAGTTTGGGAAGCATCAGTCAACAGATGATGATCCTGGAGTACCATCTTCACCAATGGCTAGGACTGTTGAACCTGTAGCATGTAAATTATTAACTTTCTTGTCTCCCTCCCTTGCGGAAAAGAAAAAACTAACTCCCTTTGTAATTCTACTTTTTGCAAAAGGCCTTCTCATTATTATTTAGTCTCATTTATTACACGTCATGATATTATCAAACAAAATGGAAACTAATTATCTGCAAAAATTTCTTGCAGCAGAAGGCTTTCTTCGAGGAAGATCTTACAATATCAGGATAGACGACCAAGTTTTGCTTAGGTTTTGCCCCAAGAATTCTGATTCGACCTATTATTTTAGTGACATGGTAAACGTTTCTTCCAATCTGACTTGTTGCTTATTTTAATTACTTACCTCCTTCCAAGAAAGAAGGAAGAAGGAAAAATAAAAGAGAAAATATGATAAAAATAAGGACCAACATGCTCCAATTTTGAGCCGACTAAGAGAGAATCCCATGCTACGATGGTTGACTATTTTCTTCTTTTACACATTTGATGAATCAGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGATCTAGGAGATGCCTTGAGGTAACTATGGTATTTTAATATGGAATAATTGGTCAGGGCAATTATCTAGTTAACTTGGGTAGTTGATGCATTATTAGGTCTATTCTTCCAGGTTTCAATAACTTTGGAGACTTCAGAGACTGTATCTCACCGTTTTGTCCATCCATCTCGGAGAAATTCTCCAACGATTGTGAAGGTATTTTCAATATTTTGAATGTGGAGAAGAGTGCCGTTATTTGCTTGTGTCTCTGTAGGATCTTCTGCGTTGATGTCATCTTGGTTTCTTGGGTGCTTTTGCTTGAGAAACATAGGAATTTTTGGCTCAGCCTTTGAGGATATAGTTGATTTCTTTTTCTTGGCTTCGAGATGGTCCTCTTAAGTCTTATCCAGTTTCTTTTGTAATTTATGTTCTTTTTTATTTTTTTTTTCTTTTTTGATTGGAATTCTCTATTTGATATTAAAAACAATGAGAGACTGTTTCTGTAATCCTTTTTGGTTCTACGCTACTCATACATTTTTTTGGAAGAATAAAACGTTCCTTAAAAAAGGAATGTATATTTGTCATGGTGTTTAATGTTGTGAAATGCATGGATGTGGCTACAACATTTTCTATTGTCTCACCAAGAGATAATGTTTATATATAATTCCCCTCCCCAGTTAATACTTGTGATATTGTGGTTGGTCGAAGTATCGGTTGAGGTGGCATGATGGGCGCATTATTTATTTTAGTTGAGTGATGGGACTTCCATTGTGTGGTATACGGATGGGAGATAAAGACAATACTTGGCTCTATAGTAAATAGCAACTTTGTCTTTAATACCATTAAATATTTGACAACTAAGAGAGTGAAATAAAAAACAAATTTTTTTTTCTAAAAAATAACGAGTTTTCAAATATTTAGTGGTGTCAAGGACAAAGTCAATCTTTACTACATAAGGAGTTCAGGGAGATAATGCGTTCAGGGATATCTGTTAGCTTTCAGTTTCTTGACATATTTGTTGGATTATATTTGTAGGTTCAGAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCCATTCCCATGGATGGACCCATGTCCTTTTCCACTCCGCATGTATCTTTGAAGTGGGCACTTCGCTTTGAATTTTTTACCACTCCCAAGAACCTGGATTGGACCAGGTATTGACTTTGCTCTAACTTCTAATATTTCGGTGAAAAAGCTTTTTACTACCGTAGTAACTTCTCTCTCTCTATTTTTATTTATTATTATTATTATAAGAAACATTATTGTATTAATCAAAAAAGGAGGAGGAACAGCCTAGGGAAAAGGGTGGAGGTCACCCCGCCCAAACAATAACAATAAACAATGAGAGGTTTCCAATTGTTTATAATCCTATAGTTACAAAAGAATTTCTTGTAGTTTATGCAAAACCACAGAGAAGTATGCTATACATTGTCAGGATTTGTTCTCTTCAAAAAGTATACTCTTTCTTTCTTTCTATTTCACAGTCACAACAAGGCATGAACTGCACAATTCCATAGGCTGTTAGCTTTGCACAATTTTATAGGATGTTAGCTTTACCTCTTAGCAGTGAGCCACCAAATTTTCCACAATCAAATCAACTTTATAGGGAAAACATTATAGGGAGGGCAGAGGGTTAAAGTCGAATTCCGAGAGAATGAAACACCAGGCCTTGTGGGCAAAATAACAATGCAAAAAAATGTGGTGGACTGATTCCTGTTTCAAGTTACACATGCTGCAAATACGATAGTACCTTCTCTCAAACAAACATAAATGAAACACTAGATTTCACATAGTAAAATAATATATTTATCGATTAGACTGAATATGAATGGATGTATACCTATAACGATACACTTGAATATTTCCACTAGGCCATATCTTCTTTCTTATCACTTATCTGAAATTGTTGAAGATTTTTCCAGATATTTTAATCCAATTTTTACCCTTTCAGATATGAGCATCCTCTTTTGATAGAAGGAAGAGAAAAAAGTGAGTGGATTCTTCCAATCACTGTGCATGCACCTCCATCTAGCACTCCGGCTGCTCGAAATGAGAGGCCTTTCTCCTTGGAGCCCTTGTGGATGCACAGCTGAAGGTAATAATAGATCATTCTTGAAATCACAGCATACGGATCAATTAGGTAGTCTAGTAGTTATCAATGTCGATCGGCTTCAAATTCACTCTGGGTTTCGGATGATTGCGAGCAGGGGTGGGGTCAGTTATTTTCATGGGGAGGGACAAAATTTTACCTTCTAAGGTTCAATTCTAAAAATTTTGAGGGTGAATGAGAGAGAGCTTTAAAATTTAATCTTTTATAGAACTTCTTTAATCTAAAACGGATCAAGATGGGGCTCACTCTGCCCCTAATTGGGAGATAAAGGATCAGTTTAACTCCTCCATATCGTGGCAAAAATCTTTCCATGATCATCTGCAGTTATGTGTAAAAGACTTAATTATGGTTGTTTATTGCTTTATGCATCACGCAGGGGTTCATGTTAAATGAAGAGCTTGTAGATTTGTCCAAGAATTTAGTATCACCTCGACGTCTCGCTGCCGTGAACTTTTCTGAAGAGACTAACATCGGGCAAATGTGACTGCCGAGCTGAAGCGCCCGAAGAAGAAGATGATTACGGAAGCATAGACCATTGACGTTGCAGCTTGACCGTGGTATGTCGCCATTGATTGCCAGAAGCATCAGATGGCATTCTTAGTTCCTCCTAATTCAAAGTGTGAATCTGATTTTGACTTTAAAAATGCAGTAGAAAAAATCACCTTTTTCTTTTCCTTTAATTTTGAATATATATCTTTGGCATTAAAGCTGCCTCCCAACATTCTTTTGAAACTGCCAAATGTGCTTAAATTCTTTATATTTATGTATTCTGATTCCTAGCTCCCAAGTCCCAACTATTCCCCCTTTGGGAAACGAATTTGGTGCTTTAGAATAATGCAAGAACCCATTTACCCTTTCGTTTTTCATTCTCTCTACTGATTACTTCCAATGAAAAGAAAAGTTGGTATGGAACTTAAAAATAAACCACCAATTCTACCAGAAACTGATTTCAATCACACAGATTGATCCATGGAGTTCATCTTCAACTGTTTCTCAGCCATTCTTACTGCTTTCGTCGTCTTCGCAGTCAGCTTCTGCACAAGGCTATTCTGGTGCTGCCCCAATGCGCCCCACGGTTCTTGCAGCGGAGATGACTACTACATCGGTTCTTGCAGCGGAGATGACTACTACATTGTGTGTCTGCAGGCGATCGTCATCGGCGAGTGATCTTGGAAGCTACCCAAGTTTGATCACTGCTTCCATGGTGAGTGTATTGATGTGTGGCTTCAGTCCCAACCCACATGCCTGCTGTGTGGAAACCATGTCGTGGTTCCATTTTCGTCTTCCTTTTGGCCATTGGCTTTGATCACATCTATTTTCGTTTCGTTTTTCCAGAGATTTTGTCACGAAACATTCTACTCTATCGATTGGATGTTGCCTTGGCTTTAAGTTATTGCTTATTTCATTCGTAATTCCAACTTACTGGAGATCAGTTCTACCGTTTTTTTTTCTTTTTGGTCTTTTATGTGTTACATATTAGAATTGTTGGATGATCTAGGTAAATATTTTAAACATTTATGTTCTTACAAGATATTTTCAATAAATC
mRNA sequence
TTAATTATAAGAAAGGAAAAAGAAAAAAAAAAAATTGGAGATTTTGACAGGTAAAAACGAAGTGCGTATCAAAGTGTGTAATTTGGCGGGGGGCTGAAACCTCGGGGAGAAGAACTCTGTGACTGAATCTGCAAACTCTGTACCCGCGAAGAACAAATCCATCTCAATTCTCCACACTCTTCATCAAGCTTTGATCTTCCTTCTCGATCCATCTCTCTCTCCAATCCAATCGCCCAATTCGATTCCCCCAACTCCCACTCTCCGTTTCAAGAAATTTCCCCTTTTCTTTCCGATCCCGATCGAGCTCCAATTCCAAGAGGAAAAGTGGCATCACACGAGGCCAATGTTCGGATCTAGGTTTTCCATTTTCGGAACCGGCGGTGGTGCTGATAGCGTTGACAAGTCGGCCAAGAGCGAGGTTGTTCCTGGCCTCAAGCTTCGCTCCGACAAGGATGTGTATCGCCCTGGCGATCCCGTTGTTGTCACTCTCGAAATTTCCTCTTCTGTCCCTCAATTGGACTGCTCCCTTTTGATCGACCGGCTTAGTTTTGAGATTAAAGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCGCTCCATGGATCCAAACAACGGAGAGGCGAACACGTTTTCATGGACTGCTCTGTACAATCAGTAGTTTCAAATCAGATGGTTCCCGCGGGGTCTCCTAAGTCATATGTCGTCCGAACAACCCTTCCTAGCCGCATACCACCATCTTACAAGGGTGCTACTATTCGATACATGTATTATGTTAAAAGTACATTACAAGGACGATGGCTAATACAGGAAAATGGTCGATCTCATAAAGAGTCACTGAAGGATAAAATAGATATGGAAGCTCGGATTCCATTACAAGTGTGGGTCACTCAGAAAACCAGTGGCATGTTAATGGAATTGGAAGAAGGTCAAAATGGTCGAAATGATGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGATTAGAGCAAATGATATATATGATGGCATTGATGAAGGATATGAAAGCTCAAGGGATGAGATCTCATCGGTTTCATCATACAATCCCATGAGAGAATCTTTTCGCAGAACCTTTGGAAGTTCACCATCGTTACAATCATCTCCTGTGAGATCTTCAATAAGGGATGCTTCTTTTATTGAAGGAGAGCGATTAAGTTTTTCTAATGCCGTGCGTCCTCGGGTCTCTGTTGCTGAGGTCCTATATGATTCTGCTGATGTCGCAACACCTCAGAAGTCATCTGCAGTTGTCTCTCCCAGCCAGGCGTTGAAGTTTGGGAAGCATCAGTCAACAGATGATGATCCTGGAGTACCATCTTCACCAATGGCTAGGACTGTTGAACCTGTAGCATCAGAAGGCTTTCTTCGAGGAAGATCTTACAATATCAGGATAGACGACCAAGTTTTGCTTAGGTTTTGCCCCAAGAATTCTGATTCGACCTATTATTTTAGTGACATGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGATCTAGGAGATGCCTTGAGGTTTCAATAACTTTGGAGACTTCAGAGACTGTATCTCACCGTTTTGTCCATCCATCTCGGAGAAATTCTCCAACGATTGTGAAGGTTCAGAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCCATTCCCATGGATGGACCCATGTCCTTTTCCACTCCGCATGTATCTTTGAAGTGGGCACTTCGCTTTGAATTTTTTACCACTCCCAAGAACCTGGATTGGACCAGATATGAGCATCCTCTTTTGATAGAAGGAAGAGAAAAAAGTGAGTGGATTCTTCCAATCACTGTGCATGCACCTCCATCTAGCACTCCGGCTGCTCGAAATGAGAGGCCTTTCTCCTTGGAGCCCTTGTGGATGCACAGCTGAAGGGGTTCATGTTAAATGAAGAGCTTGTAGATTTGTCCAAGAATTTAGTATCACCTCGACGTCTCGCTGCCGTGAACTTTTCTGAAGAGACTAACATCGGGCAAATGTGACTGCCGAGCTGAAGCGCCCGAAGAAGAAGATGATTACGGAAGCATAGACCATTGACGTTGCAGCTTGACCGTGGTATGTCGCCATTGATTGCCAGAAGCATCAGATGGCATTCTTAGTTCCTCCTAATTCAAAGTGTGAATCTGATTTTGACTTTAAAAATGCAGTAGAAAAAATCACCTTTTTCTTTTCCTTTAATTTTGAATATATATCTTTGGCATTAAAGCTGCCTCCCAACATTCTTTTGAAACTGCCAAATGTGCTTAAATTCTTTATATTTATGTATTCTGATTCCTAGCTCCCAAGTCCCAACTATTCCCCCTTTGGGAAACGAATTTGGTGCTTTAGAATAATGCAAGAACCCATTTACCCTTTCGTTTTTCATTCTCTCTACTGATTACTTCCAATGAAAAGAAAAGTTGGTATGGAACTTAAAAATAAACCACCAATTCTACCAGAAACTGATTTCAATCACACAGATTGATCCATGGAGTTCATCTTCAACTGTTTCTCAGCCATTCTTACTGCTTTCGTCGTCTTCGCAGTCAGCTTCTGCACAAGGCTATTCTGGTGCTGCCCCAATGCGCCCCACGGTTCTTGCAGCGGAGATGACTACTACATCGGTTCTTGCAGCGGAGATGACTACTACATTGTGTGTCTGCAGGCGATCGTCATCGGCGAGTGATCTTGGAAGCTACCCAAGTTTGATCACTGCTTCCATGGTGAGTGTATTGATGTGTGGCTTCAGTCCCAACCCACATGCCTGCTGTGTGGAAACCATGTCGTGGTTCCATTTTCGTCTTCCTTTTGGCCATTGGCTTTGATCACATCTATTTTCGTTTCGTTTTTCCAGAGATTTTGTCACGAAACATTCTACTCTATCGATTGGATGTTGCCTTGGCTTTAAGTTATTGCTTATTTCATTCGTAATTCCAACTTACTGGAGATCAGTTCTACCGTTTTTTTTTCTTTTTGGTCTTTTATGTGTTACATATTAGAATTGTTGGATGATCTAGGTAAATATTTTAAACATTTATGTTCTTACAAGATATTTTCAATAAATC
Coding sequence (CDS)
ATGTTCGGATCTAGGTTTTCCATTTTCGGAACCGGCGGTGGTGCTGATAGCGTTGACAAGTCGGCCAAGAGCGAGGTTGTTCCTGGCCTCAAGCTTCGCTCCGACAAGGATGTGTATCGCCCTGGCGATCCCGTTGTTGTCACTCTCGAAATTTCCTCTTCTGTCCCTCAATTGGACTGCTCCCTTTTGATCGACCGGCTTAGTTTTGAGATTAAAGGCCTCCAGAAGTTGGACGCTCAGTGGTTTAGCACTCAGAAGCCGCTCCATGGATCCAAACAACGGAGAGGCGAACACGTTTTCATGGACTGCTCTGTACAATCAGTAGTTTCAAATCAGATGGTTCCCGCGGGGTCTCCTAAGTCATATGTCGTCCGAACAACCCTTCCTAGCCGCATACCACCATCTTACAAGGGTGCTACTATTCGATACATGTATTATGTTAAAAGTACATTACAAGGACGATGGCTAATACAGGAAAATGGTCGATCTCATAAAGAGTCACTGAAGGATAAAATAGATATGGAAGCTCGGATTCCATTACAAGTGTGGGTCACTCAGAAAACCAGTGGCATGTTAATGGAATTGGAAGAAGGTCAAAATGGTCGAAATGATGCCTTTCAAATGGATGTATTTTGGAAAGAGATGGAAGGTGACACTGATTGGATTAGAGCAAATGATATATATGATGGCATTGATGAAGGATATGAAAGCTCAAGGGATGAGATCTCATCGGTTTCATCATACAATCCCATGAGAGAATCTTTTCGCAGAACCTTTGGAAGTTCACCATCGTTACAATCATCTCCTGTGAGATCTTCAATAAGGGATGCTTCTTTTATTGAAGGAGAGCGATTAAGTTTTTCTAATGCCGTGCGTCCTCGGGTCTCTGTTGCTGAGGTCCTATATGATTCTGCTGATGTCGCAACACCTCAGAAGTCATCTGCAGTTGTCTCTCCCAGCCAGGCGTTGAAGTTTGGGAAGCATCAGTCAACAGATGATGATCCTGGAGTACCATCTTCACCAATGGCTAGGACTGTTGAACCTGTAGCATCAGAAGGCTTTCTTCGAGGAAGATCTTACAATATCAGGATAGACGACCAAGTTTTGCTTAGGTTTTGCCCCAAGAATTCTGATTCGACCTATTATTTTAGTGACATGATAGGTGGAACTCTTACCTTTTTCCATGAAGAAGGATCTAGGAGATGCCTTGAGGTTTCAATAACTTTGGAGACTTCAGAGACTGTATCTCACCGTTTTGTCCATCCATCTCGGAGAAATTCTCCAACGATTGTGAAGGTTCAGAGTGATCATTATGAGGTTGTTGCTGATTTAATTCAGACAAGCTTTCTTTTCTCCATTCCCATGGATGGACCCATGTCCTTTTCCACTCCGCATGTATCTTTGAAGTGGGCACTTCGCTTTGAATTTTTTACCACTCCCAAGAACCTGGATTGGACCAGATATGAGCATCCTCTTTTGATAGAAGGAAGAGAAAAAAGTGAGTGGATTCTTCCAATCACTGTGCATGCACCTCCATCTAGCACTCCGGCTGCTCGAAATGAGAGGCCTTTCTCCTTGGAGCCCTTGTGGATGCACAGCTGA
Protein sequence
MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDCSLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPLQVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRDEISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEVLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSYNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTPKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS
Homology
BLAST of MC07g0462 vs. NCBI nr
Match:
XP_022155720.1 (uncharacterized protein LOC111022779 [Momordica charantia])
HSP 1 Score: 1065 bits (2754), Expect = 0.0
Identity = 533/533 (100.00%), Postives = 533/533 (100.00%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC
Sbjct: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK
Sbjct: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL
Sbjct: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD
Sbjct: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV 300
EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV
Sbjct: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV 300
Query: 301 LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY 360
LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY
Sbjct: 301 LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY 360
Query: 361 NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV 420
NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV
Sbjct: 361 NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV 420
Query: 421 HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP 480
HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP
Sbjct: 421 HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP 480
Query: 481 KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS 533
KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS
Sbjct: 481 KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS 533
BLAST of MC07g0462 vs. NCBI nr
Match:
XP_038904898.1 (uncharacterized protein LOC120091119 isoform X1 [Benincasa hispida])
HSP 1 Score: 895 bits (2312), Expect = 0.0
Identity = 451/539 (83.67%), Postives = 486/539 (90.17%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFGTG AD V+ SAKSEV PGLKLRSDKDVYRPGDPVVVT+EI SSV Q DC
Sbjct: 1 MFGSRFSIFGTGASADKVENSAKSEVFPGLKLRSDKDVYRPGDPVVVTIEICSSVAQFDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RLSFEI GLQKLDAQWFSTQKP+ GSKQRRGEHVFMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLSFEIIGLQKLDAQWFSTQKPIPGSKQRRGEHVFMDCSVQSIVSNQIISSGATK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SYVVR+ LP+ IPPSYKGATIRYMYYVKSTL GRWL QENGRSHKESL+D+I+ME R+PL
Sbjct: 121 SYVVRSMLPTCIPPSYKGATIRYMYYVKSTLLGRWLSQENGRSHKESLRDQIEMETRVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+GMLME EGQNG+NDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGY+SSRD
Sbjct: 181 QVWVTQKTNGMLME--EGQNGQNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRS---SIRDASFIEGERLSFS-NAVRPRVS 300
EISSVSSYNP RE F RTFGSS SLQSS RS SI+ A FIEGERLS S N RPRVS
Sbjct: 241 EISSVSSYNPTREPFHRTFGSSLSLQSSAGRSGRSSIKVAPFIEGERLSLSSNVARPRVS 300
Query: 301 VAEVLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLR 360
VAEVLY+S DVA+PQKS A VSPSQ L F K+QSTDDD GV SSP + ++PVASEGF+R
Sbjct: 301 VAEVLYESTDVASPQKSFAAVSPSQVLNFEKNQSTDDDAGVASSPRPKIIKPVASEGFIR 360
Query: 361 GRSYNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVS 420
GRSYNIR+DDQVLLRFCPKNSDS YYFSDMIGGTLTFFHEEG RRCLEVSITLETSETVS
Sbjct: 361 GRSYNIRVDDQVLLRFCPKNSDSNYYFSDMIGGTLTFFHEEGMRRCLEVSITLETSETVS 420
Query: 421 HRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEF 480
RF+HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSL+WALRFEF
Sbjct: 421 RRFIHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRFEF 480
Query: 481 FTTPKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPLWMHS 533
FTTPKN++WTRYEHPL+IEGREKSEWILPITVHAPPSS A RN+RPFSLEP+WMH+
Sbjct: 481 FTTPKNVNWTRYEHPLMIEGREKSEWILPITVHAPPSSAATAQNRNDRPFSLEPMWMHN 537
BLAST of MC07g0462 vs. NCBI nr
Match:
XP_011653362.1 (uncharacterized protein LOC101218523 [Cucumis sativus] >XP_031739927.1 uncharacterized protein LOC101218523 [Cucumis sativus] >KGN53681.2 hypothetical protein Csa_014549 [Cucumis sativus])
HSP 1 Score: 892 bits (2306), Expect = 0.0
Identity = 450/537 (83.80%), Postives = 485/537 (90.32%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGA-DSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLD 60
MFGSRFSIFGTG A D V+KSAKSE PGLKLRSDKDVYRPGDPVVVT+EI SSVPQLD
Sbjct: 1 MFGSRFSIFGTGAAAADKVEKSAKSEFFPGLKLRSDKDVYRPGDPVVVTIEICSSVPQLD 60
Query: 61 CSLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSP 120
CSLLI+RL FEI GL KLDAQWFSTQKP+ GSKQRRGEH+FMDCSVQS+VS+Q++ +G+
Sbjct: 61 CSLLIERLRFEIIGLHKLDAQWFSTQKPIPGSKQRRGEHIFMDCSVQSIVSSQIISSGAM 120
Query: 121 KSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIP 180
KSYVVR+TLP+ IPPSYKGATIRYMYYVKSTL GRWL QENGRSHKES KD+I+MEAR+P
Sbjct: 121 KSYVVRSTLPTCIPPSYKGATIRYMYYVKSTLLGRWLSQENGRSHKESPKDQIEMEARLP 180
Query: 181 LQVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSR 240
LQVWVTQKT+GMLME G+NDAFQMDVFWKEME DTDWIRANDIYDG DEGY+SSR
Sbjct: 181 LQVWVTQKTNGMLME-----EGQNDAFQMDVFWKEMESDTDWIRANDIYDGTDEGYDSSR 240
Query: 241 DEISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFS-NAVRPRVSVA 300
DEISSVSSYNPMRE F RTFGSS SLQSS RSSI+ A FIEGERLS S N RPRVSVA
Sbjct: 241 DEISSVSSYNPMREPFHRTFGSSLSLQSSAGRSSIKIAPFIEGERLSLSSNVARPRVSVA 300
Query: 301 EVLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGR 360
EVLY+SADVA+PQKS A VSPSQ L F K+QSTDDD G +SP +T+EPVASEGF+RGR
Sbjct: 301 EVLYESADVASPQKSFAAVSPSQVLNFEKNQSTDDDAGAATSPRPKTIEPVASEGFIRGR 360
Query: 361 SYNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHR 420
SYNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEG+RRCLE+SITLETSETVS R
Sbjct: 361 SYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGTRRCLELSITLETSETVSRR 420
Query: 421 FVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFT 480
F+HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSL+WALRFEFFT
Sbjct: 421 FIHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRFEFFT 480
Query: 481 TPKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPLWMHS 533
TPKN+DWTRYEHPLLIEGREKSEW+LPITVHAPPSS A RN+RPFSLEPLWMHS
Sbjct: 481 TPKNVDWTRYEHPLLIEGREKSEWVLPITVHAPPSSAATAQNRNDRPFSLEPLWMHS 532
BLAST of MC07g0462 vs. NCBI nr
Match:
XP_008455604.1 (PREDICTED: uncharacterized protein LOC103495739 isoform X1 [Cucumis melo] >XP_016901787.1 PREDICTED: uncharacterized protein LOC103495739 isoform X1 [Cucumis melo])
HSP 1 Score: 887 bits (2291), Expect = 0.0
Identity = 447/536 (83.40%), Postives = 482/536 (89.93%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGS+FSIFGTG AD V+KSAKSE PGLKLRSDKDVYRPGDPVVVT+EI SSVPQLDC
Sbjct: 1 MFGSKFSIFGTGTAADKVEKSAKSEFFPGLKLRSDKDVYRPGDPVVVTIEICSSVPQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RL FEI GLQKLDAQWFSTQKP+ GSKQRRGEH+FMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGEHIFMDCSVQSIVSNQIISSGAMK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SYVVR+ LP+ IPPSYKGATIRYMY VKSTL GRWL QEN RSHKES D+I+MEAR+PL
Sbjct: 121 SYVVRSMLPTCIPPSYKGATIRYMYCVKSTLVGRWLSQENCRSHKESPMDQIEMEARVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+GMLME G+NDAFQMDVFWKEME DTDWIRANDIY GIDEGY+SSRD
Sbjct: 181 QVWVTQKTNGMLME-----EGQNDAFQMDVFWKEMESDTDWIRANDIYAGIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFS-NAVRPRVSVAE 300
EISSVSSYNPMRE F RTFGSS SLQSS RSSI+ A FIEGERLS S N RPRVSVAE
Sbjct: 241 EISSVSSYNPMREPFHRTFGSSLSLQSSAGRSSIKIAPFIEGERLSLSSNVARPRVSVAE 300
Query: 301 VLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRS 360
VLY+S DVA+PQKS A VSPSQ L F K+QSTDDD GV +SP +T+EPVASEGF+RGRS
Sbjct: 301 VLYESTDVASPQKSFAAVSPSQVLNFEKNQSTDDDAGVATSPRPKTIEPVASEGFIRGRS 360
Query: 361 YNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRF 420
YNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEG+RRCLE+SITLETSETVS RF
Sbjct: 361 YNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGTRRCLELSITLETSETVSRRF 420
Query: 421 VHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTT 480
+HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSL+WALRFEFFTT
Sbjct: 421 IHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRFEFFTT 480
Query: 481 PKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPLWMHS 533
PKN+DWTRYEHPLLIEGREKSEW+LPITVHAPPSS A RN+RPFSLEPLWMHS
Sbjct: 481 PKNVDWTRYEHPLLIEGREKSEWVLPITVHAPPSSAATAQNRNDRPFSLEPLWMHS 531
BLAST of MC07g0462 vs. NCBI nr
Match:
KAG7013397.1 (hypothetical protein SDJN02_23563 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 880 bits (2275), Expect = 0.0
Identity = 444/532 (83.46%), Postives = 481/532 (90.41%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFG G A+ V++S KSEV+PG +LR DKDVYRPGDPVVVT+EI SSV QLDC
Sbjct: 1 MFGSRFSIFGVGAAAEKVEESVKSEVLPGFELRCDKDVYRPGDPVVVTIEICSSVAQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RL FEI GL+KLDAQWFSTQKP+ GS+QRRGEHVFMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLRFEIIGLRKLDAQWFSTQKPIPGSRQRRGEHVFMDCSVQSIVSNQIISSGATK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SY VRTTLPSRIPPSYKGATIRYMYYVKSTL GRWL QENGRSHKESLKD+I+MEAR+PL
Sbjct: 121 SYEVRTTLPSRIPPSYKGATIRYMYYVKSTLLGRWLTQENGRSHKESLKDQIEMEARVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+GMLME G+NDAFQMDVFWKEM+GD DWIRANDIYDGIDEGY+SSRD
Sbjct: 181 QVWVTQKTNGMLME-----EGQNDAFQMDVFWKEMKGDADWIRANDIYDGIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFS-NAVRPRVSVAE 300
EISSVSSYNPMRE F RTFGSS SLQSS RSSI+DA FIEGERLS S N RPRVSVAE
Sbjct: 241 EISSVSSYNPMREPFHRTFGSSLSLQSSAGRSSIKDAPFIEGERLSLSPNVARPRVSVAE 300
Query: 301 VLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRS 360
VLYDSADVA+ QKS A VSPSQAL F K+Q TDDD GV SSPM + +EPVASEGF+RGRS
Sbjct: 301 VLYDSADVASSQKSFAAVSPSQALSFEKNQLTDDDVGVASSPMPKIIEPVASEGFIRGRS 360
Query: 361 YNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRF 420
YNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHE+G+RRCLEVSITLETSETVS RF
Sbjct: 361 YNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEDGARRCLEVSITLETSETVSRRF 420
Query: 421 VHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTT 480
VHPSRRNSPTIVKVQSDH+EVVADLIQTSFLFSIP++GPMSFSTPHVSL+WALRFEFFTT
Sbjct: 421 VHPSRRNSPTIVKVQSDHFEVVADLIQTSFLFSIPINGPMSFSTPHVSLQWALRFEFFTT 480
Query: 481 PKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPL 529
PKN+DWTRYEHPLLIE REKSEWILPITVHAPPSST A RN+RPF LEPL
Sbjct: 481 PKNVDWTRYEHPLLIEAREKSEWILPITVHAPPSSTATAQNRNDRPFPLEPL 527
BLAST of MC07g0462 vs. ExPASy TrEMBL
Match:
A0A6J1DN77 (uncharacterized protein LOC111022779 OS=Momordica charantia OX=3673 GN=LOC111022779 PE=4 SV=1)
HSP 1 Score: 1065 bits (2754), Expect = 0.0
Identity = 533/533 (100.00%), Postives = 533/533 (100.00%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC
Sbjct: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK
Sbjct: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL
Sbjct: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD
Sbjct: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV 300
EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV
Sbjct: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPRVSVAEV 300
Query: 301 LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY 360
LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY
Sbjct: 301 LYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRSY 360
Query: 361 NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV 420
NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV
Sbjct: 361 NIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRFV 420
Query: 421 HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP 480
HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP
Sbjct: 421 HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTTP 480
Query: 481 KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS 533
KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS
Sbjct: 481 KNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARNERPFSLEPLWMHS 533
BLAST of MC07g0462 vs. ExPASy TrEMBL
Match:
A0A1S4E0M0 (uncharacterized protein LOC103495739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103495739 PE=4 SV=1)
HSP 1 Score: 887 bits (2291), Expect = 0.0
Identity = 447/536 (83.40%), Postives = 482/536 (89.93%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGS+FSIFGTG AD V+KSAKSE PGLKLRSDKDVYRPGDPVVVT+EI SSVPQLDC
Sbjct: 1 MFGSKFSIFGTGTAADKVEKSAKSEFFPGLKLRSDKDVYRPGDPVVVTIEICSSVPQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RL FEI GLQKLDAQWFSTQKP+ GSKQRRGEH+FMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGEHIFMDCSVQSIVSNQIISSGAMK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SYVVR+ LP+ IPPSYKGATIRYMY VKSTL GRWL QEN RSHKES D+I+MEAR+PL
Sbjct: 121 SYVVRSMLPTCIPPSYKGATIRYMYCVKSTLVGRWLSQENCRSHKESPMDQIEMEARVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+GMLME G+NDAFQMDVFWKEME DTDWIRANDIY GIDEGY+SSRD
Sbjct: 181 QVWVTQKTNGMLME-----EGQNDAFQMDVFWKEMESDTDWIRANDIYAGIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFS-NAVRPRVSVAE 300
EISSVSSYNPMRE F RTFGSS SLQSS RSSI+ A FIEGERLS S N RPRVSVAE
Sbjct: 241 EISSVSSYNPMREPFHRTFGSSLSLQSSAGRSSIKIAPFIEGERLSLSSNVARPRVSVAE 300
Query: 301 VLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRS 360
VLY+S DVA+PQKS A VSPSQ L F K+QSTDDD GV +SP +T+EPVASEGF+RGRS
Sbjct: 301 VLYESTDVASPQKSFAAVSPSQVLNFEKNQSTDDDAGVATSPRPKTIEPVASEGFIRGRS 360
Query: 361 YNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRF 420
YNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEG+RRCLE+SITLETSETVS RF
Sbjct: 361 YNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGTRRCLELSITLETSETVSRRF 420
Query: 421 VHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTT 480
+HPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSL+WALRFEFFTT
Sbjct: 421 IHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRFEFFTT 480
Query: 481 PKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPLWMHS 533
PKN+DWTRYEHPLLIEGREKSEW+LPITVHAPPSS A RN+RPFSLEPLWMHS
Sbjct: 481 PKNVDWTRYEHPLLIEGREKSEWVLPITVHAPPSSAATAQNRNDRPFSLEPLWMHS 531
BLAST of MC07g0462 vs. ExPASy TrEMBL
Match:
A0A6J1H364 (uncharacterized protein LOC111460073 OS=Cucurbita moschata OX=3662 GN=LOC111460073 PE=4 SV=1)
HSP 1 Score: 872 bits (2254), Expect = 0.0
Identity = 440/532 (82.71%), Postives = 479/532 (90.04%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFG G A+ V++S KSEV+PG +LR DKDVYRPGDPVVVT+EI SSV QLDC
Sbjct: 1 MFGSRFSIFGVGAAAEKVEESVKSEVLPGFELRCDKDVYRPGDPVVVTVEICSSVAQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RL FEI GL+KLDAQWFSTQKP+ GS+QRRGEHVFMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLRFEIIGLRKLDAQWFSTQKPIPGSRQRRGEHVFMDCSVQSIVSNQIISSGATK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SY VRT LPSRIPPSYKGATIRYMYYVKSTL G+WL QENGRSHKESLKD+I+MEAR+PL
Sbjct: 121 SYEVRTMLPSRIPPSYKGATIRYMYYVKSTLLGQWLTQENGRSHKESLKDQIEMEARVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+GMLME G+NDAFQMDVFWKEM+GD DWIRANDIYDGIDEGY+SSRD
Sbjct: 181 QVWVTQKTNGMLME-----EGQNDAFQMDVFWKEMKGDADWIRANDIYDGIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFS-NAVRPRVSVAE 300
EISSVSSYNPMRE F RTFGSS SLQSS RSSI+DA FIEGERLS S N RPRVSVAE
Sbjct: 241 EISSVSSYNPMREPFLRTFGSSLSLQSSAGRSSIKDAPFIEGERLSLSPNVARPRVSVAE 300
Query: 301 VLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRS 360
VLYDSADVA+ QKS A VSPSQAL F K+Q TDDD GV SSPM + +EPVASEGF+RGRS
Sbjct: 301 VLYDSADVASSQKSFAAVSPSQALSFEKNQLTDDDVGVASSPMPKIIEPVASEGFIRGRS 360
Query: 361 YNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRF 420
YNIR+D+QVLLRFCPKNSDSTYYFSDMIGGTLTFFHE+G+RRCLE SITLETSETVS RF
Sbjct: 361 YNIRVDEQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEDGARRCLEASITLETSETVSRRF 420
Query: 421 VHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTT 480
VHPSRRNSPTIVKVQSDH+EVVADLIQTSFLFSIP++GPMSFSTPHVSL+WALRFEFFTT
Sbjct: 421 VHPSRRNSPTIVKVQSDHFEVVADLIQTSFLFSIPINGPMSFSTPHVSLQWALRFEFFTT 480
Query: 481 PKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPL 529
PKN+DWTRYEHPLLIE REKSEWILPITVHAPPSST A RN+RPF LEPL
Sbjct: 481 PKNVDWTRYEHPLLIEAREKSEWILPITVHAPPSSTATAQNRNDRPFPLEPL 527
BLAST of MC07g0462 vs. ExPASy TrEMBL
Match:
A0A6J1L132 (uncharacterized protein LOC111499409 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111499409 PE=4 SV=1)
HSP 1 Score: 872 bits (2252), Expect = 0.0
Identity = 441/532 (82.89%), Postives = 480/532 (90.23%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGADSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQLDC 60
MFGSRFSIFG G A+ V++S KSEV+PG +LRSDKDVYRPGDPVVVT+EI SSV QLDC
Sbjct: 1 MFGSRFSIFGIGAAAEKVEESVKSEVLPGFELRSDKDVYRPGDPVVVTIEICSSVAQLDC 60
Query: 61 SLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGSPK 120
SLLI+RL FEI GL+KLDAQWFSTQKP+ GS+QRRGEHVFMDCSVQS+VSNQ++ +G+ K
Sbjct: 61 SLLIERLRFEIIGLRKLDAQWFSTQKPIPGSRQRRGEHVFMDCSVQSIVSNQIISSGATK 120
Query: 121 SYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARIPL 180
SY VRTTLPS IPPSYKGATIRYMYYVKSTL GRWL QENGRSHKE LKD+I+MEAR+PL
Sbjct: 121 SYEVRTTLPSCIPPSYKGATIRYMYYVKSTLLGRWLTQENGRSHKELLKDQIEMEARVPL 180
Query: 181 QVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESSRD 240
QVWVTQKT+G+LME G+N+AFQMDVFWKEM+GDTDWIRANDIYD IDEGY+SSRD
Sbjct: 181 QVWVTQKTNGVLME-----EGQNEAFQMDVFWKEMKGDTDWIRANDIYDCIDEGYDSSRD 240
Query: 241 EISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLS-FSNAVRPRVSVAE 300
EISSVSSYNPMRE F RTFGSS SLQSS RSSI+DA FIEGERLS F N RPRVSVAE
Sbjct: 241 EISSVSSYNPMREPFHRTFGSSLSLQSSAGRSSIKDAPFIEGERLSLFPNVARPRVSVAE 300
Query: 301 VLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGFLRGRS 360
VLYDSADVA+ QKS A VSPSQAL F K+Q TDDD GV SSPM + +EPVASEGF+RGRS
Sbjct: 301 VLYDSADVASSQKSFAAVSPSQALSFEKNQLTDDDVGVASSPMPKIIEPVASEGFIRGRS 360
Query: 361 YNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSETVSHRF 420
YNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHE+G+RRCLEVSITLETSETVS RF
Sbjct: 361 YNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEDGARRCLEVSITLETSETVSRRF 420
Query: 421 VHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRFEFFTT 480
VHPSRRNSPTIVKVQSDH+EVVADLIQTSFLFSIP++GPMSFSTPHVSL+WALRFEFFTT
Sbjct: 421 VHPSRRNSPTIVKVQSDHFEVVADLIQTSFLFSIPINGPMSFSTPHVSLQWALRFEFFTT 480
Query: 481 PKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPL 529
PKN+DWTRYEHPLLIE REKSEWILPITVHAPPSST A RN+RPF LEPL
Sbjct: 481 PKNVDWTRYEHPLLIETREKSEWILPITVHAPPSSTATAQNRNDRPFPLEPL 527
BLAST of MC07g0462 vs. ExPASy TrEMBL
Match:
A0A6J1KC17 (uncharacterized protein LOC111494236 OS=Cucurbita maxima OX=3661 GN=LOC111494236 PE=4 SV=1)
HSP 1 Score: 869 bits (2245), Expect = 3.48e-315
Identity = 445/541 (82.26%), Postives = 477/541 (88.17%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGA--DSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQL 60
MFGSRFSIFG+G A D V KSAKS+V PGLKLRSDKDVY PGDPVVVT+EISSSVPQ
Sbjct: 1 MFGSRFSIFGSGAAAAADKVQKSAKSDVFPGLKLRSDKDVYWPGDPVVVTIEISSSVPQF 60
Query: 61 DCSLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPAGS 120
DCSL I+RL FEI GLQKLDAQWFSTQKP+ GSKQRRGE +FMDCSVQS+VSNQ++ +G+
Sbjct: 61 DCSLFIERLRFEIIGLQKLDAQWFSTQKPIPGSKQRRGERIFMDCSVQSIVSNQIISSGA 120
Query: 121 PKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEARI 180
KSYVVRTTLPSRIPPSYK ATIRYMYYVKSTL GRWLIQENGRS KESLKD+I+MEAR+
Sbjct: 121 TKSYVVRTTLPSRIPPSYKSATIRYMYYVKSTLLGRWLIQENGRSRKESLKDQIEMEARV 180
Query: 181 PLQVWVTQKTSGMLMELEEGQNGRNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEGYESS 240
PLQVWVTQK SGMLME G+NDAFQMDVFWKEMEGDTDW+RANDIYDGIDEGY+SS
Sbjct: 181 PLQVWVTQKISGMLME-----EGQNDAFQMDVFWKEMEGDTDWVRANDIYDGIDEGYDSS 240
Query: 241 RDEISSVSSYNPMRESFRRTFGSSPSLQSSPVRS---SIRDASFIEGERLSFS-NAVRPR 300
RDEISSVSSYNPMRE F RTFGSS S QSS +S SI+D+ F EGERLS S N P
Sbjct: 241 RDEISSVSSYNPMREPFHRTFGSSLSFQSSAAKSGRFSIKDSPFTEGERLSLSSNVAHPG 300
Query: 301 VSVAEVLYDSADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEGF 360
+SVAEVLYDSADV +P KSSAV QAL F K+QS DDD GVPSSP +T EPVASEGF
Sbjct: 301 LSVAEVLYDSADVTSPPKSSAVGG--QALNFEKNQSKDDDAGVPSSPRLKTNEPVASEGF 360
Query: 361 LRGRSYNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSET 420
+RGRSYNIR+DDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEG+RRCLEVSI LETSET
Sbjct: 361 IRGRSYNIRVDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGARRCLEVSIALETSET 420
Query: 421 VSHRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALRF 480
VS RFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSL+WALRF
Sbjct: 421 VSRRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLQWALRF 480
Query: 481 EFFTTPKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAA--RNERPFSLEPLWMH 533
EFFTTPKN+DWTRYEHPL+IEGREKSEWILPI VHAPPSS+ AA RNER SL+PLWMH
Sbjct: 481 EFFTTPKNVDWTRYEHPLMIEGREKSEWILPINVHAPPSSSAAAQNRNERSLSLDPLWMH 534
BLAST of MC07g0462 vs. TAIR 10
Match:
AT1G50120.1 (FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Rgp1 (InterPro:IPR014848), Immunoglobulin E-set (InterPro:IPR014756); Has 144 Blast hits to 140 proteins in 61 species: Archae - 0; Bacteria - 0; Metazoa - 86; Fungi - 10; Plants - 39; Viruses - 0; Other Eukaryotes - 9 (source: NCBI BLink). )
HSP 1 Score: 610.9 bits (1574), Expect = 9.7e-175
Identity = 321/542 (59.23%), Postives = 403/542 (74.35%), Query Frame = 0
Query: 1 MFGSRFSIFGTGGGA---DSVDKSAKSEVVPGLKLRSDKDVYRPGDPVVVTLEISSSVPQ 60
M SRFS G G + DSV S S++ P L +++DKDVYRPGD + VT+E+++S
Sbjct: 1 MLSSRFSFLGIGSSSEVNDSVGVSG-SKIKPSLSVQTDKDVYRPGDSIFVTIEVANSHDN 60
Query: 61 L-DCSLLIDRLSFEIKGLQKLDAQWFSTQKPLHGSKQRRGEHVFMDCSVQSVVSNQMVPA 120
+ S+L+++LSFE+KGL+KLD QWFSTQKP GSK RRGEH+F+D S S++SNQ++
Sbjct: 61 ASNPSILVEKLSFEVKGLEKLDIQWFSTQKPSPGSKGRRGEHIFLDSSTPSLISNQILSP 120
Query: 121 GSPKSYVVRTTLPSRIPPSYKGATIRYMYYVKSTLQGRWLIQENGRSHKESLKDKIDMEA 180
G+ + +VR LP IPPSYKGAT+RY+YY+KSTL GRW+ EN + +K+S +D I++E
Sbjct: 121 GAKMTLMVRAILPQIIPPSYKGATLRYLYYIKSTLCGRWMALENSQFYKDSTQDFIEVET 180
Query: 181 RIPLQVWVTQKTSGMLMELEEGQNG--RNDAFQMDVFWKEMEGDTDWIRANDIYDGIDEG 240
RIPLQVWV QK +G+L+E E+ +G Q +++WKEM+GD++W RAND YD ++G
Sbjct: 181 RIPLQVWVIQKNNGLLLE-EDQIDGIVPTSTIQTEIYWKEMDGDSEWTRANDAYDSGEDG 240
Query: 241 YESSRDEISSVSSYNPMRESFRRTFGSSPSLQSSPVRSSIRDASFIEGERLSFSNAVRPR 300
Y+SSRDEISSVSSY P + + RTFGSS SL S P R S++D S++E S + +
Sbjct: 241 YDSSRDEISSVSSY-PNKSNLNRTFGSSLSLNSGP-RLSMKDTSYVEERVGSSPKMMLSQ 300
Query: 301 VSVAEVLYDS-ADVATPQKSSAVVSPSQALKFGKHQSTDDDPGVPSSPMARTVEPVASEG 360
+S A V YDS DV++P KSS V PSQ K + G SP A EPV SEG
Sbjct: 301 LSAAVVSYDSGTDVSSPHKSSNSVVPSQQPK------QTNGAGASMSPGAGAREPVPSEG 360
Query: 361 FLRGRSYNIRIDDQVLLRFCPKNSDSTYYFSDMIGGTLTFFHEEGSRRCLEVSITLETSE 420
F RGRSYNIR+DDQVLLRF PKN+DSTYYFSD IGGTLTFFHEEG+RRCLEVS+TLET E
Sbjct: 361 FTRGRSYNIRMDDQVLLRFSPKNADSTYYFSDTIGGTLTFFHEEGTRRCLEVSVTLETLE 420
Query: 421 TVSHRFVHPSRRNSPTIVKVQSDHYEVVADLIQTSFLFSIPMDGPMSFSTPHVSLKWALR 480
T++ RFVHPSRRNSPT+ KVQSDH+EVVADLIQTSFLFSIP DGPMSFSTP VS++W LR
Sbjct: 421 TINRRFVHPSRRNSPTLTKVQSDHHEVVADLIQTSFLFSIPTDGPMSFSTPRVSVQWILR 480
Query: 481 FEFFTTPKNLDWTRYEHPLLIEGREKSEWILPITVHAPPSSTPAARN--ERPFSLEPLWM 534
FEF TTPK++D +RYEHPLL+ REKSEW+LPITVHAPP T A+N ++ F LEP W+
Sbjct: 481 FEFLTTPKSVDLSRYEHPLLVPEREKSEWVLPITVHAPPPRTSGAQNRGDKLFGLEPSWI 532
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
XP_022155720.1 | 0.0 | 100.00 | uncharacterized protein LOC111022779 [Momordica charantia] | [more] |
XP_038904898.1 | 0.0 | 83.67 | uncharacterized protein LOC120091119 isoform X1 [Benincasa hispida] | [more] |
XP_011653362.1 | 0.0 | 83.80 | uncharacterized protein LOC101218523 [Cucumis sativus] >XP_031739927.1 uncharact... | [more] |
XP_008455604.1 | 0.0 | 83.40 | PREDICTED: uncharacterized protein LOC103495739 isoform X1 [Cucumis melo] >XP_01... | [more] |
KAG7013397.1 | 0.0 | 83.46 | hypothetical protein SDJN02_23563 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
Match Name | E-value | Identity | Description | |
A0A6J1DN77 | 0.0 | 100.00 | uncharacterized protein LOC111022779 OS=Momordica charantia OX=3673 GN=LOC111022... | [more] |
A0A1S4E0M0 | 0.0 | 83.40 | uncharacterized protein LOC103495739 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1H364 | 0.0 | 82.71 | uncharacterized protein LOC111460073 OS=Cucurbita moschata OX=3662 GN=LOC1114600... | [more] |
A0A6J1L132 | 0.0 | 82.89 | uncharacterized protein LOC111499409 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1KC17 | 3.48e-315 | 82.26 | uncharacterized protein LOC111494236 OS=Cucurbita maxima OX=3661 GN=LOC111494236... | [more] |
Match Name | E-value | Identity | Description | |
AT1G50120.1 | 9.7e-175 | 59.23 | FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknow... | [more] |