Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTCACTCTCGGTCTCACTACACTCCTCTTCCATCTGCGATTTTACAGACCCGCCGCCGCCGCCTCCGTCTCTCCGCCGCTCGCCGCCCTCACTGGGCCTCCCCGTTCTCACTCGGTATACAATTCTCGTTCTCTCTCAAATCTCTCAGAGTCCTATTCAACATTTAAAAAAAAAACGATGGGCAGATTAGCCTGAACCAAACCGAACCTAAAACCGACCGGCCGAGTCAACTTTGGAGTATCGTCTTGAATGCTGTGGCTTCAGGAAAACTTGAGCTGACCTCTCTTTAGTTCCTTGGCTCTTGCAGTATAACTGAGGCGGGATTTCAAGAAGAATGGAACCAGAATCAATGGCGGATGAGCATGTAATGGAAGCTGAATATGGTTTTAACAGACCCACCAGTCGGAAAAGAAAGGCTGATACAGCTGCCGACTGTAATGATGATGGCCGGCGAGCGACGCTAATGAAAAGAATTACGTTGTCTTTGACAAAGCCATCGTTTGTTCTGGGGCTTGGACCGAAGATGGTAAGGGCTGAAAATCGAATTACACTGCGCAATGTCTTGCACAAACTTATGAGGCAGCAAAACTGGGTGGAAGCAAGTGGAGTATTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATTAAGAATCGTTTGAAGTATTCGGTAATATACACAATCCCCATTTGTTTATATTTCTACTCTTTTCGTTGATTAGTTTACAATCTCGTATGTTTCTAATGGAAAATGAGGTGAAGAAATTATAAACACCATATGTGGAGAATATACTTACTGTGGACGATTTAAATTTTGTGGTTCTGGCATCCTTTGTGATAGTAAGTACATTCCAAGCTTTCTTGTTGATGGAGATATGCCATGAGCTTTATGTATCTTTAATATTTGAACTTGTTTACTTAACTTTGTAGAATACTTTTGTGTATGTCTTAGATTACAAACAGAGATAACTGTAAAATACTGAATACTAGAGGAGGGGGGGGGGGGGGGTTAGCTAGGGGAGTTTCCTGAGCTCGAAGACTCTCAGGATGGCAAGCTGCAATGTGGTGGAGGAATATCTGGATTTTCTGTCTTTGGTGGTGGATCTGCTTCATACGGACCCTTAAAGTTGAGCATCCAGTCCTCTTTCAGTTCTATGTTCTTTCTTTCTTTACAGGATGGTAGTTTGCATTATTGAAGTAGGTATATTGTGCGTGTACTTTTTACATATAATTTGTGTGTTGTGTTGGGTCTTGGTGGTGTCACAATGACTCGCATTTGATTCAACTCATCTCAGTAAGAGACTCTAGCATTAATATACCAATTAGATTCCTATAGGATGAGATTTACTGAATATGACTCAGAACTCTTTTTAGGCTTCAATGGAGCTCCTTAAGCATATAGAAGGCGATCGTATGAGACCAAATAGGATCAAACATATTTATGACAACTGGATGAGGAAGAATGGATCAATGAAGCATTGCCCCGTTGAGGTATGTGTTGTAGAAATTCTCACTTGCATTATGCTGTCTTCAAGGGAACATTGCATATTCCACGATGATTTATCAGTTTTGCCTATTGTTGACAAGGTTAAATATATACTTACGAATCTACAATAACATTTAATCTCTGTTGTTGAAGTTAATAGTTTTATGCCATCCAATATGCCTGAAATTTGAATAAAATAGGAAGTATGACAAATATAAGTGGGAGCTCAATAAGGTATTAGGTATGTAATTAAGTGAATTTTCTCAATTTTCTTTGGCTTCACAACATGGATAGTTTTGTTGTTGACTAGTTAAAGAGTTATTTTAGCTGTAACCTCTTTGGTCATTTGCTTGGAGTTGGACCTGTTGTTCTTACATTTTTCTTTGGTTGTTGTATAAAATGGGCTTTCCCCTTTTCTTTTATACTTTTCACTATCAAGAAGACAACGATACAAGATGGAGGACATTGTAAACTCCCTGTATAAATTTTAGTTTTTCATAGAAGATTCCTCAAAACCACTTGTATTAATCTCGAATCAGCATGGCTGTCAAGCAAGACTAAGGGCCTGTTTGGAATACATTTTAGTTTAAGTGTTTTTCAAGAATGCATTTTCTTTTAAATATTTTGTATACAAATTGTTTAGAATTTAAACACTCATATGTTTAGCTACATTTGCTCATGTGTTTTTATACACAAGTTATATATATTCAAAAACTCTTTTTAAAAAATTTGCTTTCAAGTGTTTTTCTCAAAATGGTTTATTTCTTAATTTCATTTTTTAATAAGTCATTTTTAGTGGTTGCCAAACAATAGAATTTTTTCAAAATATATTTTTTTTCAATAAACAGTTAAAAAGTTAAACCAAACATACCCTAAGACTTCCAATGTCAAAGTGTCAATCAGGTTTAAGCTCAAGCCTCGTTCATGGTTCTTGATTAGAATCTTGTTATCTTTCTTTATATGTTATAAAAGGAATTTCCTAGATGTGGTCTCATCTCATTCTATGGAGTCCCCACGTTTGATTTATTTATTTTATCTAACATCATCACCATCCTTATTATTTTTTGGAGAGGAGAGATTGATTTGTGTTTTACCATGTTACAAATTGACCTACAGGCATGGGTGGGAAACAAAAATCATGTTTATAGCAAACACATACCTTGAATCTACAATACAATAAAACTATACAACTTCCACACCACACCTTGGAAATTCACCTAAAGTTAATGACTTAAATTCATTCATTTGTCTTGCATATGGACTTATAGAAAGTTGTTATGATGGCAATCTGTTGTATTTTGTTTAATAATCTCCACTAAATTTTTCCTTTAGGTTTGTTCCTTGAACGTAGAATAGGATATAACCCCAAATTGGTTAGGATTTCTATCCTCTGAAGTATTTCACATTTATTTAGTACTTAAACAAGAAACAAGAAGCCCATGTTTTTGTTGAACTAAAAAAAAAAGATTCTAACTTCTCATTATTTTCTGCCTTGAGTTTTTATTTTTATTTTTAATAAGAAACCGAATTTTCATTGCTACCTTGAAAATGAATGCTACCTCAAATGTTAAAAAGTCATTGAGAAAGCAACTTGAAAATGGAATGCTCATTTAAAAAGCCTTACCACATAACGTAGACTACTTTCAATCTAATGAATTTTTGGCTGCAACTATATTAAAGCACCACAGAATACCAAAGTCTTCTCATTAAACAAGAAATAGAACAAACTGTGTTACTCCATTAGATCTCAAACTTTCAAATAGAGTGCACAAACTGTGTTACTCCATTAGATCTCAAGCTTGTTTCTTCAAACGGATAGCACCCAAGAGTCAACTTAGAGTAGCTTTCACATCATAGGAAAAGGCCCCTAGAAAGTGAAAAATCCCATAAAGATTTTTTTGGAGTTTCATAGTGAATGAGCTGTAAAAATGAAAAACAAGATAGTAAAGTGGCTTGCCAAGTTCTCCAGACACAGAACACACGAAGAGAAGGATAGGAAAATAATGGGGTTACGTTTTCCGGATAATGAACCTTGGCTCTGTCCTTGAGAGGATAGCCGATGAGAAATTGGTCTATCTTATGCATGTATCATTTAGAGAGGTTTTGTGTTTTCCCCGACTGTCTGCATGCTTATTCCAAAATTTCCATGGTTTTGCAGGACAGATTTATGGTCCATGTGGAATTTATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCACATCAGGCTGCTTTATGGTATGGTATTCAAACTGGGCATATAATTTGCGTACATGTGGCATGTTTGCAAGTTTTGTTTTAGTCATATCATTTGTCCACTTGGAGGGGGTCTCCAATTTTATGATTTTATGAATGGTTGAACCATGTATACGTTCATAATTAACAACCATTTCTATTTGAAATGCTAAAAGAACCAGCGAACACAATAAGGTATTGTTAGATGTTAAGGAATAACCAGAAATGAAAGGCTATAGTCAAACTAATTATGCTAATTTTCTTTTGTCAGCCTCATGCAAGAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGGCAGCTGTGGTTCTCTACCATTCCAGAAGAGATTCAGTGGAGGGACTCTCTCCAGTTTCACTCCCCAATTCGATCAGATAGGTTGATTTCAAATTCAGATGGGTGTTCAGTCAGCAGCTCTCATGGAGATGGTGCCTCATATCGCAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGCGAGGGGCACACAGAAGCTTCTTTTGAGGTCGATCATGACATAAAAGTGGAAAGTCAAAACTTTGAGGCACAAGATTTTTGTGTGAGTTCTGCAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTCATCAGTACTATGTTTCAATTTTTTCTGCTCTTGGTAAGTTTAAAATTATGTAACTATGGCATGTTGGTGTTTGCACCATTTTGTGTGGGGGGGGGGGACAATAGAATTTTTAGAGGGACCGAGAGACATCCTTGTGAGGTTTGGGCTAATATTACATTTTCTACTTCTTTATGGGCCTCAGTTTCTAAGGCTTTTTGTAATTACCCATTAGGTATGATTTTGATTAATTGGAAACCCTTTTCTGTAATTGGTTTCCCTTGCTTGGGGCTTGGTTTTTTTGGATGCCCTTGTATTCTTTCATTTATTCTCAATGAAAGCTCGGTTCTTTATAAAAAAAAAAGTGATTATTAAACATTCTTCTATTTATCATGCTACTCATAGTTGGATTCCTTACTCTGAAGGAAACTTTTGTGCTATCCTTTCATTTATTTATCTAGGTTACCTAAGCATGTGAAAATGTTTGAATATTTTTTTCCTTGTTGTTCTGTTTGGTAGTATTGCTAATAATTTGGTTTATATATATATATATATTCTATACTTCGTTTTGTTGTATAACTATGAGGGACTGACAAGCTTGTACACATATGACCTTCTGAACAAATAAGAATCAAAGAGCATGTACGTTGAGTATCTATGACCTAATTATTATTCTGATGTGATCATTTATATGGCTACAGAAGAAAACAATTTGCAAATGTGATCTTCTGTACATGATAATAATTTTAGCCCTAAGTAGACATCTATTCTCTAAAAAGTTTTCATGCATTTTTAATGATTTGAAAATGTCAGTTGATGGATTCACGATTACAAGTTAACTATATTTATCCTTTTTTTATGTTGCCACAGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTGCCACCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCGGCCAGTTTCTGAATGACTATTATAAGGATGCAGTGAAGCACCTAAACCTTGCTCTTAACTCAAACCCCCCAATATTGGTTGCCTTACTTCCTCTTATACAGGTAAAACAAACCTAAGCCTTCAAATCTGTATTCAGTTTAAATGATTCTTTGATAATATTTTGTATCTTATAGGTTAGCACACTGGACTCTACAATTCTCAATATCTGATATGTATTATGGAAGCAGATATTAACTTTTCAAACAAGTCGAACATCAAACTAATTACATTCATATCCCAATATAGCAAGTTCTTTTAAATCCATCCATCCCTTGAAAGTTTTATGGGTAGATTTCTGACTAATTTCTCTGCATGATAATGTAGTTCTTATAACTAAATGTTGCCTTTGCCGGGAAATTGGAGGAACTCTTAATATGAAGGCCCATTTGATTGTCCGCTTCATTATTCCAATATCCATTTCGCTGTTGCCTGTTTCATTGTTCATGTCAATTGAAAAAAAATGAAAGTATATGTGTTCTTCTTTTTCTTGAATGTAGCCATTCTCATGTTTCTTATATATATAAAAAAAGAAAGTATATGTGGAGAATCCAAAATGCATCACTCACTCAAAATTCCAAGGCAAGTATGAATGGCTCTTTAATTCGCCATATAATCATAAAAAAAGGGAATGTTACCTGTGTATTGTTGCCACCCTGAAACTATAGGGAAAGGGACATTCAGCGTGGTTGATCTAACCACCCAAAGGTTTTCTTATAAATTGTTTTTTCTTCCACCTAAGATTAGCTTTGAATTTGCTACAATCAAGTGTCTCTACTTTTGGATCTGAGAATCTTCTCCGCACTTGTTTTGCTAAGAGTGCCTCATTCCTCTTTCTTGGATGCTCAATTCCCGAACCTCTCTTTTCCATTTTCTTCTCTAACCAAACTTACATTAAGATGAATAGGACAGGCAAGACAAAAAGGAAGCTGGTTACAAGAGGGGTCAAAACAAGTAGACCAAAAACAAAAGAAAATTACTGTAAGATCTCTTGTCCTTGAAAACTACAAAACCCTCTAGTCATAAACATGTAAAGGCATTAAAGTGAGCCTAAATCCGAACTTCCACAAAGCCTTCTAACCCTTGAAATTTCTCTTGGACCCGCCAAAGAACCCTCCCCTTATCTCAGAAAATGTCTCTCAAGTTCTTCTACGGAAGGAAATTCTTCTTTTGATTCTCAACAGAAATAGACACTAAGGTATTTGAAAAAATGAGATAAAGTAGGTGACTTGAAAGTGCCGATTGTACAAAGCCACTTTTAAACATATAGAATATTCCACTCATGCATGACTCTTTTCCCAAATTTTGAGGATTTCAAATGTCCATCAGGTGAATCCCCAATTTTTTTGTGGGTAGTTTACTGCAAGCGCATCTAGTGATGCCTAGTATTGCTGTTCATTTTGCAATCATCAGCAATACTTGTTTTCTACTTGAAGTCAAGTTCAAAGATCTTCAAGGTTATAGAAAAAAATTGATTTTTGTCTATTATATTTGCAAATAAATAAGATATCAGTTGCAAACTAGAAGTATGGAAGCTGAGCATCATGTTTTCTCACCTTAAGATCCTCCAAGAGGCCTGATTTAACGGCCTGATCCAGAAATTTAATTGGGAAGTCACTGACTATTACAAGATTGGGGAAGCGTGGTGTCTTATGGACAACTACCTTTTTAGCTACTTTGTGGTACATTTGTGGCTTGAGAGAATTTTTAGGAGGTTGGAGATATCTTTGGAGGAAGTTGCTGGTCTTCAGATGTTCAACACCTCCCTTTGAGCGTCTATAACTAGATAATCTTGTAATTTTCCTTTTTTTTTTTTTTTCTTATTTTGAACAGTTGGAGCCCTTCCTGTAGAGGATTTTGCCTTCTATCTCGGCTCCCTTTGGCTTGGGCTAGTTTTATGTATTGCTCTCTTGTATTTCCTTTCATTTACTCTATGAAAGAACGGTTTTTAGCCTGCCCGGGGGGAGGACTTGTCATAAACCACCAAAAACTTGTCCTATGGGTTTTCCCATTATTAAATATTGAGAATGGGAAAACCCATAGGACAAATTTGCATTGTGAGTCGAACCAAATTTGTTGCTAGCCACACTGTAGAGTTGGGTGAGCTTGAGAGGCTGATTAGCAGCCTGACATTTTCCAAAAACAAATGAAAGTGTAGCTACCACAGGCCATAGCCAACCTTTATCATATCTTTTCCTTACAGCTGGAAAGGAAACTCGTAGTCATTTGTGTAGAGCCCATTCACTTAATTTTCTTGGGTACAAAATTTCCAAAGTAATGTTGAATGAAATGTAATTTTTTTTAAAACAAGAAACAAAAGTTTTTATTGAAAAAATGAAACGAGACTAATGCTCAAGAGAATTGAAAGATGTTGATTAACACAATTCAAGTAATATCCGAACCTCCTTGAGCCTTCAATACAAATAACGATTCATCACAGAATTGGTTGCTTTCCCTCACATTACCTTTCCTCAATTTTAATACTCCAAACCGCAGCTAACAAACTCCTCTATAATGACTTGAAAAATGAAAAATGAAACAATCAGGTGCTGTTTTGAAAATTTTCTTCTTACCTTACTAAATCTTCTGGGATGATCATTCACACAATCAATTTCTTTTATATGCTCATCTCTCTCTTTAATGTACAAATTTCAGTTGTTGTTGATTGGAGGTCGAGTTGACAAGGCACTCAATGAAATGGAAAATATATGTCGTGATTCAAATGCAGTACTTCCCTTCAGGTACTAAAATCTGCTCAAAAACCCTAAGTTCTCTATTTTGTATTTAGTGGTTGAAATTGAATATTTAAGAAATCATATGTGAAAGAGATTGGCAACTTTCTAGAAAGGAACAAAATAGCAAGAAGGTACAATCACGCATCATCTAATCTTTTCATGTAGATTTATAATTTCCTTTGTGTTTAATTATAAAAAAATTCTTTATTTTCTGTTGTTGACTCATTCGCAGTGAAAGTTGGTTTCTTTATGAAAATTACACCCGGAAAAGAAACATAAGAACTGAGTGTTATTCCGTCAATCGCTTATAACAGAGGTTTGAAGGCCTCTTACCTTTGAATGAATATTGATTTAGATATTCTGCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAATGATGTCTTGCTTTCAACTTGTTATGAGCAAATACTGAAGAAGGATCCAACCTGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAACGGTATGTTAACTGCAGAACAATTTTCTATGATGAACTATTAGACCTCAAGGAGAGGTTCAAATTTGAGAATCTGAGATGATAATTTGTGGTTGAATTTCTCACATCATAGGTACGATTGATAATTCTTACCGTGAAAAGCTATCTATAATAAAATCATGTATTCTGGTTTCATTATCCAAACGTGAATTCTTCTTTTCCATTCTTTCTTGTTCATCCCAGCGAGTTATTTATAGAAACCTTGTCATATCATGAACGCAACAAGTTACCATTGTGGTTGCTTAACTTCTGGAAAACGGAACTCCCGGTTCATAGTTCTCAAATTTTTAGTCCTCAGCTTGACTGAATTTACACTTTTACATTATGGATAACATGAATTATTTCATGAAAAGTTGATAATATTTTCACATTAACATGGTTGTTGTATGTGGTGTGATGGAGATAATGTGCTTGCCTGTGATATAATTACTTGAAAACTTGTCTCGAAACTTTGTGTTCTTGTTGAATTTACTTTTTATTGCAATTGCAGGCAATTACAGTCTTGAATCTCTATTGGAAATGATAGCTTTGCATTTAGATGGCACATGTGCAGAATATGATACATGGAGAGAGTTGGCTATGTGTTTTCTTAAACTTTCTCAATTTGAAGAGGATAGAGTATCAACAGCATGTTCAATTGGGACTGAAGGGCATAAGCTGATGTCCTCATTGAATGTTAGCAGTAACCTTAAGTTGTTGACTGAAAAGAACTTGAGAAACACATGGAGATTGCGTTGTCGATGGTGGTTGACGCAGCATTTCGGTCATAAAATAACATCAGAAACTTTGGCTGGTATGTTCAAAGTTTGTGAACCTCTTGTTTTCCGTGCTGTGTTCGGTTCTTATTTTATTTTTATTTTTTTTTGACATGTGGGGTGGGGATTTGAACCTCTGACCTCTTGATCAATGGTTCAAACTTTATGCTAGTTGAGCTATGCTCTTGTTGGCGTTCGGTTCTTAGATTGCTGTTTGGTTTAAATGGTACAGTGATAATGTTAACTTAGAGCTAAACAAGTGTTACTTTAAAGTACTACTTATCCTATATATTTTCATGGTTTATTTTGGTCTTTAGAGCATTATGTGGATGCTTACTAAAAGAAAGACTACTTCGTTATTATATATATATATTTTGGCTAAAGAAAAATCGTTATTATTAAGTTCTGTATGTTGGGGGTGGGTACAATAAATATGAATGTTTGCCAACTTGCGCATAGCTTAACTAGTTAAGTTGTATACCTTCGACTAAAAGATTAGAGGTATACTCAATTGGTTAAGGCATATACCTTCGACCAAGAGGTTAGATGTATGCTTAACTGGTTCATGCATATATACCCTCGATCAAGAGATCAAATGTTCAAACCTCTTTACCCACATGTTTAACCCTAAAAAATAAAATAGGACGAACGTTATTCAAACATGTATTCATATGAATTGAGATTCTCCATGAATGATGGTTGCTTTCTTTCCGTCTAATACAAATTAATAATTTTGGTCTTTGATGTATGCAACAGGTAATTTGGAGCTTTGGACTTACAAAGCAGCATGCGCAAGTCATATGTATGGTAGCAACTACAAATATGTGGGACAGGTTTACAACCTTTTAGAGAAGCAAAACGATAAGGATTTATTATTGTTTTTAAAGAGGCACATGAAGAATTCGTTTGGACTCCATTCTAAATTATAATCATGAGTTTCC
mRNA sequence
CTTCACTCTCGGTCTCACTACACTCCTCTTCCATCTGCGATTTTACAGACCCGCCGCCGCCGCCTCCGTCTCTCCGCCGCTCGCCGCCCTCACTGGGCCTCCCCGTTCTCACTCGTATAACTGAGGCGGGATTTCAAGAAGAATGGAACCAGAATCAATGGCGGATGAGCATGTAATGGAAGCTGAATATGGTTTTAACAGACCCACCAGTCGGAAAAGAAAGGCTGATACAGCTGCCGACTGTAATGATGATGGCCGGCGAGCGACGCTAATGAAAAGAATTACGTTGTCTTTGACAAAGCCATCGTTTGTTCTGGGGCTTGGACCGAAGATGGTAAGGGCTGAAAATCGAATTACACTGCGCAATGTCTTGCACAAACTTATGAGGCAGCAAAACTGGGTGGAAGCAAGTGGAGTATTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATTAAGAATCGTTTGAAGTATTCGGCTTCAATGGAGCTCCTTAAGCATATAGAAGGCGATCGTATGAGACCAAATAGGATCAAACATATTTATGACAACTGGATGAGGAAGAATGGATCAATGAAGCATTGCCCCGTTGAGGACAGATTTATGGTCCATGTGGAATTTATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCACATCAGGCTGCTTTATGCCTCATGCAAGAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGGCAGCTGTGGTTCTCTACCATTCCAGAAGAGATTCAGTGGAGGGACTCTCTCCAGTTTCACTCCCCAATTCGATCAGATAGGTTGATTTCAAATTCAGATGGGTGTTCAGTCAGCAGCTCTCATGGAGATGGTGCCTCATATCGCAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGCGAGGGGCACACAGAAGCTTCTTTTGAGGTCGATCATGACATAAAAGTGGAAAGTCAAAACTTTGAGGCACAAGATTTTTGTGTGAGTTCTGCAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTCATCAGTACTATGTTTCAATTTTTTCTGCTCTTGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTGCCACCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCGGCCAGTTTCTGAATGACTATTATAAGGATGCAGTGAAGCACCTAAACCTTGCTCTTAACTCAAACCCCCCAATATTGGTTGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAGGCACTCAATGAAATGGAAAATATATGTCGTGATTCAAATGCAGTACTTCCCTTCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAATGATGTCTTGCTTTCAACTTGTTATGAGCAAATACTGAAGAAGGATCCAACCTGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAACGGCAATTACAGTCTTGAATCTCTATTGGAAATGATAGCTTTGCATTTAGATGGCACATGTGCAGAATATGATACATGGAGAGAGTTGGCTATGTGTTTTCTTAAACTTTCTCAATTTGAAGAGGATAGAGTATCAACAGCATGTTCAATTGGGACTGAAGGGCATAAGCTGATGTCCTCATTGAATGTTAGCAGTAACCTTAAGTTGTTGACTGAAAAGAACTTGAGAAACACATGGAGATTGCGTTGTCGATGGTGGTTGACGCAGCATTTCGGTCATAAAATAACATCAGAAACTTTGGCTGGTAATTTGGAGCTTTGGACTTACAAAGCAGCATGCGCAAGTCATATGTATGGTAGCAACTACAAATATGTGGGACAGGTTTACAACCTTTTAGAGAAGCAAAACGATAAGGATTTATTATTGTTTTTAAAGAGGCACATGAAGAATTCGTTTGGACTCCATTCTAAATTATAATCATGAGTTTCC
Coding sequence (CDS)
ATGGAACCAGAATCAATGGCGGATGAGCATGTAATGGAAGCTGAATATGGTTTTAACAGACCCACCAGTCGGAAAAGAAAGGCTGATACAGCTGCCGACTGTAATGATGATGGCCGGCGAGCGACGCTAATGAAAAGAATTACGTTGTCTTTGACAAAGCCATCGTTTGTTCTGGGGCTTGGACCGAAGATGGTAAGGGCTGAAAATCGAATTACACTGCGCAATGTCTTGCACAAACTTATGAGGCAGCAAAACTGGGTGGAAGCAAGTGGAGTATTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATTAAGAATCGTTTGAAGTATTCGGCTTCAATGGAGCTCCTTAAGCATATAGAAGGCGATCGTATGAGACCAAATAGGATCAAACATATTTATGACAACTGGATGAGGAAGAATGGATCAATGAAGCATTGCCCCGTTGAGGACAGATTTATGGTCCATGTGGAATTTATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCACATCAGGCTGCTTTATGCCTCATGCAAGAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGGCAGCTGTGGTTCTCTACCATTCCAGAAGAGATTCAGTGGAGGGACTCTCTCCAGTTTCACTCCCCAATTCGATCAGATAGGTTGATTTCAAATTCAGATGGGTGTTCAGTCAGCAGCTCTCATGGAGATGGTGCCTCATATCGCAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGCGAGGGGCACACAGAAGCTTCTTTTGAGGTCGATCATGACATAAAAGTGGAAAGTCAAAACTTTGAGGCACAAGATTTTTGTGTGAGTTCTGCAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTCATCAGTACTATGTTTCAATTTTTTCTGCTCTTGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTGCCACCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCGGCCAGTTTCTGAATGACTATTATAAGGATGCAGTGAAGCACCTAAACCTTGCTCTTAACTCAAACCCCCCAATATTGGTTGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAGGCACTCAATGAAATGGAAAATATATGTCGTGATTCAAATGCAGTACTTCCCTTCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAATGATGTCTTGCTTTCAACTTGTTATGAGCAAATACTGAAGAAGGATCCAACCTGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAACGGCAATTACAGTCTTGAATCTCTATTGGAAATGATAGCTTTGCATTTAGATGGCACATGTGCAGAATATGATACATGGAGAGAGTTGGCTATGTGTTTTCTTAAACTTTCTCAATTTGAAGAGGATAGAGTATCAACAGCATGTTCAATTGGGACTGAAGGGCATAAGCTGATGTCCTCATTGAATGTTAGCAGTAACCTTAAGTTGTTGACTGAAAAGAACTTGAGAAACACATGGAGATTGCGTTGTCGATGGTGGTTGACGCAGCATTTCGGTCATAAAATAACATCAGAAACTTTGGCTGGTAATTTGGAGCTTTGGACTTACAAAGCAGCATGCGCAAGTCATATGTATGGTAGCAACTACAAATATGTGGGACAGGTTTACAACCTTTTAGAGAAGCAAAACGATAAGGATTTATTATTGTTTTTAAAGAGGCACATGAAGAATTCGTTTGGACTCCATTCTAAATTATAA
Protein sequence
MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGLGPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELLKHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSVSSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVESQNFEAQDFCVSSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRLRCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLLLFLKRHMKNSFGLHSKL
Homology
BLAST of Lcy11g006020 vs. ExPASy TrEMBL
Match:
A0A6J1CPA4 (uncharacterized protein LOC111012919 OS=Momordica charantia OX=3673 GN=LOC111012919 PE=4 SV=1)
HSP 1 Score: 1060.1 bits (2740), Expect = 3.6e-306
Identity = 525/618 (84.95%), Postives = 563/618 (91.10%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE MAD+HV+EAEYGFN+P +RKRKAD AD DGRRATLMKR+TLSLTKPSFV+GL
Sbjct: 1 MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGL 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKMVR ENR+TLRNVL KL+RQQNWVEASGVLSMLLKGTLRDRSPIKNRLKY ASMELL
Sbjct: 61 GPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYLASMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKH+YDNWMRK GSMK+ P+EDRFMVHVEFILFCLEEGNTEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEH+SVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPI+ DR+ISNS GCSV
Sbjct: 181 CLMQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+SHGDGA Y+SNSETSVMNDKLVHVDSEGH E S EVD D+KVE+ QNFEA DF +SS
Sbjct: 241 SNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
AEK+ENEAS SDNGG+Q+YVSIFSALEGLDPLLLPLHLP SI+NWENAISLCG+FLN YY
Sbjct: 301 AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL+LALNSNPPILVALLPLIQLLLIGGRVDKAL E+E IC DSNA LPFRLRAAL
Sbjct: 361 KDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHL-DGTC 480
VEHFDR NDVLLS+CYEQILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHL DGTC
Sbjct: 421 VEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTC 480
Query: 481 AEYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWR 540
EYD WRELA+CFLKLSQ EEDRVSTACSIGT H LMSS N++SNLKLLTEK RNTWR
Sbjct: 481 VEYDKWRELALCFLKLSQSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWR 540
Query: 541 LRCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDL 600
LRCRWW T+HF HKI SETL GNLEL TYKAACA HMYGSN+KYV +VY+LLEKQ D+DL
Sbjct: 541 LRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL 600
Query: 601 LLFLKRHMKNSFGLHSKL 616
LFLK+H NSFGL +KL
Sbjct: 601 FLFLKKHKLNSFGLQAKL 618
BLAST of Lcy11g006020 vs. ExPASy TrEMBL
Match:
A0A6J1FSH8 (uncharacterized protein LOC111446895 OS=Cucurbita moschata OX=3662 GN=LOC111446895 PE=4 SV=1)
HSP 1 Score: 1041.6 bits (2692), Expect = 1.3e-300
Identity = 519/617 (84.12%), Postives = 554/617 (89.79%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE + D VMEAEYG R T RKRK DTAAD ++DGRRA MK+ITL+LTKPSFVLG+
Sbjct: 1 MEPELVGDTLVMEAEYGSERSTGRKRKPDTAADGSNDGRRAAAMKKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPIRSDR+I NSDGCSV
Sbjct: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPIRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+HVDSEGHT ASFE DH IKVE+ Q FE DF SS
Sbjct: 241 SNSRGDGASYQSHSETSVMDHKLIHVDSEGHTGASFEDDHKIKVENDPQKFEPLDFYASS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEASFSDNGSYQHCVSIFSALEGLDPLLLPLHLPSSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICRDSNATLPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N +LLSTCYE+ILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA
Sbjct: 421 VEHFDHSNVLLLSTCYEKILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWRELAMCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELAMCFLKLSQIEEDRVSAACSIGSGGHKLRSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK L+
Sbjct: 541 RCRWWLTHHFCCNITSETSDGTLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKQLI 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+H NSF LHSKL
Sbjct: 601 LFLKKHTNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. ExPASy TrEMBL
Match:
A0A6J1IWZ8 (uncharacterized protein LOC111479331 OS=Cucurbita maxima OX=3661 GN=LOC111479331 PE=4 SV=1)
HSP 1 Score: 1039.6 bits (2687), Expect = 5.0e-300
Identity = 521/617 (84.44%), Postives = 555/617 (89.95%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE M D VMEAE+G R T RKRK DT AD ++DGRRA MK+ITL+LTKPSFVLG+
Sbjct: 1 MEPELMGDTLVMEAEHGSERSTGRKRKLDTEADGSNDGRRAAAMKKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPIRSDR+I NSDGCSV
Sbjct: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPIRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+HVDSEGHTEASFE DH IKVE+ Q FE DF VSS
Sbjct: 241 SNSRGDGASYQSHSETSVMDHKLIHVDSEGHTEASFEDDHKIKVENHPQKFEPLDFYVSS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENEASFSDNGG+Q+ VSIFSALEGLDPLLLPLHLPPS+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEASFSDNGGYQHCVSIFSALEGLDPLLLPLHLPPSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENIC DSNA LPFRL+AAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICCDSNATLPFRLKAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N VLLSTCYE+ILKKDPTCCHSLGKLV MHRNGNYSLESLLEMIALHLDGT A
Sbjct: 421 VEHFDHSNVVLLSTCYEKILKKDPTCCHSLGKLVLMHRNGNYSLESLLEMIALHLDGTRA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWRELAMCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELAMCFLKLSQIEEDRVSAACSIGSGGHKLRSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK LL
Sbjct: 541 RCRWWLTHHFCCNITSETSDGTLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKHLL 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+HM NSF LHSKL
Sbjct: 601 LFLKKHMNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. ExPASy TrEMBL
Match:
A0A1S3BS63 (uncharacterized protein LOC103492916 OS=Cucumis melo OX=3656 GN=LOC103492916 PE=4 SV=1)
HSP 1 Score: 1010.7 bits (2612), Expect = 2.5e-291
Identity = 507/621 (81.64%), Postives = 549/621 (88.41%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE MAD VME EYG P SRKRKAD AD N+D RRATLMKRI LSLTKPSFVLGL
Sbjct: 1 MEPEQMADRPVMEIEYGSIIPFSRKRKADPTADGNNDSRRATLMKRIKLSLTKPSFVLGL 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
PKMVRAENRITLRN LHKLMRQQNWVEASGVLSMLL+GTLRD SPI+NRLKYSASMELL
Sbjct: 61 APKMVRAENRITLRNALHKLMRQQNWVEASGVLSMLLQGTLRDNSPIRNRLKYSASMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRP+RI+HIYD WM+KNGS+KH P+EDRFMV +E+ILFCLEEG EDAHQ L
Sbjct: 121 KHIEGDRMRPDRIRHIYDIWMKKNGSLKHWPIEDRFMVQLEYILFCLEEGKMEDAHQETL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
LMQ ES NDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQ SPI SD +I NSDGCS+
Sbjct: 181 SLMQMPESANDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQLRSPIHSDGMILNSDGCSI 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVD---HDIKVESQ--NFEAQDFC 300
S+SHG GA SN+E+SVMNDK+VHVD EGHTEAS +VD H+IKVE+ NFEAQDFC
Sbjct: 241 SNSHGVGALSWSNTESSVMNDKVVHVDIEGHTEASLDVDHKIHNIKVENHPLNFEAQDFC 300
Query: 301 VSSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLN 360
VSSAEKDENEASFSDNGG+Q+YVSIFSALEGLDPLLLPL LPPSIENWENAISLCG+FLN
Sbjct: 301 VSSAEKDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLQLPPSIENWENAISLCGEFLN 360
Query: 361 DYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLR 420
DYYKDAVKHL LALNSNPPILVALLPLIQLLLIGGR+DKAL+EME C DSNA LPFRLR
Sbjct: 361 DYYKDAVKHLGLALNSNPPILVALLPLIQLLLIGGRIDKALDEMEKFCLDSNAALPFRLR 420
Query: 421 AALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDG 480
AALVEHFDR N+VLLSTCYEQ LKKDPTC HS+GKLV MHRNGNY+LESLLEMIALHLDG
Sbjct: 421 AALVEHFDRSNNVLLSTCYEQTLKKDPTCYHSMGKLVQMHRNGNYNLESLLEMIALHLDG 480
Query: 481 TCAEYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNT 540
T EYDTWRELA+CFLKL Q EEDRVSTACSIGT GHKL+SSL ++SN+KLLTEKN RNT
Sbjct: 481 TYPEYDTWRELAVCFLKLHQSEEDRVSTACSIGTGGHKLVSSLKINSNIKLLTEKNSRNT 540
Query: 541 WRLRCRWWLTQHFGHKITSE-TLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQND 600
WRLRCRWWLT+HFGH+IT E ++ GNLEL TYKAAC H+YG+N+KY VYNLL+KQND
Sbjct: 541 WRLRCRWWLTRHFGHEITPESSVVGNLELLTYKAACGCHLYGNNFKYAVDVYNLLDKQND 600
Query: 601 KDLLLFLKRHMKNSFGLHSKL 616
+DL LFLKRHMKN+FGL SKL
Sbjct: 601 RDLFLFLKRHMKNAFGLRSKL 621
BLAST of Lcy11g006020 vs. ExPASy TrEMBL
Match:
A0A0A0KXN5 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G089260 PE=4 SV=1)
HSP 1 Score: 736.1 bits (1899), Expect = 1.2e-208
Identity = 366/439 (83.37%), Postives = 397/439 (90.43%), Query Frame = 0
Query: 183 MQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSVSS 242
MQ ESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPI SD +I NSDGCS S+
Sbjct: 1 MQMPESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIHSDGMILNSDGCSTSN 60
Query: 243 SHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVD---HDIKVES--QNFEAQDFCVS 302
SHGDGASY S +ETSVMN KLV VDSEGHTEASF+VD H+IKVES QNFEAQDFCV
Sbjct: 61 SHGDGASYWSKTETSVMNGKLVQVDSEGHTEASFDVDHKIHNIKVESHPQNFEAQDFCVI 120
Query: 303 SAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDY 362
SAEKDENEASFSDNGG+Q+YVSIFSALEGLDPLLLPLHLPPSIENWENAISLCG+FLNDY
Sbjct: 121 SAEKDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGEFLNDY 180
Query: 363 YKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAA 422
YKDAVKHL+LALNSNPPILVALLPLIQLLLIGGR+DKAL+EME C DSNA LPFRLRAA
Sbjct: 181 YKDAVKHLDLALNSNPPILVALLPLIQLLLIGGRIDKALDEMEKFCLDSNAALPFRLRAA 240
Query: 423 LVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTC 482
LVEHFDR N+VLLSTCYEQ LKKDPTCCHS+GKLV MHRNGNY+LESLLEMIALHLDGT
Sbjct: 241 LVEHFDRSNNVLLSTCYEQTLKKDPTCCHSMGKLVQMHRNGNYNLESLLEMIALHLDGTY 300
Query: 483 AEYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWR 542
EYDTWRELA+CFL+L Q EEDRVS ACSIGT GHKL+SSLN++SN+KLLTEKN RNTWR
Sbjct: 301 PEYDTWRELAVCFLQLHQSEEDRVSRACSIGTGGHKLVSSLNINSNIKLLTEKNSRNTWR 360
Query: 543 LRCRWWLTQHFGHKITSET-LAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKD 602
LRCRWWLT+HFGHKIT ET + GNLEL TYKAAC H+YG+N+KY VY+LL++QN ++
Sbjct: 361 LRCRWWLTRHFGHKITPETSVVGNLELLTYKAACGFHLYGNNFKYAVDVYSLLDEQNYRN 420
Query: 603 LLLFLKRHMKNSFGLHSKL 616
L LFLKRHMKN+FGL SKL
Sbjct: 421 LFLFLKRHMKNAFGLRSKL 439
BLAST of Lcy11g006020 vs. NCBI nr
Match:
XP_022142927.1 (uncharacterized protein LOC111012919 [Momordica charantia])
HSP 1 Score: 1060.1 bits (2740), Expect = 7.4e-306
Identity = 525/618 (84.95%), Postives = 563/618 (91.10%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE MAD+HV+EAEYGFN+P +RKRKAD AD DGRRATLMKR+TLSLTKPSFV+GL
Sbjct: 1 MEPEKMADDHVVEAEYGFNKPRNRKRKADMVADGTSDGRRATLMKRMTLSLTKPSFVMGL 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKMVR ENR+TLRNVL KL+RQQNWVEASGVLSMLLKGTLRDRSPIKNRLKY ASMELL
Sbjct: 61 GPKMVRVENRVTLRNVLRKLLRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYLASMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKH+YDNWMRK GSMK+ P+EDRFMVHVEFILFCLEEGNTEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHVYDNWMRKIGSMKNWPIEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEH+SVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPI+ DR+ISNS GCSV
Sbjct: 181 CLMQEHDSVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQLDRMISNSFGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+SHGDGA Y+SNSETSVMNDKLVHVDSEGH E S EVD D+KVE+ QNFEA DF +SS
Sbjct: 241 SNSHGDGAPYQSNSETSVMNDKLVHVDSEGHRETSIEVDRDLKVENHPQNFEAHDFYMSS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
AEK+ENEAS SDNGG+Q+YVSIFSALEGLDPLLLPLHLP SI+NWENAISLCG+FLN YY
Sbjct: 301 AEKNENEASLSDNGGYQHYVSIFSALEGLDPLLLPLHLPHSIDNWENAISLCGEFLNGYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL+LALNSNPPILVALLPLIQLLLIGGRVDKAL E+E IC DSNA LPFRLRAAL
Sbjct: 361 KDAVKHLDLALNSNPPILVALLPLIQLLLIGGRVDKALKEVEKICHDSNAALPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHL-DGTC 480
VEHFDR NDVLLS+CYEQILKKDPTCCHSLGKLV MHRNGNY+LESLLEMI LHL DGTC
Sbjct: 421 VEHFDRSNDVLLSSCYEQILKKDPTCCHSLGKLVDMHRNGNYTLESLLEMIVLHLDDGTC 480
Query: 481 AEYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWR 540
EYD WRELA+CFLKLSQ EEDRVSTACSIGT H LMSS N++SNLKLLTEK RNTWR
Sbjct: 481 VEYDKWRELALCFLKLSQSEEDRVSTACSIGTGEHNLMSSFNINSNLKLLTEKKSRNTWR 540
Query: 541 LRCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDL 600
LRCRWW T+HF HKI SETL GNLEL TYKAACA HMYGSN+KYV +VY+LLEKQ D+DL
Sbjct: 541 LRCRWWSTRHFSHKIASETLDGNLELLTYKAACACHMYGSNFKYVVEVYSLLEKQTDRDL 600
Query: 601 LLFLKRHMKNSFGLHSKL 616
LFLK+H NSFGL +KL
Sbjct: 601 FLFLKKHKLNSFGLQAKL 618
BLAST of Lcy11g006020 vs. NCBI nr
Match:
KAG7031054.1 (hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma])
HSP 1 Score: 1043.1 bits (2696), Expect = 9.4e-301
Identity = 520/617 (84.28%), Postives = 554/617 (89.79%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE M D VMEAEYG R T RKRK DTAAD ++DGRRA MK+ITL+LTKPSFVLG+
Sbjct: 1 MEPELMGDTLVMEAEYGSERSTGRKRKPDTAADGSNDGRRAAAMKKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPIRSDR+I NSDGCSV
Sbjct: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPIRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+HVDSEGHT ASFE DH IKVE+ Q FE DF SS
Sbjct: 241 SNSRGDGASYQSHSETSVMDHKLIHVDSEGHTGASFEDDHKIKVENDPQKFEPLDFYASS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEASFSDNGSYQHCVSIFSALEGLDPLLLPLHLPSSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICRDSNATLPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N +LLSTCYE+ILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA
Sbjct: 421 VEHFDHSNVLLLSTCYEKILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWREL MCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELDMCFLKLSQIEEDRVSAACSIGSGGHKLSSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK L+
Sbjct: 541 RCRWWLTHHFCCNITSETSDGTLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKQLI 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+HM NSF LHSKL
Sbjct: 601 LFLKKHMNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. NCBI nr
Match:
XP_022941583.1 (uncharacterized protein LOC111446895 [Cucurbita moschata])
HSP 1 Score: 1041.6 bits (2692), Expect = 2.7e-300
Identity = 519/617 (84.12%), Postives = 554/617 (89.79%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE + D VMEAEYG R T RKRK DTAAD ++DGRRA MK+ITL+LTKPSFVLG+
Sbjct: 1 MEPELVGDTLVMEAEYGSERSTGRKRKPDTAADGSNDGRRAAAMKKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPIRSDR+I NSDGCSV
Sbjct: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPIRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+HVDSEGHT ASFE DH IKVE+ Q FE DF SS
Sbjct: 241 SNSRGDGASYQSHSETSVMDHKLIHVDSEGHTGASFEDDHKIKVENDPQKFEPLDFYASS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEASFSDNGSYQHCVSIFSALEGLDPLLLPLHLPSSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICRDSNATLPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N +LLSTCYE+ILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA
Sbjct: 421 VEHFDHSNVLLLSTCYEKILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWRELAMCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELAMCFLKLSQIEEDRVSAACSIGSGGHKLRSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK L+
Sbjct: 541 RCRWWLTHHFCCNITSETSDGTLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKQLI 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+H NSF LHSKL
Sbjct: 601 LFLKKHTNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. NCBI nr
Match:
XP_023536647.1 (uncharacterized protein LOC111797853 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1040.8 bits (2690), Expect = 4.7e-300
Identity = 518/617 (83.95%), Postives = 555/617 (89.95%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE M D VMEAEYG R T RKRK DTA D ++DGRRA M++ITL+LTKPSFVLG+
Sbjct: 1 MEPELMGDTLVMEAEYGSERSTGRKRKPDTAVDGSNDGRRAAAMRKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEH+SVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSP+RSDR+I NSDGCSV
Sbjct: 181 CLMQEHDSVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPVRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+ VDSEGHTEASFE DH IKVE+ Q FE DF VSS
Sbjct: 241 SNSRGDGASYQSHSETSVMDRKLIRVDSEGHTEASFEDDHKIKVENHPQKFEPLDFYVSS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENE SFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEVSFSDNGDYQHCVSIFSALEGLDPLLLPLHLPSSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICRDSNATLPFRLRAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N +LLSTCYE+ILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA
Sbjct: 421 VEHFDHSNVLLLSTCYEKILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWRELAMCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELAMCFLKLSQIEEDRVSAACSIGSGGHKLRSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G+LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK LL
Sbjct: 541 RCRWWLTHHFCCNITSETSDGSLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKHLL 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+HM NSF LHSKL
Sbjct: 601 LFLKKHMNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. NCBI nr
Match:
XP_022979683.1 (uncharacterized protein LOC111479331 [Cucurbita maxima])
HSP 1 Score: 1039.6 bits (2687), Expect = 1.0e-299
Identity = 521/617 (84.44%), Postives = 555/617 (89.95%), Query Frame = 0
Query: 1 MEPESMADEHVMEAEYGFNRPTSRKRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGL 60
MEPE M D VMEAE+G R T RKRK DT AD ++DGRRA MK+ITL+LTKPSFVLG+
Sbjct: 1 MEPELMGDTLVMEAEHGSERSTGRKRKLDTEADGSNDGRRAAAMKKITLALTKPSFVLGI 60
Query: 61 GPKMVRAENRITLRNVLHKLMRQQNWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELL 120
GPKM+RAENR TLRNVL KLM QQNWVEASGVLSMLLKGTLRDRSPI+NRLKYS SMELL
Sbjct: 61 GPKMLRAENRTTLRNVLRKLMMQQNWVEASGVLSMLLKGTLRDRSPIRNRLKYSVSMELL 120
Query: 121 KHIEGDRMRPNRIKHIYDNWMRKNGSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAAL 180
KHIEGDRMRPNRIKHIYDNWMRK GSMK PVEDRFMVHVEFILFCLEEG+TEDAHQAAL
Sbjct: 121 KHIEGDRMRPNRIKHIYDNWMRKIGSMKRWPVEDRFMVHVEFILFCLEEGSTEDAHQAAL 180
Query: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIRSDRLISNSDGCSV 240
CLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPIRSDR+I NSDGCSV
Sbjct: 181 CLMQEHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWRDSLQYHSPIRSDRMILNSDGCSV 240
Query: 241 SSSHGDGASYRSNSETSVMNDKLVHVDSEGHTEASFEVDHDIKVES--QNFEAQDFCVSS 300
S+S GDGASY+S+SETSVM+ KL+HVDSEGHTEASFE DH IKVE+ Q FE DF VSS
Sbjct: 241 SNSRGDGASYQSHSETSVMDHKLIHVDSEGHTEASFEDDHKIKVENHPQKFEPLDFYVSS 300
Query: 301 AEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYY 360
EKDENEASFSDNGG+Q+ VSIFSALEGLDPLLLPLHLPPS+ENWENA+SLCG+FLNDYY
Sbjct: 301 VEKDENEASFSDNGGYQHCVSIFSALEGLDPLLLPLHLPPSVENWENALSLCGEFLNDYY 360
Query: 361 KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAAL 420
KDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENIC DSNA LPFRL+AAL
Sbjct: 361 KDAVKHLELALNSNPPILVALLPFIQLLLIGGRVDKALDEMENICCDSNATLPFRLKAAL 420
Query: 421 VEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCA 480
VEHFD N VLLSTCYE+ILKKDPTCCHSLGKLV MHRNGNYSLESLLEMIALHLDGT A
Sbjct: 421 VEHFDHSNVVLLSTCYEKILKKDPTCCHSLGKLVLMHRNGNYSLESLLEMIALHLDGTRA 480
Query: 481 EYDTWRELAMCFLKLSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNTWRL 540
EYDTWRELAMCFLKLSQ EEDRVS ACSIG+ GHKL SSLN++ NLKL TEKNLRN WRL
Sbjct: 481 EYDTWRELAMCFLKLSQIEEDRVSAACSIGSGGHKLRSSLNINCNLKLFTEKNLRNAWRL 540
Query: 541 RCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLL 600
RCRWWLT HF ITSET G LEL TYKAACA HMYGSN+KYV +VY+LL+KQNDK LL
Sbjct: 541 RCRWWLTHHFCCNITSETSDGTLELLTYKAACACHMYGSNHKYVVEVYSLLDKQNDKHLL 600
Query: 601 LFLKRHMKNSFGLHSKL 616
LFLK+HM NSF LHSKL
Sbjct: 601 LFLKKHMNNSFQLHSKL 617
BLAST of Lcy11g006020 vs. TAIR 10
Match:
AT1G53200.1 (unknown protein; Has 21 Blast hits to 21 proteins in 9 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 19; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 320.5 bits (820), Expect = 3.0e-87
Identity = 214/599 (35.73%), Postives = 328/599 (54.76%), Query Frame = 0
Query: 25 KRKADTAADCNDDGRRATLMKRITLSLTKPSFVLGLGPKMVRAENRITLRNVLHKLMRQQ 84
KR+ ++++ + D ++ KRI KPS++L +GPK R+E L +L +L+R +
Sbjct: 31 KRRRVSSSEIDSDTQK---YKRIQRCRAKPSYLLCIGPKSSRSEYLNRLPGLLRELLRNR 90
Query: 85 NWVEASGVLSMLLKGTLRDRSPIKNRLKYSASMELLKHIEGDRMRPNRIKHIYDNWMRKN 144
+W +AS VLS+L+KGT+ D P NRLKY A ++++ H E ++ + + I IYD W+ +
Sbjct: 91 HWNDASRVLSVLMKGTINDPCPKMNRLKYEAHIQIVSHSETNKNKADEIGRIYDTWIGQI 150
Query: 145 GSMKHCPVEDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQ 204
G E+R +V E I +E +A+ + LMQ + P +N+ IG++F +
Sbjct: 151 GKQHK---EERLLVWYEQICHFIEHEMNNEAYSTVISLMQNRDFAMLPRANLFIGISFYK 210
Query: 205 LWFSTIPEEIQWRD-----SLQFHSPIRSDRLISNSDGCSVSSSHGDGASYRSNSETSVM 264
+W + +E+Q D S+ S S L+ S S S R +SETSVM
Sbjct: 211 MWCNKFLKELQPEDADDNGSVSNISESGSGSLVECSGRDESVCSLASEVSARKDSETSVM 270
Query: 265 NDKLVHVDSEGHTEASFEVDHDIKVESQNF---EAQDFCVSSAEKDENEASFSDNGGHQY 324
+K V S +E +D +KV S + Q + +S +ENEAS D G ++
Sbjct: 271 KNKKVSHLSISDSET--RMDTKVKVMSTPYVTPPPQLYAIS----EENEASLGD-GIVEF 330
Query: 325 YVSIFSALEGLDPLLLPLHLPPSIENWENAISLCGQFLNDYYKDAVKHLNLALNSNPPI- 384
++ + L +DP LLP P + + ++ + YYK+AVK++ L S P +
Sbjct: 331 DPTVINILGDMDPWLLPFKPPEDPDCYRKIVN------DSYYKEAVKYMRQTLQSPPHVS 390
Query: 385 LVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYE 444
L AL PL+Q+LLIGG VD+A+ +E +C + V PFR++A ++E F R +D +L+ CYE
Sbjct: 391 LAALHPLVQILLIGGHVDEAMKVVEEMCNKIHDVKPFRIKALMMEKFHRNSD-MLAKCYE 450
Query: 445 QILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-LS 504
ILK DP C +L KL+ M YS ESL EMIALH++ + E + W+ELA CF
Sbjct: 451 DILKIDPCCITTLKKLIGMCLEDEYSRESLTEMIALHVEASFPEPEIWKELASCFSNFFE 510
Query: 505 QFEEDRVSTACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNT-WRLRCRWWLTQHFG---- 564
+EDR+S C G+E + + +V N T K T W LR +WWL +HF
Sbjct: 511 NLDEDRLS-VCLDGSEDKRNPQTYSVRYN---PTPKTFTKTSWTLRAKWWLNRHFSPQML 570
Query: 565 -HKITSETLAGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLLLFLKRHMKN 608
+I + TL G+ E+ TYKAACAS++YG + YV +VY LL+ N ++ FL+ H N
Sbjct: 571 ETEIKNVTLTGDWEMMTYKAACASYIYGREFGYVTKVYELLKSSNKREFFKFLREHRVN 605
BLAST of Lcy11g006020 vs. TAIR 10
Match:
AT1G53200.2 (unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )
HSP 1 Score: 243.8 bits (621), Expect = 3.6e-64
Identity = 168/470 (35.74%), Postives = 251/470 (53.40%), Query Frame = 0
Query: 154 DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEE 213
+R +V E I +E +A+ + LMQ + P +N+ IG++F ++W + +E
Sbjct: 15 ERLLVWYEQICHFIEHEMNNEAYSTVISLMQNRDFAMLPRANLFIGISFYKMWCNKFLKE 74
Query: 214 IQWRD-----SLQFHSPIRSDRLISNSDGCSVSSSHGDGASYRSNSETSVMNDKLVHVDS 273
+Q D S+ S S L+ S S S R +SETSVM +K V S
Sbjct: 75 LQPEDADDNGSVSNISESGSGSLVECSGRDESVCSLASEVSARKDSETSVMKNKKVSHLS 134
Query: 274 EGHTEASFEVDHDIKVESQNF---EAQDFCVSSAEKDENEASFSDNGGHQYYVSIFSALE 333
+E +D +KV S + Q + +S +ENEAS D G ++ ++ + L
Sbjct: 135 ISDSET--RMDTKVKVMSTPYVTPPPQLYAIS----EENEASLGD-GIVEFDPTVINILG 194
Query: 334 GLDPLLLPLHLPPSIENWENAISLCGQFLNDYYKDAVKHLNLALNSNPPI-LVALLPLIQ 393
+DP LLP P + + ++ + YYK+AVK++ L S P + L AL PL+Q
Sbjct: 195 DMDPWLLPFKPPEDPDCYRKIVN------DSYYKEAVKYMRQTLQSPPHVSLAALHPLVQ 254
Query: 394 LLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTC 453
+LLIGG VD+A+ +E +C + V PFR++A ++E F R +D +L+ CYE ILK DP C
Sbjct: 255 ILLIGGHVDEAMKVVEEMCNKIHDVKPFRIKALMMEKFHRNSD-MLAKCYEDILKIDPCC 314
Query: 454 CHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-LSQFEEDRVST 513
+L KL+ M YS ESL EMIALH++ + E + W+ELA CF +EDR+S
Sbjct: 315 ITTLKKLIGMCLEDEYSRESLTEMIALHVEASFPEPEIWKELASCFSNFFENLDEDRLS- 374
Query: 514 ACSIGTEGHKLMSSLNVSSNLKLLTEKNLRNT-WRLRCRWWLTQHFG-----HKITSETL 573
C G+E + + +V N T K T W LR +WWL +HF +I + TL
Sbjct: 375 VCLDGSEDKRNPQTYSVRYN---PTPKTFTKTSWTLRAKWWLNRHFSPQMLETEIKNVTL 434
Query: 574 AGNLELWTYKAACASHMYGSNYKYVGQVYNLLEKQNDKDLLLFLKRHMKN 608
G+ E+ TYKAACAS++YG + YV +VY LL+ N ++ FL+ H N
Sbjct: 435 TGDWEMMTYKAACASYIYGREFGYVTKVYELLKSSNKREFFKFLREHRVN 466
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1CPA4 | 3.6e-306 | 84.95 | uncharacterized protein LOC111012919 OS=Momordica charantia OX=3673 GN=LOC111012... | [more] |
A0A6J1FSH8 | 1.3e-300 | 84.12 | uncharacterized protein LOC111446895 OS=Cucurbita moschata OX=3662 GN=LOC1114468... | [more] |
A0A6J1IWZ8 | 5.0e-300 | 84.44 | uncharacterized protein LOC111479331 OS=Cucurbita maxima OX=3661 GN=LOC111479331... | [more] |
A0A1S3BS63 | 2.5e-291 | 81.64 | uncharacterized protein LOC103492916 OS=Cucumis melo OX=3656 GN=LOC103492916 PE=... | [more] |
A0A0A0KXN5 | 1.2e-208 | 83.37 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G089260 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
XP_022142927.1 | 7.4e-306 | 84.95 | uncharacterized protein LOC111012919 [Momordica charantia] | [more] |
KAG7031054.1 | 9.4e-301 | 84.28 | hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma] | [more] |
XP_022941583.1 | 2.7e-300 | 84.12 | uncharacterized protein LOC111446895 [Cucurbita moschata] | [more] |
XP_023536647.1 | 4.7e-300 | 83.95 | uncharacterized protein LOC111797853 [Cucurbita pepo subsp. pepo] | [more] |
XP_022979683.1 | 1.0e-299 | 84.44 | uncharacterized protein LOC111479331 [Cucurbita maxima] | [more] |
Match Name | E-value | Identity | Description | |
AT1G53200.1 | 3.0e-87 | 35.73 | unknown protein; Has 21 Blast hits to 21 proteins in 9 species: Archae - 0; Bact... | [more] |
AT1G53200.2 | 3.6e-64 | 35.74 | unknown protein; Has 35333 Blast hits to 34131 proteins in 2444 species: Archae ... | [more] |