Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: exonCDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTACCAGAAAGTGAAGAGGTGCTTCATATTTTTCTTTGCTGTTTTCTGGATTTTGAAGATCGTGCAAGTTTCTTCTTATATGGTTATAATTATTGTCAGTAAATGTTCTATTGGCACTTGACAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGCCTTCCTATCAAGAAAAGGAGATTTCTGGTTGTGCAGTCTCCTCCATCTCCATCTAAAGAGATATCTTCATTTCATCCAGATGGAAATATAATGAAGACTGAACTGCCATCTATACTTAAAGACGTATCTTTGTTACATACCAATGAAAATTTTATAAAGAGTGAACAACCAGGTCTATCTATGATGACACTAGATTCAAGTTCAAGTGCAGTCACAAGTTCTGGGTTGTCAAACAGGAACCAGGATTGTGTTTCTGACAGGAACAAAGAAAAAACTGATACTGATTCATGCTATTTGGATATGGTCCAGAGCGATGTTGGAATGCCAGGAGTCAAGTTTCAGGAACCCAGTTTGGGAGGACGTGCTTCCTTTGATGGTTGTGTCGAATATGAAGATAAGTCCTTGGTAACTGAAAAACACGCCGTTCATGCATCACCAGAGATCCGTGGGCGGTTGGAGTTATCATCAACTAGCCTCGACTCTGACCCCCTTGCTGATAACAAAGAAGAAGAAATTGATGTAAAAAAGCCTAAAGAAAAGTGCAGCTCCCCAGTTCGTCAAGTTGAAGGAGGAGCTGAAGTATCCGTAGTTTTAATGGGACATGTTGATCCGAAATTGGTTCCTGAAGAGAGTGATTTGAATTTTTTGAAGCAGAATTCTTTGGAACCTGAGTTACTGTACCTGTCTTTGAACAAGCAAGGAAGTAGCACCCATTGTGTCAAAGGTAACGTAGGGTCTGATTATGATGGTTCTCCTTTGCGTTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGTCTGTGCAAGTGATGACCCTCCCGTGCATGTTCCAATTCAGACAAACACAATTGTCACTACACATAGATGCTCAACTGAAATGGTTAAAACTGATATTCTGCCAGGAAAACTAACCCTTTTAGATCACAATGATCATCTTCATCTAAGTCTTAGTTCATCTGATCTTAGACATGTAACGAGTCAGGAACAAAGTCCATTTGCTAAGTTAGATTTTAGGAAAACAAATCCTTCTATAAGTTTACCAGGCAGAGGTATGAAGTTTGATGATCTTAATGGTGCATTAAAAGTTGTAAAGCCAGAACCATTTGTTGAGGGCTCCAAACTTGAGTTTAGAAGTGATGAAACTAATGAGCTGGGATTACAAAATGGTACTGTAGTGAAGCGTGAATCAGATGATCAATGCAATCTTCAAATTCCCAATGCTTCAGATATTTGTAGTCCAATGGACATAGTTAAGGCTAAATCTACTAAATCCGAATCAGTTTATGAAAGCAAACAGGAAGCACTTAAAACATTAGGGAGTAGATTAGATCTGGTAGAAAAGCAAGTTGTTCCAGAAGTAGATAATTCTCGTCCTGTGGCAGAGATGTCAGGAACAACTGGAGATCTTGATTTGATTAGAGACACAGATTTGTCAAACCATCTAGAATTGCAAACTCCTTCTGAAGGACATCTTGATATGAAAGTGCACCAAGGTGAATATGGCTGTGGTGATGAACATGTCAATTCTGAAATGACTGATATAAGTGAGGATCCATGTTCCAAAGATTCTAGTAGTCCTTTTATAAAACCTGTAGCAACGCCTGTTGTGGCAGAGATGTTGGAAGCAGCTAGAAATCCTTCATGTTCAACTGATTTGATTATAGATGGAGACATGTCAAACCATTCAGAATTGTTAACTCCAACTGAAGGACCTCTTATTGGGAAAGTGCACCAAGGGGGATATGGTTGTGGTGGTGGGCTTATGAACTTAAAAATGATGGATGTAAGTAAGGATCCAGGTTCCAAAGATTCCACTAGTTGTGTTATAAAATCATCCATTATTGATGATAAAAATAAGAATAATCCCGAGTGGCGTCCTTTGAAACTCACAAATGAACAGTGCTCTAGTTTACAAGGAGGTGAGAAATGTTTGGTAAGTGATGAGGAAAAGATCAGTATATCAGCGGATTTATTAGAAGAAGACCCTTATAGTTCTGAATATGAATCAGATTGCAGGCATGATGTAAATGAGGCCATGGATGCAGTTGATATGGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGAAGATTCAAGTAGAAAGTAATGTATGTGAGCAAAAAGAAGTAAAAAGTTTTGATCATGGTGATTGTTGTAATGGGCTGCCTAGTATTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAAATAAATCAGACATTCATGATGTTAAACTAGAAGATAATCTTCATTCTGTAACTTCTAATCAATCTTCCGAGCAAGAAAAATTGAAGGAATTACTTGTGGAAGAGGATATAATTAGGGTGTGCTCGAACAAGGCCAACAAGGCTATAAAAGCCACAGATACAAGGCAAATGTCTCAGTGTGAGGAAATAGATGCCTTAGAGGACCAGGAAATTTCTTCTGAGAAAGCCACTAATGGAATTGAAGAACCGACTGTGAAAGTTTCTCAGAGTGATGTAGTGAATGATAAAACAGTAGATTTTGTGCAAAACGATGGCCTAGCTTTGCCAAATGTTAAAGAGCCTTTAACTAATGATGATGTGACTAATGATTTTACTTGTGGCAGTGTACATAGCCAGATTGTTAATGCATGTCAAGCTTCTACTTCTTCGTCTCCTAGTAAAACAAGATCTAATTTGGTGAGGTCAGTTTTAACACAAACTGATAGAGAACAAATACCTGACATGGGGCATGAAGGGGGGAAACTACAACCTCAAGGAAGGTGATTATTCATCACGACTTTTTTATTTTTTATTCTCAGAAATATAAATATATGTCAACCAAGCTTGTTTCTTAACTGCATTAACATTTTATTGATTACTTTCTGCATTGCTGATTTCTTTTTCATTTTCTAGAGATGAGCCATACAGTGACGTTTCCCAGAAATTTTATGTGAATAGACAGCAGAATCTATCACCCCAAACGAATTTTACTCGTAGGAGAGGTAGATTCACCATCCAGATAAACTCTCTCCAAGGTGAATGGGATTTTAATCCAACGATTTCCCCAGGAATTTACAATGATCAAATACCACCATATGATGCCCCAAAACGTAAATACATGTCTGTTGTTTCTGATGATGGCATTGATCAAAACTATTATAAAATTAAACCTGATGGTCCATTTCGTGGTTCTGGTCATCGAGGTAGACAAACATTAGATGATGAAGGGCCCTTTTTTTGTCATAGGAGGAAGTCTCCTGGTAGAAGAGATGGGCCTCCCGTGCGAGGCATTAAAATGGTACAAAGAATACCTAGAAATATCAGTCCAGGTAGATGCATTCGTGAGCCTGGGTCTGAATTGGTGGTTGGACCACGACACGGTGAAAAGTTTTTGAGGACTTTAGAAGATGAGACCATGGATCCAATTTATGCTCGCCCTCAACCTCCATTTGAAGGAGACAGACCTTTTATCCGAGATCGAAGGAACTTTTCTATCCAAAGAAAAAGTTTTCCTAGAGTTGATTCTAGATCTCCAGGACGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCGTCCAAAAGAAAGCATGATAGGTTTTTTGGACATCCAGAAATGGGACGTCGAAGTCCACCAGGTTACAGGATGAGATCTCCTGATCACTCTCCTCCAATCCATGGAGATATGCCTGTTCGAAGACATGGTTTTCCTTTTCCGTCACTGCCACCCAATGATTTGAGGGATATGGGCTCTGCTCGCGACCACGGACATATGAGATCAGGTATCCGAAGTAGGAATCGAACTGATAGAATACCCTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAGTGAATACTTTGATGGGCCTGTACATCCTGATCAATTGAATGAACTAGCTGGTGATGGTAACGATGGCGATCGAAGGAGGTTTCCTGATAGACATGAACATCTTCACCCATTTCAGCTGCCATGTAATGATTCTGATGGGGAAATTTATCATAATGATCTGAACGAAGGACACAGGCCTTTCAGATACTGTGCAGAGGATGAACCAGAATTTCATGAAAGAGGTAAGTTGAGGGAGAGGGAATTTGATAGACGTGTAAAAAACCAACCAGGAAATTTACCTAGACAAACCGGGGTTATTGAAGAACATGAAGTTGTTGAAGATTACCGGCATGGTCGGCAGTTCTGGAATGAACACCATGGCTTTGAAGAAGACATCTCACGGATGAAAAGAAAAAGATTTTGA
mRNA sequence
ATGGCCGTACCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGCCTTCCTATCAAGAAAAGGAGATTTCTGGTTGTGCAGTCTCCTCCATCTCCATCTAAAGAGATATCTTCATTTCATCCAGATGGAAATATAATGAAGACTGAACTGCCATCTATACTTAAAGACGTATCTTTGTTACATACCAATGAAAATTTTATAAAGAGTGAACAACCAGGTCTATCTATGATGACACTAGATTCAAGTTCAAGTGCAGTCACAAGTTCTGGGTTGTCAAACAGGAACCAGGATTGTGTTTCTGACAGGAACAAAGAAAAAACTGATACTGATTCATGCTATTTGGATATGGTCCAGAGCGATGTTGGAATGCCAGGAGTCAAGTTTCAGGAACCCAGTTTGGGAGGACGTGCTTCCTTTGATGGTTGTGTCGAATATGAAGATAAGTCCTTGGTAACTGAAAAACACGCCGTTCATGCATCACCAGAGATCCGTGGGCGGTTGGAGTTATCATCAACTAGCCTCGACTCTGACCCCCTTGCTGATAACAAAGAAGAAGAAATTGATGTAAAAAAGCCTAAAGAAAAGTGCAGCTCCCCAGTTCGTCAAGTTGAAGGAGGAGCTGAAGTATCCGTAGTTTTAATGGGACATGTTGATCCGAAATTGGTTCCTGAAGAGAGTGATTTGAATTTTTTGAAGCAGAATTCTTTGGAACCTGAGTTACTGTACCTGTCTTTGAACAAGCAAGGAAGTAGCACCCATTGTGTCAAAGGTAACGTAGGGTCTGATTATGATGGTTCTCCTTTGCGTTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGTCTGTGCAAGTGATGACCCTCCCGTGCATGTTCCAATTCAGACAAACACAATTGTCACTACACATAGATGCTCAACTGAAATGGTTAAAACTGATATTCTGCCAGGAAAACTAACCCTTTTAGATCACAATGATCATCTTCATCTAAGTCTTAGTTCATCTGATCTTAGACATGTAACGAGTCAGGAACAAAGTCCATTTGCTAAGTTAGATTTTAGGAAAACAAATCCTTCTATAAGTTTACCAGGCAGAGGTATGAAGTTTGATGATCTTAATGGTGCATTAAAAGTTGTAAAGCCAGAACCATTTGTTGAGGGCTCCAAACTTGAGTTTAGAAGTGATGAAACTAATGAGCTGGGATTACAAAATGGTACTGTAGTGAAGCGTGAATCAGATGATCAATGCAATCTTCAAATTCCCAATGCTTCAGATATTTGTAGTCCAATGGACATAGTTAAGGCTAAATCTACTAAATCCGAATCAGTTTATGAAAGCAAACAGGAAGCACTTAAAACATTAGGGAGTAGATTAGATCTGGTAGAAAAGCAAGTTGTTCCAGAAGTAGATAATTCTCGTCCTGTGGCAGAGATGTCAGGAACAACTGGAGATCTTGATTTGATTAGAGACACAGATTTGTCAAACCATCTAGAATTGCAAACTCCTTCTGAAGGACATCTTGATATGAAAGTGCACCAAGGTGAATATGGCTGTGGTGATGAACATGTCAATTCTGAAATGACTGATATAAGTGAGGATCCATGTTCCAAAGATTCTAGTAGTCCTTTTATAAAACCTGTAGCAACGCCTGTTGTGGCAGAGATGTTGGAAGCAGCTAGAAATCCTTCATGTTCAACTGATTTGATTATAGATGGAGACATGTCAAACCATTCAGAATTGTTAACTCCAACTGAAGGACCTCTTATTGGGAAAGTGCACCAAGGGGGATATGGTTGTGGTGGTGGGCTTATGAACTTAAAAATGATGGATGTAAGTAAGGATCCAGGTTCCAAAGATTCCACTAGTTGTGTTATAAAATCATCCATTATTGATGATAAAAATAAGAATAATCCCGAGTGGCGTCCTTTGAAACTCACAAATGAACAGTGCTCTAGTTTACAAGGAGGTGAGAAATGTTTGGTAAGTGATGAGGAAAAGATCAGTATATCAGCGGATTTATTAGAAGAAGACCCTTATAGTTCTGAATATGAATCAGATTGCAGGCATGATGTAAATGAGGCCATGGATGCAGTTGATATGGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGAAGATTCAAGTAGAAAGTAATGTATGTGAGCAAAAAGAAGTAAAAAGTTTTGATCATGGTGATTGTTGTAATGGGCTGCCTAGTATTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAAATAAATCAGACATTCATGATGTTAAACTAGAAGATAATCTTCATTCTGTAACTTCTAATCAATCTTCCGAGCAAGAAAAATTGAAGGAATTACTTGTGGAAGAGGATATAATTAGGGTGTGCTCGAACAAGGCCAACAAGGCTATAAAAGCCACAGATACAAGGCAAATGTCTCAGTGTGAGGAAATAGATGCCTTAGAGGACCAGGAAATTTCTTCTGAGAAAGCCACTAATGGAATTGAAGAACCGACTGTGAAAGTTTCTCAGAGTGATGTAGTGAATGATAAAACAGTAGATTTTGTGCAAAACGATGGCCTAGCTTTGCCAAATGTTAAAGAGCCTTTAACTAATGATGATGTGACTAATGATTTTACTTGTGGCAGTGTACATAGCCAGATTGTTAATGCATGTCAAGCTTCTACTTCTTCGTCTCCTAGTAAAACAAGATCTAATTTGGTGAGGTCAGTTTTAACACAAACTGATAGAGAACAAATACCTGACATGGGGCATGAAGGGGGGAAACTACAACCTCAAGGAAGAGATGAGCCATACAGTGACGTTTCCCAGAAATTTTATGTGAATAGACAGCAGAATCTATCACCCCAAACGAATTTTACTCGTAGGAGAGGTAGATTCACCATCCAGATAAACTCTCTCCAAGGTGAATGGGATTTTAATCCAACGATTTCCCCAGGAATTTACAATGATCAAATACCACCATATGATGCCCCAAAACGTAAATACATGTCTGTTGTTTCTGATGATGGCATTGATCAAAACTATTATAAAATTAAACCTGATGGTCCATTTCGTGGTTCTGGTCATCGAGGTAGACAAACATTAGATGATGAAGGGCCCTTTTTTTGTCATAGGAGGAAGTCTCCTGGTAGAAGAGATGGGCCTCCCGTGCGAGGCATTAAAATGGTACAAAGAATACCTAGAAATATCAGTCCAGGTAGATGCATTCGTGAGCCTGGGTCTGAATTGGTGGTTGGACCACGACACGGTGAAAAGTTTTTGAGGACTTTAGAAGATGAGACCATGGATCCAATTTATGCTCGCCCTCAACCTCCATTTGAAGGAGACAGACCTTTTATCCGAGATCGAAGGAACTTTTCTATCCAAAGAAAAAGTTTTCCTAGAGTTGATTCTAGATCTCCAGGACGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCGTCCAAAAGAAAGCATGATAGGTTTTTTGGACATCCAGAAATGGGACGTCGAAGTCCACCAGGTTACAGGATGAGATCTCCTGATCACTCTCCTCCAATCCATGGAGATATGCCTGTTCGAAGACATGGTTTTCCTTTTCCGTCACTGCCACCCAATGATTTGAGGGATATGGGCTCTGCTCGCGACCACGGACATATGAGATCAGGTATCCGAAGTAGGAATCGAACTGATAGAATACCCTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAGTGAATACTTTGATGGGCCTGTACATCCTGATCAATTGAATGAACTAGCTGGTGATGGTAACGATGGCGATCGAAGGAGGTTTCCTGATAGACATGAACATCTTCACCCATTTCAGCTGCCATGTAATGATTCTGATGGGGAAATTTATCATAATGATCTGAACGAAGGACACAGGCCTTTCAGATACTGTGCAGAGGATGAACCAGAATTTCATGAAAGAGGTAAGTTGAGGGAGAGGGAATTTGATAGACGTGTAAAAAACCAACCAGGAAATTTACCTAGACAAACCGGGGTTATTGAAGAACATGAAGTTGTTGAAGATTACCGGCATGGTCGGCAGTTCTGGAATGAACACCATGGCTTTGAAGAAGACATCTCACGGATGAAAAGAAAAAGATTTTGA
Coding sequence (CDS)
ATGGCCGTACCAGAAAGTGAAGAGGTTGGTTTTAAGCGCATTGGGTTGTCAGCTAGTGATTATGATGCAAGCCTTCCTATCAAGAAAAGGAGATTTCTGGTTGTGCAGTCTCCTCCATCTCCATCTAAAGAGATATCTTCATTTCATCCAGATGGAAATATAATGAAGACTGAACTGCCATCTATACTTAAAGACGTATCTTTGTTACATACCAATGAAAATTTTATAAAGAGTGAACAACCAGGTCTATCTATGATGACACTAGATTCAAGTTCAAGTGCAGTCACAAGTTCTGGGTTGTCAAACAGGAACCAGGATTGTGTTTCTGACAGGAACAAAGAAAAAACTGATACTGATTCATGCTATTTGGATATGGTCCAGAGCGATGTTGGAATGCCAGGAGTCAAGTTTCAGGAACCCAGTTTGGGAGGACGTGCTTCCTTTGATGGTTGTGTCGAATATGAAGATAAGTCCTTGGTAACTGAAAAACACGCCGTTCATGCATCACCAGAGATCCGTGGGCGGTTGGAGTTATCATCAACTAGCCTCGACTCTGACCCCCTTGCTGATAACAAAGAAGAAGAAATTGATGTAAAAAAGCCTAAAGAAAAGTGCAGCTCCCCAGTTCGTCAAGTTGAAGGAGGAGCTGAAGTATCCGTAGTTTTAATGGGACATGTTGATCCGAAATTGGTTCCTGAAGAGAGTGATTTGAATTTTTTGAAGCAGAATTCTTTGGAACCTGAGTTACTGTACCTGTCTTTGAACAAGCAAGGAAGTAGCACCCATTGTGTCAAAGGTAACGTAGGGTCTGATTATGATGGTTCTCCTTTGCGTTCAAACAGGGAAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGTCTGTGCAAGTGATGACCCTCCCGTGCATGTTCCAATTCAGACAAACACAATTGTCACTACACATAGATGCTCAACTGAAATGGTTAAAACTGATATTCTGCCAGGAAAACTAACCCTTTTAGATCACAATGATCATCTTCATCTAAGTCTTAGTTCATCTGATCTTAGACATGTAACGAGTCAGGAACAAAGTCCATTTGCTAAGTTAGATTTTAGGAAAACAAATCCTTCTATAAGTTTACCAGGCAGAGGTATGAAGTTTGATGATCTTAATGGTGCATTAAAAGTTGTAAAGCCAGAACCATTTGTTGAGGGCTCCAAACTTGAGTTTAGAAGTGATGAAACTAATGAGCTGGGATTACAAAATGGTACTGTAGTGAAGCGTGAATCAGATGATCAATGCAATCTTCAAATTCCCAATGCTTCAGATATTTGTAGTCCAATGGACATAGTTAAGGCTAAATCTACTAAATCCGAATCAGTTTATGAAAGCAAACAGGAAGCACTTAAAACATTAGGGAGTAGATTAGATCTGGTAGAAAAGCAAGTTGTTCCAGAAGTAGATAATTCTCGTCCTGTGGCAGAGATGTCAGGAACAACTGGAGATCTTGATTTGATTAGAGACACAGATTTGTCAAACCATCTAGAATTGCAAACTCCTTCTGAAGGACATCTTGATATGAAAGTGCACCAAGGTGAATATGGCTGTGGTGATGAACATGTCAATTCTGAAATGACTGATATAAGTGAGGATCCATGTTCCAAAGATTCTAGTAGTCCTTTTATAAAACCTGTAGCAACGCCTGTTGTGGCAGAGATGTTGGAAGCAGCTAGAAATCCTTCATGTTCAACTGATTTGATTATAGATGGAGACATGTCAAACCATTCAGAATTGTTAACTCCAACTGAAGGACCTCTTATTGGGAAAGTGCACCAAGGGGGATATGGTTGTGGTGGTGGGCTTATGAACTTAAAAATGATGGATGTAAGTAAGGATCCAGGTTCCAAAGATTCCACTAGTTGTGTTATAAAATCATCCATTATTGATGATAAAAATAAGAATAATCCCGAGTGGCGTCCTTTGAAACTCACAAATGAACAGTGCTCTAGTTTACAAGGAGGTGAGAAATGTTTGGTAAGTGATGAGGAAAAGATCAGTATATCAGCGGATTTATTAGAAGAAGACCCTTATAGTTCTGAATATGAATCAGATTGCAGGCATGATGTAAATGAGGCCATGGATGCAGTTGATATGGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGAAGATTCAAGTAGAAAGTAATGTATGTGAGCAAAAAGAAGTAAAAAGTTTTGATCATGGTGATTGTTGTAATGGGCTGCCTAGTATTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAAATAAATCAGACATTCATGATGTTAAACTAGAAGATAATCTTCATTCTGTAACTTCTAATCAATCTTCCGAGCAAGAAAAATTGAAGGAATTACTTGTGGAAGAGGATATAATTAGGGTGTGCTCGAACAAGGCCAACAAGGCTATAAAAGCCACAGATACAAGGCAAATGTCTCAGTGTGAGGAAATAGATGCCTTAGAGGACCAGGAAATTTCTTCTGAGAAAGCCACTAATGGAATTGAAGAACCGACTGTGAAAGTTTCTCAGAGTGATGTAGTGAATGATAAAACAGTAGATTTTGTGCAAAACGATGGCCTAGCTTTGCCAAATGTTAAAGAGCCTTTAACTAATGATGATGTGACTAATGATTTTACTTGTGGCAGTGTACATAGCCAGATTGTTAATGCATGTCAAGCTTCTACTTCTTCGTCTCCTAGTAAAACAAGATCTAATTTGGTGAGGTCAGTTTTAACACAAACTGATAGAGAACAAATACCTGACATGGGGCATGAAGGGGGGAAACTACAACCTCAAGGAAGAGATGAGCCATACAGTGACGTTTCCCAGAAATTTTATGTGAATAGACAGCAGAATCTATCACCCCAAACGAATTTTACTCGTAGGAGAGGTAGATTCACCATCCAGATAAACTCTCTCCAAGGTGAATGGGATTTTAATCCAACGATTTCCCCAGGAATTTACAATGATCAAATACCACCATATGATGCCCCAAAACGTAAATACATGTCTGTTGTTTCTGATGATGGCATTGATCAAAACTATTATAAAATTAAACCTGATGGTCCATTTCGTGGTTCTGGTCATCGAGGTAGACAAACATTAGATGATGAAGGGCCCTTTTTTTGTCATAGGAGGAAGTCTCCTGGTAGAAGAGATGGGCCTCCCGTGCGAGGCATTAAAATGGTACAAAGAATACCTAGAAATATCAGTCCAGGTAGATGCATTCGTGAGCCTGGGTCTGAATTGGTGGTTGGACCACGACACGGTGAAAAGTTTTTGAGGACTTTAGAAGATGAGACCATGGATCCAATTTATGCTCGCCCTCAACCTCCATTTGAAGGAGACAGACCTTTTATCCGAGATCGAAGGAACTTTTCTATCCAAAGAAAAAGTTTTCCTAGAGTTGATTCTAGATCTCCAGGACGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCGTCCAAAAGAAAGCATGATAGGTTTTTTGGACATCCAGAAATGGGACGTCGAAGTCCACCAGGTTACAGGATGAGATCTCCTGATCACTCTCCTCCAATCCATGGAGATATGCCTGTTCGAAGACATGGTTTTCCTTTTCCGTCACTGCCACCCAATGATTTGAGGGATATGGGCTCTGCTCGCGACCACGGACATATGAGATCAGGTATCCGAAGTAGGAATCGAACTGATAGAATACCCTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAGTGAATACTTTGATGGGCCTGTACATCCTGATCAATTGAATGAACTAGCTGGTGATGGTAACGATGGCGATCGAAGGAGGTTTCCTGATAGACATGAACATCTTCACCCATTTCAGCTGCCATGTAATGATTCTGATGGGGAAATTTATCATAATGATCTGAACGAAGGACACAGGCCTTTCAGATACTGTGCAGAGGATGAACCAGAATTTCATGAAAGAGGTAAGTTGAGGGAGAGGGAATTTGATAGACGTGTAAAAAACCAACCAGGAAATTTACCTAGACAAACCGGGGTTATTGAAGAACATGAAGTTGTTGAAGATTACCGGCATGGTCGGCAGTTCTGGAATGAACACCATGGCTTTGAAGAAGACATCTCACGGATGAAAAGAAAAAGATTTTGA
Protein sequence
MAVPESEEVGFKRIGLSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENFIKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGVKFQEPSLGGRASFDGCVEYEDKSLVTEKHAVHASPEIRGRLELSSTSLDSDPLADNKEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLYLSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVPIQTNTIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTNPSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCNLQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRPVAEMSGTTGDLDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTDISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLIGKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQCSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDGEVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSIDCSSLVFVKQENKSDIHDVKLEDNLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEISSEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQIVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFYVNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQIPPYDAPKRKYMSVVSDDGIDQNYYKIKPDGPFRGSGHRGRQTLDDEGPFFCHRRKSPGRRDGPPVRGIKMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDRPFIRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRSPPGYRMRSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNRRFEDMDPRDRIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSDGEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVVEDYRHGRQFWNEHHGFEEDISRMKRKRF
Homology
BLAST of Lcy09g005930 vs. ExPASy TrEMBL
Match:
A0A6J1JYG4 (uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 1660.2 bits (4298), Expect = 0.0e+00
Identity = 933/1415 (65.94%), Postives = 1058/1415 (74.77%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + ++N
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDNI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQ +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQRIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ +V E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSAGCVENKETCMVNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +E+D +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEVD-RKQLDRLEFSTSLAKKEAELSVGSKEHLVPDSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS C+ NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD L GKLT LD +DHLHLSLSSSD RHV SQEQS FAKL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFAKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKL +SDE N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLMSKSDEVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPNASD+ PM+ VKAKS SES YESKQEALKTLG RLDLV KQV+PEVD+S P
Sbjct: 421 LQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGGRLDLVAKQVLPEVDSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ G+ DLI D D+SNH ELQTP++ HL++KVH+G Y CG E V+SEMTD
Sbjct: 481 PFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLNLKVHEGAYCCGGELVDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
IS+DP SKDS+ P IKP+A P RNPS + D II+ +MS+ SEL TPT GPL
Sbjct: 541 ISKDPGSKDSNGPIIKPIAMP---------RNPSPTNDSIIEANMSSPSELHTPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 MKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPFTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD DIEEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDIEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK FDHGD NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA +I ALEDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KISALEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+ND ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDR++IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQI-PPYDAPKRKYMSV 1035
VNR QNLSPQTNF+RRRGRFTI+INS+QGEWDFNPTISPG Y+DQ+ PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDQVPPPYDARRRKYMPA 1020
Query: 1036 VSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVRGI 1095
VSDD IDQN+YK+KPDGPFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVRG+
Sbjct: 1021 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KM R+PRNISP RC RE GSEL VGPRHGEKF+RT EDETMDP+YA PQP FE DR PF
Sbjct: 1081 KMAHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPF 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGYRM 1215
IRDRRNF IQRKSF RVDS+SPG SRGRSP QWFPSKRK +RFFGHPEM RRS PPGYRM
Sbjct: 1141 IRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRM 1200
Query: 1216 RSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNR 1275
RSPD P IHGDMPVRRHGFPFPSLPPN+LRDMGSARDHGHMR +RSRNRTDR+ FRNR
Sbjct: 1201 RSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNR 1260
Query: 1276 RFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSD 1335
RFEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF +RHEHLH F+ CNDSD
Sbjct: 1261 RFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSD 1320
Query: 1336 GEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVV 1395
E YHND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE V
Sbjct: 1321 SENYHNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE-V 1349
Query: 1396 EDYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
E+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 EEYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1349
BLAST of Lcy09g005930 vs. ExPASy TrEMBL
Match:
A0A6J1FEB1 (uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 1628.6 bits (4216), Expect = 0.0e+00
Identity = 920/1417 (64.93%), Postives = 1048/1417 (73.96%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + +++
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDSI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQS +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQSIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ ++ E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSTGCVENKETCMMNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +EID +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEID-RKQLDRLEFSTSVAKKEAELSVGSKEHLVPNSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS + NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD LPGKLT LD +DHLHLSLSSSD RHV SQEQS F KL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKLE +SD N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDGVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPN SDI PM+ VKA+S SE YESKQEALKTLG RLDLV KQV+PEV +S P
Sbjct: 421 LQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGGRLDLVAKQVLPEVGSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ + DLI D D+SNH ELQTP++ HL++ VH+G Y E ++SEMTD
Sbjct: 481 PFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLNLNVHEGAYRFAGELIDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
+S+DP SKD +SP IKP+A P RNPS + D II+ +MS+ SEL PT GPL
Sbjct: 541 VSKDPGSKDFNSPIIKPIAMP---------RNPSRTNDSIIEANMSSPSELHIPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 TKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD D+EEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDVEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK+FDH D NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA + A+EDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KTSAIEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+N+ ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDRE+IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQ---IPPYDAPKRKYM 1035
VNR QNLSPQTNF+RRRGRFTI+INS+QGEWDFNPTISPG Y+D PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDHQVPPPPYDARRRKYM 1020
Query: 1036 SVVSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVR 1095
VSDD IDQN+YK+KPD PFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVR
Sbjct: 1021 PAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVR 1080
Query: 1096 GIKMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR- 1155
G+KMV R+PRNISP RC RE GSEL VGPRHGEKF+RT EDE MDP+YA PQP FE DR
Sbjct: 1081 GVKMVHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRS 1140
Query: 1156 PFIRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGY 1215
PFIRDRRNF IQRKSF RVDS+SPGRSRGRSP QWFPSKRK +RFFGHPEM RRS PPGY
Sbjct: 1141 PFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGY 1200
Query: 1216 RMRSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFR 1275
RMRSPD P IHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMR GIRSRNRT+R+ FR
Sbjct: 1201 RMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFR 1260
Query: 1276 NRRFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCND 1335
NRRFEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF DRHEHLH F+ CND
Sbjct: 1261 NRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCND 1320
Query: 1336 SDGEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHE 1395
SDGE Y ND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE
Sbjct: 1321 SDGENYRNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE 1351
Query: 1396 VVEDYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
VE+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 -VEEYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1351
BLAST of Lcy09g005930 vs. ExPASy TrEMBL
Match:
A0A6J1JUI7 (uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 1603.6 bits (4151), Expect = 0.0e+00
Identity = 913/1415 (64.52%), Postives = 1036/1415 (73.22%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + ++N
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDNI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQ +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQRIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ +V E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSAGCVENKETCMVNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +E+D +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEVD-RKQLDRLEFSTSLAKKEAELSVGSKEHLVPDSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS C+ NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD L GKLT LD +DHLHLSLSSSD RHV SQEQS FAKL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFAKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKL +SDE N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLMSKSDEVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPNASD+ PM+ VKAKS SES YESKQEALKTLG RLDLV KQV+PEVD+S P
Sbjct: 421 LQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGGRLDLVAKQVLPEVDSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ G+ DLI D D+SNH ELQTP++ HL++KVH+G Y CG E V+SEMTD
Sbjct: 481 PFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLNLKVHEGAYCCGGELVDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
IS+DP SKDS+ P IKP+A P RNPS + D II+ +MS+ SEL TPT GPL
Sbjct: 541 ISKDPGSKDSNGPIIKPIAMP---------RNPSPTNDSIIEANMSSPSELHTPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 MKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPFTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD DIEEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDIEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK FDHGD NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA +I ALEDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KISALEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+ND ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDR++IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQI-PPYDAPKRKYMSV 1035
VNR QNLSPQTNF+RRRG Y+DQ+ PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGN----------------------YSDQVPPPYDARRRKYMPA 1020
Query: 1036 VSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVRGI 1095
VSDD IDQN+YK+KPDGPFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVRG+
Sbjct: 1021 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KM R+PRNISP RC RE GSEL VGPRHGEKF+RT EDETMDP+YA PQP FE DR PF
Sbjct: 1081 KMAHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPF 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGYRM 1215
IRDRRNF IQRKSF RVDS+SPG SRGRSP QWFPSKRK +RFFGHPEM RRS PPGYRM
Sbjct: 1141 IRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRM 1200
Query: 1216 RSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNR 1275
RSPD P IHGDMPVRRHGFPFPSLPPN+LRDMGSARDHGHMR +RSRNRTDR+ FRNR
Sbjct: 1201 RSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNR 1260
Query: 1276 RFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSD 1335
RFEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF +RHEHLH F+ CNDSD
Sbjct: 1261 RFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSD 1320
Query: 1336 GEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVV 1395
E YHND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE V
Sbjct: 1321 SENYHNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE-V 1327
Query: 1396 EDYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
E+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 EEYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1327
BLAST of Lcy09g005930 vs. ExPASy TrEMBL
Match:
A0A6J1FDD8 (uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 1574.7 bits (4076), Expect = 0.0e+00
Identity = 899/1414 (63.58%), Postives = 1028/1414 (72.70%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + +++
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDSI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQS +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQSIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ ++ E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSTGCVENKETCMMNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +EID +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEID-RKQLDRLEFSTSVAKKEAELSVGSKEHLVPNSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS + NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD LPGKLT LD +DHLHLSLSSSD RHV SQEQS F KL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKLE +SD N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDGVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPN SDI PM+ VKA+S SE YESKQEALKTLG RLDLV KQV+PEV +S P
Sbjct: 421 LQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGGRLDLVAKQVLPEVGSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ + DLI D D+SNH ELQTP++ HL++ VH+G Y E ++SEMTD
Sbjct: 481 PFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLNLNVHEGAYRFAGELIDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
+S+DP SKD +SP IKP+A P RNPS + D II+ +MS+ SEL PT GPL
Sbjct: 541 VSKDPGSKDFNSPIIKPIAMP---------RNPSRTNDSIIEANMSSPSELHIPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 TKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD D+EEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDVEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK+FDH D NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA + A+EDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KTSAIEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+N+ ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDRE+IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQIPPYDAPKRKYMSVV 1035
VNR QNLSPQTNF+RRRG ++ + + P PPYDA +RKYM V
Sbjct: 961 VNRHQNLSPQTNFSRRRGNYS------------DHQVPP-------PPYDARRRKYMPAV 1020
Query: 1036 SDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVRGIK 1095
SDD IDQN+YK+KPD PFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVRG+K
Sbjct: 1021 SDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGVK 1080
Query: 1096 MVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PFI 1155
MV R+PRNISP RC RE GSEL VGPRHGEKF+RT EDE MDP+YA PQP FE DR PFI
Sbjct: 1081 MVHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRSPFI 1140
Query: 1156 RDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGYRMR 1215
RDRRNF IQRKSF RVDS+SPGRSRGRSP QWFPSKRK +RFFGHPEM RRS PPGYRMR
Sbjct: 1141 RDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRMR 1200
Query: 1216 SPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNRR 1275
SPD P IHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMR GIRSRNRT+R+ FRNRR
Sbjct: 1201 SPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFRNRR 1260
Query: 1276 FEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSDG 1335
FEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF DRHEHLH F+ CNDSDG
Sbjct: 1261 FEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSDG 1320
Query: 1336 EIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVVE 1395
E Y ND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE VE
Sbjct: 1321 ENYRNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE-VE 1329
Query: 1396 DYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 EYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1329
BLAST of Lcy09g005930 vs. ExPASy TrEMBL
Match:
A0A6J1BWB0 (uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006113 PE=4 SV=1)
HSP 1 Score: 1494.9 bits (3869), Expect = 0.0e+00
Identity = 854/1422 (60.06%), Postives = 995/1422 (69.97%), Query Frame = 0
Query: 1 MAVPESEEVGFKRIGLSASDYDASLPIKKRRFLVVQ-SPPSPSKEISSFHPDGNIMKTEL 60
M PE+EEVGFK +SASDY+ +PIKKRRF +VQ SP SP KE+SS
Sbjct: 1 MVAPETEEVGFKHTVMSASDYNVIVPIKKRRFTIVQSSPSSPHKELSS------------ 60
Query: 61 PSILKDVSLLHTNENFIKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTD 120
L ++N +K +PG+S SSS +T+S LS + + S+ ++ K D
Sbjct: 61 ---------LSLDDNLVKVAEPGISDGITVSSSVTITTSELSEKKEISFSEESERKVDL- 120
Query: 121 SCYLDMVQSDVGMPGVKFQEPSLGGRASFDGCVEYEDKSLVTEKHAVHASPEIRGRLELS 180
C + VQS++ GV+FQE A F+ VE + ++ EKHA+H + +L S
Sbjct: 121 -CNSNRVQSNIEPSGVRFQEDD----ACFNHQVENKAMNVENEKHALHLLEKPELKLPTS 180
Query: 181 STSLDSDPLADNKEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNF 240
+ A+ K ID +K EKC S V+ AE+SV L + P LV + SD +
Sbjct: 181 DPNSKLGLCANKKRVGID-RKELEKCKSLTSLVKTEAELSVGLNERLVPDLVVKGSDRKW 240
Query: 241 LKQNSLEPELLYLSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASD 300
KQN+LEP L LSL+KQGS T C+ NVGSDYDGS +SNR NWDLNTSME WE CASD
Sbjct: 241 QKQNNLEPVSLNLSLSKQGSYTQCLTSNVGSDYDGSLQQSNRGNWDLNTSMESWEGCASD 300
Query: 301 DPPVHVP-IQTNTIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQ 360
DP V VP +QTNTIVTTHRCSTEMV+ DI GK T LD +D+LHLSL+SSDLR VT QEQ
Sbjct: 301 DPSVQVPVVQTNTIVTTHRCSTEMVRADISSGKPTPLDQSDYLHLSLNSSDLRPVTKQEQ 360
Query: 361 SPFAKLDFRKTNPSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNG 420
KLDFR T+ S+S PG M+FDDLN ALKVVK EPFV+GS+LE +S+E LGL
Sbjct: 361 ISSVKLDFRSTDSSLSSPG-NMQFDDLNVALKVVKAEPFVKGSELESKSNEVKGLGLSGD 420
Query: 421 TVVKRESDDQCNLQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQV 480
++ E DDQCNL++P AS+ICSPM+IVKAKS KSE VYESK+EAL+ LG RL+L+ KQV
Sbjct: 421 ALMNGELDDQCNLELPKASNICSPMNIVKAKSFKSEPVYESKKEALEMLGGRLNLISKQV 480
Query: 481 VPEVDNSRPVAEMSGTTGDLDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNS 540
+P+VDNS
Sbjct: 481 LPDVDNS----------------------------------------------------- 540
Query: 541 EMTDISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTE 600
C P+A PVVAEM EAARNPSCST L DGDM NHSEL TPT+
Sbjct: 541 ---------C----------PIAVPVVAEMSEAARNPSCSTYLATDGDMLNHSELPTPTK 600
Query: 601 GPLIGKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKL 660
G L CGGGL+N + D++KDPG DS+ + K +D+N+NNP+W LKL
Sbjct: 601 GNL--------NECGGGLVNSEKTDITKDPGLGDSSISIAKPFNAEDENQNNPKWCLLKL 660
Query: 661 TNEQCSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEED 720
+NEQCS LQGGE+ VSDEEKIS+SAD+LEE PYSSEYESD + DV+ AM V DIEED
Sbjct: 661 SNEQCSGLQGGEESSVSDEEKISLSADILEEYPYSSEYESDGKQDVDGAMAEVHNDIEED 720
Query: 721 YEDGEVREPILKIQVESNVCEQKEVKSFDHGDCCN-------GLPSIDCSSLVFVKQENK 780
YEDGEVREP+LK QVES+VC ++EV++FDHGD GLP D S+L+ VKQENK
Sbjct: 721 YEDGEVREPLLKTQVESSVCVKREVENFDHGDFSKDKKINSVGLPGTDFSTLISVKQENK 780
Query: 781 SDIHDVKLEDNLHSVTSNQSSEQEK-----LKELLVEEDIIRVCSNKANKAIKATDTRQM 840
+ HDV+ ED HSVT+NQSSEQEK LKE+LVEE N +NK IKAT RQ+
Sbjct: 781 LESHDVRQEDKFHSVTTNQSSEQEKDEASYLKEILVEE-------NASNKVIKATGRRQL 840
Query: 841 SQCEEIDALEDQEISSEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTN 900
CEE DALEDQ SS+KAT+GIEEP V VSQ D N KTVDFV+N+ LPNVKEP+ N
Sbjct: 841 FHCEERDALEDQN-SSDKATDGIEEPIVTVSQGDAENVKTVDFVRNNDPVLPNVKEPVNN 900
Query: 901 DDVTNDFTCGSVHSQIVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQP 960
DD T+DF GS H +N C STSSSPSKTRSN +RSVLT+TDREQI D+ EGGKLQP
Sbjct: 901 DDATDDFIHGSRH---INPCHGSTSSSPSKTRSNSLRSVLTRTDREQILDVALEGGKLQP 960
Query: 961 QGRDEPYSDVSQKFYVNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQ 1020
QGRD+ YS VSQK YVNR QNLSPQTNF RR RFTI+ +SLQGEWDFNPT+SPGIY+DQ
Sbjct: 961 QGRDDRYSGVSQKIYVNRHQNLSPQTNF-HRRERFTIRTDSLQGEWDFNPTVSPGIYSDQ 1020
Query: 1021 IPPYDAPKRKYMSVVSDDGIDQNYYKIKPDGPFRGSGHRGRQTLDDEGPFFCH---RRKS 1080
I PYDAP+RKY+S VSDD IDQN+YKIKP+GPFR +G +GRQ LDDEGP +CH RRKS
Sbjct: 1021 I-PYDAPRRKYLSAVSDDDIDQNHYKIKPNGPFRSAGRQGRQILDDEGPPYCHIPSRRKS 1080
Query: 1081 PGRRDGPPVRGIKMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYAR 1140
PG RDGPPVRG+KMV R+PRNISP CIRE GSEL VGPRHGEKF+RT EDETMDPIYA
Sbjct: 1081 PGIRDGPPVRGVKMVHRMPRNISPSGCIREAGSEL-VGPRHGEKFMRTFEDETMDPIYAH 1140
Query: 1141 PQPPFEGDR-PFIRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPE 1200
PQPP+E DR PFIR+RRNF+IQRK+FPR+DS+SPGRSRGRSPGQW P KRK RF GH
Sbjct: 1141 PQPPYEVDRPPFIRERRNFTIQRKTFPRIDSKSPGRSRGRSPGQWVPGKRKSYRFCGHLG 1200
Query: 1201 MGRRSPPGY---RMRSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGI 1260
M RRS PGY RMRSPD PPIHGDMPVRRHGFPF LP +DLRDM SA D GHMRS I
Sbjct: 1201 MTRRSSPGYRGDRMRSPDQPPPIHGDMPVRRHGFPFSPLPSSDLRDMRSAPDQGHMRSDI 1260
Query: 1261 RSRNRTDRIPFRNRRFEDMDPRDRIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHE 1320
R RNR+DR+ FRNRRFE MDPRDRIESSEYFDG P QLNEL+GDGND DRRRF DRHE
Sbjct: 1261 RCRNRSDRLSFRNRRFEIMDPRDRIESSEYFDG---PSQLNELSGDGNDDDRRRFSDRHE 1294
Query: 1321 HLHPFQLPCNDSDGEIYHNDLNEGHRPFRYCAED-EPEFHERGKLREREFDRRVKNQPGN 1380
HLH F+ NDSDGE YHN+ + RPFR+CAED PEFHERG +REREF+RRVKNQPGN
Sbjct: 1321 HLHSFRPQYNDSDGENYHNNAEDSRRPFRFCAEDGPPEFHERGNMREREFNRRVKNQPGN 1294
Query: 1381 LPRQTGVIEEHEVVEDYRHGRQFWNEHHGFEEDISRMKRKRF 1401
L R+TGV+ E VEDYRHGRQ WN+ HGF EDISRMKRKRF
Sbjct: 1381 LTRRTGVVIEEHEVEDYRHGRQMWND-HGF-EDISRMKRKRF 1294
BLAST of Lcy09g005930 vs. NCBI nr
Match:
XP_038890337.1 (uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890338.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890339.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890340.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890341.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890342.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida])
HSP 1 Score: 1718.7 bits (4450), Expect = 0.0e+00
Identity = 943/1406 (67.07%), Postives = 1082/1406 (76.96%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S +DY +PIKKRRF +QS SP KEISS P ++N
Sbjct: 1 MSTNDYTTIVPIKKRRFPSIQS--SPPKEISSLPP--------------------VDDNM 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
+K E+P +S S+SS +T+S S + + S+ K+D C ++MVQS +G V
Sbjct: 61 VKVEEPCVSDSPTVSNSSTITTSEFSEKKKISFSEDGNWKSDL--CNVNMVQSSIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKH--AVHASPEIRGRLELSSTSLDSDP--LADN 195
+F++ F G V ++ LV E A+ PE L+L S+ DS+P A+
Sbjct: 121 EFKK----NDDCFTGSVGNKETCLVNENRMLALQEKPE----LKLPSSDPDSNPGVCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
K +EI +K +KC+S V+ E+S+ L +LVP ++ L+ N+L P +L
Sbjct: 181 KSDEIH-RKELDKCNSSTSVVKKEVELSLSL----KERLVP----VSVLEGNNLGPVVLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+KQGS T C+ GNVGSD DGS +SNRENWDLNTSMEFWE CASDDPPVHVP +QTN
Sbjct: 241 LSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEFWEGCASDDPPVHVPVVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLT-LLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKT 375
T V T RCSTEMVKTD L GKLT LDH+DHLHLSL SSD RHV SQEQS F KLDFRK+
Sbjct: 301 TTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDHRHVMSQEQSSFIKLDFRKS 360
Query: 376 NPSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQC 435
+PS+S PGR +FDDLNG LKVVK EPF EGSKLE +SDE N G+ + VVKR
Sbjct: 361 SPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEVNVPGVSDNAVVKRGF---- 420
Query: 436 NLQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP-- 495
LQ+P+ASDI M+IVK+KS KSES+Y+SKQEALKTLG RLDLVEKQV+ +VDNS
Sbjct: 421 -LQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVEKQVLSDVDNSCAVP 480
Query: 496 ---VAEMSGTTGD----LDLIRDTDLSNHLELQTPSEGHLDMKVHQ-GEYGCGDEHVNSE 555
VAEMS G+ DLI D D+SNH ELQTPS+ H+ +HQ G +GC E V SE
Sbjct: 481 MSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQGGSHGCCGELVKSE 540
Query: 556 MTDISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEG 615
+TDISED SKDSSSP KP+A P +AEM + A+NPSC+ D+I+D D+ NHSEL TPT G
Sbjct: 541 VTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDKDVPNHSELQTPTRG 600
Query: 616 PLIGKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLT 675
PL KVHQG GC GGL+N +M D+SKD SKDS S VIK I++D+N+NNP+W PL+
Sbjct: 601 PLNRKVHQGD-GCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVEDQNENNPQWHPLEHR 660
Query: 676 NEQCSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDY 735
N+QCSSL G E+C VSDEEKIS+SADLLEEDPYSSEYESD + DVNEAMDAVD IEEDY
Sbjct: 661 NKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVNEAMDAVDNVIEEDY 720
Query: 736 EDGEVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSIDCSSLVFVKQENKSDIHDVKL 795
EDGEVREPIL QVES++CE +EVK FDHGDC NGLP DCSSLV VKQE+KS+I DVK
Sbjct: 721 EDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVSVKQEDKSEILDVKR 780
Query: 796 EDNLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQE 855
ED LH VTSNQSSEQE LKELLVE++ +V NKANKAIKAT RQ+ CE+I ALEDQ+
Sbjct: 781 EDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQLFHCEKIFALEDQK 840
Query: 856 ISSEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVH 915
ISSE+AT GIEE VSQSD N KTVDFVQN+ LALPNVKEPL NDDVT+DFT G+ H
Sbjct: 841 ISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLNNDDVTDDFTRGNRH 900
Query: 916 SQIVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQK 975
SQIVN CQAST SSP+KTR +LVRSVLTQTDRE IPDM H+G KLQPQGRD+ Y DV K
Sbjct: 901 SQIVNPCQAST-SSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQPQGRDDSYRDVFPK 960
Query: 976 FYVNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQIPPYDAPKRKYMS 1035
FYVNR+QNLSP+TNFTRRRGRFTI+INS+QGEWDFNPTISPG+YNDQIPPYDA +RKYM
Sbjct: 961 FYVNRRQNLSPRTNFTRRRGRFTIRINSVQGEWDFNPTISPGVYNDQIPPYDARRRKYMP 1020
Query: 1036 VVSDDGIDQNYYKIKPDGPFRGSGHRGRQTLDDEGPFFCH---RRKSPGRRDGPPVR-GI 1095
VSD+ IDQN+YK+KP GPFR GHRGRQ LDDEGP FCH RRKSPGRRDGPP+R G+
Sbjct: 1021 AVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPLRGGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KMV + RN+SP RCIREPGSEL +GPRHGEKF+RTL+DETMDP+Y PQPPFE DR P+
Sbjct: 1081 KMVHGMHRNVSPSRCIREPGSEL-IGPRHGEKFMRTLDDETMDPMY-HPQPPFEVDRPPY 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRSPPGYRMR 1215
I DRRNF IQRKSFPRVDS+SPGRSRGRSPGQWFPSKRK +RFFGHPEM RRSPPGYRMR
Sbjct: 1141 IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHPEMARRSPPGYRMR 1200
Query: 1216 SPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNRR 1275
SPD PPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRI FRNRR
Sbjct: 1201 SPD-QPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRR 1260
Query: 1276 FEDMDPRDRIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSDGE 1335
FEDMDPRDRIES+EY+DGP+HP Q NEL DGND DRRRFPDRHEHLHPF+ CNDSDGE
Sbjct: 1261 FEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLHPFRPQCNDSDGE 1320
Query: 1336 IYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVVED 1395
YHND +E RPFRYCAEDE EFHER K+REREFDRR+KNQ NL R+TGVIEEHE ++
Sbjct: 1321 NYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRRTGVIEEHE--QE 1351
Query: 1396 YRHGRQFWNEHHGFEEDISRMKRKRF 1401
YRHGRQ WNEHHGFEE ISRMKRKRF
Sbjct: 1381 YRHGRQLWNEHHGFEE-ISRMKRKRF 1351
BLAST of Lcy09g005930 vs. NCBI nr
Match:
XP_038890343.1 (uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida])
HSP 1 Score: 1661.4 bits (4301), Expect = 0.0e+00
Identity = 923/1406 (65.65%), Postives = 1060/1406 (75.39%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S +DY +PIKKRRF +QS SP KEISS P ++N
Sbjct: 1 MSTNDYTTIVPIKKRRFPSIQS--SPPKEISSLPP--------------------VDDNM 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
+K E+P +S S+SS +T+S S + + S+ K+D C ++MVQS +G V
Sbjct: 61 VKVEEPCVSDSPTVSNSSTITTSEFSEKKKISFSEDGNWKSDL--CNVNMVQSSIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKH--AVHASPEIRGRLELSSTSLDSDP--LADN 195
+F++ F G V ++ LV E A+ PE L+L S+ DS+P A+
Sbjct: 121 EFKK----NDDCFTGSVGNKETCLVNENRMLALQEKPE----LKLPSSDPDSNPGVCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
K +EI +K +KC+S V+ E+S+ L +LVP ++ L+ N+L P +L
Sbjct: 181 KSDEIH-RKELDKCNSSTSVVKKEVELSLSL----KERLVP----VSVLEGNNLGPVVLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+KQGS T C+ GNVGSD DGS +SNRENWDLNTSMEFWE CASDDPPVHVP +QTN
Sbjct: 241 LSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEFWEGCASDDPPVHVPVVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLT-LLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKT 375
T V T RCSTEMVKTD L GKLT LDH+DHLHLSL SSD RHV SQEQS F KLDFRK+
Sbjct: 301 TTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDHRHVMSQEQSSFIKLDFRKS 360
Query: 376 NPSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQC 435
+PS+S PGR +FDDLNG LKVVK EPF EGSKLE +SDE N G+ + VVKR
Sbjct: 361 SPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEVNVPGVSDNAVVKRGF---- 420
Query: 436 NLQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP-- 495
LQ+P+ASDI M+IVK+KS KSES+Y+SKQEALKTLG RLDLVEKQV+ +VDNS
Sbjct: 421 -LQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVEKQVLSDVDNSCAVP 480
Query: 496 ---VAEMSGTTGD----LDLIRDTDLSNHLELQTPSEGHLDMKVHQ-GEYGCGDEHVNSE 555
VAEMS G+ DLI D D+SNH ELQTPS+ H+ +HQ G +GC E V SE
Sbjct: 481 MSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQGGSHGCCGELVKSE 540
Query: 556 MTDISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEG 615
+TDISED SKDSSSP KP+A P +AEM + A+NPSC+ D+I+D D+ NHSEL TPT G
Sbjct: 541 VTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDKDVPNHSELQTPTRG 600
Query: 616 PLIGKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLT 675
PL KVHQG GC GGL+N +M D+SKD SKDS S VIK I++D+N+NNP+W PL+
Sbjct: 601 PLNRKVHQGD-GCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVEDQNENNPQWHPLEHR 660
Query: 676 NEQCSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDY 735
N+QCSSL G E+C VSDEEKIS+SADLLEEDPYSSEYESD + DVNEAMDAVD IEEDY
Sbjct: 661 NKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVNEAMDAVDNVIEEDY 720
Query: 736 EDGEVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSIDCSSLVFVKQENKSDIHDVKL 795
EDGEVREPIL QVES++CE +EVK FDHGDC NGLP DCSSLV VKQE+KS+I DVK
Sbjct: 721 EDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVSVKQEDKSEILDVKR 780
Query: 796 EDNLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQE 855
ED LH VTSNQSSEQE LKELLVE++ +V NKANKAIKAT RQ+ CE+I ALEDQ+
Sbjct: 781 EDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQLFHCEKIFALEDQK 840
Query: 856 ISSEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVH 915
ISSE+AT GIEE VSQSD N KTVDFVQN+ LALPNVKEPL NDDVT+DFT G+ H
Sbjct: 841 ISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLNNDDVTDDFTRGNRH 900
Query: 916 SQIVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQK 975
SQIVN CQAST SSP+KTR +LVRSVLTQTDRE IPDM H+G KLQPQGRD+ Y DV K
Sbjct: 901 SQIVNPCQAST-SSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQPQGRDDSYRDVFPK 960
Query: 976 FYVNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQIPPYDAPKRKYMS 1035
FYVNR+QNLSP+TNFTRRR G+YNDQIPPYDA +RKYM
Sbjct: 961 FYVNRRQNLSPRTNFTRRR----------------------GVYNDQIPPYDARRRKYMP 1020
Query: 1036 VVSDDGIDQNYYKIKPDGPFRGSGHRGRQTLDDEGPFFCH---RRKSPGRRDGPPVR-GI 1095
VSD+ IDQN+YK+KP GPFR GHRGRQ LDDEGP FCH RRKSPGRRDGPP+R G+
Sbjct: 1021 AVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPLRGGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KMV + RN+SP RCIREPGSEL +GPRHGEKF+RTL+DETMDP+Y PQPPFE DR P+
Sbjct: 1081 KMVHGMHRNVSPSRCIREPGSEL-IGPRHGEKFMRTLDDETMDPMY-HPQPPFEVDRPPY 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRSPPGYRMR 1215
I DRRNF IQRKSFPRVDS+SPGRSRGRSPGQWFPSKRK +RFFGHPEM RRSPPGYRMR
Sbjct: 1141 IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHPEMARRSPPGYRMR 1200
Query: 1216 SPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNRR 1275
SPD PPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRI FRNRR
Sbjct: 1201 SPD-QPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRR 1260
Query: 1276 FEDMDPRDRIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSDGE 1335
FEDMDPRDRIES+EY+DGP+HP Q NEL DGND DRRRFPDRHEHLHPF+ CNDSDGE
Sbjct: 1261 FEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLHPFRPQCNDSDGE 1320
Query: 1336 IYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVVED 1395
YHND +E RPFRYCAEDE EFHER K+REREFDRR+KNQ NL R+TGVIEEHE ++
Sbjct: 1321 NYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRRTGVIEEHE--QE 1329
Query: 1396 YRHGRQFWNEHHGFEEDISRMKRKRF 1401
YRHGRQ WNEHHGFEE ISRMKRKRF
Sbjct: 1381 YRHGRQLWNEHHGFEE-ISRMKRKRF 1329
BLAST of Lcy09g005930 vs. NCBI nr
Match:
XP_022992789.1 (uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790.1 uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima])
HSP 1 Score: 1660.2 bits (4298), Expect = 0.0e+00
Identity = 933/1415 (65.94%), Postives = 1058/1415 (74.77%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + ++N
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDNI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQ +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQRIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ +V E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSAGCVENKETCMVNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +E+D +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEVD-RKQLDRLEFSTSLAKKEAELSVGSKEHLVPDSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS C+ NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQCLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD L GKLT LD +DHLHLSLSSSD RHV SQEQS FAKL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFAKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKL +SDE N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLMSKSDEVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPNASD+ PM+ VKAKS SES YESKQEALKTLG RLDLV KQV+PEVD+S P
Sbjct: 421 LQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLGGRLDLVAKQVLPEVDSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ G+ DLI D D+SNH ELQTP++ HL++KVH+G Y CG E V+SEMTD
Sbjct: 481 PFVAEMTEAAGNSCSTDLITDGDMSNHPELQTPTKEHLNLKVHEGAYCCGGELVDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
IS+DP SKDS+ P IKP+A P RNPS + D II+ +MS+ SEL TPT GPL
Sbjct: 541 ISKDPGSKDSNGPIIKPIAMP---------RNPSPTNDSIIEANMSSPSELHTPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 MKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPFTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD DIEEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDIEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK FDHGD NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA +I ALEDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KISALEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+ND ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDR++IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQI-PPYDAPKRKYMSV 1035
VNR QNLSPQTNF+RRRGRFTI+INS+QGEWDFNPTISPG Y+DQ+ PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDQVPPPYDARRRKYMPA 1020
Query: 1036 VSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVRGI 1095
VSDD IDQN+YK+KPDGPFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVRG+
Sbjct: 1021 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KM R+PRNISP RC RE GSEL VGPRHGEKF+RT EDETMDP+YA PQP FE DR PF
Sbjct: 1081 KMAHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPF 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGYRM 1215
IRDRRNF IQRKSF RVDS+SPG SRGRSP QWFPSKRK +RFFGHPEM RRS PPGYRM
Sbjct: 1141 IRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRM 1200
Query: 1216 RSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNR 1275
RSPD P IHGDMPVRRHGFPFPSLPPN+LRDMGSARDHGHMR +RSRNRTDR+ FRNR
Sbjct: 1201 RSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSARDHGHMRPSLRSRNRTDRMSFRNR 1260
Query: 1276 RFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSD 1335
RFEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF +RHEHLH F+ CNDSD
Sbjct: 1261 RFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFANRHEHLHQFRPQCNDSD 1320
Query: 1336 GEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVV 1395
E YHND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE V
Sbjct: 1321 SENYHNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE-V 1349
Query: 1396 EDYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
E+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 EEYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1349
BLAST of Lcy09g005930 vs. NCBI nr
Match:
XP_023550091.1 (uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 1656.3 bits (4288), Expect = 0.0e+00
Identity = 930/1414 (65.77%), Postives = 1056/1414 (74.68%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF +QSP P KEISS LP + ++N
Sbjct: 1 MSTSDYNAIVPIKKRRFPSMQSP--PPKEISS-----------LPLV---------DDNI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQS +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQSIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE + GCVE ++ +V E HA +H PE + L + +S+P A+
Sbjct: 121 EFQENDV----CSTGCVENKETCMVNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +EID +K ++ + AE+S+ H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEID-RKQLDRLEFSTSVAKKEAELSIGSKEHLVPDSVLEGSDLKSLKQINLEPGLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS + NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQPLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
T+VT HR STEMV TD L GKLT LD +DHLHLSLSSSD RHV SQEQS F KL FRKT+
Sbjct: 301 TVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKLE +SDE N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDEVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPNASDI PM+ VKAKS SES YESKQ AL+TLG RLDLV KQV+PEVD+S P
Sbjct: 421 LQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLGGRLDLVAKQVLPEVDSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ G+ DLI D +SNH ELQTP+E HL++KVH+G Y CG E V+SEMTD
Sbjct: 481 PFVAEMTEAAGNSCSTDLITDGGMSNHSELQTPTEEHLNLKVHEGAYRCGGELVDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
IS+DP SKD +SP IKP+A P RNPS + D II+ +MS+ SEL PT GPL
Sbjct: 541 ISKDPGSKDFNSPIIKPIAMP---------RNPSRTNDSIIEANMSSPSELHIPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 TKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMDAVD DIEEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDAVDNDIEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK+FDHGD NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA + ALEDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KTSALEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKA+NGIEE VSQSD KTVD V+ND ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ S+SS PSKT+S+L RSVLTQTDRE+IPDMGHEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDMGHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQI-PPYDAPKRKYMSV 1035
VNR QNLSPQTNF+RRRGRFTI+INS+QGEWDFNPTISPG YNDQ+ PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYNDQVPPPYDARRRKYMPA 1020
Query: 1036 VSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVRGI 1095
VSDD IDQN+YK+KPDGPFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVRG+
Sbjct: 1021 VSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVRGV 1080
Query: 1096 KMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR-PF 1155
KMV R+PRNISP RC RE GSEL VGPRHGEKF+RT EDETMDP+YA PQP FE DR PF
Sbjct: 1081 KMVHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDETMDPLYAHPQPSFEVDRPPF 1140
Query: 1156 IRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGYRM 1215
IRDRRNF IQRKSF RVDS+SPGRSRGRSP QWFPSKRK +RFFGHPEM RRS PPGYRM
Sbjct: 1141 IRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGYRM 1200
Query: 1216 RSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFRNR 1275
RSPD P IHGDMP RRHGFPFPSLPPNDLRDMGSARDHGHMR G+RSRNRTDR+ FRNR
Sbjct: 1201 RSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSARDHGHMRPGLRSRNRTDRMSFRNR 1260
Query: 1276 RFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDSD 1335
RFEDMDPRD RIES+EYFDGPVHP Q+NEL DGND DRRRF DRHEHLH F+ CNDSD
Sbjct: 1261 RFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDDDRRRFSDRHEHLHQFRPQCNDSD 1320
Query: 1336 GEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHEVV 1395
GE YHND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE V
Sbjct: 1321 GENYHNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE-V 1349
Query: 1396 EDYR--HGRQFWNEHHGFE-----EDISRMKRKR 1400
E+YR HGRQ WNEHH EDISRMKRKR
Sbjct: 1381 EEYRHGHGRQMWNEHHHHHHHHSFEDISRMKRKR 1349
BLAST of Lcy09g005930 vs. NCBI nr
Match:
XP_022938519.1 (uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_022938520.1 uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata])
HSP 1 Score: 1628.6 bits (4216), Expect = 0.0e+00
Identity = 920/1417 (64.93%), Postives = 1048/1417 (73.96%), Query Frame = 0
Query: 16 LSASDYDASLPIKKRRFLVVQSPPSPSKEISSFHPDGNIMKTELPSILKDVSLLHTNENF 75
+S SDY+A +PIKKRRF ++QSP P KEISS LP + +++
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSP--PPKEISS-----------LPLV---------DDSI 60
Query: 76 IKSEQPGLSMMTLDSSSSAVTSSGLSNRNQDCVSDRNKEKTDTDSCYLDMVQSDVGMPGV 135
K ++P +S S+SS +T+S S + +S K +D C ++MVQS +G V
Sbjct: 61 AKVDEPCVSDGPTVSNSSTITTSEFSEKK---ISFSEDGKRKSDLCNMNMVQSIIGPSRV 120
Query: 136 KFQEPSLGGRASFDGCVEYEDKSLVTEKHA--VHASPEIRGRLELSSTSLDSDP--LADN 195
+FQE A GCVE ++ ++ E HA +H PE + L + +S+P A+
Sbjct: 121 EFQE----NDACSTGCVENKETCMMNENHALVLHEKPEFK----LPHSDANSNPGLCAEK 180
Query: 196 KEEEIDVKKPKEKCSSPVRQVEGGAEVSVVLMGHVDPKLVPEESDLNFLKQNSLEPELLY 255
+ +EID +K ++ + AE+SV H+ P V E SDL LKQ +LEP LL
Sbjct: 181 ESDEID-RKQLDRLEFSTSVAKKEAELSVGSKEHLVPNSVLEGSDLKSLKQINLEPVLLN 240
Query: 256 LSLNKQGSSTHCVKGNVGSDYDGSPLRSNRENWDLNTSMEFWEVCASDDPPVHVP-IQTN 315
LSL+K+GS + NVGS YDGS SNRENWDLNTSMEFWE C+S DPP HVP +QTN
Sbjct: 241 LSLSKEGSLDQRLTVNVGSSYDGSIQESNRENWDLNTSMEFWEGCSSGDPPEHVPAVQTN 300
Query: 316 TIVTTHRCSTEMVKTDILPGKLTLLDHNDHLHLSLSSSDLRHVTSQEQSPFAKLDFRKTN 375
TIVTTHR STEMV TD LPGKLT LD +DHLHLSLSSSD RHV SQEQS F KL FRKT+
Sbjct: 301 TIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHLSLSSSDHRHVISQEQSSFVKLGFRKTS 360
Query: 376 PSISLPGRGMKFDDLNGALKVVKPEPFVEGSKLEFRSDETNELGLQNGTVVKRESDDQCN 435
PS+S GRG++FDDLNGALKVVKPEPFVE SKLE +SD N LGL + +VKRE
Sbjct: 361 PSLSSTGRGLQFDDLNGALKVVKPEPFVEASKLESKSDGVNVLGLSDSAIVKREF----- 420
Query: 436 LQIPNASDICSPMDIVKAKSTKSESVYESKQEALKTLGSRLDLVEKQVVPEVDNSRP--- 495
LQIPN SDI PM+ VKA+S SE YESKQEALKTLG RLDLV KQV+PEV +S P
Sbjct: 421 LQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLGGRLDLVAKQVLPEVGSSCPAPM 480
Query: 496 --VAEMSGTTGD---LDLIRDTDLSNHLELQTPSEGHLDMKVHQGEYGCGDEHVNSEMTD 555
VAEM+ + DLI D D+SNH ELQTP++ HL++ VH+G Y E ++SEMTD
Sbjct: 481 PFVAEMTEAARNSCSTDLITDGDMSNHPELQTPTKEHLNLNVHEGAYRFAGELIDSEMTD 540
Query: 556 ISEDPCSKDSSSPFIKPVATPVVAEMLEAARNPSCSTDLIIDGDMSNHSELLTPTEGPLI 615
+S+DP SKD +SP IKP+A P RNPS + D II+ +MS+ SEL PT GPL
Sbjct: 541 VSKDPGSKDFNSPIIKPIAMP---------RNPSRTNDSIIEANMSSPSELHIPTTGPLN 600
Query: 616 GKVHQGGYGCGGGLMNLKMMDVSKDPGSKDSTSCVIKSSIIDDKNKNNPEWRPLKLTNEQ 675
KVHQ GYGC GGL+N M DVSKD SKDS+S VIK I++D+N+NNP WRP TNEQ
Sbjct: 601 TKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIKPVIVEDENQNNPLWRPSTHTNEQ 660
Query: 676 CSSLQGGEKCLVSDEEKISISADLLEEDPYSSEYESDCRHDVNEAMDAVDMDIEEDYEDG 735
CSSLQGGE+ V+DEEKIS+SADLLEEDPYSSEYESD + DVNEAMD VD D+EEDYEDG
Sbjct: 661 CSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESDGKLDVNEAMDTVDNDVEEDYEDG 720
Query: 736 EVREPILKIQVESNVCEQKEVKSFDHGDCCNGLPSID-CSSLVFVKQENKSDIHDVKLED 795
EVREP L QVES++CE K+VK+FDH D NGLP D CSSLV VKQENK +I DVK ED
Sbjct: 721 EVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSDCCSSLVSVKQENKLEILDVKRED 780
Query: 796 NLHSVTSNQSSEQEKLKELLVEEDIIRVCSNKANKAIKATDTRQMSQCEEIDALEDQEIS 855
NLHSVTSNQSSEQE+ KEL VEE RVC NKANKA + A+EDQE S
Sbjct: 781 NLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA-------------KTSAIEDQETS 840
Query: 856 SEKATNGIEEPTVKVSQSDVVNDKTVDFVQNDGLALPNVKEPLTNDDVTNDFTCGSVHSQ 915
EKATNGIEE VSQSD KTVD V+N+ ALPNV EPL +DDVT+D T GS HS+
Sbjct: 841 PEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALPNV-EPLNDDDVTDDITRGSKHSR 900
Query: 916 IVNACQASTSSSPSKTRSNLVRSVLTQTDREQIPDMGHEGGKLQPQGRDEPYSDVSQKFY 975
IV+ C+ STSS PSKTRS+L RSVLTQTDRE+IPDM HEG KL PQGRDEPY DV Q+FY
Sbjct: 901 IVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDMAHEGEKLHPQGRDEPYRDVFQRFY 960
Query: 976 VNRQQNLSPQTNFTRRRGRFTIQINSLQGEWDFNPTISPGIYNDQ---IPPYDAPKRKYM 1035
VNR QNLSPQTNF+RRRGRFTI+INS+QGEWDFNPTISPG Y+D PPYDA +RKYM
Sbjct: 961 VNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPTISPGNYSDHQVPPPPYDARRRKYM 1020
Query: 1036 SVVSDDGIDQNYYKIKPDGPFRGSG-HRGRQTLDDEGPFFCH---RRKSPGRRDG-PPVR 1095
VSDD IDQN+YK+KPD PFR +G HRGRQ LDDEGP FCH RRKSPGRRDG PPVR
Sbjct: 1021 PAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEGPLFCHMASRRKSPGRRDGPPPVR 1080
Query: 1096 GIKMVQRIPRNISPGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDR- 1155
G+KMV R+PRNISP RC RE GSEL VGPRHGEKF+RT EDE MDP+YA PQP FE DR
Sbjct: 1081 GVKMVHRMPRNISPSRCNRERGSEL-VGPRHGEKFMRTFEDEAMDPLYAHPQPSFEVDRS 1140
Query: 1156 PFIRDRRNFSIQRKSFPRVDSRSPGRSRGRSPGQWFPSKRKHDRFFGHPEMGRRS-PPGY 1215
PFIRDRRNF IQRKSF RVDS+SPGRSRGRSP QWFPSKRK +RFFGHPEM RRS PPGY
Sbjct: 1141 PFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSKRKSERFFGHPEMARRSPPPGY 1200
Query: 1216 RMRSPDHSPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRIPFR 1275
RMRSPD P IHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMR GIRSRNRT+R+ FR
Sbjct: 1201 RMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRPGIRSRNRTERMSFR 1260
Query: 1276 NRRFEDMDPRD-RIESSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCND 1335
NRRFEDMDPRD RIES+EYFDGPVHP QLNEL DGND DRRRF DRHEHLH F+ CND
Sbjct: 1261 NRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDDDRRRFSDRHEHLHQFRPQCND 1320
Query: 1336 SDGEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHE 1395
SDGE Y ND +E RP+RYC EDE EFHERGK+REREFDRRVKNQP NL R+T VIEEHE
Sbjct: 1321 SDGENYRNDADERARPYRYCTEDEEEFHERGKMREREFDRRVKNQPENLGRRT-VIEEHE 1351
Query: 1396 VVEDYR--HGRQFWNE------HHGFEEDISRMKRKR 1400
VE+YR HGRQ WNE HHGF EDISRMKRKR
Sbjct: 1381 -VEEYRHGHGRQMWNEHHHHHHHHGF-EDISRMKRKR 1351
BLAST of Lcy09g005930 vs. TAIR 10
Match:
AT5G13590.1 (unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; Bacteria - 8; Metazoa - 80; Fungi - 5; Plants - 17; Viruses - 0; Other Eukaryotes - 40 (source: NCBI BLink). )
HSP 1 Score: 105.9 bits (263), Expect = 2.6e-22
Identity = 113/359 (31.48%), Postives = 160/359 (44.57%), Query Frame = 0
Query: 1029 NYYKIKPDGPFRGSGHRGRQTLDDEGPFFCHR--RKSPGRRDGPPVRGIKMVQRIPRNIS 1088
N+ + G F + RGR+ +D + H R+SP R P N
Sbjct: 818 NHGNTRRGGAFMSNFQRGRRPANDGVTPYAHSFPRRSPS----------FSYNRGPTN-- 877
Query: 1089 PGRCIREPGSELVVGPRHGEKFLRTLEDETMDPIYARPQPPFEGDRPFIRDRRNF-SIQR 1148
+ + G R GEKF R L+ +P++ Q P+ G F R R F + +
Sbjct: 878 ------KEDTSAFHGFRDGEKFTRGLQCNNTEPLFMNHQRPYRGRSGFARGRTKFVNNPK 937
Query: 1149 KSFPRVDSRSPGRSRGRSPGQWFPSK-RKHDRFFGHPEMG-RRSPPGY---RMRSPDHSP 1208
+ FP SRSP RSR RS G + R + F GH + RRSP GY RM SPDHS
Sbjct: 938 RDFPGFRSRSPVRSRERSDGSSSSFRNRSQEEFSGHTDFSHRRSPSGYKVERMSSPDHS- 997
Query: 1209 PIHGDMPVRRHGF-PFPSLPPNDLRDMGSARDHGHM------RSGIRSRNRTDRIPFRNR 1268
+M VRRH PF P N R G AR G++ R G R +D + RN
Sbjct: 998 GYSREMVVRRHNSPPFSHRPSNAGRGRGYARGRGYVRGRGYGRDGNSFRKPSDHVVHRNH 1057
Query: 1269 -RFEDMDPRDRIE-SSEYFDGPVHPDQLNELAGDGNDGDRRRFPDRHEHLHPFQLPCNDS 1328
++DPR+R++ S ++F+G +H +E G + +RRRF RH+ P ++
Sbjct: 1058 GNMNNLDPRERVDYSDDFFEGQIH----SERFGVDVNAERRRFGYRHDGTSSSFRPSFNN 1117
Query: 1329 DGEIYHNDLNEGHRPFRYCAEDEPEFHERGKLREREFDRRVKNQPGNLPRQTGVIEEHE 1371
DG N N+ R+ + + E+G L E D KN N +T +EE E
Sbjct: 1118 DGCAPTNVENDPD-AVRFQQDPRIKIEEQGSL--MEIDGENKNSTENASGRTKNMEEEE 1150
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JYG4 | 0.0e+00 | 65.94 | uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FEB1 | 0.0e+00 | 64.93 | uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JUI7 | 0.0e+00 | 64.52 | uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FDD8 | 0.0e+00 | 63.58 | uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1BWB0 | 0.0e+00 | 60.06 | uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
XP_038890337.1 | 0.0e+00 | 67.07 | uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_03889033... | [more] |
XP_038890343.1 | 0.0e+00 | 65.65 | uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida] | [more] |
XP_022992789.1 | 0.0e+00 | 65.94 | uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790... | [more] |
XP_023550091.1 | 0.0e+00 | 65.77 | uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022938519.1 | 0.0e+00 | 64.93 | uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_0229385... | [more] |
Match Name | E-value | Identity | Description | |
AT5G13590.1 | 2.6e-22 | 31.48 | unknown protein; Has 150 Blast hits to 121 proteins in 42 species: Archae - 0; B... | [more] |