MC06g0751 (gene) Bitter gourd (Dali-11) v1

Overview
NameMC06g0751
Typegene
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionRegulator of Vps4 activity in the MVB pathway protein
LocationMC06: 6255280 .. 6260455 (-)
RNA-Seq ExpressionMC06g0751
SyntenyMC06g0751
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: utr5polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
GGAATATTTTGTCTTATAATAAAGGTGGATGGATGAAATTAGAAAGGAGTAATATGGTAAAAAAAAAATCCAAAGAATCTAGAAAATATATATTTTAATATAGGAATATAATTTACTACGTGGAATTTGCAATTGAATTCCTAAAGGAAATTGAGAAGGGTGATTTGGTCAATATAAAAGTCAGCGAGCCATGATGGGTGGGTTACCTTGAACCTTCCCGCACCTAGGTCTTGCCGATCTGCCACGCTTAAAGTTTGATCAGTACACGAAAAACAAAGTCCCTTTCAGAGGCTCTTTCTTCGGCCGGAGCTCCGTCGCGATTCCTTGTCGTCCTCCGTCCTTTCACTTTCTCTCCATTACCATCTCTTCTTCGATCGTATAACTTGATTTTTATTTCTCAACATAATTCAAATTCCTATTCTTCGTCTCTCGGAGGGAGGATTTTTCTTTTTGATGGTTTTTTTGGCGGATAATTGTCCCGCTAGTTGAAGATTTTCCCGCGTTGAGATCTGGTTTTGGATTCCTACAACGATGCTGCACAAGAGCTTCAAGCCTGCAAAATGGTATGGTTCCTTCTTCGAACGCGTTTTAAGTGTCTGATACTTTTTGGTTTCTGCAAAAAATCGACTCGTTGTCGATTTTTTGTTCTGCGGGTGATTGCGAACGAGATGGTCTAGGAATTCCGTGGATTTTTGTCTCTTTGGATTTAATTTCAGCGTCGTTATTGGATTGTTTGACTTAACTGTTCGATCAATCAATCGAATTTGAGGGAAATTCGTTTTTCATTTGAATTATGTTGAATTGGCTCGGTGCAGCAAGACATCGTTGAAGTTAGCGGTTTCGCGGATTAAGCTTTTGAGGAATAAGAAAGAAGTCCAGGTCAGGCAGCTGAGGGGAGAATTGGCGAAGTTGCTCGAGGCTGGACAGGACCAGACTGCTAGAATTCGGGTATTGTATTGTTATTCTTCATCTCTGAAATTGATTGTATGAATATGTTCTTGGATTTGCTTGTGTCTTTATCAAGCCATCACTGTCTTAGAGACCTAATGTCATATCGGAAGATGAACTCTGTCCTCTGTATGATTCACGCGTCAACTTGATGAATTCTGCTCGAACTCCCATTTTCTGAAGAATATATGTTTCCGTCTTTGTTTCAGGTTGAACACGTTGTTAGAGAAGAGAAATCAAAGGCAGCTTATGAACTCATTGAAATATTCTGTGAACTAATTGTGGCACGCATGCCAATGATCGAGTCCCAAAAGTATGTCAACTTTTTTTAGAAATAATTTTTTTCGATTTCTCAATAATTGTGTACTTTAGTTTCAGAATATGAATTTGTATGTCAATTCCATATGAATTCTGATTCCCCCACCCCACCCCCACCAGTGATGTTTATATTAACTTCTGATTTGCATTAAGAAACAGTAATTTCGAAGAATGACTATGACTTATCCTTATGTGTAAATTTGTTAATTCCATCTATGTTAACACGTCCAGAAATTGCCCGATAGACTTGAAGGAAGCAATCGCGAGTGTTATATTTGCATCGCCCAGATGTGCGGATATTCCAGAGCTTATGGATGTTCGAAAACACTTCAAAGCAAAATATGGAAAAGAATTCGTCTCAGCAGCAGTGGAGCTACGCCCAGAATGTGGAGCGAATCGCATGGTAAATGTGTTAGCTGTTTTGCCGTTCATGAGTTTGGGGATTGTGAATTCTATCTATGAAAACATTTCAGTCCCATTGACTTCCCTTTTCATTTTATTACAACTTTCCAGTTGGTTGAAAAAATGTCTGCTAAAGCACCAGATGGACAGACCAAACTTAAAATCTTGACTGCAATTGCCGAGGAACATAAAATCAAATGGGACCCCAAAACATTTTGTGATAGCAGCAACCCTCCAGCTGACTTGTTGGTAAAATTTATCGTTATCAATTTTTTTAATAAATGATTTCCTTTACTGTTCAAGGTGAATTTTTATCGTAGTGAGAATTTGCCTGAGTTATTTTATTTGAATGTTGAATTTCCATTTTATTAATACGAGTTATGAACTTTACCGAGCAGAATGGACCAAATACTTTTGGACAAGCCAGTCAAATACAGAGGGGGGAGCCTATTGGTGGCCAACCCTCACTTGATCACAATAACAGGGGATCTGCTAGTGTTCAAGTTCCATCTAAATCAGATGAGGGGCATCGAATACCTGAAAAGTCCCCTGAGCACAATTTGAGACCAACGCTTCATCCCCAGAAATCCAACTTTACTAATGACTATACCAATCAAAGCAACATCACTGGCCATCGTATTCCAGAAACAAGATCTTCTGGTGCGTGAAAAATACTTTGGATTTGATAATCTTCATTTTTTCCCTCTCTCTATATAGAAGATCATTACTTTGGCTTGGATTTGTAACAATCAGATATATATTGATGCTCTGATATGCAGAAATGAATGCAGAGGGGACGCATAGGCACACAAATTCTGGTGATCAGAATAATTACTCTTCAGGCAGGCAGCATTGGAATATGGATTTCAAGGATGCTACATCTGCGGCTAAGGCAGCTGCTGAATCTGCAGAATTAGCTAGCCTTGCTGCCCGAGCTGCTGCAGAACTTTCTAGTCGTGGAAACATATCCCAATCGTCCTCTTCAGGGTTTCAAAAATCTTCATCTTATAATTTAAGGGCTGAAGGACCTCAAGAATATGCTAACTTAAATTTGCAAGATCAACAACTTCCTAAAGACCAAGTTGTCAGTGCCCCTCATAGAAGTTCTATCATGGATGATAATAGGAGAGAGAATGATGCAAGAAGATTTATGGGGGACGATGACAAGAAGTTACGTTATCCGTCTTCGGGTGCTAGTAATATTGATGCTAACGCATCTGGAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAATCTGGGTTTAACGACTCTCTTGGTAGTAGTGCAAGTGTGGAGAAACAACCCAGAAAATTTGATGCTAGCACATCTGTAAATAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAACCTGGGTTTAGCGACTCTCTTCATAGTGCAACTGTGGAGAAACAACCCAAAGACTATGATTCTAACACATATGTAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCTGAACCTGGGTTTGGTGACTCCCATAGTAGTGCCAGTATGGAGAAACAACCCAGAAACTTTGATGTAGAATATGTTAGTGATCGGCCATCTGGTACAGGTTCTGAAAGAACTAGTTACTATGAGGATGTGAGGATTGGAAATGATTCAAATAGAGTGCCTTCCTACGAGAAGCCAATGATGGATACTTATGACAACCCTTTTGCCATGGATAAACCTAATGATAGTGAGACTGTTGATACAAGTTTTAATGATCATGCCAGTGTGGTCTTTGATGATTATGGTCCAGATGATGATTATGTCCCAGACTATGATTACCAGATAAGAGAATCTATTCTGGAGCTTTCATCCCCTGAGGGAACGGTGCCGATAAACTCATCAGCGACAGATGATACTTGGATCTTCAAGCAAAACAAGAATGATCCTCCTGAGAAGTCTGTTTCACACTCGCAAATTTCTGATGAGCGTACTTCTTTGTTTGCTGGAAATTCAAGAAGTTTTGAAGATCCTTCCCATTCTGATGACTTGTTGCCGGCAACTTTTGACCATTCAGATGGACCTAGTTCTGAAAGTGAGGAAGAGGTGAAAGAATCTGGGATCATTGGTAAAGAAGACTCTATCGAATTTTCTAAGAAACAGAATTTGTACTCTGAAAAACCAGAATGGATTCAGGACATTAGCCATGTATCATTGGGATCTTCAGATGAGGAGAACAAAAGCATGCCATCTCATCGCCTCTCTTCCGACATTCCCCTTGTCCATGGATCCAAGGAGAAAGCCAGCCCACCAAGCTCTCCTGATATCATACAAGATACTAAAATACTAGAAGAATCAACTTCAGAAGTCTACAGTCCCTTGAGTTTTGGGAAACTGAAGGGTGGCCTAAGAAATCAAAAGAGTAATAGGCCTCCATATGCAATCAATTCATCAACAAGTGATTTACCATCCAAACAAACATGTGGAAACGATGACGCAAGAACTGAGCAGTCTACCTCAATTCCTTCTTCTACAGCTAGGACTTCGTTTAGATCAAATGCTTCCAGTGAAGGGACATATGGTAGAAGTGTAGAAGAAAAACCAGATGAAGAGAAGGGCTCACAGGCCAAGCTTAATAGCAATTATTCCAATCTTGAAGAGTCTAAGGACAGGTCCTCTGATTATACTCTTAGAAGCGATGAAGAGTCATACAGTGACAAAATGCGCAATGAAATATCTAAGAAGCCAATCCCAACAAGAGTTGCAGTTAAGTATCCTGGTTTCCACGATGATGATGATTCTGAGGAAGATTCACATACACAAAATGTGAAAAATAGTCCTCATCGACTTATTGGACTTTCTAGACGGACCACAGCATCTCCTAAAACTCCGAGTTCGTATGTGGAGGACTCATATGGGACACCCACGAGTCATGACGACGTAACCGAGCAGAAAGCTTCAAGGAGTTACTATTCATCCCCAGCTCCATTGAAGGCGAAGACTGGAACAAGAACCTCCAGTCGCTTAGAAAGTTCAGAGCAGCCCCAATCATCTAAACCTTTCAAGCAAACTCCTGAAACTAAGATGCCCTTGAATGAGGAAAGGTTGAAATCTTCTGCAAAGGAACAGCAATCCAATTATCCTCCAGAGTTAGATAGGCAAGGAAATTCTGAGAGTTCAAAGTTTTCTTCTGCAAGGGAGACGACTACAGCTTCAGTGAAGACTTGGGCTCAAACAAGAAACTCTCATTACTTGGCAAACTCTGAGCAGCCGACTCAATCGACGAAACCTTCCAAACCAATTCCTGAAAGTAAAAGGTCTTTCCATGAAGAAAGATTAACATCTTCTACAAAGGAACTACCATCCAATCCTTCCCCGGAAGTAGAAACACAAGGCGACTCCGAGAGTTCAAAAAGGGAGAAGATGAAAGCAGTTCAGAAAGCTAGTCATGTTCACCCAAAGCTTCCTGATTACGACGACTTTGCAGCACACTTCCGTTCACTCCGACAGAATTACAAG

mRNA sequence

GGAATATTTTGTCTTATAATAAAGGTGGATGGATGAAATTAGAAAGGAGTAATATGGTAAAAAAAAAATCCAAAGAATCTAGAAAATATATATTTTAATATAGGAATATAATTTACTACGTGGAATTTGCAATTGAATTCCTAAAGGAAATTGAGAAGGGTGATTTGGTCAATATAAAAGTCAGCGAGCCATGATGGGTGGGTTACCTTGAACCTTCCCGCACCTAGGTCTTGCCGATCTGCCACGCTTAAAGTTTGATCAGTACACGAAAAACAAAGTCCCTTTCAGAGGCTCTTTCTTCGGCCGGAGCTCCGTCGCGATTCCTTGTCGTCCTCCGTCCTTTCACTTTCTCTCCATTACCATCTCTTCTTCGATCGTATAACTTGATTTTTATTTCTCAACATAATTCAAATTCCTATTCTTCGTCTCTCGGAGGGAGGATTTTTCTTTTTGATGGTTTTTTTGGCGGATAATTGTCCCGCTAGTTGAAGATTTTCCCGCGTTGAGATCTGGTTTTGGATTCCTACAACGATGCTGCACAAGAGCTTCAAGCCTGCAAAATGCAAGACATCGTTGAAGTTAGCGGTTTCGCGGATTAAGCTTTTGAGGAATAAGAAAGAAGTCCAGGTCAGGCAGCTGAGGGGAGAATTGGCGAAGTTGCTCGAGGCTGGACAGGACCAGACTGCTAGAATTCGGGTTGAACACGTTGTTAGAGAAGAGAAATCAAAGGCAGCTTATGAACTCATTGAAATATTCTGTGAACTAATTGTGGCACGCATGCCAATGATCGAGTCCCAAAAAAATTGCCCGATAGACTTGAAGGAAGCAATCGCGAGTGTTATATTTGCATCGCCCAGATGTGCGGATATTCCAGAGCTTATGGATGTTCGAAAACACTTCAAAGCAAAATATGGAAAAGAATTCGTCTCAGCAGCAGTGGAGCTACGCCCAGAATGTGGAGCGAATCGCATGTTGGTTGAAAAAATGTCTGCTAAAGCACCAGATGGACAGACCAAACTTAAAATCTTGACTGCAATTGCCGAGGAACATAAAATCAAATGGGACCCCAAAACATTTTGTGATAGCAGCAACCCTCCAGCTGACTTGTTGAATGGACCAAATACTTTTGGACAAGCCAGTCAAATACAGAGGGGGGAGCCTATTGGTGGCCAACCCTCACTTGATCACAATAACAGGGGATCTGCTAGTGTTCAAGTTCCATCTAAATCAGATGAGGGGCATCGAATACCTGAAAAGTCCCCTGAGCACAATTTGAGACCAACGCTTCATCCCCAGAAATCCAACTTTACTAATGACTATACCAATCAAAGCAACATCACTGGCCATCGTATTCCAGAAACAAGATCTTCTGAGGGGACGCATAGGCACACAAATTCTGGTGATCAGAATAATTACTCTTCAGGCAGGCAGCATTGGAATATGGATTTCAAGGATGCTACATCTGCGGCTAAGGCAGCTGCTGAATCTGCAGAATTAGCTAGCCTTGCTGCCCGAGCTGCTGCAGAACTTTCTAGTCGTGGAAACATATCCCAATCGTCCTCTTCAGGGTTTCAAAAATCTTCATCTTATAATTTAAGGGCTGAAGGACCTCAAGAATATGCTAACTTAAATTTGCAAGATCAACAACTTCCTAAAGACCAAGTTGTCAGTGCCCCTCATAGAAGTTCTATCATGGATGATAATAGGAGAGAGAATGATGCAAGAAGATTTATGGGGGACGATGACAAGAAGTTACGTTATCCGTCTTCGGGTGCTAGTAATATTGATGCTAACGCATCTGGAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAATCTGGGTTTAACGACTCTCTTGGTAGTAGTGCAAGTGTGGAGAAACAACCCAGAAAATTTGATGCTAGCACATCTGTAAATAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAACCTGGGTTTAGCGACTCTCTTCATAGTGCAACTGTGGAGAAACAACCCAAAGACTATGATTCTAACACATATGTAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCTGAACCTGGGTTTGGTGACTCCCATAGTAGTGCCAGTATGGAGAAACAACCCAGAAACTTTGATGTAGAATATGTTAGTGATCGGCCATCTGGTACAGGTTCTGAAAGAACTAGTTACTATGAGGATGTGAGGATTGGAAATGATTCAAATAGAGTGCCTTCCTACGAGAAGCCAATGATGGATACTTATGACAACCCTTTTGCCATGGATAAACCTAATGATAGTGAGACTGTTGATACAAGTTTTAATGATCATGCCAGTGTGGTCTTTGATGATTATGGTCCAGATGATGATTATGTCCCAGACTATGATTACCAGATAAGAGAATCTATTCTGGAGCTTTCATCCCCTGAGGGAACGGTGCCGATAAACTCATCAGCGACAGATGATACTTGGATCTTCAAGCAAAACAAGAATGATCCTCCTGAGAAGTCTGTTTCACACTCGCAAATTTCTGATGAGCGTACTTCTTTGTTTGCTGGAAATTCAAGAAGTTTTGAAGATCCTTCCCATTCTGATGACTTGTTGCCGGCAACTTTTGACCATTCAGATGGACCTAGTTCTGAAAGTGAGGAAGAGGTGAAAGAATCTGGGATCATTGGTAAAGAAGACTCTATCGAATTTTCTAAGAAACAGAATTTGTACTCTGAAAAACCAGAATGGATTCAGGACATTAGCCATGTATCATTGGGATCTTCAGATGAGGAGAACAAAAGCATGCCATCTCATCGCCTCTCTTCCGACATTCCCCTTGTCCATGGATCCAAGGAGAAAGCCAGCCCACCAAGCTCTCCTGATATCATACAAGATACTAAAATACTAGAAGAATCAACTTCAGAAGTCTACAGTCCCTTGAGTTTTGGGAAACTGAAGGGTGGCCTAAGAAATCAAAAGAGTAATAGGCCTCCATATGCAATCAATTCATCAACAAGTGATTTACCATCCAAACAAACATGTGGAAACGATGACGCAAGAACTGAGCAGTCTACCTCAATTCCTTCTTCTACAGCTAGGACTTCGTTTAGATCAAATGCTTCCAGTGAAGGGACATATGGTAGAAGTGTAGAAGAAAAACCAGATGAAGAGAAGGGCTCACAGGCCAAGCTTAATAGCAATTATTCCAATCTTGAAGAGTCTAAGGACAGGTCCTCTGATTATACTCTTAGAAGCGATGAAGAGTCATACAGTGACAAAATGCGCAATGAAATATCTAAGAAGCCAATCCCAACAAGAGTTGCAGTTAAGTATCCTGGTTTCCACGATGATGATGATTCTGAGGAAGATTCACATACACAAAATGTGAAAAATAGTCCTCATCGACTTATTGGACTTTCTAGACGGACCACAGCATCTCCTAAAACTCCGAGTTCGTATGTGGAGGACTCATATGGGACACCCACGAGTCATGACGACGTAACCGAGCAGAAAGCTTCAAGGAGTTACTATTCATCCCCAGCTCCATTGAAGGCGAAGACTGGAACAAGAACCTCCAGTCGCTTAGAAAGTTCAGAGCAGCCCCAATCATCTAAACCTTTCAAGCAAACTCCTGAAACTAAGATGCCCTTGAATGAGGAAAGGTTGAAATCTTCTGCAAAGGAACAGCAATCCAATTATCCTCCAGAGTTAGATAGGCAAGGAAATTCTGAGAGTTCAAAGTTTTCTTCTGCAAGGGAGACGACTACAGCTTCAGTGAAGACTTGGGCTCAAACAAGAAACTCTCATTACTTGGCAAACTCTGAGCAGCCGACTCAATCGACGAAACCTTCCAAACCAATTCCTGAAAGTAAAAGGTCTTTCCATGAAGAAAGATTAACATCTTCTACAAAGGAACTACCATCCAATCCTTCCCCGGAAGTAGAAACACAAGGCGACTCCGAGAGTTCAAAAAGGGAGAAGATGAAAGCAGTTCAGAAAGCTAGTCATGTTCACCCAAAGCTTCCTGATTACGACGACTTTGCAGCACACTTCCGTTCACTCCGACAGAATTACAAG

Coding sequence (CDS)

ATGCTGCACAAGAGCTTCAAGCCTGCAAAATGCAAGACATCGTTGAAGTTAGCGGTTTCGCGGATTAAGCTTTTGAGGAATAAGAAAGAAGTCCAGGTCAGGCAGCTGAGGGGAGAATTGGCGAAGTTGCTCGAGGCTGGACAGGACCAGACTGCTAGAATTCGGGTTGAACACGTTGTTAGAGAAGAGAAATCAAAGGCAGCTTATGAACTCATTGAAATATTCTGTGAACTAATTGTGGCACGCATGCCAATGATCGAGTCCCAAAAAAATTGCCCGATAGACTTGAAGGAAGCAATCGCGAGTGTTATATTTGCATCGCCCAGATGTGCGGATATTCCAGAGCTTATGGATGTTCGAAAACACTTCAAAGCAAAATATGGAAAAGAATTCGTCTCAGCAGCAGTGGAGCTACGCCCAGAATGTGGAGCGAATCGCATGTTGGTTGAAAAAATGTCTGCTAAAGCACCAGATGGACAGACCAAACTTAAAATCTTGACTGCAATTGCCGAGGAACATAAAATCAAATGGGACCCCAAAACATTTTGTGATAGCAGCAACCCTCCAGCTGACTTGTTGAATGGACCAAATACTTTTGGACAAGCCAGTCAAATACAGAGGGGGGAGCCTATTGGTGGCCAACCCTCACTTGATCACAATAACAGGGGATCTGCTAGTGTTCAAGTTCCATCTAAATCAGATGAGGGGCATCGAATACCTGAAAAGTCCCCTGAGCACAATTTGAGACCAACGCTTCATCCCCAGAAATCCAACTTTACTAATGACTATACCAATCAAAGCAACATCACTGGCCATCGTATTCCAGAAACAAGATCTTCTGAGGGGACGCATAGGCACACAAATTCTGGTGATCAGAATAATTACTCTTCAGGCAGGCAGCATTGGAATATGGATTTCAAGGATGCTACATCTGCGGCTAAGGCAGCTGCTGAATCTGCAGAATTAGCTAGCCTTGCTGCCCGAGCTGCTGCAGAACTTTCTAGTCGTGGAAACATATCCCAATCGTCCTCTTCAGGGTTTCAAAAATCTTCATCTTATAATTTAAGGGCTGAAGGACCTCAAGAATATGCTAACTTAAATTTGCAAGATCAACAACTTCCTAAAGACCAAGTTGTCAGTGCCCCTCATAGAAGTTCTATCATGGATGATAATAGGAGAGAGAATGATGCAAGAAGATTTATGGGGGACGATGACAAGAAGTTACGTTATCCGTCTTCGGGTGCTAGTAATATTGATGCTAACGCATCTGGAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAATCTGGGTTTAACGACTCTCTTGGTAGTAGTGCAAGTGTGGAGAAACAACCCAGAAAATTTGATGCTAGCACATCTGTAAATAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCCGAACCTGGGTTTAGCGACTCTCTTCATAGTGCAACTGTGGAGAAACAACCCAAAGACTATGATTCTAACACATATGTAACTAATTTCAATGCATCAGATAGATACTCTTTCAAGAATTCATCTGAACCTGGGTTTGGTGACTCCCATAGTAGTGCCAGTATGGAGAAACAACCCAGAAACTTTGATGTAGAATATGTTAGTGATCGGCCATCTGGTACAGGTTCTGAAAGAACTAGTTACTATGAGGATGTGAGGATTGGAAATGATTCAAATAGAGTGCCTTCCTACGAGAAGCCAATGATGGATACTTATGACAACCCTTTTGCCATGGATAAACCTAATGATAGTGAGACTGTTGATACAAGTTTTAATGATCATGCCAGTGTGGTCTTTGATGATTATGGTCCAGATGATGATTATGTCCCAGACTATGATTACCAGATAAGAGAATCTATTCTGGAGCTTTCATCCCCTGAGGGAACGGTGCCGATAAACTCATCAGCGACAGATGATACTTGGATCTTCAAGCAAAACAAGAATGATCCTCCTGAGAAGTCTGTTTCACACTCGCAAATTTCTGATGAGCGTACTTCTTTGTTTGCTGGAAATTCAAGAAGTTTTGAAGATCCTTCCCATTCTGATGACTTGTTGCCGGCAACTTTTGACCATTCAGATGGACCTAGTTCTGAAAGTGAGGAAGAGGTGAAAGAATCTGGGATCATTGGTAAAGAAGACTCTATCGAATTTTCTAAGAAACAGAATTTGTACTCTGAAAAACCAGAATGGATTCAGGACATTAGCCATGTATCATTGGGATCTTCAGATGAGGAGAACAAAAGCATGCCATCTCATCGCCTCTCTTCCGACATTCCCCTTGTCCATGGATCCAAGGAGAAAGCCAGCCCACCAAGCTCTCCTGATATCATACAAGATACTAAAATACTAGAAGAATCAACTTCAGAAGTCTACAGTCCCTTGAGTTTTGGGAAACTGAAGGGTGGCCTAAGAAATCAAAAGAGTAATAGGCCTCCATATGCAATCAATTCATCAACAAGTGATTTACCATCCAAACAAACATGTGGAAACGATGACGCAAGAACTGAGCAGTCTACCTCAATTCCTTCTTCTACAGCTAGGACTTCGTTTAGATCAAATGCTTCCAGTGAAGGGACATATGGTAGAAGTGTAGAAGAAAAACCAGATGAAGAGAAGGGCTCACAGGCCAAGCTTAATAGCAATTATTCCAATCTTGAAGAGTCTAAGGACAGGTCCTCTGATTATACTCTTAGAAGCGATGAAGAGTCATACAGTGACAAAATGCGCAATGAAATATCTAAGAAGCCAATCCCAACAAGAGTTGCAGTTAAGTATCCTGGTTTCCACGATGATGATGATTCTGAGGAAGATTCACATACACAAAATGTGAAAAATAGTCCTCATCGACTTATTGGACTTTCTAGACGGACCACAGCATCTCCTAAAACTCCGAGTTCGTATGTGGAGGACTCATATGGGACACCCACGAGTCATGACGACGTAACCGAGCAGAAAGCTTCAAGGAGTTACTATTCATCCCCAGCTCCATTGAAGGCGAAGACTGGAACAAGAACCTCCAGTCGCTTAGAAAGTTCAGAGCAGCCCCAATCATCTAAACCTTTCAAGCAAACTCCTGAAACTAAGATGCCCTTGAATGAGGAAAGGTTGAAATCTTCTGCAAAGGAACAGCAATCCAATTATCCTCCAGAGTTAGATAGGCAAGGAAATTCTGAGAGTTCAAAGTTTTCTTCTGCAAGGGAGACGACTACAGCTTCAGTGAAGACTTGGGCTCAAACAAGAAACTCTCATTACTTGGCAAACTCTGAGCAGCCGACTCAATCGACGAAACCTTCCAAACCAATTCCTGAAAGTAAAAGGTCTTTCCATGAAGAAAGATTAACATCTTCTACAAAGGAACTACCATCCAATCCTTCCCCGGAAGTAGAAACACAAGGCGACTCCGAGAGTTCAAAAAGGGAGAAGATGAAAGCAGTTCAGAAAGCTAGTCATGTTCACCCAAAGCTTCCTGATTACGACGACTTTGCAGCACACTTCCGTTCACTCCGACAGAATTACAAG

Protein sequence

MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVVREEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVRKHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPKTFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIPEKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSEGTHRHTNSGDQNNYSSGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLRAEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGASNIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYSFKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASMEKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPNDSETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK
Homology
BLAST of MC06g0751 vs. ExPASy Swiss-Prot
Match: Q54I39 (IST1-like protein OS=Dictyostelium discoideum OX=44689 GN=DDB_G0289029 PE=3 SV=1)

HSP 1 Score: 112.5 bits (280), Expect = 3.4e-23
Identity = 88/289 (30.45%), Postives = 144/289 (49.83%), Query Frame = 0

Query: 5   SFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVVREEK 64
           S+   K K  LKLAVSRI++L+NKK   VR  +  +A+LL    +++ARIRVE ++R+E 
Sbjct: 7   SYDSYKLKVQLKLAVSRIQILKNKKANIVRDEKRNVAELLRKKNEESARIRVETIIRDEY 66

Query: 65  SKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVRKHFK 124
               +++IE+ CEL+ AR+ +I +    P+++KE+I +++++S R   IPEL  ++   K
Sbjct: 67  LIECFQIIEVLCELLHARINLINATTEMPLEMKESIFTLVYSSQR-IQIPELEQIKNQLK 126

Query: 125 AKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPKTFCD 184
           AKYGK   + A         N  +V K+S   PD     + L+ IAE+  + W     C 
Sbjct: 127 AKYGKGLENEA-NCHCSTHVNPKIVHKLSYATPDPSIIFQTLSEIAEKFNVDW-----CG 186

Query: 185 SSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIPEKSP 244
           S  PP   L  P       Q Q  +P    P + H+ +    +Q P +  +  + P+   
Sbjct: 187 SDYPPPPQLIMPQPIIVQQQPQILQP---PPQIIHHQQQPQILQPPPQIIQQQQQPQMPS 246

Query: 245 EHNLRPTLHPQKSNFTNDYTNQSNI-TGHRIPETRSSEGTHRHTNSGDQ 293
              + P   P  S   +    Q       + P+  S+  +  + NSG+Q
Sbjct: 247 FPIMSPPQQPTFSQIQHQQQIQQQYQQQQQSPQFPSAPPSFYNNNSGNQ 285

BLAST of MC06g0751 vs. ExPASy Swiss-Prot
Match: P53990 (IST1 homolog OS=Homo sapiens OX=9606 GN=IST1 PE=1 SV=1)

HSP 1 Score: 111.3 bits (277), Expect = 7.5e-23
Identity = 69/214 (32.24%), Postives = 120/214 (56.07%), Query Frame = 0

Query: 1   MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
           ML   FK  + + +L+L ++R+KLL  KK    ++ R E+A  L AG+D+ ARIRVEH++
Sbjct: 1   MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 60

Query: 61  REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRC-ADIPELMDV 120
           RE+    A E++E++C+L++AR  +I+S K     L E+++++I+A+PR  +++ EL  V
Sbjct: 61  REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 120

Query: 121 RKHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDP 180
                AKY KE+             N  L+ K+S +AP      + L  IA+ + + ++P
Sbjct: 121 ADQLCAKYSKEY-GKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEP 180

Query: 181 KTFCDSSNPP---ADLLNGPNTFGQASQIQRGEP 211
            +   +  PP    DL++     G    +++G P
Sbjct: 181 DSVVMAEAPPGVETDLID----VGFTDDVKKGGP 209

BLAST of MC06g0751 vs. ExPASy Swiss-Prot
Match: Q3ZBV1 (IST1 homolog OS=Bos taurus OX=9913 GN=IST1 PE=2 SV=1)

HSP 1 Score: 109.0 bits (271), Expect = 3.7e-22
Identity = 68/214 (31.78%), Postives = 119/214 (55.61%), Query Frame = 0

Query: 1   MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
           ML    K  + + +L+L ++R+KLL  KK    ++ R E+A  L AG+D+ ARIRVEH++
Sbjct: 1   MLGSGIKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 60

Query: 61  REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRC-ADIPELMDV 120
           RE+    A E++E++C+L++AR  +I+S K     L E+++++I+A+PR  +++ EL  V
Sbjct: 61  REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 120

Query: 121 RKHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDP 180
                AKY KE+             N  L+ K+S +AP      + L  IA+ + + ++P
Sbjct: 121 ADQLCAKYSKEY-GKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEP 180

Query: 181 KTFCDSSNPP---ADLLNGPNTFGQASQIQRGEP 211
            +   +  PP    DL++     G    +++G P
Sbjct: 181 DSVVMAEAPPGVETDLID----VGFTDDVKKGGP 209

BLAST of MC06g0751 vs. ExPASy Swiss-Prot
Match: Q5R6G8 (IST1 homolog OS=Pongo abelii OX=9601 GN=IST1 PE=2 SV=1)

HSP 1 Score: 108.6 bits (270), Expect = 4.9e-22
Identity = 68/214 (31.78%), Postives = 119/214 (55.61%), Query Frame = 0

Query: 1   MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
           ML   FK  + + +L+L ++R+KLL  KK    ++ R E+A  L AG+D+ ARIRVEH++
Sbjct: 1   MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 60

Query: 61  REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRC-ADIPELMDV 120
           RE+    A E++E++C+L++AR  +I+S K     L E+++++I+A+PR  +++ EL  V
Sbjct: 61  REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 120

Query: 121 RKHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDP 180
                AKY K +             N  L+ K+S +AP      + L  IA+ + + ++P
Sbjct: 121 ADQLCAKYSKGY-GKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEP 180

Query: 181 KTFCDSSNPP---ADLLNGPNTFGQASQIQRGEP 211
            +   +  PP    DL++     G    +++G P
Sbjct: 181 DSVVMAEAPPGVETDLID----VGFTDDVKKGGP 209

BLAST of MC06g0751 vs. ExPASy Swiss-Prot
Match: Q9CX00 (IST1 homolog OS=Mus musculus OX=10090 GN=Ist1 PE=1 SV=1)

HSP 1 Score: 108.2 bits (269), Expect = 6.4e-22
Identity = 68/214 (31.78%), Postives = 119/214 (55.61%), Query Frame = 0

Query: 1   MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
           ML   FK  + + +L+L ++R+KLL  KK    ++ R E+A  L AG+D+ ARIRVEH++
Sbjct: 1   MLGSGFKAERLRVNLRLVINRLKLLEKKKTELAQKARKEIADYLAAGKDERARIRVEHII 60

Query: 61  REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRC-ADIPELMDV 120
           RE+    A E++E++C+L++AR  +I+S K     L E+++++I+A+PR  +++ EL  V
Sbjct: 61  REDYLVEAMEILELYCDLLLARFGLIQSMKELDSGLAESVSTLIWAAPRLQSEVAELKIV 120

Query: 121 RKHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDP 180
                AKY KE+             N  L+ K+S +AP      + L  IA+ + + ++P
Sbjct: 121 ADQLCAKYSKEY-GKLCRTNQIGTVNDRLMHKLSVEAPPKILVERYLIEIAKNYNVPYEP 180

Query: 181 KTFCDSSNP---PADLLNGPNTFGQASQIQRGEP 211
            +   +  P     DL++     G    +++G P
Sbjct: 181 DSVVMAEAPVGVETDLID----VGFTDDVKKGGP 209

BLAST of MC06g0751 vs. NCBI nr
Match: XP_022159239.1 (uncharacterized protein LOC111025657 isoform X1 [Momordica charantia])

HSP 1 Score: 2253 bits (5839), Expect = 0.0
Identity = 1185/1190 (99.58%), Postives = 1185/1190 (99.58%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR
Sbjct: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP
Sbjct: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            EKSPEHNLRPTLH QKSNFTNDYTNQSNITGHRIPETRSSE    GTHRHTNSGDQNNYS
Sbjct: 241  EKSPEHNLRPTLHSQKSNFTNDYTNQSNITGHRIPETRSSEMNAEGTHRHTNSGDQNNYS 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR
Sbjct: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS
Sbjct: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS
Sbjct: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540
            FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME
Sbjct: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540

Query: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600
            KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND
Sbjct: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600

Query: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660
            SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF
Sbjct: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660

Query: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720
            KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV
Sbjct: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720

Query: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780
            KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG
Sbjct: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780

Query: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840
            SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP
Sbjct: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840

Query: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900
            SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS
Sbjct: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900

Query: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960
            NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV
Sbjct: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960

Query: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020
            KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT
Sbjct: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020

Query: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080
            RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS
Sbjct: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080

Query: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140
            SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN
Sbjct: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140

Query: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
            PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK
Sbjct: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1190

BLAST of MC06g0751 vs. NCBI nr
Match: XP_022159240.1 (uncharacterized protein LOC111025657 isoform X2 [Momordica charantia])

HSP 1 Score: 2251 bits (5834), Expect = 0.0
Identity = 1184/1189 (99.58%), Postives = 1184/1189 (99.58%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR
Sbjct: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP
Sbjct: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            EKSPEHNLRPTLH QKSNFTNDYTNQSNITGHRIPETRSSE    GTHRHTNSGDQNNYS
Sbjct: 241  EKSPEHNLRPTLHSQKSNFTNDYTNQSNITGHRIPETRSSEMNAEGTHRHTNSGDQNNYS 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR
Sbjct: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS
Sbjct: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS
Sbjct: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540
            FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME
Sbjct: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540

Query: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600
            KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND
Sbjct: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600

Query: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660
            SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF
Sbjct: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660

Query: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720
            KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV
Sbjct: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720

Query: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780
            KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG
Sbjct: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780

Query: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840
            SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP
Sbjct: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840

Query: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900
            SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS
Sbjct: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900

Query: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960
            NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV
Sbjct: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960

Query: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020
            KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT
Sbjct: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020

Query: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080
            RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS
Sbjct: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080

Query: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140
            SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN
Sbjct: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140

Query: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY 1185
            PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY
Sbjct: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY 1189

BLAST of MC06g0751 vs. NCBI nr
Match: XP_023531863.1 (filaggrin-like isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1585 bits (4103), Expect = 0.0
Identity = 888/1194 (74.37%), Postives = 992/1194 (83.08%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPSLDHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSLDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            E     NLR   HPQ+SNF +   NQSN TGHR  E RSSE    G HR++NSGDQNNY+
Sbjct: 241  EDPVNRNLRSNHHPQQSNFADVNANQSNFTGHRNSEARSSETSAEGMHRYSNSGDQNNYA 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHW MDFKDATSAAKAAAESAELASLAARAAAELSSRGN+SQ SSS F +SSSYNLR
Sbjct: 301  SGRQHWGMDFKDATSAAKAAAESAELASLAARAAAELSSRGNVSQPSSSEFHQSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQ YA+ NL+DQQLPKDQ VSAPH+SS+ DDN R+ND RRFMG+D K   YPSS AS
Sbjct: 361  AEGPQGYASGNLRDQQLPKDQFVSAPHKSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            N D N S TNFNA+DRYS KNSSE GF DSLGSSASVEKQPRKFDA+ SV +FNA+DR S
Sbjct: 421  NNDVNISATNFNAADRYSLKNSSEPGFRDSLGSSASVEKQPRKFDANASVTSFNAADRSS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDS-HSSASM 540
            FKN S  GFSD L S  V+ QP+++ SNT VTNF+ SDRYS KN SEPGF D   SS SM
Sbjct: 481  FKNPSNHGFSDPLDS--VDMQPRNFGSNTSVTNFSESDRYSLKNPSEPGFRDPLGSSTSM 540

Query: 541  EKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPN 600
            EKQPRN DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFAMDKP+
Sbjct: 541  EKQPRNVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAMDKPS 600

Query: 601  DSE-TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTW 660
            D E TVDT+FNDHAS VFDDYGPDDD VPDY+YQ R+SILE SSP+G VPINS ATDDTW
Sbjct: 601  DHESTVDTNFNDHASAVFDDYGPDDDCVPDYEYQRRQSILEPSSPKGKVPINS-ATDDTW 660

Query: 661  IFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEE 720
            +FKQN ND PEKSVSHSQISD R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+
Sbjct: 661  VFKQNMNDSPEKSVSHSQISD-RASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEK 720

Query: 721  EVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLV 780
            E +E  +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N++ PSHRLSS++PLV
Sbjct: 721  EPEEFEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRNTPSHRLSSELPLV 780

Query: 781  HGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTS 840
            H  K+K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S
Sbjct: 781  HELKKKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNPRRSHASNSSIS 840

Query: 841  DLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNS 900
            DL SKQ C ND ++T Q T + SST RTSFRSNA SE  Y  SVEEKP EEKG +AK +S
Sbjct: 841  DLSSKQACENDASKTAQPTLVSSSTTRTSFRSNAPSE-LYDGSVEEKPGEEKGLRAKFDS 900

Query: 901  NYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHT 960
              SNL++SKD  SDYT+RSD+E + +K  +EISKKP PTR+ VKYPGFHDDDDSEEDS  
Sbjct: 901  FNSNLDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRIGVKYPGFHDDDDSEEDSPG 960

Query: 961  QNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAK 1020
            QNVKNSPHR++GLSRRT ASPKTPSS +EDSYGTPTSH+DV+E+KASRSY +S +PLKAK
Sbjct: 961  QNVKNSPHRVMGLSRRTKASPKTPSSRMEDSYGTPTSHEDVSERKASRSYDASKSPLKAK 1020

Query: 1021 TGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSES 1080
            TGTR S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN   
Sbjct: 1021 TGTRYSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN--- 1080

Query: 1081 SKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKE 1140
              F S+R TT AS KT AQ+      +NSEQP QS KPSKP PE+KRSFHEER TSSTKE
Sbjct: 1081 --FESSRGTTAASAKTRAQS------SNSEQP-QSMKPSKPSPETKRSFHEERPTSSTKE 1140

Query: 1141 LPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
              SNPSP++ETQ ++ESS++EK KAV+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 RLSNPSPKMETQDNTESSEKEKTKAVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1176

BLAST of MC06g0751 vs. NCBI nr
Match: KAG7021918.1 (IST1-like protein [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 1582 bits (4097), Expect = 0.0
Identity = 888/1194 (74.37%), Postives = 992/1194 (83.08%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPSLDHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSLDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            E     NLR   HPQ+ NF +   NQSN TGHR  E RSSE    G HRH+NSGDQN+Y+
Sbjct: 241  EDPVNRNLRSNHHPQQPNFADVNANQSNFTGHRNSEARSSETSAEGMHRHSNSGDQNSYA 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHW MDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQ SSS F +SSSYNLR
Sbjct: 301  SGRQHWGMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQPSSSEFHQSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQ YA+ NL+DQQLPKDQVVSAPH+SS+ DDN R+ND RRFMG+D K   YPSS AS
Sbjct: 361  AEGPQGYASGNLRDQQLPKDQVVSAPHKSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            N + N S TNFNA+DRYSFKNSSE GF DSLGSSASVEKQPRKFDA+ SV +FNA D+ S
Sbjct: 421  NNNVNISATNFNAADRYSFKNSSEPGFRDSLGSSASVEKQPRKFDANASVTSFNAVDKSS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG-DSHSSASM 540
            FKN S+PGFSD L S  V+ QP+++ SNT VTNFN SDRYS KN SEPGF     SS SM
Sbjct: 481  FKNPSQPGFSDPLDS--VDMQPRNFGSNTSVTNFNESDRYSLKNPSEPGFRVPLGSSTSM 540

Query: 541  EKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPN 600
            EKQPRN DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFAMDKPN
Sbjct: 541  EKQPRNVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAMDKPN 600

Query: 601  DSE-TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTW 660
            D E TVDTSFNDHAS VFDDYGP+DD VPDY YQ R+SILE SSP+G VPINS ATDDTW
Sbjct: 601  DHESTVDTSFNDHASAVFDDYGPEDDCVPDYGYQRRQSILEPSSPKGKVPINS-ATDDTW 660

Query: 661  IFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEE 720
            +FKQN ND PEKSVSHSQISD R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+
Sbjct: 661  VFKQNMNDSPEKSVSHSQISD-RASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEK 720

Query: 721  EVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLV 780
            E +E  +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N+S PSHRLSS++PL+
Sbjct: 721  EPEEFEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRSTPSHRLSSELPLL 780

Query: 781  HGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTS 840
            H  K+K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S
Sbjct: 781  HELKKKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNSRRSHASNSSIS 840

Query: 841  DLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNS 900
            +L SKQ C ND ++T Q T + SST RTSFRSNA SE  Y  SVEEKP EEKG +AK +S
Sbjct: 841  NLSSKQACENDASKTAQPTLVSSSTTRTSFRSNAPSE-LYDGSVEEKPGEEKGLRAKFDS 900

Query: 901  NYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHT 960
              SNL++SKD  SDYT+RSD+E + +K  +EISKKP PTRV VKYPGFHDDDDSEEDS  
Sbjct: 901  FNSNLDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRVGVKYPGFHDDDDSEEDSPG 960

Query: 961  QNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAK 1020
            QNVKNSPHR++GLSRRT ASPKTPSS +EDSYGTPTSH+DV+E+KASRSY +S +PLKAK
Sbjct: 961  QNVKNSPHRVMGLSRRTKASPKTPSSRMEDSYGTPTSHEDVSERKASRSYDASKSPLKAK 1020

Query: 1021 TGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSES 1080
            TGTR S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN   
Sbjct: 1021 TGTRYSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN--- 1080

Query: 1081 SKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKE 1140
              F S+R TT AS KT AQ+      +NSEQ +QS KPSKP PE++RSFHEER TSSTKE
Sbjct: 1081 --FESSRGTTAASAKTRAQS------SNSEQ-SQSMKPSKPSPETRRSFHEERPTSSTKE 1140

Query: 1141 LPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
              SNPSP++ETQ ++ESS++EK KAV+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 RLSNPSPKMETQDNTESSEKEKTKAVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1176

BLAST of MC06g0751 vs. NCBI nr
Match: XP_022927973.1 (filaggrin isoform X1 [Cucurbita moschata])

HSP 1 Score: 1582 bits (4095), Expect = 0.0
Identity = 887/1194 (74.29%), Postives = 993/1194 (83.17%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPSLDHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSLDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            E     NLR   HPQ+ NF +   NQSN TGHR  E RSSE    G  RH+NSGDQN+Y+
Sbjct: 241  EDPVNRNLRSNHHPQQPNFADVNANQSNFTGHRNSEARSSETSAEGMRRHSNSGDQNSYA 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHW MDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQ SSS F +SSSYNLR
Sbjct: 301  SGRQHWGMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQPSSSEFHQSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQ YA+ NL+DQQLPKDQVVSAPH+SS+ DDN R+ND RRFMG+D K   YPSS AS
Sbjct: 361  AEGPQGYASGNLRDQQLPKDQVVSAPHKSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            N D N S TNFNA+DRYSFKNSSE GF DSLGSSASVEKQPRKFDA+ SVN+FNA D+ S
Sbjct: 421  NNDVNISATNFNAADRYSFKNSSEPGFRDSLGSSASVEKQPRKFDANASVNSFNAVDKSS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG-DSHSSASM 540
            FKN S+PGFSD L S  V+ QP+++ SNT VTNFN SDRYS KN SEPGF     SS SM
Sbjct: 481  FKNPSQPGFSDPLDS--VDMQPRNFGSNTSVTNFNESDRYSLKNPSEPGFRVPLGSSTSM 540

Query: 541  EKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPN 600
            EKQPRN DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFAMDKPN
Sbjct: 541  EKQPRNVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAMDKPN 600

Query: 601  DSE-TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTW 660
            D E TVDTSFNDHAS VFDDYGP+DD VPDY+YQ R+SILE SSP+G VPINS ATDDTW
Sbjct: 601  DHESTVDTSFNDHASAVFDDYGPEDDCVPDYEYQRRQSILEPSSPKGKVPINS-ATDDTW 660

Query: 661  IFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEE 720
            +FKQN ND PEKSVSHSQISD R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+
Sbjct: 661  VFKQNMNDSPEKSVSHSQISD-RASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEK 720

Query: 721  EVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLV 780
            E +E  +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N+S PSH LSS++PL+
Sbjct: 721  EPEEFEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRSTPSHHLSSELPLL 780

Query: 781  HGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTS 840
            H  K+K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S
Sbjct: 781  HELKKKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNSRRSHASNSSIS 840

Query: 841  DLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNS 900
            +L SKQ C ND ++T Q T + SSTA+TSFRSNA SE  Y  SVEEKP EEKG +AK +S
Sbjct: 841  NLSSKQACENDASKTAQPTLVSSSTAKTSFRSNARSE-LYDGSVEEKPGEEKGLRAKFDS 900

Query: 901  NYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHT 960
              SNL++SKD  SDYT+RSD+E + +K  +EISKKP PTRV VKYPGFHDDDDSEEDS  
Sbjct: 901  FNSNLDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRVGVKYPGFHDDDDSEEDSPG 960

Query: 961  QNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAK 1020
            QNV+NSPHR++GLSRRT ASPKTPSS +EDSYGTPTSH+DV+E+KASRSY +S +PLKAK
Sbjct: 961  QNVENSPHRVMGLSRRTKASPKTPSSRMEDSYGTPTSHEDVSERKASRSYDASKSPLKAK 1020

Query: 1021 TGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSES 1080
            TGTR S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN   
Sbjct: 1021 TGTRYSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN--- 1080

Query: 1081 SKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKE 1140
              F S+R TT AS KT AQ+      +NSEQ +QS KPSKP PE++RSFHEER TSSTKE
Sbjct: 1081 --FESSRGTTAASAKTRAQS------SNSEQ-SQSMKPSKPSPETRRSFHEERPTSSTKE 1140

Query: 1141 LPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
              SNPSP++ETQ ++ESS++EK KAV+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 RLSNPSPKMETQDNTESSEKEKTKAVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1176

BLAST of MC06g0751 vs. ExPASy TrEMBL
Match: A0A6J1E1U8 (uncharacterized protein LOC111025657 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111025657 PE=3 SV=1)

HSP 1 Score: 2253 bits (5839), Expect = 0.0
Identity = 1185/1190 (99.58%), Postives = 1185/1190 (99.58%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR
Sbjct: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP
Sbjct: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            EKSPEHNLRPTLH QKSNFTNDYTNQSNITGHRIPETRSSE    GTHRHTNSGDQNNYS
Sbjct: 241  EKSPEHNLRPTLHSQKSNFTNDYTNQSNITGHRIPETRSSEMNAEGTHRHTNSGDQNNYS 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR
Sbjct: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS
Sbjct: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS
Sbjct: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540
            FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME
Sbjct: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540

Query: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600
            KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND
Sbjct: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600

Query: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660
            SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF
Sbjct: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660

Query: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720
            KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV
Sbjct: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720

Query: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780
            KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG
Sbjct: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780

Query: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840
            SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP
Sbjct: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840

Query: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900
            SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS
Sbjct: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900

Query: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960
            NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV
Sbjct: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960

Query: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020
            KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT
Sbjct: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020

Query: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080
            RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS
Sbjct: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080

Query: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140
            SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN
Sbjct: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140

Query: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
            PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK
Sbjct: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1190

BLAST of MC06g0751 vs. ExPASy TrEMBL
Match: A0A6J1DY48 (uncharacterized protein LOC111025657 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111025657 PE=3 SV=1)

HSP 1 Score: 2251 bits (5834), Expect = 0.0
Identity = 1184/1189 (99.58%), Postives = 1184/1189 (99.58%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR
Sbjct: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP
Sbjct: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            EKSPEHNLRPTLH QKSNFTNDYTNQSNITGHRIPETRSSE    GTHRHTNSGDQNNYS
Sbjct: 241  EKSPEHNLRPTLHSQKSNFTNDYTNQSNITGHRIPETRSSEMNAEGTHRHTNSGDQNNYS 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR
Sbjct: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS
Sbjct: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS
Sbjct: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540
            FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME
Sbjct: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASME 540

Query: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600
            KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND
Sbjct: 541  KQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPND 600

Query: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660
            SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF
Sbjct: 601  SETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIF 660

Query: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720
            KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV
Sbjct: 661  KQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEV 720

Query: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780
            KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG
Sbjct: 721  KESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHG 780

Query: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840
            SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP
Sbjct: 781  SKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSNRPPYAINSSTSDLP 840

Query: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900
            SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS
Sbjct: 841  SKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYS 900

Query: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960
            NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV
Sbjct: 901  NLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNV 960

Query: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020
            KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT
Sbjct: 961  KNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGT 1020

Query: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080
            RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS
Sbjct: 1021 RTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFS 1080

Query: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140
            SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN
Sbjct: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140

Query: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY 1185
            PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY
Sbjct: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNY 1189

BLAST of MC06g0751 vs. ExPASy TrEMBL
Match: A0A6J1EMJ4 (filaggrin isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434877 PE=3 SV=1)

HSP 1 Score: 1582 bits (4095), Expect = 0.0
Identity = 887/1194 (74.29%), Postives = 993/1194 (83.17%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPSLDHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSLDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            E     NLR   HPQ+ NF +   NQSN TGHR  E RSSE    G  RH+NSGDQN+Y+
Sbjct: 241  EDPVNRNLRSNHHPQQPNFADVNANQSNFTGHRNSEARSSETSAEGMRRHSNSGDQNSYA 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHW MDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQ SSS F +SSSYNLR
Sbjct: 301  SGRQHWGMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQPSSSEFHQSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQ YA+ NL+DQQLPKDQVVSAPH+SS+ DDN R+ND RRFMG+D K   YPSS AS
Sbjct: 361  AEGPQGYASGNLRDQQLPKDQVVSAPHKSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            N D N S TNFNA+DRYSFKNSSE GF DSLGSSASVEKQPRKFDA+ SVN+FNA D+ S
Sbjct: 421  NNDVNISATNFNAADRYSFKNSSEPGFRDSLGSSASVEKQPRKFDANASVNSFNAVDKSS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG-DSHSSASM 540
            FKN S+PGFSD L S  V+ QP+++ SNT VTNFN SDRYS KN SEPGF     SS SM
Sbjct: 481  FKNPSQPGFSDPLDS--VDMQPRNFGSNTSVTNFNESDRYSLKNPSEPGFRVPLGSSTSM 540

Query: 541  EKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPN 600
            EKQPRN DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFAMDKPN
Sbjct: 541  EKQPRNVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAMDKPN 600

Query: 601  DSE-TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTW 660
            D E TVDTSFNDHAS VFDDYGP+DD VPDY+YQ R+SILE SSP+G VPINS ATDDTW
Sbjct: 601  DHESTVDTSFNDHASAVFDDYGPEDDCVPDYEYQRRQSILEPSSPKGKVPINS-ATDDTW 660

Query: 661  IFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEE 720
            +FKQN ND PEKSVSHSQISD R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+
Sbjct: 661  VFKQNMNDSPEKSVSHSQISD-RASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEK 720

Query: 721  EVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLV 780
            E +E  +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N+S PSH LSS++PL+
Sbjct: 721  EPEEFEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRSTPSHHLSSELPLL 780

Query: 781  HGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTS 840
            H  K+K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S
Sbjct: 781  HELKKKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNSRRSHASNSSIS 840

Query: 841  DLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNS 900
            +L SKQ C ND ++T Q T + SSTA+TSFRSNA SE  Y  SVEEKP EEKG +AK +S
Sbjct: 841  NLSSKQACENDASKTAQPTLVSSSTAKTSFRSNARSE-LYDGSVEEKPGEEKGLRAKFDS 900

Query: 901  NYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHT 960
              SNL++SKD  SDYT+RSD+E + +K  +EISKKP PTRV VKYPGFHDDDDSEEDS  
Sbjct: 901  FNSNLDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRVGVKYPGFHDDDDSEEDSPG 960

Query: 961  QNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAK 1020
            QNV+NSPHR++GLSRRT ASPKTPSS +EDSYGTPTSH+DV+E+KASRSY +S +PLKAK
Sbjct: 961  QNVENSPHRVMGLSRRTKASPKTPSSRMEDSYGTPTSHEDVSERKASRSYDASKSPLKAK 1020

Query: 1021 TGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSES 1080
            TGTR S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN   
Sbjct: 1021 TGTRYSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN--- 1080

Query: 1081 SKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKE 1140
              F S+R TT AS KT AQ+      +NSEQ +QS KPSKP PE++RSFHEER TSSTKE
Sbjct: 1081 --FESSRGTTAASAKTRAQS------SNSEQ-SQSMKPSKPSPETRRSFHEERPTSSTKE 1140

Query: 1141 LPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
              SNPSP++ETQ ++ESS++EK KAV+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 RLSNPSPKMETQDNTESSEKEKTKAVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1176

BLAST of MC06g0751 vs. ExPASy TrEMBL
Match: A0A6J1KP60 (uncharacterized protein LOC111496330 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111496330 PE=3 SV=1)

HSP 1 Score: 1579 bits (4088), Expect = 0.0
Identity = 885/1194 (74.12%), Postives = 988/1194 (82.75%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPS DHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSFDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSE----GTHRHTNSGDQNNYS 300
            E     NLR   H Q+SNF +   NQSN TGHR  E RSSE    G HRH+NSGDQNNY+
Sbjct: 241  EDPVNRNLRSNHHTQQSNFADVNANQSNFTGHRNSEARSSETSAEGMHRHSNSGDQNNYA 300

Query: 301  SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLR 360
            SGRQHW+MDFKDATSAAKAAAESAELASLAARAAAELSSRGN+SQ SSS F KSSSYNLR
Sbjct: 301  SGRQHWSMDFKDATSAAKAAAESAELASLAARAAAELSSRGNLSQPSSSEFLKSSSYNLR 360

Query: 361  AEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGAS 420
            AEGPQ YA+ NL+DQQLPKDQVVSAPH SS+ DDN R+ND RRFMG+D K   YPSS AS
Sbjct: 361  AEGPQGYASGNLRDQQLPKDQVVSAPHNSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSAS 420

Query: 421  NIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYS 480
            N D N S TNFNA+DRYSFKNSSE GF+DSLGSSASVEKQPRKFDA+ SV +FNA+DR S
Sbjct: 421  NNDVNISATNFNAADRYSFKNSSEHGFSDSLGSSASVEKQPRKFDANASVTSFNAADRSS 480

Query: 481  FKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDS-HSSASM 540
            FKN S+ GFSD L S  V+ QP+++ SNT VTNF+ SDRYS KN SEPGF D   SS SM
Sbjct: 481  FKNPSDHGFSDPLDS--VDMQPRNFGSNTSVTNFSESDRYSLKNPSEPGFRDPLGSSTSM 540

Query: 541  EKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPN 600
            EK P N DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFA+DKPN
Sbjct: 541  EKHPINVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAVDKPN 600

Query: 601  DSE-TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTW 660
            D E TVDTSFNDHAS VFDDYGPDDD VPDY+YQ R+SILE SSP+G VPINS ATDDTW
Sbjct: 601  DHESTVDTSFNDHASAVFDDYGPDDDCVPDYEYQRRQSILEPSSPKGKVPINS-ATDDTW 660

Query: 661  IFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEE 720
            +FKQN ND PEKSVSH+QIS +R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+
Sbjct: 661  VFKQNMNDSPEKSVSHTQISADRASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEK 720

Query: 721  EVKESGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLV 780
            E +E  +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N++ PSHRLSS++PLV
Sbjct: 721  EPEEFEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRNTPSHRLSSELPLV 780

Query: 781  HGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTS 840
            H  K+K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S
Sbjct: 781  HELKKKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNPRRSHASNSSIS 840

Query: 841  DLPSKQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNS 900
            DL SKQ C ND ++T Q T + SST RTSFRSNA SE  Y  SVEEKP EEKG +AK NS
Sbjct: 841  DLSSKQACENDASKTAQPTLVSSSTTRTSFRSNAPSE-LYDGSVEEKPAEEKGPRAKFNS 900

Query: 901  NYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHT 960
              SN ++SKD  SDYT+RSD+E + +K  +EISKKP PTRV VKYPGFHDDDDSEEDS  
Sbjct: 901  FNSNFDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRVGVKYPGFHDDDDSEEDSPG 960

Query: 961  QNVKNSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAK 1020
            QNVKNSPHR++GLSRRT ASPKTPSS +EDSY TPTSH+DV+E+KASRSY +S +PLKAK
Sbjct: 961  QNVKNSPHRVMGLSRRTKASPKTPSSRMEDSYRTPTSHEDVSERKASRSYDASKSPLKAK 1020

Query: 1021 TGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSES 1080
            TGTR S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN   
Sbjct: 1021 TGTRYSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN--- 1080

Query: 1081 SKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKE 1140
              F S+R TT AS KT AQ+      +NSEQP QS KPSKP PE+KRSFHEER TSSTKE
Sbjct: 1081 --FESSRGTTAASAKTRAQS------SNSEQP-QSMKPSKPSPETKRSFHEERPTSSTKE 1140

Query: 1141 LPSNPSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
               NPSP++ETQ ++ESS++EK K V+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 RLFNPSPKMETQDNTESSEKEKTKTVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1177

BLAST of MC06g0751 vs. ExPASy TrEMBL
Match: A0A6J1EJ03 (uncharacterized protein LOC111434877 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111434877 PE=3 SV=1)

HSP 1 Score: 1554 bits (4023), Expect = 0.0
Identity = 875/1190 (73.53%), Postives = 978/1190 (82.18%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            MLHKSFKPAKCKTSLKLAVSRIKLLRNKK+V V+QL+GELAKLLEAGQDQTARIRVEH V
Sbjct: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKDVHVKQLKGELAKLLEAGQDQTARIRVEHFV 60

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEKSK AYELIEIFCELIVARMPMIESQKNCPIDLKE+++SVIFASPRCADIPEL+DVR
Sbjct: 61   REEKSKEAYELIEIFCELIVARMPMIESQKNCPIDLKESVSSVIFASPRCADIPELLDVR 120

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHFKAKYGKEFVSAAVELRPECG NRMLVEK+SAKAPDG +K+KILT IAEE+ +KWDPK
Sbjct: 121  KHFKAKYGKEFVSAAVELRPECGVNRMLVEKLSAKAPDGPSKIKILTKIAEEYNVKWDPK 180

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +F D+ NPPADLLNGPNTFG+ASQIQ  E IGGQPSLDHNNRGS ++Q P +SDE  RIP
Sbjct: 181  SFGDNINPPADLLNGPNTFGRASQIQM-EAIGGQPSLDHNNRGSPNIQAPPESDERQRIP 240

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSEGTHRHTNSGDQNNYSSGRQ 300
            E     NLR   HPQ+ NF +   NQSN TGHR  E RSS                 GRQ
Sbjct: 241  EDPVNRNLRSNHHPQQPNFADVNANQSNFTGHRNSEARSS-----------------GRQ 300

Query: 301  HWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQSSSSGFQKSSSYNLRAEGP 360
            HW MDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQ SSS F +SSSYNLRAEGP
Sbjct: 301  HWGMDFKDATSAAKAAAESAELASLAARAAAELSSRGNISQPSSSEFHQSSSYNLRAEGP 360

Query: 361  QEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGASNIDA 420
            Q YA+ NL+DQQLPKDQVVSAPH+SS+ DDN R+ND RRFMG+D K   YPSS ASN D 
Sbjct: 361  QGYASGNLRDQQLPKDQVVSAPHKSSMPDDNWRDNDTRRFMGNDAKNFSYPSSSASNNDV 420

Query: 421  NASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYSFKNS 480
            N S TNFNA+DRYSFKNSSE GF DSLGSSASVEKQPRKFDA+ SVN+FNA D+ SFKN 
Sbjct: 421  NISATNFNAADRYSFKNSSEPGFRDSLGSSASVEKQPRKFDANASVNSFNAVDKSSFKNP 480

Query: 481  SEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG-DSHSSASMEKQP 540
            S+PGFSD L S  V+ QP+++ SNT VTNFN SDRYS KN SEPGF     SS SMEKQP
Sbjct: 481  SQPGFSDPLDS--VDMQPRNFGSNTSVTNFNESDRYSLKNPSEPGFRVPLGSSTSMEKQP 540

Query: 541  RNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPNDSE- 600
            RN DVEYV+D+P G G ERTS Y D RIGN SN+VPS+EK + DTY+NPFAMDKPND E 
Sbjct: 541  RNVDVEYVNDQPFGMGFERTSSYGDSRIGNSSNKVPSHEKLVNDTYENPFAMDKPNDHES 600

Query: 601  TVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGTVPINSSATDDTWIFKQ 660
            TVDTSFNDHAS VFDDYGP+DD VPDY+YQ R+SILE SSP+G VPINS ATDDTW+FKQ
Sbjct: 601  TVDTSFNDHASAVFDDYGPEDDCVPDYEYQRRQSILEPSSPKGKVPINS-ATDDTWVFKQ 660

Query: 661  NKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSSESEEEVKE 720
            N ND PEKSVSHSQISD R SLFAGN  SF+DPSHSDDLLPATFDHSDGPSSESE+E +E
Sbjct: 661  NMNDSPEKSVSHSQISD-RASLFAGNVGSFDDPSHSDDLLPATFDHSDGPSSESEKEPEE 720

Query: 721  SGIIGKEDSIEFSKKQNLYSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHGSK 780
              +IGK+   +FSK+QNL SEKPEW Q+ISH S GSSDE+N+S PSH LSS++PL+H  K
Sbjct: 721  FEVIGKDHYSKFSKRQNLPSEKPEWSQNISHGSPGSSDEDNRSTPSHHLSSELPLLHELK 780

Query: 781  EKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKSN-RPPYAINSSTSDLPS 840
            +K SPP S DI+ D+ ILEESTSE  S L+FGKLKGGLRNQKSN R  +A NSS S+L S
Sbjct: 781  KKDSPPRSLDILHDSVILEESTSESNSGLNFGKLKGGLRNQKSNSRRSHASNSSISNLSS 840

Query: 841  KQTCGNDDARTEQSTSIPSSTARTSFRSNASSEGTYGRSVEEKPDEEKGSQAKLNSNYSN 900
            KQ C ND ++T Q T + SSTA+TSFRSNA SE  Y  SVEEKP EEKG +AK +S  SN
Sbjct: 841  KQACENDASKTAQPTLVSSSTAKTSFRSNARSE-LYDGSVEEKPGEEKGLRAKFDSFNSN 900

Query: 901  LEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAVKYPGFHDDDDSEEDSHTQNVK 960
            L++SKD  SDYT+RSD+E + +K  +EISKKP PTRV VKYPGFHDDDDSEEDS  QNV+
Sbjct: 901  LDDSKDNFSDYTVRSDQERHKNKEVDEISKKPAPTRVGVKYPGFHDDDDSEEDSPGQNVE 960

Query: 961  NSPHRLIGLSRRTTASPKTPSSYVEDSYGTPTSHDDVTEQKASRSYYSSPAPLKAKTGTR 1020
            NSPHR++GLSRRT ASPKTPSS +EDSYGTPTSH+DV+E+KASRSY +S +PLKAKTGTR
Sbjct: 961  NSPHRVMGLSRRTKASPKTPSSRMEDSYGTPTSHEDVSERKASRSYDASKSPLKAKTGTR 1020

Query: 1021 TSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPP-ELDRQGNSESSKFS 1080
             S   ESS QPQSSKPF QTPETK   NEERLKSSAKE+QS YPP ELDR GN     F 
Sbjct: 1021 YSDHYESSRQPQSSKPFNQTPETKRSYNEERLKSSAKERQSYYPPPELDRLGN-----FE 1080

Query: 1081 SARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN 1140
            S+R TT AS KT AQ+      +NSEQ +QS KPSKP PE++RSFHEER TSSTKE  SN
Sbjct: 1081 SSRGTTAASAKTRAQS------SNSEQ-SQSMKPSKPSPETRRSFHEERPTSSTKERLSN 1140

Query: 1141 PSPEVETQGDSESSKREKMKAVQKASHVHPKLPDYDDFAAHFRSLRQNYK 1186
            PSP++ETQ ++ESS++EK KAV+KASHVHPKLPDYD+FAAHF SLRQN K
Sbjct: 1141 PSPKMETQDNTESSEKEKTKAVEKASHVHPKLPDYDNFAAHFLSLRQNNK 1155

BLAST of MC06g0751 vs. TAIR 10
Match: AT2G19710.1 (Regulator of Vps4 activity in the MVB pathway protein )

HSP 1 Score: 283.1 bits (723), Expect = 1.0e-75
Identity = 296/990 (29.90%), Postives = 457/990 (46.16%), Query Frame = 0

Query: 1   MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
           +L + FKPAKCKT+L++A SR+K+L+NKKE+Q++QLR ELA+LLE+GQ  TARIRVEHVV
Sbjct: 4   VLQRGFKPAKCKTALQMANSRLKILKNKKEIQIKQLRRELAQLLESGQTPTARIRVEHVV 63

Query: 61  REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
           REEK+ AAYELI I+CEL+V R+ +IESQKNCPIDLKEA+ SV+FAS R +D+PEL ++ 
Sbjct: 64  REEKTVAAYELIGIYCELLVVRLGVIESQKNCPIDLKEAVTSVLFASQRLSDVPELSEIF 123

Query: 121 KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
           K F  KYGK+F ++AVELRP+ G +R+LVEK+SAKAPDG TK+KIL AIAEEH + W+ +
Sbjct: 124 KQFTTKYGKDFSTSAVELRPDSGVSRLLVEKLSAKAPDGPTKVKILMAIAEEHNVVWEAQ 183

Query: 181 TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
           +F +S     +LLNG N+F  AS +     I        N    A+V     S E H  P
Sbjct: 184 SFVESDPKDTELLNGANSFQPASSMNMDSSINSNKEQPPNIHAPATVNAHHGSSERHHSP 243

Query: 241 EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSEGTHRHTNSGDQNNYSSGRQ 300
           E S  +  R +   + +N T+   +    +  R   +R  EG  R+ N G +N+ S  +Q
Sbjct: 244 ENSYANGGRSS--SRSNNVTSGKADDYYHSKARPSRSRPDEGECRNPNHGYENSSSRNKQ 303

Query: 301 HWNMDFKDATSAAKAAAESAELASLAARAAAELSSRGNIS-QSSSSGFQKSSSYNLRAEG 360
            W  +F D+T AA+AAAE+AE AS AARAAAELS++  ++ Q S+     S+S NLR E 
Sbjct: 304 KWEPEFVDSTDAARAAAEAAERASFAARAAAELSNKERMTRQDSTQSHISSASVNLRNEP 363

Query: 361 PQEYANLNLQDQQLPKDQVVSAPHRSSIM--DDNRRENDARRFMGDDDKKLRYPSSGASN 420
                  N Q +   +D V  +P R+  M  +D  R    R    +    +  P SG  +
Sbjct: 364 SHRRDRSNAQRESFSEDHV--SPRRNVRMQYEDMDRTRQDRYDRAEQIPPVDQP-SGRHS 423

Query: 421 IDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYSF 480
           +D + +  +F    R    +  E+  N        + KQ      S+ V++ + S  YS 
Sbjct: 424 VDNSRNNGSFG---REKQPSQDETDINVGYSEDVHLRKQ------SSRVSSHSHSSNYSD 483

Query: 481 KNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG---------- 540
           +N     F          K P   + N + T ++   + SFK+      G          
Sbjct: 484 ENDLGSDFM---------KSPSIVEENIFATEYDHQSQSSFKDIDSHDHGHDDDAAATDN 543

Query: 541 -DSHSSASMEKQPRNFDVEYVSDRPSGTGSERTSYYEDVRI------GNDSNRVPSYEKP 600
            D +SS   + +    D  Y  +   G G                  G+ S     +   
Sbjct: 544 YDDYSSFFYQPKFHAEDNHYQDEIDHGVGFSLLGSKTSASAASWSFKGDHSKSHGKHSSS 603

Query: 601 MMDTY-DNPFAMDKPNDSETVDTSFND---HASVVFDDYGPD-----DDYVPDYDYQIRE 660
               + +NP +    + S +   S+++   HA   FD+YGP+     D  +      + E
Sbjct: 604 SSQVFQENPSSRLFDDVSTSPPASYHEPDPHAK--FDNYGPNSESDGDQPIDKVSGDVHE 663

Query: 661 SILELSSPEGTVPINSSATDDTWIFKQNKNDPPEKSVSHSQISDE-----RTSLFAGNSR 720
                S       ++ SA  + +     ++    ++   S    E     R    AG  R
Sbjct: 664 RGNLTSDRSHKFKVSDSAGHEVFPLDTEEHTDNSRTREESDSDSEPQLGLRLGALAGGFR 723

Query: 721 SFEDPSHSDDLLPATFDHSDGPSSESEEEVKESGIIGKEDSIEFSKKQNLYSEKPEWIQD 780
                 +   L P     +   + +   ++ + G   ++D   +SKK +    +P ++  
Sbjct: 724 ------NKKTLPPYRMSSASSKAEKEYIQIDDFGQSSRKDL--YSKKASNTETRPSFMP- 783

Query: 781 ISHVSLGSSDEENKSMPSHRLSSDIPLVHGSKEKASPPSSPDIIQDTKILEESTSEVYSP 840
             H S    D+ +   P    +    L   S+       + D  ++ K+   S+S +   
Sbjct: 784 -PHPSSSDEDDSDMQHPGRTETKSDSLYSHSR------VNHDDSEEEKLPTRSSSRIQE- 843

Query: 841 LSFGKLKGGLRNQKSNRPPYAINSSTSDLPSKQTCGNDDARTEQSTSIPSSTARTSFRSN 900
               K   G+R QK       +++S+ D                   +    AR + + N
Sbjct: 844 -RSHKPSTGIRVQKRTNFKMPVSASSED----------------EEEVEREAARINAKPN 903

Query: 901 ASSEGTYGRSVEEKPDEEKGSQAKLNSNYSNLEESKDRSSDYTLRSDEESYSDKMRNEIS 957
            ++   YG S+  K       Q+K N  +S    +K        ++D+ES+       ++
Sbjct: 904 KTT--GYGFSLRTK------GQSKANEKHSLPVTTK--------KTDKESHDQPSPRTVT 914

BLAST of MC06g0751 vs. TAIR 10
Match: AT4G29440.1 (Regulator of Vps4 activity in the MVB pathway protein )

HSP 1 Score: 268.1 bits (684), Expect = 3.4e-71
Identity = 372/1267 (29.36%), Postives = 549/1267 (43.33%), Query Frame = 0

Query: 1    MLHKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVV 60
            +LH+SFKPAKCK +L++A SR+K+L+NKK+ Q++QLR ELA LLE+GQ QTA+IRVEHVV
Sbjct: 4    VLHRSFKPAKCKIALQMAASRLKILKNKKDTQIKQLRRELAHLLESGQTQTAKIRVEHVV 63

Query: 61   REEKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVR 120
            REEK+ AAYEL+ I+CEL+VAR+ +I+SQK CP DLKEA+ASV++AS R  D+ EL D+ 
Sbjct: 64   REEKTVAAYELVGIYCELLVARLGVIDSQKTCPNDLKEAVASVLYASQRLTDVGELSDIV 123

Query: 121  KHFKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPK 180
            KHF AKYGK+FVSAA+ L+P+ G +R+LVEK+S KAPDG TK+KILT IA +H + W+ +
Sbjct: 124  KHFSAKYGKDFVSAAIGLQPDSGVSRLLVEKLSVKAPDGPTKIKILTEIATQHNVTWEAE 183

Query: 181  TFCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIP 240
            +  +S   P + ++        SQ   G  I  + S   NN                   
Sbjct: 184  SLVESD--PKETMSASGASSSVSQPATG--IKSESSRIQNN------------------- 243

Query: 241  EKSPEHNLRPTLHPQKSNFTNDYTNQSNITGHRIPETRSSEGTHRHTN-SGDQNNYSSGR 300
             + P      T++  ++++  D  + S +T       ++ +  H+    SGD+ +    R
Sbjct: 244  -QPPVFQAAATVNVSQNSYATDGRSSSRMTSTDFNVGKTPDHYHQDPKPSGDRVDGREHR 303

Query: 301  QH---------WNMDFKDATSAAKAAAESAELASLAARAAAELSS--RGNISQSSSSGFQ 360
             H         +   F DATSAA+AAAESAE AS AAR AAELSS  R  + Q+S+    
Sbjct: 304  DHNPGHGDTSPFETKFVDATSAARAAAESAERASFAARRAAELSSKERMMMMQNSTESRN 363

Query: 361  KSSSYNLRAEGP-QEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKK 420
             SS  NLR+  P    ++ N+Q     K++++ + +R                  D    
Sbjct: 364  SSSYENLRSNPPHSRTSSSNMQGGGFGKEELLKSNNRQV----------------DQSTT 423

Query: 421  LRYPSSGASNIDANASGTNFNASDRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVN 480
                 S    +D  +  T++     +S +NS E   NDS       ++QP   D    +N
Sbjct: 424  TTRAESSKKTVDELSENTSWRRG--HSRENSLEMRPNDSFAKIGREKQQPGMDD----IN 483

Query: 481  NFNASDRYSFKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFG 540
              +++D  + K SS      S HS +      ++  +  VT  +  D  S          
Sbjct: 484  LSSSADVLNKKQSSRA----SSHSPS-----SNFSDDNDVTALDHIDSPSI--------- 543

Query: 541  DSHSSASMEKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDN 600
                              +  ++   T  +R SY       NDS  V     P  D Y +
Sbjct: 544  ------------------FEENKFQSTVGDRESY-------NDSPVV--VVAPAFDDYSS 603

Query: 601  PFAMDKPNDSETVDTSFNDHASVVFDDYGPDDDYVPDYDYQIRESILELSSPEGT-VPIN 660
             F  DKP                    +  +D Y  + +  +  S+L  SS     +P  
Sbjct: 604  FF--DKP-------------------QFDTEDAYHDEPEQGLGFSLLGSSSKTSDHMPTE 663

Query: 661  SSATDDTWIFKQNKNDPPEKSVSHSQISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSD 720
             S    +W  + +K+     S S SQ+ ++            E PS      P TFD  D
Sbjct: 664  IS----SWSLEGHKDLGKLSSASTSQVLEK------------EKPSS-----PPTFD--D 723

Query: 721  GPSS--ESEEEVKESGIIGKEDSIEFSKKQNL------YSEKPEWIQDISHVSLGSSDEE 780
            GP+S   S  E + S      D    S++ NL         K +     SH+S G  D  
Sbjct: 724  GPTSPPASLHEPEPSAKFDDYDRDSESEEDNLGRLSGRAEGKSKLTAQKSHMSEGPDDLG 783

Query: 781  NKSMPSHRLSSDIPLVHGSKEKASPPSSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRN 840
                PS                     + D   D+K  EES +E  + L FG L  GL N
Sbjct: 784  RYFFPS--------------------DTEDQGDDSKTQEESDAETPTGLKFGPLASGLEN 843

Query: 841  QKS-----NRPPYAINSSTS---DLP---------SKQTCGNDDARTE-------QSTSI 900
            + +     + PP    SS S    LP         S QT  +   R E        S   
Sbjct: 844  ETTLPSYGSSPPRDKTSSKSIKEYLPTEVDPSRSSSLQTASSSSIRNELYTQKASNSDKR 903

Query: 901  PSSTARTSFRSNASSEGTYGRSV----EEKPDEEKGSQAKLNSNYSNLEESKDRSSDYTL 960
            PSS    S  S+  S+    + V    +EK  E +     L+S  S+    KD   +   
Sbjct: 904  PSSIPPDSSSSDDESDMELPKRVSFRYQEKRTESRTRPTHLHSGVSH----KDLEEEIPT 963

Query: 961  RSDEESYSDKMRNEISKKPIPTRVAVKYPGFH---DDDDSEEDSHTQNVKNSPHRLIGLS 1020
            R+     S + ++  + K  P   +  Y  FH    DD+ E++ H           I +S
Sbjct: 964  RA-----STRSQDRRTHKTTPASASASY--FHTMSSDDEDEKEVHRDTAHIQTRPYISIS 1023

Query: 1021 RRTTASPKTPS---------SYVEDS--------------YGTPTSHDDVTE-QKASRSY 1080
            RRT    + PS         S+ E+S               G+ +S   + + +K S   
Sbjct: 1024 RRTKGQERRPSLVTAKIDKVSFDEESPPKLSPEAKPLTKQQGSASSLSYLPKTEKVSHDQ 1083

Query: 1081 YSSP------APLKAKTGTRTSSRLESSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQ 1140
             S P       PL  + G  ++S L    +   + P + +P   +P  +   K       
Sbjct: 1084 ESHPKLGLGAKPLIKQQG--SASSLSFLPKTNKASPDQDSPPKLVPKEKPAAKQRGSASS 1090

Query: 1141 SNYPPELDRQGNSESSKFSSARETTTASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPES 1184
             ++ P+ D+    + S      +   A+ +  + T +S      +       PSK  PE+
Sbjct: 1144 LSFLPKTDKASPDQDSPPKLLPKEKPAAKQQGSATSSSSLPKTEKISHYRESPSKLTPEA 1090

BLAST of MC06g0751 vs. TAIR 10
Match: AT4G29440.2 (Regulator of Vps4 activity in the MVB pathway protein )

HSP 1 Score: 233.8 bits (595), Expect = 7.1e-61
Identity = 356/1241 (28.69%), Postives = 525/1241 (42.30%), Query Frame = 0

Query: 27   NKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVVREEKSKAAYELIEIFCELIVARMPMI 86
            NKK+ Q++QLR ELA LLE+GQ QTA+IRVEHVVREEK+ AAYEL+ I+CEL+VAR+ +I
Sbjct: 2    NKKDTQIKQLRRELAHLLESGQTQTAKIRVEHVVREEKTVAAYELVGIYCELLVARLGVI 61

Query: 87   ESQKNCPIDLKEAIASVIFASPRCADIPELMDVRKHFKAKYGKEFVSAAVELRPECGANR 146
            +SQK CP DLKEA+ASV++AS R  D+ EL D+ KHF AKYGK+FVSAA+ L+P+ G +R
Sbjct: 62   DSQKTCPNDLKEAVASVLYASQRLTDVGELSDIVKHFSAKYGKDFVSAAIGLQPDSGVSR 121

Query: 147  MLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDPKTFCDSSNPPADLLNGPNTFGQASQIQ 206
            +LVEK+S KAPDG TK+KILT IA +H + W+ ++  +S   P + ++        SQ  
Sbjct: 122  LLVEKLSVKAPDGPTKIKILTEIATQHNVTWEAESLVESD--PKETMSASGASSSVSQPA 181

Query: 207  RGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIPEKSPEHNLRPTLHPQKSNFTNDYTNQ 266
             G  I  + S   NN                    + P      T++  ++++  D  + 
Sbjct: 182  TG--IKSESSRIQNN--------------------QPPVFQAAATVNVSQNSYATDGRSS 241

Query: 267  SNITGHRIPETRSSEGTHRHTN-SGDQNNYSSGRQH---------WNMDFKDATSAAKAA 326
            S +T       ++ +  H+    SGD+ +    R H         +   F DATSAA+AA
Sbjct: 242  SRMTSTDFNVGKTPDHYHQDPKPSGDRVDGREHRDHNPGHGDTSPFETKFVDATSAARAA 301

Query: 327  AESAELASLAARAAAELSS--RGNISQSSSSGFQKSSSYNLRAEGP-QEYANLNLQDQQL 386
            AESAE AS AAR AAELSS  R  + Q+S+     SS  NLR+  P    ++ N+Q    
Sbjct: 302  AESAERASFAARRAAELSSKERMMMMQNSTESRNSSSYENLRSNPPHSRTSSSNMQGGGF 361

Query: 387  PKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGASNIDANASGTNFNASDRY 446
             K++++ + +R                  D         S    +D  +  T++     +
Sbjct: 362  GKEELLKSNNRQV----------------DQSTTTTRAESSKKTVDELSENTSWRRG--H 421

Query: 447  SFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYSFKNSSEPGFSDSLHSAT 506
            S +NS E   NDS       ++QP   D    +N  +++D  + K SS      S HS +
Sbjct: 422  SRENSLEMRPNDSFAKIGREKQQPGMDD----INLSSSADVLNKKQSSRA----SSHSPS 481

Query: 507  VEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASMEKQPRNFDVEYVSDRPSG 566
                  ++  +  VT  +  D  S                            +  ++   
Sbjct: 482  -----SNFSDDNDVTALDHIDSPSI---------------------------FEENKFQS 541

Query: 567  TGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPNDSETVDTSFNDHASVVFD 626
            T  +R SY       NDS  V     P  D Y + F  DKP                   
Sbjct: 542  TVGDRESY-------NDSPVV--VVAPAFDDYSSFF--DKP------------------- 601

Query: 627  DYGPDDDYVPDYDYQIRESILELSSPEGT-VPINSSATDDTWIFKQNKNDPPEKSVSHSQ 686
             +  +D Y  + +  +  S+L  SS     +P   S    +W  + +K+     S S SQ
Sbjct: 602  QFDTEDAYHDEPEQGLGFSLLGSSSKTSDHMPTEIS----SWSLEGHKDLGKLSSASTSQ 661

Query: 687  ISDERTSLFAGNSRSFEDPSHSDDLLPATFDHSDGPSS--ESEEEVKESGIIGKEDSIEF 746
            + ++            E PS      P TFD  DGP+S   S  E + S      D    
Sbjct: 662  VLEK------------EKPSS-----PPTFD--DGPTSPPASLHEPEPSAKFDDYDRDSE 721

Query: 747  SKKQNL------YSEKPEWIQDISHVSLGSSDEENKSMPSHRLSSDIPLVHGSKEKASPP 806
            S++ NL         K +     SH+S G  D      PS                    
Sbjct: 722  SEEDNLGRLSGRAEGKSKLTAQKSHMSEGPDDLGRYFFPS-------------------- 781

Query: 807  SSPDIIQDTKILEESTSEVYSPLSFGKLKGGLRNQKS-----NRPPYAINSSTS---DLP 866
             + D   D+K  EES +E  + L FG L  GL N+ +     + PP    SS S    LP
Sbjct: 782  DTEDQGDDSKTQEESDAETPTGLKFGPLASGLENETTLPSYGSSPPRDKTSSKSIKEYLP 841

Query: 867  ---------SKQTCGNDDARTE-------QSTSIPSSTARTSFRSNASSEGTYGRSV--- 926
                     S QT  +   R E        S   PSS    S  S+  S+    + V   
Sbjct: 842  TEVDPSRSSSLQTASSSSIRNELYTQKASNSDKRPSSIPPDSSSSDDESDMELPKRVSFR 901

Query: 927  -EEKPDEEKGSQAKLNSNYSNLEESKDRSSDYTLRSDEESYSDKMRNEISKKPIPTRVAV 986
             +EK  E +     L+S  S+    KD   +   R+     S + ++  + K  P   + 
Sbjct: 902  YQEKRTESRTRPTHLHSGVSH----KDLEEEIPTRA-----STRSQDRRTHKTTPASASA 961

Query: 987  KYPGFH---DDDDSEEDSHTQNVKNSPHRLIGLSRRTTASPKTPS---------SYVEDS 1046
             Y  FH    DD+ E++ H           I +SRRT    + PS         S+ E+S
Sbjct: 962  SY--FHTMSSDDEDEKEVHRDTAHIQTRPYISISRRTKGQERRPSLVTAKIDKVSFDEES 1021

Query: 1047 --------------YGTPTSHDDVTE-QKASRSYYSSP------APLKAKTGTRTSSRLE 1106
                           G+ +S   + + +K S    S P       PL  + G  ++S L 
Sbjct: 1022 PPKLSPEAKPLTKQQGSASSLSYLPKTEKVSHDQESHPKLGLGAKPLIKQQG--SASSLS 1062

Query: 1107 SSEQPQSSKPFKQTPETKMPLNEERLKSSAKEQQSNYPPELDRQGNSESSKFSSARETTT 1166
               +   + P + +P   +P  +   K        ++ P+ D+    + S      +   
Sbjct: 1082 FLPKTNKASPDQDSPPKLVPKEKPAAKQRGSASSLSFLPKTDKASPDQDSPPKLLPKEKP 1062

Query: 1167 ASVKTWAQTRNSHYLANSEQPTQSTKPSKPIPESKRSFHEERLTSSTKELPSN-PSPEVE 1184
            A+ +  + T +S      +       PSK  PE+K    +E L SS+  LP    SP+ E
Sbjct: 1142 AAKQQGSATSSSSLPKTEKISHYRESPSKLTPEAKSMAKQEGLASSSSSLPKTVTSPDPE 1062

BLAST of MC06g0751 vs. TAIR 10
Match: AT1G34220.2 (Regulator of Vps4 activity in the MVB pathway protein )

HSP 1 Score: 226.5 bits (576), Expect = 1.1e-58
Identity = 209/665 (31.43%), Postives = 324/665 (48.72%), Query Frame = 0

Query: 3   HKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVVRE 62
           +K FK AKCKT LKL + RIKL+RN++E Q++Q+R E+AKLLE GQ+ TARIRVEH++RE
Sbjct: 9   NKGFKAAKCKTLLKLTIPRIKLIRNRREAQIKQMRREIAKLLETGQEATARIRVEHIIRE 68

Query: 63  EKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVRKH 122
           EK  AA E++E+FCELI  R+P+IE+Q+ CP+DLKEAI+SV FA+PRC+D+ EL  V+  
Sbjct: 69  EKMMAAQEILELFCELIAVRLPIIEAQRECPLDLKEAISSVCFAAPRCSDLTELQQVQIL 128

Query: 123 FKAKYGKEFVSAAVELRPECGANRMLVEKMSAKAPDGQTKLKILTAIAEEHKIKWDP-KT 182
           F +KYGKEFV+AA EL+P+ G NR LVE +S +AP  +TKLK+L  IAEEH++ WDP  T
Sbjct: 129 FVSKYGKEFVAAASELKPDSGVNRKLVELLSVRAPSPETKLKLLKEIAEEHELDWDPAST 188

Query: 183 FCDSSNPPADLLNGPNTFGQASQIQRGEPIGGQPSLDHNNRGSASVQVPSKSDEGHRIPE 242
             D      DLL+GP  FG  S++    P+  + +   N    ++ +  S SD  + I +
Sbjct: 189 ETDLFKSHEDLLDGPKQFGGGSKL----PLPEEQNEKTNLTSLSAAKEKSDSDSEYDILD 248

Query: 243 --KSPEHNLRPTLHPQKSNFTNDYTNQS-NITGHRIPETRSSEGTHRHTNSGDQN----- 302
             + P   LRPT      N  +   + S   T H +P    + G  +  +  D++     
Sbjct: 249 FPEVPNVLLRPTPGATSVNAPDAAKSASYEHTSHDLPFDSENAGVEKTASKRDEHPAKAS 308

Query: 303 --------------------NYS--------------------SGRQHWNMDFKDATSAA 362
                               NYS                    + R+  + D +D   AA
Sbjct: 309 KTVVEGQQSSPILMESFEKKNYSPPSIDAVGPIPTKESGASRDTPRKISDGDLQDVLMAA 368

Query: 363 KAAAESAELASLAARAAAELSSR--GNISQSSSSGFQKSSSYNLRAEGPQEYANLNLQDQ 422
           +AAA+SAE A+ AAR+AA L+      +++ +S  + +S                     
Sbjct: 369 QAAADSAERAASAARSAASLAQLRINELTRKTSDQYPES--------------------- 428

Query: 423 QLPKDQVVSAPHRSSIMDDNRRENDARRFMGDDDKKLRYPSSGASNIDANASGTNFNAS- 482
             P +    AP   ++  D+  +N +    GD  +  R  +S   N + N      ++S 
Sbjct: 429 --PSENPFHAPSMGNLQFDH--QNSSASSSGDLTELQRAETSSLFNSEQNNQQPQTHSSM 488

Query: 483 --DRYSFKNSSESGFNDSLGSSASVEKQPRKFDASTSVNNFNASDRYSFKNSSEPGFSDS 542
              ++  +NSS S + D         ++P+    ++S +N++      F +  +P F   
Sbjct: 489 EKPQFDRQNSSFSSYGDLTPQRFHSMEKPQFDHQNSSGSNYDDLTPQRFSSLEKPQFD-- 548

Query: 543 LHSATVEKQPKDYDSNTYVTNFNASDRYSFKNSSEPGFGDSHSSASMEKQPRNFDVEYVS 602
                          N+  +N++    + F +  +P F   +SS S      ++      
Sbjct: 549 -------------HQNSSGSNYDDLTPHRFPSMEKPQFDHQNSSVS------SYGDLPEL 608

Query: 603 DRPSGTGSERTSYYEDVRIGNDSNRVPSYEKPMMDTYDNPFAMDKPNDSETVDTSFNDHA 614
            RP  +  +R S  +D    +   R+PS E     +Y N F   K +D  +   SF+D+ 
Sbjct: 609 QRPETSPLDRLSPDQD----HQQMRLPSMEDDPYYSYPNLFTSQK-HDPSSGSHSFSDNT 618

BLAST of MC06g0751 vs. TAIR 10
Match: AT1G34220.1 (Regulator of Vps4 activity in the MVB pathway protein )

HSP 1 Score: 211.1 bits (536), Expect = 5.0e-54
Identity = 209/695 (30.07%), Postives = 324/695 (46.62%), Query Frame = 0

Query: 3   HKSFKPAKCKTSLKLAVSRIKLLRNKKEVQVRQLRGELAKLLEAGQDQTARIRVEHVVRE 62
           +K FK AKCKT LKL + RIKL+RN++E Q++Q+R E+AKLLE GQ+ TARIRVEH++RE
Sbjct: 9   NKGFKAAKCKTLLKLTIPRIKLIRNRREAQIKQMRREIAKLLETGQEATARIRVEHIIRE 68

Query: 63  EKSKAAYELIEIFCELIVARMPMIESQKNCPIDLKEAIASVIFASPRCADIPELMDVRKH 122
           EK  AA E++E+FCELI  R+P+IE+Q+ CP+DLKEAI+SV FA+PRC+D+ EL  V+  
Sbjct: 69  EKMMAAQEILELFCELIAVRLPIIEAQRECPLDLKEAISSVCFAAPRCSDLTELQQVQIL 128

Query: 123 FKAKYGKEFVSAAVELRPECGANR------------------------------MLVEKM 182
           F +KYGKEFV+AA EL+P+ G NR                               LVE +
Sbjct: 129 FVSKYGKEFVAAASELKPDSGVNRKTESLIFIAWFSLVETRDLFMFLYFSNSILQLVELL 188

Query: 183 SAKAPDGQTKLKILTAIAEEHKIKWDP-KTFCDSSNPPADLLNGPNTFGQASQIQRGEPI 242
           S +AP  +TKLK+L  IAEEH++ WDP  T  D      DLL+GP  FG  S++    P+
Sbjct: 189 SVRAPSPETKLKLLKEIAEEHELDWDPASTETDLFKSHEDLLDGPKQFGGGSKL----PL 248

Query: 243 GGQPSLDHNNRGSASVQVPSKSDEGHRIPE--KSPEHNLRPTLHPQKSNFTNDYTNQS-N 302
             + +   N    ++ +  S SD  + I +  + P   LRPT      N  +   + S  
Sbjct: 249 PEEQNEKTNLTSLSAAKEKSDSDSEYDILDFPEVPNVLLRPTPGATSVNAPDAAKSASYE 308

Query: 303 ITGHRIPETRSSEGTHRHTNSGDQN-------------------------NYS------- 362
            T H +P    + G  +  +  D++                         NYS       
Sbjct: 309 HTSHDLPFDSENAGVEKTASKRDEHPAKASKTVVEGQQSSPILMESFEKKNYSPPSIDAV 368

Query: 363 -------------SGRQHWNMDFKDATSAAKAAAESAELASLAARAAAELSSR--GNISQ 422
                        + R+  + D +D   AA+AAA+SAE A+ AAR+AA L+      +++
Sbjct: 369 GPIPTKESGASRDTPRKISDGDLQDVLMAAQAAADSAERAASAARSAASLAQLRINELTR 428

Query: 423 SSSSGFQKSSSYNLRAEGPQEYANLNLQDQQLPKDQVVSAPHRSSIMDDNRRENDARRFM 482
            +S  + +S                       P +    AP   ++  D+  +N +    
Sbjct: 429 KTSDQYPES-----------------------PSENPFHAPSMGNLQFDH--QNSSASSS 488

Query: 483 GDDDKKLRYPSSGASNIDANASGTNFNAS---DRYSFKNSSESGFNDSLGSSASVEKQPR 542
           GD  +  R  +S   N + N      ++S    ++  +NSS S + D         ++P+
Sbjct: 489 GDLTELQRAETSSLFNSEQNNQQPQTHSSMEKPQFDRQNSSFSSYGDLTPQRFHSMEKPQ 548

Query: 543 KFDASTSVNNFNASDRYSFKNSSEPGFSDSLHSATVEKQPKDYDSNTYVTNFNASDRYSF 602
               ++S +N++      F +  +P F                  N+  +N++    + F
Sbjct: 549 FDHQNSSGSNYDDLTPQRFSSLEKPQFD---------------HQNSSGSNYDDLTPHRF 608

Query: 603 KNSSEPGFGDSHSSASMEKQPRNFDVEYVSDRPSGTGSERTSYYEDVRIGNDSNRVPSYE 614
            +  +P F   +SS S      ++       RP  +  +R S  +D    +   R+PS E
Sbjct: 609 PSMEKPQFDHQNSSVS------SYGDLPELQRPETSPLDRLSPDQD----HQQMRLPSME 648

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q54I393.4e-2330.45IST1-like protein OS=Dictyostelium discoideum OX=44689 GN=DDB_G0289029 PE=3 SV=1[more]
P539907.5e-2332.24IST1 homolog OS=Homo sapiens OX=9606 GN=IST1 PE=1 SV=1[more]
Q3ZBV13.7e-2231.78IST1 homolog OS=Bos taurus OX=9913 GN=IST1 PE=2 SV=1[more]
Q5R6G84.9e-2231.78IST1 homolog OS=Pongo abelii OX=9601 GN=IST1 PE=2 SV=1[more]
Q9CX006.4e-2231.78IST1 homolog OS=Mus musculus OX=10090 GN=Ist1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
XP_022159239.10.099.58uncharacterized protein LOC111025657 isoform X1 [Momordica charantia][more]
XP_022159240.10.099.58uncharacterized protein LOC111025657 isoform X2 [Momordica charantia][more]
XP_023531863.10.074.37filaggrin-like isoform X1 [Cucurbita pepo subsp. pepo][more]
KAG7021918.10.074.37IST1-like protein [Cucurbita argyrosperma subsp. argyrosperma][more]
XP_022927973.10.074.29filaggrin isoform X1 [Cucurbita moschata][more]
Match NameE-valueIdentityDescription
A0A6J1E1U80.099.58uncharacterized protein LOC111025657 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1DY480.099.58uncharacterized protein LOC111025657 isoform X2 OS=Momordica charantia OX=3673 G... [more]
A0A6J1EMJ40.074.29filaggrin isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111434877 PE=3 SV=1[more]
A0A6J1KP600.074.12uncharacterized protein LOC111496330 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1EJ030.073.53uncharacterized protein LOC111434877 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
Match NameE-valueIdentityDescription
AT2G19710.11.0e-7529.90Regulator of Vps4 activity in the MVB pathway protein [more]
AT4G29440.13.4e-7129.36Regulator of Vps4 activity in the MVB pathway protein [more]
AT4G29440.27.1e-6128.69Regulator of Vps4 activity in the MVB pathway protein [more]
AT1G34220.21.1e-5831.43Regulator of Vps4 activity in the MVB pathway protein [more]
AT1G34220.15.0e-5430.07Regulator of Vps4 activity in the MVB pathway protein [more]
InterPro
Analysis Name: InterPro Annotations of Bitter gourd (Dali-11) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 26..46
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 286..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 657..729
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1130..1144
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 900..927
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1145..1171
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1115..1129
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 822..873
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 217..231
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 707..729
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 657..688
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 367..466
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 957..1038
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 784..806
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 938..956
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 414..466
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 179..301
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 386..410
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 250..276
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 753..1171
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1054..1114
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 187..210
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 234..249
NoneNo IPR availablePANTHERPTHR12161:SF13REGULATOR OF VPS4 ACTIVITY IN THE MVB PATHWAY PROTEINcoord: 3..1184
IPR042277Vacuolar protein sorting-associated protein IST1-likeGENE3D1.20.1260.60coord: 1..187
e-value: 5.5E-73
score: 246.5
IPR005061Vacuolar protein sorting-associated protein Ist1PFAMPF03398Ist1coord: 12..176
e-value: 2.8E-56
score: 190.0
IPR005061Vacuolar protein sorting-associated protein Ist1PANTHERPTHR12161IST1 FAMILY MEMBERcoord: 3..1184

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
MC06g0751.1MC06g0751.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0015031 protein transport