HG10022259 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10022259
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPCI domain-containing protein
LocationChr05: 22403893 .. 22408076 (+)
RNA-Seq ExpressionHG10022259
SyntenyHG10022259
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGGTGATTGTTTGTCATGACTTCTTTTCTGTTTTTCTTCTTATATATATATATATTTGTCAACCAAGCCTGTTTCTTTACAGCATTAATTCGATTACTTTCTGCATTGCTGATATCTCTCCATTTTCTAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA

mRNA sequence

ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA

Coding sequence (CDS)

ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA

Protein sequence

MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITTSEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNENHALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKEHLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHLSLCSSDHRHVMTQEQSSFVKLDFRKSSPLSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQGGCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYNDQIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHPEMARRSPPGYRMRSPDQPPPGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRRTGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF
Homology
BLAST of HG10022259 vs. NCBI nr
Match: XP_038890337.1 (uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890338.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890339.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890340.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890341.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890342.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida])

HSP 1 Score: 2296.2 bits (5949), Expect = 0.0e+00
Identity = 1171/1357 (86.29%), Postives = 1239/1357 (91.30%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MST+D   IVPIKKRRFP IQSSPPKEISSLP +DDN+VKVEEPCVSD  TVSNSSTITT
Sbjct: 1    MSTNDYTTIVPIKKRRFPSIQSSPPKEISSLPPVDDNMVKVEEPCVSDSPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSEKKKI FSEDGNWKSDLCNVNMVQS+IGPSRVE + ND    G V NK TC+VNEN
Sbjct: 61   SEFSEKKKISFSEDGNWKSDLCNVNMVQSSIGPSRVEFKKNDDCFTGSVGNKETCLVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
              LAL EKPELKL  SD +SNPG+CAEKKSDEI RKELD+CNSSTS+VK EVELS+ LKE
Sbjct: 121  RMLALQEKPELKLPSSDPDSNPGVCAEKKSDEIHRKELDKCNSSTSVVKKEVELSLSLKE 180

Query: 181  HLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEF 240
             LVP SVLEGNNL PV+LNLSLSKQGSHTQCLTGNVGS  DGSLQQSNR NWDLNTSMEF
Sbjct: 181  RLVPVSVLEGNNLGPVVLNLSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEF 240

Query: 241  WEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLT-PLGHSDHLHLSLCSSDH 300
            WEGCA+DDPPVHVPVVQTNT V T RCSTEMVKTDTL GKLT PL HSDHLHLSLCSSDH
Sbjct: 241  WEGCASDDPPVHVPVVQTNTTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDH 300

Query: 301  RHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEV 360
            RHVM+QEQSSF+KLDFRKSSP LSS GRSKQ DDL+G LK VK EPF EGS+LESKSDEV
Sbjct: 301  RHVMSQEQSSFIKLDFRKSSPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEV 360

Query: 361  NVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVE 420
            NV G+S++AVVKR FLQLP+ASDIY+SMNIVK+KS+KSESIYQSKQEALKTLGGRLDLVE
Sbjct: 361  NVPGVSDNAVVKRGFLQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVE 420

Query: 421  KQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQ 480
            KQV  +VDNSC VPM FVAE S+  GNPSC+TDL+ DKDMSN+SELQTP+KEH+STI+ Q
Sbjct: 421  KQVLSDVDNSCAVPMSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQ 480

Query: 481  G---GCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDR 540
            G   GC GELVKSE+TDIS+DTGSKD SSPI KPI +P +AEM K AKNPSCTNDMI+D+
Sbjct: 481  GGSHGCCGELVKSEVTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDK 540

Query: 541  DVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGD 600
            DVPNHSEL+TPT GPLNRKVHQ G GCDGGLVNSEMTDLSKD CSKDS+SSV+KPFI+ D
Sbjct: 541  DVPNHSELQTPTRGPLNRKVHQ-GDGCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVED 600

Query: 601  QNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVN 660
            QNENNP W  LEH N+QCSSLHG EECSVSDEEKIS+SADLLEEDPYSSEYESDGKQDVN
Sbjct: 601  QNENNPQWHPLEHRNKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVN 660

Query: 661  EAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVF 720
            EAMDAVDN IEEDYEDGEVREPIL TQVESSICETREVK FDHGDCSNGLPGSDCSSLV 
Sbjct: 661  EAMDAVDNVIEEDYEDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVS 720

Query: 721  VKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRK 780
            VKQE KSEILDVKREDIL+ V SNQSSEQEHLKELLVEDNT+KV LNKANKAIKATGPR+
Sbjct: 721  VKQEDKSEILDVKREDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQ 780

Query: 781  LFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLN 840
            LFHCEKI ALEDQKI S++ATT IEESI TV QSDAENVKTVDFVQN+DL LPN++EPLN
Sbjct: 781  LFHCEKIFALEDQKISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLN 840

Query: 841  NDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQ 900
            N DDVTDDFT GNRH+QIVNPCQASTSSP KTRPSLVRSVLTQTDRELIPDMAH+GEKLQ
Sbjct: 841  N-DDVTDDFTRGNRHSQIVNPCQASTSSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQ 900

Query: 901  PQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYND 960
            PQGRDDSYRDVFPKFYVNR QNLSPRTNFTRRRGRFTIRIN+VQGEWDFN TIS GVYND
Sbjct: 901  PQGRDDSYRDVFPKFYVNRRQNLSPRTNFTRRRGRFTIRINSVQGEWDFNPTISPGVYND 960

Query: 961  QIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRK 1020
            QI PYDARRRKYMP +SD+DIDQNHYKMKP GPFRT GHRGRQILDDEGPIFCHIPSRRK
Sbjct: 961  QIPPYDARRRKYMPAVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRK 1020

Query: 1021 SPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYA 1080
            SPGRRDGPP+RGGVKMVH MHRN+SP RCIREP SEL+GPRHGEKFMRTL+DE MDP+Y 
Sbjct: 1021 SPGRRDGPPLRGGVKMVHGMHRNVSPSRCIREPGSELIGPRHGEKFMRTLDDETMDPMY- 1080

Query: 1081 HPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHP 1140
            HPQPPFEVDRPP+IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFF HP
Sbjct: 1081 HPQPPFEVDRPPYIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHP 1140

Query: 1141 EMARRSPPGYRMRSPDQPP-PGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
            EMARRSPPGYRMRSPDQPP  GDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR
Sbjct: 1141 EMARRSPPGYRMRSPDQPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200

Query: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLH 1260
            NRTDRISFRNRRFEDMDPRDRIESNEY+DGP+HPGQ NELV DGNDDDRRRFPDRHEHLH
Sbjct: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLH 1260

Query: 1261 QFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRR 1320
             FRPQCNDSDGENYH DADER RPFRYCAEDEAEFHER KMREREFDRRLKNQ  NLGRR
Sbjct: 1261 PFRPQCNDSDGENYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRR 1320

Query: 1321 TGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF 1352
            T GVIEEHE +EYRHGRQ+WNEHHGFE+ISRMKRKRF
Sbjct: 1321 T-GVIEEHE-QEYRHGRQLWNEHHGFEEISRMKRKRF 1351

BLAST of HG10022259 vs. NCBI nr
Match: XP_038890343.1 (uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida])

HSP 1 Score: 2245.3 bits (5817), Expect = 0.0e+00
Identity = 1152/1357 (84.89%), Postives = 1219/1357 (89.83%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MST+D   IVPIKKRRFP IQSSPPKEISSLP +DDN+VKVEEPCVSD  TVSNSSTITT
Sbjct: 1    MSTNDYTTIVPIKKRRFPSIQSSPPKEISSLPPVDDNMVKVEEPCVSDSPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSEKKKI FSEDGNWKSDLCNVNMVQS+IGPSRVE + ND    G V NK TC+VNEN
Sbjct: 61   SEFSEKKKISFSEDGNWKSDLCNVNMVQSSIGPSRVEFKKNDDCFTGSVGNKETCLVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
              LAL EKPELKL  SD +SNPG+CAEKKSDEI RKELD+CNSSTS+VK EVELS+ LKE
Sbjct: 121  RMLALQEKPELKLPSSDPDSNPGVCAEKKSDEIHRKELDKCNSSTSVVKKEVELSLSLKE 180

Query: 181  HLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEF 240
             LVP SVLEGNNL PV+LNLSLSKQGSHTQCLTGNVGS  DGSLQQSNR NWDLNTSMEF
Sbjct: 181  RLVPVSVLEGNNLGPVVLNLSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEF 240

Query: 241  WEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLT-PLGHSDHLHLSLCSSDH 300
            WEGCA+DDPPVHVPVVQTNT V T RCSTEMVKTDTL GKLT PL HSDHLHLSLCSSDH
Sbjct: 241  WEGCASDDPPVHVPVVQTNTTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDH 300

Query: 301  RHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEV 360
            RHVM+QEQSSF+KLDFRKSSP LSS GRSKQ DDL+G LK VK EPF EGS+LESKSDEV
Sbjct: 301  RHVMSQEQSSFIKLDFRKSSPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEV 360

Query: 361  NVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVE 420
            NV G+S++AVVKR FLQLP+ASDIY+SMNIVK+KS+KSESIYQSKQEALKTLGGRLDLVE
Sbjct: 361  NVPGVSDNAVVKRGFLQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVE 420

Query: 421  KQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQ 480
            KQV  +VDNSC VPM FVAE S+  GNPSC+TDL+ DKDMSN+SELQTP+KEH+STI+ Q
Sbjct: 421  KQVLSDVDNSCAVPMSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQ 480

Query: 481  G---GCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDR 540
            G   GC GELVKSE+TDIS+DTGSKD SSPI KPI +P +AEM K AKNPSCTNDMI+D+
Sbjct: 481  GGSHGCCGELVKSEVTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDK 540

Query: 541  DVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGD 600
            DVPNHSEL+TPT GPLNRKVHQ G GCDGGLVNSEMTDLSKD CSKDS+SSV+KPFI+ D
Sbjct: 541  DVPNHSELQTPTRGPLNRKVHQ-GDGCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVED 600

Query: 601  QNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVN 660
            QNENNP W  LEH N+QCSSLHG EECSVSDEEKIS+SADLLEEDPYSSEYESDGKQDVN
Sbjct: 601  QNENNPQWHPLEHRNKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVN 660

Query: 661  EAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVF 720
            EAMDAVDN IEEDYEDGEVREPIL TQVESSICETREVK FDHGDCSNGLPGSDCSSLV 
Sbjct: 661  EAMDAVDNVIEEDYEDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVS 720

Query: 721  VKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRK 780
            VKQE KSEILDVKREDIL+ V SNQSSEQEHLKELLVEDNT+KV LNKANKAIKATGPR+
Sbjct: 721  VKQEDKSEILDVKREDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQ 780

Query: 781  LFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLN 840
            LFHCEKI ALEDQKI S++ATT IEESI TV QSDAENVKTVDFVQN+DL LPN++EPLN
Sbjct: 781  LFHCEKIFALEDQKISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLN 840

Query: 841  NDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQ 900
            N DDVTDDFT GNRH+QIVNPCQASTSSP KTRPSLVRSVLTQTDRELIPDMAH+GEKLQ
Sbjct: 841  N-DDVTDDFTRGNRHSQIVNPCQASTSSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQ 900

Query: 901  PQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYND 960
            PQGRDDSYRDVFPKFYVNR QNLSPRTNFTRRR                      GVYND
Sbjct: 901  PQGRDDSYRDVFPKFYVNRRQNLSPRTNFTRRR----------------------GVYND 960

Query: 961  QIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRK 1020
            QI PYDARRRKYMP +SD+DIDQNHYKMKP GPFRT GHRGRQILDDEGPIFCHIPSRRK
Sbjct: 961  QIPPYDARRRKYMPAVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRK 1020

Query: 1021 SPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYA 1080
            SPGRRDGPP+RGGVKMVH MHRN+SP RCIREP SEL+GPRHGEKFMRTL+DE MDP+Y 
Sbjct: 1021 SPGRRDGPPLRGGVKMVHGMHRNVSPSRCIREPGSELIGPRHGEKFMRTLDDETMDPMY- 1080

Query: 1081 HPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHP 1140
            HPQPPFEVDRPP+IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFF HP
Sbjct: 1081 HPQPPFEVDRPPYIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHP 1140

Query: 1141 EMARRSPPGYRMRSPDQPP-PGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
            EMARRSPPGYRMRSPDQPP  GDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR
Sbjct: 1141 EMARRSPPGYRMRSPDQPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200

Query: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLH 1260
            NRTDRISFRNRRFEDMDPRDRIESNEY+DGP+HPGQ NELV DGNDDDRRRFPDRHEHLH
Sbjct: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLH 1260

Query: 1261 QFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRR 1320
             FRPQCNDSDGENYH DADER RPFRYCAEDEAEFHER KMREREFDRRLKNQ  NLGRR
Sbjct: 1261 PFRPQCNDSDGENYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRR 1320

Query: 1321 TGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF 1352
            T GVIEEHE +EYRHGRQ+WNEHHGFE+ISRMKRKRF
Sbjct: 1321 T-GVIEEHE-QEYRHGRQLWNEHHGFEEISRMKRKRF 1329

BLAST of HG10022259 vs. NCBI nr
Match: XP_022992789.1 (uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790.1 uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima])

HSP 1 Score: 2046.6 bits (5301), Expect = 0.0e+00
Identity = 1085/1377 (78.79%), Postives = 1165/1377 (84.60%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQ  IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDE+DRK+LDR   STSL K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVPDSVLEG+        NLEPVLLNLSLSK+GS  QCLT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSF KL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
             SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y  MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361  MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G   CGGELV SEMTDISKD GSKD + PIIKPI MP         +NPS TN
Sbjct: 481  LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        KISALEDQ+   +KAT  IEESI TV QSDAE VKTVD V+ND+  LP
Sbjct: 781  ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960

Query: 961  ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
            IS G Y+DQ+  PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961  ISPGNYSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020

Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
            +FCH+ SRRKSPGRRDGPP   GVKM HRM RNISP RC RE  SELVGPRHGEKFMRT 
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080

Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
            EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140

Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
            RKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200

Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
            DHGHMR  +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260

Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
            DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320

Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            RR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1349

BLAST of HG10022259 vs. NCBI nr
Match: XP_023550091.1 (uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 2037.7 bits (5278), Expect = 0.0e+00
Identity = 1080/1377 (78.43%), Postives = 1164/1377 (84.53%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFP +QS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQS IGPSRVE Q+ND  S GCVENK TCMVNEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDEIDRK+LDR   STS+ K E ELS+G KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVPDSVLEG+        NLEP LLNLSLSK+GS  Q LT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNT+VT HR STEMV TDTLSGKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
            ESKSDEVNVLGLS+SA+VKREFLQ+PNASDIY  MN VKAKSV SES Y+SKQ AL+TLG
Sbjct: 361  ESKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD  MSN+SELQTPT+EH
Sbjct: 421  GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGGMSNHSELQTPTEEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G   CGGELV SEMTDISKD GSKDF+SPIIKPI MP         +NPS TN
Sbjct: 481  LNLKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL  PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMDAVDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661  GKLDVNEAMDAVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        K SALEDQ+   +KA+  IEESI TV QSDAE VKTVD V+ND+  LP
Sbjct: 781  ------------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ S+SS P KT+ SL RSVLTQTDRE IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
             HEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901  GHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960

Query: 961  ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
            IS G YNDQ+  PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961  ISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020

Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
            +FCH+ SRRKSPGRRDGPP   GVKMVHRM RNISP RC RE  SELVGPRHGEKFMRT 
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080

Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
            EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSK 1140

Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
            RKSERFF HPEMARRS PPGYRMRSPDQPP   GDMP RRHGFPFPSLPPNDLRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSAR 1200

Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
            DHGHMR G+RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQ+NEL+ DGNDD
Sbjct: 1201 DHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDD 1260

Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
            DRRRF DRHEHLHQFRPQCNDSDGENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320

Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            RR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HH FEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHSFEDISRMKRKR 1349

BLAST of HG10022259 vs. NCBI nr
Match: XP_022938519.1 (uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_022938520.1 uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata])

HSP 1 Score: 2016.1 bits (5222), Expect = 0.0e+00
Identity = 1074/1379 (77.88%), Postives = 1158/1379 (83.97%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDEIDRK+LDR   STS+ K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVP+SVLEG+        NLEPVLLNLSLSK+GS  Q LT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
            ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY  MN VKA+SV SE  Y+SKQEALKTLG
Sbjct: 361  ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEV +SCP PMPFVAE ++A  N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G     GEL+ SEMTD+SKD GSKDF+SPIIKPI MP         +NPS TN
Sbjct: 481  LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL  PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        K SA+EDQ+   +KAT  IEESI TV QSDAE VKTVD V+N++  LP
Sbjct: 781  ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960

Query: 961  ISLGVYNDQ---IQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDE 1020
            IS G Y+D      PYDARRRKYMP +SDDDIDQNHYKMKP  PFR+AG HRGRQILDDE
Sbjct: 961  ISPGNYSDHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDE 1020

Query: 1021 GPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMR 1080
            GP+FCH+ SRRKSPGRRDGPP   GVKMVHRM RNISP RC RE  SELVGPRHGEKFMR
Sbjct: 1021 GPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMR 1080

Query: 1081 TLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFP 1140
            T EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFP
Sbjct: 1081 TFEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFP 1140

Query: 1141 SKRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGS 1200
            SKRKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPNDLRDMGS
Sbjct: 1141 SKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGS 1200

Query: 1201 ARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGN 1260
            ARDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGN
Sbjct: 1201 ARDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGN 1260

Query: 1261 DDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMRERE 1320
            DDDRRRF DRHEHLHQFRPQCNDSDGENY  DADERARP+RYC EDE EFHERGKMRERE
Sbjct: 1261 DDDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMRERE 1320

Query: 1321 FDRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            FDRR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 FDRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1351

BLAST of HG10022259 vs. ExPASy TrEMBL
Match: A0A6J1JYG4 (uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)

HSP 1 Score: 2046.6 bits (5301), Expect = 0.0e+00
Identity = 1085/1377 (78.79%), Postives = 1165/1377 (84.60%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQ  IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDE+DRK+LDR   STSL K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVPDSVLEG+        NLEPVLLNLSLSK+GS  QCLT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSF KL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
             SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y  MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361  MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G   CGGELV SEMTDISKD GSKD + PIIKPI MP         +NPS TN
Sbjct: 481  LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        KISALEDQ+   +KAT  IEESI TV QSDAE VKTVD V+ND+  LP
Sbjct: 781  ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960

Query: 961  ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
            IS G Y+DQ+  PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961  ISPGNYSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020

Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
            +FCH+ SRRKSPGRRDGPP   GVKM HRM RNISP RC RE  SELVGPRHGEKFMRT 
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080

Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
            EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140

Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
            RKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200

Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
            DHGHMR  +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260

Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
            DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320

Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            RR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1349

BLAST of HG10022259 vs. ExPASy TrEMBL
Match: A0A6J1FEB1 (uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)

HSP 1 Score: 2016.1 bits (5222), Expect = 0.0e+00
Identity = 1074/1379 (77.88%), Postives = 1158/1379 (83.97%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDEIDRK+LDR   STS+ K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVP+SVLEG+        NLEPVLLNLSLSK+GS  Q LT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
            ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY  MN VKA+SV SE  Y+SKQEALKTLG
Sbjct: 361  ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEV +SCP PMPFVAE ++A  N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G     GEL+ SEMTD+SKD GSKDF+SPIIKPI MP         +NPS TN
Sbjct: 481  LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL  PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        K SA+EDQ+   +KAT  IEESI TV QSDAE VKTVD V+N++  LP
Sbjct: 781  ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960

Query: 961  ISLGVYNDQ---IQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDE 1020
            IS G Y+D      PYDARRRKYMP +SDDDIDQNHYKMKP  PFR+AG HRGRQILDDE
Sbjct: 961  ISPGNYSDHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDE 1020

Query: 1021 GPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMR 1080
            GP+FCH+ SRRKSPGRRDGPP   GVKMVHRM RNISP RC RE  SELVGPRHGEKFMR
Sbjct: 1021 GPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMR 1080

Query: 1081 TLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFP 1140
            T EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFP
Sbjct: 1081 TFEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFP 1140

Query: 1141 SKRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGS 1200
            SKRKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPNDLRDMGS
Sbjct: 1141 SKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGS 1200

Query: 1201 ARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGN 1260
            ARDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGN
Sbjct: 1201 ARDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGN 1260

Query: 1261 DDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMRERE 1320
            DDDRRRF DRHEHLHQFRPQCNDSDGENY  DADERARP+RYC EDE EFHERGKMRERE
Sbjct: 1261 DDDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMRERE 1320

Query: 1321 FDRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            FDRR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 FDRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1351

BLAST of HG10022259 vs. ExPASy TrEMBL
Match: A0A6J1JUI7 (uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)

HSP 1 Score: 1996.5 bits (5171), Expect = 0.0e+00
Identity = 1066/1377 (77.41%), Postives = 1145/1377 (83.15%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQ  IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDE+DRK+LDR   STSL K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVPDSVLEG+        NLEPVLLNLSLSK+GS  QCLT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSF KL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
             SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y  MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361  MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G   CGGELV SEMTDISKD GSKD + PIIKPI MP         +NPS TN
Sbjct: 481  LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        KISALEDQ+   +KAT  IEESI TV QSDAE VKTVD V+ND+  LP
Sbjct: 781  ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRG                  
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGN----------------- 960

Query: 961  ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
                 Y+DQ+  PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961  -----YSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020

Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
            +FCH+ SRRKSPGRRDGPP   GVKM HRM RNISP RC RE  SELVGPRHGEKFMRT 
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080

Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
            EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140

Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
            RKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200

Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
            DHGHMR  +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260

Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
            DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320

Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            RR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1327

BLAST of HG10022259 vs. ExPASy TrEMBL
Match: A0A6J1FDD8 (uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)

HSP 1 Score: 1967.6 bits (5096), Expect = 0.0e+00
Identity = 1054/1378 (76.49%), Postives = 1140/1378 (82.73%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
            MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1    MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60

Query: 61   SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
            SEFSE KKI FSEDG  KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61   SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120

Query: 121  HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
            HAL LHEKPE KL  SDANSNPGLCAEK+SDEIDRK+LDR   STS+ K E ELSVG KE
Sbjct: 121  HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180

Query: 181  HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
            HLVP+SVLEG+        NLEPVLLNLSLSK+GS  Q LT NVGS  DGS+Q+SNR NW
Sbjct: 181  HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240

Query: 241  DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
            DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL  SDHLHL
Sbjct: 241  DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300

Query: 301  SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
            SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR  Q DDL+GALK VKPEPFVE S+L
Sbjct: 301  SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360

Query: 361  ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
            ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY  MN VKA+SV SE  Y+SKQEALKTLG
Sbjct: 361  ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420

Query: 421  GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
            GRLDLV KQV PEV +SCP PMPFVAE ++A  N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421  GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480

Query: 481  LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
            L+  V +G     GEL+ SEMTD+SKD GSKDF+SPIIKPI MP         +NPS TN
Sbjct: 481  LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540

Query: 541  DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
            D II+ ++ + SEL  PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541  DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600

Query: 601  PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
            P I+ D+N+NNP WR   H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601  PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660

Query: 661  GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
            GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661  GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720

Query: 721  -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
             CSSLV VKQE K EILDVKRED L+SV SNQSSEQE  KEL VE++TT+VCLNKANKA 
Sbjct: 721  CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780

Query: 781  KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
                        K SA+EDQ+   +KAT  IEESI TV QSDAE VKTVD V+N++  LP
Sbjct: 781  ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840

Query: 841  NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
            N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841  NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900

Query: 901  AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
            AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRG ++               
Sbjct: 901  AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGNYS--------------- 960

Query: 961  ISLGVYNDQI--QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEG 1020
                  + Q+   PYDARRRKYMP +SDDDIDQNHYKMKP  PFR+AG HRGRQILDDEG
Sbjct: 961  ------DHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEG 1020

Query: 1021 PIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRT 1080
            P+FCH+ SRRKSPGRRDGPP   GVKMVHRM RNISP RC RE  SELVGPRHGEKFMRT
Sbjct: 1021 PLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRT 1080

Query: 1081 LEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPS 1140
             EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFPS
Sbjct: 1081 FEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPS 1140

Query: 1141 KRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSA 1200
            KRKSERFF HPEMARRS PPGYRMRSPDQPP   GDMPVRRHGFPFPSLPPNDLRDMGSA
Sbjct: 1141 KRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSA 1200

Query: 1201 RDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGND 1260
            RDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGND
Sbjct: 1201 RDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGND 1260

Query: 1261 DDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREF 1320
            DDRRRF DRHEHLHQFRPQCNDSDGENY  DADERARP+RYC EDE EFHERGKMREREF
Sbjct: 1261 DDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMREREF 1320

Query: 1321 DRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
            DRR+KNQP NLGRRT  VIEEHEVEEYR  HGRQMWNE      HHGFEDISRMKRKR
Sbjct: 1321 DRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1329

BLAST of HG10022259 vs. ExPASy TrEMBL
Match: A0A6J1BWB0 (uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006113 PE=4 SV=1)

HSP 1 Score: 1602.0 bits (4147), Expect = 0.0e+00
Identity = 911/1387 (65.68%), Postives = 1016/1387 (73.25%), Query Frame = 0

Query: 1    MSTSDCNAIVPIKKRRFPLIQSSPP---KEISSLPSLDDNIVKVEEPCVSDGTTVSNSST 60
            MS SD N IVPIKKRRF ++QSSP    KE+SSL SLDDN+VKV EP +SDG TVS+S T
Sbjct: 16   MSASDYNVIVPIKKRRFTIVQSSPSSPHKELSSL-SLDDNLVKVAEPGISDGITVSSSVT 75

Query: 61   ITTSEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMV 120
            ITTSE SEKK+I FSE+   K DLCN N VQSNI PS V  Q++DA     VENK   + 
Sbjct: 76   ITTSELSEKKEISFSEESERKVDLCNSNRVQSNIEPSGVRFQEDDACFNHQVENKAMNVE 135

Query: 121  NENHALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVG 180
            NE HAL L EKPELKL  SD NS  GLCA KK   IDRKEL++C S TSLVK E ELSVG
Sbjct: 136  NEKHALHLLEKPELKLPTSDPNSKLGLCANKKRVGIDRKELEKCKSLTSLVKTEAELSVG 195

Query: 181  LKEHLVPDSVLEG--------NNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNR 240
            L E LVPD V++G        NNLEPV LNLSLSKQGS+TQCLT NVGS  DGSLQQSNR
Sbjct: 196  LNERLVPDLVVKGSDRKWQKQNNLEPVSLNLSLSKQGSYTQCLTSNVGSDYDGSLQQSNR 255

Query: 241  GNWDLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDH 300
            GNWDLNTSME WEGCA+DDP V VPVVQTNTIVTTHRCSTEMV+ D  SGK TPL  SD+
Sbjct: 256  GNWDLNTSMESWEGCASDDPSVQVPVVQTNTIVTTHRCSTEMVRADISSGKPTPLDQSDY 315

Query: 301  LHLSLCSSDHRHVMTQEQSSFVKLDFRKS-SPLSSEGRSKQSDDLDGALKDVKPEPFVEG 360
            LHLSL SSD R V  QEQ S VKLDFR + S LSS G + Q DDL+ ALK VK EPFV+G
Sbjct: 316  LHLSLNSSDLRPVTKQEQISSVKLDFRSTDSSLSSPG-NMQFDDLNVALKVVKAEPFVKG 375

Query: 361  SRLESKSDEVNVLGLSNSAVVKREF-----LQLPNASDIYRSMNIVKAKSVKSESIYQSK 420
            S LESKS+EV  LGLS  A++  E      L+LP AS+I   MNIVKAKS KSE +Y+SK
Sbjct: 376  SELESKSNEVKGLGLSGDALMNGELDDQCNLELPKASNICSPMNIVKAKSFKSEPVYESK 435

Query: 421  QEALKTLGGRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSE 480
            +EAL+ LGGRL+L+ KQV P+VDNSCP+ +P VAE S+A  NPSCST L TD DM N+SE
Sbjct: 436  KEALEMLGGRLNLISKQVLPDVDNSCPIAVPVVAEMSEAARNPSCSTYLATDGDMLNHSE 495

Query: 481  LQTPTKEHLSTIVQQGGCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAK 540
            L TPTK                                                      
Sbjct: 496  LPTPTK------------------------------------------------------ 555

Query: 541  NPSCTNDMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDS 600
                                     G LN         C GGLVNSE TD++KD    DS
Sbjct: 556  -------------------------GNLNE--------CGGGLVNSEKTDITKDPGLGDS 615

Query: 601  SSSVMKPFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYS 660
            S S+ KPF   D+N+NNP W  L+  NEQCS L GGEE SVSDEEKIS+SAD+LEE PYS
Sbjct: 616  SISIAKPFNAEDENQNNPKWCLLKLSNEQCSGLQGGEESSVSDEEKISLSADILEEYPYS 675

Query: 661  SEYESDGKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSN 720
            SEYESDGKQDV+ AM  V NDIEEDYEDGEVREP+L TQVESS+C  REV+ FDHGD S 
Sbjct: 676  SEYESDGKQDVDGAMAEVHNDIEEDYEDGEVREPLLKTQVESSVCVKREVENFDHGDFSK 735

Query: 721  -------GLPGSDCSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQE-----HLKELL 780
                   GLPG+D S+L+ VKQE K E  DV++ED  +SV +NQSSEQE     +LKE+L
Sbjct: 736  DKKINSVGLPGTDFSTLISVKQENKLESHDVRQEDKFHSVTTNQSSEQEKDEASYLKEIL 795

Query: 781  VEDNTTKVCLNKANKAIKATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDA 840
            VE+N        +NK IKATG R+LFHCE+  ALEDQ   SDKAT  IEE IVTV Q DA
Sbjct: 796  VEENA-------SNKVIKATGRRQLFHCEERDALEDQN-SSDKATDGIEEPIVTVSQGDA 855

Query: 841  ENVKTVDFVQNDDLTLPNIREPLNNDDDVTDDFTHGNRHAQIVNPCQAST-SSPIKTRPS 900
            ENVKTVDFV+N+D  LPN++EP+NN DD TDDF HG+RH   +NPC  ST SSP KTR +
Sbjct: 856  ENVKTVDFVRNNDPVLPNVKEPVNN-DDATDDFIHGSRH---INPCHGSTSSSPSKTRSN 915

Query: 901  LVRSVLTQTDRELIPDMAHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGR 960
             +RSVLT+TDRE I D+A EG KLQPQGRDD Y  V  K YVNRHQNLSP+TNF  RR R
Sbjct: 916  SLRSVLTRTDREQILDVALEGGKLQPQGRDDRYSGVSQKIYVNRHQNLSPQTNF-HRRER 975

Query: 961  FTIRINNVQGEWDFNRTISLGVYNDQIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFR 1020
            FTIR +++QGEWDFN T+S G+Y+DQI PYDA RRKY+  +SDDDIDQNHYK+KP+GPFR
Sbjct: 976  FTIRTDSLQGEWDFNPTVSPGIYSDQI-PYDAPRRKYLSAVSDDDIDQNHYKIKPNGPFR 1035

Query: 1021 TAGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPES 1080
            +AG +GRQILDDEGP +CHIPSRRKSPG RDGPPVR GVKMVHRM RNISP  CIRE  S
Sbjct: 1036 SAGRQGRQILDDEGPPYCHIPSRRKSPGIRDGPPVR-GVKMVHRMPRNISPSGCIREAGS 1095

Query: 1081 ELVGPRHGEKFMRTLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSP 1140
            ELVGPRHGEKFMRT EDE MDPIYAHPQPP+EVDRPPFI +RRNF IQRK+FPR+DSKSP
Sbjct: 1096 ELVGPRHGEKFMRTFEDETMDPIYAHPQPPYEVDRPPFIRERRNFTIQRKTFPRIDSKSP 1155

Query: 1141 GRSRGRSPGQWFPSKRKSERFFPHPEMARRSPPGY---RMRSPDQPPP--GDMPVRRHGF 1200
            GRSRGRSPGQW P KRKS RF  H  M RRS PGY   RMRSPDQPPP  GDMPVRRHGF
Sbjct: 1156 GRSRGRSPGQWVPGKRKSYRFCGHLGMTRRSSPGYRGDRMRSPDQPPPIHGDMPVRRHGF 1215

Query: 1201 PFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRDRIESNEYFDGP 1260
            PF  LP +DLRDM SA D GHMRS IR RNR+DR+SFRNRRFE MDPRDRIES+EYFDG 
Sbjct: 1216 PFSPLPSSDLRDMRSAPDQGHMRSDIRCRNRSDRLSFRNRRFEIMDPRDRIESSEYFDG- 1275

Query: 1261 VHPGQLNELVGDGNDDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAED 1320
              P QLNEL GDGNDDDRRRF DRHEHLH FRPQ NDSDGENYH +A++  RPFR+CAED
Sbjct: 1276 --PSQLNELSGDGNDDDRRRFSDRHEHLHSFRPQYNDSDGENYHNNAEDSRRPFRFCAED 1294

Query: 1321 -EAEFHERGKMREREFDRRLKNQPGNLGRRTGGVIEEHEVEEYRHGRQMWNEHHGFEDIS 1352
               EFHERG MREREF+RR+KNQPGNL RRTG VIEEHEVE+YRHGRQMWN+ HGFEDIS
Sbjct: 1336 GPPEFHERGNMREREFNRRVKNQPGNLTRRTGVVIEEHEVEDYRHGRQMWND-HGFEDIS 1294

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038890337.10.0e+0086.29uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_03889033... [more]
XP_038890343.10.0e+0084.89uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida][more]
XP_022992789.10.0e+0078.79uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790... [more]
XP_023550091.10.0e+0078.43uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo][more]
XP_022938519.10.0e+0077.88uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_0229385... [more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A6J1JYG40.0e+0078.79uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FEB10.0e+0077.88uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1JUI70.0e+0077.41uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... [more]
A0A6J1FDD80.0e+0076.49uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN... [more]
A0A6J1BWB00.0e+0065.68uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1077..1163
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 319..341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 327..341
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1147..1161
NoneNo IPR availablePANTHERPTHR34536:SF4BNAC09G43500D PROTEINcoord: 2..1351
NoneNo IPR availablePANTHERPTHR34536DENTIN SIALOPHOSPHOPROTEIN-LIKE PROTEINcoord: 2..1351

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10022259.1HG10022259.1mRNA