Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGGTGATTGTTTGTCATGACTTCTTTTCTGTTTTTCTTCTTATATATATATATATTTGTCAACCAAGCCTGTTTCTTTACAGCATTAATTCGATTACTTTCTGCATTGCTGATATCTCTCCATTTTCTAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA
mRNA sequence
ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA
Coding sequence (CDS)
ATGTCTACTAGTGATTGTAATGCAATTGTTCCTATCAAGAAGAGGAGGTTTCCTTTAATCCAATCTTCTCCACCCAAAGAAATATCTTCTCTTCCATCACTAGATGATAACATAGTAAAGGTAGAAGAGCCTTGCGTATCTGATGGTACAACAGTTTCAAATTCTAGTACAATAACAACTTCTGAATTTTCAGAAAAGAAGAAGATTTTATTTTCCGAAGATGGTAATTGGAAATCTGATTTATGCAATGTCAATATGGTCCAAAGCAATATTGGACCTTCCAGAGTCGAGGTTCAGGATAATGATGCTCGTTCTATAGGCTGCGTGGAAAATAAGGGAACATGTATGGTGAATGAAAATCATGCGCTTGCTCTGCATGAGAAGCCTGAGTTAAAGTTACAACCTTCTGATGCGAACTCTAACCCTGGACTTTGTGCTGAAAAGAAAAGTGATGAGATTGACAGAAAAGAACTTGATAGATGTAATTCTTCAACTTCTTTAGTTAAAAATGAAGTTGAATTATCGGTTGGTTTGAAGGAACATCTTGTTCCTGACTCGGTTCTAGAAGGGAATAATTTAGAACCAGTGTTATTGAACTTAAGTTTAAGCAAGCAAGGAAGTCACACCCAGTGTCTCACTGGTAATGTTGGGTCTGTTTGTGATGGTTCTCTTCAGCAGTCAAATAGGGGAAATTGGGATCTAAATACCTCAATGGAGTTTTGGGAAGGCTGTGCAACTGATGATCCTCCAGTGCATGTTCCAGTTGTTCAGACAAACACAATTGTCACCACACATAGATGCTCAACGGAAATGGTTAAAACTGATACTCTGTCTGGAAAACTAACCCCTTTAGGTCACAGTGATCATCTTCATCTAAGTCTTTGTTCATCTGATCATAGGCATGTAATGACTCAGGAACAAAGTTCATTTGTTAAGTTAGATTTTAGGAAATCAAGTCCTTTAAGCTCAGAAGGAAGAAGTAAGCAATCTGATGATCTTGATGGCGCACTAAAAGATGTAAAGCCAGAACCATTTGTTGAGGGTTCCAGACTTGAGTCTAAAAGTGATGAAGTTAATGTGCTGGGATTATCAAACAGTGCTGTAGTGAAGCGTGAATTTCTTCAACTTCCCAATGCTTCAGATATTTACAGATCAATGAACATAGTTAAGGCTAAATCTGTTAAATCTGAATCAATTTATCAAAGTAAACAGGAAGCACTCAAAACATTAGGTGGTAGATTAGATCTGGTAGAAAAGCAAGTTCCTCCAGAGGTTGATAATTCTTGTCCTGTACCAATGCCTTTTGTGGCAGAGAGGTCAGACGCAATTGGAAATCCTTCTTGTTCAACTGATTTGGTTACAGACAAAGACATGTCAAACTATTCAGAATTGCAAACCCCTACTAAAGAACATCTTAGTACGATAGTGCAACAAGGAGGATGTGGTGGTGAACTTGTTAAGTCAGAGATGACCGATATAAGTAAGGATACAGGTTCCAAAGATTTCAGTAGTCCTATTATAAAACCTATAGTAATGCCTGTTGTGGCTGAGATGTTGAAAGCAGCTAAAAATCCTTCTTGTACAAATGATATGATTATAGACAGAGACGTGCCAAACCATTCAGAATTGGAAACTCCAACTATAGGACCTCTTAATAGGAAAGTGCACCAAGAGGGATATGGCTGTGATGGTGGACTTGTGAATTCAGAAATGACGGATTTAAGTAAGGATATATGTTCCAAAGATTCCAGTAGCTCTGTTATGAAACCATTCATTATTGGGGATCAAAATGAGAATAATCCTCCATGGCGTCGTTTGGAACACATGAATGAGCAGTGCTCTAGTTTGCATGGAGGTGAGGAATGTTCTGTTAGTGATGAGGAAAAGATCAGCATATCAGCCGATTTATTAGAAGAAGATCCTTATAGTTCTGAATATGAATCAGATGGTAAGCAGGATGTAAATGAGGCCATGGATGCAGTTGATAATGATATAGAAGAAGATTATGAAGATGGAGAGGTTCGGGAACCAATATTGACGACTCAAGTAGAAAGCAGTATATGTGAGACAAGAGAAGTAAAAAAATTTGATCATGGTGATTGTAGCAATGGACTTCCTGGTTCTGATTGTTCCTCTTTGGTTTTTGTTAAGCAGGAAGTTAAATCAGAAATTCTTGATGTTAAACGAGAAGACATTCTTAATTCTGTTATTTCTAATCAATCTTCTGAGCAAGAACATTTGAAAGAGCTACTTGTTGAAGATAATACCACTAAGGTGTGTTTGAACAAGGCCAACAAGGCTATAAAAGCTACAGGTCCTAGGAAATTGTTTCATTGCGAGAAAATATCTGCCTTAGAGGACCAGAAAATTTTTTCTGATAAAGCCACTACTAGAATTGAAGAATCGATTGTGACAGTTCCTCAGAGTGATGCAGAGAATGTTAAAACAGTAGATTTTGTGCAAAACGACGATCTAACTTTGCCAAATATTAGAGAGCCTTTAAATAATGATGATGATGTTACTGATGATTTTACTCATGGCAATCGACATGCCCAGATTGTTAATCCCTGTCAAGCTTCTACTTCATCTCCTATTAAAACAAGACCAAGTTTAGTGAGGTCGGTTTTAACACAAACCGATAGAGAACTAATACCTGACATGGCGCATGAAGGGGAAAAATTACAACCTCAAGGAAGAGATGACTCATACAGGGACGTTTTCCCAAAATTTTATGTGAATAGACATCAGAATCTTTCACCCAGAACGAATTTTACTCGTAGAAGAGGTAGATTCACTATCCGGATTAACAATGTCCAAGGTGAATGGGATTTTAATCGAACAATTTCTCTAGGAGTTTACAATGATCAAATACAACCCTATGATGCCCGTAGACGTAAATACATGCCTACCATTTCTGATGACGACATTGATCAAAATCATTATAAAATGAAACCTAGTGGTCCATTTCGTACCGCTGGTCACAGAGGTAGACAAATTTTAGACGATGAAGGCCCCATTTTTTGTCATATACCCTCTAGGAGGAAGTCACCTGGTAGAAGAGATGGGCCTCCAGTACGAGGTGGTGTTAAAATGGTACACAGAATGCATAGAAATATCAGTCCAGGTAGATGCATTCGTGAACCTGAGTCTGAATTGGTTGGACCGCGACACGGTGAAAAGTTTATGAGGACTTTAGAAGATGAGGCCATGGATCCAATATATGCACACCCTCAACCTCCATTTGAGGTAGATAGGCCTCCTTTTATCCCAGACCGAAGGAACTTTCCTATCCAAAGAAAAAGCTTTCCAAGAGTTGATTCTAAATCTCCAGGAAGGTCCAGAGGACGCTCTCCTGGCCAATGGTTTCCATCCAAAAGAAAGTCAGAAAGGTTCTTTCCACATCCTGAAATGGCACGTCGAAGTCCACCAGGTTACAGGATGAGATCCCCTGATCAACCTCCTCCTGGAGATATGCCAGTTCGAAGACACGGTTTCCCTTTTCCGTCACTGCCACCCAACGATTTGAGGGATATGGGTTCTGCTCGTGACCATGGCCACATGAGATCGGGTATACGAAGTAGGAACCGAACAGACAGAATATCTTTTAGAAACAGGAGGTTTGAAGATATGGATCCTCGAGATAGGATAGAGAGTAACGAATACTTTGATGGGCCTGTACATCCTGGTCAATTGAATGAACTGGTTGGTGATGGTAATGATGACGACCGAAGAAGGTTTCCTGACAGACACGAACATCTTCACCAATTCCGGCCGCAATGTAATGATTCTGATGGTGAAAACTATCATATCGATGCAGACGAAAGGGCGAGACCTTTCAGATATTGTGCAGAGGATGAAGCAGAGTTTCATGAAAGAGGTAAGATGAGGGAAAGGGAATTTGATAGACGTTTAAAGAACCAACCAGGAAATTTAGGTAGACGAACAGGAGGAGTTATTGAAGAACATGAAGTTGAAGAATACAGGCATGGTCGGCAGATGTGGAATGAACATCATGGCTTTGAAGATATCTCACGAATGAAAAGAAAAAGATTTTGA
Protein sequence
MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITTSEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNENHALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKEHLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHLSLCSSDHRHVMTQEQSSFVKLDFRKSSPLSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQGGCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYNDQIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHPEMARRSPPGYRMRSPDQPPPGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRRTGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF
Homology
BLAST of HG10022259 vs. NCBI nr
Match:
XP_038890337.1 (uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890338.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890339.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890340.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890341.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_038890342.1 uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida])
HSP 1 Score: 2296.2 bits (5949), Expect = 0.0e+00
Identity = 1171/1357 (86.29%), Postives = 1239/1357 (91.30%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MST+D IVPIKKRRFP IQSSPPKEISSLP +DDN+VKVEEPCVSD TVSNSSTITT
Sbjct: 1 MSTNDYTTIVPIKKRRFPSIQSSPPKEISSLPPVDDNMVKVEEPCVSDSPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSEKKKI FSEDGNWKSDLCNVNMVQS+IGPSRVE + ND G V NK TC+VNEN
Sbjct: 61 SEFSEKKKISFSEDGNWKSDLCNVNMVQSSIGPSRVEFKKNDDCFTGSVGNKETCLVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
LAL EKPELKL SD +SNPG+CAEKKSDEI RKELD+CNSSTS+VK EVELS+ LKE
Sbjct: 121 RMLALQEKPELKLPSSDPDSNPGVCAEKKSDEIHRKELDKCNSSTSVVKKEVELSLSLKE 180
Query: 181 HLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEF 240
LVP SVLEGNNL PV+LNLSLSKQGSHTQCLTGNVGS DGSLQQSNR NWDLNTSMEF
Sbjct: 181 RLVPVSVLEGNNLGPVVLNLSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEF 240
Query: 241 WEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLT-PLGHSDHLHLSLCSSDH 300
WEGCA+DDPPVHVPVVQTNT V T RCSTEMVKTDTL GKLT PL HSDHLHLSLCSSDH
Sbjct: 241 WEGCASDDPPVHVPVVQTNTTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDH 300
Query: 301 RHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEV 360
RHVM+QEQSSF+KLDFRKSSP LSS GRSKQ DDL+G LK VK EPF EGS+LESKSDEV
Sbjct: 301 RHVMSQEQSSFIKLDFRKSSPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEV 360
Query: 361 NVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVE 420
NV G+S++AVVKR FLQLP+ASDIY+SMNIVK+KS+KSESIYQSKQEALKTLGGRLDLVE
Sbjct: 361 NVPGVSDNAVVKRGFLQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVE 420
Query: 421 KQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQ 480
KQV +VDNSC VPM FVAE S+ GNPSC+TDL+ DKDMSN+SELQTP+KEH+STI+ Q
Sbjct: 421 KQVLSDVDNSCAVPMSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQ 480
Query: 481 G---GCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDR 540
G GC GELVKSE+TDIS+DTGSKD SSPI KPI +P +AEM K AKNPSCTNDMI+D+
Sbjct: 481 GGSHGCCGELVKSEVTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDK 540
Query: 541 DVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGD 600
DVPNHSEL+TPT GPLNRKVHQ G GCDGGLVNSEMTDLSKD CSKDS+SSV+KPFI+ D
Sbjct: 541 DVPNHSELQTPTRGPLNRKVHQ-GDGCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVED 600
Query: 601 QNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVN 660
QNENNP W LEH N+QCSSLHG EECSVSDEEKIS+SADLLEEDPYSSEYESDGKQDVN
Sbjct: 601 QNENNPQWHPLEHRNKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVN 660
Query: 661 EAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVF 720
EAMDAVDN IEEDYEDGEVREPIL TQVESSICETREVK FDHGDCSNGLPGSDCSSLV
Sbjct: 661 EAMDAVDNVIEEDYEDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVS 720
Query: 721 VKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRK 780
VKQE KSEILDVKREDIL+ V SNQSSEQEHLKELLVEDNT+KV LNKANKAIKATGPR+
Sbjct: 721 VKQEDKSEILDVKREDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQ 780
Query: 781 LFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLN 840
LFHCEKI ALEDQKI S++ATT IEESI TV QSDAENVKTVDFVQN+DL LPN++EPLN
Sbjct: 781 LFHCEKIFALEDQKISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLN 840
Query: 841 NDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQ 900
N DDVTDDFT GNRH+QIVNPCQASTSSP KTRPSLVRSVLTQTDRELIPDMAH+GEKLQ
Sbjct: 841 N-DDVTDDFTRGNRHSQIVNPCQASTSSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQ 900
Query: 901 PQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYND 960
PQGRDDSYRDVFPKFYVNR QNLSPRTNFTRRRGRFTIRIN+VQGEWDFN TIS GVYND
Sbjct: 901 PQGRDDSYRDVFPKFYVNRRQNLSPRTNFTRRRGRFTIRINSVQGEWDFNPTISPGVYND 960
Query: 961 QIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRK 1020
QI PYDARRRKYMP +SD+DIDQNHYKMKP GPFRT GHRGRQILDDEGPIFCHIPSRRK
Sbjct: 961 QIPPYDARRRKYMPAVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRK 1020
Query: 1021 SPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYA 1080
SPGRRDGPP+RGGVKMVH MHRN+SP RCIREP SEL+GPRHGEKFMRTL+DE MDP+Y
Sbjct: 1021 SPGRRDGPPLRGGVKMVHGMHRNVSPSRCIREPGSELIGPRHGEKFMRTLDDETMDPMY- 1080
Query: 1081 HPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHP 1140
HPQPPFEVDRPP+IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFF HP
Sbjct: 1081 HPQPPFEVDRPPYIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHP 1140
Query: 1141 EMARRSPPGYRMRSPDQPP-PGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
EMARRSPPGYRMRSPDQPP GDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR
Sbjct: 1141 EMARRSPPGYRMRSPDQPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
Query: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLH 1260
NRTDRISFRNRRFEDMDPRDRIESNEY+DGP+HPGQ NELV DGNDDDRRRFPDRHEHLH
Sbjct: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLH 1260
Query: 1261 QFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRR 1320
FRPQCNDSDGENYH DADER RPFRYCAEDEAEFHER KMREREFDRRLKNQ NLGRR
Sbjct: 1261 PFRPQCNDSDGENYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRR 1320
Query: 1321 TGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF 1352
T GVIEEHE +EYRHGRQ+WNEHHGFE+ISRMKRKRF
Sbjct: 1321 T-GVIEEHE-QEYRHGRQLWNEHHGFEEISRMKRKRF 1351
BLAST of HG10022259 vs. NCBI nr
Match:
XP_038890343.1 (uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida])
HSP 1 Score: 2245.3 bits (5817), Expect = 0.0e+00
Identity = 1152/1357 (84.89%), Postives = 1219/1357 (89.83%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MST+D IVPIKKRRFP IQSSPPKEISSLP +DDN+VKVEEPCVSD TVSNSSTITT
Sbjct: 1 MSTNDYTTIVPIKKRRFPSIQSSPPKEISSLPPVDDNMVKVEEPCVSDSPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSEKKKI FSEDGNWKSDLCNVNMVQS+IGPSRVE + ND G V NK TC+VNEN
Sbjct: 61 SEFSEKKKISFSEDGNWKSDLCNVNMVQSSIGPSRVEFKKNDDCFTGSVGNKETCLVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
LAL EKPELKL SD +SNPG+CAEKKSDEI RKELD+CNSSTS+VK EVELS+ LKE
Sbjct: 121 RMLALQEKPELKLPSSDPDSNPGVCAEKKSDEIHRKELDKCNSSTSVVKKEVELSLSLKE 180
Query: 181 HLVPDSVLEGNNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNWDLNTSMEF 240
LVP SVLEGNNL PV+LNLSLSKQGSHTQCLTGNVGS DGSLQQSNR NWDLNTSMEF
Sbjct: 181 RLVPVSVLEGNNLGPVVLNLSLSKQGSHTQCLTGNVGSDNDGSLQQSNRENWDLNTSMEF 240
Query: 241 WEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLT-PLGHSDHLHLSLCSSDH 300
WEGCA+DDPPVHVPVVQTNT V T RCSTEMVKTDTL GKLT PL HSDHLHLSLCSSDH
Sbjct: 241 WEGCASDDPPVHVPVVQTNTTVATDRCSTEMVKTDTLFGKLTHPLDHSDHLHLSLCSSDH 300
Query: 301 RHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRLESKSDEV 360
RHVM+QEQSSF+KLDFRKSSP LSS GRSKQ DDL+G LK VK EPF EGS+LESKSDEV
Sbjct: 301 RHVMSQEQSSFIKLDFRKSSPSLSSPGRSKQFDDLNGTLKVVKSEPFAEGSKLESKSDEV 360
Query: 361 NVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLGGRLDLVE 420
NV G+S++AVVKR FLQLP+ASDIY+SMNIVK+KS+KSESIYQSKQEALKTLGGRLDLVE
Sbjct: 361 NVPGVSDNAVVKRGFLQLPSASDIYKSMNIVKSKSIKSESIYQSKQEALKTLGGRLDLVE 420
Query: 421 KQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEHLSTIVQQ 480
KQV +VDNSC VPM FVAE S+ GNPSC+TDL+ DKDMSN+SELQTP+KEH+STI+ Q
Sbjct: 421 KQVLSDVDNSCAVPMSFVAEMSEVAGNPSCTTDLIIDKDMSNHSELQTPSKEHISTIMHQ 480
Query: 481 G---GCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTNDMIIDR 540
G GC GELVKSE+TDIS+DTGSKD SSPI KPI +P +AEM K AKNPSCTNDMI+D+
Sbjct: 481 GGSHGCCGELVKSEVTDISEDTGSKDSSSPITKPIAIP-LAEMSKTAKNPSCTNDMIVDK 540
Query: 541 DVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMKPFIIGD 600
DVPNHSEL+TPT GPLNRKVHQ G GCDGGLVNSEMTDLSKD CSKDS+SSV+KPFI+ D
Sbjct: 541 DVPNHSELQTPTRGPLNRKVHQ-GDGCDGGLVNSEMTDLSKDTCSKDSNSSVIKPFIVED 600
Query: 601 QNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESDGKQDVN 660
QNENNP W LEH N+QCSSLHG EECSVSDEEKIS+SADLLEEDPYSSEYESDGKQDVN
Sbjct: 601 QNENNPQWHPLEHRNKQCSSLHGCEECSVSDEEKISLSADLLEEDPYSSEYESDGKQDVN 660
Query: 661 EAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSDCSSLVF 720
EAMDAVDN IEEDYEDGEVREPIL TQVESSICETREVK FDHGDCSNGLPGSDCSSLV
Sbjct: 661 EAMDAVDNVIEEDYEDGEVREPILMTQVESSICETREVKIFDHGDCSNGLPGSDCSSLVS 720
Query: 721 VKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAIKATGPRK 780
VKQE KSEILDVKREDIL+ V SNQSSEQEHLKELLVEDNT+KV LNKANKAIKATGPR+
Sbjct: 721 VKQEDKSEILDVKREDILHFVTSNQSSEQEHLKELLVEDNTSKVSLNKANKAIKATGPRQ 780
Query: 781 LFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLPNIREPLN 840
LFHCEKI ALEDQKI S++ATT IEESI TV QSDAENVKTVDFVQN+DL LPN++EPLN
Sbjct: 781 LFHCEKIFALEDQKISSERATTGIEESIATVSQSDAENVKTVDFVQNEDLALPNVKEPLN 840
Query: 841 NDDDVTDDFTHGNRHAQIVNPCQASTSSPIKTRPSLVRSVLTQTDRELIPDMAHEGEKLQ 900
N DDVTDDFT GNRH+QIVNPCQASTSSP KTRPSLVRSVLTQTDRELIPDMAH+GEKLQ
Sbjct: 841 N-DDVTDDFTRGNRHSQIVNPCQASTSSPTKTRPSLVRSVLTQTDRELIPDMAHDGEKLQ 900
Query: 901 PQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRTISLGVYND 960
PQGRDDSYRDVFPKFYVNR QNLSPRTNFTRRR GVYND
Sbjct: 901 PQGRDDSYRDVFPKFYVNRRQNLSPRTNFTRRR----------------------GVYND 960
Query: 961 QIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAGHRGRQILDDEGPIFCHIPSRRK 1020
QI PYDARRRKYMP +SD+DIDQNHYKMKP GPFRT GHRGRQILDDEGPIFCHIPSRRK
Sbjct: 961 QIPPYDARRRKYMPAVSDEDIDQNHYKMKPGGPFRTGGHRGRQILDDEGPIFCHIPSRRK 1020
Query: 1021 SPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTLEDEAMDPIYA 1080
SPGRRDGPP+RGGVKMVH MHRN+SP RCIREP SEL+GPRHGEKFMRTL+DE MDP+Y
Sbjct: 1021 SPGRRDGPPLRGGVKMVHGMHRNVSPSRCIREPGSELIGPRHGEKFMRTLDDETMDPMY- 1080
Query: 1081 HPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFPHP 1140
HPQPPFEVDRPP+IPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFF HP
Sbjct: 1081 HPQPPFEVDRPPYIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSKRKSERFFGHP 1140
Query: 1141 EMARRSPPGYRMRSPDQPP-PGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
EMARRSPPGYRMRSPDQPP GDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR
Sbjct: 1141 EMARRSPPGYRMRSPDQPPIHGDMPVRRHGFPFPSLPPNDLRDMGSARDHGHMRSGIRSR 1200
Query: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYFDGPVHPGQLNELVGDGNDDDRRRFPDRHEHLH 1260
NRTDRISFRNRRFEDMDPRDRIESNEY+DGP+HPGQ NELV DGNDDDRRRFPDRHEHLH
Sbjct: 1201 NRTDRISFRNRRFEDMDPRDRIESNEYYDGPIHPGQFNELVVDGNDDDRRRFPDRHEHLH 1260
Query: 1261 QFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFDRRLKNQPGNLGRR 1320
FRPQCNDSDGENYH DADER RPFRYCAEDEAEFHER KMREREFDRRLKNQ NLGRR
Sbjct: 1261 PFRPQCNDSDGENYHNDADERPRPFRYCAEDEAEFHERSKMREREFDRRLKNQSENLGRR 1320
Query: 1321 TGGVIEEHEVEEYRHGRQMWNEHHGFEDISRMKRKRF 1352
T GVIEEHE +EYRHGRQ+WNEHHGFE+ISRMKRKRF
Sbjct: 1321 T-GVIEEHE-QEYRHGRQLWNEHHGFEEISRMKRKRF 1329
BLAST of HG10022259 vs. NCBI nr
Match:
XP_022992789.1 (uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790.1 uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima])
HSP 1 Score: 2046.6 bits (5301), Expect = 0.0e+00
Identity = 1085/1377 (78.79%), Postives = 1165/1377 (84.60%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQ IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDE+DRK+LDR STSL K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVPDSVLEG+ NLEPVLLNLSLSK+GS QCLT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSF KL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361 MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G CGGELV SEMTDISKD GSKD + PIIKPI MP +NPS TN
Sbjct: 481 LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
KISALEDQ+ +KAT IEESI TV QSDAE VKTVD V+ND+ LP
Sbjct: 781 ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960
Query: 961 ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
IS G Y+DQ+ PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961 ISPGNYSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020
Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
+FCH+ SRRKSPGRRDGPP GVKM HRM RNISP RC RE SELVGPRHGEKFMRT
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080
Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140
Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
RKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200
Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
DHGHMR +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260
Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320
Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
RR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1349
BLAST of HG10022259 vs. NCBI nr
Match:
XP_023550091.1 (uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 2037.7 bits (5278), Expect = 0.0e+00
Identity = 1080/1377 (78.43%), Postives = 1164/1377 (84.53%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFP +QS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPSMQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQS IGPSRVE Q+ND S GCVENK TCMVNEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDVCSTGCVENKETCMVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDEIDRK+LDR STS+ K E ELS+G KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSIGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVPDSVLEG+ NLEP LLNLSLSK+GS Q LT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPDSVLEGSDLKSLKQINLEPGLLNLSLSKEGSLDQPLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNT+VT HR STEMV TDTLSGKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTVVTMHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
ESKSDEVNVLGLS+SA+VKREFLQ+PNASDIY MN VKAKSV SES Y+SKQ AL+TLG
Sbjct: 361 ESKSDEVNVLGLSDSAIVKREFLQIPNASDIYIPMNTVKAKSVNSESNYESKQVALETLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD MSN+SELQTPT+EH
Sbjct: 421 GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGGMSNHSELQTPTEEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G CGGELV SEMTDISKD GSKDF+SPIIKPI MP +NPS TN
Sbjct: 481 LNLKVHEGAYRCGGELVDSEMTDISKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMDAVDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661 GKLDVNEAMDAVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKNFDHGDSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
K SALEDQ+ +KA+ IEESI TV QSDAE VKTVD V+ND+ LP
Sbjct: 781 ------------KTSALEDQETSPEKASNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ S+SS P KT+ SL RSVLTQTDRE IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSSSSLPSKTKSSLARSVLTQTDRERIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
HEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901 GHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960
Query: 961 ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
IS G YNDQ+ PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961 ISPGNYNDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020
Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
+FCH+ SRRKSPGRRDGPP GVKMVHRM RNISP RC RE SELVGPRHGEKFMRT
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080
Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPSK 1140
Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
RKSERFF HPEMARRS PPGYRMRSPDQPP GDMP RRHGFPFPSLPPNDLRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPDRRHGFPFPSLPPNDLRDMGSAR 1200
Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
DHGHMR G+RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQ+NEL+ DGNDD
Sbjct: 1201 DHGHMRPGLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQMNELIDDGNDD 1260
Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
DRRRF DRHEHLHQFRPQCNDSDGENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFSDRHEHLHQFRPQCNDSDGENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320
Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
RR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HH FEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHSFEDISRMKRKR 1349
BLAST of HG10022259 vs. NCBI nr
Match:
XP_022938519.1 (uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_022938520.1 uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata])
HSP 1 Score: 2016.1 bits (5222), Expect = 0.0e+00
Identity = 1074/1379 (77.88%), Postives = 1158/1379 (83.97%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDEIDRK+LDR STS+ K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVP+SVLEG+ NLEPVLLNLSLSK+GS Q LT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY MN VKA+SV SE Y+SKQEALKTLG
Sbjct: 361 ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEV +SCP PMPFVAE ++A N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G GEL+ SEMTD+SKD GSKDF+SPIIKPI MP +NPS TN
Sbjct: 481 LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
K SA+EDQ+ +KAT IEESI TV QSDAE VKTVD V+N++ LP
Sbjct: 781 ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960
Query: 961 ISLGVYNDQ---IQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDE 1020
IS G Y+D PYDARRRKYMP +SDDDIDQNHYKMKP PFR+AG HRGRQILDDE
Sbjct: 961 ISPGNYSDHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDE 1020
Query: 1021 GPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMR 1080
GP+FCH+ SRRKSPGRRDGPP GVKMVHRM RNISP RC RE SELVGPRHGEKFMR
Sbjct: 1021 GPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMR 1080
Query: 1081 TLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFP 1140
T EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFP
Sbjct: 1081 TFEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFP 1140
Query: 1141 SKRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGS 1200
SKRKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPNDLRDMGS
Sbjct: 1141 SKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGS 1200
Query: 1201 ARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGN 1260
ARDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGN
Sbjct: 1201 ARDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGN 1260
Query: 1261 DDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMRERE 1320
DDDRRRF DRHEHLHQFRPQCNDSDGENY DADERARP+RYC EDE EFHERGKMRERE
Sbjct: 1261 DDDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMRERE 1320
Query: 1321 FDRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
FDRR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 FDRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1351
BLAST of HG10022259 vs. ExPASy TrEMBL
Match:
A0A6J1JYG4 (uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 2046.6 bits (5301), Expect = 0.0e+00
Identity = 1085/1377 (78.79%), Postives = 1165/1377 (84.60%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQ IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDE+DRK+LDR STSL K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVPDSVLEG+ NLEPVLLNLSLSK+GS QCLT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSF KL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361 MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G CGGELV SEMTDISKD GSKD + PIIKPI MP +NPS TN
Sbjct: 481 LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
KISALEDQ+ +KAT IEESI TV QSDAE VKTVD V+ND+ LP
Sbjct: 781 ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960
Query: 961 ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
IS G Y+DQ+ PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961 ISPGNYSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020
Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
+FCH+ SRRKSPGRRDGPP GVKM HRM RNISP RC RE SELVGPRHGEKFMRT
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080
Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140
Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
RKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200
Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
DHGHMR +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260
Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320
Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
RR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1349
BLAST of HG10022259 vs. ExPASy TrEMBL
Match:
A0A6J1FEB1 (uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 2016.1 bits (5222), Expect = 0.0e+00
Identity = 1074/1379 (77.88%), Postives = 1158/1379 (83.97%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDEIDRK+LDR STS+ K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVP+SVLEG+ NLEPVLLNLSLSK+GS Q LT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY MN VKA+SV SE Y+SKQEALKTLG
Sbjct: 361 ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEV +SCP PMPFVAE ++A N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G GEL+ SEMTD+SKD GSKDF+SPIIKPI MP +NPS TN
Sbjct: 481 LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
K SA+EDQ+ +KAT IEESI TV QSDAE VKTVD V+N++ LP
Sbjct: 781 ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRGRFTIRIN+VQGEWDFN T
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGRFTIRINSVQGEWDFNPT 960
Query: 961 ISLGVYNDQ---IQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDE 1020
IS G Y+D PYDARRRKYMP +SDDDIDQNHYKMKP PFR+AG HRGRQILDDE
Sbjct: 961 ISPGNYSDHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDE 1020
Query: 1021 GPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMR 1080
GP+FCH+ SRRKSPGRRDGPP GVKMVHRM RNISP RC RE SELVGPRHGEKFMR
Sbjct: 1021 GPLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMR 1080
Query: 1081 TLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFP 1140
T EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFP
Sbjct: 1081 TFEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFP 1140
Query: 1141 SKRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGS 1200
SKRKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPNDLRDMGS
Sbjct: 1141 SKRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGS 1200
Query: 1201 ARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGN 1260
ARDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGN
Sbjct: 1201 ARDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGN 1260
Query: 1261 DDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMRERE 1320
DDDRRRF DRHEHLHQFRPQCNDSDGENY DADERARP+RYC EDE EFHERGKMRERE
Sbjct: 1261 DDDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMRERE 1320
Query: 1321 FDRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
FDRR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 FDRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1351
BLAST of HG10022259 vs. ExPASy TrEMBL
Match:
A0A6J1JUI7 (uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489020 PE=4 SV=1)
HSP 1 Score: 1996.5 bits (5171), Expect = 0.0e+00
Identity = 1066/1377 (77.41%), Postives = 1145/1377 (83.15%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DDNI KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDNIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQ IGPSRVE Q+NDA S GCVENK TCMVNEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQRIIGPSRVEFQENDACSAGCVENKETCMVNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDE+DRK+LDR STSL K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEVDRKQLDRLEFSTSLAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVPDSVLEG+ NLEPVLLNLSLSK+GS QCLT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPDSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQCLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTLSGKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLSGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSF KL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFAKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
SKSDEVNVLGLS+SA+VKREFLQ+PNASD+Y MN VKAKSV SES Y+SKQEALKTLG
Sbjct: 361 MSKSDEVNVLGLSDSAIVKREFLQIPNASDVYIPMNPVKAKSVNSESNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEVD+SCP PMPFVAE ++A GN SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVDSSCPAPMPFVAEMTEAAGN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G CGGELV SEMTDISKD GSKD + PIIKPI MP +NPS TN
Sbjct: 481 LNLKVHEGAYCCGGELVDSEMTDISKDPGSKDSNGPIIKPIAMP---------RNPSPTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL TPT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHTPTTGPLNMKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPFTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDNDIEEDYEDGEVREP LTTQVESSICET++VK FDHGD SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDIEEDYEDGEVREPTLTTQVESSICETKKVKIFDHGDSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
KISALEDQ+ +KAT IEESI TV QSDAE VKTVD V+ND+ LP
Sbjct: 781 ------------KISALEDQETSPEKATNGIEESITTVSQSDAEKVKTVDIVRNDNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDR+ IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRKRIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRG
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGN----------------- 960
Query: 961 ISLGVYNDQI-QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEGP 1020
Y+DQ+ PYDARRRKYMP +SDDDIDQNHYKMKP GPFR+AG HRGRQILDDEGP
Sbjct: 961 -----YSDQVPPPYDARRRKYMPAVSDDDIDQNHYKMKPDGPFRSAGDHRGRQILDDEGP 1020
Query: 1021 IFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRTL 1080
+FCH+ SRRKSPGRRDGPP GVKM HRM RNISP RC RE SELVGPRHGEKFMRT
Sbjct: 1021 LFCHMASRRKSPGRRDGPPPVRGVKMAHRMPRNISPSRCNRERGSELVGPRHGEKFMRTF 1080
Query: 1081 EDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPSK 1140
EDE MDP+YAHPQP FEVDRPPFI DRRNFPIQRKSF RVDSKSPG SRGRSP QWFPSK
Sbjct: 1081 EDETMDPLYAHPQPSFEVDRPPFIRDRRNFPIQRKSFQRVDSKSPGTSRGRSPSQWFPSK 1140
Query: 1141 RKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSAR 1200
RKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPN+LRDMGSAR
Sbjct: 1141 RKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNNLRDMGSAR 1200
Query: 1201 DHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGNDD 1260
DHGHMR +RSRNRTDR+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGNDD
Sbjct: 1201 DHGHMRPSLRSRNRTDRMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGNDD 1260
Query: 1261 DRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREFD 1320
DRRRF +RHEHLHQFRPQCNDSD ENYH DADERARP+RYC EDE EFHERGKMREREFD
Sbjct: 1261 DRRRFANRHEHLHQFRPQCNDSDSENYHNDADERARPYRYCTEDEEEFHERGKMREREFD 1320
Query: 1321 RRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
RR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 RRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1327
BLAST of HG10022259 vs. ExPASy TrEMBL
Match:
A0A6J1FDD8 (uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111444729 PE=4 SV=1)
HSP 1 Score: 1967.6 bits (5096), Expect = 0.0e+00
Identity = 1054/1378 (76.49%), Postives = 1140/1378 (82.73%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPPKEISSLPSLDDNIVKVEEPCVSDGTTVSNSSTITT 60
MSTSD NAIVPIKKRRFPLIQS PPKEISSLP +DD+I KV+EPCVSDG TVSNSSTITT
Sbjct: 1 MSTSDYNAIVPIKKRRFPLIQSPPPKEISSLPLVDDSIAKVDEPCVSDGPTVSNSSTITT 60
Query: 61 SEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMVNEN 120
SEFSE KKI FSEDG KSDLCN+NMVQS IGPSRVE Q+NDA S GCVENK TCM+NEN
Sbjct: 61 SEFSE-KKISFSEDGKRKSDLCNMNMVQSIIGPSRVEFQENDACSTGCVENKETCMMNEN 120
Query: 121 HALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVGLKE 180
HAL LHEKPE KL SDANSNPGLCAEK+SDEIDRK+LDR STS+ K E ELSVG KE
Sbjct: 121 HALVLHEKPEFKLPHSDANSNPGLCAEKESDEIDRKQLDRLEFSTSVAKKEAELSVGSKE 180
Query: 181 HLVPDSVLEGN--------NLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNRGNW 240
HLVP+SVLEG+ NLEPVLLNLSLSK+GS Q LT NVGS DGS+Q+SNR NW
Sbjct: 181 HLVPNSVLEGSDLKSLKQINLEPVLLNLSLSKEGSLDQRLTVNVGSSYDGSIQESNRENW 240
Query: 241 DLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDHLHL 300
DLNTSMEFWEGC++ DPP HVP VQTNTIVTTHR STEMV TDTL GKLTPL SDHLHL
Sbjct: 241 DLNTSMEFWEGCSSGDPPEHVPAVQTNTIVTTHRFSTEMVNTDTLPGKLTPLDDSDHLHL 300
Query: 301 SLCSSDHRHVMTQEQSSFVKLDFRKSSP-LSSEGRSKQSDDLDGALKDVKPEPFVEGSRL 360
SL SSDHRHV++QEQSSFVKL FRK+SP LSS GR Q DDL+GALK VKPEPFVE S+L
Sbjct: 301 SLSSSDHRHVISQEQSSFVKLGFRKTSPSLSSTGRGLQFDDLNGALKVVKPEPFVEASKL 360
Query: 361 ESKSDEVNVLGLSNSAVVKREFLQLPNASDIYRSMNIVKAKSVKSESIYQSKQEALKTLG 420
ESKSD VNVLGLS+SA+VKREFLQ+PN SDIY MN VKA+SV SE Y+SKQEALKTLG
Sbjct: 361 ESKSDGVNVLGLSDSAIVKREFLQIPNVSDIYIPMNTVKARSVNSELNYESKQEALKTLG 420
Query: 421 GRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSELQTPTKEH 480
GRLDLV KQV PEV +SCP PMPFVAE ++A N SCSTDL+TD DMSN+ ELQTPTKEH
Sbjct: 421 GRLDLVAKQVLPEVGSSCPAPMPFVAEMTEAARN-SCSTDLITDGDMSNHPELQTPTKEH 480
Query: 481 LSTIVQQGG--CGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAKNPSCTN 540
L+ V +G GEL+ SEMTD+SKD GSKDF+SPIIKPI MP +NPS TN
Sbjct: 481 LNLNVHEGAYRFAGELIDSEMTDVSKDPGSKDFNSPIIKPIAMP---------RNPSRTN 540
Query: 541 DMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDSSSSVMK 600
D II+ ++ + SEL PT GPLN KVHQ GYGCDGGLVNS MTD+SKD CSKDSSSSV+K
Sbjct: 541 DSIIEANMSSPSELHIPTTGPLNTKVHQAGYGCDGGLVNSVMTDVSKDTCSKDSSSSVIK 600
Query: 601 PFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYSSEYESD 660
P I+ D+N+NNP WR H NEQCSSL GGEE SV+DEEKIS+SADLLEEDPYSSEYESD
Sbjct: 601 PVIVEDENQNNPLWRPSTHTNEQCSSLQGGEESSVNDEEKISLSADLLEEDPYSSEYESD 660
Query: 661 GKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSNGLPGSD 720
GK DVNEAMD VDND+EEDYEDGEVREP LTTQVESSICET++VK FDH D SNGLPGSD
Sbjct: 661 GKLDVNEAMDTVDNDVEEDYEDGEVREPTLTTQVESSICETKKVKNFDHADSSNGLPGSD 720
Query: 721 -CSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQEHLKELLVEDNTTKVCLNKANKAI 780
CSSLV VKQE K EILDVKRED L+SV SNQSSEQE KEL VE++TT+VCLNKANKA
Sbjct: 721 CCSSLVSVKQENKLEILDVKREDNLHSVTSNQSSEQERSKELPVEEHTTRVCLNKANKA- 780
Query: 781 KATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDAENVKTVDFVQNDDLTLP 840
K SA+EDQ+ +KAT IEESI TV QSDAE VKTVD V+N++ LP
Sbjct: 781 ------------KTSAIEDQETSPEKATNGIEESITTVSQSDAEKVKTVDMVRNNNPALP 840
Query: 841 NIREPLNNDDDVTDDFTHGNRHAQIVNPCQASTSS-PIKTRPSLVRSVLTQTDRELIPDM 900
N+ EPL NDDDVTDD T G++H++IV+PC+ STSS P KTR SL RSVLTQTDRE IPDM
Sbjct: 841 NV-EPL-NDDDVTDDITRGSKHSRIVSPCKPSTSSLPSKTRSSLARSVLTQTDRERIPDM 900
Query: 901 AHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGRFTIRINNVQGEWDFNRT 960
AHEGEKL PQGRD+ YRDVF +FYVNRHQNLSP+TNF+RRRG ++
Sbjct: 901 AHEGEKLHPQGRDEPYRDVFQRFYVNRHQNLSPQTNFSRRRGNYS--------------- 960
Query: 961 ISLGVYNDQI--QPYDARRRKYMPTISDDDIDQNHYKMKPSGPFRTAG-HRGRQILDDEG 1020
+ Q+ PYDARRRKYMP +SDDDIDQNHYKMKP PFR+AG HRGRQILDDEG
Sbjct: 961 ------DHQVPPPPYDARRRKYMPAVSDDDIDQNHYKMKPDCPFRSAGDHRGRQILDDEG 1020
Query: 1021 PIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPESELVGPRHGEKFMRT 1080
P+FCH+ SRRKSPGRRDGPP GVKMVHRM RNISP RC RE SELVGPRHGEKFMRT
Sbjct: 1021 PLFCHMASRRKSPGRRDGPPPVRGVKMVHRMPRNISPSRCNRERGSELVGPRHGEKFMRT 1080
Query: 1081 LEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSPGRSRGRSPGQWFPS 1140
EDEAMDP+YAHPQP FEVDR PFI DRRNFPIQRKSF RVDSKSPGRSRGRSP QWFPS
Sbjct: 1081 FEDEAMDPLYAHPQPSFEVDRSPFIRDRRNFPIQRKSFQRVDSKSPGRSRGRSPSQWFPS 1140
Query: 1141 KRKSERFFPHPEMARRS-PPGYRMRSPDQPPP--GDMPVRRHGFPFPSLPPNDLRDMGSA 1200
KRKSERFF HPEMARRS PPGYRMRSPDQPP GDMPVRRHGFPFPSLPPNDLRDMGSA
Sbjct: 1141 KRKSERFFGHPEMARRSPPPGYRMRSPDQPPQIHGDMPVRRHGFPFPSLPPNDLRDMGSA 1200
Query: 1201 RDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRD-RIESNEYFDGPVHPGQLNELVGDGND 1260
RDHGHMR GIRSRNRT+R+SFRNRRFEDMDPRD RIESNEYFDGPVHPGQLNEL+ DGND
Sbjct: 1201 RDHGHMRPGIRSRNRTERMSFRNRRFEDMDPRDNRIESNEYFDGPVHPGQLNELIDDGND 1260
Query: 1261 DDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAEDEAEFHERGKMREREF 1320
DDRRRF DRHEHLHQFRPQCNDSDGENY DADERARP+RYC EDE EFHERGKMREREF
Sbjct: 1261 DDRRRFSDRHEHLHQFRPQCNDSDGENYRNDADERARPYRYCTEDEEEFHERGKMREREF 1320
Query: 1321 DRRLKNQPGNLGRRTGGVIEEHEVEEYR--HGRQMWNE------HHGFEDISRMKRKR 1351
DRR+KNQP NLGRRT VIEEHEVEEYR HGRQMWNE HHGFEDISRMKRKR
Sbjct: 1321 DRRVKNQPENLGRRT--VIEEHEVEEYRHGHGRQMWNEHHHHHHHHGFEDISRMKRKR 1329
BLAST of HG10022259 vs. ExPASy TrEMBL
Match:
A0A6J1BWB0 (uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111006113 PE=4 SV=1)
HSP 1 Score: 1602.0 bits (4147), Expect = 0.0e+00
Identity = 911/1387 (65.68%), Postives = 1016/1387 (73.25%), Query Frame = 0
Query: 1 MSTSDCNAIVPIKKRRFPLIQSSPP---KEISSLPSLDDNIVKVEEPCVSDGTTVSNSST 60
MS SD N IVPIKKRRF ++QSSP KE+SSL SLDDN+VKV EP +SDG TVS+S T
Sbjct: 16 MSASDYNVIVPIKKRRFTIVQSSPSSPHKELSSL-SLDDNLVKVAEPGISDGITVSSSVT 75
Query: 61 ITTSEFSEKKKILFSEDGNWKSDLCNVNMVQSNIGPSRVEVQDNDARSIGCVENKGTCMV 120
ITTSE SEKK+I FSE+ K DLCN N VQSNI PS V Q++DA VENK +
Sbjct: 76 ITTSELSEKKEISFSEESERKVDLCNSNRVQSNIEPSGVRFQEDDACFNHQVENKAMNVE 135
Query: 121 NENHALALHEKPELKLQPSDANSNPGLCAEKKSDEIDRKELDRCNSSTSLVKNEVELSVG 180
NE HAL L EKPELKL SD NS GLCA KK IDRKEL++C S TSLVK E ELSVG
Sbjct: 136 NEKHALHLLEKPELKLPTSDPNSKLGLCANKKRVGIDRKELEKCKSLTSLVKTEAELSVG 195
Query: 181 LKEHLVPDSVLEG--------NNLEPVLLNLSLSKQGSHTQCLTGNVGSVCDGSLQQSNR 240
L E LVPD V++G NNLEPV LNLSLSKQGS+TQCLT NVGS DGSLQQSNR
Sbjct: 196 LNERLVPDLVVKGSDRKWQKQNNLEPVSLNLSLSKQGSYTQCLTSNVGSDYDGSLQQSNR 255
Query: 241 GNWDLNTSMEFWEGCATDDPPVHVPVVQTNTIVTTHRCSTEMVKTDTLSGKLTPLGHSDH 300
GNWDLNTSME WEGCA+DDP V VPVVQTNTIVTTHRCSTEMV+ D SGK TPL SD+
Sbjct: 256 GNWDLNTSMESWEGCASDDPSVQVPVVQTNTIVTTHRCSTEMVRADISSGKPTPLDQSDY 315
Query: 301 LHLSLCSSDHRHVMTQEQSSFVKLDFRKS-SPLSSEGRSKQSDDLDGALKDVKPEPFVEG 360
LHLSL SSD R V QEQ S VKLDFR + S LSS G + Q DDL+ ALK VK EPFV+G
Sbjct: 316 LHLSLNSSDLRPVTKQEQISSVKLDFRSTDSSLSSPG-NMQFDDLNVALKVVKAEPFVKG 375
Query: 361 SRLESKSDEVNVLGLSNSAVVKREF-----LQLPNASDIYRSMNIVKAKSVKSESIYQSK 420
S LESKS+EV LGLS A++ E L+LP AS+I MNIVKAKS KSE +Y+SK
Sbjct: 376 SELESKSNEVKGLGLSGDALMNGELDDQCNLELPKASNICSPMNIVKAKSFKSEPVYESK 435
Query: 421 QEALKTLGGRLDLVEKQVPPEVDNSCPVPMPFVAERSDAIGNPSCSTDLVTDKDMSNYSE 480
+EAL+ LGGRL+L+ KQV P+VDNSCP+ +P VAE S+A NPSCST L TD DM N+SE
Sbjct: 436 KEALEMLGGRLNLISKQVLPDVDNSCPIAVPVVAEMSEAARNPSCSTYLATDGDMLNHSE 495
Query: 481 LQTPTKEHLSTIVQQGGCGGELVKSEMTDISKDTGSKDFSSPIIKPIVMPVVAEMLKAAK 540
L TPTK
Sbjct: 496 LPTPTK------------------------------------------------------ 555
Query: 541 NPSCTNDMIIDRDVPNHSELETPTIGPLNRKVHQEGYGCDGGLVNSEMTDLSKDICSKDS 600
G LN C GGLVNSE TD++KD DS
Sbjct: 556 -------------------------GNLNE--------CGGGLVNSEKTDITKDPGLGDS 615
Query: 601 SSSVMKPFIIGDQNENNPPWRRLEHMNEQCSSLHGGEECSVSDEEKISISADLLEEDPYS 660
S S+ KPF D+N+NNP W L+ NEQCS L GGEE SVSDEEKIS+SAD+LEE PYS
Sbjct: 616 SISIAKPFNAEDENQNNPKWCLLKLSNEQCSGLQGGEESSVSDEEKISLSADILEEYPYS 675
Query: 661 SEYESDGKQDVNEAMDAVDNDIEEDYEDGEVREPILTTQVESSICETREVKKFDHGDCSN 720
SEYESDGKQDV+ AM V NDIEEDYEDGEVREP+L TQVESS+C REV+ FDHGD S
Sbjct: 676 SEYESDGKQDVDGAMAEVHNDIEEDYEDGEVREPLLKTQVESSVCVKREVENFDHGDFSK 735
Query: 721 -------GLPGSDCSSLVFVKQEVKSEILDVKREDILNSVISNQSSEQE-----HLKELL 780
GLPG+D S+L+ VKQE K E DV++ED +SV +NQSSEQE +LKE+L
Sbjct: 736 DKKINSVGLPGTDFSTLISVKQENKLESHDVRQEDKFHSVTTNQSSEQEKDEASYLKEIL 795
Query: 781 VEDNTTKVCLNKANKAIKATGPRKLFHCEKISALEDQKIFSDKATTRIEESIVTVPQSDA 840
VE+N +NK IKATG R+LFHCE+ ALEDQ SDKAT IEE IVTV Q DA
Sbjct: 796 VEENA-------SNKVIKATGRRQLFHCEERDALEDQN-SSDKATDGIEEPIVTVSQGDA 855
Query: 841 ENVKTVDFVQNDDLTLPNIREPLNNDDDVTDDFTHGNRHAQIVNPCQAST-SSPIKTRPS 900
ENVKTVDFV+N+D LPN++EP+NN DD TDDF HG+RH +NPC ST SSP KTR +
Sbjct: 856 ENVKTVDFVRNNDPVLPNVKEPVNN-DDATDDFIHGSRH---INPCHGSTSSSPSKTRSN 915
Query: 901 LVRSVLTQTDRELIPDMAHEGEKLQPQGRDDSYRDVFPKFYVNRHQNLSPRTNFTRRRGR 960
+RSVLT+TDRE I D+A EG KLQPQGRDD Y V K YVNRHQNLSP+TNF RR R
Sbjct: 916 SLRSVLTRTDREQILDVALEGGKLQPQGRDDRYSGVSQKIYVNRHQNLSPQTNF-HRRER 975
Query: 961 FTIRINNVQGEWDFNRTISLGVYNDQIQPYDARRRKYMPTISDDDIDQNHYKMKPSGPFR 1020
FTIR +++QGEWDFN T+S G+Y+DQI PYDA RRKY+ +SDDDIDQNHYK+KP+GPFR
Sbjct: 976 FTIRTDSLQGEWDFNPTVSPGIYSDQI-PYDAPRRKYLSAVSDDDIDQNHYKIKPNGPFR 1035
Query: 1021 TAGHRGRQILDDEGPIFCHIPSRRKSPGRRDGPPVRGGVKMVHRMHRNISPGRCIREPES 1080
+AG +GRQILDDEGP +CHIPSRRKSPG RDGPPVR GVKMVHRM RNISP CIRE S
Sbjct: 1036 SAGRQGRQILDDEGPPYCHIPSRRKSPGIRDGPPVR-GVKMVHRMPRNISPSGCIREAGS 1095
Query: 1081 ELVGPRHGEKFMRTLEDEAMDPIYAHPQPPFEVDRPPFIPDRRNFPIQRKSFPRVDSKSP 1140
ELVGPRHGEKFMRT EDE MDPIYAHPQPP+EVDRPPFI +RRNF IQRK+FPR+DSKSP
Sbjct: 1096 ELVGPRHGEKFMRTFEDETMDPIYAHPQPPYEVDRPPFIRERRNFTIQRKTFPRIDSKSP 1155
Query: 1141 GRSRGRSPGQWFPSKRKSERFFPHPEMARRSPPGY---RMRSPDQPPP--GDMPVRRHGF 1200
GRSRGRSPGQW P KRKS RF H M RRS PGY RMRSPDQPPP GDMPVRRHGF
Sbjct: 1156 GRSRGRSPGQWVPGKRKSYRFCGHLGMTRRSSPGYRGDRMRSPDQPPPIHGDMPVRRHGF 1215
Query: 1201 PFPSLPPNDLRDMGSARDHGHMRSGIRSRNRTDRISFRNRRFEDMDPRDRIESNEYFDGP 1260
PF LP +DLRDM SA D GHMRS IR RNR+DR+SFRNRRFE MDPRDRIES+EYFDG
Sbjct: 1216 PFSPLPSSDLRDMRSAPDQGHMRSDIRCRNRSDRLSFRNRRFEIMDPRDRIESSEYFDG- 1275
Query: 1261 VHPGQLNELVGDGNDDDRRRFPDRHEHLHQFRPQCNDSDGENYHIDADERARPFRYCAED 1320
P QLNEL GDGNDDDRRRF DRHEHLH FRPQ NDSDGENYH +A++ RPFR+CAED
Sbjct: 1276 --PSQLNELSGDGNDDDRRRFSDRHEHLHSFRPQYNDSDGENYHNNAEDSRRPFRFCAED 1294
Query: 1321 -EAEFHERGKMREREFDRRLKNQPGNLGRRTGGVIEEHEVEEYRHGRQMWNEHHGFEDIS 1352
EFHERG MREREF+RR+KNQPGNL RRTG VIEEHEVE+YRHGRQMWN+ HGFEDIS
Sbjct: 1336 GPPEFHERGNMREREFNRRVKNQPGNLTRRTGVVIEEHEVEDYRHGRQMWND-HGFEDIS 1294
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038890337.1 | 0.0e+00 | 86.29 | uncharacterized protein LOC120079942 isoform X1 [Benincasa hispida] >XP_03889033... | [more] |
XP_038890343.1 | 0.0e+00 | 84.89 | uncharacterized protein LOC120079942 isoform X2 [Benincasa hispida] | [more] |
XP_022992789.1 | 0.0e+00 | 78.79 | uncharacterized protein LOC111489020 isoform X1 [Cucurbita maxima] >XP_022992790... | [more] |
XP_023550091.1 | 0.0e+00 | 78.43 | uncharacterized protein LOC111808389 isoform X1 [Cucurbita pepo subsp. pepo] | [more] |
XP_022938519.1 | 0.0e+00 | 77.88 | uncharacterized protein LOC111444729 isoform X1 [Cucurbita moschata] >XP_0229385... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A6J1JYG4 | 0.0e+00 | 78.79 | uncharacterized protein LOC111489020 isoform X1 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FEB1 | 0.0e+00 | 77.88 | uncharacterized protein LOC111444729 isoform X1 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1JUI7 | 0.0e+00 | 77.41 | uncharacterized protein LOC111489020 isoform X2 OS=Cucurbita maxima OX=3661 GN=L... | [more] |
A0A6J1FDD8 | 0.0e+00 | 76.49 | uncharacterized protein LOC111444729 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1BWB0 | 0.0e+00 | 65.68 | uncharacterized protein LOC111006113 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |