HG10005218 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10005218
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionUnknown protein
LocationChr07: 576415 .. 582012 (+)
RNA-Seq ExpressionHG10005218
SyntenyHG10005218
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGGTAAGTACATACATGATGTTTTATTTACTGAGCTATGTTTTTTTTTGAAAAAATTAAAATTCATTGTGGTATTTTTTATTTTTTTTGTTTCGGTAGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGTATGTATTGTATATATGCAATTTTTCATTTTCATTTTGATTAATTTTGGGATTTTTTTTTTTAATTATTTTCTTAGAGGTAGATTTTAGTAGACTGCAAATATTCTCTCAAATTGGGGGGTTTTTGTTTTTGGTAGCTCCTTTCTCTTTCTGCTCATAATTACTGCTGGAAAAAGCTGGTGAAAAGACTGCATTACGTTGGAGAGAAAAAGAGAGACAGTGAAAGAGAGAGAGAGAAAAAAAAAAAACAAGAAATAGTACTTGTTTTTTGTGTGATTAAGGGCGCGGAAATAGGCTGATTCACAATTGATTCACAATTGCAAAAAAATAAAAATAAAAATAAAAATAAAAAATAAAAAGTCACTACAAGACGGGTTTGTATCGTTTGTGAAGGCTTTCATTTGAGCCTGATTATATATATGTATGTATGTATATAATGCTCCAGCTGCAAGAGTACGGTCTTGCCAGTGAGTGACTTTATTTGGTGGAAGAGTTTTTTTTGTCCTTTGTTTAACAAATTTATCCTAGGTTAATGATGCTGTTTTTAACTAGTTTCGTAGTGATTAAATTATAATTTCTTACTTGAGAAATTATTTTAGATTTCTAGTAGTAGTATTAGAGATGAGTGTCCTCGATCGAACGCGTTTAGCGTTTAGGGTTTAGGGTTTTTGAAATAAAACATTTGAGTTTGTTGTAATGGGTTTTGTTTCACTATCATCTCCATTTTGAGGTACTTGGATTTTGACAACTTTTTGACCAAATTTGCTACGCACTAACATTTTTTTGCAACCCATTATCATATGGGGATTATTTTATAAGTTATGCTTGAGTTTTGCTTCAATGATCAAGTACTGAACTGCAAAACTTGAATCTAATTCTTTTTCTCCAACAATCTTTATGAAATGAGAGTTTATAAGTCATTTTTTTGAAGAATAGAACATGAATGTAAATAGTTTTTTAACTATTTTATGATGCTTTTTTGCTGATTAGTAGAGGGAATTGAAGAGACAATGGAATGTTGTGGATGAGAGAGAGAGAGTTGAGAAGGCAATCTCTTTTAGGATAGTGCATTTCATTATTTTGTTGTTAATTAATTCAAGGGCCCTCTTCCCTTATTTATGAGTAGTCTTTGTCTTTGTCCTTTAACAAATTGTCTTATTTGATAATTATGCTTTTTGTCTTATTGTTTTATAAGATGTAAGTGTTTGACCATTGAGAATGGAAGAATATATAATGAGGTTTTGTGGGTGAAGATTGCTAGATAGATTAGATGACATCATTATTATATATTAATAACCAAATTAGTGACAACATTTTCTCCTACTATGTAGCTACTAGACCTTAATTTATGATAGGATTGAAATTTTATGTGTTCAAGTTAGAGGGTTGAGGTGCTGCAATAACCCACTATAATTAGAATGAATGTTCTTTGCCAACCATTATGCTTCATATGCTTATTCTTTGAGATTAATTTGATCAACTTCTTGTTTCTTCTTACATTTCATGAAATAGATCTAGAAATATAACAAAATCATTTTCAAGTTTAGCATTGAAGCCATTAGCTTCCATGTCCACTGACTTGATGAGTTCCTTTGCCTCTCCTTTATAATGATGCTTTGTAAACTATTTTCACCCTTTATTTTTGTGTAAGAAAACACATAAAATATATTTTTTGACAGTTCACTTTTAACATCCATCAATGTTGAGTTGGGACTTGTTTGGATTGACTTTTAAAGTGCTTAAACATAATTTCTTAAGTACTTAAAACGTCAATTCAAACAAGTCCTTTGATAGAATATGAAAGGAAAGCTTGTACTCTTTGTATTTACTTTTAAGATTTTAATTATAATTAGTTTCTGACATTTTCTATTGGTTATAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAGTTATGTACCCCCATCCGCGCTTCAATCGAAAAAGCAGTATAATGTATCAAAGACCTGACTCTGTCATCAACTTAAATGAAAGATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA

mRNA sequence

ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA

Coding sequence (CDS)

ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA

Protein sequence

MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSSFKEEKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVGKQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPSSLSENPVSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAGLVHSVSLVGSQKKSTKVYSF
Homology
BLAST of HG10005218 vs. NCBI nr
Match: XP_038888639.1 (uncharacterized protein LOC120078436 [Benincasa hispida])

HSP 1 Score: 1832.4 bits (4745), Expect = 0.0e+00
Identity = 967/1218 (79.39%), Postives = 1023/1218 (83.99%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSSFKEEK 60
            MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS      
Sbjct: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS------ 60

Query: 61   VSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQ-TGKEIRRKNKG 120
             SE    EEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLN+Q T KEIR+K   
Sbjct: 61   -SERVISEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNSQITSKEIRKK--- 120

Query: 121  GGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQII---H 180
                     LK KSRTPKKRSIA+IFAVAPPVK MII NDC  E EE+KAVGKQII   +
Sbjct: 121  ---------LKAKSRTPKKRSIADIFAVAPPVKTMIIANDCCDEEEEKKAVGKQIIRHNN 180

Query: 181  NNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQ 240
            NNNNLKTTSLATSLVSTIKTIN    TTT  EQPSI             KKK KDFGHGQ
Sbjct: 181  NNNNLKTTSLATSLVSTIKTIN----TTTEQEQPSI-----------LHKKKKKDFGHGQ 240

Query: 241  LCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKA 300
            LC+KGEIRNHKDVSTLCKKPCFKRL RQKR+KLVKKSNVVAKQQRPMP LRSILKHSVKA
Sbjct: 241  LCRKGEIRNHKDVSTLCKKPCFKRLCRQKRKKLVKKSNVVAKQQRPMPLLRSILKHSVKA 300

Query: 301  ISETNSSLINLRGSNNQVFNN-SGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQ 360
             SETN S INLRG+NNQVFNN  GQKSDR VSFLDKDDVLG ST  FSDTFEQNVGNPFQ
Sbjct: 301  TSETNFSSINLRGNNNQVFNNGGGQKSDRRVSFLDKDDVLGLSTEVFSDTFEQNVGNPFQ 360

Query: 361  ASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNAQSWDN 420
            ASEVSTNSGESNK VA +EANL+D  VCFST+HEVD QH KGKIQLPN  +QVNA+SWDN
Sbjct: 361  ASEVSTNSGESNKEVAPVEANLNDD-VCFSTQHEVDGQHAKGKIQLPNFHNQVNAESWDN 420

Query: 421  EKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRT 480
             KHSTE LIS N+D+PHDQNDL LFDHVYVD  QKL PVHSAIPALLAAQEERQYG VRT
Sbjct: 421  AKHSTENLISKNQDIPHDQNDLRLFDHVYVDGLQKLSPVHSAIPALLAAQEERQYGHVRT 480

Query: 481  RCCLNSVPQVHSLNGKSVDHLINPF-NGAAALGSITSKVP-SSLSENPVSRFLNIAESSA 540
            +C LNS+ Q HSL GKS DHLINPF NG AALGSITS+VP SSLSENPVSRFLN+AESS 
Sbjct: 481  QCGLNSIRQAHSLYGKSTDHLINPFNNGVAALGSITSRVPSSSLSENPVSRFLNLAESSI 540

Query: 541  KDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNIIAC 600
            KD  FPF NGE+S V+YKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMN+  N IAC
Sbjct: 541  KDTIFPFSNGEESMVSYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNTIAC 600

Query: 601  SNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDVAE 660
            S+RIP CSLVLPRSRDYF+DNEKLLVDTELTGNQLTLFPLHSH+ E QNRY PAGFD++E
Sbjct: 601  SSRIPVCSLVLPRSRDYFIDNEKLLVDTELTGNQLTLFPLHSHLPENQNRYFPAGFDISE 660

Query: 661  PG-TSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSSM 720
            PG TSETADIRLMNSERG E+GRFFHPNLMDSP+NRCRYYGK QNQNVSTQFYPENSSSM
Sbjct: 661  PGITSETADIRLMNSERGTESGRFFHPNLMDSPYNRCRYYGKFQNQNVSTQFYPENSSSM 720

Query: 721  CANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFLQ 780
            CANPG+QTMRLMGKDVAVGGN QEVQEPEVINFWKNS  IGNCLTNPIQETHMRKRNFLQ
Sbjct: 721  CANPGQQTMRLMGKDVAVGGNRQEVQEPEVINFWKNSTLIGNCLTNPIQETHMRKRNFLQ 780

Query: 781  DRELHHPSKGETLCYHPAGFYGNQMAQRH----------------------------LLQ 840
            DRELHHPSKGETL YHPAGF+GNQ+AQ +                            ++ 
Sbjct: 781  DRELHHPSKGETLFYHPAGFHGNQVAQSNFFANASQVRYPHPHLNRKSSIMYQRPDSVIN 840

Query: 841  MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQNS 900
            +   FN+NIH FSP  T+TFNMA+NFQ PFISG ET RFGS PSAFSTSHH CPN Y+NS
Sbjct: 841  LNESFNNNIHAFSPSSTDTFNMAQNFQGPFISGPETLRFGSQPSAFSTSHHTCPNRYENS 900

Query: 901  FELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLHDH 960
            FELGFNQNLHPAKLGTFNFPFLQPDDE HV L  S   K            L PWMLHDH
Sbjct: 901  FELGFNQNLHPAKLGTFNFPFLQPDDETHVQLPWSHTSK-----------SLPPWMLHDH 960

Query: 961  QREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKNHI 1020
            QRE     NSKLAD+NGYYCPCIP G+DVLI+PSSMH RLETAYPCSTMPYSHLQ KNHI
Sbjct: 961  QREAPQTTNSKLADLNGYYCPCIPFGTDVLINPSSMHHRLETAYPCSTMPYSHLQTKNHI 1020

Query: 1021 PGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALELVN 1080
            PG TSFFQP+PV PR+LQSPI+NAGH+IR+SSEDRLKFN+LSVKD DFSSKT  A ELV+
Sbjct: 1021 PGPTSFFQPMPVAPRILQSPIANAGHEIRLSSEDRLKFNTLSVKDFDFSSKTLLAGELVD 1080

Query: 1081 SRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNMTQ 1140
            SRKRQ++SSLE NNSGVVP WTRGKFSDDHL+S  GTVKIHANWDKAVNS G NIPNMTQ
Sbjct: 1081 SRKRQKISSLETNNSGVVPGWTRGKFSDDHLESNPGTVKIHANWDKAVNSAG-NIPNMTQ 1140

Query: 1141 TTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAGLV 1180
            TTD V+IST NNE PK ECMARSGPIKLTAGAKHILKPSQS+D+DNTKPTYSTIPSAGLV
Sbjct: 1141 TTDGVVISTKNNETPKFECMARSGPIKLTAGAKHILKPSQSVDIDNTKPTYSTIPSAGLV 1171

BLAST of HG10005218 vs. NCBI nr
Match: XP_011657559.1 (uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical protein Csa_004444 [Cucumis sativus])

HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 953/1220 (78.11%), Postives = 1025/1220 (84.02%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSS--FKE 60
            MA PTS FSIREYALNKRS  LT ISWPFSEKVKKEVAE+LLPPMDVKKFRWWSS     
Sbjct: 1    MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60

Query: 61   EKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNK 120
            ++  E EE EE+EVI ERIKMQKICPVCGVFVAATV AVNAHID+CL   T KEIRRKN 
Sbjct: 61   QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKN- 120

Query: 121  GGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQIIHNN 180
                      LK KSRTPKKRSIAEIFAVAPPVK MI+VNDC  + EE+KAVGKQIIH+N
Sbjct: 121  ---------YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHN 180

Query: 181  NNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLC 240
             NLKTTSLATSLVS IKTI  KI   TTTE+P+I  L K+KKKKKKKKKKNKDF HG+LC
Sbjct: 181  KNLKTTSLATSLVSAIKTIKNKI--ATTTEEPTI--LAKRKKKKKKKKKKNKDFCHGKLC 240

Query: 241  KKGEIRNHKDVSTLCK-KPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAI 300
            KKG+IRNHKDVST CK +PCFKRLS+QK++KL KKS VVAKQQRPMPPLRSILKHSVKAI
Sbjct: 241  KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300

Query: 301  SETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQAS 360
            SETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR  SDTFEQNVGNPFQAS
Sbjct: 301  SETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQAS 360

Query: 361  EVSTNSGESNKGVASMEANLSDHVVCF-STRHEVDSQHVKGKIQLPNVQSQVNAQSWDNE 420
            EVSTNSGESNK V SMEANL+D V CF STRH+VDSQHVKGKIQLPN  +QVNAQSW+N 
Sbjct: 361  EVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENP 420

Query: 421  KHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTR 480
            KHSTEKLI  +RD+PHD+NDLHLFDHVYVDA QKLPP HSAIPALLAAQEER YG VRT+
Sbjct: 421  KHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQ 480

Query: 481  CCLNSVPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSLSENPVSRFLNIAESS 540
            C LN VPQ HSL GKSVDHLI   N FNG AALGS+TS+VP SSL+ENPVSRFLN+AESS
Sbjct: 481  CGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESS 540

Query: 541  AKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNII 600
            A+D NRF   NGEQ  V YKEKGVNDGFFCLPLNS+GELIQLNSGL +RFDQMN+    I
Sbjct: 541  ARDSNRFQISNGEQGVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTI 600

Query: 601  ACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDV 660
            A S+RIP C+ V+PRSRDYFVDNEKL +DT+LTGNQLTLFPLHSHMQE QNRYLPAGFDV
Sbjct: 601  AGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDV 660

Query: 661  AEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSS 720
             EPGTSETADIRLMNSERG ETGRFFHPNLMDSPFNRCRYY K QNQNVS QFYPENSSS
Sbjct: 661  PEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSS 720

Query: 721  MCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFL 780
            MCANPGRQTMRLMGKDVAVGGNG++VQEPEVINFWKNS+ IGNCLTNPIQETHMRKRNFL
Sbjct: 721  MCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFL 780

Query: 781  QDRELHHPSKGETLCYHPAGFYGNQMAQRHLL---------------------------- 840
            QDRELH+PS+GETL YHPAGF+GNQ+AQ +LL                            
Sbjct: 781  QDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESV 840

Query: 841  -QMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQ 900
              +  +FN NIH F    T+T NMARNFQAPF+SGLETQRF S PSAFSTSHH+CPN Y+
Sbjct: 841  INLNERFN-NIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYE 900

Query: 901  NSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLH 960
            NSFELGFNQ+LHPAKLGTFNFPFLQPDD NHV L  S   K            LSPW+LH
Sbjct: 901  NSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSK-----------SLSPWILH 960

Query: 961  DHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKN 1020
            DHQRE  P ANSKLAD+NGYYCPC P G+DVLISPSS+H +LETAYPCSTM YSHLQ KN
Sbjct: 961  DHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQTKN 1020

Query: 1021 HIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALEL 1080
            HIPGSTS FQPIP+ PRVL SPI+NAGH+IRM SEDRLKFNSLSVK+SDFSSK Q A E 
Sbjct: 1021 HIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEF 1080

Query: 1081 VNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNM 1140
            V+SRKRQ+  SLE NNSGVVPEWTRGK+SDDHL+S  GTVKIHANWDKAVNSVG NIPNM
Sbjct: 1081 VDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVG-NIPNM 1140

Query: 1141 TQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAG 1180
            TQTTD ++IS NNNEA +VECMARSGPIKLTAGAKHILKPSQSMD+DNTKPTYSTIPSAG
Sbjct: 1141 TQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAG 1191

BLAST of HG10005218 vs. NCBI nr
Match: XP_008449514.1 (PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa] >TYK21149.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa])

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0

Query: 281  MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
            MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR  
Sbjct: 1    MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60

Query: 341  SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
            SDTFEQNVGNPFQASEV  NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61   SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120

Query: 401  NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
            N  +QVNA+ W+N KHSTEKLI  +RD+PHD+NDLH F HVYVDA QKLP  HSAIPALL
Sbjct: 121  NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180

Query: 461  AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
            A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI   N FNG AALGS+TS+VP SSL
Sbjct: 181  AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240

Query: 521  SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
            +ENPVSR  N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241  TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300

Query: 581  LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
            L +RFDQMN+  N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301  LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360

Query: 641  MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
            MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361  MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420

Query: 701  NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
            NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421  NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480

Query: 761  TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
            TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL             
Sbjct: 481  TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540

Query: 821  ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
                             L++   +IH F P  T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541  TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600

Query: 881  AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
            AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L  S   K     
Sbjct: 601  AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660

Query: 941  LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
                   LSPW+LHDHQRE  P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661  ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720

Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
            PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721  PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780

Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
            DSDFSSK Q A E V+SRKRQ+  SLE NNSG+VPEWTRGK+SDDHL+S  G  KIHAN 
Sbjct: 781  DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840

Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
            DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841  DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900

BLAST of HG10005218 vs. NCBI nr
Match: XP_022148072.1 (uncharacterized protein LOC111016842 isoform X1 [Momordica charantia])

HSP 1 Score: 1364.4 bits (3530), Expect = 0.0e+00
Identity = 786/1229 (63.95%), Postives = 894/1229 (72.74%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
            MAV  S FSIREYALN R  DL R  WPF + VKKEVAEA+LPP+ V KFRWWS    + 
Sbjct: 1    MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60

Query: 61   KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
            K   +SE        +++EEE+VII    M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61   KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120

Query: 121  GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
                +RKN   G       +K KSRTPKKRSIAEIFAVAPPV+   +V D         G
Sbjct: 121  ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180

Query: 181  KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
              II     LK TSLA +LV+ +KTI  K               + K+ K K    KNKD
Sbjct: 181  GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240

Query: 241  FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
            FGH  L KKGE RNHKDVS  CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241  FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300

Query: 301  HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
             SVK +SET+ S  NL+GS  QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301  QSVKVVSETBPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360

Query: 361  NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
            NPFQ SE +T SGESNKGVASME   L+D +V FSTRH VDSQ +KGKIQLPN+  QVNA
Sbjct: 361  NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420

Query: 421  Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
            Q         W N KH  E+ IS NR VPH+ N  HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421  QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480

Query: 481  AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
            AAQ+ERQYG VRT+   N  P  H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN 
Sbjct: 481  AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540

Query: 541  VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
            V R  N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541  VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600

Query: 601  DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
            DQMN+  N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE 
Sbjct: 601  DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660

Query: 661  QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
            +N+YL A FDV EPGTS   DIRL+NSERG ++G   H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661  RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720

Query: 721  STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
            ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721  STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780

Query: 781  QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
            QE  MRKRNFLQDR LH+PSKGETL ++PAGF+  Q+AQ +LL                 
Sbjct: 781  QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840

Query: 841  -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
             ++++ +D          NI+ F P  T  FNMA NFQAPFISG  T RFG  P AFSTS
Sbjct: 841  GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900

Query: 901  HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
             HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV                   
Sbjct: 901  QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960

Query: 961  ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
                W+    Q++EAP A SKLADING Y P I SG DVL SP SM  R E A+PCSTMP
Sbjct: 961  -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020

Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
             SH Q+KN IPGSTS FQPIPV PR     I  AGH+ R+S  EDRLKF +LSVKD+D  
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080

Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
            SK QP  EL++SRKRQ++ SLE NNSGVV EWT GKF+D+  +S  G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140

Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
                N+PN+T+ TD V++ +  NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1153

BLAST of HG10005218 vs. NCBI nr
Match: XP_022148073.1 (uncharacterized protein LOC111016842 isoform X2 [Momordica charantia])

HSP 1 Score: 1345.5 bits (3481), Expect = 0.0e+00
Identity = 780/1229 (63.47%), Postives = 888/1229 (72.25%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
            MAV  S FSIR         DL R  WPF + VKKEVAEA+LPP+ V KFRWWS    + 
Sbjct: 1    MAVAPSGFSIR---------DLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60

Query: 61   KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
            K   +SE        +++EEE+VII    M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61   KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120

Query: 121  GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
                +RKN   G       +K KSRTPKKRSIAEIFAVAPPV+   +V D         G
Sbjct: 121  ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180

Query: 181  KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
              II     LK TSLA +LV+ +KTI  K               + K+ K K    KNKD
Sbjct: 181  GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240

Query: 241  FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
            FGH  L KKGE RNHKDVS  CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241  FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300

Query: 301  HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
             SVK +SET+ S  NL+GS  QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301  QSVKVVSETBPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360

Query: 361  NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
            NPFQ SE +T SGESNKGVASME   L+D +V FSTRH VDSQ +KGKIQLPN+  QVNA
Sbjct: 361  NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420

Query: 421  Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
            Q         W N KH  E+ IS NR VPH+ N  HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421  QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480

Query: 481  AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
            AAQ+ERQYG VRT+   N  P  H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN 
Sbjct: 481  AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540

Query: 541  VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
            V R  N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541  VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600

Query: 601  DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
            DQMN+  N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE 
Sbjct: 601  DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660

Query: 661  QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
            +N+YL A FDV EPGTS   DIRL+NSERG ++G   H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661  RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720

Query: 721  STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
            ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721  STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780

Query: 781  QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
            QE  MRKRNFLQDR LH+PSKGETL ++PAGF+  Q+AQ +LL                 
Sbjct: 781  QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840

Query: 841  -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
             ++++ +D          NI+ F P  T  FNMA NFQAPFISG  T RFG  P AFSTS
Sbjct: 841  GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900

Query: 901  HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
             HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV                   
Sbjct: 901  QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960

Query: 961  ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
                W+    Q++EAP A SKLADING Y P I SG DVL SP SM  R E A+PCSTMP
Sbjct: 961  -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020

Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
             SH Q+KN IPGSTS FQPIPV PR     I  AGH+ R+S  EDRLKF +LSVKD+D  
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080

Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
            SK QP  EL++SRKRQ++ SLE NNSGVV EWT GKF+D+  +S  G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140

Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
                N+PN+T+ TD V++ +  NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1144

BLAST of HG10005218 vs. ExPASy TrEMBL
Match: A0A0A0KJS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1)

HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 953/1220 (78.11%), Postives = 1025/1220 (84.02%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSS--FKE 60
            MA PTS FSIREYALNKRS  LT ISWPFSEKVKKEVAE+LLPPMDVKKFRWWSS     
Sbjct: 1    MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60

Query: 61   EKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNK 120
            ++  E EE EE+EVI ERIKMQKICPVCGVFVAATV AVNAHID+CL   T KEIRRKN 
Sbjct: 61   QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKN- 120

Query: 121  GGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQIIHNN 180
                      LK KSRTPKKRSIAEIFAVAPPVK MI+VNDC  + EE+KAVGKQIIH+N
Sbjct: 121  ---------YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHN 180

Query: 181  NNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLC 240
             NLKTTSLATSLVS IKTI  KI   TTTE+P+I  L K+KKKKKKKKKKNKDF HG+LC
Sbjct: 181  KNLKTTSLATSLVSAIKTIKNKI--ATTTEEPTI--LAKRKKKKKKKKKKNKDFCHGKLC 240

Query: 241  KKGEIRNHKDVSTLCK-KPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAI 300
            KKG+IRNHKDVST CK +PCFKRLS+QK++KL KKS VVAKQQRPMPPLRSILKHSVKAI
Sbjct: 241  KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300

Query: 301  SETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQAS 360
            SETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR  SDTFEQNVGNPFQAS
Sbjct: 301  SETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQAS 360

Query: 361  EVSTNSGESNKGVASMEANLSDHVVCF-STRHEVDSQHVKGKIQLPNVQSQVNAQSWDNE 420
            EVSTNSGESNK V SMEANL+D V CF STRH+VDSQHVKGKIQLPN  +QVNAQSW+N 
Sbjct: 361  EVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENP 420

Query: 421  KHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTR 480
            KHSTEKLI  +RD+PHD+NDLHLFDHVYVDA QKLPP HSAIPALLAAQEER YG VRT+
Sbjct: 421  KHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQ 480

Query: 481  CCLNSVPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSLSENPVSRFLNIAESS 540
            C LN VPQ HSL GKSVDHLI   N FNG AALGS+TS+VP SSL+ENPVSRFLN+AESS
Sbjct: 481  CGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESS 540

Query: 541  AKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNII 600
            A+D NRF   NGEQ  V YKEKGVNDGFFCLPLNS+GELIQLNSGL +RFDQMN+    I
Sbjct: 541  ARDSNRFQISNGEQGVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTI 600

Query: 601  ACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDV 660
            A S+RIP C+ V+PRSRDYFVDNEKL +DT+LTGNQLTLFPLHSHMQE QNRYLPAGFDV
Sbjct: 601  AGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDV 660

Query: 661  AEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSS 720
             EPGTSETADIRLMNSERG ETGRFFHPNLMDSPFNRCRYY K QNQNVS QFYPENSSS
Sbjct: 661  PEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSS 720

Query: 721  MCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFL 780
            MCANPGRQTMRLMGKDVAVGGNG++VQEPEVINFWKNS+ IGNCLTNPIQETHMRKRNFL
Sbjct: 721  MCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFL 780

Query: 781  QDRELHHPSKGETLCYHPAGFYGNQMAQRHLL---------------------------- 840
            QDRELH+PS+GETL YHPAGF+GNQ+AQ +LL                            
Sbjct: 781  QDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESV 840

Query: 841  -QMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQ 900
              +  +FN NIH F    T+T NMARNFQAPF+SGLETQRF S PSAFSTSHH+CPN Y+
Sbjct: 841  INLNERFN-NIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYE 900

Query: 901  NSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLH 960
            NSFELGFNQ+LHPAKLGTFNFPFLQPDD NHV L  S   K            LSPW+LH
Sbjct: 901  NSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSK-----------SLSPWILH 960

Query: 961  DHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKN 1020
            DHQRE  P ANSKLAD+NGYYCPC P G+DVLISPSS+H +LETAYPCSTM YSHLQ KN
Sbjct: 961  DHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQTKN 1020

Query: 1021 HIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALEL 1080
            HIPGSTS FQPIP+ PRVL SPI+NAGH+IRM SEDRLKFNSLSVK+SDFSSK Q A E 
Sbjct: 1021 HIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEF 1080

Query: 1081 VNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNM 1140
            V+SRKRQ+  SLE NNSGVVPEWTRGK+SDDHL+S  GTVKIHANWDKAVNSVG NIPNM
Sbjct: 1081 VDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVG-NIPNM 1140

Query: 1141 TQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAG 1180
            TQTTD ++IS NNNEA +VECMARSGPIKLTAGAKHILKPSQSMD+DNTKPTYSTIPSAG
Sbjct: 1141 TQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAG 1191

BLAST of HG10005218 vs. ExPASy TrEMBL
Match: A0A5D3DCZ7 (Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold75860G00170 PE=4 SV=1)

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0

Query: 281  MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
            MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR  
Sbjct: 1    MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60

Query: 341  SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
            SDTFEQNVGNPFQASEV  NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61   SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120

Query: 401  NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
            N  +QVNA+ W+N KHSTEKLI  +RD+PHD+NDLH F HVYVDA QKLP  HSAIPALL
Sbjct: 121  NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180

Query: 461  AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
            A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI   N FNG AALGS+TS+VP SSL
Sbjct: 181  AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240

Query: 521  SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
            +ENPVSR  N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241  TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300

Query: 581  LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
            L +RFDQMN+  N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301  LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360

Query: 641  MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
            MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361  MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420

Query: 701  NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
            NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421  NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480

Query: 761  TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
            TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL             
Sbjct: 481  TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540

Query: 821  ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
                             L++   +IH F P  T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541  TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600

Query: 881  AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
            AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L  S   K     
Sbjct: 601  AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660

Query: 941  LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
                   LSPW+LHDHQRE  P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661  ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720

Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
            PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721  PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780

Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
            DSDFSSK Q A E V+SRKRQ+  SLE NNSG+VPEWTRGK+SDDHL+S  G  KIHAN 
Sbjct: 781  DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840

Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
            DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841  DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900

BLAST of HG10005218 vs. ExPASy TrEMBL
Match: A0A1S3BM77 (uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=4 SV=1)

HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0

Query: 281  MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
            MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR  
Sbjct: 1    MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60

Query: 341  SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
            SDTFEQNVGNPFQASEV  NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61   SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120

Query: 401  NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
            N  +QVNA+ W+N KHSTEKLI  +RD+PHD+NDLH F HVYVDA QKLP  HSAIPALL
Sbjct: 121  NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180

Query: 461  AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
            A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI   N FNG AALGS+TS+VP SSL
Sbjct: 181  AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240

Query: 521  SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
            +ENPVSR  N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241  TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300

Query: 581  LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
            L +RFDQMN+  N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301  LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360

Query: 641  MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
            MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361  MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420

Query: 701  NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
            NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421  NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480

Query: 761  TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
            TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL             
Sbjct: 481  TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540

Query: 821  ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
                             L++   +IH F P  T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541  TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600

Query: 881  AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
            AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L  S   K     
Sbjct: 601  AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660

Query: 941  LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
                   LSPW+LHDHQRE  P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661  ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720

Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
            PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721  PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780

Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
            DSDFSSK Q A E V+SRKRQ+  SLE NNSG+VPEWTRGK+SDDHL+S  G  KIHAN 
Sbjct: 781  DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840

Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
            DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841  DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900

BLAST of HG10005218 vs. ExPASy TrEMBL
Match: A0A6J1D428 (uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)

HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 786/1229 (63.95%), Postives = 894/1229 (72.74%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
            MAV  S FSIREYALN R  DL R  WPF + VKKEVAEA+LPP+ V KFRWWS    + 
Sbjct: 1    MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60

Query: 61   KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
            K   +SE        +++EEE+VII    M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61   KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120

Query: 121  GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
                +RKN   G       +K KSRTPKKRSIAEIFAVAPPV+   +V D         G
Sbjct: 121  ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180

Query: 181  KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
              II     LK TSLA +LV+ +KTI  K               + K+ K K    KNKD
Sbjct: 181  GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240

Query: 241  FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
            FGH  L KKGE RNHKDVS  CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241  FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300

Query: 301  HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
             SVK +SET+ S  NL+GS  QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301  QSVKVVSETDPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360

Query: 361  NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
            NPFQ SE +T SGESNKGVASME   L+D +V FSTRH VDSQ +KGKIQLPN+  QVNA
Sbjct: 361  NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420

Query: 421  Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
            Q         W N KH  E+ IS NR VPH+ N  HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421  QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480

Query: 481  AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
            AAQ+ERQYG VRT+   N  P  H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN 
Sbjct: 481  AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540

Query: 541  VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
            V R  N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541  VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600

Query: 601  DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
            DQMN+  N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE 
Sbjct: 601  DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660

Query: 661  QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
            +N+YL A FDV EPGTS   DIRL+NSERG ++G   H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661  RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720

Query: 721  STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
            ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721  STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780

Query: 781  QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
            QE  MRKRNFLQDR LH+PSKGETL ++PAGF+  Q+AQ +LL                 
Sbjct: 781  QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840

Query: 841  -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
             ++++ +D          NI+ F P  T  FNMA NFQAPFISG  T RFG  P AFSTS
Sbjct: 841  GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900

Query: 901  HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
             HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV                   
Sbjct: 901  QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960

Query: 961  ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
                W+    Q++EAP A SKLADING Y P I SG DVL SP SM  R E A+PCSTMP
Sbjct: 961  -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020

Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
             SH Q+KN IPGSTS FQPIPV PR     I  AGH+ R+S  EDRLKF +LSVKD+D  
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080

Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
            SK QP  EL++SRKRQ++ SLE NNSGVV EWT GKF+D+  +S  G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140

Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
                N+PN+T+ TD V++ +  NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1153

BLAST of HG10005218 vs. ExPASy TrEMBL
Match: A0A6J1D325 (uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)

HSP 1 Score: 1344.3 bits (3478), Expect = 0.0e+00
Identity = 780/1229 (63.47%), Postives = 888/1229 (72.25%), Query Frame = 0

Query: 1    MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
            MAV  S FSIR         DL R  WPF + VKKEVAEA+LPP+ V KFRWWS    + 
Sbjct: 1    MAVAPSGFSIR---------DLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60

Query: 61   KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
            K   +SE        +++EEE+VII    M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61   KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120

Query: 121  GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
                +RKN   G       +K KSRTPKKRSIAEIFAVAPPV+   +V D         G
Sbjct: 121  ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180

Query: 181  KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
              II     LK TSLA +LV+ +KTI  K               + K+ K K    KNKD
Sbjct: 181  GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240

Query: 241  FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
            FGH  L KKGE RNHKDVS  CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241  FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300

Query: 301  HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
             SVK +SET+ S  NL+GS  QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301  QSVKVVSETDPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360

Query: 361  NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
            NPFQ SE +T SGESNKGVASME   L+D +V FSTRH VDSQ +KGKIQLPN+  QVNA
Sbjct: 361  NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420

Query: 421  Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
            Q         W N KH  E+ IS NR VPH+ N  HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421  QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480

Query: 481  AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
            AAQ+ERQYG VRT+   N  P  H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN 
Sbjct: 481  AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540

Query: 541  VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
            V R  N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541  VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600

Query: 601  DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
            DQMN+  N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE 
Sbjct: 601  DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660

Query: 661  QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
            +N+YL A FDV EPGTS   DIRL+NSERG ++G   H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661  RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720

Query: 721  STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
            ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721  STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780

Query: 781  QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
            QE  MRKRNFLQDR LH+PSKGETL ++PAGF+  Q+AQ +LL                 
Sbjct: 781  QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840

Query: 841  -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
             ++++ +D          NI+ F P  T  FNMA NFQAPFISG  T RFG  P AFSTS
Sbjct: 841  GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900

Query: 901  HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
             HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV                   
Sbjct: 901  QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960

Query: 961  ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
                W+    Q++EAP A SKLADING Y P I SG DVL SP SM  R E A+PCSTMP
Sbjct: 961  -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020

Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
             SH Q+KN IPGSTS FQPIPV PR     I  AGH+ R+S  EDRLKF +LSVKD+D  
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080

Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
            SK QP  EL++SRKRQ++ SLE NNSGVV EWT GKF+D+  +S  G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140

Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
                N+PN+T+ TD V++ +  NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1144

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038888639.10.0e+0079.39uncharacterized protein LOC120078436 [Benincasa hispida][more]
XP_011657559.10.0e+0078.11uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical ... [more]
XP_008449514.10.0e+0077.52PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 put... [more]
XP_022148072.10.0e+0063.95uncharacterized protein LOC111016842 isoform X1 [Momordica charantia][more]
XP_022148073.10.0e+0063.47uncharacterized protein LOC111016842 isoform X2 [Momordica charantia][more]
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0KJS60.0e+0078.11Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1[more]
A0A5D3DCZ70.0e+0077.52Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... [more]
A0A1S3BM770.0e+0077.52uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=... [more]
A0A6J1D4280.0e+0063.95uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 G... [more]
A0A6J1D3250.0e+0063.47uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 G... [more]
Match NameE-valueIdentityDescription
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR006642Rad18, zinc finger UBZ4-typeSMARTSM00734c2hc_5coord: 80..105
e-value: 0.0018
score: 27.6
NoneNo IPR availableGENE3D3.30.160.60Classic Zinc Fingercoord: 82..109
e-value: 1.3E-5
score: 26.7
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 115..135
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..231
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 203..236
NoneNo IPR availablePANTHERPTHR36892:SF1OS01G0201800 PROTEINcoord: 1..1174
NoneNo IPR availablePANTHERPTHR36892OS01G0201800 PROTEINcoord: 1..1174

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10005218.1HG10005218.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0006281 DNA repair
molecular_function GO:0003677 DNA binding