Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGGTAAGTACATACATGATGTTTTATTTACTGAGCTATGTTTTTTTTTGAAAAAATTAAAATTCATTGTGGTATTTTTTATTTTTTTTGTTTCGGTAGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGTATGTATTGTATATATGCAATTTTTCATTTTCATTTTGATTAATTTTGGGATTTTTTTTTTTAATTATTTTCTTAGAGGTAGATTTTAGTAGACTGCAAATATTCTCTCAAATTGGGGGGTTTTTGTTTTTGGTAGCTCCTTTCTCTTTCTGCTCATAATTACTGCTGGAAAAAGCTGGTGAAAAGACTGCATTACGTTGGAGAGAAAAAGAGAGACAGTGAAAGAGAGAGAGAGAAAAAAAAAAAACAAGAAATAGTACTTGTTTTTTGTGTGATTAAGGGCGCGGAAATAGGCTGATTCACAATTGATTCACAATTGCAAAAAAATAAAAATAAAAATAAAAATAAAAAATAAAAAGTCACTACAAGACGGGTTTGTATCGTTTGTGAAGGCTTTCATTTGAGCCTGATTATATATATGTATGTATGTATATAATGCTCCAGCTGCAAGAGTACGGTCTTGCCAGTGAGTGACTTTATTTGGTGGAAGAGTTTTTTTTGTCCTTTGTTTAACAAATTTATCCTAGGTTAATGATGCTGTTTTTAACTAGTTTCGTAGTGATTAAATTATAATTTCTTACTTGAGAAATTATTTTAGATTTCTAGTAGTAGTATTAGAGATGAGTGTCCTCGATCGAACGCGTTTAGCGTTTAGGGTTTAGGGTTTTTGAAATAAAACATTTGAGTTTGTTGTAATGGGTTTTGTTTCACTATCATCTCCATTTTGAGGTACTTGGATTTTGACAACTTTTTGACCAAATTTGCTACGCACTAACATTTTTTTGCAACCCATTATCATATGGGGATTATTTTATAAGTTATGCTTGAGTTTTGCTTCAATGATCAAGTACTGAACTGCAAAACTTGAATCTAATTCTTTTTCTCCAACAATCTTTATGAAATGAGAGTTTATAAGTCATTTTTTTGAAGAATAGAACATGAATGTAAATAGTTTTTTAACTATTTTATGATGCTTTTTTGCTGATTAGTAGAGGGAATTGAAGAGACAATGGAATGTTGTGGATGAGAGAGAGAGAGTTGAGAAGGCAATCTCTTTTAGGATAGTGCATTTCATTATTTTGTTGTTAATTAATTCAAGGGCCCTCTTCCCTTATTTATGAGTAGTCTTTGTCTTTGTCCTTTAACAAATTGTCTTATTTGATAATTATGCTTTTTGTCTTATTGTTTTATAAGATGTAAGTGTTTGACCATTGAGAATGGAAGAATATATAATGAGGTTTTGTGGGTGAAGATTGCTAGATAGATTAGATGACATCATTATTATATATTAATAACCAAATTAGTGACAACATTTTCTCCTACTATGTAGCTACTAGACCTTAATTTATGATAGGATTGAAATTTTATGTGTTCAAGTTAGAGGGTTGAGGTGCTGCAATAACCCACTATAATTAGAATGAATGTTCTTTGCCAACCATTATGCTTCATATGCTTATTCTTTGAGATTAATTTGATCAACTTCTTGTTTCTTCTTACATTTCATGAAATAGATCTAGAAATATAACAAAATCATTTTCAAGTTTAGCATTGAAGCCATTAGCTTCCATGTCCACTGACTTGATGAGTTCCTTTGCCTCTCCTTTATAATGATGCTTTGTAAACTATTTTCACCCTTTATTTTTGTGTAAGAAAACACATAAAATATATTTTTTGACAGTTCACTTTTAACATCCATCAATGTTGAGTTGGGACTTGTTTGGATTGACTTTTAAAGTGCTTAAACATAATTTCTTAAGTACTTAAAACGTCAATTCAAACAAGTCCTTTGATAGAATATGAAAGGAAAGCTTGTACTCTTTGTATTTACTTTTAAGATTTTAATTATAATTAGTTTCTGACATTTTCTATTGGTTATAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAGTTATGTACCCCCATCCGCGCTTCAATCGAAAAAGCAGTATAATGTATCAAAGACCTGACTCTGTCATCAACTTAAATGAAAGATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA
mRNA sequence
ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA
Coding sequence (CDS)
ATGGCCGTTCCCACCTCCGCTTTCTCCATCCGAGAGTATGCTTTGAATAAGAGAAGCACGGATTTAACGAGAATTAGTTGGCCATTTAGCGAGAAAGTAAAGAAAGAAGTGGCAGAAGCCTTGCTTCCACCAATGGATGTGAAGAAATTTCGTTGGTGGTCGTCGTTTAAAGAGGAAAAGGTTAGTGAAGAAGAAGAAGAAGAAGAAGAAGAAGTAATTATAGAGAGAATTAAAATGCAAAAGATTTGTCCGGTTTGTGGGGTTTTTGTTGCAGCTACGGTGAACGCGGTGAATGCACATATTGATAGTTGTTTAAACGCTCAAACAGGCAAAGAAATTAGGAGAAAGAACAAAGGAGGAGGAGGAGGAGGAGGTAATTTGAATTTGAAGGGAAAATCAAGAACGCCAAAAAAGAGATCAATTGCCGAAATCTTTGCAGTGGCTCCGCCAGTAAAAGCAATGATTATTGTTAATGATTGTGAAGGAGAAGAAGAAAAAGCCGTTGGGAAACAAATTATTCACAACAACAACAACCTCAAAACGACGTCGTTGGCTACAAGTCTTGTCTCCACAATCAAGACAATCAACACCAAAATCACAACAACAACAACAACGGAACAACCCTCAATTGATCTTCTCAAGAAAAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAATAAGGATTTTGGTCATGGGCAACTTTGCAAGAAGGGAGAGATCAGAAATCACAAGGATGTTTCTACTCTTTGTAAGAAACCATGTTTTAAACGCTTGTCTAGACAAAAAAGGCAAAAACTAGTTAAAAAATCCAATGTAGTTGCCAAGCAACAGAGGCCAATGCCTCCACTTAGGAGCATTTTGAAGCATAGTGTAAAAGCAATTTCTGAGACAAACTCTTCATTAATCAATTTAAGAGGCAGCAATAATCAAGTGTTCAACAATAGTGGTCAAAAGTCTGATAGGCACGTTAGTTTCTTGGATAAGGATGATGTTCTTGGTCCAAGCACTAGAGACTTTTCTGATACCTTTGAACAAAATGTTGGCAATCCATTTCAAGCCTCAGAAGTAAGCACTAATTCAGGTGAAAGTAATAAAGGAGTTGCTTCCATGGAGGCGAATTTAAGTGACCATGTTGTTTGCTTTAGCACCCGACACGAAGTTGATAGTCAACATGTGAAAGGAAAGATTCAGTTGCCTAATGTTCAGAGTCAGGTTAATGCTCAAAGTTGGGACAATGAGAAGCATTCGACCGAGAAGTTGATATCGACAAATCGGGATGTTCCTCATGATCAAAATGATTTGCATTTGTTTGACCATGTCTATGTAGATGCACCTCAGAAGCTGCCACCAGTACATTCTGCTATTCCTGCTCTATTAGCTGCACAAGAAGAAAGGCAATATGGCGATGTAAGAACTCGATGTTGTTTAAATTCAGTCCCACAAGTTCATTCTCTTAATGGAAAATCAGTTGATCATTTGATAAATCCTTTCAATGGAGCAGCTGCTTTAGGCTCAATTACAAGCAAAGTGCCTTCTTCTTTAAGTGAAAATCCTGTTAGCAGATTTCTTAATATAGCTGAATCTTCTGCTAAAGACAATAGATTTCCATTTCCGAATGGGGAGCAAAGTGCGGTCGCCTACAAAGAGAAGGGCGTAAATGATGGATTTTTCTGCCTGCCATTGAACTCAAAGGGTGAACTGATACAGCTAAATTCAGGTTTGATTAATAGGTTTGATCAAATGAATGACACCGGTAACATTATAGCATGTTCTAACAGAATACCGGCATGCAGTCTCGTCCTGCCAAGGAGCAGGGATTATTTTGTAGACAATGAGAAGCTCCTTGTTGACACAGAACTTACTGGAAACCAGTTAACTTTATTTCCATTGCATAGTCATATGCAAGAATATCAAAATCGATATTTGCCAGCTGGATTCGACGTCGCTGAGCCTGGAACTTCGGAAACAGCTGATATTAGACTGATGAATTCAGAAAGGGGAAATGAAACTGGAAGGTTTTTTCACCCAAACTTGATGGATTCTCCATTTAACAGATGCAGGTACTATGGAAAGTTGCAGAACCAAAATGTAAGTACACAGTTTTATCCTGAAAATTCAAGTAGCATGTGTGCGAATCCCGGTCGGCAAACGATGCGGTTGATGGGCAAAGATGTAGCTGTTGGTGGAAATGGGCAAGAAGTTCAAGAACCTGAAGTTATAAACTTTTGGAAGAACTCAAACTTCATTGGGAACTGCCTGACCAATCCTATCCAAGAGACTCACATGAGAAAAAGAAACTTTCTGCAAGATAGGGAGTTGCATCATCCATCAAAAGGAGAAACCTTGTGTTATCATCCTGCAGGCTTTTATGGCAATCAAATGGCACAAAGGCATTTATTGCAAATGCTTCACAAATTCAACGACAACATCCATGGTTTTTCTCCCTTGTTGACCAACACCTTTAATATGGCACGAAACTTTCAAGCACCCTTTATTTCTGGTCTGGAAACACAAAGGTTTGGTTCACATCCATCAGCATTTTCTACTTCTCACCACATGTGTCCAAATAGTTATCAAAATTCTTTTGAACTTGGCTTCAACCAGAATCTACATCCAGCAAAATTAGGAACCTTCAACTTCCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCTTTCTTGCAGCCAGATGATGAAAATCATGTCCAGCTCCCTTGGTCTCACACTTCTAGAGCTGTCCCCATGGATGTTACACGATCACCAACGGGAAGAAGCGCCAATCGCAAATTCTAAACTCGCTGACATAAATGGATACTATTGTCCATGTATTCCTTCTGGCTCAGATGTTCTCATTAGCCCCTCTTCCATGCATCAAAGGCTTGAAACTGCCTATCCTTGCAGTACAATGCCATATTCTCACTTACAGATGAAGAATCATATCCCGGGTTCGACATCTTTTTTTCAACCAATTCCTGTTGGTCCAAGAGTACTTCAATCGCCAATTTCCAATGCAGGCCATCAAATTAGAATGAGCTCTGAGGACAGGTTGAAGTTCAACTCTTTGAGTGTCAAGGACTCTGATTTTTCAAGTAAAACACAACCGGCTCTAGAGCTGGTCAATTCGAGGAAGCGTCAAAGGGTATCGAGTTTAGAAATGAACAATTCAGGTGTTGTGCCAGAGTGGACAAGAGGAAAATTCAGTGATGATCACCTGCAATCTTACTCGGGGACGGTGAAAATCCATGCTAACTGGGACAAAGCTGTTAATTCAGTAGGAAATAATATCCCAAATATGACTCAAACTACTGATGAAGTAATGATTTCTACCAACAATAATGAAGCTCCTAAGGTTGAATGTATGGCAAGATCCGGCCCCATCAAGTTAACAGCAGGAGCAAAACACATCCTGAAACCAAGTCAGAGTATGGATCTAGATAATACTAAGCCTACTTATTCAACAATTCCTTCTGCTGGATTAGTTCATAGTGTTAGCTTGGTAGGATCTCAAAAGAAGTCAACTAAAGTATACAGTTTCTAA
Protein sequence
MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSSFKEEKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVGKQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPSSLSENPVSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAGLVHSVSLVGSQKKSTKVYSF
Homology
BLAST of HG10005218 vs. NCBI nr
Match:
XP_038888639.1 (uncharacterized protein LOC120078436 [Benincasa hispida])
HSP 1 Score: 1832.4 bits (4745), Expect = 0.0e+00
Identity = 967/1218 (79.39%), Postives = 1023/1218 (83.99%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSSFKEEK 60
MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS
Sbjct: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS------ 60
Query: 61 VSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQ-TGKEIRRKNKG 120
SE EEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLN+Q T KEIR+K
Sbjct: 61 -SERVISEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNSQITSKEIRKK--- 120
Query: 121 GGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQII---H 180
LK KSRTPKKRSIA+IFAVAPPVK MII NDC E EE+KAVGKQII +
Sbjct: 121 ---------LKAKSRTPKKRSIADIFAVAPPVKTMIIANDCCDEEEEKKAVGKQIIRHNN 180
Query: 181 NNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQ 240
NNNNLKTTSLATSLVSTIKTIN TTT EQPSI KKK KDFGHGQ
Sbjct: 181 NNNNLKTTSLATSLVSTIKTIN----TTTEQEQPSI-----------LHKKKKKDFGHGQ 240
Query: 241 LCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKA 300
LC+KGEIRNHKDVSTLCKKPCFKRL RQKR+KLVKKSNVVAKQQRPMP LRSILKHSVKA
Sbjct: 241 LCRKGEIRNHKDVSTLCKKPCFKRLCRQKRKKLVKKSNVVAKQQRPMPLLRSILKHSVKA 300
Query: 301 ISETNSSLINLRGSNNQVFNN-SGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQ 360
SETN S INLRG+NNQVFNN GQKSDR VSFLDKDDVLG ST FSDTFEQNVGNPFQ
Sbjct: 301 TSETNFSSINLRGNNNQVFNNGGGQKSDRRVSFLDKDDVLGLSTEVFSDTFEQNVGNPFQ 360
Query: 361 ASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNAQSWDN 420
ASEVSTNSGESNK VA +EANL+D VCFST+HEVD QH KGKIQLPN +QVNA+SWDN
Sbjct: 361 ASEVSTNSGESNKEVAPVEANLNDD-VCFSTQHEVDGQHAKGKIQLPNFHNQVNAESWDN 420
Query: 421 EKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRT 480
KHSTE LIS N+D+PHDQNDL LFDHVYVD QKL PVHSAIPALLAAQEERQYG VRT
Sbjct: 421 AKHSTENLISKNQDIPHDQNDLRLFDHVYVDGLQKLSPVHSAIPALLAAQEERQYGHVRT 480
Query: 481 RCCLNSVPQVHSLNGKSVDHLINPF-NGAAALGSITSKVP-SSLSENPVSRFLNIAESSA 540
+C LNS+ Q HSL GKS DHLINPF NG AALGSITS+VP SSLSENPVSRFLN+AESS
Sbjct: 481 QCGLNSIRQAHSLYGKSTDHLINPFNNGVAALGSITSRVPSSSLSENPVSRFLNLAESSI 540
Query: 541 KDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNIIAC 600
KD FPF NGE+S V+YKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMN+ N IAC
Sbjct: 541 KDTIFPFSNGEESMVSYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNEASNTIAC 600
Query: 601 SNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDVAE 660
S+RIP CSLVLPRSRDYF+DNEKLLVDTELTGNQLTLFPLHSH+ E QNRY PAGFD++E
Sbjct: 601 SSRIPVCSLVLPRSRDYFIDNEKLLVDTELTGNQLTLFPLHSHLPENQNRYFPAGFDISE 660
Query: 661 PG-TSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSSM 720
PG TSETADIRLMNSERG E+GRFFHPNLMDSP+NRCRYYGK QNQNVSTQFYPENSSSM
Sbjct: 661 PGITSETADIRLMNSERGTESGRFFHPNLMDSPYNRCRYYGKFQNQNVSTQFYPENSSSM 720
Query: 721 CANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFLQ 780
CANPG+QTMRLMGKDVAVGGN QEVQEPEVINFWKNS IGNCLTNPIQETHMRKRNFLQ
Sbjct: 721 CANPGQQTMRLMGKDVAVGGNRQEVQEPEVINFWKNSTLIGNCLTNPIQETHMRKRNFLQ 780
Query: 781 DRELHHPSKGETLCYHPAGFYGNQMAQRH----------------------------LLQ 840
DRELHHPSKGETL YHPAGF+GNQ+AQ + ++
Sbjct: 781 DRELHHPSKGETLFYHPAGFHGNQVAQSNFFANASQVRYPHPHLNRKSSIMYQRPDSVIN 840
Query: 841 MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQNS 900
+ FN+NIH FSP T+TFNMA+NFQ PFISG ET RFGS PSAFSTSHH CPN Y+NS
Sbjct: 841 LNESFNNNIHAFSPSSTDTFNMAQNFQGPFISGPETLRFGSQPSAFSTSHHTCPNRYENS 900
Query: 901 FELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLHDH 960
FELGFNQNLHPAKLGTFNFPFLQPDDE HV L S K L PWMLHDH
Sbjct: 901 FELGFNQNLHPAKLGTFNFPFLQPDDETHVQLPWSHTSK-----------SLPPWMLHDH 960
Query: 961 QREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKNHI 1020
QRE NSKLAD+NGYYCPCIP G+DVLI+PSSMH RLETAYPCSTMPYSHLQ KNHI
Sbjct: 961 QREAPQTTNSKLADLNGYYCPCIPFGTDVLINPSSMHHRLETAYPCSTMPYSHLQTKNHI 1020
Query: 1021 PGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALELVN 1080
PG TSFFQP+PV PR+LQSPI+NAGH+IR+SSEDRLKFN+LSVKD DFSSKT A ELV+
Sbjct: 1021 PGPTSFFQPMPVAPRILQSPIANAGHEIRLSSEDRLKFNTLSVKDFDFSSKTLLAGELVD 1080
Query: 1081 SRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNMTQ 1140
SRKRQ++SSLE NNSGVVP WTRGKFSDDHL+S GTVKIHANWDKAVNS G NIPNMTQ
Sbjct: 1081 SRKRQKISSLETNNSGVVPGWTRGKFSDDHLESNPGTVKIHANWDKAVNSAG-NIPNMTQ 1140
Query: 1141 TTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAGLV 1180
TTD V+IST NNE PK ECMARSGPIKLTAGAKHILKPSQS+D+DNTKPTYSTIPSAGLV
Sbjct: 1141 TTDGVVISTKNNETPKFECMARSGPIKLTAGAKHILKPSQSVDIDNTKPTYSTIPSAGLV 1171
BLAST of HG10005218 vs. NCBI nr
Match:
XP_011657559.1 (uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical protein Csa_004444 [Cucumis sativus])
HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 953/1220 (78.11%), Postives = 1025/1220 (84.02%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSS--FKE 60
MA PTS FSIREYALNKRS LT ISWPFSEKVKKEVAE+LLPPMDVKKFRWWSS
Sbjct: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
Query: 61 EKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNK 120
++ E EE EE+EVI ERIKMQKICPVCGVFVAATV AVNAHID+CL T KEIRRKN
Sbjct: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKN- 120
Query: 121 GGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQIIHNN 180
LK KSRTPKKRSIAEIFAVAPPVK MI+VNDC + EE+KAVGKQIIH+N
Sbjct: 121 ---------YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHN 180
Query: 181 NNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLC 240
NLKTTSLATSLVS IKTI KI TTTE+P+I L K+KKKKKKKKKKNKDF HG+LC
Sbjct: 181 KNLKTTSLATSLVSAIKTIKNKI--ATTTEEPTI--LAKRKKKKKKKKKKNKDFCHGKLC 240
Query: 241 KKGEIRNHKDVSTLCK-KPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAI 300
KKG+IRNHKDVST CK +PCFKRLS+QK++KL KKS VVAKQQRPMPPLRSILKHSVKAI
Sbjct: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
Query: 301 SETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQAS 360
SETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR SDTFEQNVGNPFQAS
Sbjct: 301 SETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQAS 360
Query: 361 EVSTNSGESNKGVASMEANLSDHVVCF-STRHEVDSQHVKGKIQLPNVQSQVNAQSWDNE 420
EVSTNSGESNK V SMEANL+D V CF STRH+VDSQHVKGKIQLPN +QVNAQSW+N
Sbjct: 361 EVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENP 420
Query: 421 KHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTR 480
KHSTEKLI +RD+PHD+NDLHLFDHVYVDA QKLPP HSAIPALLAAQEER YG VRT+
Sbjct: 421 KHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQ 480
Query: 481 CCLNSVPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSLSENPVSRFLNIAESS 540
C LN VPQ HSL GKSVDHLI N FNG AALGS+TS+VP SSL+ENPVSRFLN+AESS
Sbjct: 481 CGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESS 540
Query: 541 AKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNII 600
A+D NRF NGEQ V YKEKGVNDGFFCLPLNS+GELIQLNSGL +RFDQMN+ I
Sbjct: 541 ARDSNRFQISNGEQGVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTI 600
Query: 601 ACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDV 660
A S+RIP C+ V+PRSRDYFVDNEKL +DT+LTGNQLTLFPLHSHMQE QNRYLPAGFDV
Sbjct: 601 AGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDV 660
Query: 661 AEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSS 720
EPGTSETADIRLMNSERG ETGRFFHPNLMDSPFNRCRYY K QNQNVS QFYPENSSS
Sbjct: 661 PEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSS 720
Query: 721 MCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFL 780
MCANPGRQTMRLMGKDVAVGGNG++VQEPEVINFWKNS+ IGNCLTNPIQETHMRKRNFL
Sbjct: 721 MCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFL 780
Query: 781 QDRELHHPSKGETLCYHPAGFYGNQMAQRHLL---------------------------- 840
QDRELH+PS+GETL YHPAGF+GNQ+AQ +LL
Sbjct: 781 QDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESV 840
Query: 841 -QMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQ 900
+ +FN NIH F T+T NMARNFQAPF+SGLETQRF S PSAFSTSHH+CPN Y+
Sbjct: 841 INLNERFN-NIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYE 900
Query: 901 NSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLH 960
NSFELGFNQ+LHPAKLGTFNFPFLQPDD NHV L S K LSPW+LH
Sbjct: 901 NSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSK-----------SLSPWILH 960
Query: 961 DHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKN 1020
DHQRE P ANSKLAD+NGYYCPC P G+DVLISPSS+H +LETAYPCSTM YSHLQ KN
Sbjct: 961 DHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQTKN 1020
Query: 1021 HIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALEL 1080
HIPGSTS FQPIP+ PRVL SPI+NAGH+IRM SEDRLKFNSLSVK+SDFSSK Q A E
Sbjct: 1021 HIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEF 1080
Query: 1081 VNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNM 1140
V+SRKRQ+ SLE NNSGVVPEWTRGK+SDDHL+S GTVKIHANWDKAVNSVG NIPNM
Sbjct: 1081 VDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVG-NIPNM 1140
Query: 1141 TQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAG 1180
TQTTD ++IS NNNEA +VECMARSGPIKLTAGAKHILKPSQSMD+DNTKPTYSTIPSAG
Sbjct: 1141 TQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAG 1191
BLAST of HG10005218 vs. NCBI nr
Match:
XP_008449514.1 (PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa] >TYK21149.1 putative Zinc finger, Rad18-type [Cucumis melo var. makuwa])
HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0
Query: 281 MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60
Query: 341 SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
SDTFEQNVGNPFQASEV NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61 SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120
Query: 401 NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
N +QVNA+ W+N KHSTEKLI +RD+PHD+NDLH F HVYVDA QKLP HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 461 AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI N FNG AALGS+TS+VP SSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 521 SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
+ENPVSR N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 581 LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
L +RFDQMN+ N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 641 MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 701 NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 761 TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 821 ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
L++ +IH F P T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 881 AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L S K
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660
Query: 941 LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
LSPW+LHDHQRE P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661 ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720
Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721 PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780
Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
DSDFSSK Q A E V+SRKRQ+ SLE NNSG+VPEWTRGK+SDDHL+S G KIHAN
Sbjct: 781 DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840
Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841 DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900
BLAST of HG10005218 vs. NCBI nr
Match:
XP_022148072.1 (uncharacterized protein LOC111016842 isoform X1 [Momordica charantia])
HSP 1 Score: 1364.4 bits (3530), Expect = 0.0e+00
Identity = 786/1229 (63.95%), Postives = 894/1229 (72.74%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
MAV S FSIREYALN R DL R WPF + VKKEVAEA+LPP+ V KFRWWS +
Sbjct: 1 MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
K +SE +++EEE+VII M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120
Query: 121 GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
+RKN G +K KSRTPKKRSIAEIFAVAPPV+ +V D G
Sbjct: 121 ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180
Query: 181 KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
II LK TSLA +LV+ +KTI K + K+ K K KNKD
Sbjct: 181 GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240
Query: 241 FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
FGH L KKGE RNHKDVS CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241 FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300
Query: 301 HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
SVK +SET+ S NL+GS QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301 QSVKVVSETBPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360
Query: 361 NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
NPFQ SE +T SGESNKGVASME L+D +V FSTRH VDSQ +KGKIQLPN+ QVNA
Sbjct: 361 NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420
Query: 421 Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
Q W N KH E+ IS NR VPH+ N HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421 QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480
Query: 481 AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
AAQ+ERQYG VRT+ N P H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN
Sbjct: 481 AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540
Query: 541 VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
V R N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541 VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600
Query: 601 DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
DQMN+ N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE
Sbjct: 601 DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660
Query: 661 QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
+N+YL A FDV EPGTS DIRL+NSERG ++G H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661 RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720
Query: 721 STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721 STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780
Query: 781 QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
QE MRKRNFLQDR LH+PSKGETL ++PAGF+ Q+AQ +LL
Sbjct: 781 QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840
Query: 841 -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
++++ +D NI+ F P T FNMA NFQAPFISG T RFG P AFSTS
Sbjct: 841 GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900
Query: 901 HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV
Sbjct: 901 QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960
Query: 961 ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
W+ Q++EAP A SKLADING Y P I SG DVL SP SM R E A+PCSTMP
Sbjct: 961 -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020
Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
SH Q+KN IPGSTS FQPIPV PR I AGH+ R+S EDRLKF +LSVKD+D
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080
Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
SK QP EL++SRKRQ++ SLE NNSGVV EWT GKF+D+ +S G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140
Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
N+PN+T+ TD V++ + NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1153
BLAST of HG10005218 vs. NCBI nr
Match:
XP_022148073.1 (uncharacterized protein LOC111016842 isoform X2 [Momordica charantia])
HSP 1 Score: 1345.5 bits (3481), Expect = 0.0e+00
Identity = 780/1229 (63.47%), Postives = 888/1229 (72.25%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
MAV S FSIR DL R WPF + VKKEVAEA+LPP+ V KFRWWS +
Sbjct: 1 MAVAPSGFSIR---------DLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
K +SE +++EEE+VII M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120
Query: 121 GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
+RKN G +K KSRTPKKRSIAEIFAVAPPV+ +V D G
Sbjct: 121 ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180
Query: 181 KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
II LK TSLA +LV+ +KTI K + K+ K K KNKD
Sbjct: 181 GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240
Query: 241 FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
FGH L KKGE RNHKDVS CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241 FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300
Query: 301 HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
SVK +SET+ S NL+GS QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301 QSVKVVSETBPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360
Query: 361 NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
NPFQ SE +T SGESNKGVASME L+D +V FSTRH VDSQ +KGKIQLPN+ QVNA
Sbjct: 361 NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420
Query: 421 Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
Q W N KH E+ IS NR VPH+ N HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421 QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480
Query: 481 AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
AAQ+ERQYG VRT+ N P H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN
Sbjct: 481 AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540
Query: 541 VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
V R N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541 VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600
Query: 601 DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
DQMN+ N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE
Sbjct: 601 DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660
Query: 661 QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
+N+YL A FDV EPGTS DIRL+NSERG ++G H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661 RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720
Query: 721 STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721 STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780
Query: 781 QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
QE MRKRNFLQDR LH+PSKGETL ++PAGF+ Q+AQ +LL
Sbjct: 781 QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840
Query: 841 -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
++++ +D NI+ F P T FNMA NFQAPFISG T RFG P AFSTS
Sbjct: 841 GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900
Query: 901 HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV
Sbjct: 901 QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960
Query: 961 ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
W+ Q++EAP A SKLADING Y P I SG DVL SP SM R E A+PCSTMP
Sbjct: 961 -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020
Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
SH Q+KN IPGSTS FQPIPV PR I AGH+ R+S EDRLKF +LSVKD+D
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080
Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
SK QP EL++SRKRQ++ SLE NNSGVV EWT GKF+D+ +S G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140
Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
N+PN+T+ TD V++ + NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1144
BLAST of HG10005218 vs. ExPASy TrEMBL
Match:
A0A0A0KJS6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1)
HSP 1 Score: 1799.6 bits (4660), Expect = 0.0e+00
Identity = 953/1220 (78.11%), Postives = 1025/1220 (84.02%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWSS--FKE 60
MA PTS FSIREYALNKRS LT ISWPFSEKVKKEVAE+LLPPMDVKKFRWWSS
Sbjct: 1 MADPTSTFSIREYALNKRSMGLTTISWPFSEKVKKEVAESLLPPMDVKKFRWWSSLWLSS 60
Query: 61 EKVSEEEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQTGKEIRRKNK 120
++ E EE EE+EVI ERIKMQKICPVCGVFVAATV AVNAHID+CL T KEIRRKN
Sbjct: 61 QEEEEGEEGEEKEVITERIKMQKICPVCGVFVAATVAAVNAHIDTCLAQTTSKEIRRKN- 120
Query: 121 GGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDC--EGEEEKAVGKQIIHNN 180
LK KSRTPKKRSIAEIFAVAPPVK MI+VNDC + EE+KAVGKQIIH+N
Sbjct: 121 ---------YLKAKSRTPKKRSIAEIFAVAPPVKTMIVVNDCCEDEEEKKAVGKQIIHHN 180
Query: 181 NNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKDFGHGQLC 240
NLKTTSLATSLVS IKTI KI TTTE+P+I L K+KKKKKKKKKKNKDF HG+LC
Sbjct: 181 KNLKTTSLATSLVSAIKTIKNKI--ATTTEEPTI--LAKRKKKKKKKKKKNKDFCHGKLC 240
Query: 241 KKGEIRNHKDVSTLCK-KPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILKHSVKAI 300
KKG+IRNHKDVST CK +PCFKRLS+QK++KL KKS VVAKQQRPMPPLRSILKHSVKAI
Sbjct: 241 KKGDIRNHKDVSTFCKRRPCFKRLSKQKKKKLAKKSTVVAKQQRPMPPLRSILKHSVKAI 300
Query: 301 SETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVGNPFQAS 360
SETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR SDTFEQNVGNPFQAS
Sbjct: 301 SETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRTISDTFEQNVGNPFQAS 360
Query: 361 EVSTNSGESNKGVASMEANLSDHVVCF-STRHEVDSQHVKGKIQLPNVQSQVNAQSWDNE 420
EVSTNSGESNK V SMEANL+D V CF STRH+VDSQHVKGKIQLPN +QVNAQSW+N
Sbjct: 361 EVSTNSGESNKEVPSMEANLNDDVDCFNSTRHKVDSQHVKGKIQLPNFHNQVNAQSWENP 420
Query: 421 KHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALLAAQEERQYGDVRTR 480
KHSTEKLI +RD+PHD+NDLHLFDHVYVDA QKLPP HSAIPALLAAQEER YG VRT+
Sbjct: 421 KHSTEKLILESRDIPHDRNDLHLFDHVYVDAHQKLPPEHSAIPALLAAQEERPYGHVRTQ 480
Query: 481 CCLNSVPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSLSENPVSRFLNIAESS 540
C LN VPQ HSL GKSVDHLI N FNG AALGS+TS+VP SSL+ENPVSRFLN+AESS
Sbjct: 481 CGLNVVPQAHSLYGKSVDHLINNNNHFNGVAALGSVTSRVPSSSLTENPVSRFLNLAESS 540
Query: 541 AKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRFDQMNDTGNII 600
A+D NRF NGEQ V YKEKGVNDGFFCLPLNS+GELIQLNSGL +RFDQMN+ I
Sbjct: 541 ARDSNRFQISNGEQGVVTYKEKGVNDGFFCLPLNSRGELIQLNSGLTDRFDQMNEANTTI 600
Query: 601 ACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEYQNRYLPAGFDV 660
A S+RIP C+ V+PRSRDYFVDNEKL +DT+LTGNQLTLFPLHSHMQE QNRYLPAGFDV
Sbjct: 601 AGSSRIPVCNFVVPRSRDYFVDNEKLFLDTKLTGNQLTLFPLHSHMQENQNRYLPAGFDV 660
Query: 661 AEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNVSTQFYPENSSS 720
EPGTSETADIRLMNSERG ETGRFFHPNLMDSPFNRCRYY K QNQNVS QFYPENSSS
Sbjct: 661 PEPGTSETADIRLMNSERGTETGRFFHPNLMDSPFNRCRYYEKFQNQNVSAQFYPENSSS 720
Query: 721 MCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPIQETHMRKRNFL 780
MCANPGRQTMRLMGKDVAVGGNG++VQEPEVINFWKNS+ IGNCLTNPIQETHMRKRNFL
Sbjct: 721 MCANPGRQTMRLMGKDVAVGGNGKDVQEPEVINFWKNSHLIGNCLTNPIQETHMRKRNFL 780
Query: 781 QDRELHHPSKGETLCYHPAGFYGNQMAQRHLL---------------------------- 840
QDRELH+PS+GETL YHPAGF+GNQ+AQ +LL
Sbjct: 781 QDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPCTNRKSSLLYPRPESV 840
Query: 841 -QMLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTSHHMCPNSYQ 900
+ +FN NIH F T+T NMARNFQAPF+SGLETQRF S PSAFSTSHH+CPN Y+
Sbjct: 841 INLNERFN-NIHSFPTSSTDTLNMARNFQAPFVSGLETQRFCSQPSAFSTSHHVCPNRYE 900
Query: 901 NSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSSLGLTLLELSPWMLH 960
NSFELGFNQ+LHPAKLGTFNFPFLQPDD NHV L S K LSPW+LH
Sbjct: 901 NSFELGFNQSLHPAKLGTFNFPFLQPDDGNHVQLPWSHTSK-----------SLSPWILH 960
Query: 961 DHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMPYSHLQMKN 1020
DHQRE P ANSKLAD+NGYYCPC P G+DVLISPSS+H +LETAYPCSTM YSHLQ KN
Sbjct: 961 DHQREVPPTANSKLADVNGYYCPCTP-GTDVLISPSSIHHQLETAYPCSTMAYSHLQTKN 1020
Query: 1021 HIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVKDSDFSSKTQPALEL 1080
HIPGSTS FQPIP+ PRVL SPI+NAGH+IRM SEDRLKFNSLSVK+SDFSSK Q A E
Sbjct: 1021 HIPGSTSLFQPIPIAPRVLHSPIANAGHEIRMRSEDRLKFNSLSVKNSDFSSKKQLAEEF 1080
Query: 1081 VNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVNSVGNNIPNM 1140
V+SRKRQ+ SLE NNSGVVPEWTRGK+SDDHL+S GTVKIHANWDKAVNSVG NIPNM
Sbjct: 1081 VDSRKRQKTLSLETNNSGVVPEWTRGKYSDDHLKSNPGTVKIHANWDKAVNSVG-NIPNM 1140
Query: 1141 TQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKPTYSTIPSAG 1180
TQTTD ++IS NNNEA +VECMARSGPIKLTAGAKHILKPSQSMD+DNTKPTYSTIPSAG
Sbjct: 1141 TQTTDGIVISANNNEAHRVECMARSGPIKLTAGAKHILKPSQSMDVDNTKPTYSTIPSAG 1191
BLAST of HG10005218 vs. ExPASy TrEMBL
Match:
A0A5D3DCZ7 (Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold75860G00170 PE=4 SV=1)
HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0
Query: 281 MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60
Query: 341 SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
SDTFEQNVGNPFQASEV NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61 SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120
Query: 401 NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
N +QVNA+ W+N KHSTEKLI +RD+PHD+NDLH F HVYVDA QKLP HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 461 AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI N FNG AALGS+TS+VP SSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 521 SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
+ENPVSR N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 581 LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
L +RFDQMN+ N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 641 MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 701 NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 761 TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 821 ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
L++ +IH F P T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 881 AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L S K
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660
Query: 941 LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
LSPW+LHDHQRE P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661 ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720
Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721 PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780
Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
DSDFSSK Q A E V+SRKRQ+ SLE NNSG+VPEWTRGK+SDDHL+S G KIHAN
Sbjct: 781 DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840
Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841 DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900
BLAST of HG10005218 vs. ExPASy TrEMBL
Match:
A0A1S3BM77 (uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=4 SV=1)
HSP 1 Score: 1401.0 bits (3625), Expect = 0.0e+00
Identity = 724/934 (77.52%), Postives = 781/934 (83.62%), Query Frame = 0
Query: 281 MPPLRSILKHSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDF 340
MPPLRSILK SVKAISETNSS INL+GS NQ FNN GQKSDR VSFLDKDDVLGPSTR
Sbjct: 1 MPPLRSILKRSVKAISETNSSFINLKGS-NQAFNNGGQKSDRRVSFLDKDDVLGPSTRAI 60
Query: 341 SDTFEQNVGNPFQASEVSTNSGESNKGVASMEANLSDHVVCFSTRHEVDSQHVKGKIQLP 400
SDTFEQNVGNPFQASEV NSGESNK V SMEANL+D V CF+TRH+VDSQHVKGKIQLP
Sbjct: 61 SDTFEQNVGNPFQASEVGINSGESNK-VPSMEANLNDDVDCFNTRHKVDSQHVKGKIQLP 120
Query: 401 NVQSQVNAQSWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 460
N +QVNA+ W+N KHSTEKLI +RD+PHD+NDLH F HVYVDA QKLP HSAIPALL
Sbjct: 121 NFHNQVNAERWENAKHSTEKLILESRDIPHDRNDLHSFAHVYVDAHQKLPLKHSAIPALL 180
Query: 461 AAQEERQYGDVRTRCCLNS-VPQVHSLNGKSVDHLI---NPFNGAAALGSITSKVP-SSL 520
A QEER YG VRT+C LN+ VPQ HSL GKSVD+LI N FNG AALGS+TS+VP SSL
Sbjct: 181 AEQEERPYGHVRTQCGLNNVVPQAHSLYGKSVDNLINNNNHFNGVAALGSVTSRVPSSSL 240
Query: 521 SENPVSRFLNIAESSAKD-NRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSG 580
+ENPVSR N+AESSA+D NRF FPNGEQS V YKEKGVNDGFFCLPLNS+GELIQLNSG
Sbjct: 241 TENPVSRLFNLAESSARDSNRFQFPNGEQSVVTYKEKGVNDGFFCLPLNSRGELIQLNSG 300
Query: 581 LINRFDQMNDTGNIIACSNRIPACSLVLPRSRDYFVDNEKLLVDTELTGNQLTLFPLHSH 640
L +RFDQMN+ N +A S+RIP C+LV+PRSRDYFVDNEKL VDT+LTGNQLTLFPLHSH
Sbjct: 301 LTDRFDQMNEASNTMAGSSRIPVCNLVVPRSRDYFVDNEKLFVDTKLTGNQLTLFPLHSH 360
Query: 641 MQEYQNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQ 700
MQE QNRYLPAGFDV EPGTSETADIRLM+SERG ETGRFFHP LMDSPFNRCRYY K Q
Sbjct: 361 MQENQNRYLPAGFDVPEPGTSETADIRLMSSERGTETGRFFHPKLMDSPFNRCRYYEKFQ 420
Query: 701 NQNVSTQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCL 760
NQNVSTQFYPENSSSMC NPGRQTMRLMGKDVAVGGNG++ QEPEVINF KNS+ +GNCL
Sbjct: 421 NQNVSTQFYPENSSSMCVNPGRQTMRLMGKDVAVGGNGKDAQEPEVINFLKNSHLVGNCL 480
Query: 761 TNPIQETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ------------ 820
TNPIQETHMRKRNFLQDRELH+PS+GETL YHPAGF+GNQ+AQ +LL
Sbjct: 481 TNPIQETHMRKRNFLQDRELHYPSRGETLFYHPAGFHGNQVAQGNLLANAPQAVRYPHPC 540
Query: 821 ----------------MLHKFNDNIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPS 880
L++ +IH F P T+T NMARNFQAPF+SGLETQRF S PS
Sbjct: 541 TNRKSSILYPRPESVINLNERFSSIHSFPPSSTDTLNMARNFQAPFVSGLETQRFCSQPS 600
Query: 881 AFSTSHHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHV-LSCSQMMKIMSSS 940
AFSTSHHMCPN Y+NSFELGFNQ+LHPAKLGTFNFPFLQ DD NHV L S K
Sbjct: 601 AFSTSHHMCPNRYENSFELGFNQSLHPAKLGTFNFPFLQQDDGNHVQLPWSHTSK----- 660
Query: 941 LGLTLLELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAY 1000
LSPW+LHDHQRE P ANSKLADINGYYCPC PSG+DVLISPSS+H RLETAY
Sbjct: 661 ------SLSPWILHDHQRELPPTANSKLADINGYYCPCTPSGTDVLISPSSIHHRLETAY 720
Query: 1001 PCSTMPYSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSSEDRLKFNSLSVK 1060
PCSTM YSHLQ KNHI GSTSFFQPIP+ PRVLQSPI+NAGH+IRM SEDRLKFNSLSVK
Sbjct: 721 PCSTMAYSHLQTKNHISGSTSFFQPIPIAPRVLQSPIANAGHEIRMRSEDRLKFNSLSVK 780
Query: 1061 DSDFSSKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANW 1120
DSDFSSK Q A E V+SRKRQ+ SLE NNSG+VPEWTRGK+SDDHL+S G KIHAN
Sbjct: 781 DSDFSSKKQLAEEFVDSRKRQKTLSLETNNSGIVPEWTRGKYSDDHLKSNPGMKKIHANR 840
Query: 1121 DKAVNSVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDL 1180
DKAVNSVG NIPNMTQTTD ++IS NNNEA KVEC ARSGPIKLTAGAKHILKPSQSMD+
Sbjct: 841 DKAVNSVG-NIPNMTQTTDGIVISANNNEAHKVECTARSGPIKLTAGAKHILKPSQSMDV 900
BLAST of HG10005218 vs. ExPASy TrEMBL
Match:
A0A6J1D428 (uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)
HSP 1 Score: 1363.2 bits (3527), Expect = 0.0e+00
Identity = 786/1229 (63.95%), Postives = 894/1229 (72.74%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
MAV S FSIREYALN R DL R WPF + VKKEVAEA+LPP+ V KFRWWS +
Sbjct: 1 MAVAPSGFSIREYALNMRGRDLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
K +SE +++EEE+VII M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120
Query: 121 GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
+RKN G +K KSRTPKKRSIAEIFAVAPPV+ +V D G
Sbjct: 121 ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180
Query: 181 KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
II LK TSLA +LV+ +KTI K + K+ K K KNKD
Sbjct: 181 GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240
Query: 241 FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
FGH L KKGE RNHKDVS CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241 FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300
Query: 301 HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
SVK +SET+ S NL+GS QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301 QSVKVVSETDPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360
Query: 361 NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
NPFQ SE +T SGESNKGVASME L+D +V FSTRH VDSQ +KGKIQLPN+ QVNA
Sbjct: 361 NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420
Query: 421 Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
Q W N KH E+ IS NR VPH+ N HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421 QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480
Query: 481 AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
AAQ+ERQYG VRT+ N P H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN
Sbjct: 481 AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540
Query: 541 VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
V R N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541 VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600
Query: 601 DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
DQMN+ N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE
Sbjct: 601 DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660
Query: 661 QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
+N+YL A FDV EPGTS DIRL+NSERG ++G H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661 RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720
Query: 721 STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721 STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780
Query: 781 QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
QE MRKRNFLQDR LH+PSKGETL ++PAGF+ Q+AQ +LL
Sbjct: 781 QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840
Query: 841 -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
++++ +D NI+ F P T FNMA NFQAPFISG T RFG P AFSTS
Sbjct: 841 GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900
Query: 901 HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV
Sbjct: 901 QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960
Query: 961 ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
W+ Q++EAP A SKLADING Y P I SG DVL SP SM R E A+PCSTMP
Sbjct: 961 -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020
Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
SH Q+KN IPGSTS FQPIPV PR I AGH+ R+S EDRLKF +LSVKD+D
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080
Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
SK QP EL++SRKRQ++ SLE NNSGVV EWT GKF+D+ +S G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140
Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
N+PN+T+ TD V++ + NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1153
BLAST of HG10005218 vs. ExPASy TrEMBL
Match:
A0A6J1D325 (uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 GN=LOC111016842 PE=4 SV=1)
HSP 1 Score: 1344.3 bits (3478), Expect = 0.0e+00
Identity = 780/1229 (63.47%), Postives = 888/1229 (72.25%), Query Frame = 0
Query: 1 MAVPTSAFSIREYALNKRSTDLTRISWPFSEKVKKEVAEALLPPMDVKKFRWWS----SF 60
MAV S FSIR DL R WPF + VKKEVAEA+LPP+ V KFRWWS +
Sbjct: 1 MAVAPSGFSIR---------DLGR-CWPFRDNVKKEVAEAILPPISVTKFRWWSHELEAL 60
Query: 61 KEEKVSE-------EEEEEEEEVIIERIKMQKICPVCGVFVAATVNAVNAHIDSCLNAQT 120
K +SE +++EEE+VII M+KICPVCGVFV ATVNA+NAHIDSCL AQT
Sbjct: 61 KSSNISETVTAAAAAQKQEEEKVII----MEKICPVCGVFVTATVNAMNAHIDSCL-AQT 120
Query: 121 GKEIRRKNKGGGGGGGNLNLKGKSRTPKKRSIAEIFAVAPPVKAMIIVNDCEGEEEKAVG 180
+RKN G +K KSRTPKKRSIAEIFAVAPPV+ +V D G
Sbjct: 121 ITNQKRKNNSNGA------VKPKSRTPKKRSIAEIFAVAPPVET--VVED---------G 180
Query: 181 KQIIHNNNNLKTTSLATSLVSTIKTINTKITTTTTTEQPSIDLLKKKKKKKKKKKKKNKD 240
II LK TSLA +LV+ +KTI K + K+ K K KNKD
Sbjct: 181 GGIIRQKQQLKATSLARTLVTAMKTIKAK---------------RNKQHKLKASVVKNKD 240
Query: 241 FGHGQLCKKGEIRNHKDVSTLCKKPCFKRLSRQKRQKLVKKSNVVAKQQRPMPPLRSILK 300
FGH L KKGE RNHKDVS CKKPCFKRLSRQK++KLVKKSNV AKQQRP+P +RSILK
Sbjct: 241 FGHELLRKKGE-RNHKDVSVRCKKPCFKRLSRQKKKKLVKKSNVPAKQQRPVPSIRSILK 300
Query: 301 HSVKAISETNSSLINLRGSNNQVFNNSGQKSDRHVSFLDKDDVLGPSTRDFSDTFEQNVG 360
SVK +SET+ S NL+GS QV NN G++SDR VSF DKDDVLGP TR FSDTFEQ+VG
Sbjct: 301 QSVKVVSETDPS-GNLKGS-KQVINNGGKQSDRRVSFFDKDDVLGPKTRAFSDTFEQSVG 360
Query: 361 NPFQASEVSTNSGESNKGVASME-ANLSDHVVCFSTRHEVDSQHVKGKIQLPNVQSQVNA 420
NPFQ SE +T SGESNKGVASME L+D +V FSTRH VDSQ +KGKIQLPN+ QVNA
Sbjct: 361 NPFQDSEGNTMSGESNKGVASMEDVGLNDDIVSFSTRHGVDSQRIKGKIQLPNIHDQVNA 420
Query: 421 Q--------SWDNEKHSTEKLISTNRDVPHDQNDLHLFDHVYVDAPQKLPPVHSAIPALL 480
Q W N KH E+ IS NR VPH+ N HLFDHVY+DAPQ+ PPVHSAIPALL
Sbjct: 421 QISSMRPHPCWGNMKHLVEEPISANRVVPHESNS-HLFDHVYIDAPQR-PPVHSAIPALL 480
Query: 481 AAQEERQYGDVRTRCCLNSVPQVHSLNGKSVDHLINPFNGAAALGSITSKVPS-SLSENP 540
AAQ+ERQYG VRT+ N P H+ NGKSVDHL+NP NG A LGS+TS VP+ +L+EN
Sbjct: 481 AAQDERQYGQVRTQXGSN-FPGAHTFNGKSVDHLVNPINGVANLGSMTSTVPTFTLTENG 540
Query: 541 VSRFLNIAESSAKDNRFPFPNGEQSAVAYKEKGVNDGFFCLPLNSKGELIQLNSGLINRF 600
V R N+AESSAKDNR PFPN EQ AVAYKEKG+NDGFFCLPLNSKGELIQLNSGL+NR+
Sbjct: 541 VGRLFNLAESSAKDNRGPFPNLEQRAVAYKEKGMNDGFFCLPLNSKGELIQLNSGLVNRY 600
Query: 601 DQMNDTGNIIACSNRIPACSLVLPRS-RDYFVDNEKLLVDTELTGNQLTLFPLHSHMQEY 660
DQMN+ N +ACS+RIP C LV PRS RDYF+DNEK+L+DTELT NQLTLFPLHS MQE
Sbjct: 601 DQMNEARNNMACSSRIPVCGLVQPRSTRDYFIDNEKVLIDTELTENQLTLFPLHS-MQEN 660
Query: 661 QNRYLPAGFDVAEPGTSETADIRLMNSERGNETGRFFHPNLMDSPFNRCRYYGKLQNQNV 720
+N+YL A FDV EPGTS DIRL+NSERG ++G H NLMD+PFNRCRYYGKL NQNV
Sbjct: 661 RNQYLSARFDVTEPGTSGETDIRLLNSERGTDSGSLLHSNLMDAPFNRCRYYGKLHNQNV 720
Query: 721 STQFYPENSSSMCANPGRQTMRLMGKDVAVGGNGQEVQEPEVINFWKNSNFIGNCLTNPI 780
ST+ YPENSS+M ANP RQTMRLMGKDVAVGGNG+EVQEPE INFWKNS+ I NCLTN I
Sbjct: 721 STEIYPENSSTMSANPARQTMRLMGKDVAVGGNGKEVQEPEGINFWKNSSLIENCLTNSI 780
Query: 781 QETHMRKRNFLQDRELHHPSKGETLCYHPAGFYGNQMAQRHLLQ---------------- 840
QE MRKRNFLQDR LH+PSKGETL ++PAGF+ Q+AQ +LL
Sbjct: 781 QENPMRKRNFLQDRVLHYPSKGETL-FYPAGFHSGQVAQSNLLPNAPQVRYPHPRLNRKN 840
Query: 841 -MLHKFND----------NIHGFSPLLTNTFNMARNFQAPFISGLETQRFGSHPSAFSTS 900
++++ +D NI+ F P T FNMA NFQAPFISG T RFG P AFSTS
Sbjct: 841 GVMYQRSDSVINLNERFSNIYAFFPSSTEAFNMAPNFQAPFISGPRTLRFGPQPPAFSTS 900
Query: 901 HHMCPNSYQNSFELGFNQNLHPAKLGTFNFPFLQPDDENHVLSCSQMMKIMSSSLGLTLL 960
HMC N Y++SFELG+NQN HPAKLGTFNFPFLQPDDENHV
Sbjct: 901 QHMCSNRYEHSFELGYNQNPHPAKLGTFNFPFLQPDDENHV------------------- 960
Query: 961 ELSPWMLHDHQREEAPIANSKLADINGYYCPCIPSGSDVLISPSSMHQRLETAYPCSTMP 1020
W+ Q++EAP A SKLADING Y P I SG DVL SP SM R E A+PCSTMP
Sbjct: 961 -PPSWL----QQDEAPTATSKLADINGCYYPFISSGPDVLTSP-SMRTRPEAAFPCSTMP 1020
Query: 1021 YSHLQMKNHIPGSTSFFQPIPVGPRVLQSPISNAGHQIRMSS-EDRLKFNSLSVKDSDFS 1080
SH Q+KN IPGSTS FQPIPV PR I AGH+ R+S EDRLKF +LSVKD+D
Sbjct: 1021 -SHRQVKN-IPGSTSIFQPIPVTPRFEVPYIVKAGHESRISCFEDRLKFKTLSVKDTDLL 1080
Query: 1081 SKTQPALELVNSRKRQRVSSLEMNNSGVVPEWTRGKFSDDHLQSYSGTVKIHANWDKAVN 1140
SK QP EL++SRKRQ++ SLE NNSGVV EWT GKF+D+ +S G+ KIH NWDKAVN
Sbjct: 1081 SKKQPVGELIDSRKRQKLLSLETNNSGVVAEWTPGKFNDEQ-RSNPGSAKIHGNWDKAVN 1140
Query: 1141 SVGNNIPNMTQTTDEVMISTNNNEAPKVECMARSGPIKLTAGAKHILKPSQSMDLDNTKP 1180
N+PN+T+ TD V++ + NE+PKVE MARSGP+KLTAGAKHILKPSQSMDLDNTKP
Sbjct: 1141 PT-XNLPNVTE-TDGVLLISPTNESPKVESMARSGPVKLTAGAKHILKPSQSMDLDNTKP 1144
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038888639.1 | 0.0e+00 | 79.39 | uncharacterized protein LOC120078436 [Benincasa hispida] | [more] |
XP_011657559.1 | 0.0e+00 | 78.11 | uncharacterized protein LOC105435872 [Cucumis sativus] >KGN47991.1 hypothetical ... | [more] |
XP_008449514.1 | 0.0e+00 | 77.52 | PREDICTED: uncharacterized protein LOC103491377 [Cucumis melo] >KAA0061673.1 put... | [more] |
XP_022148072.1 | 0.0e+00 | 63.95 | uncharacterized protein LOC111016842 isoform X1 [Momordica charantia] | [more] |
XP_022148073.1 | 0.0e+00 | 63.47 | uncharacterized protein LOC111016842 isoform X2 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KJS6 | 0.0e+00 | 78.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G423330 PE=4 SV=1 | [more] |
A0A5D3DCZ7 | 0.0e+00 | 77.52 | Putative Zinc finger, Rad18-type OS=Cucumis melo var. makuwa OX=1194695 GN=E5676... | [more] |
A0A1S3BM77 | 0.0e+00 | 77.52 | uncharacterized protein LOC103491377 OS=Cucumis melo OX=3656 GN=LOC103491377 PE=... | [more] |
A0A6J1D428 | 0.0e+00 | 63.95 | uncharacterized protein LOC111016842 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
A0A6J1D325 | 0.0e+00 | 63.47 | uncharacterized protein LOC111016842 isoform X2 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |