Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGACTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATCATCGACTTCGCCAGATCAGAGAACTTGATTGAGGATTCCCTTTCATCCTCCCCACCTTCTTCTCCTTCCTCCCTTTCTTCCTCCTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTTTGGAGGCCGGTGATATTGACTGCGCCATCGATCTCCTCCGCCTTCATGCACCCTTCATTCTTGACGACCACAGGCTTCTATTCCGGTTGCAGAAGCAGGTTACTCTAATTCTTCTCTCCTCCCCTCTCTCTCTTGCTGATGCTGCCTGCCTAATCCTGAATGTGCAACTTCCTTTTGTTTTTGCGTTTTCTTTTGTTTTTTGTTGTTTTCACTAATTTGGGGGGGGGGGGGGGGGGGGATTTTTCCATTTTTGATCATGCTGCACTAGGGATTGGAAATCAAGAAAAATAGAATCCGTTTGTTTGGTAGCTAAACATCCTAGTCAGCGGAACATCCTTTCATCTATTCTTTTTTCGATTACTGATATTGTGGGACTTTTGAGTTGATTCATTTCCTTTTCTGTGTTTAATTGTGTTCAGAAATTTATTGAGCTTTTGAGGAAAGGGACTCCAGAGTGTCGTGATTTGGCCATTCAATGTCTTCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGTAAACATTCTACACATTCACGGTTTGCTTTCATTTGATGCTTTTAAGAAAATATTCTCGGGTTATCGGATCGAGCAAAGAGTTGTTGTCTTTTTAAAAGAGAGAAAATTTCCGTTAAATTATGGAAAGTACAAGATGAGGTGCCGTAAGCCATCTTTTTTCTACTCTAGTTGGCGTAATAATGAAGTAGCTGTAACTACAAAAAGGTGACAAGAAAAGAGAGAGAGAGAGCACCATAAGAACCCAAAAGTCCAACCATCTCTCAGATAAGGGCACCATGGAATAACTTTTGACTGCATCATTCCAGATAACATCAGGTTTCCCTTCCAACGGATGGCCACATAATATTGTTCCATAATGTCTTGAATCCTTTGCGGAAGAAACCACTGTGCTATTATATCAGATACTGCAACTTTTCTGCTTATGCTTTTGGCTGACATCTCTTTTGTTTATGTTGAAGGAGGCGTATGAGGAGTTCAAGCATGTTCTTCTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCAGTGACATATGAGGTGGATAACACTATTTTCTCTCATATCACATCGGTGAATATTACCTTTGGTATTCTGGATATTTTCAGTGGTTGTTGACTATGCCACCCTTCTGGAAACCCAGTAACATATGGCTGTTCTTGATAATTAGAATGCACATTTAGTATTTGGCTATCTTATATCTCAAACAGTTGTAAAGGCAACCACTTTCCTCTTTTCTTGTAACTACGGAATTATAAGAAGGAAAAGATGCTCCATTAATTTATTGGGATTTTTGGCCACTTTTAAGGTTCACTAAGCAATCATTGGGATGTCTTACGTTGAACATAATTTTTCACCCGACACATTTAAAATGGGTTCTGTTTTGTCTTAGGCCCTCAGGGCATTATATAAGTCCAGTACATATTTTTTTGCCAAGAAAATATGAATTCCCTGGTTCCGGGTGGGTAGGATATGGAATTGTCTTGCATATTGTTGTCAAGGTGCTTGAACAGAATGGGGATAAGAGGTACACACGAAATCCAAATCTTATAAAATAGTTTCTTAATTTGACAAGCATTGGGAAAATTTTGGAGTCGTTCAAATTCTTTTGTTCTTTGATCTTTTGACAGGGCAGAGATGTTATCCTTTGGAAATTCATAAAGAATAGTCTTTTTTTTTTTTTTTTTTTTTTTTTTTTCGGGGAAAAGAAATATATATATATATATTTATAAACATTGTTAATTTTTATTGGAGATGTGAATTCTCGTCTCTTTATAGTTGCTTTCCTTGTTATGTTTATGTTGTGTGCGTTGAAATATGTTGCAAAATTTATCCTTTGGAGTCTGAACTACATAATTCTGTAAGAACTTTCAAATTGGATACGTCTAGTTTTAGGTGCAATATGTTCTTTTTTGGTGGTCTGAAGACTATGAACAGGGTGGAATTGGGAACCGCATTTAATTTATCGTCTGAAAGTGCATTTTCTCATGTTTCAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGATTAATGTCCTCAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTCTGAGATACTTGATAAGGTTTTTTTCCCTTCATTTTCTTCTTTCTTCTTGCCTCCCTCCCATAGTGTATTGTGAACCTGATAGAAGTTTGCGTTGAATCTAGAATTTAAGTTTATTTATTCTTCTTTGTTCTTTCTTGCAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCTATATCAGATCTCACTGAGAGGTTGCTTCTAGATGAACGTGATCCACCTGCCACACCCAAGGAGAGTCTATATGAAGCACCTCCATTTGATGAGGCGAGATGACCAATCTTTTTAATAAGAATTTAATTATAAAAATTGAGCTTCTAGTTAGTTCCATCTAGTGGTGTGGTACAACGTGATAGTTGGCCCCTATTTGCATGTGTATGGTTACTAGCTTTCAGGAGTTGTTATTTTCTTTGCGTCTTAATTTTTATGTTAATATGAGTATAGGTGGACATTCAAGCTCTTGCACATGCTGTAGAGCTAACAAGGCAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGGTTAGCATCTCTATGAAGATTCATAATTTTATCCTTTCTGGTCTATGTTCTACTGTTATTAGATTCATTTTATGCAAATCCTTGGGTATAGGTGACTAAACTACTCTACGCTGCTTTGTGTTGCATTAGACAGTGTGAACCAGAACTTTATTGGATGCCGTGCTTGTTATTGATACTAGCATGGGCCACAGATGAAGTGACACTGACTTCATGGATTGGTGAATAGAAATGGTTCTTCATATTGGAATATAAGTTTGGTGGAAAGTGGGAAAATTGAGAAATATAAGGCAAGAGCAAGACCATAGTAAATTTAGTGATGTAAAAGTAATTCCGTTATATTTTTGGAGGCAGTTTGTTGTTTGTTCTGAAAATTGAGTTCATTTGAAAATTTTGTTGAAATTTACCGACAGTAGAATCTTTACTCTTCACCTCTTGATTTTCTCTAACATGCAGAACGGTTCTATCTATTCTGTTGTATTGATATTTACCTTTTGTATAATTATTTAAATATTATTCTGTACATCGTTCTCTTGATTTTTTGTGATTATGGCCTTCAATTCCAATCATCTCGCCAAGTAGATTAGTATTCTTCATTTTGTTTCAGTCTCATGGTCTCATTGATTATGAGTGTTTACATACATTTGGCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCTACAGAGGAATTGTGGATTCTGGTCGTGGAGCCCTCTCTGGTGAAACTCAAACTTTTATCCCTTATTTCTTTTTATCAATAAGCTTGTATGCATTTTAAACCTGTATGTCATTTCTTGATAATTAGGAGTGCAGAATATCTCTAGTTCATCAAAAGTTAATCAGTCAGAGCTGGAGTATTGTTCATCTAGGAATTGTTCTTTTGAAGTTGACTATGCAACCAGTAAACTTTCGGATGGTGAAATTTCTGTCAGCAATTCCCGTGTAGATAGTTCTCCTGAAAATATTGCTGATGTGACCAGTTCACAAGGTACTGATATTGAATTACGATATGCATTGGAGCCAACGTCCAATCGAGAAGATTGTAGCACTAGTGATTCAGTTCATGTGGGAAATTCAAGAACATTACAAGTGAACAAGAATCATGGGATTGTAGAAAGGAGCAAGCGAAAGAGATGGAGAGGAAGACACGATGATAGAGAACTTCATGATGTTTCTTACAGTGGGTGCAGTAAACAAGAACTTAGCTCTACAACAGTGGCCGGTACAACCATGCCTAAGGAACAACAGGTACATCCATAACTAATATTTTATTCCATCTAATGGAACAATCTGACTGAACTGTGTTCTGCAGAACCTTGAAAAACATTTACCATTTGAGTCTACTGGCAAGGAGGATAAATATGAGATTGTCTTGGGCATTAGAGAACTGGCAAGTAAAAGATTGGCTGCAGAGGTTGTGGAGGAAATTAACGCCGTGGATCCGAACTTTTTTGCACAAAATCCTATTCTCCTGTTCCAACTTAAGCAGGTACATTGATTGTTACAAGGTTAAGTGTGAGTTTTCTTTTTATACTCGTTAGTCTTACTACACTGTTAGCTAGTCATTATTGATGGATCATGGAAGTTTACATGAGTCTGCTGTTTGATAAGAATGATCGCATATTTCTTTGAAATTTATTTGTCATTCCTATGGCTTCAGGTTGAATTTTTGAAGCTGGTTAGTTCCGGTGATTATTCCAGTGCTTTGAGGGTCGCATGCACTCACTTAGGCCCAATAGCCGCTAATGATCCTTCCTTGTTGAAGCAATTGAAGGAGACTTTGTTGGCTTTGCTCCTGCCCAATGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAATTCTCTTCAGGTATGGAGTAACTTACTAATAATGAAACCCACCCTAGCCCCAATTCATGATTTAACATTTGGGGGGAGACATTGGCCCAAAAATTGTGGGGGCCACTATCTTGTATATGCCTGGTTAGGCCCAAAACTTGTATATGCCGGCATATATTCTATCCAAGCTTAAAAAGTTCTCTTCAATATAAATTATTATCTTAGGTACCAATATTCAACCATTTCTATTGCTGATACCTCTGGATTTGACCCTTACAAACAGGCTAGTAGTATGTTCTCCTATTGAAGATGTGTCATTCAAAGTCAGCGTCAGGATGTTTCCATGAACAGTATTTTAATCAAAATTATTTATGTTAGAAAGAAACTGATTTGAAAGAAATAAGAGGACATTCTAAGGAGAATAGACCAGCCAAGAGACTCTCTTGTTAGTCTCTGCTTTTAGCTGGTGTTCTATTTCAAATTTTCAAGCGAGTTCTGTAATGACTGTTTTGTTCATATGTTTGCCAGTTGGGATGCTTTCTAGTAATTCCCTTGGTTTAAATCCCCTCTCCTTTTTGTTATCCTCTGTTTGAACATGAAATTGAGTTGTAAAAAAAATTGAAATTATTTCAATTATAAGTTCAAGAAGTATTTCTTTGATTCAAATAAACTGGTCTTTGAAATGATTAGTTCACTAAAATGAAAAAGGTTAAGATTACTTAAGAGAGTGCATGCATGTATCCATTGGTAAAGAGAGATTACTAAGAAAAAGGGACTCTTGTAGGCTGTGGACCAAAAGTGAAAAAAAAAGCGTCAAACTTGGAAATTGGAATTTTGCTAAAAGGGATCTATCAAGGATGTTTCAGAACCTTGTCCAGTGTATCATAAACATTTGGATTGACCACATTTTTATTTGGTTGGTTCCCGAGAAAGACTGGGCAAAGATATCAAGGTCATCTAGGTAGCTTAGTCTGTCTTAACCGCTTTTAGAATCTCCTTTCCAATTTCCCATCTGAAGCTATCGTTTTTTATGAGTCCTCTTTTAATCTAAACTGATTGTCCTTCCACCATTCTGGTCTTGGTATTAATTTTGGCATAATTTACTGTTTTAAAAAATTTGATGTTTTTAGAGCTGTTATTGTGACACTCAAGGTTGAAGAGGGGGAGTAAGGAGCACAGGCTTTGAGGTGAAGGGACGCATGGATGAGTTGGCATCACACTAAGATTATAGTGCTTCTGAGATACGGAAAGTATAAGTTTCTTAGAAATCATGTGAGAGAGTACAGATCATGTTGGAATGATCCATTTTGGTTTGTGGAGATACTCTTCAATCTCAATAGCCGCTCAAGACTCATTTGGTTGAGCTCAAAACATTTCGTGGCAATTCCAGATTATAGGGAATGTTGCAGCCTCAAAATTAGGTGCCAAATTCAGATTCTAAATATTGGGCCTAGGGCATTATAGCTGTGCGTTGAAAGCAATCTTTGGATTTCAACATCCAAATACCTTTTTAGATTTTTAACAGTTTTTAGATACAGGTTCATTTTTCTTGTTTGTCAAATGCTTCCATTTGTAGACCACAATCCATATGCCCCCCTTTTCTATTTCATACTTCCTAAATATAATATAAGATAGTCATATATGGGAGGAGGTAAAAGAGAATGTGATCATGGTATTAGAATATTGACTTAATAGGGGGGGGGGGGGGATAAAGCTTGTAATATCCATAATTCTCTATCAGGAAGAGAACAAGATACACTGATCTGCAAATATGCAGCAAGATGACCAAAAAATTTTACATTTTCGGCAGGTTGCTTTTGGTAAGAGACTTGGTATTGAAGAGCCACAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAAATGTGCAAGGATCGGTTTGAAGGTCTTTTAAAGATTGATTCATTGAAGGAAGTCAATCCGCCTTTGCTTTCTACTACCGCTGGGCTATTGAAATCAAATTCAGATAGTTGCAGCCATGGTTCTTCCCAAGTCACAAAATCTTCGGGTGCAAGAACCTCAGAAGATGGTAGCAGTCCCACGCAAGCATCATCTAGAGATGCATGTGACGAAAATGCAATACTTAAAGTCATGGTAAGTAATTTCAACTTGAAAAATGAACGTCTATCATATCCTGAATGGTCTGGTCTCTGAACAAATTATGTTTGGTAGAAAATGCGTTTTTCTATAGTCCTCCTGTTCTAGGTTCTCTTCATATTTGTGATGCGAAATTGTTGTTCATGGATTCAGGAGTTTCTTGCCTTGCCCAGGGCTGATGCCATTCATCTTCTTGCACAGTATAATGGAAATGCAGAGATGGTCATACAACAAATATTTGCATGA
mRNA sequence
ATGGACTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATCATCGACTTCGCCAGATCAGAGAACTTGATTGAGGATTCCCTTTCATCCTCCCCACCTTCTTCTCCTTCCTCCCTTTCTTCCTCCTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTTTGGAGGCCGGTGATATTGACTGCGCCATCGATCTCCTCCGCCTTCATGCACCCTTCATTCTTGACGACCACAGGCTTCTATTCCGGTTGCAGAAGCAGAAATTTATTGAGCTTTTGAGGAAAGGGACTCCAGAGTGTCGTGATTTGGCCATTCAATGTCTTCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAGGCGTATGAGGAGTTCAAGCATGTTCTTCTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCAGTGACATATGAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGATTAATGTCCTCAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTCTGAGATACTTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCTATATCAGATCTCACTGAGAGGTTGCTTCTAGATGAACGTGATCCACCTGCCACACCCAAGGAGAGTCTATATGAAGCACCTCCATTTGATGAGGTGGACATTCAAGCTCTTGCACATGCTGTAGAGCTAACAAGGCAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCTACAGAGGAATTGTGGATTCTGGTCGTGGAGCCCTCTCTGGAGTGCAGAATATCTCTAGTTCATCAAAAGTTAATCAGTCAGAGCTGGAGTATTGTTCATCTAGGAATTGTTCTTTTGAAGTTGACTATGCAACCAGTAAACTTTCGGATGGTGAAATTTCTGTCAGCAATTCCCGTGTAGATAGTTCTCCTGAAAATATTGCTGATGTGACCAGTTCACAAGGTACTGATATTGAATTACGATATGCATTGGAGCCAACGTCCAATCGAGAAGATTGTAGCACTAGTGATTCAGTTCATGTGGGAAATTCAAGAACATTACAAGTGAACAAGAATCATGGGATTGTAGAAAGGAGCAAGCGAAAGAGATGGAGAGGAAGACACGATGATAGAGAACTTCATGATGTTTCTTACAGTGGGTGCAGTAAACAAGAACTTAGCTCTACAACAGTGGCCGGTACAACCATGCCTAAGGAACAACAGAACCTTGAAAAACATTTACCATTTGAGTCTACTGGCAAGGAGGATAAATATGAGATTGTCTTGGGCATTAGAGAACTGGCAAGTAAAAGATTGGCTGCAGAGGTTGTGGAGGAAATTAACGCCGTGGATCCGAACTTTTTTGCACAAAATCCTATTCTCCTGTTCCAACTTAAGCAGGTTGAATTTTTGAAGCTGGTTAGTTCCGGTGATTATTCCAGTGCTTTGAGGGTCGCATGCACTCACTTAGGCCCAATAGCCGCTAATGATCCTTCCTTGTTGAAGCAATTGAAGGAGACTTTGTTGGCTTTGCTCCTGCCCAATGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAATTCTCTTCAGGTTGCTTTTGGTAAGAGACTTGGTATTGAAGAGCCACAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAAATGTGCAAGGATCGGTTTGAAGGTCTTTTAAAGATTGATTCATTGAAGGAAGTCAATCCGCCTTTGCTTTCTACTACCGCTGGGCTATTGAAATCAAATTCAGATAGTTGCAGCCATGGTTCTTCCCAAGTCACAAAATCTTCGGGTGCAAGAACCTCAGAAGATGGTAGCAGTCCCACGCAAGCATCATCTAGAGATGCATGTGACGAAAATGCAATACTTAAAGTCATGGAGTTTCTTGCCTTGCCCAGGGCTGATGCCATTCATCTTCTTGCACAGTATAATGGAAATGCAGAGATGGTCATACAACAAATATTTGCATGA
Coding sequence (CDS)
ATGGACTCCACCCCTTTGAACTGGGAAGCTCTCGATGCCCTAATCATCGACTTCGCCAGATCAGAGAACTTGATTGAGGATTCCCTTTCATCCTCCCCACCTTCTTCTCCTTCCTCCCTTTCTTCCTCCTCTTACCATTCCAGGTTGATCATCCGCCAGATCAGGCGCTCTTTGGAGGCCGGTGATATTGACTGCGCCATCGATCTCCTCCGCCTTCATGCACCCTTCATTCTTGACGACCACAGGCTTCTATTCCGGTTGCAGAAGCAGAAATTTATTGAGCTTTTGAGGAAAGGGACTCCAGAGTGTCGTGATTTGGCCATTCAATGTCTTCGGACGGCGCTCGCTCCTTGTGCTCTTGATGCATACCCGGAGGCGTATGAGGAGTTCAAGCATGTTCTTCTTGCCTTCATTTATGACAAGGATAATCAAACATCCCCAGTGACATATGAGTGGTGTGAAAGGAGGAGGTTTGATATTGCTGGATTAATGTCCTCAGTCCTACGAGCTCATATGCAGGCATATGATCCAGTCTTCTCAATGACTCTGAGATACTTGATAAGCATACATAAAGGTTTTTGCTTTCGTGAAGGTGTGTCATCTCCTATATCAGATCTCACTGAGAGGTTGCTTCTAGATGAACGTGATCCACCTGCCACACCCAAGGAGAGTCTATATGAAGCACCTCCATTTGATGAGGTGGACATTCAAGCTCTTGCACATGCTGTAGAGCTAACAAGGCAGGGGGCAATTGATAGCTTGAGATTCACCAAAGGTGATTTGTTTCATGCATTCCAGAATGAGTTGTGCCGGATGAAATTGGACCTTTCTGTACTTGATGAGCTTGTTCGTGAGTATTGCATCTACAGAGGAATTGTGGATTCTGGTCGTGGAGCCCTCTCTGGAGTGCAGAATATCTCTAGTTCATCAAAAGTTAATCAGTCAGAGCTGGAGTATTGTTCATCTAGGAATTGTTCTTTTGAAGTTGACTATGCAACCAGTAAACTTTCGGATGGTGAAATTTCTGTCAGCAATTCCCGTGTAGATAGTTCTCCTGAAAATATTGCTGATGTGACCAGTTCACAAGGTACTGATATTGAATTACGATATGCATTGGAGCCAACGTCCAATCGAGAAGATTGTAGCACTAGTGATTCAGTTCATGTGGGAAATTCAAGAACATTACAAGTGAACAAGAATCATGGGATTGTAGAAAGGAGCAAGCGAAAGAGATGGAGAGGAAGACACGATGATAGAGAACTTCATGATGTTTCTTACAGTGGGTGCAGTAAACAAGAACTTAGCTCTACAACAGTGGCCGGTACAACCATGCCTAAGGAACAACAGAACCTTGAAAAACATTTACCATTTGAGTCTACTGGCAAGGAGGATAAATATGAGATTGTCTTGGGCATTAGAGAACTGGCAAGTAAAAGATTGGCTGCAGAGGTTGTGGAGGAAATTAACGCCGTGGATCCGAACTTTTTTGCACAAAATCCTATTCTCCTGTTCCAACTTAAGCAGGTTGAATTTTTGAAGCTGGTTAGTTCCGGTGATTATTCCAGTGCTTTGAGGGTCGCATGCACTCACTTAGGCCCAATAGCCGCTAATGATCCTTCCTTGTTGAAGCAATTGAAGGAGACTTTGTTGGCTTTGCTCCTGCCCAATGAAGATATTCTTGGGAAAGGCTTCCCTATAAATGCTCTTGCTAATTCTCTTCAGGTTGCTTTTGGTAAGAGACTTGGTATTGAAGAGCCACAACTAATGAAGTTGATGAGAGCCACACTTCACTCTCATAGTGAATGGTTTAAACTTCAAATGTGCAAGGATCGGTTTGAAGGTCTTTTAAAGATTGATTCATTGAAGGAAGTCAATCCGCCTTTGCTTTCTACTACCGCTGGGCTATTGAAATCAAATTCAGATAGTTGCAGCCATGGTTCTTCCCAAGTCACAAAATCTTCGGGTGCAAGAACCTCAGAAGATGGTAGCAGTCCCACGCAAGCATCATCTAGAGATGCATGTGACGAAAATGCAATACTTAAAGTCATGGAGTTTCTTGCCTTGCCCAGGGCTGATGCCATTCATCTTCTTGCACAGTATAATGGAAATGCAGAGATGGTCATACAACAAATATTTGCATGA
Protein sequence
MDSTPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSGVQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQGTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELHDVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAAEVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEWFKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDGSSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Homology
BLAST of HG10016736 vs. NCBI nr
Match:
XP_038882577.1 (uncharacterized protein LOC120073801 [Benincasa hispida])
HSP 1 Score: 1326.2 bits (3431), Expect = 0.0e+00
Identity = 683/713 (95.79%), Postives = 693/713 (97.19%), Query Frame = 0
Query: 1 MDSTPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEA 60
MDSTPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEA
Sbjct: 1 MDSTPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEA 60
Query: 61 GDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCAL 120
GDID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTP RDLAIQCLRTALAPCAL
Sbjct: 61 GDIDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPGDRDLAIQCLRTALAPCAL 120
Query: 121 DAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFS 180
DAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFS
Sbjct: 121 DAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFS 180
Query: 181 MTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALA 240
MTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALA
Sbjct: 181 MTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALA 240
Query: 241 HAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGAL 300
HAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG GAL
Sbjct: 241 HAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGHGAL 300
Query: 301 SGVQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTS 360
SG+QN S SSKVNQ+ELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPEN+ADV S
Sbjct: 301 SGMQNFSGSSKVNQAELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENVADVAS 360
Query: 361 SQGTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRE 420
SQGTDIELRYALEP SNREDCSTSDS+HVGNSRTLQVNKN GIVERSKRKRWRGRHDD E
Sbjct: 361 SQGTDIELRYALEPISNREDCSTSDSIHVGNSRTLQVNKNRGIVERSKRKRWRGRHDDGE 420
Query: 421 LHDVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRL 480
LHD+SY G KQELSSTTV+GTT+ KEQQNLEKHLP ESTGKEDKYEIVLGIRELASKRL
Sbjct: 421 LHDISYCGFGKQELSSTTVSGTTISKEQQNLEKHLPLESTGKEDKYEIVLGIRELASKRL 480
Query: 481 AAEVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPS 540
AAEVVEEINAVDP FFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPS
Sbjct: 481 AAEVVEEINAVDPYFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPS 540
Query: 541 LLKQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHS 600
LLKQLKETLLALL+PNEDILGKGFPINALANSLQVAFG+RLGIEEPQLMKLMRATLHSHS
Sbjct: 541 LLKQLKETLLALLVPNEDILGKGFPINALANSLQVAFGRRLGIEEPQLMKLMRATLHSHS 600
Query: 601 EWFKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSE 660
EWFKLQMCKDRFE LLKID LKEVNPPLLSTT GLLKSNSDSCSHGSSQVTKSSGARTSE
Sbjct: 601 EWFKLQMCKDRFESLLKIDLLKEVNPPLLSTTTGLLKSNSDSCSHGSSQVTKSSGARTSE 660
Query: 661 DGSSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
DGSSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 661 DGSSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 713
BLAST of HG10016736 vs. NCBI nr
Match:
XP_008440269.1 (PREDICTED: uncharacterized protein LOC103484770 isoform X1 [Cucumis melo])
HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 678/711 (95.36%), Postives = 685/711 (96.34%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDSVHVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. NCBI nr
Match:
XP_011657859.1 (uncharacterized protein LOC101218546 isoform X1 [Cucumis sativus])
HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 677/711 (95.22%), Postives = 686/711 (96.48%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG+LSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGSLSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSS K NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSLKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDS+HVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSIHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELST-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSAL+VACTHLGP+AANDPSLL
Sbjct: 485 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALKVACTHLGPLAANDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. NCBI nr
Match:
TYK12895.1 (CLTH domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1299.3 bits (3361), Expect = 0.0e+00
Identity = 676/711 (95.08%), Postives = 684/711 (96.20%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDS+HV NSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSIHVANSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. NCBI nr
Match:
XP_008440270.1 (PREDICTED: uncharacterized protein LOC103484770 isoform X2 [Cucumis melo])
HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 677/711 (95.22%), Postives = 684/711 (96.20%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YP AYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YP-AYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDSVHVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 708
BLAST of HG10016736 vs. ExPASy TrEMBL
Match:
A0A0A0KGB9 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490820 PE=4 SV=1)
HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 677/711 (95.22%), Postives = 686/711 (96.48%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRG+LSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGSLSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSS K NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSLKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDS+HVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSIHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELST-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSAL+VACTHLGP+AANDPSLL
Sbjct: 485 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALKVACTHLGPLAANDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. ExPASy TrEMBL
Match:
A0A1S3B1B9 (uncharacterized protein LOC103484770 isoform X1 OS=Cucumis melo OX=3656 GN=LOC103484770 PE=4 SV=1)
HSP 1 Score: 1302.0 bits (3368), Expect = 0.0e+00
Identity = 678/711 (95.36%), Postives = 685/711 (96.34%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDSVHVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. ExPASy TrEMBL
Match:
A0A5D3CMW8 (CLTH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold255G004890 PE=4 SV=1)
HSP 1 Score: 1299.3 bits (3361), Expect = 0.0e+00
Identity = 676/711 (95.08%), Postives = 684/711 (96.20%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDS+HV NSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSIHVANSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 709
BLAST of HG10016736 vs. ExPASy TrEMBL
Match:
A0A1S3B1G5 (uncharacterized protein LOC103484770 isoform X2 OS=Cucumis melo OX=3656 GN=LOC103484770 PE=4 SV=1)
HSP 1 Score: 1295.4 bits (3351), Expect = 0.0e+00
Identity = 677/711 (95.22%), Postives = 684/711 (96.20%), Query Frame = 0
Query: 3 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGD 62
STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAG
Sbjct: 5 STPLNWEALDALIIDFARSENLIEDSLSSSPPSSPSSLSSSSYHSRLIIRQIRRSLEAGH 64
Query: 63 IDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALAPCALDA 122
ID AIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPE RDLAIQCLRTALAPCALDA
Sbjct: 65 IDSAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPEDRDLAIQCLRTALAPCALDA 124
Query: 123 YPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 182
YP AYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT
Sbjct: 125 YP-AYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYDPVFSMT 184
Query: 183 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 242
LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA
Sbjct: 185 LRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDIQALAHA 244
Query: 243 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 302
VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG
Sbjct: 245 VELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSGRGALSG 304
Query: 303 VQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIADVTSSQ 362
+QN+SSSSK NQSE EYC SRNCSFEVDY TSKLSDGEISVSNSRVDSSPEN ADVTSSQ
Sbjct: 305 MQNLSSSSKANQSEQEYC-SRNCSFEVDYTTSKLSDGEISVSNSRVDSSPENTADVTSSQ 364
Query: 363 GTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRHDDRELH 422
GTDIELRYA EPTSNREDCSTSDSVHVGNSR LQVNKN GIVERSKRKRWRGR DD ELH
Sbjct: 365 GTDIELRYASEPTSNREDCSTSDSVHVGNSRMLQVNKNRGIVERSKRKRWRGRLDDTELH 424
Query: 423 DVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELASKRLAA 482
DVSYSGCSKQELS+ TTM KEQQNLEKH+P ESTGKEDKYEIVLGIRELASKR AA
Sbjct: 425 DVSYSGCSKQELSA-----TTMSKEQQNLEKHIPVESTGKEDKYEIVLGIRELASKRFAA 484
Query: 483 EVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAANDPSLL 542
EVVEEINAVDPNFF+QNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGP+A NDPSLL
Sbjct: 485 EVVEEINAVDPNFFSQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPLATNDPSLL 544
Query: 543 KQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATLHSHSEW 602
KQLKETLLALLLP EDILGKGFPINALANSLQVA G+RLGIEEPQLMKLMRATLHSHSEW
Sbjct: 545 KQLKETLLALLLPKEDILGKGFPINALANSLQVAVGRRLGIEEPQLMKLMRATLHSHSEW 604
Query: 603 FKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGARTSEDG 662
FKLQMCKDRFEGLLKID LKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKS GARTSEDG
Sbjct: 605 FKLQMCKDRFEGLLKIDLLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSLGARTSEDG 664
Query: 663 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 665 SSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 708
BLAST of HG10016736 vs. ExPASy TrEMBL
Match:
A0A6J1BXE9 (uncharacterized protein LOC111005585 isoform X1 OS=Momordica charantia OX=3673 GN=LOC111005585 PE=4 SV=1)
HSP 1 Score: 1287.3 bits (3330), Expect = 0.0e+00
Identity = 666/717 (92.89%), Postives = 684/717 (95.40%), Query Frame = 0
Query: 1 MDSTPLNWEALDALIIDFARSENLIEDSLSSSPPS----SPSSLSSSSYHSRLIIRQIRR 60
MDS PLNWEALDALIIDFARSENLIEDS SSSPPS SPSSLSSSSYHSRLIIR IRR
Sbjct: 1 MDSAPLNWEALDALIIDFARSENLIEDSFSSSPPSSPSPSPSSLSSSSYHSRLIIRHIRR 60
Query: 61 SLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQCLRTALA 120
SLEAG ID AI LLRLHAPFILDDHRLLFRL KQKFIELLRKGT E RDLAIQCLRTALA
Sbjct: 61 SLEAGHIDSAIHLLRLHAPFILDDHRLLFRLHKQKFIELLRKGTAEDRDLAIQCLRTALA 120
Query: 121 PCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRAHMQAYD 180
PCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEW ERRRFDIAGLMSSVLRAHMQAYD
Sbjct: 121 PCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWSERRRFDIAGLMSSVLRAHMQAYD 180
Query: 181 PVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDI 240
PVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDI
Sbjct: 181 PVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPPFDEVDI 240
Query: 241 QALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG 300
QALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG
Sbjct: 241 QALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYRGIVDSG 300
Query: 301 RGALSGVQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDSSPENIA 360
RG L G+QN+SSSSK+NQSELEYCSSRNCSFEVD+ATSKLSDGEISV NSRVDSSPENIA
Sbjct: 301 RGPLPGMQNLSSSSKINQSELEYCSSRNCSFEVDHATSKLSDGEISVGNSRVDSSPENIA 360
Query: 361 DVTSSQGTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRKRWRGRH 420
DVTSSQGTDIELRYA EPT+NREDCSTSDS+HVGNSRTLQVNKN GIVERSKRKRWRGRH
Sbjct: 361 DVTSSQGTDIELRYAFEPTTNREDCSTSDSIHVGNSRTLQVNKNRGIVERSKRKRWRGRH 420
Query: 421 DDRELHDVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVLGIRELA 480
DDR LHDVSYSGCSKQELS+ TVA T+ K+QQNLEK LP EST KEDKYEIVLGIRE+A
Sbjct: 421 DDRGLHDVSYSGCSKQELSTATVASITISKDQQNLEKQLPLESTCKEDKYEIVLGIREMA 480
Query: 481 SKRLAAEVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTHLGPIAA 540
SKRLAAEVVEEINA+DPNFF QNPILLFQLKQVEF KLVS+GDYSS LRVACTHLGP+AA
Sbjct: 481 SKRLAAEVVEEINALDPNFFMQNPILLFQLKQVEFFKLVSTGDYSSGLRVACTHLGPLAA 540
Query: 541 NDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMKLMRATL 600
NDPSLLKQLKETLLALLLPNED+LGKGFPINALANSLQVAFG+RLGIEEPQLMKLMRATL
Sbjct: 541 NDPSLLKQLKETLLALLLPNEDLLGKGFPINALANSLQVAFGRRLGIEEPQLMKLMRATL 600
Query: 601 HSHSEWFKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQVTKSSGA 660
HSHSEWFKLQMCKDRFE LLKIDSLKEVNPPLLST++GLLKSNSDSC+ GSSQVTKSSGA
Sbjct: 601 HSHSEWFKLQMCKDRFESLLKIDSLKEVNPPLLSTSSGLLKSNSDSCTLGSSQVTKSSGA 660
Query: 661 RTSEDGSSPTQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 714
RTSEDGSSP QASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA
Sbjct: 661 RTSEDGSSPIQASSRDACDENAILKVMEFLALPRADAIHLLAQYNGNAEMVIQQIFA 717
BLAST of HG10016736 vs. TAIR 10
Match:
AT5G66810.1 (CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595); BEST Arabidopsis thaliana protein match is: LisH and RanBPM domains containing protein (TAIR:AT1G61150.1); Has 333 Blast hits to 242 proteins in 88 species: Archae - 0; Bacteria - 0; Metazoa - 104; Fungi - 47; Plants - 152; Viruses - 0; Other Eukaryotes - 30 (source: NCBI BLink). )
HSP 1 Score: 802.7 bits (2072), Expect = 2.3e-232
Identity = 453/729 (62.14%), Postives = 541/729 (74.21%), Query Frame = 0
Query: 1 MDSTPLNWEALDALIIDFARSENLIEDSLSS-----SPPSSPS-----SLSSSSYHSRLI 60
MDSTP+NWEALDALIIDF SENL+ED+ ++ SP SSPS S+SSSSYHSRLI
Sbjct: 57 MDSTPVNWEALDALIIDFVSSENLVEDAAAAVNSPPSPLSSPSSSSSPSISSSSYHSRLI 116
Query: 61 IRQIRRSLEAGDIDCAIDLLRLHAPFILDDHRLLFRLQKQKFIELLRKGTPECRDLAIQC 120
IR+IR S+E+GDI+ AID+LR HAPF+LDDHR+LFRLQKQKFIELLRKGT E AI C
Sbjct: 117 IRRIRSSIESGDIETAIDILRSHAPFVLDDHRILFRLQKQKFIELLRKGTHEA---AIDC 176
Query: 121 LRTALAPCALDAYPEAYEEFKHVLLAFIYDKDNQTSPVTYEWCERRRFDIAGLMSSVLRA 180
LRT +APCALDAYPEAYEEFKHVLLA IYDKD+QTSPV EW E+RR+++AGLMSSVLRA
Sbjct: 177 LRTCVAPCALDAYPEAYEEFKHVLLALIYDKDDQTSPVANEWAEKRRYEMAGLMSSVLRA 236
Query: 181 HMQAYDPVFSMTLRYLISIHKGFCFREGVSSPISDLTERLLLDERDPPATPKESLYEAPP 240
+QAYDPVFSMTLRYLISIHKGFCF +G+SS +SDLT RLLL+ERD PATP ES+YE PP
Sbjct: 237 SLQAYDPVFSMTLRYLISIHKGFCFHQGISSAVSDLTHRLLLEERDAPATPIESMYEVPP 296
Query: 241 FDEVDIQALAHAVELTRQGAIDSLRFTKGDLFHAFQNELCRMKLDLSVLDELVREYCIYR 300
FDEVDIQALAHAVELTRQGA+DS++F KGDLF AFQNELCRM+LD+SVLDELV+EYCIYR
Sbjct: 297 FDEVDIQALAHAVELTRQGAVDSMKFAKGDLFQAFQNELCRMRLDVSVLDELVKEYCIYR 356
Query: 301 GIVDSGRGALSGVQNISSSSKVNQSELEYCSSRNCSFEVDYATSKLSDGEISVSNSRVDS 360
GIVD S +Q I+ +K NQSE+ SR+CS E+D TS+ SD E + S +D
Sbjct: 357 GIVD------SEMQMITIPAKRNQSEVGRSLSRDCSSEIDLNTSQHSDIENYSNKSMLDG 416
Query: 361 SPENIADVTSSQGTDIELRYALEPTSNREDCSTSDSVHVGNSRTLQVNKNHGIVERSKRK 420
S +++ +G D+ RY EPTS EDCSTS S N+R L ++H E +KRK
Sbjct: 417 SLTYDTEMSCEEGGDVGTRYGSEPTSVCEDCSTSWSNQCENTRALLRIRSHMNSEGNKRK 476
Query: 421 RWRGRHDDRELHDVSYSGCSKQELSSTTVAGTTMPKEQQNLEKHLPFESTGKEDKYEIVL 480
RW GR + + C + + + +GT P EDKYEI L
Sbjct: 477 RWCGRTAEMD--------CLPRISFANSESGTN------------PI-----EDKYEIAL 536
Query: 481 GIRELASKRLAAEVVEEINAVDPNFFAQNPILLFQLKQVEFLKLVSSGDYSSALRVACTH 540
++EL S+ +AAE EI+ +DP+FF QNP LLF LKQVEFLKLVS+GD++ AL+VAC H
Sbjct: 537 ALKELVSRGMAAEAFSEISTMDPDFFTQNPGLLFHLKQVEFLKLVSAGDHNGALKVACFH 596
Query: 541 LGPIAANDPSLLKQLKETLLALLLPNEDILGKGFPINALANSLQVAFGKRLGIEEPQLMK 600
LGP+AAND SLLK LKETLL LL P+ GK P+N LAN+LQV+ G RLGIEEP+LMK
Sbjct: 597 LGPLAANDQSLLKTLKETLLVLLQPDGTAPGKDLPLNDLANTLQVSVGNRLGIEEPKLMK 656
Query: 601 LMRATLHSHSEWFKLQMCKDRFEGLLKIDSLKEVNPPLLSTTAGLLKSNSDSCSHGSSQV 660
+++ATLH+H+EWFKLQMCKDRF LLKIDSLKEVN L+ KS DS ++ SSQV
Sbjct: 657 IIKATLHTHTEWFKLQMCKDRFNNLLKIDSLKEVNTDLIGAIKS--KSKKDSNTNLSSQV 716
Query: 661 -TKSSGARTSEDGSSP-----TQASSRDAC-DENAILKVMEFLALPRADAIHLLAQYNGN 713
T SS TSEDG S TQ R+A +E+AILKVMEFLA+PR+DAI LL+QYNG+
Sbjct: 717 TTTSSSTMTSEDGGSSSLMMMTQTLPREALWEESAILKVMEFLAMPRSDAIQLLSQYNGD 749
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038882577.1 | 0.0e+00 | 95.79 | uncharacterized protein LOC120073801 [Benincasa hispida] | [more] |
XP_008440269.1 | 0.0e+00 | 95.36 | PREDICTED: uncharacterized protein LOC103484770 isoform X1 [Cucumis melo] | [more] |
XP_011657859.1 | 0.0e+00 | 95.22 | uncharacterized protein LOC101218546 isoform X1 [Cucumis sativus] | [more] |
TYK12895.1 | 0.0e+00 | 95.08 | CLTH domain-containing protein [Cucumis melo var. makuwa] | [more] |
XP_008440270.1 | 0.0e+00 | 95.22 | PREDICTED: uncharacterized protein LOC103484770 isoform X2 [Cucumis melo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KGB9 | 0.0e+00 | 95.22 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_6G490820 PE=4 SV=1 | [more] |
A0A1S3B1B9 | 0.0e+00 | 95.36 | uncharacterized protein LOC103484770 isoform X1 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A5D3CMW8 | 0.0e+00 | 95.08 | CLTH domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_s... | [more] |
A0A1S3B1G5 | 0.0e+00 | 95.22 | uncharacterized protein LOC103484770 isoform X2 OS=Cucumis melo OX=3656 GN=LOC10... | [more] |
A0A6J1BXE9 | 0.0e+00 | 92.89 | uncharacterized protein LOC111005585 isoform X1 OS=Momordica charantia OX=3673 G... | [more] |
Match Name | E-value | Identity | Description | |
AT5G66810.1 | 2.3e-232 | 62.14 | CONTAINS InterPro DOMAIN/s: CTLH, C-terminal LisH motif (InterPro:IPR006595); BE... | [more] |