Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGTGGGCGGTTCTGGTAGCGTTAATGGTATTCCTTCTCTGTCGTTGTAATTCTCTAAAGGAATGTACAAACACTCCTACTCAACTTGGATCACATACTTTTAGATATGAGCTCTTATCATCACATAATGGAACGTGGAAGGAAGAGATGTTTTCTCACTACCACTTGACACCAACTGATGACTTTGCTTGGTCTAATTTGCTACCAAGAAAGATGTTGAAGGAAGAAAACGAATTTAATTGGGAAATGATGTATAGACAAATGAAGAATAAAGATGGGCTGCAAGTTCCTGGTGGTTTGCTCAAGGAAATGTCTTTACACGACGTACGGTTGGATCCAAACTCGTTACACGGGACGGTTCAGATGACAAATTTGAAGTATCTATTGATGTTGGATGTGGACAGGTTGCTCTGGAGCTTTAGGAAGACAGCTGGTTTGCCTACACCTGGAGAACCATATCTCGGGTGGGAAAAATCAGACTGCGAGCTTCGTGGTCATTTTGTAGGTTAGTCAAGTCTTTTAATCGTGCTTTTACTTGTCCTTATGGTTTTTATTGTTACGTTCATCTTTTCTGGCATGCTTCTGTCTTCTGAGAATATACCATAAGACTAGTCATTGCACAGAGGGCCCATAATAGCTTGGGAAAAGACATTTACCTTTAAGTATTAAGTTTATGAGTTCATTTTGTTCATGTCAACCGTGTCCATTCACTCCATTCTGAACATTTTTTGCTCCACAGTCTCCCTCTCCCCAACCTGAATTCTTATTTCTTGAATGGTTGGTGGGCAGGAAGAGGCAATTAGTCAAAAAGCATGTAGGTTTAGTTTAATTTAAATATGAAGCGACATCTTTGTTACTTCTTTGTTATTTTTCAATGTTAACCAGAGTAGAAGTTCCAAGACCAATACATTGGTAGTGCTTGATGCTCTGCTTTAGCTATGAGAAAAAACTAACTGTTTGATTAGAACTTTAGAGAATTAACAGTTCCTGCAAATATTCACCGCTTTGTTCGTTCTTGTATATTGATTAGTTGAGACTATGGTGTAGACCAAATTTGGAGAGAATTTTATTAATGAGATATTATCTTAAATACAAGACAAAACCTCGTATGTATTGGGGAAATCAAAAGCTAGGCTTACTAACTAATTAATAGGAAAGACCAAACACATCACTATGTCTTTTTTGTTGTGATAGGATTTCAATTTGGGAACTTGTAGGTGTTAAGTTTAAGATGTCACTTGTCAGTTCTCATGCTAATTAAGCTTATACTTCCAATTTCAGAACTTAATCTCAATCTCTACATTTAATCTCTTTCATTTTTTTCTCAAGGAAAGTTGTTTCTATCAAAAAAATCTCTACATCTAAGTTTGGCCACTGCTGAATTTAGAAGTCTAAAATCTCTCTGTTTCTTTCTCCCAATCACACACGTGCACAATCATTGTTCTTGGTCGTTTTCTATTAGGTTAGATTATAAATTTAGTCCCTGGACTTTTAAGTTTGTGTCAAATAGGTCTTTGAACTTCTAAAAGTATCTAATAGGTCTTTAAACTTTCAGTTTTGTGTCTTATAAGACTTTAAACTTTAAAAGCTGTCTAATTAATTACCAAGCTTTCGATCTTGTGTCTAATAAGTCCTTAAGCTTTCAGTTTTATGTCTAATAGGTCCTTTACTTATTTGATATTTTTTAAAATTCATGAACCTATTGTATACAAAATTGAAATTTAATGTGTATGAAACACAAAATTCAATGTTATGATTTGTAGATTTACTAATGTTAAAAAATGTTGGAAATGTCAGGAATCTATTAGACGCAAAATTGAAAGTTCAAAGATCCATTAGATACTTTTTAAAGTTCAATGGTCAATTGGACACAAACATAAAAGTTGAAGGACTAAACTTGTAGTTTAAGCGTTCACTTATCTAGTTTGTGGGAAGGCAATTATGTTCTTATGACTTTGTCTGTTTGATTACTGCTGAATATGGAGGCCTTCTCTCTCTCTCTCTCTCACACACACACACACACACTCTGACACACGTCCACATATCTATATGTGATCTTGATCTTTTTTGTAATCTGGTTTTTAGAAAGTAGATTATGAACTGCAGATTATGTTTTTGTAGCTTTAGCCTATGTGATCAATTATTGATAGCTTGCTCTTTCAAACTGAAGGACACTACTTAAGTGCGTCGGCCCAAATGTGGGCAAGCACTGGCAACCCTGTTCTTAAAGAGAAGATGTCTGCACTAGTTTCTGGTTTGGCAACTTGTCAGGACAAAATGGGTACAGGATATCTTTCTGCTTTTCCCTCTGAAGAGTTTGATCGTTTTGAAGCTATTCAACCTGTCTGGGCACCATATTACACCATCCACAAGGTAAAGCTTCCCTTAGACTTGGACACAGAATGTCCTAAGAAGTTACCACGTCATTTGATCTTGTCTATTGTGACAGATATTGGCAGGATTATTGGATCAATATACTTTTGCTGGTAATTCTCAAGCCTTAAAAATGGTTACATGGATGGTTGAGTACTTCTACAACCGCGTTCAAAATGTCGTATTGAAGTACACTGTAGAGAAACACTATCGGGCACTTAATGAAGAAACTGGTGGCATGAATGATGTGCTCTACCGGTTGTACAGTATAACAGTATGTGATTCCTCTATCCGTACTCTCATTGGTCTAATGTGATCTATAATGTGAGGGTAATATCCATCAAATTGCATTGTCATTTAATTTAAATTTTCATGCATGTGCTCCAGAACAAAAGGAGGAAACCATCCAAATACTATATGATTCCTTCTACGAGAAAGAAGTCTAACTCTATCCAAGAAGTATACTCTTGAGAAATCTGCGACATGGTCTTGTTGGCTTGATTCGATGAAATTGTTTGGATACCTCACCTATCATCAACATGACACTCATTGGATAATTAACTTGCCTGTGATAATTTCATCTAATCGCAAGACCATGTTGCAGAAATCTGCCACAAATGTCTTTTGTAATCACCTTAAGGTGTTTTGGGGTCTCCCCTAATTTCATTCATGCAATGAAATGTTTCTTATCTATAAAATAAAAAAAACTTGCCTGTGATAATTGATTTTAGCTTGTCATATGGTCTAGGTGTAGTTAGCTATTGATGAGTTTTTGAGTTTTGATGTACAAAGTGTTGGTATCTATGCATAAGAGACTTATGAAGTGTGCTTTGTAAATACATGCAGGGAAATACAAAGCATTTATTACTGGCACACCTTTTTGACAAACCGTGCTTTCTAGGCCTTCTTGCCGTACAGGTATCTATTGCTTGTTCAATCCATAGTTCTACAGTAGAATTAGTGTTTTATTGAACATTCACTGCCGATTATTAGGGACAGAACAAAGCGTTGTGCAATCAAGCTTTTACTATTTTATTAAAAGTGACAATTTTTTATGCTACTTAGATTTGAGAGCAAAAGTTCTAAAACAAGATGGAAACCGTTGGAACTAATTTCCTTGTTCATAAACTAAATCAAAATATTATGTAACAGTGTAAGATCCTGTTTGTTCTGAATGGAATACTTATATCTTGGGGATTCTCTCCTTTGATTATAGGCTGAAGACATTTCGGGTTTCCATACCAACACACATATACCAATTGTTGTTGGATCTCAAATGCGGTATGAAGTCACTGGTGATCCACTGTATAAGGTAAATTTTCTCACATCAATTTGTGTCTGAAATAGATTTTTTAAATACTTTTTCTTTATATGAATATTCTTTATGGTTATTCCAAGTATTATGAGAAACAGAACAGATAACACGTGCATTCCCGACATAGACACATAATTTATGAAGACTGTCTTGTAATACTTGTGAACAATTTTATTGAGCTATTTTAAATTCTTTATTCCAGGAAATATCCACATATTTTATGGACATTGTTAACTCCTCCCATAGCTACGCAACAGGAGGGACATCAGTTCATGAGTTCTGGTATGAACACTCGAGTCTTTGAGTCATTTAAAATTATGTGCCATGTATTGTCGGACTTTAATGCACATTCATTTGTTTTGTAGTCTGTTCATGAGTTCTGGTACGAACACTTGAATCTTTGAGTACACTTGAGTCTTTAATGCACATTCATTTAAAATTATGCAGTTATATAATCCATGAATATTCAGCCTGAGTGCCAGTCTTTAAAAAATGCTACTGTCTTATGTTTCAAATCTTGCATCCTCCTTAATCATGGTGTTATACAATTAAAAAGTGAGCATTAGGCTCAAGTTAGTTATTTGCTTATTTTGTGCTTCATGTTTTATGAAAATTTTATGAGTTTGTGTATGACTGTTGTTTCTATTCTAACCTTTTCTTCTTTGAAGTTGGCTTCTGTGTCTTTTTTCCATGCACTTTCTACCATCATATTCTTTCCTCCTTTGACCTATGAGCAGGACAGATCCCAAGAGATTGGCAAATACTCTAGGAACGGAGAATGAAGAATCATGTACAACTTACAACATGCTTAAGGTATGACTCTTTCCTTTGGTTATATTGTGTGCAATTATTTGACTTCTCCGCTGTCCTTGTTTGGAAATAAAAGACCCTTATTATTTTGCTTTTCTTCTGGAAAGAAGGATAAGTTAATGAATTTGATAGAACCCAACTATACTGATATGAAAGATAAAAGGAATTACAATAGTGCTCTCCAAGTATTAGGGGTACTTGGACCCTCCCCTCTTGACATATCTCTCAACCCCTAATTCACTATCAAATAATGCCTATCCTCCTCATTTCTCACACCCCTGTTTATAACCAAATACCATAACACACTCCGTATTAGTTTTATGCTAAAATAACACAATTCCTAGCTAATTACTTATATACCCCTAATACTCTAATAATATTTCTAATAGCAATCCTACCAGTAATTGAGTAATATGCTGAAAATATTTCCCCAATCACTTCTTCCTTTTTTTTGTCGACAATTTGTATAGACTTTGATGCATCATCCTGATGGGATGAGGGTTATATTATACACTTCATGTTGTTGCTGTACACACATTTCTAGATGAATACAAACTTTTGTCGTATTTTTTTTTTTTACTCAAGTTGATATTATAGGTTTCTCGCAACCTGTTTAAATGGACCAAGGAAATAGCATACGCCGATTATTATGAGCGGGCTCTTACAAATGGGGTGCTGAGCATCCAAAGAGGAACTGATCCTGGGGTAATGATTTACATGCTTCCATTAGGTAGTGGAAGTTCTAAAGCCAGAAGCTATCATGGATGGGGGACCCCATTTGAATCTTTTTGGTGCTGCTATGGCACAGGTTTTGACTTCATTAAACTCACGCATTCCTTTTCCTTTATTCTCTGCAATTACTTGAAAATGATAAGTTTTATTCTCATAACCTTGTTTGTGTATGAATGCGGAGCTTCAGTTGTTTTGTTTCTTGCTTTTGATTTAGGAATTGAATCATTCTCCAAGTTGGGCGATTCCATATACTTTGAGGAGGAAGCACAAACTCCTACGCTTTATGTTATCCAGTACATACCAAGTTCTCTTGATTGGAAATCAGGAAATGTTTTGGTTAATCAGGAAGTTGATCCTATTCATTCAGAGGATCCAAACCTCAGGATGACAATGACATTTTCTCCCAAGGTGGTATACTTCTACATTTTTCACAAATGCTTTTTGATAATAAGATTTTGTTTATTACATCTCACGATTTGATTAGAATGCCTTATTAAGACTTCAAACTTTCAAAAAGACTAAACCCATTAATGGTGTGTTTGGGGTAAGGAGTGGAATAGTGAGTTGGGACGCAGTTATGAAGTCTAGGGAGTTGTGAGCTCCTGGGGCCCACTAAGGAGTTGATAAGATATAAAATTGATGGGTTTTTTTATGTTCTTTTTTGTTTTAATACCCGTGAGTGTCTGGGCCAGCTTACATGCACCTCGACTAATCTCACGGGACAACTCGCCTGACCCAAAACATTTGGGTGTCAAGGAAACTCGTAGAAAATTAATTCCTAGGTAGGTGGCTACCATGGATTGGACCCATGACCTCTTAGCCATTTATTGAGACTGTCTCCTTTTTTACCACTAGACCAACCCATGGTGGTTTAAAATTAATAATGTTTTTTTAGTAGTGGGACCGACCAACTCCTTGGGTCCAAACAAGGAGTGGAGTTCACAACTCCTACTTTCCAACTCCTTGGACCGAACACACCTTAAATAGTAGGTGGACTTATACATATATGTTTGTGAATGTCCGGGACAACTTTTGCACACATCGACTGCATTTTCTCACAGCATAGTTACCTAATCCTCTAATCCCTTAATTGTTTTCATGGTTCTTATTGACTACTAGGGCCAACCTACGGTGGTTTTACATAATATAGTGATATACATCATTGGGTTATAAAAGATGCAACAACATTTAAGTGAATATAACAAATAAGTCATAACCGGATGTGTTAAGTAACACTGAAGATCAATTAGCCATTTCCTCATAATGGTTGTATTGCACTGACGACGAGGCTCAAATCAACTGAGTTGAGGCTTAGCCAAGATAGACCACTAGACCAGTGGACATTGGATCGTATATGATTAGTCTAGCTAGGAGAGTTGTTCAACAGTAGAAAAGATTGGGTTGAAGGTTACATTACTGTTTGACATACTTGCACAGTTCAGAAGTAAGGAATACTTCACCCTGTGAAGATGAAACTGACTAAAAATCTGGGAAACTCTAAAACGAAGATTCAGTGAATTTGTGGCATGGGCAAAAGATTTAGATTTCCATCTTCGCTTGATCCCAAGCACAAGTGAGCAGTGAAAACATAATTGTTTTACAATCTTTGGTATCTAACTCTGCAAGTCATCTTGTTTGTATTATGTAACCGGCAGAGATTGCCTGAATCCTGGCGAGAAAACTTTCAAATCCAATATGTTTTCAATGTCTCTGCTTGTTGGGTAGTGATTTCCATGGGAGCTAACTAATTTCCGTGTGACCTGAATTATGCAGGGATCAGTACAGTCATCTACCATTAATTTGCGAATTCCAAGTTGGACAAGTGCGAGTGGCGCGAAAGTTTTACTAAATGGTCAGAGTTTGGGAAATAACCCGAATGGTATACACCTCCATATCCCTCTGCTGGTTTTTCTTTTGTATAGGGCGGGAAAAATCACAATTTGATTTGTATTTCAAATTCGTTTGAATGCGTCTGGCATTTTATTCTGGCTCTTATATTGGACTTTCACCTGACGCTTTATATGACATCTTTGCAGTGATTTTTTTTGTTTCATACTGCTATTTTCTTATGCTGCAGGCAACTTCAAATCGGTGACTAACAAATGGAGCTCAGGGGACAAGTTAAGCCTTGAGCTACCCATTAACATAAGAACTGAAGCTATTGAAGGTATATTGTCTATCCAATCTTGACATGGATGTTTCATTATAATGTTTAGATAATAAATCACTCTTGTGTGAATATAATCAGATGATCGATCTGAATATGCTTCCATCAAAGCGATCCTCTTTGGTCCCTATCTGCTGGCAGCCTATAGTAGTGGTGACTGGGAAATTAAAACCGGACTGTCAGATTCTTTTTCAGACTGGATAACTCCTGTTCCTTCCGTGTACAATACTTTTCTTGTTACTTTTTCCCAACCGTCTGGAAAGACATCTTTTGCCTTAACAAACTCGAACCAGTCAATAACAATGGAAAAGTATCCTGGACGAGGGACGGATTCTGCTGTCCATGCTACATTTAGGCTCATCTTAAATGACCCATCTGCCAAAGTCACAGAATTGCGAGATGTTATTGGCAAACGGGTCATGCTAGAACCATTTAATTTTCCAGGAATGGTTCTAGGAAATGAAGGAAAAGATGAGAAACTTGCAATTGCAGATTCAACCTCAGAGGGACATTCCTCTTATTTCTATCTAGTTGAGGGATTAGATGGAAATAATGGAACCGTATCCTTGGAGTCCGCAGACAATGAAGGCTGCTTTGTTTACAGTGGAGTGAACTATGAATCTGGTGCACAGCTGAAACTAAGCTGCAAGTCAAAGTTGTCATTAGATGATGGATTCAATGAAGCCTCGAGCTTTGTGATGGAAAATGGAGCAAGTCAGTATCATCCAATAAGCTTTGTTGCAAAAGGATTGACAAGGAACTTTCTTCTGGCACCATTGCTGAGTTTCATAGATGAATCTTACACAGTTTACTTCAACGTGATTGGTTAG
mRNA sequence
ATGTGGGCGGTTCTGGTAGCGTTAATGGTATTCCTTCTCTGTCGTTGTAATTCTCTAAAGGAATGTACAAACACTCCTACTCAACTTGGATCACATACTTTTAGATATGAGCTCTTATCATCACATAATGGAACGTGGAAGGAAGAGATGTTTTCTCACTACCACTTGACACCAACTGATGACTTTGCTTGGTCTAATTTGCTACCAAGAAAGATGTTGAAGGAAGAAAACGAATTTAATTGGGAAATGATGTATAGACAAATGAAGAATAAAGATGGGCTGCAAGTTCCTGGTGGTTTGCTCAAGGAAATGTCTTTACACGACGTACGGTTGGATCCAAACTCGTTACACGGGACGGTTCAGATGACAAATTTGAAGTATCTATTGATGTTGGATGTGGACAGGTTGCTCTGGAGCTTTAGGAAGACAGCTGGTTTGCCTACACCTGGAGAACCATATCTCGGGTGGGAAAAATCAGACTGCGAGCTTCGTGGTCATTTTGTAGGACACTACTTAAGTGCGTCGGCCCAAATGTGGGCAAGCACTGGCAACCCTGTTCTTAAAGAGAAGATGTCTGCACTAGTTTCTGGTTTGGCAACTTGTCAGGACAAAATGGGTACAGGATATCTTTCTGCTTTTCCCTCTGAAGAGTTTGATCGTTTTGAAGCTATTCAACCTGTCTGGGCACCATATTACACCATCCACAAGATATTGGCAGGATTATTGGATCAATATACTTTTGCTGGTAATTCTCAAGCCTTAAAAATGGTTACATGGATGGTTGAGTACTTCTACAACCGCGTTCAAAATGTCGTATTGAAGTACACTGTAGAGAAACACTATCGGGCACTTAATGAAGAAACTGGTGGCATGAATGATGTGCTCTACCGGTTGTACAGTATAACAGGAAATACAAAGCATTTATTACTGGCACACCTTTTTGACAAACCGTGCTTTCTAGGCCTTCTTGCCGTACAGGCTGAAGACATTTCGGGTTTCCATACCAACACACATATACCAATTGTTGTTGGATCTCAAATGCGGTATGAAGTCACTGGTGATCCACTGTATAAGGAAATATCCACATATTTTATGGACATTGTTAACTCCTCCCATAGCTACGCAACAGGAGGGACATCAGTTCATGAGTTCTGGACAGATCCCAAGAGATTGGCAAATACTCTAGGAACGGAGAATGAAGAATCATGTACAACTTACAACATGCTTAAGGTTTCTCGCAACCTGTTTAAATGGACCAAGGAAATAGCATACGCCGATTATTATGAGCGGGCTCTTACAAATGGGGTGCTGAGCATCCAAAGAGGAACTGATCCTGGGGTAATGATTTACATGCTTCCATTAGGTAGTGGAAGTTCTAAAGCCAGAAGCTATCATGGATGGGGGACCCCATTTGAATCTTTTTGGTGCTGCTATGGCACAGGAATTGAATCATTCTCCAAGTTGGGCGATTCCATATACTTTGAGGAGGAAGCACAAACTCCTACGCTTTATGTTATCCAGTACATACCAAGTTCTCTTGATTGGAAATCAGGAAATGTTTTGGTTAATCAGGAAGTTGATCCTATTCATTCAGAGGATCCAAACCTCAGGATGACAATGACATTTTCTCCCAAGGTGGGATCAGTACAGTCATCTACCATTAATTTGCGAATTCCAAGTTGGACAAGTGCGAGTGGCGCGAAAGTTTTACTAAATGGTCAGAGTTTGGGAAATAACCCGAATGGCAACTTCAAATCGGTGACTAACAAATGGAGCTCAGGGGACAAGTTAAGCCTTGAGCTACCCATTAACATAAGAACTGAAGCTATTGAAGATGATCGATCTGAATATGCTTCCATCAAAGCGATCCTCTTTGGTCCCTATCTGCTGGCAGCCTATAGTAGTGGTGACTGGGAAATTAAAACCGGACTGTCAGATTCTTTTTCAGACTGGATAACTCCTGTTCCTTCCGTGTACAATACTTTTCTTGTTACTTTTTCCCAACCGTCTGGAAAGACATCTTTTGCCTTAACAAACTCGAACCAGTCAATAACAATGGAAAAGTATCCTGGACGAGGGACGGATTCTGCTGTCCATGCTACATTTAGGCTCATCTTAAATGACCCATCTGCCAAAGTCACAGAATTGCGAGATGTTATTGGCAAACGGGTCATGCTAGAACCATTTAATTTTCCAGGAATGGTTCTAGGAAATGAAGGAAAAGATGAGAAACTTGCAATTGCAGATTCAACCTCAGAGGGACATTCCTCTTATTTCTATCTAGTTGAGGGATTAGATGGAAATAATGGAACCGTATCCTTGGAGTCCGCAGACAATGAAGGCTGCTTTGTTTACAGTGGAGTGAACTATGAATCTGGTGCACAGCTGAAACTAAGCTGCAAGTCAAAGTTGTCATTAGATGATGGATTCAATGAAGCCTCGAGCTTTGTGATGGAAAATGGAGCAAGTCAGTATCATCCAATAAGCTTTGTTGCAAAAGGATTGACAAGGAACTTTCTTCTGGCACCATTGCTGAGTTTCATAGATGAATCTTACACAGTTTACTTCAACGTGATTGGTTAG
Coding sequence (CDS)
ATGTGGGCGGTTCTGGTAGCGTTAATGGTATTCCTTCTCTGTCGTTGTAATTCTCTAAAGGAATGTACAAACACTCCTACTCAACTTGGATCACATACTTTTAGATATGAGCTCTTATCATCACATAATGGAACGTGGAAGGAAGAGATGTTTTCTCACTACCACTTGACACCAACTGATGACTTTGCTTGGTCTAATTTGCTACCAAGAAAGATGTTGAAGGAAGAAAACGAATTTAATTGGGAAATGATGTATAGACAAATGAAGAATAAAGATGGGCTGCAAGTTCCTGGTGGTTTGCTCAAGGAAATGTCTTTACACGACGTACGGTTGGATCCAAACTCGTTACACGGGACGGTTCAGATGACAAATTTGAAGTATCTATTGATGTTGGATGTGGACAGGTTGCTCTGGAGCTTTAGGAAGACAGCTGGTTTGCCTACACCTGGAGAACCATATCTCGGGTGGGAAAAATCAGACTGCGAGCTTCGTGGTCATTTTGTAGGACACTACTTAAGTGCGTCGGCCCAAATGTGGGCAAGCACTGGCAACCCTGTTCTTAAAGAGAAGATGTCTGCACTAGTTTCTGGTTTGGCAACTTGTCAGGACAAAATGGGTACAGGATATCTTTCTGCTTTTCCCTCTGAAGAGTTTGATCGTTTTGAAGCTATTCAACCTGTCTGGGCACCATATTACACCATCCACAAGATATTGGCAGGATTATTGGATCAATATACTTTTGCTGGTAATTCTCAAGCCTTAAAAATGGTTACATGGATGGTTGAGTACTTCTACAACCGCGTTCAAAATGTCGTATTGAAGTACACTGTAGAGAAACACTATCGGGCACTTAATGAAGAAACTGGTGGCATGAATGATGTGCTCTACCGGTTGTACAGTATAACAGGAAATACAAAGCATTTATTACTGGCACACCTTTTTGACAAACCGTGCTTTCTAGGCCTTCTTGCCGTACAGGCTGAAGACATTTCGGGTTTCCATACCAACACACATATACCAATTGTTGTTGGATCTCAAATGCGGTATGAAGTCACTGGTGATCCACTGTATAAGGAAATATCCACATATTTTATGGACATTGTTAACTCCTCCCATAGCTACGCAACAGGAGGGACATCAGTTCATGAGTTCTGGACAGATCCCAAGAGATTGGCAAATACTCTAGGAACGGAGAATGAAGAATCATGTACAACTTACAACATGCTTAAGGTTTCTCGCAACCTGTTTAAATGGACCAAGGAAATAGCATACGCCGATTATTATGAGCGGGCTCTTACAAATGGGGTGCTGAGCATCCAAAGAGGAACTGATCCTGGGGTAATGATTTACATGCTTCCATTAGGTAGTGGAAGTTCTAAAGCCAGAAGCTATCATGGATGGGGGACCCCATTTGAATCTTTTTGGTGCTGCTATGGCACAGGAATTGAATCATTCTCCAAGTTGGGCGATTCCATATACTTTGAGGAGGAAGCACAAACTCCTACGCTTTATGTTATCCAGTACATACCAAGTTCTCTTGATTGGAAATCAGGAAATGTTTTGGTTAATCAGGAAGTTGATCCTATTCATTCAGAGGATCCAAACCTCAGGATGACAATGACATTTTCTCCCAAGGTGGGATCAGTACAGTCATCTACCATTAATTTGCGAATTCCAAGTTGGACAAGTGCGAGTGGCGCGAAAGTTTTACTAAATGGTCAGAGTTTGGGAAATAACCCGAATGGCAACTTCAAATCGGTGACTAACAAATGGAGCTCAGGGGACAAGTTAAGCCTTGAGCTACCCATTAACATAAGAACTGAAGCTATTGAAGATGATCGATCTGAATATGCTTCCATCAAAGCGATCCTCTTTGGTCCCTATCTGCTGGCAGCCTATAGTAGTGGTGACTGGGAAATTAAAACCGGACTGTCAGATTCTTTTTCAGACTGGATAACTCCTGTTCCTTCCGTGTACAATACTTTTCTTGTTACTTTTTCCCAACCGTCTGGAAAGACATCTTTTGCCTTAACAAACTCGAACCAGTCAATAACAATGGAAAAGTATCCTGGACGAGGGACGGATTCTGCTGTCCATGCTACATTTAGGCTCATCTTAAATGACCCATCTGCCAAAGTCACAGAATTGCGAGATGTTATTGGCAAACGGGTCATGCTAGAACCATTTAATTTTCCAGGAATGGTTCTAGGAAATGAAGGAAAAGATGAGAAACTTGCAATTGCAGATTCAACCTCAGAGGGACATTCCTCTTATTTCTATCTAGTTGAGGGATTAGATGGAAATAATGGAACCGTATCCTTGGAGTCCGCAGACAATGAAGGCTGCTTTGTTTACAGTGGAGTGAACTATGAATCTGGTGCACAGCTGAAACTAAGCTGCAAGTCAAAGTTGTCATTAGATGATGGATTCAATGAAGCCTCGAGCTTTGTGATGGAAAATGGAGCAAGTCAGTATCATCCAATAAGCTTTGTTGCAAAAGGATTGACAAGGAACTTTCTTCTGGCACCATTGCTGAGTTTCATAGATGAATCTTACACAGTTTACTTCAACGTGATTGGTTAG
Protein sequence
MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTDDFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTVQMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWASTGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYSITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTMTFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLELPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYNTFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVIGKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNEGCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFLLAPLLSFIDESYTVYFNVIG
Homology
BLAST of HG10001145 vs. NCBI nr
Match:
XP_038901175.1 (uncharacterized protein LOC120088146 [Benincasa hispida])
HSP 1 Score: 1681.4 bits (4353), Expect = 0.0e+00
Identity = 820/859 (95.46%), Postives = 836/859 (97.32%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VL AL+ FLLC C+SLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD
Sbjct: 9 MWVVLAALIAFLLCHCDSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQ+PGGLLKE+SLHD+RLDPNSLHGT
Sbjct: 69 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQIPGGLLKEISLHDIRLDPNSLHGTA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGNPVLKEKMSALVSGLATCQDK+GTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG
Sbjct: 189 STGNPVLKEKMSALVSGLATCQDKLGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNR+QNV+LKYTVEKHYRALNEETGGMNDVLYRLY
Sbjct: 249 LLDQYTFAGNSQALKMVTWMVEYFYNRIQNVILKYTVEKHYRALNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLA+QAEDISGFHTNTHIPIVVG+QMRYEVTGDPLYKEI
Sbjct: 309 ITGNTKHLLLAHLFDKPCFLGLLALQAEDISGFHTNTHIPIVVGAQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
S YFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK
Sbjct: 369 SAYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSL+WKSGNVL+NQEVD IHSEDPNLRMTM
Sbjct: 489 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLEWKSGNVLLNQEVDTIHSEDPNLRMTM 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GS QSSTINLRIPSWTSAS AKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE
Sbjct: 549 TFSPK-GSAQSSTINLRIPSWTSASDAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
LPIN+RTEAIEDD SEYASIKAILFGPYLLAAYSSGD EIKT L DSFSDWITPVP+VYN
Sbjct: 609 LPINLRTEAIEDDGSEYASIKAILFGPYLLAAYSSGDREIKTELVDSFSDWITPVPAVYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGK SFALTNSNQSITMEKYPG GTDSAVHATFRLILNDPSAKVTELRDVI
Sbjct: 669 TFLVTFSQASGKISFALTNSNQSITMEKYPGWGTDSAVHATFRLILNDPSAKVTELRDVI 728
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVMLEPFNFPGMVLGNEGKDEKLAIA S SEGHSS FYLVEGLDG NGTVSLESADNE
Sbjct: 729 GKRVMLEPFNFPGMVLGNEGKDEKLAIAHSNSEGHSSDFYLVEGLDGKNGTVSLESADNE 788
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESGAQLKLSCKSKLSLDDGFN+ASSFVMENGASQYHPISFVAKGLTRNFL
Sbjct: 789 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNKASSFVMENGASQYHPISFVAKGLTRNFL 848
Query: 841 LAPLLSFIDESYTVYFNVI 860
LAPLLSFIDESYTVYFN+I
Sbjct: 849 LAPLLSFIDESYTVYFNMI 866
BLAST of HG10001145 vs. NCBI nr
Match:
XP_008449737.1 (PREDICTED: uncharacterized protein LOC103491528 [Cucumis melo])
HSP 1 Score: 1643.2 bits (4254), Expect = 0.0e+00
Identity = 793/857 (92.53%), Postives = 828/857 (96.62%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VLV L+ FLLC C+SLKECTNTPTQLGSHTFRYELLSS N TWK+E+FSHYHLTPTD
Sbjct: 9 MWVVLVVLLAFLLCNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKEIFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENE+NWEMMYRQMKNKDGLQ+PGG+LKE+SLHDVRLDP+SLHGT
Sbjct: 69 DFAWSNLLPRKMLKEENEYNWEMMYRQMKNKDGLQIPGGMLKEISLHDVRLDPSSLHGTA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEA+QPVWAPYYTIHKILAG
Sbjct: 189 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+HYR+LNEETGGMNDVLYRLY
Sbjct: 249 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHIPIV+GSQMRYEVTGDPLYKEI
Sbjct: 309 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHIPIVIGSQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE EESCTTYNMLKVSRNLFKWTK
Sbjct: 369 STYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGT+PGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTNPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQTP+LYVIQYI SSLDWKSGNVL+NQEVDPIHSEDP LRMT+
Sbjct: 489 GIESFSKLGDSIYFEEEAQTPSLYVIQYISSSLDWKSGNVLLNQEVDPIHSEDPKLRMTL 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GSV+SSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTN WSSGDKLSLE
Sbjct: 549 TFSPK-GSVRSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNSWSSGDKLSLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
+PIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYSSGDWEIKT +DSFSDWITPVPSVYN
Sbjct: 609 IPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSSGDWEIKTRQADSFSDWITPVPSVYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GTDSAVHATFRLI+NDPSAKVTELRDVI
Sbjct: 669 TFLVTFSQASGKTSFALTNSNQSITMEKYPEQGTDSAVHATFRLIVNDPSAKVTELRDVI 728
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVMLEPF+FPGMVLGN+GKDEKL IAD+ SE HSS FYLVEGLDG NGTVSL S DNE
Sbjct: 729 GKRVMLEPFSFPGMVLGNKGKDEKLEIADANSEAHSSEFYLVEGLDGKNGTVSLASIDNE 788
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESG+QLKLSCKSKLSLDDGF+EASSFVME+GASQYHPISFV KGLTRNFL
Sbjct: 789 GCFVYSGVNYESGSQLKLSCKSKLSLDDGFDEASSFVMESGASQYHPISFVTKGLTRNFL 848
Query: 841 LAPLLSFIDESYTVYFN 858
LAPLLSF+DESYTVYFN
Sbjct: 849 LAPLLSFVDESYTVYFN 864
BLAST of HG10001145 vs. NCBI nr
Match:
KAA0041392.1 (DUF1680 domain-containing protein [Cucumis melo var. makuwa])
HSP 1 Score: 1642.1 bits (4251), Expect = 0.0e+00
Identity = 792/857 (92.42%), Postives = 828/857 (96.62%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VLV L+ FLLC C+SLKECTNTPTQLGSHTFRYELLSS N TWK+E+FSHYHLTPTD
Sbjct: 1 MWVVLVVLLAFLLCNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKEIFSHYHLTPTD 60
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENE+NWEMMYRQMKNKDGLQ+PGG+LKE+SLHDVRLDP+SLHGT
Sbjct: 61 DFAWSNLLPRKMLKEENEYNWEMMYRQMKNKDGLQIPGGMLKEISLHDVRLDPSSLHGTA 120
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 121 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVGHYLSASAQMWA 180
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEA+QPVWAPYYTIHKILAG
Sbjct: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAG 240
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+HYR+LNEETGGMNDVLYRLY
Sbjct: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYR 300
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHIPIV+GSQMRYEVTGDPLYKEI
Sbjct: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHIPIVIGSQMRYEVTGDPLYKEI 360
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE EESCTTYNMLKVSRNLFKWTK
Sbjct: 361 STYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTK 420
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGT+PGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT
Sbjct: 421 EIAYADYYERALTNGVLSIQRGTNPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQTP+LYVIQYI SSLDWKSGNVL+NQEVDPIHSEDP LRMT+
Sbjct: 481 GIESFSKLGDSIYFEEEAQTPSLYVIQYISSSLDWKSGNVLLNQEVDPIHSEDPKLRMTL 540
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GSV+SSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTN WSSGDKLSLE
Sbjct: 541 TFSPK-GSVRSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNSWSSGDKLSLE 600
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
+PIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYSSGDWEIKT +DSFSDWITPVPSVYN
Sbjct: 601 IPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSSGDWEIKTRQADSFSDWITPVPSVYN 660
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GTDSAVHATFRLI+NDPSAKVTELRDVI
Sbjct: 661 TFLVTFSQASGKTSFALTNSNQSITMEKYPEQGTDSAVHATFRLIVNDPSAKVTELRDVI 720
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVM+EPF+FPGMVLGN+GKDEKL IAD+ SE HSS FYLVEGLDG NGTVSL S DNE
Sbjct: 721 GKRVMVEPFSFPGMVLGNKGKDEKLEIADANSEAHSSEFYLVEGLDGKNGTVSLASIDNE 780
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESG+QLKLSCKSKLSLDDGF+EASSFVME+GASQYHPISFV KGLTRNFL
Sbjct: 781 GCFVYSGVNYESGSQLKLSCKSKLSLDDGFDEASSFVMESGASQYHPISFVTKGLTRNFL 840
Query: 841 LAPLLSFIDESYTVYFN 858
LAPLLSF+DESYTVYFN
Sbjct: 841 LAPLLSFVDESYTVYFN 856
BLAST of HG10001145 vs. NCBI nr
Match:
XP_011653585.1 (uncharacterized protein LOC101207833 [Cucumis sativus] >KAE8649507.1 hypothetical protein Csa_017897 [Cucumis sativus])
HSP 1 Score: 1620.9 bits (4196), Expect = 0.0e+00
Identity = 783/857 (91.37%), Postives = 821/857 (95.80%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VLV L+ FLLC C+SLKECTNTPTQLGSHTFRYELLSS N TWK+E+FSHYHLTPTD
Sbjct: 9 MWVVLVVLLAFLLCNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKELFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENE+NWEMMYRQMKNKDGL++PGG+LKE+SLHDVRLDPNSLHGT
Sbjct: 69 DFAWSNLLPRKMLKEENEYNWEMMYRQMKNKDGLRIPGGMLKEISLHDVRLDPNSLHGTA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYVGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGN VLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEA+QPVWAPYYTIHKILAG
Sbjct: 189 STGNSVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+HYR+LNEETGGMNDVLYRLY
Sbjct: 249 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHIPIVVGSQMRYEVTGDPLYKEI
Sbjct: 309 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHIPIVVGSQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE EESCTTYNMLKVSRNLFKWTK
Sbjct: 369 STYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKA SYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKAISYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEE QTPTLYVIQYI SSLDWKSGNVL+NQ VDPIHSEDP LRMT+
Sbjct: 489 GIESFSKLGDSIYFEEELQTPTLYVIQYISSSLDWKSGNVLLNQTVDPIHSEDPKLRMTL 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GSV SSTINLRIPSWTSASGAKV+LNGQSLGNN NGNFKSVTN WSSG+KLSLE
Sbjct: 549 TFSPK-GSVHSSTINLRIPSWTSASGAKVVLNGQSLGNNINGNFKSVTNSWSSGNKLSLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
LPIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYS+GDWEIKT +DS SDWIT VPS YN
Sbjct: 609 LPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSNGDWEIKTQQADSLSDWITHVPSAYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYPG+GTDSAVHATFRLI++DPSAKVTEL+DVI
Sbjct: 669 TFLVTFSQASGKTSFALTNSNQSITMEKYPGQGTDSAVHATFRLIIDDPSAKVTELQDVI 728
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVMLEPF+FPGMVLGN+GKDE+L IAD+ SEGHSS FYLVEGLDG NGTVSL S DNE
Sbjct: 729 GKRVMLEPFSFPGMVLGNKGKDERLEIADANSEGHSSDFYLVEGLDGKNGTVSLASIDNE 788
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESGAQLKLSCKSKLSLDDGF+EASSF++E+GASQYHPISFV KG+TRNFL
Sbjct: 789 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFDEASSFLLESGASQYHPISFVTKGMTRNFL 848
Query: 841 LAPLLSFIDESYTVYFN 858
LAPLLSF+DESYTVYFN
Sbjct: 849 LAPLLSFVDESYTVYFN 864
BLAST of HG10001145 vs. NCBI nr
Match:
XP_022148748.1 (uncharacterized protein LOC111017340 [Momordica charantia])
HSP 1 Score: 1587.4 bits (4109), Expect = 0.0e+00
Identity = 773/860 (89.88%), Postives = 809/860 (94.07%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MWAVLV LMVF+LCR +SLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD
Sbjct: 9 MWAVLVTLMVFMLCRGDSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWS+LLPRK+LKEENEFNW M+YRQMKNKDG QVPGGLLKE+SLHDVRLDPNS HG
Sbjct: 69 DFAWSSLLPRKVLKEENEFNWAMVYRQMKNKDGTQVPGGLLKEISLHDVRLDPNSFHGRA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
ST NPVLKEKMSA+VSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG
Sbjct: 189 STDNPVLKEKMSAIVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLD YTFAGNSQALKMVTWMVEYFYNRVQNV+ KYTVE+HYRALNEETGGMNDVLYRLY
Sbjct: 249 LLDHYTFAGNSQALKMVTWMVEYFYNRVQNVITKYTVERHYRALNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGN KHLLLAHLFDKPCFLG+LAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI
Sbjct: 309 ITGNAKHLLLAHLFDKPCFLGILAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDI+ SSHSYATGGTSVHEFWTDPKRLA+TLGTENEESCTTYNMLKVSRNLFKWTK
Sbjct: 369 STYFMDIIKSSHSYATGGTSVHEFWTDPKRLADTLGTENEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPL GSSKA+SYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLARGSSKAKSYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQ PTLYVIQYI SSLDWKSGNVL+ QEV PIHSEDPNLRMTM
Sbjct: 489 GIESFSKLGDSIYFEEEAQAPTLYVIQYISSSLDWKSGNVLLKQEVAPIHSEDPNLRMTM 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
FSPK GSVQSSTINLRIPSWT+A+ AKV LNGQSL +PN NF+ V+ KW+SGDKL+LE
Sbjct: 549 MFSPK-GSVQSSTINLRIPSWTTANDAKVTLNGQSLAISPNVNFQPVSYKWNSGDKLTLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
LPIN+RTEAIEDDRSEYASIKAILFGPYLLAAYS GDW+IKTG +DS SDWITPVPS YN
Sbjct: 609 LPINLRTEAIEDDRSEYASIKAILFGPYLLAAYSDGDWDIKTGSTDSLSDWITPVPSAYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRL-ILNDPSAKVTELRDV 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GT+SAV ATFRL ILNDPSAKV+ELRDV
Sbjct: 669 TFLVTFSQESGKTSFALTNSNQSITMEKYPEQGTNSAVRATFRLIILNDPSAKVSELRDV 728
Query: 721 IGKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADN 780
IGKRVMLEPF+FPGMVLG GKD LAIA+S SEGH S FYLVEGLDG NGT+SL+SADN
Sbjct: 729 IGKRVMLEPFDFPGMVLGTRGKDGDLAIAESNSEGHFSDFYLVEGLDGKNGTISLKSADN 788
Query: 781 EGCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNF 840
EGCFVYSGVNYESG QLKLSCKSKLS DDGF++ASSFV++NG QYHPISF+ KG TR F
Sbjct: 789 EGCFVYSGVNYESGPQLKLSCKSKLSSDDGFDQASSFVIQNGVIQYHPISFIVKGATRKF 848
Query: 841 LLAPLLSFIDESYTVYFNVI 860
LLAPLLSFIDESYTVYFNVI
Sbjct: 849 LLAPLLSFIDESYTVYFNVI 867
BLAST of HG10001145 vs. ExPASy TrEMBL
Match:
A0A1S3BM44 (uncharacterized protein LOC103491528 OS=Cucumis melo OX=3656 GN=LOC103491528 PE=4 SV=1)
HSP 1 Score: 1643.2 bits (4254), Expect = 0.0e+00
Identity = 793/857 (92.53%), Postives = 828/857 (96.62%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VLV L+ FLLC C+SLKECTNTPTQLGSHTFRYELLSS N TWK+E+FSHYHLTPTD
Sbjct: 9 MWVVLVVLLAFLLCNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKEIFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENE+NWEMMYRQMKNKDGLQ+PGG+LKE+SLHDVRLDP+SLHGT
Sbjct: 69 DFAWSNLLPRKMLKEENEYNWEMMYRQMKNKDGLQIPGGMLKEISLHDVRLDPSSLHGTA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEA+QPVWAPYYTIHKILAG
Sbjct: 189 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+HYR+LNEETGGMNDVLYRLY
Sbjct: 249 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHIPIV+GSQMRYEVTGDPLYKEI
Sbjct: 309 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHIPIVIGSQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE EESCTTYNMLKVSRNLFKWTK
Sbjct: 369 STYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGT+PGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTNPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQTP+LYVIQYI SSLDWKSGNVL+NQEVDPIHSEDP LRMT+
Sbjct: 489 GIESFSKLGDSIYFEEEAQTPSLYVIQYISSSLDWKSGNVLLNQEVDPIHSEDPKLRMTL 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GSV+SSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTN WSSGDKLSLE
Sbjct: 549 TFSPK-GSVRSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNSWSSGDKLSLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
+PIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYSSGDWEIKT +DSFSDWITPVPSVYN
Sbjct: 609 IPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSSGDWEIKTRQADSFSDWITPVPSVYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GTDSAVHATFRLI+NDPSAKVTELRDVI
Sbjct: 669 TFLVTFSQASGKTSFALTNSNQSITMEKYPEQGTDSAVHATFRLIVNDPSAKVTELRDVI 728
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVMLEPF+FPGMVLGN+GKDEKL IAD+ SE HSS FYLVEGLDG NGTVSL S DNE
Sbjct: 729 GKRVMLEPFSFPGMVLGNKGKDEKLEIADANSEAHSSEFYLVEGLDGKNGTVSLASIDNE 788
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESG+QLKLSCKSKLSLDDGF+EASSFVME+GASQYHPISFV KGLTRNFL
Sbjct: 789 GCFVYSGVNYESGSQLKLSCKSKLSLDDGFDEASSFVMESGASQYHPISFVTKGLTRNFL 848
Query: 841 LAPLLSFIDESYTVYFN 858
LAPLLSF+DESYTVYFN
Sbjct: 849 LAPLLSFVDESYTVYFN 864
BLAST of HG10001145 vs. ExPASy TrEMBL
Match:
A0A5A7TD86 (DUF1680 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold206G00380 PE=4 SV=1)
HSP 1 Score: 1642.1 bits (4251), Expect = 0.0e+00
Identity = 792/857 (92.42%), Postives = 828/857 (96.62%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW VLV L+ FLLC C+SLKECTNTPTQLGSHTFRYELLSS N TWK+E+FSHYHLTPTD
Sbjct: 1 MWVVLVVLLAFLLCNCDSLKECTNTPTQLGSHTFRYELLSSGNVTWKKEIFSHYHLTPTD 60
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWSNLLPRKMLKEENE+NWEMMYRQMKNKDGLQ+PGG+LKE+SLHDVRLDP+SLHGT
Sbjct: 61 DFAWSNLLPRKMLKEENEYNWEMMYRQMKNKDGLQIPGGMLKEISLHDVRLDPSSLHGTA 120
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPY+GWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 121 QTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYIGWEKSDCELRGHFVGHYLSASAQMWA 180
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEA+QPVWAPYYTIHKILAG
Sbjct: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAVQPVWAPYYTIHKILAG 240
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+HYR+LNEETGGMNDVLYRLY
Sbjct: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVERHYRSLNEETGGMNDVLYRLYR 300
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHIPIV+GSQMRYEVTGDPLYKEI
Sbjct: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHIPIVIGSQMRYEVTGDPLYKEI 360
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE EESCTTYNMLKVSRNLFKWTK
Sbjct: 361 STYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTETEESCTTYNMLKVSRNLFKWTK 420
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGT+PGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT
Sbjct: 421 EIAYADYYERALTNGVLSIQRGTNPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQTP+LYVIQYI SSLDWKSGNVL+NQEVDPIHSEDP LRMT+
Sbjct: 481 GIESFSKLGDSIYFEEEAQTPSLYVIQYISSSLDWKSGNVLLNQEVDPIHSEDPKLRMTL 540
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK GSV+SSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTN WSSGDKLSLE
Sbjct: 541 TFSPK-GSVRSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNSWSSGDKLSLE 600
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
+PIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYSSGDWEIKT +DSFSDWITPVPSVYN
Sbjct: 601 IPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSSGDWEIKTRQADSFSDWITPVPSVYN 660
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GTDSAVHATFRLI+NDPSAKVTELRDVI
Sbjct: 661 TFLVTFSQASGKTSFALTNSNQSITMEKYPEQGTDSAVHATFRLIVNDPSAKVTELRDVI 720
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRVM+EPF+FPGMVLGN+GKDEKL IAD+ SE HSS FYLVEGLDG NGTVSL S DNE
Sbjct: 721 GKRVMVEPFSFPGMVLGNKGKDEKLEIADANSEAHSSEFYLVEGLDGKNGTVSLASIDNE 780
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
GCFVYSGVNYESG+QLKLSCKSKLSLDDGF+EASSFVME+GASQYHPISFV KGLTRNFL
Sbjct: 781 GCFVYSGVNYESGSQLKLSCKSKLSLDDGFDEASSFVMESGASQYHPISFVTKGLTRNFL 840
Query: 841 LAPLLSFIDESYTVYFN 858
LAPLLSF+DESYTVYFN
Sbjct: 841 LAPLLSFVDESYTVYFN 856
BLAST of HG10001145 vs. ExPASy TrEMBL
Match:
A0A6J1D4Z0 (uncharacterized protein LOC111017340 OS=Momordica charantia OX=3673 GN=LOC111017340 PE=4 SV=1)
HSP 1 Score: 1587.4 bits (4109), Expect = 0.0e+00
Identity = 773/860 (89.88%), Postives = 809/860 (94.07%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MWAVLV LMVF+LCR +SLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD
Sbjct: 9 MWAVLVTLMVFMLCRGDSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 68
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
DFAWS+LLPRK+LKEENEFNW M+YRQMKNKDG QVPGGLLKE+SLHDVRLDPNS HG
Sbjct: 69 DFAWSSLLPRKVLKEENEFNWAMVYRQMKNKDGTQVPGGLLKEISLHDVRLDPNSFHGRA 128
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA
Sbjct: 129 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 188
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
ST NPVLKEKMSA+VSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG
Sbjct: 189 STDNPVLKEKMSAIVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 248
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLD YTFAGNSQALKMVTWMVEYFYNRVQNV+ KYTVE+HYRALNEETGGMNDVLYRLY
Sbjct: 249 LLDHYTFAGNSQALKMVTWMVEYFYNRVQNVITKYTVERHYRALNEETGGMNDVLYRLYR 308
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGN KHLLLAHLFDKPCFLG+LAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI
Sbjct: 309 ITGNAKHLLLAHLFDKPCFLGILAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 368
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
STYFMDI+ SSHSYATGGTSVHEFWTDPKRLA+TLGTENEESCTTYNMLKVSRNLFKWTK
Sbjct: 369 STYFMDIIKSSHSYATGGTSVHEFWTDPKRLADTLGTENEESCTTYNMLKVSRNLFKWTK 428
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPL GSSKA+SYHGWGTPFESFWCCYGT
Sbjct: 429 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLARGSSKAKSYHGWGTPFESFWCCYGT 488
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEEEAQ PTLYVIQYI SSLDWKSGNVL+ QEV PIHSEDPNLRMTM
Sbjct: 489 GIESFSKLGDSIYFEEEAQAPTLYVIQYISSSLDWKSGNVLLKQEVAPIHSEDPNLRMTM 548
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
FSPK GSVQSSTINLRIPSWT+A+ AKV LNGQSL +PN NF+ V+ KW+SGDKL+LE
Sbjct: 549 MFSPK-GSVQSSTINLRIPSWTTANDAKVTLNGQSLAISPNVNFQPVSYKWNSGDKLTLE 608
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
LPIN+RTEAIEDDRSEYASIKAILFGPYLLAAYS GDW+IKTG +DS SDWITPVPS YN
Sbjct: 609 LPINLRTEAIEDDRSEYASIKAILFGPYLLAAYSDGDWDIKTGSTDSLSDWITPVPSAYN 668
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRL-ILNDPSAKVTELRDV 720
TFLVTFSQ SGKTSFALTNSNQSITMEKYP +GT+SAV ATFRL ILNDPSAKV+ELRDV
Sbjct: 669 TFLVTFSQESGKTSFALTNSNQSITMEKYPEQGTNSAVRATFRLIILNDPSAKVSELRDV 728
Query: 721 IGKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADN 780
IGKRVMLEPF+FPGMVLG GKD LAIA+S SEGH S FYLVEGLDG NGT+SL+SADN
Sbjct: 729 IGKRVMLEPFDFPGMVLGTRGKDGDLAIAESNSEGHFSDFYLVEGLDGKNGTISLKSADN 788
Query: 781 EGCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNF 840
EGCFVYSGVNYESG QLKLSCKSKLS DDGF++ASSFV++NG QYHPISF+ KG TR F
Sbjct: 789 EGCFVYSGVNYESGPQLKLSCKSKLSSDDGFDQASSFVIQNGVIQYHPISFIVKGATRKF 848
Query: 841 LLAPLLSFIDESYTVYFNVI 860
LLAPLLSFIDESYTVYFNVI
Sbjct: 849 LLAPLLSFIDESYTVYFNVI 867
BLAST of HG10001145 vs. ExPASy TrEMBL
Match:
A0A5D3BCH4 (DUF1680 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold182G001250 PE=4 SV=1)
HSP 1 Score: 1459.1 bits (3776), Expect = 0.0e+00
Identity = 708/758 (93.40%), Postives = 735/758 (96.97%), Query Frame = 0
Query: 100 LLKEMSLHDVRLDPNSLHGTVQMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKS 159
+LKE+SLHDVRLDPNSLHGT Q TNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKS
Sbjct: 1 MLKEISLHDVRLDPNSLHGTAQTTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKS 60
Query: 160 DCELRGHFVGHYLSASAQMWASTGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFD 219
DCELRGHFVGHYLSASAQMWASTGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFD
Sbjct: 61 DCELRGHFVGHYLSASAQMWASTGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFD 120
Query: 220 RFEAIQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEK 279
RFEA+QPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVTWMVEYFYNRVQNV+LKYTVE+
Sbjct: 121 RFEAVQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVTWMVEYFYNRVQNVILKYTVER 180
Query: 280 HYRALNEETGGMNDVLYRLYSITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHI 339
HYR+LNEETGGMNDVLYRLY ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFH NTHI
Sbjct: 181 HYRSLNEETGGMNDVLYRLYRITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHVNTHI 240
Query: 340 PIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTEN 399
PIV+GSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGGTSVHEFW DPKRLA+ LGTE
Sbjct: 241 PIVIGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGGTSVHEFWRDPKRLADALGTET 300
Query: 400 EESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSS 459
EESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLSIQRGT+PGVMIYMLPLGSGSS
Sbjct: 301 EESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLSIQRGTNPGVMIYMLPLGSGSS 360
Query: 460 KARSYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGN 519
KARSYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEAQTP+LYVIQYI SSLDWKSGN
Sbjct: 361 KARSYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEAQTPSLYVIQYISSSLDWKSGN 420
Query: 520 VLVNQEVDPIHSEDPNLRMTMTFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNN 579
VL+NQEVDPIHSEDP LRMT+TFSPK GSV+SSTINLRIPSWTSASGAKVLLNGQSLGNN
Sbjct: 421 VLLNQEVDPIHSEDPKLRMTLTFSPK-GSVRSSTINLRIPSWTSASGAKVLLNGQSLGNN 480
Query: 580 PNGNFKSVTNKWSSGDKLSLELPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWE 639
PNGNFKSVTN WSSGDKLSLE+PIN+RTEAI+DDRSEYAS+KAILFGPYLLAAYSSGDWE
Sbjct: 481 PNGNFKSVTNSWSSGDKLSLEIPINLRTEAIDDDRSEYASVKAILFGPYLLAAYSSGDWE 540
Query: 640 IKTGLSDSFSDWITPVPSVYNTFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVH 699
IKT +DSFSDWITPVPSVYNTFLVTFSQ SGKTSFALTNSNQSITMEKYP +GTDSAVH
Sbjct: 541 IKTRQADSFSDWITPVPSVYNTFLVTFSQASGKTSFALTNSNQSITMEKYPEQGTDSAVH 600
Query: 700 ATFRLILNDPSAKVTELRDVIGKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYF 759
ATFRLI+NDPSAKVTELRDVIGKRVMLEPF+FPGMVLGN+GKDEKL IAD+ SE HSS F
Sbjct: 601 ATFRLIVNDPSAKVTELRDVIGKRVMLEPFSFPGMVLGNKGKDEKLEIADANSEAHSSEF 660
Query: 760 YLVEGLDGNNGTVSLESADNEGCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVME 819
YLVEGLDG NGTVSL S DNEGCFVYSGVNYESG+QLKLSCKSKLSLDDGF+EASSFVME
Sbjct: 661 YLVEGLDGKNGTVSLASIDNEGCFVYSGVNYESGSQLKLSCKSKLSLDDGFDEASSFVME 720
Query: 820 NGASQYHPISFVAKGLTRNFLLAPLLSFIDESYTVYFN 858
+GASQYHPISFV KGLTRNFLLAPLLSF+DESYTVYFN
Sbjct: 721 SGASQYHPISFVTKGLTRNFLLAPLLSFVDESYTVYFN 757
BLAST of HG10001145 vs. ExPASy TrEMBL
Match:
A0A6J1H2F6 (uncharacterized protein LOC111459415 OS=Cucurbita moschata OX=3662 GN=LOC111459415 PE=4 SV=1)
HSP 1 Score: 1447.2 bits (3745), Expect = 0.0e+00
Identity = 697/859 (81.14%), Postives = 770/859 (89.64%), Query Frame = 0
Query: 1 MWAVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTD 60
MW V VALM FLLC C++LKECTN PTQLGSHT RYEL SHN T K+EMFSHYHLTPTD
Sbjct: 10 MWVVWVALMAFLLCHCDALKECTNIPTQLGSHTLRYELSLSHNETLKKEMFSHYHLTPTD 69
Query: 61 DFAWSNLLPRKMLKEENEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTV 120
D AWSNLL R++LKEENEFNWEMMYRQMKNKDG+QVPGGLLKE+ L DVRL+PNS HG
Sbjct: 70 DAAWSNLLTRRLLKEENEFNWEMMYRQMKNKDGVQVPGGLLKEVPLGDVRLEPNSFHGRA 129
Query: 121 QMTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWA 180
Q TNLKYLLMLDVD LLWSFR+TAGLPTPG+PYLGWEKSDCELRGHFVGHYLSA+A+MWA
Sbjct: 130 QATNLKYLLMLDVDNLLWSFRQTAGLPTPGKPYLGWEKSDCELRGHFVGHYLSATAKMWA 189
Query: 181 STGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAG 240
STG+ +KEKM+ALVSGLA CQDKMGTGYLSAFPSE FDR+EAI+PVWAPYYTIHKILAG
Sbjct: 190 STGDAAIKEKMNALVSGLAACQDKMGTGYLSAFPSELFDRYEAIKPVWAPYYTIHKILAG 249
Query: 241 LLDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYS 300
LLDQYTF GN+QALKMVT MVEYFYNRVQNV+ +TVE+HY++LN ETGGMNDVLYRLY
Sbjct: 250 LLDQYTFGGNAQALKMVTRMVEYFYNRVQNVIKLHTVERHYQSLNTETGGMNDVLYRLYG 309
Query: 301 ITGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEI 360
ITGNT HLLLAHLFDKPCFLG+LAVQAE +SG H NTHIPIVVG Q+RYE+TGDPLYKE+
Sbjct: 310 ITGNTTHLLLAHLFDKPCFLGILAVQAESLSGLHANTHIPIVVGGQLRYELTGDPLYKEM 369
Query: 361 STYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTK 420
ST+FMD +NSSHSYATGGTS HEFWTDPKRLA+TLG ENEESCTTYNMLKVSRNLF+WTK
Sbjct: 370 STFFMDSINSSHSYATGGTSAHEFWTDPKRLADTLGAENEESCTTYNMLKVSRNLFRWTK 429
Query: 421 EIAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGT 480
+AYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFE+FWCCYGT
Sbjct: 430 GVAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFETFWCCYGT 489
Query: 481 GIESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTM 540
GIESFSKLGDSIYFEE AQTPTLYVIQYI SSL+WKSGNVL+NQ VDP+HS+DPNLRMTM
Sbjct: 490 GIESFSKLGDSIYFEEGAQTPTLYVIQYISSSLNWKSGNVLLNQVVDPVHSDDPNLRMTM 549
Query: 541 TFSPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 600
TFSPK SVQSSTINLRIPSWTSASGA+VLLNGQS+G NG F+ VTNKWSS DKLS+
Sbjct: 550 TFSPK-ESVQSSTINLRIPSWTSASGAQVLLNGQSVGKITNGIFQPVTNKWSSKDKLSIV 609
Query: 601 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 660
LPIN+RTEAI DDR+++AS KAILFGPYLLA +S GD +IKTG + SFSDWITPVPS YN
Sbjct: 610 LPINLRTEAIGDDRTQFASTKAILFGPYLLAGHSGGDKDIKTGTTGSFSDWITPVPSSYN 669
Query: 661 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 720
TFLVT SQ SG SFALTNSNQ+ITME YPG+GTDSAVHATFRL+LND +A V L+DVI
Sbjct: 670 TFLVTLSQMSGNASFALTNSNQTITMEMYPGQGTDSAVHATFRLVLNDTTANVKTLQDVI 729
Query: 721 GKRVMLEPFNFPGMVLGNEGKDEKLAIADSTSEGHSSYFYLVEGLDGNNGTVSLESADNE 780
GKRV LEPF+FPGMVL +G D+KL IA S S G SS F++++GLDG NGT+SL+SA+NE
Sbjct: 730 GKRVKLEPFDFPGMVLATQGPDQKLVIAGSNSVGLSSDFFVIQGLDGKNGTISLKSANNE 789
Query: 781 GCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFL 840
CFVYSGVNY+SG QLKLSCK K S D F++ASSF M+ G SQYHPISFVAKG TRNFL
Sbjct: 790 DCFVYSGVNYKSGVQLKLSCKPKSSSDVAFDQASSFAMQTGVSQYHPISFVAKGPTRNFL 849
Query: 841 LAPLLSFIDESYTVYFNVI 860
+APL+SF+DE+YTVYFN+I
Sbjct: 850 MAPLMSFMDETYTVYFNII 867
BLAST of HG10001145 vs. TAIR 10
Match:
AT5G12950.1 (Putative glycosyl hydrolase of unknown function (DUF1680) )
HSP 1 Score: 1118.6 bits (2892), Expect = 0.0e+00
Identity = 549/842 (65.20%), Postives = 658/842 (78.15%), Query Frame = 0
Query: 20 KECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTDDFAWSNLLPRKMLKEE-NE 79
KECTNTPTQL SHTFR ELL S N T K E+FSHYHLTP DD AWS+LLPRKMLKEE +E
Sbjct: 25 KECTNTPTQLSSHTFRSELLQSKNETLKTELFSHYHLTPADDSAWSSLLPRKMLKEEADE 84
Query: 80 FNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTVQMTNLKYLLMLDVDRLLW 139
F W M+YR+ K+ + G LK++SLHDVRLDP+S H Q TNL+YLLMLDVD L W
Sbjct: 85 FAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPDSFHWRAQQTNLEYLLMLDVDGLAW 144
Query: 140 SFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWASTGNPVLKEKMSALVSGL 199
SFRK AGL PG+ Y GWE+ D ELRGHFVGHYLSA+A MWAST N LKEKMSALVS L
Sbjct: 145 SFRKEAGLDAPGDYYGGWERPDSELRGHFVGHYLSATAYMWASTHNDTLKEKMSALVSAL 204
Query: 200 ATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAGLLDQYTFAGNSQALKMVT 259
+ CQ K GTGYLSAFPS FDRFEAI PVWAPYYTIHKILAGL+DQY AGNSQALKM T
Sbjct: 205 SECQQKSGTGYLSAFPSSFFDRFEAITPVWAPYYTIHKILAGLVDQYKLAGNSQALKMAT 264
Query: 260 WMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYSITGNTKHLLLAHLFDKPC 319
M +YFY RV+NV+ KY+VE+H+++LNEETGGMNDVLY+LYSITG++K+LLLAHLFDKPC
Sbjct: 265 GMADYFYGRVRNVIRKYSVERHWQSLNEETGGMNDVLYQLYSITGDSKYLLLAHLFDKPC 324
Query: 320 FLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEISTYFMDIVNSSHSYATGG 379
FLG+LA+QA+DISGFH NTHIPIVVGSQ RYE+TGD L+KEIS +FMDI N+SHSYATGG
Sbjct: 325 FLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEISMFFMDIFNASHSYATGG 384
Query: 380 TSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTKEIAYADYYERALTNGVLS 439
TSV EFW DPKR+A L TENEESCTTYNMLKVSRNLF+WTKE++YADYYERALTNGVL
Sbjct: 385 TSVSEFWQDPKRMATALQTENEESCTTYNMLKVSRNLFRWTKEVSYADYYERALTNGVLG 444
Query: 440 IQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGTGIESFSKLGDSIYFEEEA 499
IQRGT PG+MIYMLPLG G SKA +YHGWGTP++SFWCCYGTGIESFSKLGDSIYF+E+
Sbjct: 445 IQRGTQPGLMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTGIESFSKLGDSIYFQEDG 504
Query: 500 QTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTMTF-SPKVGSVQSSTINLR 559
TP LYV QYI SSLDWKS + ++Q+V+P+ S DP +R+T T S KVG + ST+NLR
Sbjct: 505 ATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFTLSSSKVGVAKESTLNLR 564
Query: 560 IPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLELPINIRTEAIEDDRSEY 619
IP WT++ GAKV LNG+ L +GNF S+ KW SGD++++ELP++IRTEAI+DDR EY
Sbjct: 565 IPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTMELPMSIRTEAIKDDRPEY 624
Query: 620 ASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYNTFLVTFSQPSGKTSFAL 679
AS++AIL+GPYLLA ++S DW I T WITP+P N++LVT SQ SG S+
Sbjct: 625 ASLQAILYGPYLLAGHTSRDWSITTQAKP--GKWITPIPETQNSYLVTLSQQSGNVSYVF 684
Query: 680 TNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVIGKRVMLEPFNFPGMVLG 739
+NSNQ+ITM P GT AV ATFRL+ ++ +++ +IG+ VMLEPF+FPGM++
Sbjct: 685 SNSNQTITMRVSPEPGTQDAVAATFRLVTDNSKPRISGPEGLIGRLVMLEPFDFPGMIV- 744
Query: 740 NEGKDEKLAI-ADSTSEGHSSYFYLVEGLDGNNGTVSLESADNEGCFVYSGVNYESGAQL 799
+ D L + A S S+ +S F LV GLDG G+VSL +GCFVYS + G +L
Sbjct: 745 KQATDSSLTVQASSPSDKGASSFRLVSGLDGKLGSVSLRLESKKGCFVYSDQTLKQGTKL 804
Query: 800 KLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNFLLAPLLSFIDESYTVYF 859
+L C S + D+ F EA+SF ++ G QY+P+SFV G RNF+L+PL S DE+Y VYF
Sbjct: 805 RLECGSD-ATDEKFKEAASFSLKTGMHQYNPMSFVMSGTQRNFVLSPLFSLRDETYNVYF 859
BLAST of HG10001145 vs. TAIR 10
Match:
AT5G12960.1 (Putative glycosyl hydrolase of unknown function (DUF1680) )
HSP 1 Score: 1097.4 bits (2837), Expect = 0.0e+00
Identity = 547/859 (63.68%), Postives = 657/859 (76.48%), Query Frame = 0
Query: 3 AVLVALMVFLLCRCNSLKECTNTPTQLGSHTFRYELLSSHNGTWKEEMFSHYHLTPTDDF 62
A+L+ L+C KECT+ PT+L SHT R ELL S N K E FSHYHLTPTDD
Sbjct: 15 ALLLYTSFLLVCLA---KECTDIPTKLSSHTLRSELLQSQNANLKSEEFSHYHLTPTDDS 74
Query: 63 AWSNLLPRKMLKEE-NEFNWEMMYRQMKNKDGLQVPGGLLKEMSLHDVRLDPNSLHGTVQ 122
AWS LLPRKMLKEE ++F W M+YR+ K+ + G LK++SLHDVRLDP+S H Q
Sbjct: 75 AWSTLLPRKMLKEETDDFAWTMLYRKFKDSNS---SGNFLKDVSLHDVRLDPSSFHWRAQ 134
Query: 123 MTNLKYLLMLDVDRLLWSFRKTAGLPTPGEPYLGWEKSDCELRGHFVGHYLSASAQMWAS 182
TNL+YLLMLDVD L ++FRK AGL PG PY GWEK D ELRGHFVGHYLSA+A MWAS
Sbjct: 135 QTNLEYLLMLDVDGLAYNFRKEAGLNAPGVPYGGWEKPDSELRGHFVGHYLSATAYMWAS 194
Query: 183 TGNPVLKEKMSALVSGLATCQDKMGTGYLSAFPSEEFDRFEAIQPVWAPYYTIHKILAGL 242
T N LK KM+ALVS LA CQ K GTGYLSAFPS FDRFEAI VWAPYYTIHKILAGL
Sbjct: 195 THNETLKAKMTALVSALAECQQKYGTGYLSAFPSSFFDRFEAITHVWAPYYTIHKILAGL 254
Query: 243 LDQYTFAGNSQALKMVTWMVEYFYNRVQNVVLKYTVEKHYRALNEETGGMNDVLYRLYSI 302
+DQY AGN+QALKM T M +YFY RVQNV+ KY+VE+H+ +LNEETGGMNDVLY+LYSI
Sbjct: 255 VDQYKLAGNTQALKMATGMADYFYGRVQNVIKKYSVERHWLSLNEETGGMNDVLYQLYSI 314
Query: 303 TGNTKHLLLAHLFDKPCFLGLLAVQAEDISGFHTNTHIPIVVGSQMRYEVTGDPLYKEIS 362
T ++K+L LAHLFDKPCFLG+LA+QA+DISGFH NTHIPIVVGSQ RYE+TGD L+KEI
Sbjct: 315 TRDSKYLFLAHLFDKPCFLGVLAIQADDISGFHANTHIPIVVGSQQRYEITGDLLHKEIP 374
Query: 363 TYFMDIVNSSHSYATGGTSVHEFWTDPKRLANTLGTENEESCTTYNMLKVSRNLFKWTKE 422
+FMDIVN+SHSYATGGTSV EFW DPKR+A TL TENEESCTTYNMLKVSRNLF+WTKE
Sbjct: 375 MFFMDIVNASHSYATGGTSVKEFWQDPKRMATTLQTENEESCTTYNMLKVSRNLFRWTKE 434
Query: 423 IAYADYYERALTNGVLSIQRGTDPGVMIYMLPLGSGSSKARSYHGWGTPFESFWCCYGTG 482
++YADYYERALTNGVL IQRGTDPG MIYMLPLG G SKA +YHGWGTP++SFWCCYGTG
Sbjct: 435 VSYADYYERALTNGVLGIQRGTDPGRMIYMLPLGKGVSKAVTYHGWGTPYDSFWCCYGTG 494
Query: 483 IESFSKLGDSIYFEEEAQTPTLYVIQYIPSSLDWKSGNVLVNQEVDPIHSEDPNLRMTMT 542
IESFSKLGDSIYF+E+ TP LYV QYI SSLDWKS + ++Q+V+P+ S DP +R+T T
Sbjct: 495 IESFSKLGDSIYFQEDGATPALYVTQYISSSLDWKSAGLSISQKVNPVVSWDPYMRVTFT 554
Query: 543 F-SPKVGSVQSSTINLRIPSWTSASGAKVLLNGQSLGNNPNGNFKSVTNKWSSGDKLSLE 602
S KVG + ST+NLRIP WT++ GAKV LNG+ L +GNF S+ KW SGD++++E
Sbjct: 555 LSSSKVGVAKESTLNLRIPVWTNSIGAKVSLNGRPLNVPTSGNFLSIKQKWKSGDQVTME 614
Query: 603 LPINIRTEAIEDDRSEYASIKAILFGPYLLAAYSSGDWEIKTGLSDSFSDWITPVPSVYN 662
LP++IRTEAI+DDR EYAS++AIL+GPYLLA ++S DW I T +WITP+P N
Sbjct: 615 LPMSIRTEAIKDDRPEYASLQAILYGPYLLAGHTSMDWSITT--QAKAGNWITPIPETLN 674
Query: 663 TFLVTFSQPSGKTSFALTNSNQSITMEKYPGRGTDSAVHATFRLILNDPSAKVTELRDVI 722
+ LVT SQ SG S+ L+NSNQ+I M+ P GT AV ATFRL+ +D ++ +I
Sbjct: 675 SHLVTLSQQSGNISYVLSNSNQTIIMKVSPEPGTQDAVSATFRLVTDDSKHPISSPEGLI 734
Query: 723 GKRVMLEPFNFPGMVLGNEGKDEKLAI-ADSTSEGHSSYFYLVEGLDGNNGTVSLESADN 782
G VMLEPF+FPGM++ + D L + A S S+ SS F LV GLDG G+VSL
Sbjct: 735 GSLVMLEPFDFPGMIV-KQATDSSLTVQASSPSDKGSSSFRLVSGLDGKPGSVSLSLESK 794
Query: 783 EGCFVYSGVNYESGAQLKLSCKSKLSLDDGFNEASSFVMENGASQYHPISFVAKGLTRNF 842
+GCFVYS + G +L+L C S + D+ F +A+SF ++ G +QY+P+SFV G RNF
Sbjct: 795 KGCFVYSDQTLKQGTKLRLECGS-AATDEKFKQAASFSLKTGMNQYNPMSFVMSGTQRNF 854
Query: 843 LLAPLLSFIDESYTVYFNV 859
+L+PL S DE+Y VYF+V
Sbjct: 855 VLSPLFSLRDETYNVYFSV 863
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038901175.1 | 0.0e+00 | 95.46 | uncharacterized protein LOC120088146 [Benincasa hispida] | [more] |
XP_008449737.1 | 0.0e+00 | 92.53 | PREDICTED: uncharacterized protein LOC103491528 [Cucumis melo] | [more] |
KAA0041392.1 | 0.0e+00 | 92.42 | DUF1680 domain-containing protein [Cucumis melo var. makuwa] | [more] |
XP_011653585.1 | 0.0e+00 | 91.37 | uncharacterized protein LOC101207833 [Cucumis sativus] >KAE8649507.1 hypothetica... | [more] |
XP_022148748.1 | 0.0e+00 | 89.88 | uncharacterized protein LOC111017340 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A1S3BM44 | 0.0e+00 | 92.53 | uncharacterized protein LOC103491528 OS=Cucumis melo OX=3656 GN=LOC103491528 PE=... | [more] |
A0A5A7TD86 | 0.0e+00 | 92.42 | DUF1680 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E6C2... | [more] |
A0A6J1D4Z0 | 0.0e+00 | 89.88 | uncharacterized protein LOC111017340 OS=Momordica charantia OX=3673 GN=LOC111017... | [more] |
A0A5D3BCH4 | 0.0e+00 | 93.40 | DUF1680 domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E567... | [more] |
A0A6J1H2F6 | 0.0e+00 | 81.14 | uncharacterized protein LOC111459415 OS=Cucurbita moschata OX=3662 GN=LOC1114594... | [more] |
Match Name | E-value | Identity | Description | |
AT5G12950.1 | 0.0e+00 | 65.20 | Putative glycosyl hydrolase of unknown function (DUF1680) | [more] |
AT5G12960.1 | 0.0e+00 | 63.68 | Putative glycosyl hydrolase of unknown function (DUF1680) | [more] |