HG10004139 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10004139
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionXS domain-containing protein
LocationChr08: 14098631 .. 14104077 (-)
RNA-Seq ExpressionHG10004139
SyntenyHG10004139
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCAATGTCGAAGGCGTGAAGATTACTATGTTAGAGAGTCGGAAAATATGGAACTACATGCTCAGGATCGGCTTCATGCTCAGGATCGGCTTCATCTTGATCATGGTCGATATGGTAAGCCCCAACGTGAGACACTGGATCAATCTCCGCGTCTTAGGAGAAGTTTGAGCCCTCACAGAATTGGTGGCTCCCGATGCGAAGTGGGTTTGGTTCATAGGGTTGATACCACTGAGAGGAGGAACGAGGATTGGCATCTGAGAACTGGAAGGAACAATGACATCGGGTCAAGTTCACATTCTTATGGTCATGCGAGGAAACTGCCCAACTACGAGGAAGTGTTTCTGCACAATGATCACAGGCAGCTTTCGGATTTGCAACAGACGCATGTTTTACCTGAGCCAAGGAAATTTTCTGCTGACAACGAAGTTGTCGATTATAAGCAGCATGACGTTCGGTATAGGCATGATGATTTGAGAATTAGGAAAGAGAGAGAAATTATTGAAGGGAGATGGTCAGATGGCCGTGGACAAAGGATGACGGATCAGAAACTTTTGGCTACGGAAGAGGGCACTGCAATGGGATTGTATAATTCACATCTTGATATCGGTCCCACGTCAGTCTATAAAGACTTCTTACCATTTTCCCAGAGTTTGGATGTGCGAAGTCTTGACAATGAAAGACTCAAATTTCGAAACGATGTAGTTTCTGATAAACCTCGGGTCACAAATTCACGCGAAGTTGAAGAAAACCAGAGGTTTAATTCAAGGAATATTGGATATTCTGCAAGTTCAGGATTTCATTCCAGAGGAAATGAGAGCTCTTCGTCAGGACCATTGACAAGCAAGTGTTTGGAATCTTATCGTGATGGCCATTACTTTCCAATTTCAGATGAGTTTTCAACAAGGAGCCATGGAGACCTTGTGGACCCTATAGAATTTAATTCATATGGGAAAAGGACCCTTGTAGATTCAGCCATTGATCTTGTAGGTGGAAAAAGGGATCTTACTCCTCATCAACGAGGAACTAATAGTCCTAGGAGGGAGCAAAGGAGCTATTTTTATTCTAAACCTGAGAGAACAGTGAATAATTCAAATGAAGATCCATCTCGAGTGATGCAGAAAATTACTCAAACACATGACTATGTTGATTATGACAGTACAATTGTTTCTGATCATGGAGACTTTTCAAGACCTAAAGCTGCAAACACTAGTATGCTGAAATTACAAAATGCTGACGACCCATGTGCGAACTATAGAACTGGAATAGCACTCGACCATTATAGGCTTAGGAAACAGACAGTTTTAGGTTACCCTGATATAGGACCAACCACAGAAGCAATAAACAATGACAATGAATATGCAGGTGCTGGATCCATCCATGTTGATGTGGGTAGGAGAGTAACTCAAGATTATGAAAGGTCACACATTAACCTTTCTCAGTATTGTCAACCATCGTATGCAAGATCGGATTGTGGCTCAGAAAGAGAAGTAGGTTCATATTGTTTGAAAGAGAGGTTGAATAGGTTCTCCATGTCCAAGTGTGATGGAGAGGCTTACAGAAATAATGGAAGAGTGCAGAGAATGACCGAGGGTGTTCGCACTTATAACTTGAGGGAGGACCATATGCCAAAAAGAAAGTACTTTGATGCAGATATGAATTTACTCGATCATAGAATAGCTACATCATGTGAATATACACCCAGTAAGGTTGTGGATCTTTATGATAGTGGTGAACAGTGGATGGATGAGGAAAACAGTTGCAGATATATATCCAGGAAAGCAGGATTTGACCACAACAAATACAAGAAGCCAAATACGAATTTCAATCGTCAAATTTTGTATGCTTCTGCTGATTCACATGAAAGCTATTTAGATCATGCTAAAAAATATAAACCTGGTCCAAAATATATGAAGGGCAATAGAAGGCATGGTCCTTCAAGCTGGATCAAGTCACAAAATGTTGATCACAGAAATAGTTCTCATAGACCGGTTAAAAATTGGAAAAAAACTGAAGAAGAGAATGATTATGCTCGTGTAAATGATGATGACTTGTCAGATGATTTGGTAATATCTACAGAATCTGAACCTCCTGAGGATTCTGAAGAGTTCAAACAATTGATTCATGAGGCCTTTTTGAAGTGCTCAAAAAAGTTGAATATGAAGACTAATGTCCGGAAAAAGTACAAGGAGCAAGGAAATGCTGGTAGTTTATATTGCATCGTATGTGGCAGAAGGTCTGTCTTCTTCCCTATTTCCTAATCGTGGAGACTCCATTTCATATTTTGACTCAAGAGTTCCCTAATTGCTATAGTGCACAACCATATTCGAGTAAGATTATAAGAACTAACCTTCAAGGTTTTGTTCTGTACGTTCCTTTGGGATGTAAGACTCCAATTATATGGCCGTGTGCTTCTATTATTTTGTGTGTACCGTCATCATTTTTTTTTTTTTGTGGGGGGGGGGGGGGGGGGGTGGGGGTGGTGTTAAAATCTTGGAGCTGCTTGTGAGGGTTCCCTTTCACTGCAGCAATGTGTGGATCATTAGAGCTCTACTTCAATTGATAGTGGCTTCTATGGTGGTGTTTGAGAATGATGGCTTACCCACATAATTAAACCTACTGCTTACAAACAGGTTCATCATTTGATTGAGTTTGGAAGCTTGGAGTTGTTCAGGATAAGCTTTGAGATTGTATTTGCATTGGTCACTTTTCCAATTGTTGCAATGTTTCCTGTGAAATGCGCAATAATGTCATGGACTCCATGTTCAGATTAACCTTTACGGATGTGTATGTATGTGTTTGCTTTTTAGTTTATCACTTGTTAATGGTGGTGATTATATTCTATAAAGCACACATCAATGCTTGGATCATAGTGAATTTCCTTGGACATTCAATAAAGGCAGTCAGATGAGATCATGGGGGGTCTGGAAGGCACGCAGATCAGCCCATTTTTATTGTTTTAGAACTTCAAAGAGCAGTATCAGCTTGTTGTAGAGAGATCCAAGCTAGAGGGTGTTTTTTGGATAAGAAACAATTTCATTAATGGCATGAAATTACAAAAGAAGGCCTTCATATGGGGGATTTACAAAAGACTTCTCCAATTAGAGAGGAGGGAAGAAAAGCTATAGTATAAAGGGGAAAGCAATTACACCATGCCACAACCAAACTAACTATAGCTTCAAAAAAAATCCACGCTAAAGGGTTTGATTTGGTTGTTATTTTCCTTGTGATATAACTGATCTCATTTGGTCTGTTTGCACTAGTAGATGTATGGCCAGTCCACCTTAGCAAACATTAGTTTTGTGTGAAGATATGGCCTCTTTTGATATTTATCGTGTCAGAACTTTGCTTTTGGAAATGTTATTTTTCCTAGCTCATTTTGCAGAATTTAGGAGAGTATAGCAATAGCTTGGACTTTCTTTCTGCCGCTGCGATGCTTATGGTCTCCATTGCTGCTTGTCAACTGCTTCTTTAGTGATAACAGTCATGCTTTGACATGCATATCAGTTTTAAATGGTAGTATATCGGTGAAGATTGTTGGAATGGCTGTAAAGTGGACCATCGCAAGTACTAGCATGACAGAATTACTTGATAGATATTTTTCCCATAGCGTTCTAGGGTTTAACTTGTCCTTTACTCAATGGGATGGTGTCAACCTAGTACAAAACATCATCTGATGGTAGCTTGCTCTGTAATATTATCTTCATTTTGTACTGTTACGCCCATTTATTTCTTGCTAGAGATATATGTTGCATCTGATATATTGGGAGGACAATTAGGAACTCATATTCGAACCTTTTGTTTTGGTTCATCTAATTTCTTTATTTCAGCCCTAGTGGAGTAGAGCCTTCTATTGCTTTTGTTTAGTCCCATTTTTTGTACAATTTATGTTTGCATTGAAAGTGGTTTCAGGTTTGTTGAAAAACCATGAGACTGTAAAGGTTCACGTGGTGTGGTTAAATGTTCATGCATGTAGCTTAATTTAAGATTGCTGACATCCTTTGATATTAATTTTCTTCTGAAGTTTTGTTGACTGTCAAGTATTGCAAGAACAACTTCTGATCCCCAGTACTTTGACTCCATGAGTGAAGGTGGAAAAGGTCGAGGTGTTTTTTTAAAAAAGAAAAATCATGAATATGAAATCATGGGGTTTAACTTCTTTGAATTAGATTTCCCGAGGTAGTTCTGAATGTTAATTTGCTGATGGGCTTGTGAGTATGATCATCTTTGTGATTTTGATAGTTGTTTACTTTTTTGAATTCTATTCAGTTTGCTGTCGCCTGCTGTCTTGACCTAAATCTAGTACCTTTGTAGATGACGTGTCTTTTTATTGACTATGCATCACTAATTCTAGCTTGCCTGTTTGTGCCTTTGCAGCCACTCAAAGGAATTTATGAATACTCAACGCCTGGTAAAGCATGCCTATATGTCCCACAAGGTGGGGTTGAGGGCTCAGCATTTGGGTCTTGCCAAAGCCATATGCGTTTTGATGGGGTGGAATAGTGTCCTTCCCCGAGACACTGTAATATGGGTTCCTGAGGTTTTGTCAAAGGAAGAAACTGTGGTTCAGAAGGAGGATCTTATCATCTGGCCTCCTGTTATTATCGTCCGCAACGTTTCTCTGTCACACACCAGTCCTGATAAGTGGAAAGTTGTAACAATTGAAGCACTCGAGTCTTTCTTGAGAAGTAAGTGACGTTTTATACTTCAAACTTATCATGTTAAATAGTATTTGACCAGTACCTTTCACGATGAGGGAGGGCCATAAATCTGACGTTTACAGTTTACTTCTATCTCAGAGCATGTCTAACCTTTTTGGTTGCACCTCTAGACTTCAAATTAGCAGTATGGTATATGCATTGTTGTCTTTTGTCTTTCTAACTGACTTGAGCATAATTCAAACCAAAAATGTAAGAGTTCGAATTCCCCACAGACTTCGAACTAGCAGTATGGTATATATATTGTGGTCTTGTCTTTTTAGCCGACTTGACGATAACTCAATTACTTAAGACATGCTCTCGACTAAAAAGATGAGAGTTCAAATTATCTTCCTCTTGAGCTCAAAAAGATAATAATAATATTTTATTATTCTCGGATATGGTAGCATTAACGCAATTGAATGACAGGTAAAAATCTGCTGAAGGGAAGAGTGAAAATGAGTTTGGGGTGTCCTGCAGATCAAAGTGTAATGGTGTTGAAGTTCCTGCCTACCTTTTCTGGTTTAGCAGATGCAGAAAGACTCAACAAATTCTTCTCTGAAAACAGACGTGGAAGAGAGGATTTTGAGCTGGCAAAGTGCAAAGATGGGGGTGTGGAAATGGAGGGAGACAAAATAGAAGAGGAAGTGCTTTATGGATACTTAGGAGCTGCAGAGGATTTGGATGACGTTGAACTCAATGTAAGGAAGTTGAGTATGATAAAGAGCAAAAAGGAAATATTGGAGTTGTAA

mRNA sequence

ATGCAATGTCGAAGGCGTGAAGATTACTATGTTAGAGAGTCGGAAAATATGGAACTACATGCTCAGGATCGGCTTCATGCTCAGGATCGGCTTCATCTTGATCATGGTCGATATGGTAAGCCCCAACGTGAGACACTGGATCAATCTCCGCGTCTTAGGAGAAGTTTGAGCCCTCACAGAATTGGTGGCTCCCGATGCGAAGTGGGTTTGGTTCATAGGGTTGATACCACTGAGAGGAGGAACGAGGATTGGCATCTGAGAACTGGAAGGAACAATGACATCGGGTCAAGTTCACATTCTTATGGTCATGCGAGGAAACTGCCCAACTACGAGGAAGTGTTTCTGCACAATGATCACAGGCAGCTTTCGGATTTGCAACAGACGCATGTTTTACCTGAGCCAAGGAAATTTTCTGCTGACAACGAAGTTGTCGATTATAAGCAGCATGACGTTCGGTATAGGCATGATGATTTGAGAATTAGGAAAGAGAGAGAAATTATTGAAGGGAGATGGTCAGATGGCCGTGGACAAAGGATGACGGATCAGAAACTTTTGGCTACGGAAGAGGGCACTGCAATGGGATTGTATAATTCACATCTTGATATCGGTCCCACGTCAGTCTATAAAGACTTCTTACCATTTTCCCAGAGTTTGGATGTGCGAAGTCTTGACAATGAAAGACTCAAATTTCGAAACGATGTAGTTTCTGATAAACCTCGGGTCACAAATTCACGCGAAGTTGAAGAAAACCAGAGGTTTAATTCAAGGAATATTGGATATTCTGCAAGTTCAGGATTTCATTCCAGAGGAAATGAGAGCTCTTCGTCAGGACCATTGACAAGCAAGTGTTTGGAATCTTATCGTGATGGCCATTACTTTCCAATTTCAGATGAGTTTTCAACAAGGAGCCATGGAGACCTTGTGGACCCTATAGAATTTAATTCATATGGGAAAAGGACCCTTGTAGATTCAGCCATTGATCTTGTAGGTGGAAAAAGGGATCTTACTCCTCATCAACGAGGAACTAATAGTCCTAGGAGGGAGCAAAGGAGCTATTTTTATTCTAAACCTGAGAGAACAGTGAATAATTCAAATGAAGATCCATCTCGAGTGATGCAGAAAATTACTCAAACACATGACTATGTTGATTATGACAGTACAATTGTTTCTGATCATGGAGACTTTTCAAGACCTAAAGCTGCAAACACTAGTATGCTGAAATTACAAAATGCTGACGACCCATGTGCGAACTATAGAACTGGAATAGCACTCGACCATTATAGGCTTAGGAAACAGACAGTTTTAGGTTACCCTGATATAGGACCAACCACAGAAGCAATAAACAATGACAATGAATATGCAGGTGCTGGATCCATCCATGTTGATGTGGGTAGGAGAGTAACTCAAGATTATGAAAGGTCACACATTAACCTTTCTCAGTATTGTCAACCATCGTATGCAAGATCGGATTGTGGCTCAGAAAGAGAAGTAGGTTCATATTGTTTGAAAGAGAGGTTGAATAGGTTCTCCATGTCCAAGTGTGATGGAGAGGCTTACAGAAATAATGGAAGAGTGCAGAGAATGACCGAGGGTGTTCGCACTTATAACTTGAGGGAGGACCATATGCCAAAAAGAAAGTACTTTGATGCAGATATGAATTTACTCGATCATAGAATAGCTACATCATGTGAATATACACCCAGTAAGGTTGTGGATCTTTATGATAGTGGTGAACAGTGGATGGATGAGGAAAACAGTTGCAGATATATATCCAGGAAAGCAGGATTTGACCACAACAAATACAAGAAGCCAAATACGAATTTCAATCGTCAAATTTTGTATGCTTCTGCTGATTCACATGAAAGCTATTTAGATCATGCTAAAAAATATAAACCTGGTCCAAAATATATGAAGGGCAATAGAAGGCATGGTCCTTCAAGCTGGATCAAGTCACAAAATGTTGATCACAGAAATAGTTCTCATAGACCGGTTAAAAATTGGAAAAAAACTGAAGAAGAGAATGATTATGCTCGTGTAAATGATGATGACTTGTCAGATGATTTGGTAATATCTACAGAATCTGAACCTCCTGAGGATTCTGAAGAGTTCAAACAATTGATTCATGAGGCCTTTTTGAAGTGCTCAAAAAAGTTGAATATGAAGACTAATGTCCGGAAAAAGTACAAGGAGCAAGGAAATGCTGGTAGTTTATATTGCATCGTATGTGGCAGAAGCCACTCAAAGGAATTTATGAATACTCAACGCCTGGTAAAGCATGCCTATATGTCCCACAAGGTGGGGTTGAGGGCTCAGCATTTGGGTCTTGCCAAAGCCATATGCGTTTTGATGGGGTGGAATAGTGTCCTTCCCCGAGACACTGTAATATGGGTTCCTGAGGTTTTGTCAAAGGAAGAAACTGTGGTTCAGAAGGAGGATCTTATCATCTGGCCTCCTGTTATTATCGTCCGCAACGTTTCTCTGTCACACACCAGTCCTGATAAGTGGAAAGTTGTAACAATTGAAGCACTCGAGTCTTTCTTGAGAAGTAAAAATCTGCTGAAGGGAAGAGTGAAAATGAGTTTGGGGTGTCCTGCAGATCAAAGTGTAATGGTGTTGAAGTTCCTGCCTACCTTTTCTGGTTTAGCAGATGCAGAAAGACTCAACAAATTCTTCTCTGAAAACAGACGTGGAAGAGAGGATTTTGAGCTGGCAAAGTGCAAAGATGGGGGTGTGGAAATGGAGGGAGACAAAATAGAAGAGGAAGTGCTTTATGGATACTTAGGAGCTGCAGAGGATTTGGATGACGTTGAACTCAATGTAAGGAAGTTGAGTATGATAAAGAGCAAAAAGGAAATATTGGAGTTGTAA

Coding sequence (CDS)

ATGCAATGTCGAAGGCGTGAAGATTACTATGTTAGAGAGTCGGAAAATATGGAACTACATGCTCAGGATCGGCTTCATGCTCAGGATCGGCTTCATCTTGATCATGGTCGATATGGTAAGCCCCAACGTGAGACACTGGATCAATCTCCGCGTCTTAGGAGAAGTTTGAGCCCTCACAGAATTGGTGGCTCCCGATGCGAAGTGGGTTTGGTTCATAGGGTTGATACCACTGAGAGGAGGAACGAGGATTGGCATCTGAGAACTGGAAGGAACAATGACATCGGGTCAAGTTCACATTCTTATGGTCATGCGAGGAAACTGCCCAACTACGAGGAAGTGTTTCTGCACAATGATCACAGGCAGCTTTCGGATTTGCAACAGACGCATGTTTTACCTGAGCCAAGGAAATTTTCTGCTGACAACGAAGTTGTCGATTATAAGCAGCATGACGTTCGGTATAGGCATGATGATTTGAGAATTAGGAAAGAGAGAGAAATTATTGAAGGGAGATGGTCAGATGGCCGTGGACAAAGGATGACGGATCAGAAACTTTTGGCTACGGAAGAGGGCACTGCAATGGGATTGTATAATTCACATCTTGATATCGGTCCCACGTCAGTCTATAAAGACTTCTTACCATTTTCCCAGAGTTTGGATGTGCGAAGTCTTGACAATGAAAGACTCAAATTTCGAAACGATGTAGTTTCTGATAAACCTCGGGTCACAAATTCACGCGAAGTTGAAGAAAACCAGAGGTTTAATTCAAGGAATATTGGATATTCTGCAAGTTCAGGATTTCATTCCAGAGGAAATGAGAGCTCTTCGTCAGGACCATTGACAAGCAAGTGTTTGGAATCTTATCGTGATGGCCATTACTTTCCAATTTCAGATGAGTTTTCAACAAGGAGCCATGGAGACCTTGTGGACCCTATAGAATTTAATTCATATGGGAAAAGGACCCTTGTAGATTCAGCCATTGATCTTGTAGGTGGAAAAAGGGATCTTACTCCTCATCAACGAGGAACTAATAGTCCTAGGAGGGAGCAAAGGAGCTATTTTTATTCTAAACCTGAGAGAACAGTGAATAATTCAAATGAAGATCCATCTCGAGTGATGCAGAAAATTACTCAAACACATGACTATGTTGATTATGACAGTACAATTGTTTCTGATCATGGAGACTTTTCAAGACCTAAAGCTGCAAACACTAGTATGCTGAAATTACAAAATGCTGACGACCCATGTGCGAACTATAGAACTGGAATAGCACTCGACCATTATAGGCTTAGGAAACAGACAGTTTTAGGTTACCCTGATATAGGACCAACCACAGAAGCAATAAACAATGACAATGAATATGCAGGTGCTGGATCCATCCATGTTGATGTGGGTAGGAGAGTAACTCAAGATTATGAAAGGTCACACATTAACCTTTCTCAGTATTGTCAACCATCGTATGCAAGATCGGATTGTGGCTCAGAAAGAGAAGTAGGTTCATATTGTTTGAAAGAGAGGTTGAATAGGTTCTCCATGTCCAAGTGTGATGGAGAGGCTTACAGAAATAATGGAAGAGTGCAGAGAATGACCGAGGGTGTTCGCACTTATAACTTGAGGGAGGACCATATGCCAAAAAGAAAGTACTTTGATGCAGATATGAATTTACTCGATCATAGAATAGCTACATCATGTGAATATACACCCAGTAAGGTTGTGGATCTTTATGATAGTGGTGAACAGTGGATGGATGAGGAAAACAGTTGCAGATATATATCCAGGAAAGCAGGATTTGACCACAACAAATACAAGAAGCCAAATACGAATTTCAATCGTCAAATTTTGTATGCTTCTGCTGATTCACATGAAAGCTATTTAGATCATGCTAAAAAATATAAACCTGGTCCAAAATATATGAAGGGCAATAGAAGGCATGGTCCTTCAAGCTGGATCAAGTCACAAAATGTTGATCACAGAAATAGTTCTCATAGACCGGTTAAAAATTGGAAAAAAACTGAAGAAGAGAATGATTATGCTCGTGTAAATGATGATGACTTGTCAGATGATTTGGTAATATCTACAGAATCTGAACCTCCTGAGGATTCTGAAGAGTTCAAACAATTGATTCATGAGGCCTTTTTGAAGTGCTCAAAAAAGTTGAATATGAAGACTAATGTCCGGAAAAAGTACAAGGAGCAAGGAAATGCTGGTAGTTTATATTGCATCGTATGTGGCAGAAGCCACTCAAAGGAATTTATGAATACTCAACGCCTGGTAAAGCATGCCTATATGTCCCACAAGGTGGGGTTGAGGGCTCAGCATTTGGGTCTTGCCAAAGCCATATGCGTTTTGATGGGGTGGAATAGTGTCCTTCCCCGAGACACTGTAATATGGGTTCCTGAGGTTTTGTCAAAGGAAGAAACTGTGGTTCAGAAGGAGGATCTTATCATCTGGCCTCCTGTTATTATCGTCCGCAACGTTTCTCTGTCACACACCAGTCCTGATAAGTGGAAAGTTGTAACAATTGAAGCACTCGAGTCTTTCTTGAGAAGTAAAAATCTGCTGAAGGGAAGAGTGAAAATGAGTTTGGGGTGTCCTGCAGATCAAAGTGTAATGGTGTTGAAGTTCCTGCCTACCTTTTCTGGTTTAGCAGATGCAGAAAGACTCAACAAATTCTTCTCTGAAAACAGACGTGGAAGAGAGGATTTTGAGCTGGCAAAGTGCAAAGATGGGGGTGTGGAAATGGAGGGAGACAAAATAGAAGAGGAAGTGCTTTATGGATACTTAGGAGCTGCAGAGGATTTGGATGACGTTGAACTCAATGTAAGGAAGTTGAGTATGATAAAGAGCAAAAAGGAAATATTGGAGTTGTAA

Protein sequence

MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHRIGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHRQLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMTDQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKPRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEFSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPERTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYRTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLSQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLREDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHNKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRNSSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKLNMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVVTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGREDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL
Homology
BLAST of HG10004139 vs. NCBI nr
Match: XP_038884675.1 (uncharacterized protein LOC120075393 [Benincasa hispida] >XP_038884676.1 uncharacterized protein LOC120075393 [Benincasa hispida])

HSP 1 Score: 1678.7 bits (4346), Expect = 0.0e+00
Identity = 847/955 (88.69%), Postives = 882/955 (92.36%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRR +D+YVRESENME      LHAQDRLHLDHGRYGKP+RE LD++PRLRRSLS HR
Sbjct: 1   MQCRRGDDFYVRESENME------LHAQDRLHLDHGRYGKPRREALDRAPRLRRSLSAHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
           IGGSR EVGL+HRVD +ERRN DWHLRTGRNN+IGSSSHSYG ARK+PNY+EVFLHNDH 
Sbjct: 61  IGGSRGEVGLLHRVDVSERRNGDWHLRTGRNNEIGSSSHSYGQARKMPNYKEVFLHNDHG 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           QLSDLQQTHVLPEPRKFSADN+VV+YK HDVRYRHDDLRIRKE EIIEGRWSDGRGQRM 
Sbjct: 121 QLSDLQQTHVLPEPRKFSADNKVVNYK-HDVRYRHDDLRIRKEMEIIEGRWSDGRGQRMM 180

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKPR 240
            QKLLA EEGTAMGLYNSHLDIGP SVYKDFLP SQSLDVRS DNERLKF+N VVSDKP+
Sbjct: 181 GQKLLAMEEGTAMGLYNSHLDIGPKSVYKDFLPSSQSLDVRSHDNERLKFQNHVVSDKPQ 240

Query: 241 VTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEFS 300
           VT+SREVEE+QRFNSRNIGYSASSGF+SRGNESS SGPLTSKCLESYRDGHYF ISDEFS
Sbjct: 241 VTDSREVEESQRFNSRNIGYSASSGFYSRGNESSLSGPLTSKCLESYRDGHYFQISDEFS 300

Query: 301 TRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPERT 360
           TRSHGDLVDPIEFN YGKRTLVDSAIDLV GKR+LTPHQRGTNSPRRE  SYFYSKPERT
Sbjct: 301 TRSHGDLVDPIEFNPYGKRTLVDSAIDLVSGKRNLTPHQRGTNSPRREHESYFYSKPERT 360

Query: 361 VNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYRT 420
           VN SNEDP RVMQK TQTHDYVDYD TIVSD GDFSRPK  NTSMLKLQNADD C NYRT
Sbjct: 361 VNISNEDPCRVMQKTTQTHDYVDYDGTIVSDPGDFSRPKVTNTSMLKLQNADDLCVNYRT 420

Query: 421 GIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLSQ 480
           GIALDHY LRKQ VL YPDIGP+TEAINNDNEYAG GSIHVDVGRRVTQDYERSHINLSQ
Sbjct: 421 GIALDHYWLRKQAVLDYPDIGPSTEAINNDNEYAGEGSIHVDVGRRVTQDYERSHINLSQ 480

Query: 481 YCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLRED 540
           YCQ SYARSD GSEREVGSYCLKERL+R SMSKCDG AYRN  RVQRMTEGV TYNLRE 
Sbjct: 481 YCQTSYARSDYGSEREVGSYCLKERLHRSSMSKCDGVAYRNTERVQRMTEGVHTYNLREG 540

Query: 541 HMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHNK 600
           H+PKRKYF+ DMNLLD RIATSCE TPSKVVDLYD+GEQWMD+ENSCRYISRK  FDHNK
Sbjct: 541 HVPKRKYFEEDMNLLDDRIATSCEDTPSKVVDLYDNGEQWMDDENSCRYISRKEEFDHNK 600

Query: 601 YKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRNSS 660
           YKKPNT FNRQ LYASADSHESYLDH KKYKPGPKYMKGNRRHGPSSWIKSQNV HRNSS
Sbjct: 601 YKKPNTKFNRQSLYASADSHESYLDHVKKYKPGPKYMKGNRRHGPSSWIKSQNVGHRNSS 660

Query: 661 HRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKLNM 720
           HRPVKNWKKT EENDYA VNDDDLSDDLVISTESEPPEDSEEFKQL HEAFLKCSK LNM
Sbjct: 661 HRPVKNWKKT-EENDYACVNDDDLSDDLVISTESEPPEDSEEFKQLTHEAFLKCSKMLNM 720

Query: 721 KTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKAIC 780
           K++VRKKY EQGNAGSLYCI+C RSHSKEFMNTQRLVKHAYMSHKVGLRA HLGLAKAIC
Sbjct: 721 KSSVRKKYTEQGNAGSLYCIICRRSHSKEFMNTQRLVKHAYMSHKVGLRALHLGLAKAIC 780

Query: 781 VLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVVTI 840
           VLMGWNSV P+DTV WVPEVLSKEE VVQKEDLIIWPPVIIVRN+SLS++SPDKW+VVTI
Sbjct: 781 VLMGWNSVRPQDTVTWVPEVLSKEEIVVQKEDLIIWPPVIIVRNLSLSYSSPDKWRVVTI 840

Query: 841 EALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGRED 900
           +ALESFLRSKNLLKGRVKM+LGCPADQSVMVLKFLPTFSGL DAERL+KFFSENRRGRED
Sbjct: 841 KALESFLRSKNLLKGRVKMNLGCPADQSVMVLKFLPTFSGLTDAERLDKFFSENRRGRED 900

Query: 901 FELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           FELAKCK GGV MEGDKIEEEVLYGYLG AEDLDDVELNVRKLSMIKSKKEILEL
Sbjct: 901 FELAKCKKGGVGMEGDKIEEEVLYGYLGTAEDLDDVELNVRKLSMIKSKKEILEL 947

BLAST of HG10004139 vs. NCBI nr
Match: XP_008456586.1 (PREDICTED: uncharacterized protein LOC103496499 [Cucumis melo] >XP_008456587.1 PREDICTED: uncharacterized protein LOC103496499 [Cucumis melo])

HSP 1 Score: 1518.1 bits (3929), Expect = 0.0e+00
Identity = 779/957 (81.40%), Postives = 841/957 (87.88%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRE EN E      LH QDRLHLDHGRYG  +RETLD+SPRLRRSLSPHR
Sbjct: 1   MQCRRREDYYVREPENTE------LHVQDRLHLDHGRYGMARRETLDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
            G SR EVGLV RVD TE R+ +WHLRTGRNNDIG SSHS+G +RK+PNYEEVFLHNDHR
Sbjct: 61  FGVSRREVGLVDRVDNTESRDGNWHLRTGRNNDIGLSSHSFGQSRKVPNYEEVFLHNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           Q SDLQQ  V P+PR+FS DNEV+DYK HDV YR  DLRIRKEREIIEGRWSDGRGQRMT
Sbjct: 121 QHSDLQQ--VSPDPRRFSDDNEVIDYK-HDVGYRLGDLRIRKEREIIEGRWSDGRGQRMT 180

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDV--RSLDNERLKFRNDVVSDK 240
           DQ+LLA EEG  +G YNSH  IGPT+VYKDF P S SLDV  R LDNERLKFRN VVSD+
Sbjct: 181 DQRLLAIEEGNGLGSYNSHPGIGPTAVYKDFFPSSLSLDVEMRGLDNERLKFRNHVVSDR 240

Query: 241 PRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDE 300
           P++T+S+E +E Q+FNSRNIGYSASSGF+SRGNESS SGPL S+CLESYRDGHYF ISDE
Sbjct: 241 PQITDSQEAQEGQKFNSRNIGYSASSGFYSRGNESSLSGPLASQCLESYRDGHYFQISDE 300

Query: 301 FSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPE 360
           FSTR+HGD+VDPIEFNSYGKRTLVDSAIDL GGKR+LTPHQRGTNSPRRE  SYFYSKPE
Sbjct: 301 FSTRTHGDIVDPIEFNSYGKRTLVDSAIDLQGGKRNLTPHQRGTNSPRREHGSYFYSKPE 360

Query: 361 RTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANY 420
           RTVNNSNEDPSRV+QKITQT  YVDY ST+VSDHGDFSR K ANTSMLK+Q ADD  ANY
Sbjct: 361 RTVNNSNEDPSRVVQKITQTRGYVDYASTVVSDHGDFSRTKVANTSMLKIQKADDSYANY 420

Query: 421 RTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINL 480
           R GIALD YRLRKQT L YPDIGP+TE IN+DNEYAGAGSI+ DVG RVTQDYERS+IN 
Sbjct: 421 RAGIALDQYRLRKQTALDYPDIGPSTEEINDDNEYAGAGSIYSDVG-RVTQDYERSNINH 480

Query: 481 SQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLR 540
           SQY Q SYA SD G EREVGSY LKERL R +MSKCD EAYR+  RVQRMTEGVRTYNLR
Sbjct: 481 SQYGQTSYAISDYGPEREVGSYYLKERLRRSNMSKCDREAYRSTERVQRMTEGVRTYNLR 540

Query: 541 EDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDH 600
           EDHMPKR +F+ DMNLLDHRIATS E  P+K+VDLYDS EQW D+ NS RYISRKAGFD 
Sbjct: 541 EDHMPKRNFFEEDMNLLDHRIATSRENAPNKLVDLYDSDEQWRDDGNSRRYISRKAGFDR 600

Query: 601 NKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           NKYKKPNT +N + +   ADSHESY DHA+KYK G KYMKGN+++GPSSWIKSQNVDHRN
Sbjct: 601 NKYKKPNTKYNCRNI---ADSHESYSDHAQKYKFGSKYMKGNKKYGPSSWIKSQNVDHRN 660

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S H+P KNWKKT EENDYARVNDDDLSDDL+I+TESEPPEDSEEFKQL+HEAFLKCSK L
Sbjct: 661 SLHKPFKNWKKT-EENDYARVNDDDLSDDLIITTESEPPEDSEEFKQLVHEAFLKCSKML 720

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NM  +VRKKYKEQGNAGSLYC+VCGRS SKEFMN+QRLVKHAYMSHKVGL+AQHLGL KA
Sbjct: 721 NMNPSVRKKYKEQGNAGSLYCVVCGRSDSKEFMNSQRLVKHAYMSHKVGLKAQHLGLGKA 780

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNSV P+DTV WVPEVLSKEE V+QKEDLIIWPPVIIVRNVSLSH SPDKW+VV
Sbjct: 781 ICVLMGWNSVFPQDTVTWVPEVLSKEEAVLQKEDLIIWPPVIIVRNVSLSHNSPDKWRVV 840

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALESFLRSKNLLKGRVKMSLGCPADQSVM LKFLPTFSGL DAERLNKFFSENRRGR
Sbjct: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMALKFLPTFSGLTDAERLNKFFSENRRGR 900

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           EDFE+AKC +G V+MEG+KIEEEVLYGYLG AEDL DVELNVRK  MIKSKKEILE+
Sbjct: 901 EDFEVAKCNNGEVKMEGNKIEEEVLYGYLGTAEDLVDVELNVRKF-MIKSKKEILEM 942

BLAST of HG10004139 vs. NCBI nr
Match: TYK11753.1 (uncharacterized protein E5676_scaffold304G00720 [Cucumis melo var. makuwa])

HSP 1 Score: 1518.1 bits (3929), Expect = 0.0e+00
Identity = 779/957 (81.40%), Postives = 841/957 (87.88%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRE EN E      LH QDRLHLDHGRYG  +RETLD+SPRLRRSLSPHR
Sbjct: 38  MQCRRREDYYVREPENTE------LHVQDRLHLDHGRYGMARRETLDRSPRLRRSLSPHR 97

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
            G SR EVGLV RVD TE R+ +WHLRTGRNNDIG SSHS+G +RK+PNYEEVFLHNDHR
Sbjct: 98  FGVSRREVGLVDRVDNTESRDGNWHLRTGRNNDIGLSSHSFGQSRKVPNYEEVFLHNDHR 157

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           Q SDLQQ  V P+PR+FS DNEV+DYK HDV YR  DLRIRKEREIIEGRWSDGRGQRMT
Sbjct: 158 QHSDLQQ--VSPDPRRFSDDNEVIDYK-HDVGYRLGDLRIRKEREIIEGRWSDGRGQRMT 217

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDV--RSLDNERLKFRNDVVSDK 240
           DQ+LLA EEG  +G YNSH  IGPT+VYKDF P S SLDV  R LDNERLKFRN VVSD+
Sbjct: 218 DQRLLAIEEGNGLGSYNSHPGIGPTAVYKDFFPSSLSLDVEMRGLDNERLKFRNHVVSDR 277

Query: 241 PRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDE 300
           P++T+S+E +E Q+FNSRNIGYSASSGF+SRGNESS SGPL S+CLESYRDGHYF ISDE
Sbjct: 278 PQITDSQEAQEGQKFNSRNIGYSASSGFYSRGNESSLSGPLASQCLESYRDGHYFQISDE 337

Query: 301 FSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPE 360
           FSTR+HGD+VDPIEFNSYGKRTLVDSAIDL GGKR+LTPHQRGTNSPRRE  SYFYSKPE
Sbjct: 338 FSTRTHGDIVDPIEFNSYGKRTLVDSAIDLQGGKRNLTPHQRGTNSPRREHGSYFYSKPE 397

Query: 361 RTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANY 420
           RTVNNSNEDPSRV+QKITQT  YVDY ST+VSDHGDFSR K ANTSMLK+Q ADD  ANY
Sbjct: 398 RTVNNSNEDPSRVVQKITQTRGYVDYASTVVSDHGDFSRTKVANTSMLKIQKADDSYANY 457

Query: 421 RTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINL 480
           R GIALD YRLRKQT L YPDIGP+TE IN+DNEYAGAGSI+ DVG RVTQDYERS+IN 
Sbjct: 458 RAGIALDQYRLRKQTALDYPDIGPSTEEINDDNEYAGAGSIYSDVG-RVTQDYERSNINH 517

Query: 481 SQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLR 540
           SQY Q SYA SD G EREVGSY LKERL R +MSKCD EAYR+  RVQRMTEGVRTYNLR
Sbjct: 518 SQYGQTSYAISDYGPEREVGSYYLKERLRRSNMSKCDREAYRSTERVQRMTEGVRTYNLR 577

Query: 541 EDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDH 600
           EDHMPKR +F+ DMNLLDHRIATS E  P+K+VDLYDS EQW D+ NS RYISRKAGFD 
Sbjct: 578 EDHMPKRNFFEEDMNLLDHRIATSRENAPNKLVDLYDSDEQWRDDGNSRRYISRKAGFDR 637

Query: 601 NKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           NKYKKPNT +N + +   ADSHESY DHA+KYK G KYMKGN+++GPSSWIKSQNVDHRN
Sbjct: 638 NKYKKPNTKYNCRNI---ADSHESYSDHAQKYKFGSKYMKGNKKYGPSSWIKSQNVDHRN 697

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S H+P KNWKKT EENDYARVNDDDLSDDL+I+TESEPPEDSEEFKQL+HEAFLKCSK L
Sbjct: 698 SLHKPFKNWKKT-EENDYARVNDDDLSDDLIITTESEPPEDSEEFKQLVHEAFLKCSKML 757

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NM  +VRKKYKEQGNAGSLYC+VCGRS SKEFMN+QRLVKHAYMSHKVGL+AQHLGL KA
Sbjct: 758 NMNPSVRKKYKEQGNAGSLYCVVCGRSDSKEFMNSQRLVKHAYMSHKVGLKAQHLGLGKA 817

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNSV P+DTV WVPEVLSKEE V+QKEDLIIWPPVIIVRNVSLSH SPDKW+VV
Sbjct: 818 ICVLMGWNSVFPQDTVTWVPEVLSKEEAVLQKEDLIIWPPVIIVRNVSLSHNSPDKWRVV 877

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALESFLRSKNLLKGRVKMSLGCPADQSVM LKFLPTFSGL DAERLNKFFSENRRGR
Sbjct: 878 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMALKFLPTFSGLTDAERLNKFFSENRRGR 937

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           EDFE+AKC +G V+MEG+KIEEEVLYGYLG AEDL DVELNVRK  MIKSKKEILE+
Sbjct: 938 EDFEVAKCNNGEVKMEGNKIEEEVLYGYLGTAEDLVDVELNVRKF-MIKSKKEILEM 979

BLAST of HG10004139 vs. NCBI nr
Match: XP_011656567.1 (uncharacterized protein LOC101208223 [Cucumis sativus] >XP_031743187.1 uncharacterized protein LOC101208223 [Cucumis sativus] >KGN46030.2 hypothetical protein Csa_004949 [Cucumis sativus])

HSP 1 Score: 1510.7 bits (3910), Expect = 0.0e+00
Identity = 780/958 (81.42%), Postives = 842/958 (87.89%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRR EDYYVRE ENME      LH QDRLHLDHGRYG P+RETLD+SPRLRRSLSPHR
Sbjct: 1   MQCRRHEDYYVREPENME------LHVQDRLHLDHGRYGMPRRETLDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
            GGSR EVGLVHRVD TERR  DWHLRTGRNNDI  SSHSYG +RK+ NYEE FLHNDHR
Sbjct: 61  FGGSRREVGLVHRVDNTERRGGDWHLRTGRNNDIELSSHSYGQSRKVLNYEEGFLHNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADN-EVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRM 180
           Q SDLQQ  V PEPR+FSADN EVVDYK HDVRYRH DLRIRKEREIIEGRWSDGRGQR+
Sbjct: 121 QHSDLQQ--VSPEPRRFSADNDEVVDYK-HDVRYRHGDLRIRKEREIIEGRWSDGRGQRL 180

Query: 181 TDQKLLATEEGTAMGLYNSHLDIGPTSVYKDFL--PFSQSLDVRSLDNERLKFRNDVVSD 240
           TDQKLLA EEG  MG YNSH  IG T+V+KDF   P S ++D+RSLDNERL+FRN  VSD
Sbjct: 181 TDQKLLAIEEGNGMGSYNSHPGIGSTAVHKDFFPSPLSLAVDMRSLDNERLQFRNHGVSD 240

Query: 241 KPRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISD 300
           KP+VT+S+E +E QRFNSRNIGY+ASSGF SRGNESSSSGPLTS+CLESYRDGHYF ISD
Sbjct: 241 KPQVTDSQEAQEGQRFNSRNIGYAASSGFCSRGNESSSSGPLTSQCLESYRDGHYFQISD 300

Query: 301 EFSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKP 360
           EFSTR+HGD+VDP+EFNSYGKRTLVD+AIDL GGKR+LT HQRG NSPR E  SYFYSKP
Sbjct: 301 EFSTRNHGDIVDPVEFNSYGKRTLVDTAIDLQGGKRNLT-HQRGKNSPRGEHGSYFYSKP 360

Query: 361 ERTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCAN 420
           ERTVNNSNEDPSRV+QKITQT  YVDY ST+VSDHGDFSR K ANTSML+LQ ADD  AN
Sbjct: 361 ERTVNNSNEDPSRVVQKITQTRGYVDYASTVVSDHGDFSRTKVANTSMLRLQKADDSYAN 420

Query: 421 YRTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHIN 480
           YRTGIALDHYRLRKQT L YPDIGP+TE IN+DNEYAGAGSI+ DVG RVTQDYERSHIN
Sbjct: 421 YRTGIALDHYRLRKQTALDYPDIGPSTEEINDDNEYAGAGSIYPDVG-RVTQDYERSHIN 480

Query: 481 LSQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNL 540
            SQY Q SYA +D G EREVGSY LKERL+R +MSKCDGE YR+  RVQRMT+GVRTYNL
Sbjct: 481 HSQYGQTSYAITDHGPEREVGSYYLKERLHRSNMSKCDGEVYRSTERVQRMTKGVRTYNL 540

Query: 541 REDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFD 600
           REDHM KRKYF+ DMNLLDHRIATS E  PS++VDLYDSGEQW D+ N  RYIS+KAGFD
Sbjct: 541 REDHMQKRKYFEEDMNLLDHRIATSRENAPSRLVDLYDSGEQWRDDGNDRRYISKKAGFD 600

Query: 601 HNKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHR 660
           HNKYKKPNT +NR   +  ADSHESY DHA+KYK G K MKGN+++GPSSWIKSQNVDHR
Sbjct: 601 HNKYKKPNTKYNR---HNFADSHESYSDHAQKYKSGSKNMKGNKKYGPSSWIKSQNVDHR 660

Query: 661 NSSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKK 720
           NS H+P K+WKKT E NDY RVNDD LSDDLVI+TESEPPEDSEEFKQL+HEAFLKCSK 
Sbjct: 661 NSLHKPFKSWKKT-EGNDYTRVNDDGLSDDLVITTESEPPEDSEEFKQLVHEAFLKCSKM 720

Query: 721 LNMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAK 780
           LNM  +VRKKYKEQGNAGSLYCI+CGRS SKEFMN+QRLVKHAYMSHKVGL+AQHLGLAK
Sbjct: 721 LNMNPSVRKKYKEQGNAGSLYCIICGRSDSKEFMNSQRLVKHAYMSHKVGLKAQHLGLAK 780

Query: 781 AICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKV 840
           AICVLMGWNSV P+DTV WVPEVLSKEE VVQKEDLIIWPPVII+RN+SLSH SPDKW+V
Sbjct: 781 AICVLMGWNSVHPQDTVTWVPEVLSKEEAVVQKEDLIIWPPVIIIRNISLSHNSPDKWRV 840

Query: 841 VTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRG 900
           VTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGL DAERL+KFFSENRRG
Sbjct: 841 VTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLTDAERLHKFFSENRRG 900

Query: 901 REDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           REDFE+AKC  G V+MEG+KIEEEVLYGYLG AEDL DVELNVRK  MIKSKKEILE+
Sbjct: 901 REDFEVAKCNYGEVKMEGNKIEEEVLYGYLGTAEDLVDVELNVRKF-MIKSKKEILEM 942

BLAST of HG10004139 vs. NCBI nr
Match: XP_022133809.1 (uncharacterized protein LOC111006280 [Momordica charantia])

HSP 1 Score: 1415.6 bits (3663), Expect = 0.0e+00
Identity = 716/957 (74.82%), Postives = 812/957 (84.85%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRR+DYYVRESE+M      +LHAQDRLHLDH RYGK +RE LD+SPRLRRSLSPHR
Sbjct: 1   MQCRRRDDYYVRESESM------KLHAQDRLHLDHDRYGKTRREALDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
           +G SR EVGL  RVDT ERR+EDWHLRTGRNN++ S SHSYG ARK PN+EE++  NDHR
Sbjct: 61  VGASRREVGLGQRVDTIERRDEDWHLRTGRNNNVDSRSHSYGQARKKPNFEELYHQNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           QLSDLQQT V+PEPRKF A +EV+DY +HD+RYRHDDLRIRK++E IEGRWS G GQRMT
Sbjct: 121 QLSDLQQTRVVPEPRKFHAGDEVLDY-EHDLRYRHDDLRIRKDKETIEGRWSVGSGQRMT 180

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKPR 240
           DQKLLA EE TAMG Y+S L++G TS+YKDFLP SQSLDVRSLD+ERLKFR+ VVSDK +
Sbjct: 181 DQKLLAMEESTAMGSYSSSLNMGSTSIYKDFLPSSQSLDVRSLDDERLKFRSHVVSDKSQ 240

Query: 241 VTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEFS 300
           VT S EVEE++RF+SRNIGY ASSGF+S+  E SSSGP TSK LESY+DG YF +SD+F 
Sbjct: 241 VTESHEVEESRRFSSRNIGYLASSGFYSKEYERSSSGPFTSKSLESYQDGQYFEVSDDFP 300

Query: 301 TRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPERT 360
           TRSHGDL+D ++F SYGKRTLVDSAIDLVGG+R+ TPHQ+ TNSP RE  SYFYSKPE T
Sbjct: 301 TRSHGDLMDRLDFKSYGKRTLVDSAIDLVGGERNFTPHQQSTNSPMREHMSYFYSKPEGT 360

Query: 361 VNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYRT 420
           VN+SNEDPSRVMQKI QTHDY+DY   IVSD GDFSRPK AN+S LKLQN ++  AN+ T
Sbjct: 361 VNDSNEDPSRVMQKINQTHDYIDYGRAIVSDLGDFSRPKVANSSSLKLQNPENLFANHST 420

Query: 421 GIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLSQ 480
           GIAL+ Y LR+Q VL YPDIG T++ IN+D EYA  GSIHV+VGRRVTQDYE S IN S+
Sbjct: 421 GIALNRYSLREQRVLDYPDIGLTSKTINHDCEYASTGSIHVEVGRRVTQDYEVSDINPSE 480

Query: 481 YCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLRED 540
           Y +  + RSD GSEREVGS+ LKERL+R SMSKCDGE YRN+ RVQRMTEGV  Y LR D
Sbjct: 481 YSKKLHERSDYGSEREVGSHYLKERLHRSSMSKCDGETYRNSERVQRMTEGVSAYKLR-D 540

Query: 541 HMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHNK 600
            MPKR YF+ DMNLLDHRI+  CEYTP KVVD+YDSGE WMD++ S RY SRKAGFDH K
Sbjct: 541 QMPKRNYFEEDMNLLDHRISMPCEYTPDKVVDMYDSGEAWMDDDTSHRYTSRKAGFDHGK 600

Query: 601 YKKPNTNFNRQILYASADSH--ESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           Y+K N  ++R   +AS DS   E YLDHA+K+K GPKYMKGNRRHGPSSWIKSQNVD RN
Sbjct: 601 YRKSNKKYDRHNFHASDDSFSCERYLDHAQKFKNGPKYMKGNRRHGPSSWIKSQNVDLRN 660

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S HRP+K WK TEE+NDY  VNDD LSDD +  TESEPPEDSEEFKQ++HEAFLKCSKKL
Sbjct: 661 SLHRPLKIWKNTEEDNDYVHVNDDGLSDDFIKPTESEPPEDSEEFKQMVHEAFLKCSKKL 720

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NMK  VRKKYKEQGNAGSLYCIVCG S SKEF++T+RLVKHAYMSH+ GLRAQHLGLAKA
Sbjct: 721 NMKPTVRKKYKEQGNAGSLYCIVCGISSSKEFLDTKRLVKHAYMSHRTGLRAQHLGLAKA 780

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNS +P+DTV WVPEVL KEE VVQKEDLIIWPPVII+RN+SLSH++PD+W+VV
Sbjct: 781 ICVLMGWNSAMPQDTVTWVPEVLPKEEAVVQKEDLIIWPPVIIIRNISLSHSNPDRWRVV 840

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALE+FLRSKNLLKGRVK++LG PADQSVMVLKFL  FSGL DAERL+KFFSE R GR
Sbjct: 841 TIEALETFLRSKNLLKGRVKITLGSPADQSVMVLKFLAMFSGLTDAERLHKFFSERRHGR 900

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
            +FE+AKC++GG EMEGDK EE +LYGYLG +EDLDDVE NVRKLS IKSKKEILEL
Sbjct: 901 VNFEVAKCRNGGAEMEGDKTEERMLYGYLGISEDLDDVEFNVRKLSTIKSKKEILEL 949

BLAST of HG10004139 vs. ExPASy Swiss-Prot
Match: A5YVF1 (Protein SUPPRESSOR OF GENE SILENCING 3 OS=Solanum lycopersicum OX=4081 GN=SGS3 PE=1 SV=1)

HSP 1 Score: 57.0 bits (136), Expect = 1.4e-06
Identity = 76/287 (26.48%), Postives = 122/287 (42.51%), Query Frame = 0

Query: 671 EEENDYARVNDDDLSDDLVISTESEPPEDSEE----FKQLIHEAFLKCSKKLNMKTNVRK 730
           + E D+   +DDDL  D   S   E   ++ +    F QL H         L+  T    
Sbjct: 173 DNELDFLDESDDDLHSDDFDSDVGEMSYETRKKNPWFNQLFH--------SLDSLTVTEI 232

Query: 731 KYKEQGNAGSLYCIVC--GRSHSKEFMNTQRLVKHAYMSHKVGLRAQ-HLGLAKAICVLM 790
              E+      +C  C  G    + F   Q L+ HA      GLR + H  LA+ +    
Sbjct: 233 NEPER----QWHCPACKGGPGAIEWFTGLQSLMTHAKTK---GLRVKIHRELAELL---- 292

Query: 791 GWNSVLPRDTVIWVP-EVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVVTIEA 850
               +  R T +  P EV  +   +  K+  I+WPP++I+ N  L     DKW  +  + 
Sbjct: 293 -EEDLRQRGTSVVPPGEVYGRWGGMEFKDKEIVWPPMVIIMNTRLDKDENDKWIGMGNQE 352

Query: 851 LESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGREDFE 910
           L  +  S   +K R   S G    + + +L F  +  G  +A+RL++ FSEN R R+ +E
Sbjct: 353 LLEYFSSYAAVKAR--HSYGPQGHRGMSLLIFEASAVGYIEADRLSEHFSENGRNRDAWE 412

Query: 911 --LAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIK 948
              A+   GG         + +LYGY+   +D+D+   +    S +K
Sbjct: 413 RRSARFYPGG---------KRLLYGYMADKKDIDNFNQHSAGKSKLK 428

BLAST of HG10004139 vs. ExPASy Swiss-Prot
Match: A1Y2B7 (Protein SUPPRESSOR OF GENE SILENCING 3 homolog OS=Zea mays OX=4577 GN=SGS3 PE=1 SV=1)

HSP 1 Score: 53.1 bits (126), Expect = 2.0e-05
Identity = 82/303 (27.06%), Postives = 128/303 (42.24%), Query Frame = 0

Query: 652 QNVDHRNSSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAF 711
           +NVD  N+S          ++++D    +DDDLSDD+    +S+  E S E ++  ++ F
Sbjct: 128 ENVDGNNTS----------DDDDD----DDDDLSDDISDDYDSDASEKSFETRK-TNKWF 187

Query: 712 LKCSKKLNMKTNVRKKYKEQGNAGSLYCIVC--GRSHSKEFMNTQRLVKHAYM--SHKVG 771
            +  + LN  T   ++  EQ      +C  C  G      +   Q LV HA    S +V 
Sbjct: 188 KEFFEVLN--TLSLEQINEQ--TRQWHCPACKNGPGAIDWYKGLQPLVSHARTKGSTRVK 247

Query: 772 LRAQHLGLAKAICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDL-IIWPPVIIVRNVS 831
           L  +   L +      G  SVLP        E   K + + +  D  I+WPP++IV N  
Sbjct: 248 LHRELAALLEEELSRRG-TSVLP------AGEQFGKWKGLQESTDREIVWPPMVIVMNTF 307

Query: 832 LSHTSPDKWKVVTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAER 891
           L     DKWK +  + L  +       K R   + G    + + VL F  +  G  +AER
Sbjct: 308 LEKDEDDKWKGMGNQELLDYFGEYEASKAR--HAYGPSGHRGMSVLIFESSAVGYMEAER 367

Query: 892 LNKFFSENRRGREDFELAKCK--DGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLS 948
           L+K F      R  + L K +   GG         +  LYG+L   ED++    +    S
Sbjct: 368 LHKHFVNQGTDRNSWHLRKVRFVPGG---------KRQLYGFLANKEDMEAFNKHCHGKS 393

BLAST of HG10004139 vs. ExPASy Swiss-Prot
Match: Q2QWE9 (Protein SUPPRESSOR OF GENE SILENCING 3 homolog OS=Oryza sativa subsp. japonica OX=39947 GN=SGS3 PE=3 SV=1)

HSP 1 Score: 48.1 bits (113), Expect = 6.3e-04
Identity = 70/277 (25.27%), Postives = 114/277 (41.16%), Query Frame = 0

Query: 680 NDDDLSDDLVISTESEPPEDSEEFKQ--LIHEAFLKCSKKLNMKTNVRKKYKEQGN--AG 739
           NDDD+SDDL    +S+  E S E ++   + + F +  + L++         EQ N    
Sbjct: 153 NDDDMSDDLSDDYDSDASEKSFETRKNHKLFKGFFEVLEALSV---------EQLNEPTR 212

Query: 740 SLYCIVC--GRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKAICVLMGWNSVLPRDT 799
             +C  C  G      +   Q L+ HA     + ++ +H  LA  +         L R  
Sbjct: 213 QWHCPACKNGPGAIDWYKGLQPLMTHAKTKGSIKVK-RHRELASLL------EEELSRRG 272

Query: 800 VIWVP--EVLSKEETVVQKEDL-IIWPPVIIVRNVSLSHTSPDKWKVVTIEALESFLRSK 859
              VP  E   K + + +  D  I+WPP+++V N  L     DKWK +  + L  +    
Sbjct: 273 TSVVPSGEQFRKWKGLRESTDREIVWPPMVVVMNTVLEQDEDDKWKGMGNQELIDYFSEY 332

Query: 860 NLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGREDFELAKCKDGG 919
              K R   + G    + + VL F  +  G  +AERL+  F   R  R  +  A      
Sbjct: 333 AASKAR--HAYGPNGHRGMSVLIFDSSAVGYMEAERLHDHFVRQRTDRNTWNSA---HKV 392

Query: 920 VEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIK 948
             + G K +   LYG+L   +D++    +    S +K
Sbjct: 393 TFLPGGKRQ---LYGFLATKDDMETFNRHCHGKSRLK 405

BLAST of HG10004139 vs. ExPASy TrEMBL
Match: A0A1S3C369 (uncharacterized protein LOC103496499 OS=Cucumis melo OX=3656 GN=LOC103496499 PE=4 SV=1)

HSP 1 Score: 1518.1 bits (3929), Expect = 0.0e+00
Identity = 779/957 (81.40%), Postives = 841/957 (87.88%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRE EN E      LH QDRLHLDHGRYG  +RETLD+SPRLRRSLSPHR
Sbjct: 1   MQCRRREDYYVREPENTE------LHVQDRLHLDHGRYGMARRETLDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
            G SR EVGLV RVD TE R+ +WHLRTGRNNDIG SSHS+G +RK+PNYEEVFLHNDHR
Sbjct: 61  FGVSRREVGLVDRVDNTESRDGNWHLRTGRNNDIGLSSHSFGQSRKVPNYEEVFLHNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           Q SDLQQ  V P+PR+FS DNEV+DYK HDV YR  DLRIRKEREIIEGRWSDGRGQRMT
Sbjct: 121 QHSDLQQ--VSPDPRRFSDDNEVIDYK-HDVGYRLGDLRIRKEREIIEGRWSDGRGQRMT 180

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDV--RSLDNERLKFRNDVVSDK 240
           DQ+LLA EEG  +G YNSH  IGPT+VYKDF P S SLDV  R LDNERLKFRN VVSD+
Sbjct: 181 DQRLLAIEEGNGLGSYNSHPGIGPTAVYKDFFPSSLSLDVEMRGLDNERLKFRNHVVSDR 240

Query: 241 PRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDE 300
           P++T+S+E +E Q+FNSRNIGYSASSGF+SRGNESS SGPL S+CLESYRDGHYF ISDE
Sbjct: 241 PQITDSQEAQEGQKFNSRNIGYSASSGFYSRGNESSLSGPLASQCLESYRDGHYFQISDE 300

Query: 301 FSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPE 360
           FSTR+HGD+VDPIEFNSYGKRTLVDSAIDL GGKR+LTPHQRGTNSPRRE  SYFYSKPE
Sbjct: 301 FSTRTHGDIVDPIEFNSYGKRTLVDSAIDLQGGKRNLTPHQRGTNSPRREHGSYFYSKPE 360

Query: 361 RTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANY 420
           RTVNNSNEDPSRV+QKITQT  YVDY ST+VSDHGDFSR K ANTSMLK+Q ADD  ANY
Sbjct: 361 RTVNNSNEDPSRVVQKITQTRGYVDYASTVVSDHGDFSRTKVANTSMLKIQKADDSYANY 420

Query: 421 RTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINL 480
           R GIALD YRLRKQT L YPDIGP+TE IN+DNEYAGAGSI+ DVG RVTQDYERS+IN 
Sbjct: 421 RAGIALDQYRLRKQTALDYPDIGPSTEEINDDNEYAGAGSIYSDVG-RVTQDYERSNINH 480

Query: 481 SQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLR 540
           SQY Q SYA SD G EREVGSY LKERL R +MSKCD EAYR+  RVQRMTEGVRTYNLR
Sbjct: 481 SQYGQTSYAISDYGPEREVGSYYLKERLRRSNMSKCDREAYRSTERVQRMTEGVRTYNLR 540

Query: 541 EDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDH 600
           EDHMPKR +F+ DMNLLDHRIATS E  P+K+VDLYDS EQW D+ NS RYISRKAGFD 
Sbjct: 541 EDHMPKRNFFEEDMNLLDHRIATSRENAPNKLVDLYDSDEQWRDDGNSRRYISRKAGFDR 600

Query: 601 NKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           NKYKKPNT +N + +   ADSHESY DHA+KYK G KYMKGN+++GPSSWIKSQNVDHRN
Sbjct: 601 NKYKKPNTKYNCRNI---ADSHESYSDHAQKYKFGSKYMKGNKKYGPSSWIKSQNVDHRN 660

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S H+P KNWKKT EENDYARVNDDDLSDDL+I+TESEPPEDSEEFKQL+HEAFLKCSK L
Sbjct: 661 SLHKPFKNWKKT-EENDYARVNDDDLSDDLIITTESEPPEDSEEFKQLVHEAFLKCSKML 720

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NM  +VRKKYKEQGNAGSLYC+VCGRS SKEFMN+QRLVKHAYMSHKVGL+AQHLGL KA
Sbjct: 721 NMNPSVRKKYKEQGNAGSLYCVVCGRSDSKEFMNSQRLVKHAYMSHKVGLKAQHLGLGKA 780

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNSV P+DTV WVPEVLSKEE V+QKEDLIIWPPVIIVRNVSLSH SPDKW+VV
Sbjct: 781 ICVLMGWNSVFPQDTVTWVPEVLSKEEAVLQKEDLIIWPPVIIVRNVSLSHNSPDKWRVV 840

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALESFLRSKNLLKGRVKMSLGCPADQSVM LKFLPTFSGL DAERLNKFFSENRRGR
Sbjct: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMALKFLPTFSGLTDAERLNKFFSENRRGR 900

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           EDFE+AKC +G V+MEG+KIEEEVLYGYLG AEDL DVELNVRK  MIKSKKEILE+
Sbjct: 901 EDFEVAKCNNGEVKMEGNKIEEEVLYGYLGTAEDLVDVELNVRKF-MIKSKKEILEM 942

BLAST of HG10004139 vs. ExPASy TrEMBL
Match: A0A5D3CIK8 (XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold304G00720 PE=4 SV=1)

HSP 1 Score: 1518.1 bits (3929), Expect = 0.0e+00
Identity = 779/957 (81.40%), Postives = 841/957 (87.88%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRE EN E      LH QDRLHLDHGRYG  +RETLD+SPRLRRSLSPHR
Sbjct: 38  MQCRRREDYYVREPENTE------LHVQDRLHLDHGRYGMARRETLDRSPRLRRSLSPHR 97

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
            G SR EVGLV RVD TE R+ +WHLRTGRNNDIG SSHS+G +RK+PNYEEVFLHNDHR
Sbjct: 98  FGVSRREVGLVDRVDNTESRDGNWHLRTGRNNDIGLSSHSFGQSRKVPNYEEVFLHNDHR 157

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           Q SDLQQ  V P+PR+FS DNEV+DYK HDV YR  DLRIRKEREIIEGRWSDGRGQRMT
Sbjct: 158 QHSDLQQ--VSPDPRRFSDDNEVIDYK-HDVGYRLGDLRIRKEREIIEGRWSDGRGQRMT 217

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDV--RSLDNERLKFRNDVVSDK 240
           DQ+LLA EEG  +G YNSH  IGPT+VYKDF P S SLDV  R LDNERLKFRN VVSD+
Sbjct: 218 DQRLLAIEEGNGLGSYNSHPGIGPTAVYKDFFPSSLSLDVEMRGLDNERLKFRNHVVSDR 277

Query: 241 PRVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDE 300
           P++T+S+E +E Q+FNSRNIGYSASSGF+SRGNESS SGPL S+CLESYRDGHYF ISDE
Sbjct: 278 PQITDSQEAQEGQKFNSRNIGYSASSGFYSRGNESSLSGPLASQCLESYRDGHYFQISDE 337

Query: 301 FSTRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPE 360
           FSTR+HGD+VDPIEFNSYGKRTLVDSAIDL GGKR+LTPHQRGTNSPRRE  SYFYSKPE
Sbjct: 338 FSTRTHGDIVDPIEFNSYGKRTLVDSAIDLQGGKRNLTPHQRGTNSPRREHGSYFYSKPE 397

Query: 361 RTVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANY 420
           RTVNNSNEDPSRV+QKITQT  YVDY ST+VSDHGDFSR K ANTSMLK+Q ADD  ANY
Sbjct: 398 RTVNNSNEDPSRVVQKITQTRGYVDYASTVVSDHGDFSRTKVANTSMLKIQKADDSYANY 457

Query: 421 RTGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINL 480
           R GIALD YRLRKQT L YPDIGP+TE IN+DNEYAGAGSI+ DVG RVTQDYERS+IN 
Sbjct: 458 RAGIALDQYRLRKQTALDYPDIGPSTEEINDDNEYAGAGSIYSDVG-RVTQDYERSNINH 517

Query: 481 SQYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLR 540
           SQY Q SYA SD G EREVGSY LKERL R +MSKCD EAYR+  RVQRMTEGVRTYNLR
Sbjct: 518 SQYGQTSYAISDYGPEREVGSYYLKERLRRSNMSKCDREAYRSTERVQRMTEGVRTYNLR 577

Query: 541 EDHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDH 600
           EDHMPKR +F+ DMNLLDHRIATS E  P+K+VDLYDS EQW D+ NS RYISRKAGFD 
Sbjct: 578 EDHMPKRNFFEEDMNLLDHRIATSRENAPNKLVDLYDSDEQWRDDGNSRRYISRKAGFDR 637

Query: 601 NKYKKPNTNFNRQILYASADSHESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           NKYKKPNT +N + +   ADSHESY DHA+KYK G KYMKGN+++GPSSWIKSQNVDHRN
Sbjct: 638 NKYKKPNTKYNCRNI---ADSHESYSDHAQKYKFGSKYMKGNKKYGPSSWIKSQNVDHRN 697

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S H+P KNWKKT EENDYARVNDDDLSDDL+I+TESEPPEDSEEFKQL+HEAFLKCSK L
Sbjct: 698 SLHKPFKNWKKT-EENDYARVNDDDLSDDLIITTESEPPEDSEEFKQLVHEAFLKCSKML 757

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NM  +VRKKYKEQGNAGSLYC+VCGRS SKEFMN+QRLVKHAYMSHKVGL+AQHLGL KA
Sbjct: 758 NMNPSVRKKYKEQGNAGSLYCVVCGRSDSKEFMNSQRLVKHAYMSHKVGLKAQHLGLGKA 817

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNSV P+DTV WVPEVLSKEE V+QKEDLIIWPPVIIVRNVSLSH SPDKW+VV
Sbjct: 818 ICVLMGWNSVFPQDTVTWVPEVLSKEEAVLQKEDLIIWPPVIIVRNVSLSHNSPDKWRVV 877

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALESFLRSKNLLKGRVKMSLGCPADQSVM LKFLPTFSGL DAERLNKFFSENRRGR
Sbjct: 878 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMALKFLPTFSGLTDAERLNKFFSENRRGR 937

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
           EDFE+AKC +G V+MEG+KIEEEVLYGYLG AEDL DVELNVRK  MIKSKKEILE+
Sbjct: 938 EDFEVAKCNNGEVKMEGNKIEEEVLYGYLGTAEDLVDVELNVRKF-MIKSKKEILEM 979

BLAST of HG10004139 vs. ExPASy TrEMBL
Match: A0A6J1BX13 (uncharacterized protein LOC111006280 OS=Momordica charantia OX=3673 GN=LOC111006280 PE=4 SV=1)

HSP 1 Score: 1415.6 bits (3663), Expect = 0.0e+00
Identity = 716/957 (74.82%), Postives = 812/957 (84.85%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRR+DYYVRESE+M      +LHAQDRLHLDH RYGK +RE LD+SPRLRRSLSPHR
Sbjct: 1   MQCRRRDDYYVRESESM------KLHAQDRLHLDHDRYGKTRREALDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
           +G SR EVGL  RVDT ERR+EDWHLRTGRNN++ S SHSYG ARK PN+EE++  NDHR
Sbjct: 61  VGASRREVGLGQRVDTIERRDEDWHLRTGRNNNVDSRSHSYGQARKKPNFEELYHQNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           QLSDLQQT V+PEPRKF A +EV+DY +HD+RYRHDDLRIRK++E IEGRWS G GQRMT
Sbjct: 121 QLSDLQQTRVVPEPRKFHAGDEVLDY-EHDLRYRHDDLRIRKDKETIEGRWSVGSGQRMT 180

Query: 181 DQKLLATEEGTAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKPR 240
           DQKLLA EE TAMG Y+S L++G TS+YKDFLP SQSLDVRSLD+ERLKFR+ VVSDK +
Sbjct: 181 DQKLLAMEESTAMGSYSSSLNMGSTSIYKDFLPSSQSLDVRSLDDERLKFRSHVVSDKSQ 240

Query: 241 VTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEFS 300
           VT S EVEE++RF+SRNIGY ASSGF+S+  E SSSGP TSK LESY+DG YF +SD+F 
Sbjct: 241 VTESHEVEESRRFSSRNIGYLASSGFYSKEYERSSSGPFTSKSLESYQDGQYFEVSDDFP 300

Query: 301 TRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPERT 360
           TRSHGDL+D ++F SYGKRTLVDSAIDLVGG+R+ TPHQ+ TNSP RE  SYFYSKPE T
Sbjct: 301 TRSHGDLMDRLDFKSYGKRTLVDSAIDLVGGERNFTPHQQSTNSPMREHMSYFYSKPEGT 360

Query: 361 VNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYRT 420
           VN+SNEDPSRVMQKI QTHDY+DY   IVSD GDFSRPK AN+S LKLQN ++  AN+ T
Sbjct: 361 VNDSNEDPSRVMQKINQTHDYIDYGRAIVSDLGDFSRPKVANSSSLKLQNPENLFANHST 420

Query: 421 GIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLSQ 480
           GIAL+ Y LR+Q VL YPDIG T++ IN+D EYA  GSIHV+VGRRVTQDYE S IN S+
Sbjct: 421 GIALNRYSLREQRVLDYPDIGLTSKTINHDCEYASTGSIHVEVGRRVTQDYEVSDINPSE 480

Query: 481 YCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLRED 540
           Y +  + RSD GSEREVGS+ LKERL+R SMSKCDGE YRN+ RVQRMTEGV  Y LR D
Sbjct: 481 YSKKLHERSDYGSEREVGSHYLKERLHRSSMSKCDGETYRNSERVQRMTEGVSAYKLR-D 540

Query: 541 HMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHNK 600
            MPKR YF+ DMNLLDHRI+  CEYTP KVVD+YDSGE WMD++ S RY SRKAGFDH K
Sbjct: 541 QMPKRNYFEEDMNLLDHRISMPCEYTPDKVVDMYDSGEAWMDDDTSHRYTSRKAGFDHGK 600

Query: 601 YKKPNTNFNRQILYASADSH--ESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHRN 660
           Y+K N  ++R   +AS DS   E YLDHA+K+K GPKYMKGNRRHGPSSWIKSQNVD RN
Sbjct: 601 YRKSNKKYDRHNFHASDDSFSCERYLDHAQKFKNGPKYMKGNRRHGPSSWIKSQNVDLRN 660

Query: 661 SSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKL 720
           S HRP+K WK TEE+NDY  VNDD LSDD +  TESEPPEDSEEFKQ++HEAFLKCSKKL
Sbjct: 661 SLHRPLKIWKNTEEDNDYVHVNDDGLSDDFIKPTESEPPEDSEEFKQMVHEAFLKCSKKL 720

Query: 721 NMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAKA 780
           NMK  VRKKYKEQGNAGSLYCIVCG S SKEF++T+RLVKHAYMSH+ GLRAQHLGLAKA
Sbjct: 721 NMKPTVRKKYKEQGNAGSLYCIVCGISSSKEFLDTKRLVKHAYMSHRTGLRAQHLGLAKA 780

Query: 781 ICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKVV 840
           ICVLMGWNS +P+DTV WVPEVL KEE VVQKEDLIIWPPVII+RN+SLSH++PD+W+VV
Sbjct: 781 ICVLMGWNSAMPQDTVTWVPEVLPKEEAVVQKEDLIIWPPVIIIRNISLSHSNPDRWRVV 840

Query: 841 TIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGR 900
           TIEALE+FLRSKNLLKGRVK++LG PADQSVMVLKFL  FSGL DAERL+KFFSE R GR
Sbjct: 841 TIEALETFLRSKNLLKGRVKITLGSPADQSVMVLKFLAMFSGLTDAERLHKFFSERRHGR 900

Query: 901 EDFELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKEILEL 956
            +FE+AKC++GG EMEGDK EE +LYGYLG +EDLDDVE NVRKLS IKSKKEILEL
Sbjct: 901 VNFEVAKCRNGGAEMEGDKTEERMLYGYLGISEDLDDVEFNVRKLSTIKSKKEILEL 949

BLAST of HG10004139 vs. ExPASy TrEMBL
Match: A0A6J1HC30 (uncharacterized protein LOC111461470 OS=Cucurbita moschata OX=3662 GN=LOC111461470 PE=4 SV=1)

HSP 1 Score: 1348.2 bits (3488), Expect = 0.0e+00
Identity = 702/964 (72.82%), Postives = 792/964 (82.16%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRESE+M      +LHAQDRLHLDHGRY KP+RE LD+SPRLRRSLSPHR
Sbjct: 1   MQCRRREDYYVRESESM------KLHAQDRLHLDHGRYVKPRREALDRSPRLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
           IG S  EVGL  RVDT ERR+EDW LRTGRNNDIGSS HSYG  R+ PNY+EVFLHNDHR
Sbjct: 61  IGDSWREVGLGQRVDTIERRDEDWRLRTGRNNDIGSSVHSYGQTRERPNYDEVFLHNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           QLS+LQ+THVL EPRK SA++E +DY Q D+RY HDDLRIR EREI  G WSDG  QR  
Sbjct: 121 QLSELQRTHVLSEPRKISAEDEFLDYNQ-DLRYMHDDLRIRIEREINRGNWSDGSEQRRM 180

Query: 181 DQKLLATEEG-TAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKP 240
           +QKLLA EEG TAMG YNSHLD+ P S+Y+DFLP SQSLD+ SL+NER K+R+D VSDK 
Sbjct: 181 NQKLLAAEEGETAMGSYNSHLDMVPASIYRDFLPSSQSLDMGSLNNERFKYRDDAVSDKS 240

Query: 241 RVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEF 300
           +  +  EVE N RF+SRNI YSASSGF+SR  ESS S PLT +CLESY+DG Y  ISDEF
Sbjct: 241 QGADYHEVEPNHRFHSRNIEYSASSGFYSRKYESSLSRPLTGRCLESYQDGQYLQISDEF 300

Query: 301 STRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPER 360
           S RSHGD VD  EFNSYGKRTLVDSA  +VGGKR+LTPHQ+GTNS RRE  SYFYSKPE 
Sbjct: 301 SERSHGDFVDTKEFNSYGKRTLVDSA--MVGGKRNLTPHQQGTNSSRREHGSYFYSKPEG 360

Query: 361 TVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYR 420
           TVN+S E PSRVMQKITQT +Y+DYDS IVS  GDFSRPK +N S+LKL N DD  AN+R
Sbjct: 361 TVNDSYEGPSRVMQKITQTRNYIDYDSAIVSGRGDFSRPKVSNDSLLKLPNVDDSYANHR 420

Query: 421 TGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLS 480
           TGIALD YRLRKQTVL YPDI   T+A+N+ +EY G GSIH++VGRRVTQ+YE S IN S
Sbjct: 421 TGIALDCYRLRKQTVLDYPDI-ELTKAVNHGSEYVGTGSIHLEVGRRVTQNYEESPINPS 480

Query: 481 QYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLRE 540
           QYCQ S+ARSD GSEREVG + LKERL+  SM KCDGEAYRN   ++RMTEGV TYNL+ 
Sbjct: 481 QYCQKSHARSDYGSEREVGPHLLKERLHESSMFKCDGEAYRNTESLERMTEGVCTYNLK- 540

Query: 541 DHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHN 600
           D +PKRKYF+ D NLLD RI TSC+Y PSKVVDLY+SGE+WM++E + RY SRKA FDHN
Sbjct: 541 DRVPKRKYFEEDRNLLDRRIGTSCDYMPSKVVDLYNSGEEWMEDETNRRYTSRKAKFDHN 600

Query: 601 KYKKPNTNFNRQILYASADS--HESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHR 660
           KY+KPN  ++R  LYAS DS   ESYLD+ KKY+ GPKYMKGN++ G SSWIKSQNVD R
Sbjct: 601 KYRKPNKKYDRHNLYASDDSFLRESYLDNGKKYETGPKYMKGNKKQGTSSWIKSQNVDRR 660

Query: 661 NSSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKK 720
           NS H+  K W K E EN Y  +NDDDLSDDLVI TESEPPEDSE+F Q++HEAFLKC K 
Sbjct: 661 NSLHKQHKVWNKAEGENGYVYLNDDDLSDDLVIPTESEPPEDSEKFNQMVHEAFLKCLKM 720

Query: 721 LNMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAK 780
           LNMK +VRK+YK+QGN GSLYCIVCGRS+SKEF++TQRLVKHAYMSHK+GLRA+HLGLAK
Sbjct: 721 LNMKASVRKRYKDQGNGGSLYCIVCGRSYSKEFLDTQRLVKHAYMSHKIGLRARHLGLAK 780

Query: 781 AICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKV 840
           AICVLMGWNS LP+DTV WVPE L KEE VVQKEDLIIWPPV+IVRN+S+S ++P KWKV
Sbjct: 781 AICVLMGWNSALPQDTVTWVPEDLHKEEAVVQKEDLIIWPPVVIVRNISMSCSNPGKWKV 840

Query: 841 VTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRG 900
           +TIEALE+FLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGL DAERLNKFF E R G
Sbjct: 841 ITIEALEAFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLTDAERLNKFFLEKRHG 900

Query: 901 REDFELAK-----CKDGGVEMEGDKI-EEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKE 956
           R +FE +K       D G   +G+KI EEEVLYGYLG AEDLD VE N+RK S IKSKKE
Sbjct: 901 RVNFEQSKGDNGRANDVGEATKGNKIEEEEVLYGYLGIAEDLDSVEFNIRKSSSIKSKKE 953

BLAST of HG10004139 vs. ExPASy TrEMBL
Match: A0A6J1JSP3 (uncharacterized protein LOC111487181 OS=Cucurbita maxima OX=3661 GN=LOC111487181 PE=4 SV=1)

HSP 1 Score: 1327.0 bits (3433), Expect = 0.0e+00
Identity = 692/964 (71.78%), Postives = 783/964 (81.22%), Query Frame = 0

Query: 1   MQCRRREDYYVRESENMELHAQDRLHAQDRLHLDHGRYGKPQRETLDQSPRLRRSLSPHR 60
           MQCRRREDYYVRESENM      +LHAQDRLHLDHGRY KP+RE LD+SP LRRSLSPHR
Sbjct: 1   MQCRRREDYYVRESENM------KLHAQDRLHLDHGRYVKPRREALDRSPCLRRSLSPHR 60

Query: 61  IGGSRCEVGLVHRVDTTERRNEDWHLRTGRNNDIGSSSHSYGHARKLPNYEEVFLHNDHR 120
           IG S  EVGL  RVDT ERR+EDW LRTGRNNDIGS+ HSYG  R+ PNY EVFL NDHR
Sbjct: 61  IGDSWREVGLGQRVDTIERRDEDWRLRTGRNNDIGSNVHSYGQTRERPNYNEVFLRNDHR 120

Query: 121 QLSDLQQTHVLPEPRKFSADNEVVDYKQHDVRYRHDDLRIRKEREIIEGRWSDGRGQRMT 180
           QLS+LQ+TH L EPRK S ++E +DY Q D+RY HDDLRIR +REI +G+WSDG  QR  
Sbjct: 121 QLSELQRTHGLSEPRKISVEDEFLDYNQ-DLRYMHDDLRIRIDREINQGKWSDGSRQRRM 180

Query: 181 DQKLLATEEG-TAMGLYNSHLDIGPTSVYKDFLPFSQSLDVRSLDNERLKFRNDVVSDKP 240
           +QKLLA EEG  AMG YNSHLD+ P S+Y+DFLP SQS D+RSLDNER K+R+D VSDK 
Sbjct: 181 NQKLLAAEEGEMAMGSYNSHLDMNPASIYRDFLPSSQSSDMRSLDNERFKYRDDAVSDKS 240

Query: 241 RVTNSREVEENQRFNSRNIGYSASSGFHSRGNESSSSGPLTSKCLESYRDGHYFPISDEF 300
           +  +  EVE N+RF SRN  YSASSGF+SR  ESS S PLT +CLESY+DG Y  ISDEF
Sbjct: 241 QGADYHEVEPNRRFPSRNSEYSASSGFYSRKYESSLSRPLTGRCLESYQDGQYLQISDEF 300

Query: 301 STRSHGDLVDPIEFNSYGKRTLVDSAIDLVGGKRDLTPHQRGTNSPRREQRSYFYSKPER 360
           S RSHGD VD  EFNSYGKRTLVDS   +VGGKR+LTPHQ+GTNS RRE  SYFYSKPE 
Sbjct: 301 SERSHGDFVDTKEFNSYGKRTLVDS--PMVGGKRNLTPHQQGTNSFRREHGSYFYSKPEG 360

Query: 361 TVNNSNEDPSRVMQKITQTHDYVDYDSTIVSDHGDFSRPKAANTSMLKLQNADDPCANYR 420
           TVN+S   PSRVMQKITQT +Y+DYDS IVS  GDFSRPK  N S+LKL N DD  AN+R
Sbjct: 361 TVNDSYAGPSRVMQKITQTRNYIDYDSAIVSGRGDFSRPKVLNDSLLKLPNVDDSYANHR 420

Query: 421 TGIALDHYRLRKQTVLGYPDIGPTTEAINNDNEYAGAGSIHVDVGRRVTQDYERSHINLS 480
           TGIALD YRLRKQTVL YPDI   T+A+N+ +EY G GSIH++VGRRVTQ+Y+ S IN S
Sbjct: 421 TGIALDCYRLRKQTVLDYPDI-ELTKAVNHGSEYVGTGSIHLEVGRRVTQNYKESPINPS 480

Query: 481 QYCQPSYARSDCGSEREVGSYCLKERLNRFSMSKCDGEAYRNNGRVQRMTEGVRTYNLRE 540
           Q+CQ  +ARSD GSER+VG +  KERL+  SM KCDGEAYRN   ++RMTEG+ TYNL+ 
Sbjct: 481 QFCQNLHARSDYGSERDVGPHLSKERLHESSMFKCDGEAYRNTESLERMTEGLCTYNLK- 540

Query: 541 DHMPKRKYFDADMNLLDHRIATSCEYTPSKVVDLYDSGEQWMDEENSCRYISRKAGFDHN 600
           D +PKRKYF+ D NLL HRI TSC+Y PSKVVDLY+SGE+WMD+E + RYISRKA FDHN
Sbjct: 541 DRVPKRKYFEEDRNLLHHRIGTSCDYMPSKVVDLYNSGEEWMDDETNRRYISRKAKFDHN 600

Query: 601 KYKKPNTNFNRQILYASADS--HESYLDHAKKYKPGPKYMKGNRRHGPSSWIKSQNVDHR 660
           KY+K N  ++R  LYAS DS  HESYLD+AKKY+ GPKYMKGN++ G SSWIKSQNVD R
Sbjct: 601 KYRKRNKKYDRHNLYASDDSFLHESYLDNAKKYETGPKYMKGNKKQGTSSWIKSQNVDRR 660

Query: 661 NSSHRPVKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKK 720
           NS H+  K W KTE EN Y  +NDDDLSDDLVI TESEPPEDSE+F Q++HEAFLKC K 
Sbjct: 661 NSLHKQHKVWNKTEGENGYVYLNDDDLSDDLVIPTESEPPEDSEKFNQMVHEAFLKCLKM 720

Query: 721 LNMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKVGLRAQHLGLAK 780
           LNMK +VRK+YK+QGN GSLYCIVCGRS+SKEF++TQRLVKHAYMSHK+GLRAQHLGLAK
Sbjct: 721 LNMKASVRKRYKDQGNGGSLYCIVCGRSYSKEFLDTQRLVKHAYMSHKIGLRAQHLGLAK 780

Query: 781 AICVLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKWKV 840
           AICVLMGWNS LP+DTVIWVPE L KEE VVQKEDLIIWPPVIIVRN+S+S ++P KWKV
Sbjct: 781 AICVLMGWNSALPQDTVIWVPEDLHKEEAVVQKEDLIIWPPVIIVRNISMSRSNPGKWKV 840

Query: 841 VTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRG 900
           +TIEALE+FLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGL DAERL+KFF E R G
Sbjct: 841 ITIEALEAFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLTDAERLDKFFLEKRHG 900

Query: 901 REDFELAKCKDGGVEMEGDKI------EEEVLYGYLGAAEDLDDVELNVRKLSMIKSKKE 956
           R +FE +K  +G     GD        EEEVLYGYLG AEDLD VE N+RK S IKSKKE
Sbjct: 901 RVNFEQSKGNNGKGNDVGDATERNEIEEEEVLYGYLGIAEDLDSVEFNIRKSSSIKSKKE 953

BLAST of HG10004139 vs. TAIR 10
Match: AT3G22430.1 (CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); BEST Arabidopsis thaliana protein match is: XS domain-containing protein / XS zinc finger domain-containing protein-related (TAIR:AT5G23570.1); Has 565 Blast hits to 510 proteins in 121 species: Archae - 2; Bacteria - 90; Metazoa - 191; Fungi - 32; Plants - 51; Viruses - 4; Other Eukaryotes - 195 (source: NCBI BLink). )

HSP 1 Score: 131.7 bits (330), Expect = 3.1e-30
Identity = 88/257 (34.24%), Postives = 135/257 (52.53%), Query Frame = 0

Query: 707 IHEAFLKCSKKLNMKTNVRKKYKEQGNAGSLYCIVCGRSHSKEFMNTQRLVKHAYMSHKV 766
           + ++FL   K++      +K Y E G  G L C+VCGRS SK+  +T  LV H Y S   
Sbjct: 253 LKKSFLGFVKRVFEDPMEKKNYLENGRKGRLQCLVCGRS-SKDVQDTHSLVMHTYCSDDS 312

Query: 767 GLRAQHLGLAKAICVLMGWN-SVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNV 826
             R  HLGL KA+CVLMGWN S  P ++  +  + L  +E  + +  LIIWPP +IV+N 
Sbjct: 313 SSRVHHLGLHKALCVLMGWNFSKAPDNSKAY--QNLPADEAAINQAQLIIWPPHVIVQNT 372

Query: 827 SLSHTSPDKWKVVTIEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAE 886
           S       + +    + +++ +R   L  G+ K   G      + + KF    SGL DA 
Sbjct: 373 STGKGKEGRMEGFGNKTMDNRIRELGLTGGKSKSLYGREGHLGITLFKFAGDDSGLRDAM 432

Query: 887 RLNKFFSENRRGREDF----ELAKCKD-----GGVEMEGDKIEEE-VLYGYLGAAEDLDD 946
           R+ ++F +  RGR+ +     L   KD     G VE++G   E++ + YGYL    DLD 
Sbjct: 433 RMAEYFEKINRGRKSWGRVQPLTPSKDDEKNPGLVEVDGRTGEKKRIFYGYLATVTDLDK 492

Query: 947 VELNVRKLSMIKSKKEI 953
           V++  +K + I+S +E+
Sbjct: 493 VDVETKKKTTIESLREL 506

BLAST of HG10004139 vs. TAIR 10
Match: AT5G23570.1 (XS domain-containing protein / XS zinc finger domain-containing protein-related )

HSP 1 Score: 45.8 bits (107), Expect = 2.2e-04
Identity = 68/276 (24.64%), Postives = 115/276 (41.67%), Query Frame = 0

Query: 664 VKNWKKTEEENDYARVNDDDLSDDLVISTESEPPEDSEEFKQLIHEAFLKCSKKLNMKTN 723
           V N  + E ++D    +DDDL+ D   S  S+    S +  +   + F         + N
Sbjct: 161 VDNASEEENDSDALDDSDDDLASDDYDSDVSQKSHGSRKQNKWFKKFFGSLDSLSIEQIN 220

Query: 724 VRKKYKEQGNAGSLYCIVCGRSHSK-EFMNTQRLVKHAYMSHKVGLRAQHLGLAKAI--C 783
             ++          +C  C       ++ N   L+ HA       ++  H  LA+ +   
Sbjct: 221 EPQR--------QWHCPACQNGPGAIDWYNLHPLLAHARTKGARRVKL-HRELAEVLEKD 280

Query: 784 VLMGWNSVLPRDTVIWVPEVLSKEETVVQKEDLIIWPPVIIVRNVSLSHTSPDKW-KVVT 843
           + M   SV+P   +    + L ++E    K+  I+WPP++I+ N  L     DKW  +  
Sbjct: 281 LQMRGASVIPCGEIYGQWKGLGEDE----KDYEIVWPPMVIIMNTRLDKDDNDKWLGMGN 340

Query: 844 IEALESFLRSKNLLKGRVKMSLGCPADQSVMVLKFLPTFSGLADAERLNKFFSENRRGRE 903
            E LE F + + L   R + S G    + + VL F  + +G  +AERL++  +E    R 
Sbjct: 341 QELLEYFDKYEAL---RARHSYGPQGHRGMSVLMFESSATGYLEAERLHRELAEMGLDRI 400

Query: 904 DF-ELAKCKDGGVEMEGDKIEEEVLYGYLGAAEDLD 935
            + +      GGV           LYG+L   +DLD
Sbjct: 401 AWGQKRSMFSGGVRQ---------LYGFLATKQDLD 411

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038884675.10.0e+0088.69uncharacterized protein LOC120075393 [Benincasa hispida] >XP_038884676.1 unchara... [more]
XP_008456586.10.0e+0081.40PREDICTED: uncharacterized protein LOC103496499 [Cucumis melo] >XP_008456587.1 P... [more]
TYK11753.10.0e+0081.40uncharacterized protein E5676_scaffold304G00720 [Cucumis melo var. makuwa][more]
XP_011656567.10.0e+0081.42uncharacterized protein LOC101208223 [Cucumis sativus] >XP_031743187.1 uncharact... [more]
XP_022133809.10.0e+0074.82uncharacterized protein LOC111006280 [Momordica charantia][more]
Match NameE-valueIdentityDescription
A5YVF11.4e-0626.48Protein SUPPRESSOR OF GENE SILENCING 3 OS=Solanum lycopersicum OX=4081 GN=SGS3 P... [more]
A1Y2B72.0e-0527.06Protein SUPPRESSOR OF GENE SILENCING 3 homolog OS=Zea mays OX=4577 GN=SGS3 PE=1 ... [more]
Q2QWE96.3e-0425.27Protein SUPPRESSOR OF GENE SILENCING 3 homolog OS=Oryza sativa subsp. japonica O... [more]
Match NameE-valueIdentityDescription
A0A1S3C3690.0e+0081.40uncharacterized protein LOC103496499 OS=Cucumis melo OX=3656 GN=LOC103496499 PE=... [more]
A0A5D3CIK80.0e+0081.40XS domain-containing protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sca... [more]
A0A6J1BX130.0e+0074.82uncharacterized protein LOC111006280 OS=Momordica charantia OX=3673 GN=LOC111006... [more]
A0A6J1HC300.0e+0072.82uncharacterized protein LOC111461470 OS=Cucurbita moschata OX=3662 GN=LOC1114614... [more]
A0A6J1JSP30.0e+0071.78uncharacterized protein LOC111487181 OS=Cucurbita maxima OX=3661 GN=LOC111487181... [more]
Match NameE-valueIdentityDescription
AT3G22430.13.1e-3034.24CONTAINS InterPro DOMAIN/s: Domain of unknown function XS (InterPro:IPR005380); ... [more]
AT5G23570.12.2e-0424.64XS domain-containing protein / XS zinc finger domain-containing protein-related [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 943..955
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 332..352
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 253..279
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 629..664
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 241..279
NoneNo IPR availablePANTHERPTHR46619:SF4XS DOMAIN-CONTAINING PROTEIN-RELATEDcoord: 1..955
NoneNo IPR availablePANTHERPTHR46619RNA RECOGNITION MOTIF XS DOMAIN PROTEIN-RELATEDcoord: 1..955
IPR038588XS domain superfamilyGENE3D3.30.70.2890XS domaincoord: 807..954
e-value: 1.2E-30
score: 108.5
IPR005380XS domainPFAMPF03468XScoord: 811..934
e-value: 8.3E-19
score: 68.0

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10004139.1HG10004139.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0031047 gene silencing by RNA