HG10020798 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10020798
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
LocationChr05: 2531079 .. 2538349 (+)
RNA-Seq ExpressionHG10020798
SyntenyHG10020798
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGATCAACATCTTCATCATCCACAACAATGGCATCCCAGGCCAATCCAAGCTACTGTCTGTCCAATTTGCGCAATGCCTCACTTCCCCTTTTGCCCTCCCCATCCATCCTTCAACCAAAACCCTAGATATCCCTTCGGACCCGATCCCTCTTTTCAAACCCCCGGTTTCGACTCTCATCGTCCACCCGTGGGGATGCCGCCTCCGTATATGGGGAATCCCGACGATGGTTTCGGAGATCAGAGGCCGTGGATTAGAAATTCTGCCAATCCAATTGGGCATGTCCCGTTCCACCCTCACAGAGAAGGGGTTTTCCCGCCACCGTATGATTATGGCGGGAATGAATTTGTTAACGACGCTGAAAGAAGCTACAAGAGGCCGAGGGTTGATGATGTGGGTTCGGATGGTGTCGTTCATGAGCTTAATCAGAATCAGAAGAGCGGCAGGAGTTCATATGAGGATGAGCGTAGGTTGAAGTTGATTCGGGATCATGGAGTTGTATCGAGTGGACCGCCTGAAGGTGGTTCTAATTCCTTGCCGAGAATGAATTTGGGTTCTAATAGCGAAGCAAACAGACGCACTCTTGAAAATTCGGTGGGATCTGAAGACCCGGAAGAAGTCGGCAGTACGAGAATCTTGGAAACTAATAACTTTCAAGATCCGGGTAATGGCAATAACGATGGAAGAACTCAAAACTTTCAGGAGAATGGTAGAATTGACACGCGGCGGCTTTCCCAAAACGAAGAATTTTCACATGCTTGTTATGATCAGGTTGGAGGCCATTGGCGTATGCCGCACTCTGTTCCTCCTGAAGCCACCGAAGACAACTATCTTTCTCATAGAAACGAATTGCATTATTCTGATAACCGGCATGCATTTTCCTGGATGGATGATAGAAATAACAGCAAAATGAACATTCTTGATCGTGATTATCAGCCACCCCCTCGCTCTGAGATGAATTCCATCCATATGAGACCGTTTTCATCGCATGGAAATGCTCATCACAGTAGAAACTTGAATTTTGGTGCTGGATACGCTCCACGGCTTTCTGGGGGTGGCAGGTTCTTGGAAAATGGAAGTTCAATTGAAGATTCTCGCTTCTTTGGTGAACAACCCCCTCTTCCTGCTTCTCCACCACCGCCTATGCCTTGGGAAGCACACTTGCACGCTTCTGCAGAGTCTATGGCCTACTCTTCTCAAGCAAAACCTTCATCCCTGTTTCCTGTTCCTGTTAGTACCTCAACAATAACGTCGGCAGCATATTCTTCAGTTCCTGAACACCGTTCCTTTCACCACCATAAACCAATGCCTCACGTTTCTTCTAGCCCTATGATGGAGGTTCGAAGAATTTGATATGTATTTTTATCATGGTTGCATTTTTTTTGGTGCATTGGTAATTTTTATGGGGATAGTGTATGAAGCTCTTCGTTCAATTTATTTATTTGTAGGATTCTCTGGCATTGCACCCATATTCTAAGAAGTATGCTGCAGATGGAAAAGCTTTTGGATTGAATCAATTGCCTCCGCAAAAGCCCAAAGTTATTGATGCTTCGCATTTATTCAAGCTACCTCATCGGTCTACTCGTCCAGATCATATTGTGGTTATTCTTCGAGGGCTTCCAGGTATTATTTTCTTGTTTGCACCTGTCTACTCAGAATTGTACATCTTGCTGGAGTTGTGGAGAATAGAAAAATGATTGTTGTATTTCTCTTCATGCTCCACATCCTACACTTATTGCTTCACATTATCATTGGCTTCCGTTGCTTAATGAATACTCAAAAAGTTCATTCGAAATATTTTCTTTGGAGGAACTTGATTGTGACAAGATGACTTGTCTAAATCGAATTGCACATTTCATGAGTTGTCATATCTTTTGCTGATAATAATAAGAAAATATCTCTTTGAACGTGGGAATAGAAAATGGACCGTAAGAATTGCTGTTTTTTCCAGGAAGTGGAAAAAGCTATTTGGCAAAGATGTTGCGTGATGTTGAAGTTGAAAATGGTGGTGATGCTCCTCGTATACATTCAATGGACGATTACTTCATGACAGAAGTGGAAAAGGTACGCTTGATTTATATCCTCTTTTAGTTTGTTTACAGTGCTTATTATAATGTTTGGTTTTAGACTTTTAGTTCTTTTAATTGATAATTGATTTAGGTTATTATTTAACTTTATATGCAACACCTTCTTTTTCTGACAGGTTGAGGAAGGTGATGCCAAATCATCAAATTCAATTAAAGGCAAGAAGCCAATCATGAAGAAGGTCATGGAATATTGTTACGAACCTGAAATGGAGGAGGTATATGTACAGATGATATAAGGAGTGCCATTTTATACATATGGTCCTGCATGTTTTATACCAACACACACACACACACTCACAATTAATTTCGATATCCAATTATTTGTTGTATTTTGCTTAAGACCGGATGTTCCTTGTGTTTCGTGCATCCTAATCGGCCTAGTGTTTTGGGTGGCTTGGTAGTTCCATTTGATTGTGGTTGCTTTGAAGATTAGGAGCGAATATAGTTTCATTATCCATTTTGCTGTGCAAGTTAGCTAACAGCACCAATGTCAATGCAATAACTATTGCAATAATGTAATAAATAAGAGCTGCAAAGCATTTTAGTTTTCCAAACATCTTTCTAAGACATTATCTGATGATGGTATATTTTTGTTAGGCTTATCGGTCAAGCATGTTGAAAGCATTCAGGAAGACCCTTGAGGAGGGGATATTCACCTTTGTAATTGGTATGAACTCTGCTCCTACGAGAATCTCAGAATGATTGTATGAGGTCCTACATTTCCCCTATCATTTTAACTATTTCTGAACTATTTTTAATAACATTTTCAATGTTGAGGTCTTTCAAAGTGAAGCCAATGGCCCTATATTGTGAATTTTGTACTTAATCTGGTGACTTGACCTTTTTTTTTTGTGAATTTTTGTTTCAAAATTGTTGGTGCATAGTTTAGCATTCAGCTTATCATTTCCCATGCCAATGGTAGCATAAAAAGAGGTGGCATGGTTGGTGATAGTCTATACCTTAGGTATTTAAAAAGGTTAAAAAACGTTAACAATTCTCATAGTCTTGGTTAAGACATGCATTAGTTAAGTGATCTAGTTTGATTTTAGATTAAGTGGTGATGCAGTGCCAATGCCACTGCATTCCTTTGCACTAACTACAATGCAGTTTGTTCGGCAATTGATCCTTTTGTTGGTGGTATGTGTGCCTCGTCTTAGTGGATGACCGCAATCTGCGGGTAGCTGATTTTGCTCAGTTTTGGGCAATTGCAAAGGTATATTCTCAATTCTCTCTTTCTTACAACCCTAGCCGTCGTGTACTGAAGTGGCAACTTAAATGTGGGTGGCAACATCATTTATTTAAGTTCTGAGTTATCCAATGGGATTGTGCATAGTGACGTTAGTAAGTAGTTTGAAATAGGTAGTGAATGGGTTGACAGCATAAAAAAAGGGACTGAGATGATTTCTTCAGAACCGATGTTGATATAGGCTATACTTTATTTCATCTACATTTGAGTCTTGGTCACAATATTTTCCATGCTGTCGAGGCTGCCTTGAAAAGATGTAGGGTTTTTATACTTTTTTTTGCTGCATAGGATTCACGCACCAACCAATTCTTATTTGGCTTTTGCTTTGGATCCAACCCAAGGGCTTATGACCTGAAAACATTCATTTTGAATTTTAAATGTCCATATGGTTAACAACACTTTATGTGCTGTTGGCTTCTATCAGTGTTATTTCTGATGCCCTCATTTGATTTCATGCATCTTGTTGGCGTACTTTTATCGTTTTGAATTAATCTCTATTTGTAATTGAATCCATTTAGGGAGCTTTTATGAAAATCCCGTGGGAGTGTATGCACATTTTTTTAATAACTTCTCACTAAGTTGATGACACTGTCTCCTTTTCCTATAAAATTGTGATTTAGTATTTTATTTTACTGTGTTCTTACACACACACACACACACATATATATTTCTATTTCTATAAGTTTTTTTACATAATTATTTCAATATACTTGGGATGCTTGTGGTTCTTGGTCTGCGGTTTTAATGGTACTTTCACTGCTTCCTGTTGATTAACACCAATCTTGTCAATTGTGTAATACTCTAATGCTAGAGCACTACCTTTTGTTTTCCTAGCAAGTCTTTGCTTTTGGGATTTGTTATGGTAGTTTCCAAATAGCGTCATGTTAGTCTGTTTATAAAAGGTCGCAGGAGACCAAATGAAGATCATAATAACCTTCTCTACTGACTTTGCTGCTTTTTACAAAGAGATTTTTCTAATTTCTGATAATAATATTCAATTATGTTTCTTTTTATTTACACGAAGGAAAATTGATCAGCAGAATTATGATGATCGACAATAACTAACGTGTGCAATTTCTGAGTAGTAATCTGACGACAATGAACATAGTTTTTATAGAACGGTTGTGTTACTCATTACATAACCTGTACTGGAACTTGGAAGCTTAGTTATAGTAAATAGTCGAGAGTAGTAATTTTAGGGTTTTGCTGTCTACTTTTTTGTTTTCTTATTATTTCTTTATTTATCTCTTGCTAGAGTTCGGGGTATGAAGTTTACATTTTGGAAGCTACCTATAAGGACCCTGCAGTAAGTTTTTTTCTCCTCCTATCTATTTTATTTGGCGGTGTTTGTCATTGACTTTAAGACAGTACAAAAGTATTAGAGAACCAAGGTCTGACAGAAATGTGATGATCTACCGAAACCATTATGAATAGCCTTTCTTTAGACACAATGAATTTACCATAATTGTATATCCTTTCCATTCTCTGTCTGCATTATGTGTAGTTATGCTCTAATTGGTGTCTTTGTTTAATATGCATGTTTGTGAGTGATTTTAAAATTGTAAAGTTCACTTTTGTCATTTTTAAAATCACTATAAAACACACTTTTGATATCTTAAAATTAATTTAATATTTCACTTTTAAATGCAATTTTCATATCATCAAAATGGGTTTTATTGTGATTTTGAAAATGACAAAAGTGATTTTAACTATTTTTTAATCACACCCAAACATGTCACCCAAACATGCCATTAGGGTCCATTTTGATAACATTTTCATTTCTCGTCTTTGTTTCTTGGATCCTCTTTTTTAATAAAGAAAAACGGAACTGTGTTTGATAATTGTTTCATTGTTTCTTGTTGTTTCTTTTTTATTGGGGAAAACAAAATAGAAACTTGTTTCGTAACATTTTTTTGTTCTCTGAACTTTTAAACGAAAATTCTTTAACATAAAAATTAGTTTTTAAAATTAAACATTGAAAGATTTAATGTGATATTTATTTTTAGATATTTATTATGCAATTTAGAAGTAACAGAAAATGATAAACTAATACTTCTATTTATGTTTCAGAAAAATCTTGAAACATAACCAAGAAACTTGATTTTGTTGTTTCCAAAACTTTTATGCAACTTTGAGAACTTTTCTGAGAAATGAGAAACAACAAATATGAATAGAGGATTGGTCACCAAACATGTTTCTCAAAACTGGAAATAAGAAACAAGAATGTTATGAAATGGGCCCTCCATGTCCAATTAATTTAACTGCTAATCCCGATCCTGGTAAAAGATTAAAATTATTTTTGACAACTCTATGTGTTCTGCTCTATTAGAATTGAGGCCTTATCAATTGAAAGCCAGGGAAAATTCTTGAAATGTTTTTATGGGACATTTAGGGCTGTGCAGCAAGGAATGTGCATGGGTTTAACCTTGATGATATACAAAAGATGGCTAGACAATGGGAAGAAGCTCTGCCTTTGTACTTACAATTGGACATCAAGGTTTGTTAGAATAAAGCATGAGGTTGATCTGTTTTGAACCTACTTTGTAAAAAAAGCATTCAAATTTCTTTCTGATGAGAATAGAATGCATAATTATTTGGGAAGGAATGGAAATTACCAATTTATAGAGGTGGTAAAAGGAGCCTTTTTACCTGTAATGAGAGTAATTCTTTTTCTTGTGACGACTTGCAGTCCTTATGTCATGGGGATGACCTTAAAGAAAGTGGAATTCAGGAGGTAATGTCTTTTTTTTTTGGTAGTGAAGTTTTGAAATAATCTTATATATATATTTAGTGTGATTTGTGCATGTCATGTCTGTCTGCCAGGTTGACATGGATATGGAAGAGGAAGACGATGACAGCCCTAGTTTTCAAGAAACGAAGTCTGAGAAGACAGCATTACCTCCTCTAAGAGATTATGCTTCTGAAGGTATAGTAAAATGAATGAGAAAAATCATATATATGCATTTTTCAGGAGTTCATCTGGTATTTTAAAAGTTTTTCATGTAGTGGTAGGCTAACAACACATCATTACAGGTTTGACTCACAAGTAGTGTCTTCTGTCATCTCATTTAGTTTTCTTCGGCTCTTCACTATTCAGATTTTGGTTTCTCACATCACATTCTTCTGCAGATGATGAGAAGAGATGGGATGCAGAACCGGGCCATCTGAGAGACGAAGTAAAAGAGTTAGGTAGGAGTAAATGGTCAAATGATTTAGATGATGATGATACAGAAAGAACTGATGGCCGGAACGGTCATTCAAATGCTCTCTCTGGTCTGATTCAAGCATACGCCAAAGAAGGAAAGTCGGTGCGCTGGATGGACCAGGTTTGCATTGAGTTTATTTTCTCCCTACTGTTGGGTATCTGGAATTTTCTTGTGCCAAATGTACAAAGTCTCTCATATTCAATGTTCAAAAACTTTCTGTTTGTATCTTAATGTCTTCCATGTCCACAAGCTCTTGGTTTTTCCTTTAATATTACGCCCTTCTCCTTTTTCCAGGCTGTTAATACCGGATTCTCAATCGGTGCCGCAAAAAAGGCAAACAGATTATCTTTAGTAATTGGTCCTGGTGCTGGATATAACCTGGTTAGTCATTGACTGCATTCTGATGAAAATATTTGTGAAACTAAAATCTTGTGTCGTGGAACGGGACTCAGTAAGATAACTTAGATGGATTAATTGCCTATAGCAACTTTACTTAGAAGCAAGTAGTTCCAAGAATCAGCCATGTTGTAAAAGTGTTGATTCTCGTTTGGTATAAATTTGCAGAAATCCAACCCATTAGCAGAAGAACACCGTGGCTCAACCCAAAACAGCAATGAGTCAAAGAAACACAGCAGATTCGAGGAGCGATTGCGCGCAGAAAGTGAATCATTCAAAGTCGTTTTCGATAAAAGGCGACAAAGAATTGGAGGACTTGATTGGGAAGAGGAATAG

mRNA sequence

ATGGATCAACATCTTCATCATCCACAACAATGGCATCCCAGGCCAATCCAAGCTACTGTCTGTCCAATTTGCGCAATGCCTCACTTCCCCTTTTGCCCTCCCCATCCATCCTTCAACCAAAACCCTAGATATCCCTTCGGACCCGATCCCTCTTTTCAAACCCCCGGTTTCGACTCTCATCGTCCACCCGTGGGGATGCCGCCTCCGTATATGGGGAATCCCGACGATGGTTTCGGAGATCAGAGGCCGTGGATTAGAAATTCTGCCAATCCAATTGGGCATGTCCCGTTCCACCCTCACAGAGAAGGGGTTTTCCCGCCACCGTATGATTATGGCGGGAATGAATTTGTTAACGACGCTGAAAGAAGCTACAAGAGGCCGAGGGTTGATGATGTGGGTTCGGATGGTGTCGTTCATGAGCTTAATCAGAATCAGAAGAGCGGCAGGAGTTCATATGAGGATGAGCGTAGGTTGAAGTTGATTCGGGATCATGGAGTTGTATCGAGTGGACCGCCTGAAGGTGGTTCTAATTCCTTGCCGAGAATGAATTTGGGTTCTAATAGCGAAGCAAACAGACGCACTCTTGAAAATTCGGTGGGATCTGAAGACCCGGAAGAAGTCGGCAGTACGAGAATCTTGGAAACTAATAACTTTCAAGATCCGGGTAATGGCAATAACGATGGAAGAACTCAAAACTTTCAGGAGAATGGTAGAATTGACACGCGGCGGCTTTCCCAAAACGAAGAATTTTCACATGCTTGTTATGATCAGGTTGGAGGCCATTGGCGTATGCCGCACTCTGTTCCTCCTGAAGCCACCGAAGACAACTATCTTTCTCATAGAAACGAATTGCATTATTCTGATAACCGGCATGCATTTTCCTGGATGGATGATAGAAATAACAGCAAAATGAACATTCTTGATCGTGATTATCAGCCACCCCCTCGCTCTGAGATGAATTCCATCCATATGAGACCGTTTTCATCGCATGGAAATGCTCATCACAGTAGAAACTTGAATTTTGGTGCTGGATACGCTCCACGGCTTTCTGGGGGTGGCAGGTTCTTGGAAAATGGAAGTTCAATTGAAGATTCTCGCTTCTTTGGTGAACAACCCCCTCTTCCTGCTTCTCCACCACCGCCTATGCCTTGGGAAGCACACTTGCACGCTTCTGCAGAGTCTATGGCCTACTCTTCTCAAGCAAAACCTTCATCCCTGTTTCCTGTTCCTGTTAGTACCTCAACAATAACGTCGGCAGCATATTCTTCAGTTCCTGAACACCGTTCCTTTCACCACCATAAACCAATGCCTCACGTTTCTTCTAGCCCTATGATGGAGGATTCTCTGGCATTGCACCCATATTCTAAGAAGTATGCTGCAGATGGAAAAGCTTTTGGATTGAATCAATTGCCTCCGCAAAAGCCCAAAGTTATTGATGCTTCGCATTTATTCAAGCTACCTCATCGGTCTACTCGTCCAGATCATATTGTGGTTATTCTTCGAGGGCTTCCAGGAAGTGGAAAAAGCTATTTGGCAAAGATGTTGCGTGATGTTGAAGTTGAAAATGGTGGTGATGCTCCTCGTATACATTCAATGGACGATTACTTCATGACAGAAGTGGAAAAGGTTGAGGAAGGTGATGCCAAATCATCAAATTCAATTAAAGGCAAGAAGCCAATCATGAAGAAGGTCATGGAATATTGTTACGAACCTGAAATGGAGGAGGCTTATCGGTCAAGCATGTTGAAAGCATTCAGGAAGACCCTTGAGGAGGGGATATTCACCTTTGTAATTGTGGATGACCGCAATCTGCGGGTAGCTGATTTTGCTCAGTTTTGGGCAATTGCAAAGAGTTCGGGGTATGAAGTTTACATTTTGGAAGCTACCTATAAGGACCCTGCAGGCTGTGCAGCAAGGAATGTGCATGGGTTTAACCTTGATGATATACAAAAGATGGCTAGACAATGGGAAGAAGCTCTGCCTTTGTACTTACAATTGGACATCAAGTCCTTATGTCATGGGGATGACCTTAAAGAAAGTGGAATTCAGGAGGTTGACATGGATATGGAAGAGGAAGACGATGACAGCCCTAGTTTTCAAGAAACGAAGTCTGAGAAGACAGCATTACCTCCTCTAAGAGATTATGCTTCTGAAGATGATGAGAAGAGATGGGATGCAGAACCGGGCCATCTGAGAGACGAAGTAAAAGAGTTAGGTAGGAGTAAATGGTCAAATGATTTAGATGATGATGATACAGAAAGAACTGATGGCCGGAACGGTCATTCAAATGCTCTCTCTGGTCTGATTCAAGCATACGCCAAAGAAGGAAAGTCGGTGCGCTGGATGGACCAGGCTGTTAATACCGGATTCTCAATCGGTGCCGCAAAAAAGGCAAACAGATTATCTTTAGTAATTGGTCCTGGTGCTGGATATAACCTGAAATCCAACCCATTAGCAGAAGAACACCGTGGCTCAACCCAAAACAGCAATGAGTCAAAGAAACACAGCAGATTCGAGGAGCGATTGCGCGCAGAAAGTGAATCATTCAAAGTCGTTTTCGATAAAAGGCGACAAAGAATTGGAGGACTTGATTGGGAAGAGGAATAG

Coding sequence (CDS)

ATGGATCAACATCTTCATCATCCACAACAATGGCATCCCAGGCCAATCCAAGCTACTGTCTGTCCAATTTGCGCAATGCCTCACTTCCCCTTTTGCCCTCCCCATCCATCCTTCAACCAAAACCCTAGATATCCCTTCGGACCCGATCCCTCTTTTCAAACCCCCGGTTTCGACTCTCATCGTCCACCCGTGGGGATGCCGCCTCCGTATATGGGGAATCCCGACGATGGTTTCGGAGATCAGAGGCCGTGGATTAGAAATTCTGCCAATCCAATTGGGCATGTCCCGTTCCACCCTCACAGAGAAGGGGTTTTCCCGCCACCGTATGATTATGGCGGGAATGAATTTGTTAACGACGCTGAAAGAAGCTACAAGAGGCCGAGGGTTGATGATGTGGGTTCGGATGGTGTCGTTCATGAGCTTAATCAGAATCAGAAGAGCGGCAGGAGTTCATATGAGGATGAGCGTAGGTTGAAGTTGATTCGGGATCATGGAGTTGTATCGAGTGGACCGCCTGAAGGTGGTTCTAATTCCTTGCCGAGAATGAATTTGGGTTCTAATAGCGAAGCAAACAGACGCACTCTTGAAAATTCGGTGGGATCTGAAGACCCGGAAGAAGTCGGCAGTACGAGAATCTTGGAAACTAATAACTTTCAAGATCCGGGTAATGGCAATAACGATGGAAGAACTCAAAACTTTCAGGAGAATGGTAGAATTGACACGCGGCGGCTTTCCCAAAACGAAGAATTTTCACATGCTTGTTATGATCAGGTTGGAGGCCATTGGCGTATGCCGCACTCTGTTCCTCCTGAAGCCACCGAAGACAACTATCTTTCTCATAGAAACGAATTGCATTATTCTGATAACCGGCATGCATTTTCCTGGATGGATGATAGAAATAACAGCAAAATGAACATTCTTGATCGTGATTATCAGCCACCCCCTCGCTCTGAGATGAATTCCATCCATATGAGACCGTTTTCATCGCATGGAAATGCTCATCACAGTAGAAACTTGAATTTTGGTGCTGGATACGCTCCACGGCTTTCTGGGGGTGGCAGGTTCTTGGAAAATGGAAGTTCAATTGAAGATTCTCGCTTCTTTGGTGAACAACCCCCTCTTCCTGCTTCTCCACCACCGCCTATGCCTTGGGAAGCACACTTGCACGCTTCTGCAGAGTCTATGGCCTACTCTTCTCAAGCAAAACCTTCATCCCTGTTTCCTGTTCCTGTTAGTACCTCAACAATAACGTCGGCAGCATATTCTTCAGTTCCTGAACACCGTTCCTTTCACCACCATAAACCAATGCCTCACGTTTCTTCTAGCCCTATGATGGAGGATTCTCTGGCATTGCACCCATATTCTAAGAAGTATGCTGCAGATGGAAAAGCTTTTGGATTGAATCAATTGCCTCCGCAAAAGCCCAAAGTTATTGATGCTTCGCATTTATTCAAGCTACCTCATCGGTCTACTCGTCCAGATCATATTGTGGTTATTCTTCGAGGGCTTCCAGGAAGTGGAAAAAGCTATTTGGCAAAGATGTTGCGTGATGTTGAAGTTGAAAATGGTGGTGATGCTCCTCGTATACATTCAATGGACGATTACTTCATGACAGAAGTGGAAAAGGTTGAGGAAGGTGATGCCAAATCATCAAATTCAATTAAAGGCAAGAAGCCAATCATGAAGAAGGTCATGGAATATTGTTACGAACCTGAAATGGAGGAGGCTTATCGGTCAAGCATGTTGAAAGCATTCAGGAAGACCCTTGAGGAGGGGATATTCACCTTTGTAATTGTGGATGACCGCAATCTGCGGGTAGCTGATTTTGCTCAGTTTTGGGCAATTGCAAAGAGTTCGGGGTATGAAGTTTACATTTTGGAAGCTACCTATAAGGACCCTGCAGGCTGTGCAGCAAGGAATGTGCATGGGTTTAACCTTGATGATATACAAAAGATGGCTAGACAATGGGAAGAAGCTCTGCCTTTGTACTTACAATTGGACATCAAGTCCTTATGTCATGGGGATGACCTTAAAGAAAGTGGAATTCAGGAGGTTGACATGGATATGGAAGAGGAAGACGATGACAGCCCTAGTTTTCAAGAAACGAAGTCTGAGAAGACAGCATTACCTCCTCTAAGAGATTATGCTTCTGAAGATGATGAGAAGAGATGGGATGCAGAACCGGGCCATCTGAGAGACGAAGTAAAAGAGTTAGGTAGGAGTAAATGGTCAAATGATTTAGATGATGATGATACAGAAAGAACTGATGGCCGGAACGGTCATTCAAATGCTCTCTCTGGTCTGATTCAAGCATACGCCAAAGAAGGAAAGTCGGTGCGCTGGATGGACCAGGCTGTTAATACCGGATTCTCAATCGGTGCCGCAAAAAAGGCAAACAGATTATCTTTAGTAATTGGTCCTGGTGCTGGATATAACCTGAAATCCAACCCATTAGCAGAAGAACACCGTGGCTCAACCCAAAACAGCAATGAGTCAAAGAAACACAGCAGATTCGAGGAGCGATTGCGCGCAGAAAGTGAATCATTCAAAGTCGTTTTCGATAAAAGGCGACAAAGAATTGGAGGACTTGATTGGGAAGAGGAATAG

Protein sequence

MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSHRPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDAERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLPRMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRIDTRRLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHSRNLNFGAGYAPRLSGGGRFLENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLAEEHRGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Homology
BLAST of HG10020798 vs. NCBI nr
Match: XP_038894607.1 (uncharacterized protein LOC120083122 [Benincasa hispida])

HSP 1 Score: 1609.3 bits (4166), Expect = 0.0e+00
Identity = 800/871 (91.85%), Postives = 826/871 (94.83%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ TVCPICAM HFPFCPPHPSFNQNPRY FGPDPSFQTPGFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTVCPICAMSHFPFCPPHPSFNQNPRYSFGPDPSFQTPGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R  +GMPPPYMGNPDDGF DQRPW+RNSAN  GHVPFH HREGVFPPPYDYGGNEFV DA
Sbjct: 61  RSSMGMPPPYMGNPDDGFADQRPWMRNSANSYGHVPFHSHREGVFPPPYDYGGNEFVIDA 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGSDGVVHELN NQKSGRSS+EDERRLKLIRDHGVVSSG P GGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSDGVVHELNHNQKSGRSSFEDERRLKLIRDHGVVSSGSPGGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN+EANRRT ENSVGS D E+V STRILE+++FQDPGN  NDGRT++F ENGRID
Sbjct: 181 RMNLGSNTEANRRTPENSVGSGDSEDVRSTRILESSHFQDPGNAINDGRTRHFHENGRID 240

Query: 241 TRRLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRN 300
            RR SQNEEFSHA YDQVGGHW MPHSVPPEATEDNYL+HRNE HYSD+R AFSWMDDRN
Sbjct: 241 ARRPSQNEEFSHARYDQVGGHWHMPHSVPPEATEDNYLTHRNEWHYSDDRQAFSWMDDRN 300

Query: 301 NSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRFLENG 360
           NSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH +RN+NFGAGYAPRLSGGGRFLENG
Sbjct: 301 NSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHGTRNMNFGAGYAPRLSGGGRFLENG 360

Query: 361 SSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSA 420
           SS EDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPV+TSTITS+
Sbjct: 361 SSTEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVNTSTITSS 420

Query: 421 AYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVID 480
           AYSS PEHRSFHHHKPM HVSSSPMMEDSLALHPYSKK+AADGK FGLNQ+PPQKP VID
Sbjct: 421 AYSSAPEHRSFHHHKPMSHVSSSPMMEDSLALHPYSKKFAADGKPFGLNQVPPQKPTVID 480

Query: 481 ASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTE 540
           ASHLFKLPHRS RPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTE
Sbjct: 481 ASHLFKLPHRSIRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTE 540

Query: 541 VEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIV 600
           VEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIV
Sbjct: 541 VEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIV 600

Query: 601 DDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEA 660
           DDRNLRVADFAQFWAIAKSSGYEVYILEATYKDP GCAARNVHGFNLDDIQKMARQWEEA
Sbjct: 601 DDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPTGCAARNVHGFNLDDIQKMARQWEEA 660

Query: 661 LPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDS-PSFQETKSEKTALPPLRDYASE 720
            PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDDDS PSFQETKSEKT LPP+RD ASE
Sbjct: 661 PPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDDSLPSFQETKSEKTVLPPIRDDASE 720

Query: 721 DDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKEGKS 780
           DDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTER DGRNGH+NALSGLIQAYAKEGKS
Sbjct: 721 DDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTERIDGRNGHANALSGLIQAYAKEGKS 780

Query: 781 VRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQNSNESKKHSR 840
           VRWMDQ  NTGFSIGA KKANRLSLVIGPGAGYNL+SNPLA EE+RGSTQNSNE+KKHSR
Sbjct: 781 VRWMDQVGNTGFSIGATKKANRLSLVIGPGAGYNLESNPLAEEEYRGSTQNSNETKKHSR 840

Query: 841 FEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           FEERLRAES SFKVVFDKRRQRIGGLDWEE+
Sbjct: 841 FEERLRAESLSFKVVFDKRRQRIGGLDWEED 871

BLAST of HG10020798 vs. NCBI nr
Match: XP_008437571.1 (PREDICTED: uncharacterized protein LOC103482943 [Cucumis melo] >TYJ99101.1 uncharacterized protein E5676_scaffold248G003060 [Cucumis melo var. makuwa])

HSP 1 Score: 1587.0 bits (4108), Expect = 0.0e+00
Identity = 793/875 (90.63%), Postives = 823/875 (94.06%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ T+CPIC MPHFPFCPPHPSFNQNPRYPFGPDPSFQ PGFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTLCPICTMPHFPFCPPHPSFNQNPRYPFGPDPSFQAPGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R P+ MPPPYM NPDDGF DQRPWIRNSAN  GHVPFHPHREG FPPPYDYGGNEFVND 
Sbjct: 61  RSPMRMPPPYMANPDDGFADQRPWIRNSANSYGHVPFHPHREGFFPPPYDYGGNEFVNDV 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGS+G VHELNQN  +GRSS+EDERRLKLIRDHG+VSSGPPEGGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSEGGVHELNQN--TGRSSFEDERRLKLIRDHGIVSSGPPEGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN EANRR+LENSVGS DPE+VGS+RILETNNFQDPGNG+N+GRTQ+F ENGR+D
Sbjct: 181 RMNLGSNGEANRRSLENSVGSGDPEDVGSSRILETNNFQDPGNGSNNGRTQHFHENGRVD 240

Query: 241 TRRLSQNEEFSHACYDQVGG-HW---RMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWM 300
            R  SQNEEFSHA YDQVGG HW    MPHSV PEATEDNYLSHR+ELHYSD+R AFSWM
Sbjct: 241 KRWPSQNEEFSHARYDQVGGSHWHAQHMPHSVHPEATEDNYLSHRHELHYSDDRQAFSWM 300

Query: 301 DDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRF 360
           D+RNNSKMN+LDRDY PPPRSEMN IHMRPFSSHGNAHH +RNLNFGAGYAPRLSGGGRF
Sbjct: 301 DERNNSKMNVLDRDYHPPPRSEMNPIHMRPFSSHGNAHHGTRNLNFGAGYAPRLSGGGRF 360

Query: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTST 420
           LENGSSIEDSRFFGEQPPLPASPPPPMPWE+HLHASAES+AYSSQAKP SLFPVPVSTST
Sbjct: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWESHLHASAESVAYSSQAKPPSLFPVPVSTST 420

Query: 421 ITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKP 480
           ITS+AYSS PEHRSFHHHKPMP VSSSPMMEDSLALHPYSKK+AADGK FG+NQLPPQK 
Sbjct: 421 ITSSAYSSAPEHRSFHHHKPMPRVSSSPMMEDSLALHPYSKKFAADGKPFGVNQLPPQKL 480

Query: 481 KVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540
           KVIDAS LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY
Sbjct: 481 KVIDASQLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540

Query: 541 FMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFT 600
           FMTEVEKV+EGDAKSSNS KGKKPI KKVMEYCYEP+MEEAYRSSMLKAFRKTLEEGIFT
Sbjct: 541 FMTEVEKVDEGDAKSSNSFKGKKPITKKVMEYCYEPQMEEAYRSSMLKAFRKTLEEGIFT 600

Query: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660
           FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ
Sbjct: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660

Query: 661 WEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDY 720
           WEEA PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDD SPSFQET SEKTALP LR  
Sbjct: 661 WEEAPPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDGSPSFQETISEKTALPSLRHD 720

Query: 721 ASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780
           ASEDDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTE+ DGRNGHSNALSGLIQAYAKE
Sbjct: 721 ASEDDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTEKIDGRNGHSNALSGLIQAYAKE 780

Query: 781 GKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQ-NSNESK 840
           GKSVRWMDQ  N+GFSIGAAKKANRLSLVIGPG GYNLKSNPLA EE+RGSTQ NSNESK
Sbjct: 781 GKSVRWMDQVRNSGFSIGAAKKANRLSLVIGPGPGYNLKSNPLAEEEYRGSTQNNSNESK 840

Query: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Sbjct: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 873

BLAST of HG10020798 vs. NCBI nr
Match: XP_011651180.1 (uncharacterized protein LOC101218580 [Cucumis sativus] >KGN64252.1 hypothetical protein Csa_013201 [Cucumis sativus])

HSP 1 Score: 1578.9 bits (4087), Expect = 0.0e+00
Identity = 790/874 (90.39%), Postives = 817/874 (93.48%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ TVCPIC M HFPFCPPHPSFNQNPRYPFGPD SFQT GFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTVCPICTMSHFPFCPPHPSFNQNPRYPFGPDHSFQTSGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R P+ MPPPYM NPDDGF DQRPWIRNSAN  GHVPFHPHREG FPPPYDYGGNEFVNDA
Sbjct: 61  RSPMRMPPPYMANPDDGFADQRPWIRNSANSYGHVPFHPHREGFFPPPYDYGGNEFVNDA 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGS+G VHELNQNQ +GRSS+EDERRLKLIRDHG+V SGPPEGGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSEGGVHELNQNQDTGRSSFEDERRLKLIRDHGIVPSGPPEGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN EANRR+LENSVGS DPE+VGS+RILETNNF D GNG+N+GRTQ+F ENGRID
Sbjct: 181 RMNLGSNGEANRRSLENSVGSGDPEDVGSSRILETNNFHDSGNGSNNGRTQHFHENGRID 240

Query: 241 TRRLSQNEEFSHACYDQVGG-HW---RMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWM 300
            R  SQNEEFSHA YDQVGG HW     PHSV PEATEDNYL+HR+E+HYSD+R AFSW+
Sbjct: 241 KRWPSQNEEFSHARYDQVGGSHWHPQHKPHSVHPEATEDNYLAHRHEVHYSDDRQAFSWV 300

Query: 301 DDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRF 360
           D+RNNSKM + DRDYQPPPRSEMN IHMR FSSHGNAHH +RNLNFGAGYAPRLSGGGRF
Sbjct: 301 DERNNSKMAVFDRDYQPPPRSEMNPIHMRSFSSHGNAHHGTRNLNFGAGYAPRLSGGGRF 360

Query: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTST 420
           LENGSSIEDSRFF EQPPLPASPPPPMPWEAHLHASAES+AYSSQAKP SLFPVPVSTST
Sbjct: 361 LENGSSIEDSRFFCEQPPLPASPPPPMPWEAHLHASAESVAYSSQAKPPSLFPVPVSTST 420

Query: 421 ITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKP 480
           ITS+AYSS PEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKK+AADGK FGLNQLPPQKP
Sbjct: 421 ITSSAYSSAPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKFAADGKPFGLNQLPPQKP 480

Query: 481 KVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540
           KVIDAS LFK PHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY
Sbjct: 481 KVIDASQLFKPPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540

Query: 541 FMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFT 600
           FMTEVEKV+E DAKSSNSIKGKKPI KKVMEYCYEP+MEEAYRSSMLKAFRKTLEEGIFT
Sbjct: 541 FMTEVEKVDEVDAKSSNSIKGKKPITKKVMEYCYEPQMEEAYRSSMLKAFRKTLEEGIFT 600

Query: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660
           FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ
Sbjct: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660

Query: 661 WEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDY 720
           WEEA PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDD SPSFQET SEKTALP LR  
Sbjct: 661 WEEAPPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDGSPSFQETMSEKTALPSLRHD 720

Query: 721 ASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780
           ASEDDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE
Sbjct: 721 ASEDDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780

Query: 781 GKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQNSNESKK 840
           GKSV WMDQ  NTGFSIGAAKKANRLSLVIGPG GYNLKSNPLA EE+RGSTQNSNESKK
Sbjct: 781 GKSVSWMDQVRNTGFSIGAAKKANRLSLVIGPGPGYNLKSNPLAEEEYRGSTQNSNESKK 840

Query: 841 HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Sbjct: 841 HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 874

BLAST of HG10020798 vs. NCBI nr
Match: XP_023541377.1 (uncharacterized protein LOC111801581 [Cucurbita pepo subsp. pepo])

HSP 1 Score: 1395.6 bits (3611), Expect = 0.0e+00
Identity = 721/904 (79.76%), Postives = 756/904 (83.63%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLH+ QQW+ RPIQ TVCPICAMPHFPFCPPHPSFNQNPRYPFGPDP FQ PGFD H
Sbjct: 1   MDQHLHYQQQWNSRPIQGTVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPPFQRPGFDPH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVF-PPPYDYGGNEFVND 120
           R P+GMP P MGN DDGF DQRPWIRNSAN  GH+PF PHRE  F PPPYDYGGNEFVND
Sbjct: 61  RSPMGMPRPSMGNLDDGFADQRPWIRNSANSYGHLPFQPHREESFLPPPYDYGGNEFVND 120

Query: 121 AERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSL 180
           AERSYKRPRVDDVG DG VHE+NQNQKSGRSS+EDERRLKLIRDHGVVSSGP        
Sbjct: 121 AERSYKRPRVDDVGLDGGVHEVNQNQKSGRSSFEDERRLKLIRDHGVVSSGP-------- 180

Query: 181 PRMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENG-- 240
                         + ENSVGS DPEEVG+TR LE N+FQD GNG+NDGR+QNF + G  
Sbjct: 181 --------------SYENSVGSGDPEEVGTTRNLEINHFQDSGNGDNDGRSQNFHDEGNL 240

Query: 241 --------------------------RIDTRRLSQNEEFSHACYDQVGGHW---RMPHSV 300
                                     RID  R SQNEE SH+ YDQ GGHW    MP  V
Sbjct: 241 APAKQFQNGREGYWSDLKHAPAAPGNRIDPWRPSQNEELSHSRYDQGGGHWHAQHMPRPV 300

Query: 301 PPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNSKMNILDRDYQPPPRSEMNSIHMRPFS 360
           PPEA+ED+YLSHRNELHYSDN  AFSWMDDRNNSKMNILDRDY+PPPRSEMN  HMRPFS
Sbjct: 301 PPEASEDSYLSHRNELHYSDNPQAFSWMDDRNNSKMNILDRDYRPPPRSEMNPTHMRPFS 360

Query: 361 SHGNAHH-SRNLNFGAGYAPRLSGGGRFLENGSSIEDSRFFGEQPPLPASPPPPMPWEAH 420
           SHGNAHH +RN N+GAGYAPR SGG RF ENGSSIEDSRFF EQPPLP SPPPPMPWE  
Sbjct: 361 SHGNAHHGTRNFNYGAGYAPRHSGGVRFFENGSSIEDSRFFDEQPPLPTSPPPPMPWE-- 420

Query: 421 LHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYSSVPEHRSFHHHKPMPHVSSSPMMED 480
                        AKPSSLFPVPVS S ITS+ YSSVPEHRS HH KPM HVSSSPM ED
Sbjct: 421 -------------AKPSSLFPVPVSVSPITSSQYSSVPEHRSLHHLKPMFHVSSSPMTED 480

Query: 481 SLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSG 540
           SL +HPYSKK+AADGK +G+NQLP  KPKVIDASHLFK PHRSTRPDHIVVILRGLPGSG
Sbjct: 481 SLGVHPYSKKFAADGKPYGVNQLPLPKPKVIDASHLFKRPHRSTRPDHIVVILRGLPGSG 540

Query: 541 KSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEY 600
           KSYLAKMLRDVE+ENGGDAPRIHSMDDYFMTEVEKVEEGD  SSNS+KGKKPI+KKVMEY
Sbjct: 541 KSYLAKMLRDVEIENGGDAPRIHSMDDYFMTEVEKVEEGDTNSSNSVKGKKPIVKKVMEY 600

Query: 601 CYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660
           CYEPEMEEAYRSSMLKAFRKTLEEG+FTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE
Sbjct: 601 CYEPEMEEAYRSSMLKAFRKTLEEGVFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660

Query: 661 ATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDM 720
           ATY+DP GCAARNVHGFNLDDIQKMARQWEEA  LYLQLDIKSLCHGDDLKESGI+EVDM
Sbjct: 661 ATYRDPTGCAARNVHGFNLDDIQKMARQWEEAPALYLQLDIKSLCHGDDLKESGIKEVDM 720

Query: 721 DMEEEDDDSP-SFQETKSEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSND 780
           DME+EDDD+P SFQETKS KTAL P RD ASEDD KRWD E  H R+EVKELGRSKWSND
Sbjct: 721 DMEDEDDDTPSSFQETKSIKTALHPQRDDASEDDGKRWDEESDHRREEVKELGRSKWSND 780

Query: 781 LDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIG 840
           LDDDDTERTDG NGH+NALSGLIQAYAKEGKSVRW+DQA  TGFSIGAAKKANRLSLVIG
Sbjct: 781 LDDDDTERTDGANGHANALSGLIQAYAKEGKSVRWIDQAGYTGFSIGAAKKANRLSLVIG 840

Query: 841 PGAGYNLKSNPLAEE--HRGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 869
           PGAGYNLKSNPL EE  +RGS QNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD
Sbjct: 841 PGAGYNLKSNPLPEEYQYRGSNQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 867

BLAST of HG10020798 vs. NCBI nr
Match: XP_022994572.1 (uncharacterized protein LOC111490251 [Cucurbita maxima])

HSP 1 Score: 1388.6 bits (3593), Expect = 0.0e+00
Identity = 720/904 (79.65%), Postives = 754/904 (83.41%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLH+ QQW+ RPIQ TVCPICAMPHFPFCPPHPSFNQNPRYP GPDP FQ PGFD H
Sbjct: 1   MDQHLHYQQQWNYRPIQGTVCPICAMPHFPFCPPHPSFNQNPRYPLGPDPPFQRPGFDPH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVF-PPPYDYGGNEFVND 120
           R P+GMP P MGN DDGF DQRPWIRNSAN  GH+PF PHRE  F PPPYDYGGNEFVND
Sbjct: 61  RSPMGMPRPSMGNLDDGFADQRPWIRNSANSYGHLPFQPHREESFLPPPYDYGGNEFVND 120

Query: 121 AERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSL 180
           AERSYKRPRVDDVG DG VHELNQNQKSGRSS+EDERRLKLIRDHGVVSSGPP       
Sbjct: 121 AERSYKRPRVDDVGLDGGVHELNQNQKSGRSSFEDERRLKLIRDHGVVSSGPP------- 180

Query: 181 PRMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENG-- 240
                           ENSVGS DPEEVG+TR LE N+FQD GNG+NDGR QNF + G  
Sbjct: 181 ---------------YENSVGSGDPEEVGTTRNLEINHFQDSGNGDNDGRNQNFHDEGNL 240

Query: 241 --------------------------RIDTRRLSQNEEFSHACYDQVGGHW---RMPHSV 300
                                     RID  R SQNEE SH+ YDQ G HW    MP  V
Sbjct: 241 APAKQFQNGREGYWSDLKHAPVAPGNRIDPWRPSQNEELSHSRYDQGGSHWHAQHMPRPV 300

Query: 301 PPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNSKMNILDRDYQPPPRSEMNSIHMRPFS 360
           PPEA+ED+YLSHRNELHYSDN  AFSWMDDRNNSKMNILDRDY+PPPRSEMN  HMRPFS
Sbjct: 301 PPEASEDSYLSHRNELHYSDNPQAFSWMDDRNNSKMNILDRDYRPPPRSEMNPTHMRPFS 360

Query: 361 SHGNAHH-SRNLNFGAGYAPRLSGGGRFLENGSSIEDSRFFGEQPPLPASPPPPMPWEAH 420
           SHGNAHH +R+ N+ AGYAPR SGG RF ENGSSIEDSRFF EQPPLP SPPPPMPWE  
Sbjct: 361 SHGNAHHGTRSFNYSAGYAPRHSGGVRFFENGSSIEDSRFFDEQPPLPTSPPPPMPWE-- 420

Query: 421 LHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYSSVPEHRSFHHHKPMPHVSSSPMMED 480
                        AKPSSLFPVPVS S ITS+AYSSVPEHRSFHH KPM HVSSSPM ED
Sbjct: 421 -------------AKPSSLFPVPVSVSPITSSAYSSVPEHRSFHHLKPMFHVSSSPMTED 480

Query: 481 SLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSG 540
           SLA+HPYSKK+AADGK +GLN LP  KPK+IDASHLFK PHRSTRPDHIVVILRGLPGSG
Sbjct: 481 SLAVHPYSKKFAADGKPYGLN-LPSPKPKIIDASHLFKRPHRSTRPDHIVVILRGLPGSG 540

Query: 541 KSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEY 600
           KSYLAKMLRDVE++NGGDAPRIHSMDDYFMTEVEKVEEGD  SSNS+KGKKPI+KKVMEY
Sbjct: 541 KSYLAKMLRDVEIDNGGDAPRIHSMDDYFMTEVEKVEEGDTNSSNSVKGKKPIVKKVMEY 600

Query: 601 CYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660
           CYEPEMEEAYRSSMLKAFRKTLEEG+FTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE
Sbjct: 601 CYEPEMEEAYRSSMLKAFRKTLEEGVFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660

Query: 661 ATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDM 720
           ATY+DP GCAARNVHGFNLDDIQKMARQWEEA  LYLQLDIKSLCHGDDLKESGI+EVDM
Sbjct: 661 ATYRDPTGCAARNVHGFNLDDIQKMARQWEEAPALYLQLDIKSLCHGDDLKESGIKEVDM 720

Query: 721 DMEEEDDDSP-SFQETKSEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSND 780
           DME+EDDD+P SFQETKS KTAL P RD ASEDD KRWD E  H R+EVKELGRSKWSND
Sbjct: 721 DMEDEDDDTPSSFQETKSIKTALHPQRDDASEDDGKRWDEESDHRREEVKELGRSKWSND 780

Query: 781 LDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIG 840
           LDDDDTERTDG NGH+NALSGLIQAYAKEGKSVRW+DQA  TGFSIGAAKKANRLSLVIG
Sbjct: 781 LDDDDTERTDGANGHANALSGLIQAYAKEGKSVRWIDQAGYTGFSIGAAKKANRLSLVIG 840

Query: 841 PGAGYNLKSNPLAEE--HRGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 869
           PGAGYNLKSNPL EE  +RGS QNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD
Sbjct: 841 PGAGYNLKSNPLPEEYQYRGSNQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 866

BLAST of HG10020798 vs. ExPASy Swiss-Prot
Match: P49750 (YLP motif-containing protein 1 OS=Homo sapiens OX=9606 GN=YLPM1 PE=1 SV=4)

HSP 1 Score: 212.6 bits (540), Expect = 1.8e-53
Identity = 116/248 (46.77%), Postives = 162/248 (65.32%), Query Frame = 0

Query: 468  NQLPP-----QKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVEN 527
            +Q PP     +KP+  +   + K P R +RP+ IVVI+RGLPGSGK+++AK++RD EVE 
Sbjct: 1802 HQPPPAPRVEKKPESKNVDDILKPPGRESRPERIVVIMRGLPGSGKTHVAKLIRDKEVEF 1861

Query: 528  GGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSML 587
            GG APR+ S+DDYF+TEVEK EE D  S   +K      KKVMEY YE EMEE YR+SM 
Sbjct: 1862 GGPAPRVLSLDDYFITEVEK-EEKDPDSGKKVK------KKVMEYEYEAEMEETYRTSMF 1921

Query: 588  KAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVH 647
            K F+KTL++G F F+I+D  N RV  F QFW+ AK+ G+EVY+ E +  D   C  RN+H
Sbjct: 1922 KTFKKTLDDGFFPFIILDAINDRVRHFDQFWSAAKTKGFEVYLAEMS-ADNQTCGKRNIH 1981

Query: 648  GFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQET 707
            G  L +I KMA  WE A    ++LDI+SL     ++E  +++ D ++EE+ ++    +E 
Sbjct: 1982 GRKLKEINKMADHWETAPRHMMRLDIRSLLQDAAIEEVEMEDFDANIEEQKEEKKDAEEE 2041

Query: 708  KSEKTALP 711
            +SE   +P
Sbjct: 2042 ESELGYIP 2041

BLAST of HG10020798 vs. ExPASy Swiss-Prot
Match: Q9R0I7 (YLP motif-containing protein 1 OS=Mus musculus OX=10090 GN=Ylpm1 PE=2 SV=2)

HSP 1 Score: 207.6 bits (527), Expect = 5.6e-52
Identity = 120/287 (41.81%), Postives = 172/287 (59.93%), Query Frame = 0

Query: 429  SFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPP-----QKPKVIDASHL 488
            S+      P     PM       +P  ++      A G +Q PP     +KP+  +   +
Sbjct: 1005 SYDRKSDRPPYEGPPMFGGERRTYP-EERMPLPAPALG-HQPPPVPRVEKKPESKNVDDI 1064

Query: 489  FKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKV 548
             K P R +RP+ IVVI+RGLPGSGK+++AK++RD EVE GG APR+ S+DDYF+ EVEK 
Sbjct: 1065 LKPPGRESRPERIVVIMRGLPGSGKTHVAKLIRDKEVEFGGPAPRVLSLDDYFIAEVEK- 1124

Query: 549  EEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRN 608
            EE D  S   +K      KKVMEY YE +MEE YR+SM K F+KTL++G F F+I+D  N
Sbjct: 1125 EEKDPDSGKKVK------KKVMEYEYEADMEETYRTSMFKTFKKTLDDGFFPFIILDAIN 1184

Query: 609  LRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLY 668
             RV  F QFW+ AK+ G+EVY+ E +  D   C  RN+HG  L +I KMA  WE A    
Sbjct: 1185 DRVRHFDQFWSAAKTKGFEVYLAEMS-ADNQTCGKRNIHGRKLKEINKMAEHWEVAPRHM 1244

Query: 669  LQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALP 711
            ++LDI+SL     ++E  +++ D ++E++ ++    +E +SE   +P
Sbjct: 1245 MRLDIRSLLQDAAIEEVEMEDFDANIEDQKEEKKDAEEEESELGYIP 1281

BLAST of HG10020798 vs. ExPASy Swiss-Prot
Match: P0CB49 (YLP motif-containing protein 1 OS=Rattus norvegicus OX=10116 GN=Ylpm1 PE=1 SV=1)

HSP 1 Score: 206.8 bits (525), Expect = 9.6e-52
Identity = 119/287 (41.46%), Postives = 172/287 (59.93%), Query Frame = 0

Query: 429  SFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPP-----QKPKVIDASHL 488
            S+      P     PM       +P  ++      + G +Q PP     +KP+  +   +
Sbjct: 995  SYDRKSDRPPYEGPPMFGGERRTYP-EERMPLPAPSLG-HQPPPVPRVEKKPESKNVDDI 1054

Query: 489  FKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKV 548
             K P R +RP+ IVVI+RGLPGSGK+++AK++RD EVE GG APR+ S+DDYF+ EVEK 
Sbjct: 1055 LKPPGRESRPERIVVIMRGLPGSGKTHVAKLIRDKEVEFGGPAPRVLSLDDYFIAEVEK- 1114

Query: 549  EEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRN 608
            EE D  S   +K      KKVMEY YE +MEE YR+SM K F+KTL++G F F+I+D  N
Sbjct: 1115 EEKDPDSGKKVK------KKVMEYEYEADMEETYRTSMFKTFKKTLDDGFFPFIILDAIN 1174

Query: 609  LRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLY 668
             RV  F QFW+ AK+ G+EVY+ E +  D   C  RN+HG  L +I KMA  WE A    
Sbjct: 1175 DRVRHFDQFWSAAKTKGFEVYLAEMS-ADNQTCGKRNIHGRKLKEINKMAEHWEAAPRHM 1234

Query: 669  LQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALP 711
            ++LDI+SL     ++E  +++ D ++E++ ++    +E +SE   +P
Sbjct: 1235 MRLDIRSLLQDAAIEEVEMEDFDANIEDQKEEKKDAEEEESELGYIP 1271

BLAST of HG10020798 vs. ExPASy Swiss-Prot
Match: Q5TBK1 (NEDD4-binding protein 2-like 1 OS=Homo sapiens OX=9606 GN=N4BP2L1 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 4.2e-07
Identity = 52/177 (29.38%), Postives = 83/177 (46.89%), Query Frame = 0

Query: 487 PHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPR--IHSMDDYFMTEVEKVE 546
           P R +   H+  +LRGLPGSGK+ LA+ L+        D PR  I S DD+F       E
Sbjct: 35  PRRHSFRKHL-YLLRGLPGSGKTTLARQLQH-------DFPRALIFSTDDFFFR-----E 94

Query: 547 EGDAKSSNSIKGKKPIMKKVMEYCYEPE-MEEAYRSSMLKAFRKTLEEGIFTFVIVDDRN 606
           +G                    Y + P+ +EEA+  +  +A RK +  GI + +I+D+ N
Sbjct: 95  DG-------------------AYEFNPDFLEEAHEWNQKRA-RKAMRNGI-SPIIIDNTN 154

Query: 607 LRVADFAQFWAIAKSSGYEVYILEATYK---DPAGCAARNVHGFNLDDIQKMARQWE 658
           L   +   +  +A  + YEV   E   +   +    A RN+HG + + I +M  ++E
Sbjct: 155 LHAWEMKPYAVMALENNYEVIFREPDTRWKFNVQELARRNIHGVSREKIHRMKERYE 177

BLAST of HG10020798 vs. ExPASy Swiss-Prot
Match: Q3V2Q8 (NEDD4-binding protein 2-like 1 OS=Mus musculus OX=10090 GN=N4bp2l1 PE=2 SV=1)

HSP 1 Score: 58.5 bits (140), Expect = 4.2e-07
Identity = 57/195 (29.23%), Postives = 87/195 (44.62%), Query Frame = 0

Query: 469 QLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPR 528
           Q PP +P           P R +   H+  +LRGLPGSGK+ LA+ L+        D PR
Sbjct: 19  QQPPPRPPPARGPP----PRRHSFRKHL-YLLRGLPGSGKTTLARQLQH-------DYPR 78

Query: 529 --IHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPE-MEEAYRSSMLKAF 588
             I S DD+F       E+G                    Y + P  +EEA+  +  +A 
Sbjct: 79  ALIFSTDDFFFK-----EDG-------------------TYEFNPNLLEEAHEWNQRRA- 138

Query: 589 RKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYK---DPAGCAARNVH 648
           RK +  GI + +I+D+ NL   +   +  +A  + YEV   E   +   +    A RN+H
Sbjct: 139 RKAMRNGI-SPIIIDNTNLHAWEMKPYAVMALENNYEVIFREPDTRWKFNVQELARRNIH 175

Query: 649 GFNLDDIQKMARQWE 658
           G   + IQ+M  ++E
Sbjct: 199 GVPKEKIQRMKERYE 175

BLAST of HG10020798 vs. ExPASy TrEMBL
Match: A0A5D3BK41 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold248G003060 PE=4 SV=1)

HSP 1 Score: 1587.0 bits (4108), Expect = 0.0e+00
Identity = 793/875 (90.63%), Postives = 823/875 (94.06%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ T+CPIC MPHFPFCPPHPSFNQNPRYPFGPDPSFQ PGFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTLCPICTMPHFPFCPPHPSFNQNPRYPFGPDPSFQAPGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R P+ MPPPYM NPDDGF DQRPWIRNSAN  GHVPFHPHREG FPPPYDYGGNEFVND 
Sbjct: 61  RSPMRMPPPYMANPDDGFADQRPWIRNSANSYGHVPFHPHREGFFPPPYDYGGNEFVNDV 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGS+G VHELNQN  +GRSS+EDERRLKLIRDHG+VSSGPPEGGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSEGGVHELNQN--TGRSSFEDERRLKLIRDHGIVSSGPPEGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN EANRR+LENSVGS DPE+VGS+RILETNNFQDPGNG+N+GRTQ+F ENGR+D
Sbjct: 181 RMNLGSNGEANRRSLENSVGSGDPEDVGSSRILETNNFQDPGNGSNNGRTQHFHENGRVD 240

Query: 241 TRRLSQNEEFSHACYDQVGG-HW---RMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWM 300
            R  SQNEEFSHA YDQVGG HW    MPHSV PEATEDNYLSHR+ELHYSD+R AFSWM
Sbjct: 241 KRWPSQNEEFSHARYDQVGGSHWHAQHMPHSVHPEATEDNYLSHRHELHYSDDRQAFSWM 300

Query: 301 DDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRF 360
           D+RNNSKMN+LDRDY PPPRSEMN IHMRPFSSHGNAHH +RNLNFGAGYAPRLSGGGRF
Sbjct: 301 DERNNSKMNVLDRDYHPPPRSEMNPIHMRPFSSHGNAHHGTRNLNFGAGYAPRLSGGGRF 360

Query: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTST 420
           LENGSSIEDSRFFGEQPPLPASPPPPMPWE+HLHASAES+AYSSQAKP SLFPVPVSTST
Sbjct: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWESHLHASAESVAYSSQAKPPSLFPVPVSTST 420

Query: 421 ITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKP 480
           ITS+AYSS PEHRSFHHHKPMP VSSSPMMEDSLALHPYSKK+AADGK FG+NQLPPQK 
Sbjct: 421 ITSSAYSSAPEHRSFHHHKPMPRVSSSPMMEDSLALHPYSKKFAADGKPFGVNQLPPQKL 480

Query: 481 KVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540
           KVIDAS LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY
Sbjct: 481 KVIDASQLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540

Query: 541 FMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFT 600
           FMTEVEKV+EGDAKSSNS KGKKPI KKVMEYCYEP+MEEAYRSSMLKAFRKTLEEGIFT
Sbjct: 541 FMTEVEKVDEGDAKSSNSFKGKKPITKKVMEYCYEPQMEEAYRSSMLKAFRKTLEEGIFT 600

Query: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660
           FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ
Sbjct: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660

Query: 661 WEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDY 720
           WEEA PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDD SPSFQET SEKTALP LR  
Sbjct: 661 WEEAPPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDGSPSFQETISEKTALPSLRHD 720

Query: 721 ASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780
           ASEDDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTE+ DGRNGHSNALSGLIQAYAKE
Sbjct: 721 ASEDDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTEKIDGRNGHSNALSGLIQAYAKE 780

Query: 781 GKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQ-NSNESK 840
           GKSVRWMDQ  N+GFSIGAAKKANRLSLVIGPG GYNLKSNPLA EE+RGSTQ NSNESK
Sbjct: 781 GKSVRWMDQVRNSGFSIGAAKKANRLSLVIGPGPGYNLKSNPLAEEEYRGSTQNNSNESK 840

Query: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Sbjct: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 873

BLAST of HG10020798 vs. ExPASy TrEMBL
Match: A0A1S3AUX6 (uncharacterized protein LOC103482943 OS=Cucumis melo OX=3656 GN=LOC103482943 PE=4 SV=1)

HSP 1 Score: 1587.0 bits (4108), Expect = 0.0e+00
Identity = 793/875 (90.63%), Postives = 823/875 (94.06%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ T+CPIC MPHFPFCPPHPSFNQNPRYPFGPDPSFQ PGFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTLCPICTMPHFPFCPPHPSFNQNPRYPFGPDPSFQAPGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R P+ MPPPYM NPDDGF DQRPWIRNSAN  GHVPFHPHREG FPPPYDYGGNEFVND 
Sbjct: 61  RSPMRMPPPYMANPDDGFADQRPWIRNSANSYGHVPFHPHREGFFPPPYDYGGNEFVNDV 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGS+G VHELNQN  +GRSS+EDERRLKLIRDHG+VSSGPPEGGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSEGGVHELNQN--TGRSSFEDERRLKLIRDHGIVSSGPPEGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN EANRR+LENSVGS DPE+VGS+RILETNNFQDPGNG+N+GRTQ+F ENGR+D
Sbjct: 181 RMNLGSNGEANRRSLENSVGSGDPEDVGSSRILETNNFQDPGNGSNNGRTQHFHENGRVD 240

Query: 241 TRRLSQNEEFSHACYDQVGG-HW---RMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWM 300
            R  SQNEEFSHA YDQVGG HW    MPHSV PEATEDNYLSHR+ELHYSD+R AFSWM
Sbjct: 241 KRWPSQNEEFSHARYDQVGGSHWHAQHMPHSVHPEATEDNYLSHRHELHYSDDRQAFSWM 300

Query: 301 DDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRF 360
           D+RNNSKMN+LDRDY PPPRSEMN IHMRPFSSHGNAHH +RNLNFGAGYAPRLSGGGRF
Sbjct: 301 DERNNSKMNVLDRDYHPPPRSEMNPIHMRPFSSHGNAHHGTRNLNFGAGYAPRLSGGGRF 360

Query: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTST 420
           LENGSSIEDSRFFGEQPPLPASPPPPMPWE+HLHASAES+AYSSQAKP SLFPVPVSTST
Sbjct: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWESHLHASAESVAYSSQAKPPSLFPVPVSTST 420

Query: 421 ITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKP 480
           ITS+AYSS PEHRSFHHHKPMP VSSSPMMEDSLALHPYSKK+AADGK FG+NQLPPQK 
Sbjct: 421 ITSSAYSSAPEHRSFHHHKPMPRVSSSPMMEDSLALHPYSKKFAADGKPFGVNQLPPQKL 480

Query: 481 KVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540
           KVIDAS LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY
Sbjct: 481 KVIDASQLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540

Query: 541 FMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFT 600
           FMTEVEKV+EGDAKSSNS KGKKPI KKVMEYCYEP+MEEAYRSSMLKAFRKTLEEGIFT
Sbjct: 541 FMTEVEKVDEGDAKSSNSFKGKKPITKKVMEYCYEPQMEEAYRSSMLKAFRKTLEEGIFT 600

Query: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660
           FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ
Sbjct: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660

Query: 661 WEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDY 720
           WEEA PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDD SPSFQET SEKTALP LR  
Sbjct: 661 WEEAPPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDGSPSFQETISEKTALPSLRHD 720

Query: 721 ASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780
           ASEDDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTE+ DGRNGHSNALSGLIQAYAKE
Sbjct: 721 ASEDDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTEKIDGRNGHSNALSGLIQAYAKE 780

Query: 781 GKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQ-NSNESK 840
           GKSVRWMDQ  N+GFSIGAAKKANRLSLVIGPG GYNLKSNPLA EE+RGSTQ NSNESK
Sbjct: 781 GKSVRWMDQVRNSGFSIGAAKKANRLSLVIGPGPGYNLKSNPLAEEEYRGSTQNNSNESK 840

Query: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Sbjct: 841 KHSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 873

BLAST of HG10020798 vs. ExPASy TrEMBL
Match: A0A0A0LTZ2 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045500 PE=4 SV=1)

HSP 1 Score: 1578.9 bits (4087), Expect = 0.0e+00
Identity = 790/874 (90.39%), Postives = 817/874 (93.48%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLHHPQQWHPRPIQ TVCPIC M HFPFCPPHPSFNQNPRYPFGPD SFQT GFDSH
Sbjct: 1   MDQHLHHPQQWHPRPIQPTVCPICTMSHFPFCPPHPSFNQNPRYPFGPDHSFQTSGFDSH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDA 120
           R P+ MPPPYM NPDDGF DQRPWIRNSAN  GHVPFHPHREG FPPPYDYGGNEFVNDA
Sbjct: 61  RSPMRMPPPYMANPDDGFADQRPWIRNSANSYGHVPFHPHREGFFPPPYDYGGNEFVNDA 120

Query: 121 ERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLP 180
           ERSYKRPRVDDVGS+G VHELNQNQ +GRSS+EDERRLKLIRDHG+V SGPPEGGSNSLP
Sbjct: 121 ERSYKRPRVDDVGSEGGVHELNQNQDTGRSSFEDERRLKLIRDHGIVPSGPPEGGSNSLP 180

Query: 181 RMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRID 240
           RMNLGSN EANRR+LENSVGS DPE+VGS+RILETNNF D GNG+N+GRTQ+F ENGRID
Sbjct: 181 RMNLGSNGEANRRSLENSVGSGDPEDVGSSRILETNNFHDSGNGSNNGRTQHFHENGRID 240

Query: 241 TRRLSQNEEFSHACYDQVGG-HW---RMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWM 300
            R  SQNEEFSHA YDQVGG HW     PHSV PEATEDNYL+HR+E+HYSD+R AFSW+
Sbjct: 241 KRWPSQNEEFSHARYDQVGGSHWHPQHKPHSVHPEATEDNYLAHRHEVHYSDDRQAFSWV 300

Query: 301 DDRNNSKMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHH-SRNLNFGAGYAPRLSGGGRF 360
           D+RNNSKM + DRDYQPPPRSEMN IHMR FSSHGNAHH +RNLNFGAGYAPRLSGGGRF
Sbjct: 301 DERNNSKMAVFDRDYQPPPRSEMNPIHMRSFSSHGNAHHGTRNLNFGAGYAPRLSGGGRF 360

Query: 361 LENGSSIEDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTST 420
           LENGSSIEDSRFF EQPPLPASPPPPMPWEAHLHASAES+AYSSQAKP SLFPVPVSTST
Sbjct: 361 LENGSSIEDSRFFCEQPPLPASPPPPMPWEAHLHASAESVAYSSQAKPPSLFPVPVSTST 420

Query: 421 ITSAAYSSVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKP 480
           ITS+AYSS PEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKK+AADGK FGLNQLPPQKP
Sbjct: 421 ITSSAYSSAPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKFAADGKPFGLNQLPPQKP 480

Query: 481 KVIDASHLFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540
           KVIDAS LFK PHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY
Sbjct: 481 KVIDASQLFKPPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDY 540

Query: 541 FMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFT 600
           FMTEVEKV+E DAKSSNSIKGKKPI KKVMEYCYEP+MEEAYRSSMLKAFRKTLEEGIFT
Sbjct: 541 FMTEVEKVDEVDAKSSNSIKGKKPITKKVMEYCYEPQMEEAYRSSMLKAFRKTLEEGIFT 600

Query: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660
           FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ
Sbjct: 601 FVIVDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQ 660

Query: 661 WEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDY 720
           WEEA PLYLQLDIKSLCHGDDLKESGIQEVDMDME+EDD SPSFQET SEKTALP LR  
Sbjct: 661 WEEAPPLYLQLDIKSLCHGDDLKESGIQEVDMDMEDEDDGSPSFQETMSEKTALPSLRHD 720

Query: 721 ASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780
           ASEDDEKRWDAEP HLR+EVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE
Sbjct: 721 ASEDDEKRWDAEPDHLREEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKE 780

Query: 781 GKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPLA-EEHRGSTQNSNESKK 840
           GKSV WMDQ  NTGFSIGAAKKANRLSLVIGPG GYNLKSNPLA EE+RGSTQNSNESKK
Sbjct: 781 GKSVSWMDQVRNTGFSIGAAKKANRLSLVIGPGPGYNLKSNPLAEEEYRGSTQNSNESKK 840

Query: 841 HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 869
           HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE
Sbjct: 841 HSRFEERLRAESESFKVVFDKRRQRIGGLDWEEE 874

BLAST of HG10020798 vs. ExPASy TrEMBL
Match: A0A6J1JW85 (uncharacterized protein LOC111490251 OS=Cucurbita maxima OX=3661 GN=LOC111490251 PE=4 SV=1)

HSP 1 Score: 1388.6 bits (3593), Expect = 0.0e+00
Identity = 720/904 (79.65%), Postives = 754/904 (83.41%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLH+ QQW+ RPIQ TVCPICAMPHFPFCPPHPSFNQNPRYP GPDP FQ PGFD H
Sbjct: 1   MDQHLHYQQQWNYRPIQGTVCPICAMPHFPFCPPHPSFNQNPRYPLGPDPPFQRPGFDPH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVF-PPPYDYGGNEFVND 120
           R P+GMP P MGN DDGF DQRPWIRNSAN  GH+PF PHRE  F PPPYDYGGNEFVND
Sbjct: 61  RSPMGMPRPSMGNLDDGFADQRPWIRNSANSYGHLPFQPHREESFLPPPYDYGGNEFVND 120

Query: 121 AERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSL 180
           AERSYKRPRVDDVG DG VHELNQNQKSGRSS+EDERRLKLIRDHGVVSSGPP       
Sbjct: 121 AERSYKRPRVDDVGLDGGVHELNQNQKSGRSSFEDERRLKLIRDHGVVSSGPP------- 180

Query: 181 PRMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENG-- 240
                           ENSVGS DPEEVG+TR LE N+FQD GNG+NDGR QNF + G  
Sbjct: 181 ---------------YENSVGSGDPEEVGTTRNLEINHFQDSGNGDNDGRNQNFHDEGNL 240

Query: 241 --------------------------RIDTRRLSQNEEFSHACYDQVGGHW---RMPHSV 300
                                     RID  R SQNEE SH+ YDQ G HW    MP  V
Sbjct: 241 APAKQFQNGREGYWSDLKHAPVAPGNRIDPWRPSQNEELSHSRYDQGGSHWHAQHMPRPV 300

Query: 301 PPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNSKMNILDRDYQPPPRSEMNSIHMRPFS 360
           PPEA+ED+YLSHRNELHYSDN  AFSWMDDRNNSKMNILDRDY+PPPRSEMN  HMRPFS
Sbjct: 301 PPEASEDSYLSHRNELHYSDNPQAFSWMDDRNNSKMNILDRDYRPPPRSEMNPTHMRPFS 360

Query: 361 SHGNAHH-SRNLNFGAGYAPRLSGGGRFLENGSSIEDSRFFGEQPPLPASPPPPMPWEAH 420
           SHGNAHH +R+ N+ AGYAPR SGG RF ENGSSIEDSRFF EQPPLP SPPPPMPWE  
Sbjct: 361 SHGNAHHGTRSFNYSAGYAPRHSGGVRFFENGSSIEDSRFFDEQPPLPTSPPPPMPWE-- 420

Query: 421 LHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYSSVPEHRSFHHHKPMPHVSSSPMMED 480
                        AKPSSLFPVPVS S ITS+AYSSVPEHRSFHH KPM HVSSSPM ED
Sbjct: 421 -------------AKPSSLFPVPVSVSPITSSAYSSVPEHRSFHHLKPMFHVSSSPMTED 480

Query: 481 SLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSG 540
           SLA+HPYSKK+AADGK +GLN LP  KPK+IDASHLFK PHRSTRPDHIVVILRGLPGSG
Sbjct: 481 SLAVHPYSKKFAADGKPYGLN-LPSPKPKIIDASHLFKRPHRSTRPDHIVVILRGLPGSG 540

Query: 541 KSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEY 600
           KSYLAKMLRDVE++NGGDAPRIHSMDDYFMTEVEKVEEGD  SSNS+KGKKPI+KKVMEY
Sbjct: 541 KSYLAKMLRDVEIDNGGDAPRIHSMDDYFMTEVEKVEEGDTNSSNSVKGKKPIVKKVMEY 600

Query: 601 CYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660
           CYEPEMEEAYRSSMLKAFRKTLEEG+FTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE
Sbjct: 601 CYEPEMEEAYRSSMLKAFRKTLEEGVFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660

Query: 661 ATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDM 720
           ATY+DP GCAARNVHGFNLDDIQKMARQWEEA  LYLQLDIKSLCHGDDLKESGI+EVDM
Sbjct: 661 ATYRDPTGCAARNVHGFNLDDIQKMARQWEEAPALYLQLDIKSLCHGDDLKESGIKEVDM 720

Query: 721 DMEEEDDDSP-SFQETKSEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSND 780
           DME+EDDD+P SFQETKS KTAL P RD ASEDD KRWD E  H R+EVKELGRSKWSND
Sbjct: 721 DMEDEDDDTPSSFQETKSIKTALHPQRDDASEDDGKRWDEESDHRREEVKELGRSKWSND 780

Query: 781 LDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIG 840
           LDDDDTERTDG NGH+NALSGLIQAYAKEGKSVRW+DQA  TGFSIGAAKKANRLSLVIG
Sbjct: 781 LDDDDTERTDGANGHANALSGLIQAYAKEGKSVRWIDQAGYTGFSIGAAKKANRLSLVIG 840

Query: 841 PGAGYNLKSNPLAEE--HRGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 869
           PGAGYNLKSNPL EE  +RGS QNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD
Sbjct: 841 PGAGYNLKSNPLPEEYQYRGSNQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 866

BLAST of HG10020798 vs. ExPASy TrEMBL
Match: A0A6J1GTW4 (uncharacterized protein LOC111457077 OS=Cucurbita moschata OX=3662 GN=LOC111457077 PE=4 SV=1)

HSP 1 Score: 1388.2 bits (3592), Expect = 0.0e+00
Identity = 720/904 (79.65%), Postives = 753/904 (83.30%), Query Frame = 0

Query: 1   MDQHLHHPQQWHPRPIQATVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPSFQTPGFDSH 60
           MDQHLH+ QQW+ RPIQ TVCPICAMPHFPFCPPHPSFNQNPRYPFGPDP FQ PGFD H
Sbjct: 1   MDQHLHYQQQWNSRPIQGTVCPICAMPHFPFCPPHPSFNQNPRYPFGPDPPFQRPGFDPH 60

Query: 61  RPPVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVF-PPPYDYGGNEFVND 120
           R P+GMP P MGN DDGF DQRPWIRNSA   GH+PF  HRE  F PP YDYGGNEFVND
Sbjct: 61  RSPMGMPRPSMGNLDDGFADQRPWIRNSAISYGHLPFQAHREESFLPPQYDYGGNEFVND 120

Query: 121 AERSYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSL 180
           AERSYKRPRVDDVG DG VHE+NQNQKSGRSS+EDERRLKLIRDHGVVSSGP        
Sbjct: 121 AERSYKRPRVDDVGLDGGVHEVNQNQKSGRSSFEDERRLKLIRDHGVVSSGP-------- 180

Query: 181 PRMNLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENG-- 240
                           ENSVGS DPEEVG+TR LE N+FQD GNG+NDGR+QNF + G  
Sbjct: 181 --------------AYENSVGSGDPEEVGTTRNLEINHFQDSGNGDNDGRSQNFHDEGNL 240

Query: 241 --------------------------RIDTRRLSQNEEFSHACYDQVGGHW---RMPHSV 300
                                     RID  R SQNEE SH+ YDQ GGHW    MP  V
Sbjct: 241 APAKQFQNGREGYWSDLKHAPAAPGNRIDPWRPSQNEELSHSRYDQGGGHWHAQHMPRPV 300

Query: 301 PPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNSKMNILDRDYQPPPRSEMNSIHMRPFS 360
           PPEA+ED+YLSHRNELHYSDN  AFSWMDDRNNSKMNILDRDY+PPPRSEMN  HMRPFS
Sbjct: 301 PPEASEDSYLSHRNELHYSDNPQAFSWMDDRNNSKMNILDRDYRPPPRSEMNPTHMRPFS 360

Query: 361 SHGNAHH-SRNLNFGAGYAPRLSGGGRFLENGSSIEDSRFFGEQPPLPASPPPPMPWEAH 420
           SHGNAHH +RN N+GAGYAPR SGG RF ENGSSIEDSRFF EQPPLP SPPPPMPWE  
Sbjct: 361 SHGNAHHGTRNFNYGAGYAPRHSGGVRFFENGSSIEDSRFFDEQPPLPTSPPPPMPWE-- 420

Query: 421 LHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYSSVPEHRSFHHHKPMPHVSSSPMMED 480
                        AKPSSLFPVPVS S ITS+ YSSVPEHRSFHH KPM HVSSSPM ED
Sbjct: 421 -------------AKPSSLFPVPVSVSPITSSQYSSVPEHRSFHHLKPMFHVSSSPMTED 480

Query: 481 SLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASHLFKLPHRSTRPDHIVVILRGLPGSG 540
           SLA+HPYSKK+AADGK +GLNQLP  KPKVIDASHLFK PHRSTRPDHIVVILRGLPGSG
Sbjct: 481 SLAVHPYSKKFAADGKPYGLNQLPSPKPKVIDASHLFKRPHRSTRPDHIVVILRGLPGSG 540

Query: 541 KSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEKVEEGDAKSSNSIKGKKPIMKKVMEY 600
           KSYLAKMLRDVE+ENGGDAPRIHSMDDYFMTEVEKVEEGD  SSNS+KGKKPI+KKVMEY
Sbjct: 541 KSYLAKMLRDVEIENGGDAPRIHSMDDYFMTEVEKVEEGDTNSSNSVKGKKPIVKKVMEY 600

Query: 601 CYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660
           CYEPEMEEAYRSSMLKAFRKTLEEG+FTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE
Sbjct: 601 CYEPEMEEAYRSSMLKAFRKTLEEGVFTFVIVDDRNLRVADFAQFWAIAKSSGYEVYILE 660

Query: 661 ATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDM 720
           ATY+DP GCAARNVHGFNLDDIQKMARQWEEA  LYLQLDIKSLCHGDDLKESGI+EVDM
Sbjct: 661 ATYRDPTGCAARNVHGFNLDDIQKMARQWEEAPALYLQLDIKSLCHGDDLKESGIKEVDM 720

Query: 721 DMEEEDDDSP-SFQETKSEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSND 780
           DME+EDDD+P SFQETKS KTAL P RD ASEDD KRWD E  H R+EVKEL RSKWSND
Sbjct: 721 DMEDEDDDTPSSFQETKSIKTALHPQRDDASEDDGKRWDEESDHRREEVKELRRSKWSND 780

Query: 781 LDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIG 840
           LDDDDTERTDG NGH+NALSGLIQAYAKEGKSVRW+DQA  TGFSIGAAKKANRLSLVIG
Sbjct: 781 LDDDDTERTDGANGHANALSGLIQAYAKEGKSVRWIDQAGYTGFSIGAAKKANRLSLVIG 840

Query: 841 PGAGYNLKSNPLAEE--HRGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 869
           PGAGYNLKSNPL EE  +RGS QNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD
Sbjct: 841 PGAGYNLKSNPLPEEYQYRGSNQNSNESKKHSRFEERLRAESESFKVVFDKRRQRIGGLD 867

BLAST of HG10020798 vs. TAIR 10
Match: AT5G62760.4 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 521.5 bits (1342), Expect = 1.3e-147
Identity = 360/859 (41.91%), Postives = 440/859 (51.22%), Query Frame = 0

Query: 6   HHPQQWHPRPIQATVCPICAMPHFPFCPPHP---SFNQNPRYPFGPDPSFQTPGFDSHRP 65
           +H QQW P P Q  +CPIC +PHFPFCPP+P   SF  NP +P  P  +   PGFDS   
Sbjct: 15  NHQQQWRPAPTQPNICPICTVPHFPFCPPYPPPSSFAYNPNFPPPPHLNSPRPGFDSFTG 74

Query: 66  PVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDAER 125
           P   PP                         + P+ PH    + P       +   +A+R
Sbjct: 75  PPVRPPQN----------------------HYPPWQPHHGNQWRPV----AVDVDREADR 134

Query: 126 SYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLPRM 185
           SYKR R+D +      + ++++  S R S+E+ERRLK++RDHG   + P           
Sbjct: 135 SYKRARIDTIAGGSPGYGVSES-PSPRISWENERRLKMVRDHGYGLAAP----------- 194

Query: 186 NLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRIDTR 245
              SN E N     +  GSE               F++                      
Sbjct: 195 ---SNIEMN-----HQYGSE---------------FRN---------------------- 254

Query: 246 RLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNS 305
                           GG +     +PP                                
Sbjct: 255 ----------------GGQFNGVAPLPP-------------------------------- 314

Query: 306 KMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHSRNLNFGAGYAPRLSGGGRFLENGSSI 365
                     PPP       H  P+                        GG F  +GS+ 
Sbjct: 315 ----------PPPH------HPPPY------------------------GGYF--SGSN- 374

Query: 366 EDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYS 425
                   QPPLP SPPPP+P                 + PSSLFPV  ++S  T    S
Sbjct: 375 -------GQPPLPVSPPPPLP----------------SSHPSSLFPVTTNSSP-TIPPSS 434

Query: 426 SVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASH 485
           S P+         MP+ S S                          QL P + KVID SH
Sbjct: 435 SYPQ---------MPNASPSSA------------------------QLAPTRSKVIDVSH 494

Query: 486 LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEK 545
           L K PHRSTRPDH V+ILRGLPGSGKSYLAK+LRDVEVENGG APRIHSMDDYFMTEVEK
Sbjct: 495 LLKPPHRSTRPDHFVIILRGLPGSGKSYLAKLLRDVEVENGGSAPRIHSMDDYFMTEVEK 554

Query: 546 VEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDR 605
           VEE D+ S +S + K+PI+K VMEYCYEPEMEEAYRSSMLKAF++TLE+G F+FVIVDDR
Sbjct: 555 VEESDSTSLSSGRSKRPIVKTVMEYCYEPEMEEAYRSSMLKAFKRTLEDGAFSFVIVDDR 614

Query: 606 NLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPL 665
           NLRVADF QFWA AK SGYE YILEATYKDP GCAARNVHG  +D +Q+MA QWEEA  L
Sbjct: 615 NLRVADFTQFWATAKRSGYEAYILEATYKDPTGCAARNVHGITVDQVQQMAEQWEEAPSL 635

Query: 666 YLQLDIKSLCHGDDLKESGIQEVDMDMEEEDDDSPSFQETKSEKTALPPLRDYASEDDEK 725
           Y+QLDIKS    DDLKE+ IQEVDMDME    D     E KS+ +      +  S   E 
Sbjct: 675 YMQLDIKSFTRWDDLKENEIQEVDMDME----DDFGLPERKSDNST--QSEEKGSYKSES 635

Query: 726 RWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVRWM 785
           +WDAE G   +EVKEL RSKWSN +++D+TE +     +S +L    Q   ++GKSV W 
Sbjct: 735 KWDAESGSRTEEVKELSRSKWSN-VEEDETENSQSMRRNSKSLPKSSQERQRKGKSVWWG 635

Query: 786 DQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPL-AEEHRGSTQNSNESKKHSRFEER 845
           D+  + GFSIGAA+  N  SL+IGPG+GYN+KSNPL AEE R       ++K    F+++
Sbjct: 795 DKGGDAGFSIGAARNMNMPSLIIGPGSGYNVKSNPLSAEESRALADAIGKAKVRGIFQDQ 635

Query: 846 LRAESESFKVVFDKRRQRI 861
           LRAE ESFK VFDKR  RI
Sbjct: 855 LRAERESFKAVFDKRHVRI 635

BLAST of HG10020798 vs. TAIR 10
Match: AT5G62760.3 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 519.2 bits (1336), Expect = 6.2e-147
Identity = 360/861 (41.81%), Postives = 442/861 (51.34%), Query Frame = 0

Query: 6   HHPQQWHPRPIQATVCPICAMPHFPFCPPHP---SFNQNPRYPFGPDPSFQTPGFDSHRP 65
           +H QQW P P Q  +CPIC +PHFPFCPP+P   SF  NP +P  P  +   PGFDS   
Sbjct: 15  NHQQQWRPAPTQPNICPICTVPHFPFCPPYPPPSSFAYNPNFPPPPHLNSPRPGFDSFTG 74

Query: 66  PVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDAER 125
           P   PP                         + P+ PH    + P       +   +A+R
Sbjct: 75  PPVRPPQN----------------------HYPPWQPHHGNQWRPV----AVDVDREADR 134

Query: 126 SYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLPRM 185
           SYKR R+D +      + ++++  S R S+E+ERRLK++RDHG   + P           
Sbjct: 135 SYKRARIDTIAGGSPGYGVSES-PSPRISWENERRLKMVRDHGYGLAAP----------- 194

Query: 186 NLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRIDTR 245
              SN E N     +  GSE               F++                      
Sbjct: 195 ---SNIEMN-----HQYGSE---------------FRN---------------------- 254

Query: 246 RLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNS 305
                           GG +     +PP                                
Sbjct: 255 ----------------GGQFNGVAPLPP-------------------------------- 314

Query: 306 KMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHSRNLNFGAGYAPRLSGGGRFLENGSSI 365
                     PPP       H  P+                        GG F  +GS+ 
Sbjct: 315 ----------PPPH------HPPPY------------------------GGYF--SGSN- 374

Query: 366 EDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYS 425
                   QPPLP SPPPP+P                 + PSSLFPV  ++S  T    S
Sbjct: 375 -------GQPPLPVSPPPPLP----------------SSHPSSLFPVTTNSSP-TIPPSS 434

Query: 426 SVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASH 485
           S P+         MP+ S S                          QL P + KVID SH
Sbjct: 435 SYPQ---------MPNASPSSA------------------------QLAPTRSKVIDVSH 494

Query: 486 LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEK 545
           L K PHRSTRPDH V+ILRGLPGSGKSYLAK+LRDVEVENGG APRIHSMDDYFMTEVEK
Sbjct: 495 LLKPPHRSTRPDHFVIILRGLPGSGKSYLAKLLRDVEVENGGSAPRIHSMDDYFMTEVEK 554

Query: 546 VEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVIVDDR 605
           VEE D+ S +S + K+PI+K VMEYCYEPEMEEAYRSSMLKAF++TLE+G F+FVIVDDR
Sbjct: 555 VEESDSTSLSSGRSKRPIVKTVMEYCYEPEMEEAYRSSMLKAFKRTLEDGAFSFVIVDDR 614

Query: 606 NLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFNLDDIQKMARQWEEALPL 665
           NLRVADF QFWA AK SGYE YILEATYKDP GCAARNVHG  +D +Q+MA QWEEA  L
Sbjct: 615 NLRVADFTQFWATAKRSGYEAYILEATYKDPTGCAARNVHGITVDQVQQMAEQWEEAPSL 639

Query: 666 YLQLDIKSLCHGDDLKESGIQEVDMDMEEED--DDSPSFQETKSEKTALPPLRDYASEDD 725
           Y+QLDIKS    DDLKE+ IQEVDMDME++    +  S   T+SE+          S   
Sbjct: 675 YMQLDIKSFTRWDDLKENEIQEVDMDMEDDFGLPERKSDNSTQSEEKGATE----GSYKS 639

Query: 726 EKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSNALSGLIQAYAKEGKSVR 785
           E +WDAE G   +EVKEL RSKWSN +++D+TE +     +S +L    Q   ++GKSV 
Sbjct: 735 ESKWDAESGSRTEEVKELSRSKWSN-VEEDETENSQSMRRNSKSLPKSSQERQRKGKSVW 639

Query: 786 WMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPL-AEEHRGSTQNSNESKKHSRFE 845
           W D+  + GFSIGAA+  N  SL+IGPG+GYN+KSNPL AEE R       ++K    F+
Sbjct: 795 WGDKGGDAGFSIGAARNMNMPSLIIGPGSGYNVKSNPLSAEESRALADAIGKAKVRGIFQ 639

Query: 846 ERLRAESESFKVVFDKRRQRI 861
           ++LRAE ESFK VFDKR  RI
Sbjct: 855 DQLRAERESFKAVFDKRHVRI 639

BLAST of HG10020798 vs. TAIR 10
Match: AT5G62760.1 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 508.4 bits (1308), Expect = 1.1e-143
Identity = 360/878 (41.00%), Postives = 442/878 (50.34%), Query Frame = 0

Query: 6   HHPQQWHPRPIQATVCPICAMPHFPFCPPHP---SFNQNPRYPFGPDPSFQTPGFDSHRP 65
           +H QQW P P Q  +CPIC +PHFPFCPP+P   SF  NP +P  P  +   PGFDS   
Sbjct: 15  NHQQQWRPAPTQPNICPICTVPHFPFCPPYPPPSSFAYNPNFPPPPHLNSPRPGFDSFTG 74

Query: 66  PVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDAER 125
           P   PP                         + P+ PH    + P       +   +A+R
Sbjct: 75  PPVRPPQN----------------------HYPPWQPHHGNQWRPV----AVDVDREADR 134

Query: 126 SYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLPRM 185
           SYKR R+D +      + ++++  S R S+E+ERRLK++RDHG   + P           
Sbjct: 135 SYKRARIDTIAGGSPGYGVSES-PSPRISWENERRLKMVRDHGYGLAAP----------- 194

Query: 186 NLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRIDTR 245
              SN E N     +  GSE               F++                      
Sbjct: 195 ---SNIEMN-----HQYGSE---------------FRN---------------------- 254

Query: 246 RLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNS 305
                           GG +     +PP                                
Sbjct: 255 ----------------GGQFNGVAPLPP-------------------------------- 314

Query: 306 KMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHSRNLNFGAGYAPRLSGGGRFLENGSSI 365
                     PPP       H  P+                        GG F  +GS+ 
Sbjct: 315 ----------PPPH------HPPPY------------------------GGYF--SGSN- 374

Query: 366 EDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYS 425
                   QPPLP SPPPP+P                 + PSSLFPV  ++S  T    S
Sbjct: 375 -------GQPPLPVSPPPPLP----------------SSHPSSLFPVTTNSSP-TIPPSS 434

Query: 426 SVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASH 485
           S P+         MP+ S S                          QL P + KVID SH
Sbjct: 435 SYPQ---------MPNASPSSA------------------------QLAPTRSKVIDVSH 494

Query: 486 LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEK 545
           L K PHRSTRPDH V+ILRGLPGSGKSYLAK+LRDVEVENGG APRIHSMDDYFMTEVEK
Sbjct: 495 LLKPPHRSTRPDHFVIILRGLPGSGKSYLAKLLRDVEVENGGSAPRIHSMDDYFMTEVEK 554

Query: 546 VEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVI---- 605
           VEE D+ S +S + K+PI+K VMEYCYEPEMEEAYRSSMLKAF++TLE+G F+FVI    
Sbjct: 555 VEESDSTSLSSGRSKRPIVKTVMEYCYEPEMEEAYRSSMLKAFKRTLEDGAFSFVIVCFL 614

Query: 606 -------------VDDRNLRVADFAQFWAIAKSSGYEVYILEATYKDPAGCAARNVHGFN 665
                        VDDRNLRVADF QFWA AK SGYE YILEATYKDP GCAARNVHG  
Sbjct: 615 ELTVSCWYMSLILVDDRNLRVADFTQFWATAKRSGYEAYILEATYKDPTGCAARNVHGIT 656

Query: 666 LDDIQKMARQWEEALPLYLQLDIKSLCHGDDLKESGIQEVDMDMEEED--DDSPSFQETK 725
           +D +Q+MA QWEEA  LY+QLDIKS    DDLKE+ IQEVDMDME++    +  S   T+
Sbjct: 675 VDQVQQMAEQWEEAPSLYMQLDIKSFTRWDDLKENEIQEVDMDMEDDFGLPERKSDNSTQ 656

Query: 726 SEKTALPPLRDYASEDDEKRWDAEPGHLRDEVKELGRSKWSNDLDDDDTERTDGRNGHSN 785
           SE+          S   E +WDAE G   +EVKEL RSKWSN +++D+TE +     +S 
Sbjct: 735 SEEKGATE----GSYKSESKWDAESGSRTEEVKELSRSKWSN-VEEDETENSQSMRRNSK 656

Query: 786 ALSGLIQAYAKEGKSVRWMDQAVNTGFSIGAAKKANRLSLVIGPGAGYNLKSNPL-AEEH 845
           +L    Q   ++GKSV W D+  + GFSIGAA+  N  SL+IGPG+GYN+KSNPL AEE 
Sbjct: 795 SLPKSSQERQRKGKSVWWGDKGGDAGFSIGAARNMNMPSLIIGPGSGYNVKSNPLSAEES 656

Query: 846 RGSTQNSNESKKHSRFEERLRAESESFKVVFDKRRQRI 861
           R       ++K    F+++LRAE ESFK VFDKR  RI
Sbjct: 855 RALADAIGKAKVRGIFQDQLRAERESFKAVFDKRHVRI 656

BLAST of HG10020798 vs. TAIR 10
Match: AT5G62760.2 (P-loop containing nucleoside triphosphate hydrolases superfamily protein )

HSP 1 Score: 263.5 bits (672), Expect = 6.2e-70
Identity = 212/596 (35.57%), Postives = 262/596 (43.96%), Query Frame = 0

Query: 6   HHPQQWHPRPIQATVCPICAMPHFPFCPPHP---SFNQNPRYPFGPDPSFQTPGFDSHRP 65
           +H QQW P P Q  +CPIC +PHFPFCPP+P   SF  NP +P  P  +   PGFDS   
Sbjct: 15  NHQQQWRPAPTQPNICPICTVPHFPFCPPYPPPSSFAYNPNFPPPPHLNSPRPGFDSFTG 74

Query: 66  PVGMPPPYMGNPDDGFGDQRPWIRNSANPIGHVPFHPHREGVFPPPYDYGGNEFVNDAER 125
           P   PP                         + P+ PH    + P       +   +A+R
Sbjct: 75  PPVRPPQN----------------------HYPPWQPHHGNQWRPV----AVDVDREADR 134

Query: 126 SYKRPRVDDVGSDGVVHELNQNQKSGRSSYEDERRLKLIRDHGVVSSGPPEGGSNSLPRM 185
           SYKR R+D +      + ++++  S R S+E+ERRLK++RDHG   + P           
Sbjct: 135 SYKRARIDTIAGGSPGYGVSES-PSPRISWENERRLKMVRDHGYGLAAP----------- 194

Query: 186 NLGSNSEANRRTLENSVGSEDPEEVGSTRILETNNFQDPGNGNNDGRTQNFQENGRIDTR 245
              SN E N     +  GSE               F++                      
Sbjct: 195 ---SNIEMN-----HQYGSE---------------FRN---------------------- 254

Query: 246 RLSQNEEFSHACYDQVGGHWRMPHSVPPEATEDNYLSHRNELHYSDNRHAFSWMDDRNNS 305
                           GG +     +PP                                
Sbjct: 255 ----------------GGQFNGVAPLPP-------------------------------- 314

Query: 306 KMNILDRDYQPPPRSEMNSIHMRPFSSHGNAHHSRNLNFGAGYAPRLSGGGRFLENGSSI 365
                     PPP       H  P+                        GG F  +GS+ 
Sbjct: 315 ----------PPPH------HPPPY------------------------GGYF--SGSN- 374

Query: 366 EDSRFFGEQPPLPASPPPPMPWEAHLHASAESMAYSSQAKPSSLFPVPVSTSTITSAAYS 425
                   QPPLP SPPPP+P                 + PSSLFPV  ++S  T    S
Sbjct: 375 -------GQPPLPVSPPPPLP----------------SSHPSSLFPVTTNSSP-TIPPSS 379

Query: 426 SVPEHRSFHHHKPMPHVSSSPMMEDSLALHPYSKKYAADGKAFGLNQLPPQKPKVIDASH 485
           S P+         MP+ S S                          QL P + KVID SH
Sbjct: 435 SYPQ---------MPNASPSSA------------------------QLAPTRSKVIDVSH 379

Query: 486 LFKLPHRSTRPDHIVVILRGLPGSGKSYLAKMLRDVEVENGGDAPRIHSMDDYFMTEVEK 545
           L K PHRSTRPDH V+ILRGLPGSGKSYLAK+LRDVEVENGG APRIHSMDDYFMTEVEK
Sbjct: 495 LLKPPHRSTRPDHFVIILRGLPGSGKSYLAKLLRDVEVENGGSAPRIHSMDDYFMTEVEK 379

Query: 546 VEEGDAKSSNSIKGKKPIMKKVMEYCYEPEMEEAYRSSMLKAFRKTLEEGIFTFVI 599
           VEE D+ S +S + K+PI+K VMEYCYEPEMEEAYRSSMLKAF++TLE+G F+FVI
Sbjct: 555 VEESDSTSLSSGRSKRPIVKTVMEYCYEPEMEEAYRSSMLKAFKRTLEDGAFSFVI 379

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038894607.10.0e+0091.85uncharacterized protein LOC120083122 [Benincasa hispida][more]
XP_008437571.10.0e+0090.63PREDICTED: uncharacterized protein LOC103482943 [Cucumis melo] >TYJ99101.1 uncha... [more]
XP_011651180.10.0e+0090.39uncharacterized protein LOC101218580 [Cucumis sativus] >KGN64252.1 hypothetical ... [more]
XP_023541377.10.0e+0079.76uncharacterized protein LOC111801581 [Cucurbita pepo subsp. pepo][more]
XP_022994572.10.0e+0079.65uncharacterized protein LOC111490251 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
P497501.8e-5346.77YLP motif-containing protein 1 OS=Homo sapiens OX=9606 GN=YLPM1 PE=1 SV=4[more]
Q9R0I75.6e-5241.81YLP motif-containing protein 1 OS=Mus musculus OX=10090 GN=Ylpm1 PE=2 SV=2[more]
P0CB499.6e-5241.46YLP motif-containing protein 1 OS=Rattus norvegicus OX=10116 GN=Ylpm1 PE=1 SV=1[more]
Q5TBK14.2e-0729.38NEDD4-binding protein 2-like 1 OS=Homo sapiens OX=9606 GN=N4BP2L1 PE=2 SV=1[more]
Q3V2Q84.2e-0729.23NEDD4-binding protein 2-like 1 OS=Mus musculus OX=10090 GN=N4bp2l1 PE=2 SV=1[more]
Match NameE-valueIdentityDescription
A0A5D3BK410.0e+0090.63Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... [more]
A0A1S3AUX60.0e+0090.63uncharacterized protein LOC103482943 OS=Cucumis melo OX=3656 GN=LOC103482943 PE=... [more]
A0A0A0LTZ20.0e+0090.39Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G045500 PE=4 SV=1[more]
A0A6J1JW850.0e+0079.65uncharacterized protein LOC111490251 OS=Cucurbita maxima OX=3661 GN=LOC111490251... [more]
A0A6J1GTW40.0e+0079.65uncharacterized protein LOC111457077 OS=Cucurbita moschata OX=3662 GN=LOC1114570... [more]
Match NameE-valueIdentityDescription
AT5G62760.41.3e-14741.91P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT5G62760.36.2e-14741.81P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT5G62760.11.1e-14341.00P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
AT5G62760.26.2e-7035.57P-loop containing nucleoside triphosphate hydrolases superfamily protein [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePFAMPF13671AAA_33coord: 497..629
e-value: 3.6E-6
score: 27.2
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 712..758
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 814..840
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 149..163
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 171..197
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 826..840
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 127..208
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 214..233
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 689..763
IPR027417P-loop containing nucleoside triphosphate hydrolaseGENE3D3.40.50.300coord: 496..661
e-value: 1.2E-26
score: 95.7
IPR027417P-loop containing nucleoside triphosphate hydrolaseSUPERFAMILY52540P-loop containing nucleoside triphosphate hydrolasescoord: 497..661
IPR026314YLP motif-containing protein 1PANTHERPTHR13413YLP MOTIF CONTAINING PROTEIN NUCLEAR PROTEIN ZAPcoord: 3..858

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10020798.1HG10020798.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0032204 regulation of telomere maintenance
cellular_component GO:0005634 nucleus