HG10008107 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10008107
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionprotein PHOX1-like
LocationChr10: 19900026 .. 19902303 (-)
RNA-Seq ExpressionHG10008107
SyntenyHG10008107
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGGGAAGCAAAGTGGGAAGAAGAAGCAGATTGGTGATAAATTCCGCAAGGCGATTTCGAAACATCGCCAAAGTGGAGATGGAAGTCCAAGTTATGACAAAGACCATGTTATTTTCATTACTATGTCTCAGGTTTTAAAGGAAGAGGGCAATAAATTGTTCCAGTCTAGAGATCTTGAAGGAGCAATGTTGAAATATGATAAGGCCCTTAAATTACTTCCAAGGAATCATATAGATGTATCGTATCTTCGGAGTAACATGGCAGCATGCTATATGCAGATGGGGCTTAGCGAGTATCCCCGAGCGATTCACGAGTGCAATTTGGCTCTTGAAGTTACACCCAAGTACAGTAAGGCATTGTTGAAGCGGGCTAGATGTTATGAGGGTTTGCGTAGGCTGGACTTGGCTTTAAGAGATGTTAAGGCAGTTTTAAGCATGGAGCCAAATAATATCATGGCTTTAGAAATATCAGAGAGATTAACAAAGGCTCTTGAGATGAAAGGATCAAAGGAAGATGATGCCGAGATCAAGCTACCTCTTGATTTTGTTGAATTGCCTTCTTCAGCATCGTCGCAAAAAGGACCGAAAGAAAAGAACCGAAAGAAGAAAAACAACCAAAAGACTAAGGAAACCATTGATGAGAAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAATAATAGTGGGGGAGGAGAAGGTCGATGAGACGGTGGAGGAGAAGAAGGCTGAGGATAAACTAGTTGTGGAGGAGAAAATCACAAGGACACAGGAGGAAACACCGATGAAAACTGTGAAATTGGTGTTTGGAGAAGATATTCGATGGGCTCAATTGCCAATTGATTGCACTCTATTGCAACTGAGGGAGGTTATACGTGATCGTTTCCCTACCTGCATGGCAGTCCTTATCAAGTACAGGGATGAAGAAGGTGATTTGGTAACAATTACTACCAATGAAGAATTACGATTGGCTGAAACATCTAAGTTGTCACAGGGTTCTGTTAGATTTTATATATTTGAAGTTAATCCAGAGCGAGATCCATTTTACAAAAGGTTTAAGAATGACAAGGCTGCCAAGTGTGAAGTTGAAGAAAATAGCATCTTTGAGAATGGTCATGTATTAAAAGCAAAAGAAATAAAGATGTCATCTTGCATTGATGACTGGATAATTCAATTTGCTCAGTTATTTATAAACCATGTTGGATTTGAATCTGGCCCATACTTGGATCTCCATGACCTTGGGATGAAGCTTTATTCTGAGGCTGTGGAAGAGACAGTAACAAGTGAAGAAGCTCAGGGTCTTTTTGAATTAGCAGCAGAAAAGTTTCATGAGATGGCAGCCTTAGCACTGTTCAACTGGGGAAATGTTATCATGTCAAGGGCGAGGAAGACGGTCTACTTCGCAGATGGTGGTTCAAAAGTTCGTGTGCTTGAACAGATCAAAGCTGCATTCGATTGGGTCGAAAAAGAATATGCTGAAGCAGAAAGGAAATATCAAATGGCGGTGGAAATCAAACCAGACTTTTATGAAGGCTATCTAGCTCTAGGACAACAACAGTTTGAGCAGGCAAAACTTTCTTGGCATTATGCAGTTAGCAGCGATGTTGATCCGAAAACATGGCCTTGCGCTGAAGTTATGCAACTTTACAATAGTGCTGAGGAAAACATGGAAACAGGCATGAAGATGTGGGAAGAATGGGAAGAGCAGCGTACTGGGGAACTCTCTAAATCTAGCAATGTTAAAACCCAGTTGCAGAAGATGGGGTTAGATGGGCTGATCAAGGATATATCAGTTGATGAGGCCGCAGAACAGGCTAAAAATATGAGGTCTCATATAAACCTCTTATGGGGTACCATGCTCTACGAGCGATCGATATTGGAATTTAAGATGGGGCTGCCGGCGTGGCATGAATGTCTGGAAGTTGCAGTTGAGAAATTCGAGCTTGCTGGAGCTTCTGCAACGGATATCGCAGTTATGATAAAGAATCACTGTTCAAGCAACAATTCACATGAAGGTACATTACAAATATTCTTTGTTTGCATCTCTTCATTCTGAGTTAAACATCTTTCTTATCTCCTCCAATTATACAGGTCTTGGGTTCAAAATTGATGAGATAGTACAAGCATGGAATGAGATGTATGATGCTAGAAAGTTGCTAACTGGAGTTCCATCATTCCGATTAGAGCCATTATTTCGGCGAAGGGTCTCGAAAATCTACCACGTGTTGGAGCAAGCTTGA

mRNA sequence

ATGGGGAAGCAAAGTGGGAAGAAGAAGCAGATTGGTGATAAATTCCGCAAGGCGATTTCGAAACATCGCCAAAGTGGAGATGGAAGTCCAAGTTATGACAAAGACCATGTTATTTTCATTACTATGTCTCAGGTTTTAAAGGAAGAGGGCAATAAATTGTTCCAGTCTAGAGATCTTGAAGGAGCAATGTTGAAATATGATAAGGCCCTTAAATTACTTCCAAGGAATCATATAGATGTATCGTATCTTCGGAGTAACATGGCAGCATGCTATATGCAGATGGGGCTTAGCGAGTATCCCCGAGCGATTCACGAGTGCAATTTGGCTCTTGAAGTTACACCCAAGTACAGTAAGGCATTGTTGAAGCGGGCTAGATGTTATGAGGGTTTGCGTAGGCTGGACTTGGCTTTAAGAGATGTTAAGGCAGTTTTAAGCATGGAGCCAAATAATATCATGGCTTTAGAAATATCAGAGAGATTAACAAAGGCTCTTGAGATGAAAGGATCAAAGGAAGATGATGCCGAGATCAAGCTACCTCTTGATTTTGTTGAATTGCCTTCTTCAGCATCGTCGCAAAAAGGACCGAAAGAAAAGAACCGAAAGAAGAAAAACAACCAAAAGACTAAGGAAACCATTGATGAGAAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAATAATAGTGGGGGAGGAGAAGGTCGATGAGACGGTGGAGGAGAAGAAGGCTGAGGATAAACTAGTTGTGGAGGAGAAAATCACAAGGACACAGGAGGAAACACCGATGAAAACTGTGAAATTGGTGTTTGGAGAAGATATTCGATGGGCTCAATTGCCAATTGATTGCACTCTATTGCAACTGAGGGAGGTTATACGTGATCGTTTCCCTACCTGCATGGCAGTCCTTATCAAGTACAGGGATGAAGAAGGTGATTTGGTAACAATTACTACCAATGAAGAATTACGATTGGCTGAAACATCTAAGTTGTCACAGGGTTCTGTTAGATTTTATATATTTGAAGTTAATCCAGAGCGAGATCCATTTTACAAAAGGTTTAAGAATGACAAGGCTGCCAAGTGTGAAGTTGAAGAAAATAGCATCTTTGAGAATGGTCATGTATTAAAAGCAAAAGAAATAAAGATGTCATCTTGCATTGATGACTGGATAATTCAATTTGCTCAGTTATTTATAAACCATGTTGGATTTGAATCTGGCCCATACTTGGATCTCCATGACCTTGGGATGAAGCTTTATTCTGAGGCTGTGGAAGAGACAGTAACAAGTGAAGAAGCTCAGGGTCTTTTTGAATTAGCAGCAGAAAAGTTTCATGAGATGGCAGCCTTAGCACTGTTCAACTGGGGAAATGTTATCATGTCAAGGGCGAGGAAGACGGTCTACTTCGCAGATGGTGGTTCAAAAGTTCGTGTGCTTGAACAGATCAAAGCTGCATTCGATTGGGTCGAAAAAGAATATGCTGAAGCAGAAAGGAAATATCAAATGGCGGTGGAAATCAAACCAGACTTTTATGAAGGCTATCTAGCTCTAGGACAACAACAGTTTGAGCAGGCAAAACTTTCTTGGCATTATGCAGTTAGCAGCGATGTTGATCCGAAAACATGGCCTTGCGCTGAAGTTATGCAACTTTACAATAGTGCTGAGGAAAACATGGAAACAGGCATGAAGATGTGGGAAGAATGGGAAGAGCAGCGTACTGGGGAACTCTCTAAATCTAGCAATGTTAAAACCCAGTTGCAGAAGATGGGGTTAGATGGGCTGATCAAGGATATATCAGTTGATGAGGCCGCAGAACAGGCTAAAAATATGAGGTCTCATATAAACCTCTTATGGGGTACCATGCTCTACGAGCGATCGATATTGGAATTTAAGATGGGGCTGCCGGCGTGGCATGAATGTCTGGAAGTTGCAGTTGAGAAATTCGAGCTTGCTGGAGCTTCTGCAACGGATATCGCAGTTATGATAAAGAATCACTGTTCAAGCAACAATTCACATGAAGGTCTTGGGTTCAAAATTGATGAGATAGTACAAGCATGGAATGAGATGTATGATGCTAGAAAGTTGCTAACTGGAGTTCCATCATTCCGATTAGAGCCATTATTTCGGCGAAGGGTCTCGAAAATCTACCACGTGTTGGAGCAAGCTTGA

Coding sequence (CDS)

ATGGGGAAGCAAAGTGGGAAGAAGAAGCAGATTGGTGATAAATTCCGCAAGGCGATTTCGAAACATCGCCAAAGTGGAGATGGAAGTCCAAGTTATGACAAAGACCATGTTATTTTCATTACTATGTCTCAGGTTTTAAAGGAAGAGGGCAATAAATTGTTCCAGTCTAGAGATCTTGAAGGAGCAATGTTGAAATATGATAAGGCCCTTAAATTACTTCCAAGGAATCATATAGATGTATCGTATCTTCGGAGTAACATGGCAGCATGCTATATGCAGATGGGGCTTAGCGAGTATCCCCGAGCGATTCACGAGTGCAATTTGGCTCTTGAAGTTACACCCAAGTACAGTAAGGCATTGTTGAAGCGGGCTAGATGTTATGAGGGTTTGCGTAGGCTGGACTTGGCTTTAAGAGATGTTAAGGCAGTTTTAAGCATGGAGCCAAATAATATCATGGCTTTAGAAATATCAGAGAGATTAACAAAGGCTCTTGAGATGAAAGGATCAAAGGAAGATGATGCCGAGATCAAGCTACCTCTTGATTTTGTTGAATTGCCTTCTTCAGCATCGTCGCAAAAAGGACCGAAAGAAAAGAACCGAAAGAAGAAAAACAACCAAAAGACTAAGGAAACCATTGATGAGAAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAACAGTGGAGGAGGAAGAGAAGGCTGATGAAATAATAGTGGGGGAGGAGAAGGTCGATGAGACGGTGGAGGAGAAGAAGGCTGAGGATAAACTAGTTGTGGAGGAGAAAATCACAAGGACACAGGAGGAAACACCGATGAAAACTGTGAAATTGGTGTTTGGAGAAGATATTCGATGGGCTCAATTGCCAATTGATTGCACTCTATTGCAACTGAGGGAGGTTATACGTGATCGTTTCCCTACCTGCATGGCAGTCCTTATCAAGTACAGGGATGAAGAAGGTGATTTGGTAACAATTACTACCAATGAAGAATTACGATTGGCTGAAACATCTAAGTTGTCACAGGGTTCTGTTAGATTTTATATATTTGAAGTTAATCCAGAGCGAGATCCATTTTACAAAAGGTTTAAGAATGACAAGGCTGCCAAGTGTGAAGTTGAAGAAAATAGCATCTTTGAGAATGGTCATGTATTAAAAGCAAAAGAAATAAAGATGTCATCTTGCATTGATGACTGGATAATTCAATTTGCTCAGTTATTTATAAACCATGTTGGATTTGAATCTGGCCCATACTTGGATCTCCATGACCTTGGGATGAAGCTTTATTCTGAGGCTGTGGAAGAGACAGTAACAAGTGAAGAAGCTCAGGGTCTTTTTGAATTAGCAGCAGAAAAGTTTCATGAGATGGCAGCCTTAGCACTGTTCAACTGGGGAAATGTTATCATGTCAAGGGCGAGGAAGACGGTCTACTTCGCAGATGGTGGTTCAAAAGTTCGTGTGCTTGAACAGATCAAAGCTGCATTCGATTGGGTCGAAAAAGAATATGCTGAAGCAGAAAGGAAATATCAAATGGCGGTGGAAATCAAACCAGACTTTTATGAAGGCTATCTAGCTCTAGGACAACAACAGTTTGAGCAGGCAAAACTTTCTTGGCATTATGCAGTTAGCAGCGATGTTGATCCGAAAACATGGCCTTGCGCTGAAGTTATGCAACTTTACAATAGTGCTGAGGAAAACATGGAAACAGGCATGAAGATGTGGGAAGAATGGGAAGAGCAGCGTACTGGGGAACTCTCTAAATCTAGCAATGTTAAAACCCAGTTGCAGAAGATGGGGTTAGATGGGCTGATCAAGGATATATCAGTTGATGAGGCCGCAGAACAGGCTAAAAATATGAGGTCTCATATAAACCTCTTATGGGGTACCATGCTCTACGAGCGATCGATATTGGAATTTAAGATGGGGCTGCCGGCGTGGCATGAATGTCTGGAAGTTGCAGTTGAGAAATTCGAGCTTGCTGGAGCTTCTGCAACGGATATCGCAGTTATGATAAAGAATCACTGTTCAAGCAACAATTCACATGAAGGTCTTGGGTTCAAAATTGATGAGATAGTACAAGCATGGAATGAGATGTATGATGCTAGAAAGTTGCTAACTGGAGTTCCATCATTCCGATTAGAGCCATTATTTCGGCGAAGGGTCTCGAAAATCTACCACGTGTTGGAGCAAGCTTGA

Protein sequence

MGKQSGKKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLEQA
Homology
BLAST of HG10008107 vs. NCBI nr
Match: XP_038879044.1 (protein PHOX1-like isoform X1 [Benincasa hispida])

HSP 1 Score: 1323.5 bits (3424), Expect = 0.0e+00
Identity = 686/742 (92.45%), Postives = 712/742 (95.96%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDL 60
           MGKQSG KKKQIGDKFR+A+SKHRQ+GDG+PSYDKDHVIFITMSQ+LKEEGNKLFQSRDL
Sbjct: 1   MGKQSGKKKKQIGDKFREAVSKHRQNGDGNPSYDKDHVIFITMSQILKEEGNKLFQSRDL 60

Query: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKA 120
           EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGL+EYPRAIHECNLALEVTPKYSKA
Sbjct: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLTEYPRAIHECNLALEVTPKYSKA 120

Query: 121 LLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLP 180
           LLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKALEMKGSKE+DA+IKLP
Sbjct: 121 LLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKALEMKGSKEEDADIKLP 180

Query: 181 LDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKADE 240
           LDFVELPS+ S QK  KEKNRKKKNNQKTKETIDEKKADETVE EEK DE +  EEK DE
Sbjct: 181 LDFVELPSTMSPQK-RKEKNRKKKNNQKTKETIDEKKADETVEGEEKPDEIMVGEEKVDE 240

Query: 241 II--------VGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQL 300
           ++        V EEKVDE VEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQL
Sbjct: 241 MVEEEKVDEMVEEEKVDEMVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQL 300

Query: 301 PIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYI 360
           PIDCTLLQLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYI
Sbjct: 301 PIDCTLLQLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYI 360

Query: 361 FEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFIN 420
           FEVNPE+DPFY+RFKN++ AKCEVEENSIFENGHVLK KEIKMSSCI+DWIIQFAQLFIN
Sbjct: 361 FEVNPEQDPFYERFKNNEVAKCEVEENSIFENGHVLKPKEIKMSSCINDWIIQFAQLFIN 420

Query: 421 HVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIM 480
           HVGF+SGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIM
Sbjct: 421 HVGFDSGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIM 480

Query: 481 SRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQ 540
           SRARK VYFADGGSKV VLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALG+Q
Sbjct: 481 SRARKKVYFADGGSKVHVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGKQ 540

Query: 541 QFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSS 600
           QFEQAK+SWHYAVSSDVDPKTWPC +VMQLYNSAEENMETGMK+WEEWEEQRTGELSKSS
Sbjct: 541 QFEQAKVSWHYAVSSDVDPKTWPCTKVMQLYNSAEENMETGMKLWEEWEEQRTGELSKSS 600

Query: 601 NVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHEC 660
           NV+TQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHEC
Sbjct: 601 NVRTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHEC 660

Query: 661 LEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVP 720
           LEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLL+GVP
Sbjct: 661 LEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLSGVP 720

Query: 721 SFRLEPLFRRRVSKIYHVLEQA 734
           SFRLEPLFRRRVSKIYHVLEQA
Sbjct: 721 SFRLEPLFRRRVSKIYHVLEQA 741

BLAST of HG10008107 vs. NCBI nr
Match: XP_038879046.1 (protein PHOX1-like isoform X3 [Benincasa hispida])

HSP 1 Score: 1323.1 bits (3423), Expect = 0.0e+00
Identity = 683/734 (93.05%), Postives = 707/734 (96.32%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDL 60
           MGKQSG KKKQIGDKFR+A+SKHRQ+GDG+PSYDKDHVIFITMSQ+LKEEGNKLFQSRDL
Sbjct: 1   MGKQSGKKKKQIGDKFREAVSKHRQNGDGNPSYDKDHVIFITMSQILKEEGNKLFQSRDL 60

Query: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKA 120
           EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGL+EYPRAIHECNLALEVTPKYSKA
Sbjct: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLTEYPRAIHECNLALEVTPKYSKA 120

Query: 121 LLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLP 180
           LLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKALEMKGSKE+DA+IKLP
Sbjct: 121 LLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKALEMKGSKEEDADIKLP 180

Query: 181 LDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKADE 240
           LDFVELPS+ S QK  KEKNRKKKNNQKTKETIDEK          KADETVE EEK DE
Sbjct: 181 LDFVELPSTMSPQK-RKEKNRKKKNNQKTKETIDEK----------KADETVEGEEKPDE 240

Query: 241 IIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ 300
           I+VGEEKVDE VEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ
Sbjct: 241 IMVGEEKVDEMVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ 300

Query: 301 LREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPERD 360
           LREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+D
Sbjct: 301 LREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQD 360

Query: 361 PFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESGP 420
           PFY+RFKN++ AKCEVEENSIFENGHVLK KEIKMSSCI+DWIIQFAQLFINHVGF+SGP
Sbjct: 361 PFYERFKNNEVAKCEVEENSIFENGHVLKPKEIKMSSCINDWIIQFAQLFINHVGFDSGP 420

Query: 421 YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVY 480
           YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARK VY
Sbjct: 421 YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKKVY 480

Query: 481 FADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLS 540
           FADGGSKV VLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALG+QQFEQAK+S
Sbjct: 481 FADGGSKVHVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGKQQFEQAKVS 540

Query: 541 WHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQK 600
           WHYAVSSDVDPKTWPC +VMQLYNSAEENMETGMK+WEEWEEQRTGELSKSSNV+TQLQK
Sbjct: 541 WHYAVSSDVDPKTWPCTKVMQLYNSAEENMETGMKLWEEWEEQRTGELSKSSNVRTQLQK 600

Query: 601 MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEKF 660
           MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHECLEVAVEKF
Sbjct: 601 MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHECLEVAVEKF 660

Query: 661 ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPLF 720
           ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLL+GVPSFRLEPLF
Sbjct: 661 ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLSGVPSFRLEPLF 720

Query: 721 RRRVSKIYHVLEQA 734
           RRRVSKIYHVLEQA
Sbjct: 721 RRRVSKIYHVLEQA 723

BLAST of HG10008107 vs. NCBI nr
Match: XP_038879045.1 (protein PHOX1-like isoform X2 [Benincasa hispida])

HSP 1 Score: 1321.6 bits (3419), Expect = 0.0e+00
Identity = 682/734 (92.92%), Postives = 707/734 (96.32%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDL 60
           MGKQSG KKKQIGDKFR+A+SKHRQ+GDG+PSYDKDHVIFITMSQ+LKEEGNKLFQSRDL
Sbjct: 1   MGKQSGKKKKQIGDKFREAVSKHRQNGDGNPSYDKDHVIFITMSQILKEEGNKLFQSRDL 60

Query: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKA 120
           EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGL+EYPRAIHECNLALEVTPKYSKA
Sbjct: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLTEYPRAIHECNLALEVTPKYSKA 120

Query: 121 LLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLP 180
           LLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKALEMKGSKE+DA+IKLP
Sbjct: 121 LLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKALEMKGSKEEDADIKLP 180

Query: 181 LDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKADE 240
           LDFVELPS+ S QK  KEKNRKKKNNQKTKETIDEK          KADETVE EEK DE
Sbjct: 181 LDFVELPSTMSPQK-RKEKNRKKKNNQKTKETIDEK----------KADETVEGEEKPDE 240

Query: 241 IIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ 300
           I+VGEEKVDE VEE+KAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ
Sbjct: 241 IMVGEEKVDEMVEEEKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQ 300

Query: 301 LREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPERD 360
           LREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+D
Sbjct: 301 LREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQD 360

Query: 361 PFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESGP 420
           PFY+RFKN++ AKCEVEENSIFENGHVLK KEIKMSSCI+DWIIQFAQLFINHVGF+SGP
Sbjct: 361 PFYERFKNNEVAKCEVEENSIFENGHVLKPKEIKMSSCINDWIIQFAQLFINHVGFDSGP 420

Query: 421 YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVY 480
           YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARK VY
Sbjct: 421 YLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKKVY 480

Query: 481 FADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLS 540
           FADGGSKV VLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALG+QQFEQAK+S
Sbjct: 481 FADGGSKVHVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGKQQFEQAKVS 540

Query: 541 WHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQK 600
           WHYAVSSDVDPKTWPC +VMQLYNSAEENMETGMK+WEEWEEQRTGELSKSSNV+TQLQK
Sbjct: 541 WHYAVSSDVDPKTWPCTKVMQLYNSAEENMETGMKLWEEWEEQRTGELSKSSNVRTQLQK 600

Query: 601 MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEKF 660
           MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHECLEVAVEKF
Sbjct: 601 MGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHECLEVAVEKF 660

Query: 661 ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPLF 720
           ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLL+GVPSFRLEPLF
Sbjct: 661 ELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLSGVPSFRLEPLF 720

Query: 721 RRRVSKIYHVLEQA 734
           RRRVSKIYHVLEQA
Sbjct: 721 RRRVSKIYHVLEQA 723

BLAST of HG10008107 vs. NCBI nr
Match: KAA0050062.1 (Protein unc-45-A-like protein [Cucumis melo var. makuwa] >TYK10348.1 Protein unc-45-A-like protein [Cucumis melo var. makuwa])

HSP 1 Score: 1297.3 bits (3356), Expect = 0.0e+00
Identity = 670/735 (91.16%), Postives = 692/735 (94.15%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGS-PSYDKDHVIFITMSQVLKEEGNKLFQSRD 60
           MGKQ+G KKKQIGDKFR+AISKHRQ+GDGS PSYDKDHVIFITMSQVLK+EGNKLFQSRD
Sbjct: 1   MGKQTGKKKKQIGDKFREAISKHRQNGDGSCPSYDKDHVIFITMSQVLKDEGNKLFQSRD 60

Query: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120
           LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK
Sbjct: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120

Query: 121 ALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKL 180
           ALLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKA+EMKGSKEDDAEIKL
Sbjct: 121 ALLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKAIEMKGSKEDDAEIKL 180

Query: 181 PLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKAD 240
           PLDFVELPSS   QK PKEKNRKKKNNQKTKETIDEKK DETVEE               
Sbjct: 181 PLDFVELPSSVLPQKKPKEKNRKKKNNQKTKETIDEKKVDETVEE--------------- 240

Query: 241 EIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLL 300
                E+KVDE VEEKKAEDKLVVEEKIT +QEETP  TVKLVFGEDIRWAQLP+DCTLL
Sbjct: 241 -----EKKVDEMVEEKKAEDKLVVEEKITTSQEETPTNTVKLVFGEDIRWAQLPVDCTLL 300

Query: 301 QLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPER 360
           QLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+
Sbjct: 301 QLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQ 360

Query: 361 DPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESG 420
           DPFY+RFKND+ A+CEVEENSI ENGH+LK+KEIKMSSCIDDWIIQFAQLFINHVGFESG
Sbjct: 361 DPFYERFKNDEVARCEVEENSILENGHILKSKEIKMSSCIDDWIIQFAQLFINHVGFESG 420

Query: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTV 480
           PYLDLHDLGMKLYSEAVEETVTSEEAQ LF+LAAEKFHEMAALALFNWGNVIM+RARK V
Sbjct: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQSLFKLAAEKFHEMAALALFNWGNVIMARARKKV 480

Query: 481 YFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540
           YFADGGSKVRVLEQIKAAFDWVE EYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL
Sbjct: 481 YFADGGSKVRVLEQIKAAFDWVENEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540

Query: 541 SWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQ 600
           SWHYAVSSDVDPK WPC E+MQLYNSAEENMETGMKMWEEWEEQRT ELSKS+NVKTQLQ
Sbjct: 541 SWHYAVSSDVDPKMWPCTEIMQLYNSAEENMETGMKMWEEWEEQRTSELSKSNNVKTQLQ 600

Query: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660
           KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHECLEVAVEK
Sbjct: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHECLEVAVEK 660

Query: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPL 720
           FELAGASATDIAVMIK+HCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLLTGVPSFRLEPL
Sbjct: 661 FELAGASATDIAVMIKSHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLTGVPSFRLEPL 715

Query: 721 FRRRVSKIYHVLEQA 734
           FRRRVSKIY+VLEQA
Sbjct: 721 FRRRVSKIYNVLEQA 715

BLAST of HG10008107 vs. NCBI nr
Match: XP_004146713.1 (protein PHOX1 [Cucumis sativus] >XP_011655705.1 protein PHOX1 [Cucumis sativus] >KGN65196.1 hypothetical protein Csa_020137 [Cucumis sativus])

HSP 1 Score: 1286.9 bits (3329), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 687/733 (93.72%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGS-PSYDKDHVIFITMSQVLKEEGNKLFQSRD 60
           MGKQSG KKKQIGDKFR+AI+KHRQ+GDGS P+YDKDHVIFITMSQVLK+EGNKLFQSRD
Sbjct: 1   MGKQSGKKKKQIGDKFREAIAKHRQNGDGSCPTYDKDHVIFITMSQVLKDEGNKLFQSRD 60

Query: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120
           LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK
Sbjct: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120

Query: 121 ALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKL 180
           ALLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTK +EMKGS EDD EIKL
Sbjct: 121 ALLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKEIEMKGSNEDDVEIKL 180

Query: 181 PLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKAD 240
           PLDF ELPSS S QK PKEKNRKKKNNQKTKE IDEKK DETVE                
Sbjct: 181 PLDFGELPSSVSPQKKPKEKNRKKKNNQKTKEIIDEKKVDETVE---------------- 240

Query: 241 EIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLL 300
                E+KVDE VEEKKAEDKLVVEEKI+ TQEETP  TVKLVFGEDIRWAQLP+DCTLL
Sbjct: 241 -----EKKVDEMVEEKKAEDKLVVEEKIS-TQEETPTNTVKLVFGEDIRWAQLPVDCTLL 300

Query: 301 QLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPER 360
           QLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+
Sbjct: 301 QLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQ 360

Query: 361 DPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESG 420
           DPFY+RFKND+ AKCEVEENSIFENGH LK+KEIKMSSCIDDWIIQFAQLFINHVGFESG
Sbjct: 361 DPFYQRFKNDEVAKCEVEENSIFENGHALKSKEIKMSSCIDDWIIQFAQLFINHVGFESG 420

Query: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTV 480
           PYLDLHDLGMKLYSEAVEETVTSEEAQ LFELAAEKFHEMAALALFNWGNVIM++ARK V
Sbjct: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQSLFELAAEKFHEMAALALFNWGNVIMAKARKKV 480

Query: 481 YFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540
           YFADGGSKVRVLEQIKAAF+WVE EYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL
Sbjct: 481 YFADGGSKVRVLEQIKAAFEWVENEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540

Query: 541 SWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQ 600
           SWHYAVSSDVDPKTWPC EVM+LYNSAEENMETGMKMWEEWEEQRT ELSKS+N+KTQLQ
Sbjct: 541 SWHYAVSSDVDPKTWPCTEVMELYNSAEENMETGMKMWEEWEEQRTSELSKSNNIKTQLQ 600

Query: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660
           KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK
Sbjct: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660

Query: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPL 720
           FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLLTGVPSFRLEPL
Sbjct: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLTGVPSFRLEPL 711

Query: 721 FRRRVSKIYHVLE 732
           FRRRVSKIYHVLE
Sbjct: 721 FRRRVSKIYHVLE 711

BLAST of HG10008107 vs. ExPASy Swiss-Prot
Match: F4IRM4 (Protein PHOX1 OS=Arabidopsis thaliana OX=3702 GN=PHOX1 PE=1 SV=1)

HSP 1 Score: 731.5 bits (1887), Expect = 9.5e-210
Identity = 409/754 (54.24%), Postives = 526/754 (69.76%), Query Frame = 0

Query: 1   MGKQSGKKKQI--------------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVL 60
           MGK +GKKK                G K  K+  +       + S+D D  IFI  +  L
Sbjct: 1   MGKPTGKKKNNNYTEMPPTESSTTGGGKTGKSFDR-----SATKSFDDDMTIFINRALEL 60

Query: 61  KEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHEC 120
           KEEGNKLFQ RD EGAM +YDKA+KLLPR+H DV+YLR++MA+CYMQMGL EYP AI+EC
Sbjct: 61  KEEGNKLFQKRDYEGAMFRYDKAVKLLPRDHGDVAYLRTSMASCYMQMGLGEYPNAINEC 120

Query: 121 NLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEM 180
           NLALE +P++SKALLKRARCYE L +LD A RD + VL+MEP N+ A EI ER+ K L  
Sbjct: 121 NLALEASPRFSKALLKRARCYEALNKLDFAFRDSRVVLNMEPENVSANEIFERVKKVLVG 180

Query: 181 KGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQKTKE---TIDEKKADETVEE 240
           KG   D+ E  L    V+   +A  +K  KE+ RKKK    T        E+K+ E V E
Sbjct: 181 KGIDVDEMEKNLV--NVQPVGAARLRKIVKERLRKKKKKSMTMTNGGNDGERKSVEAVVE 240

Query: 241 EEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQE----ETPMKTVK 300
           + K D   E         V   +  + +EEKK EDK+ V +K     E     T  +TVK
Sbjct: 241 DAKVDNGEE---------VDSGRKGKAIEEKKLEDKVAVMDKEVIASEIKEDATVTRTVK 300

Query: 301 LVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAET 360
           LV G+DIRWAQLP+D +++ +R+VI+DRFP     LIKYRD EGDLVTITT +ELRLA +
Sbjct: 301 LVHGDDIRWAQLPLDSSVVLVRDVIKDRFPALKGFLIKYRDSEGDLVTITTTDELRLAAS 360

Query: 361 SKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA-KCEVEENSIFENGHVLKAKEI-KMSSC 420
           ++   GS R YI EV+P ++P Y    ND++  K     +S+ +NG V    E  K S+ 
Sbjct: 361 TREKLGSFRLYIAEVSPNQEPTYDVIDNDESTDKFAKGSSSVADNGSVGDFVESEKASTS 420

Query: 421 IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHE 480
           ++ WI QFAQLF NHVGF+S  YL+LH+LGMKLY+EA+E+ VT E+AQ LF++AA+KF E
Sbjct: 421 LEHWIFQFAQLFKNHVGFDSDSYLELHNLGMKLYTEAMEDIVTGEDAQELFDIAADKFQE 480

Query: 481 MAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEI 540
           MAALA+FNWGNV MS+AR+ +YF + GS+  +LE+++A F+W + EY +A  KY+ AV+I
Sbjct: 481 MAALAMFNWGNVHMSKARRQIYFPEDGSRETILEKVEAGFEWAKNEYNKAAEKYEGAVKI 540

Query: 541 KPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWE 600
           K DFYE  LALGQQQFEQAKL W++A+S +VD ++    +V++LYN AEE+ME GM++WE
Sbjct: 541 KSDFYEALLALGQQQFEQAKLCWYHALSGEVDIESDASQDVLKLYNKAEESMEKGMQIWE 600

Query: 601 EWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERS 660
           E EE+R   +S     K  LQK+GLDG+  + S +E+AEQ  NM S INLLWG++LYERS
Sbjct: 601 EMEERRLNGISNFDKHKELLQKLGLDGIFSEASDEESAEQTANMSSQINLLWGSLLYERS 660

Query: 661 ILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAW 720
           I+E+K+GLP W ECLEVAVEKFELAGASATDIAVM+KNHCSS+N+ EG+GFKIDEIVQAW
Sbjct: 661 IVEYKLGLPTWDECLEVAVEKFELAGASATDIAVMVKNHCSSDNALEGMGFKIDEIVQAW 720

Query: 721 NEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLE 732
           NEMYDA++   GVPSFRLEPLFRRR  K++ +LE
Sbjct: 721 NEMYDAKRWQIGVPSFRLEPLFRRRSPKLHDILE 738

BLAST of HG10008107 vs. ExPASy Swiss-Prot
Match: F4JTI1 (Protein PHOX4 OS=Arabidopsis thaliana OX=3702 GN=PHOX4 PE=2 SV=1)

HSP 1 Score: 699.5 bits (1804), Expect = 4.0e-200
Identity = 400/782 (51.15%), Postives = 531/782 (67.90%), Query Frame = 0

Query: 1   MGKQSGKKKQI-----------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEE 60
           MGK + KKK             G   +   + HR +   S  +D+D  IFI+ +  LKEE
Sbjct: 1   MGKPTAKKKNPETPKDASGGGGGGGGKSGKTYHRST---SRVFDEDMEIFISRALELKEE 60

Query: 61  GNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLA 120
           GNKLFQ RD EGAML +DKALKLLP++HIDV+YLR++MA+CYMQMGL EYP AI ECNLA
Sbjct: 61  GNKLFQKRDHEGAMLSFDKALKLLPKDHIDVAYLRTSMASCYMQMGLGEYPNAISECNLA 120

Query: 121 LEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGS 180
           LE +P+YSKAL++R+RCYE L +LD A RD + VL+MEP N+ A EI +R+ K L  KG 
Sbjct: 121 LEASPRYSKALVRRSRCYEALNKLDYAFRDARIVLNMEPGNVSANEIFDRVKKVLVDKGI 180

Query: 181 KEDDAEIKLPLDFVELP--SSASSQKGPKEKNRKKKNNQKTKETIDEKKADETV------ 240
             D+ E     DFV++    +A  +K  KE+ RK K  +K+    +E K+ + V      
Sbjct: 181 DVDEME----KDFVDVQPVCAARLKKIVKERLRKSKKKKKSGGKDEELKSPKVVVVDKGD 240

Query: 241 -------EEEEKADET---------VEEEEKADEIIVGEEKV---DETVEEKKAEDKLVV 300
                   +EEK+D++          EE++ + +   G++K    ++  EE+K EDK+VV
Sbjct: 241 EAEGRNKPKEEKSDKSDIDGKIGGKREEKKTSFKSDKGQKKKSGGNKAGEERKVEDKVVV 300

Query: 301 EEKI-----------TRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTC 360
            +K            ++ +  T  +T+KLV G+DIRWAQLP+D T+  +R+VIRDRFP  
Sbjct: 301 MDKEVIASEIVDGGGSKKEGATVTRTIKLVHGDDIRWAQLPLDSTVRLVRDVIRDRFPAL 360

Query: 361 MAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA 420
              LIKYRD EGDLVTITT +ELRLA ++    GS+R YI EVNP+++P Y    N ++ 
Sbjct: 361 RGFLIKYRDTEGDLVTITTTDELRLAASTHDKLGSLRLYIAEVNPDQEPTYDGMSNTEST 420

Query: 421 -KCEVEENSIFENGHVLK-AKEIKMSSCIDDWIIQFAQLFINHVGFESGPYLDLHDLGMK 480
            K     +S+ +NG V +     K S C ++WI QFAQLF NHVGF+S  Y+DLHDLGMK
Sbjct: 421 DKVSKRLSSLADNGSVGEYVGSDKASGCFENWIFQFAQLFKNHVGFDSDSYVDLHDLGMK 480

Query: 481 LYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVYFADGGSKVRV 540
           LY+EA+E+ VT E+AQ LF++AA+KF EM ALAL NWGNV MS+ARK V   +  S+  +
Sbjct: 481 LYTEAMEDAVTGEDAQELFQIAADKFQEMGALALLNWGNVHMSKARKQVCIPEDASREAI 540

Query: 541 LEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVD 600
           +E ++AAF W + EY +A  KY+ A+++KPDFYE  LALGQ+QFE AKL W++A+ S VD
Sbjct: 541 IEAVEAAFVWTQNEYNKAAEKYEEAIKVKPDFYEALLALGQEQFEHAKLCWYHALKSKVD 600

Query: 601 PKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDI 660
            ++    EV++LYN AE++ME GM++WEE EE R   +SK    K  L+K+ LD L  + 
Sbjct: 601 LESEASQEVLKLYNKAEDSMERGMQIWEEMEECRLNGISKLDKHKNMLRKLELDELFSEA 660

Query: 661 SVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEKFELAGASATDI 720
           S +E  EQ  NM S INLLWG++LYERSI+E+K+GLP W ECLEVAVEKFELAGASATDI
Sbjct: 661 SEEETVEQTANMSSQINLLWGSLLYERSIVEYKLGLPTWDECLEVAVEKFELAGASATDI 720

Query: 721 AVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPLFRRRVSKIYHV 732
           AVM+KNHCSS ++ EG+GFKIDEIVQAWNEMYDA++   GVPSFRLEP+FRRR  K++ +
Sbjct: 721 AVMVKNHCSSESALEGMGFKIDEIVQAWNEMYDAKRWQMGVPSFRLEPMFRRRAPKLHDI 775

BLAST of HG10008107 vs. ExPASy Swiss-Prot
Match: K7TQE3 (HSP-interacting protein OS=Zea mays OX=4577 GN=HIP PE=1 SV=1)

HSP 1 Score: 580.1 bits (1494), Expect = 3.5e-164
Identity = 335/736 (45.52%), Postives = 471/736 (63.99%), Query Frame = 0

Query: 33  DKDHVIFITMSQVLKEEGNKLFQSRDLEGAMLKYDKALKLLPR-NHIDVSYLRSNMAACY 92
           D D  +F+ +S+ LKEEG +LF  RD EGA  KYDKA++LLP    ++ ++LR+++A CY
Sbjct: 16  DGDDAVFLELSRELKEEGTRLFNRRDFEGAAFKYDKAVQLLPAGRRVEAAHLRASIAHCY 75

Query: 93  MQMGLSEYPRAIHECNLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNI 152
           M+M  +E+  AIHECNLALE  P+YS+ALL+RA C+E L R DLA  D++ VL  EP N 
Sbjct: 76  MRMSPAEFHHAIHECNLALEAVPRYSRALLRRAACFEALGRPDLAWGDIRTVLRWEPGNR 135

Query: 153 MALEISERLTKALEMKGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQK---- 212
            A +IS+R+  ALE KG       I + LD   LP   +     K + RKK  N++    
Sbjct: 136 AARQISDRVRTALEDKG-------ISVALDV--LPEDENEIASAKGEERKKSRNKRFDSV 195

Query: 213 ------------TKETIDEKKA------------DETVEEEEKADETVEEEEKADEIIVG 272
                        +    EK+A            D T + E    E +E+  +  E  +G
Sbjct: 196 AGGREGENGIALLESASTEKQAGPRQTNGTGNHQDHTEDSESNGLEKLEQSTETGEKDMG 255

Query: 273 EEKVDETVEEKK--AEDKLVVEEKITRTQE-----ETPMKTVKLVFGEDIRWAQLPIDCT 332
           +++      +K    E K      +   Q+     E  MK VKLVFGEDIR AQ+P +C+
Sbjct: 256 KKRGAHAAGKKPRCGESKQQKHSAVNHCQDNIGAKEEVMKDVKLVFGEDIRCAQMPANCS 315

Query: 333 LLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNP 392
           L QLRE+++++FP+  A LIKY+D+E DLVTIT +EEL  A    +SQ  +RFY+ EVN 
Sbjct: 316 LPQLREIVQNKFPSLKAFLIKYKDKEEDLVTITLSEELSWASNLAVSQVPIRFYVVEVNH 375

Query: 393 ERDPFYKRFKNDKA-AKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGF 452
            ++      +   + A  E   + + +NG +    +++     DDW++QFAQ+F NHVGF
Sbjct: 376 VQELGVDGVRRRPSFATLERNRDIMLDNGTI--GHDVEHKHYADDWMVQFAQIFKNHVGF 435

Query: 453 ESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRAR 512
            S  YLDLHDLG++L+ EA+E+T+  EEAQ +FE+A  KF EMAALALFN GNV MSRAR
Sbjct: 436 SSDAYLDLHDLGLRLHYEAMEDTIQREEAQEIFEVAESKFKEMAALALFNCGNVHMSRAR 495

Query: 513 KTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQ 572
           +    A+   +  +LE++  ++DW   EYA+A   ++ AV+ K DF+EG +ALGQQ+FEQ
Sbjct: 496 RRPCLAEDPLQEFILEKVNVSYDWACTEYAKAGAMFEEAVKTKSDFFEGLIALGQQKFEQ 555

Query: 573 AKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKT 632
           AKLSW+YA++  ++ +T    EV++L+N AE+NME GM MWE  E  R   LSK S  K 
Sbjct: 556 AKLSWYYALACKINMET----EVLELFNHAEDNMEKGMDMWERMETLRLKGLSKPSKEKV 615

Query: 633 QLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVA 692
            L+KM L+G +KDIS DEA EQA ++RSHIN+LWGT+LYERS++EF +GLP+W E L VA
Sbjct: 616 VLEKMVLEGFVKDISADEAFEQASSIRSHINILWGTILYERSVVEFNLGLPSWEESLTVA 675

Query: 693 VEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRL 732
           +EKF++ GAS  DI V++KNHC++  + EGL FK++EIVQAW+EM+DA+   +G   FRL
Sbjct: 676 MEKFKIGGASQADINVIVKNHCANETTQEGLSFKVEEIVQAWSEMHDAKNWRSGPLYFRL 735

BLAST of HG10008107 vs. ExPASy Swiss-Prot
Match: F4K487 (Protein PHOX3 OS=Arabidopsis thaliana OX=3702 GN=PHOX3 PE=1 SV=1)

HSP 1 Score: 533.5 bits (1373), Expect = 3.8e-150
Identity = 305/695 (43.88%), Postives = 457/695 (65.76%), Query Frame = 0

Query: 40  ITMSQVLKEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEY 99
           ++ +Q LKEEGNKLFQ RD +GAM KY +A+K+LP++H++VS++R+N+A+CYMQ+   E+
Sbjct: 123 VSKAQGLKEEGNKLFQKRDYDGAMFKYGEAIKILPKDHVEVSHVRANVASCYMQLEPGEF 182

Query: 100 PRAIHECNLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISER 159
            +AIHEC+LAL VTP ++KALLKRARCYE L +LDLALRDV  V  ++P N MA EI E+
Sbjct: 183 AKAIHECDLALSVTPDHNKALLKRARCYEALNKLDLALRDVCMVSKLDPKNPMASEIVEK 242

Query: 160 LTKALEMKGSKEDDAEIKLPLDFVE----LPSSASSQKGPKEKNRKKKNNQKTKETIDEK 219
           L + LE KG + +++ I+LP D+VE     P++  ++ G     + KK+NQ  +++  E 
Sbjct: 243 LKRTLESKGLRINNSVIELPPDYVEPVGASPAALWAKLGKVRVKKTKKSNQVEEKSEGE- 302

Query: 220 KADETVEEEEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPM 279
              E VE E+K +   E+ ++  ++ V  ++ D+  +  K ++K+++EE++     E   
Sbjct: 303 --GEDVEPEKKNNVLAEKGKEKIKMKVKGKQSDKRSDTSKEQEKVIIEEELLVIGVEDVN 362

Query: 280 KTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELR 339
           K VK V+ +DIR A+LPI+CTL +LREV+ +RFP+  AV IKYRD+EGDLVTITT+EELR
Sbjct: 363 KDVKFVYSDDIRLAELPINCTLFKLREVVHERFPSLRAVHIKYRDQEGDLVTITTDEELR 422

Query: 340 LAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMS 399
           ++E S  SQG++RFY+ EV+PE+DPF+ R    K  K   +           KAK     
Sbjct: 423 MSEVSSRSQGTMRFYVVEVSPEQDPFFGRLVEMKKLKITADS---------FKAKVNGRG 482

Query: 400 SC-IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEK 459
            C ++DW+I+FA LF      +S   L+L +LGMKL SEA+EE VTS+ AQG F+ AA++
Sbjct: 483 GCKVEDWMIEFAHLFKIQARIDSDRCLNLQELGMKLNSEAMEEVVTSDAAQGPFDRAAQQ 542

Query: 460 FHEMAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMA 519
           F E+AA +L N G V MS ARK +    G S   V EQ+K A++  +KE+A A+ KY+ A
Sbjct: 543 FQEVAARSLLNLGYVHMSGARKRLSLLQGVSGESVSEQVKTAYECAKKEHANAKEKYEEA 602

Query: 520 VEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMK 579
           ++IKP+ +E +LALG QQFE+A+LSW+Y + S +D KTWP A+V+Q Y SAE N++  M+
Sbjct: 603 MKIKPECFEVFLALGLQQFEEARLSWYYVLVSHLDLKTWPYADVVQFYQSAESNIKKSME 662

Query: 580 MWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLY 639
           + E  E  +  E S++        +  L    +    +  A++A  ++S I++L   +LY
Sbjct: 663 VLENLETGKESEPSQAGKTDCLTHEKDLGSSTQ----NNPAKEAGRLKSWIDILLCAVLY 722

Query: 640 ERSILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIV 699
           ERSI+E+K+  P W E LE A+EKFELAG    D+  +I     + N+   + F ++EI+
Sbjct: 723 ERSIMEYKLDQPFWRESLEAAMEKFELAGTCKDDVVEIISEDYVAGNTLRDIRFHMEEII 782

Query: 700 QAWNEMYDARKLLTGVPSFRLEPLFRRRVSKIYHV 730
           Q ++E+Y+A+    G+PS +LE + +RR   I+HV
Sbjct: 783 QIFDEIYEAKHWTNGIPSDQLEEILKRRAENIFHV 801

BLAST of HG10008107 vs. ExPASy Swiss-Prot
Match: O48802 (Protein CLMP1 OS=Arabidopsis thaliana OX=3702 GN=CLMP1 PE=1 SV=1)

HSP 1 Score: 425.6 bits (1093), Expect = 1.1e-117
Identity = 288/744 (38.71%), Postives = 417/744 (56.05%), Query Frame = 0

Query: 1   MGKQSGKKKQIG--DKFRKAISKHRQSGDGSPS------YDKDHVIFITMSQVLKEEGNK 60
           MGK  G+KK+ G  +     ++    SG   PS       D D  IF+  +  LKEEGNK
Sbjct: 1   MGKSGGRKKKSGGSNSNSSQVNSSETSGLSKPSTIVNGGVDFDASIFLKRAHELKEEGNK 60

Query: 61  LFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEV 120
            FQ+RD  GA+ +Y+  +KL+P++H D +   SN AAC MQM   +Y   I EC++AL+ 
Sbjct: 61  KFQARDYVGALEQYENGIKLIPKSHPDRAVFHSNRAACLMQMKPIDYESVISECSMALKS 120

Query: 121 TPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKED 180
            P +++ALL+RAR +E + + DLA++DV  +L  +PN+  A EIS+RL  AL   G  +D
Sbjct: 121 QPGFTRALLRRARAFEAVGKFDLAVQDVNVLLGSDPNHKDAGEISKRLKTAL---GPHQD 180

Query: 181 DAEIKLPLDFVELPSSAS-----SQKGP--KEKNRKKKNNQKTKETIDEKKADETVEEEE 240
                 P     L +SA+     +  GP    +N  KK       ++    A     E  
Sbjct: 181 LQSRPSP---AALGASAALGGPIAGLGPCLPSRNVHKKGVTSPVGSVSLPNASNGKVERP 240

Query: 241 KADETVEEE----EKADEIIVGEEKVDETVEEKKAED----KLVVEEKITRTQEETPMKT 300
           +    V E      K     V  + V  + +  K E+     + V  K+   ++    + 
Sbjct: 241 QVVNPVTENGGSVSKGQASRVVLKPVSHSPKGSKVEELGSSSVAVVGKV--QEKRIRWRP 300

Query: 301 VKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLA 360
           +K V+  DIR  Q+P++C   +LRE++  RFP+  AVLIKY+D +GDLVTIT+  EL+LA
Sbjct: 301 LKFVYDHDIRLGQMPVNCRFKELREIVSSRFPSSKAVLIKYKDNDGDLVTITSTAELKLA 360

Query: 361 ETS-------------KLSQGSVRFYIFEVNPERDPFYKRFKNDKAAKCEVEENSIFENG 420
           E++               S G +R ++ +V+PE++P     + ++  +  V E  I    
Sbjct: 361 ESAADCILTKEPDTDKSDSVGMLRLHVVDVSPEQEPMLLEEEEEEVEEKPVIEEVISSPT 420

Query: 421 HVLKAKEI------------KMSSC---------IDDWIIQFAQLFINHVGFESGPYLDL 480
             L   EI            K SS          +DDW+  FA LF  HVG +   ++DL
Sbjct: 421 ESLSETEINTEKTDKEVEKEKASSSEDPETKELEMDDWLFDFAHLFRTHVGIDPDAHIDL 480

Query: 481 HDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVYFADG 540
           H+LGM+L SEA+EETVTSE+AQ LF+ A+ KF E+AALA FNWGNV M  ARK +   + 
Sbjct: 481 HELGMELCSEALEETVTSEKAQPLFDKASAKFQEVAALAFFNWGNVHMCAARKRIPLDES 540

Query: 541 GSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLSWHYA 600
             K  V  Q++ A++WV++ Y  A+ KY+ A+ IKPDFYEG LALGQQQFE AKL W Y 
Sbjct: 541 AGKEVVAAQLQTAYEWVKERYTLAKEKYEQALSIKPDFYEGLLALGQQQFEMAKLHWSYL 600

Query: 601 VSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGEL-----SKSSNVKTQLQ 660
           ++  +D   W  +E + L++SAE  M+   +MWE+ EEQR  +L     +K   V  + +
Sbjct: 601 LAQKIDISGWDPSETLNLFDSAEAKMKDATEMWEKLEEQRMDDLKNPNSNKKEEVSKRRK 660

Query: 661 KMGLDG---LIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVA 680
           K G DG   + + I+ +EAAEQA  MRS I+L WG ML+ERS +E K+G   W++ L+ A
Sbjct: 661 KQGGDGNEEVSETITAEEAAEQATAMRSQIHLFWGNMLFERSQVECKIGKDGWNKNLDSA 720

BLAST of HG10008107 vs. ExPASy TrEMBL
Match: A0A5A7U2K2 (Protein unc-45-A-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold367G00250 PE=4 SV=1)

HSP 1 Score: 1297.3 bits (3356), Expect = 0.0e+00
Identity = 670/735 (91.16%), Postives = 692/735 (94.15%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGS-PSYDKDHVIFITMSQVLKEEGNKLFQSRD 60
           MGKQ+G KKKQIGDKFR+AISKHRQ+GDGS PSYDKDHVIFITMSQVLK+EGNKLFQSRD
Sbjct: 1   MGKQTGKKKKQIGDKFREAISKHRQNGDGSCPSYDKDHVIFITMSQVLKDEGNKLFQSRD 60

Query: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120
           LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK
Sbjct: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120

Query: 121 ALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKL 180
           ALLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKA+EMKGSKEDDAEIKL
Sbjct: 121 ALLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKAIEMKGSKEDDAEIKL 180

Query: 181 PLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKAD 240
           PLDFVELPSS   QK PKEKNRKKKNNQKTKETIDEKK DETVEE               
Sbjct: 181 PLDFVELPSSVLPQKKPKEKNRKKKNNQKTKETIDEKKVDETVEE--------------- 240

Query: 241 EIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLL 300
                E+KVDE VEEKKAEDKLVVEEKIT +QEETP  TVKLVFGEDIRWAQLP+DCTLL
Sbjct: 241 -----EKKVDEMVEEKKAEDKLVVEEKITTSQEETPTNTVKLVFGEDIRWAQLPVDCTLL 300

Query: 301 QLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPER 360
           QLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+
Sbjct: 301 QLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQ 360

Query: 361 DPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESG 420
           DPFY+RFKND+ A+CEVEENSI ENGH+LK+KEIKMSSCIDDWIIQFAQLFINHVGFESG
Sbjct: 361 DPFYERFKNDEVARCEVEENSILENGHILKSKEIKMSSCIDDWIIQFAQLFINHVGFESG 420

Query: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTV 480
           PYLDLHDLGMKLYSEAVEETVTSEEAQ LF+LAAEKFHEMAALALFNWGNVIM+RARK V
Sbjct: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQSLFKLAAEKFHEMAALALFNWGNVIMARARKKV 480

Query: 481 YFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540
           YFADGGSKVRVLEQIKAAFDWVE EYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL
Sbjct: 481 YFADGGSKVRVLEQIKAAFDWVENEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540

Query: 541 SWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQ 600
           SWHYAVSSDVDPK WPC E+MQLYNSAEENMETGMKMWEEWEEQRT ELSKS+NVKTQLQ
Sbjct: 541 SWHYAVSSDVDPKMWPCTEIMQLYNSAEENMETGMKMWEEWEEQRTSELSKSNNVKTQLQ 600

Query: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660
           KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHECLEVAVEK
Sbjct: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHECLEVAVEK 660

Query: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPL 720
           FELAGASATDIAVMIK+HCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLLTGVPSFRLEPL
Sbjct: 661 FELAGASATDIAVMIKSHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLTGVPSFRLEPL 715

Query: 721 FRRRVSKIYHVLEQA 734
           FRRRVSKIY+VLEQA
Sbjct: 721 FRRRVSKIYNVLEQA 715

BLAST of HG10008107 vs. ExPASy TrEMBL
Match: A0A0A0LYY0 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G263990 PE=4 SV=1)

HSP 1 Score: 1286.9 bits (3329), Expect = 0.0e+00
Identity = 666/733 (90.86%), Postives = 687/733 (93.72%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGS-PSYDKDHVIFITMSQVLKEEGNKLFQSRD 60
           MGKQSG KKKQIGDKFR+AI+KHRQ+GDGS P+YDKDHVIFITMSQVLK+EGNKLFQSRD
Sbjct: 1   MGKQSGKKKKQIGDKFREAIAKHRQNGDGSCPTYDKDHVIFITMSQVLKDEGNKLFQSRD 60

Query: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120
           LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK
Sbjct: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120

Query: 121 ALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKL 180
           ALLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTK +EMKGS EDD EIKL
Sbjct: 121 ALLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKEIEMKGSNEDDVEIKL 180

Query: 181 PLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKAD 240
           PLDF ELPSS S QK PKEKNRKKKNNQKTKE IDEKK DETVE                
Sbjct: 181 PLDFGELPSSVSPQKKPKEKNRKKKNNQKTKEIIDEKKVDETVE---------------- 240

Query: 241 EIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLL 300
                E+KVDE VEEKKAEDKLVVEEKI+ TQEETP  TVKLVFGEDIRWAQLP+DCTLL
Sbjct: 241 -----EKKVDEMVEEKKAEDKLVVEEKIS-TQEETPTNTVKLVFGEDIRWAQLPVDCTLL 300

Query: 301 QLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPER 360
           QLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+
Sbjct: 301 QLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQ 360

Query: 361 DPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESG 420
           DPFY+RFKND+ AKCEVEENSIFENGH LK+KEIKMSSCIDDWIIQFAQLFINHVGFESG
Sbjct: 361 DPFYQRFKNDEVAKCEVEENSIFENGHALKSKEIKMSSCIDDWIIQFAQLFINHVGFESG 420

Query: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTV 480
           PYLDLHDLGMKLYSEAVEETVTSEEAQ LFELAAEKFHEMAALALFNWGNVIM++ARK V
Sbjct: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQSLFELAAEKFHEMAALALFNWGNVIMAKARKKV 480

Query: 481 YFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540
           YFADGGSKVRVLEQIKAAF+WVE EYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL
Sbjct: 481 YFADGGSKVRVLEQIKAAFEWVENEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540

Query: 541 SWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQ 600
           SWHYAVSSDVDPKTWPC EVM+LYNSAEENMETGMKMWEEWEEQRT ELSKS+N+KTQLQ
Sbjct: 541 SWHYAVSSDVDPKTWPCTEVMELYNSAEENMETGMKMWEEWEEQRTSELSKSNNIKTQLQ 600

Query: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660
           KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK
Sbjct: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660

Query: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPL 720
           FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLLTGVPSFRLEPL
Sbjct: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLTGVPSFRLEPL 711

Query: 721 FRRRVSKIYHVLE 732
           FRRRVSKIYHVLE
Sbjct: 721 FRRRVSKIYHVLE 711

BLAST of HG10008107 vs. ExPASy TrEMBL
Match: A0A1S3B9Z6 (LOW QUALITY PROTEIN: uncharacterized protein LOC103487400 OS=Cucumis melo OX=3656 GN=LOC103487400 PE=4 SV=1)

HSP 1 Score: 1278.1 bits (3306), Expect = 0.0e+00
Identity = 662/735 (90.07%), Postives = 689/735 (93.74%), Query Frame = 0

Query: 1   MGKQSGKKKQ-IGDKFRKAISKHRQSGDGS-PSYDKDHVIFITMSQVLKEEGNKLFQSRD 60
           MGKQ+GKKK+ IGDKFR+AISKHRQ+GDGS PSYDKDHVIFITMSQVLK+EGNKLFQSRD
Sbjct: 1   MGKQTGKKKKLIGDKFREAISKHRQNGDGSCPSYDKDHVIFITMSQVLKDEGNKLFQSRD 60

Query: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120
           LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK
Sbjct: 61  LEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSK 120

Query: 121 ALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKL 180
           ALLKRARCYEGL RLDLALRDVKAVL+MEPNNIMALEISERLTKA+EMKGSKEDDAEIKL
Sbjct: 121 ALLKRARCYEGLHRLDLALRDVKAVLNMEPNNIMALEISERLTKAIEMKGSKEDDAEIKL 180

Query: 181 PLDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVEEEEKADETVEEEEKAD 240
           PLDFVELPSS +++K  K K  K+KNNQKTKETIDEKK DETVEE               
Sbjct: 181 PLDFVELPSSVAAKK-TKRKEPKEKNNQKTKETIDEKKVDETVEE--------------- 240

Query: 241 EIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQLPIDCTLL 300
                E+KVDE VEEKKAEDKLVVEEKIT +QEETP  TVKLVFGEDIRWAQLP+DCTLL
Sbjct: 241 -----EKKVDEMVEEKKAEDKLVVEEKITTSQEETPTNTVKLVFGEDIRWAQLPVDCTLL 300

Query: 301 QLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPER 360
           QLREVIRDRFPTC AVLIKYRDEEGDLVTITTNEELRLAETSK SQGSVRFYIFEVNPE+
Sbjct: 301 QLREVIRDRFPTCTAVLIKYRDEEGDLVTITTNEELRLAETSKESQGSVRFYIFEVNPEQ 360

Query: 361 DPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFINHVGFESG 420
           DPFY+RFKND+ A+CEVEENSI ENGH+LK+KEIKMSSCIDDWIIQFAQLFINHVGFESG
Sbjct: 361 DPFYERFKNDEVARCEVEENSILENGHILKSKEIKMSSCIDDWIIQFAQLFINHVGFESG 420

Query: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTV 480
           PYLDLHDLGMKLYSEAVEETVTSEEAQ LF+LAAEKFHEMAALALFNWGNVIM+RARK V
Sbjct: 421 PYLDLHDLGMKLYSEAVEETVTSEEAQSLFKLAAEKFHEMAALALFNWGNVIMARARKKV 480

Query: 481 YFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540
           YFADGGSKV VLEQIKAAFDWVE EYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL
Sbjct: 481 YFADGGSKVSVLEQIKAAFDWVENEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKL 540

Query: 541 SWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQ 600
           SWHYAVSSDVDPK WPC E+MQLYNSAEENMETGMKMWEEWEEQRT ELSKS+NVKTQLQ
Sbjct: 541 SWHYAVSSDVDPKMWPCTEIMQLYNSAEENMETGMKMWEEWEEQRTSELSKSNNVKTQLQ 600

Query: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEK 660
           KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP WHECLEVAVEK
Sbjct: 601 KMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPTWHECLEVAVEK 660

Query: 661 FELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVPSFRLEPL 720
           FELAGASATDIAVMIK+HCSSNNSHEGLGFKIDEIVQAWNEMY+ARKLLTGVPSFRLEPL
Sbjct: 661 FELAGASATDIAVMIKSHCSSNNSHEGLGFKIDEIVQAWNEMYEARKLLTGVPSFRLEPL 714

Query: 721 FRRRVSKIYHVLEQA 734
           FRRRVSKIY+VLEQA
Sbjct: 721 FRRRVSKIYNVLEQA 714

BLAST of HG10008107 vs. ExPASy TrEMBL
Match: A0A6J1JHE2 (protein PHOX1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111485121 PE=4 SV=1)

HSP 1 Score: 1229.5 bits (3180), Expect = 0.0e+00
Identity = 643/742 (86.66%), Postives = 684/742 (92.18%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDL 60
           MGKQ+G KKKQIGDK+R+AISK RQ G GS SYDKDHVIFITMSQ LKEEGNKLFQSRD+
Sbjct: 1   MGKQTGKKKKQIGDKYREAISKRRQGGVGSSSYDKDHVIFITMSQALKEEGNKLFQSRDV 60

Query: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKA 120
           EGAMLKY+KALKLLPRNHIDVSYLRSNMAACYMQMGLSEYP+AIHECNLALEVTPKYSKA
Sbjct: 61  EGAMLKYEKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPQAIHECNLALEVTPKYSKA 120

Query: 121 LLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLP 180
           LLKRARCYE L RLDLALRDV AVL++EP NIMALEISERLTKAL M+GSKEDDAEIKLP
Sbjct: 121 LLKRARCYEALHRLDLALRDVNAVLNLEPRNIMALEISERLTKALMMQGSKEDDAEIKLP 180

Query: 181 LDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVE--------EEEKADETV 240
           LDFVELPSS   Q+ PKEKNRK KNNQKTKETIDEKKA ETVE        EEEKADETV
Sbjct: 181 LDFVELPSSLLPQERPKEKNRKMKNNQKTKETIDEKKAHETVEEKADEIIVEEEKADETV 240

Query: 241 EEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPMKTVKLVFGEDIRWAQL 300
            EE+KADEIIV EEK DETVEEKK EDKLVVEEKI RTQEETPMK+VKLVFG+DIRWAQ+
Sbjct: 241 -EEKKADEIIVEEEKADETVEEKKPEDKLVVEEKINRTQEETPMKSVKLVFGDDIRWAQV 300

Query: 301 PIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYI 360
           PIDCTLLQLREVI+DRFPTC AVLIKYRDEEGDLVTIT +EELRLAETSK SQGSVRFYI
Sbjct: 301 PIDCTLLQLREVIQDRFPTCTAVLIKYRDEEGDLVTITADEELRLAETSKESQGSVRFYI 360

Query: 361 FEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMSSCIDDWIIQFAQLFIN 420
           FEVNPE+DPFY RFKND+ +KC+VEENS     +VL+AKE+K+SSCIDDW+I FAQLFIN
Sbjct: 361 FEVNPEQDPFYGRFKNDEDSKCKVEENS-----YVLRAKEMKISSCIDDWLIHFAQLFIN 420

Query: 421 HVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIM 480
           HVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFE AAEKFHEMAALALF WGNVIM
Sbjct: 421 HVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFESAAEKFHEMAALALFYWGNVIM 480

Query: 481 SRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQ 540
           SRARK VYF DG SKV + EQIKAA+DWVEK Y EA  KYQMAV+IKPDFYEGYLALGQQ
Sbjct: 481 SRARKKVYFTDGVSKVSLSEQIKAAYDWVEKAYVEAGNKYQMAVKIKPDFYEGYLALGQQ 540

Query: 541 QFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSS 600
           QFEQAKLSWHYAVSS+VDPKTWPC +V+ LYN+AE+NMETGM+MWEEWE+Q T ELSKSS
Sbjct: 541 QFEQAKLSWHYAVSSNVDPKTWPCTKVIHLYNNAEDNMETGMRMWEEWEKQHTAELSKSS 600

Query: 601 NVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHEC 660
           +V+T LQKMGLDGLIKDIS DEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLP+WHEC
Sbjct: 601 DVETPLQKMGLDGLIKDISADEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPSWHEC 660

Query: 661 LEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAWNEMYDARKLLTGVP 720
           LEVAVEKFELAGASATDIAVMIKNHCSSNN+HEGLGFKI+EIVQAWNEMY+ARKLLTGVP
Sbjct: 661 LEVAVEKFELAGASATDIAVMIKNHCSSNNAHEGLGFKIEEIVQAWNEMYEARKLLTGVP 720

Query: 721 SFRLEPLFRRRVSKIYHVLEQA 734
           SFRLEPLFRRRVSKIYHVLEQA
Sbjct: 721 SFRLEPLFRRRVSKIYHVLEQA 736

BLAST of HG10008107 vs. ExPASy TrEMBL
Match: A0A6J1JJD6 (protein PHOX1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485121 PE=4 SV=1)

HSP 1 Score: 1224.2 bits (3166), Expect = 0.0e+00
Identity = 644/760 (84.74%), Postives = 685/760 (90.13%), Query Frame = 0

Query: 1   MGKQSG-KKKQIGDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEEGNKLFQSRDL 60
           MGKQ+G KKKQIGDK+R+AISK RQ G GS SYDKDHVIFITMSQ LKEEGNKLFQSRD+
Sbjct: 1   MGKQTGKKKKQIGDKYREAISKRRQGGVGSSSYDKDHVIFITMSQALKEEGNKLFQSRDV 60

Query: 61  EGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLALEVTPKYSKA 120
           EGAMLKY+KALKLLPRNHIDVSYLRSNMAACYMQMGLSEYP+AIHECNLALEVTPKYSKA
Sbjct: 61  EGAMLKYEKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPQAIHECNLALEVTPKYSKA 120

Query: 121 LLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGSKEDDAEIKLP 180
           LLKRARCYE L RLDLALRDV AVL++EP NIMALEISERLTKAL M+GSKEDDAEIKLP
Sbjct: 121 LLKRARCYEALHRLDLALRDVNAVLNLEPRNIMALEISERLTKALMMQGSKEDDAEIKLP 180

Query: 181 LDFVELPSSASSQKGPKEKNRKKKNNQKTKETIDEKKADETVE--------EEEKADETV 240
           LDFVELPSS   Q+ PKEKNRK KNNQKTKETIDEKKA ETVE        EEEKADETV
Sbjct: 181 LDFVELPSSLLPQERPKEKNRKMKNNQKTKETIDEKKAHETVEEKADEIIVEEEKADETV 240

Query: 241 E------------------EEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEET 300
           E                  EE+KADEIIV EEK DETVEEKK EDKLVVEEKI RTQEET
Sbjct: 241 EEKKADEIIVEEEKADETVEEKKADEIIVEEEKADETVEEKKPEDKLVVEEKINRTQEET 300

Query: 301 PMKTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEE 360
           PMK+VKLVFG+DIRWAQ+PIDCTLLQLREVI+DRFPTC AVLIKYRDEEGDLVTIT +EE
Sbjct: 301 PMKSVKLVFGDDIRWAQVPIDCTLLQLREVIQDRFPTCTAVLIKYRDEEGDLVTITADEE 360

Query: 361 LRLAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIK 420
           LRLAETSK SQGSVRFYIFEVNPE+DPFY RFKND+ +KC+VEENS     +VL+AKE+K
Sbjct: 361 LRLAETSKESQGSVRFYIFEVNPEQDPFYGRFKNDEDSKCKVEENS-----YVLRAKEMK 420

Query: 421 MSSCIDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAE 480
           +SSCIDDW+I FAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFE AAE
Sbjct: 421 ISSCIDDWLIHFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFESAAE 480

Query: 481 KFHEMAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQM 540
           KFHEMAALALF WGNVIMSRARK VYF DG SKV + EQIKAA+DWVEK Y EA  KYQM
Sbjct: 481 KFHEMAALALFYWGNVIMSRARKKVYFTDGVSKVSLSEQIKAAYDWVEKAYVEAGNKYQM 540

Query: 541 AVEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGM 600
           AV+IKPDFYEGYLALGQQQFEQAKLSWHYAVSS+VDPKTWPC +V+ LYN+AE+NMETGM
Sbjct: 541 AVKIKPDFYEGYLALGQQQFEQAKLSWHYAVSSNVDPKTWPCTKVIHLYNNAEDNMETGM 600

Query: 601 KMWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTML 660
           +MWEEWE+Q T ELSKSS+V+T LQKMGLDGLIKDIS DEAAEQAKNMRSHINLLWGTML
Sbjct: 601 RMWEEWEKQHTAELSKSSDVETPLQKMGLDGLIKDISADEAAEQAKNMRSHINLLWGTML 660

Query: 661 YERSILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEI 720
           YERSILEFKMGLP+WHECLEVAVEKFELAGASATDIAVMIKNHCSSNN+HEGLGFKI+EI
Sbjct: 661 YERSILEFKMGLPSWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNAHEGLGFKIEEI 720

Query: 721 VQAWNEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLEQA 734
           VQAWNEMY+ARKLLTGVPSFRLEPLFRRRVSKIYHVLEQA
Sbjct: 721 VQAWNEMYEARKLLTGVPSFRLEPLFRRRVSKIYHVLEQA 755

BLAST of HG10008107 vs. TAIR 10
Match: AT2G25290.1 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 731.5 bits (1887), Expect = 6.7e-211
Identity = 409/754 (54.24%), Postives = 526/754 (69.76%), Query Frame = 0

Query: 1   MGKQSGKKKQI--------------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVL 60
           MGK +GKKK                G K  K+  +       + S+D D  IFI  +  L
Sbjct: 1   MGKPTGKKKNNNYTEMPPTESSTTGGGKTGKSFDR-----SATKSFDDDMTIFINRALEL 60

Query: 61  KEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHEC 120
           KEEGNKLFQ RD EGAM +YDKA+KLLPR+H DV+YLR++MA+CYMQMGL EYP AI+EC
Sbjct: 61  KEEGNKLFQKRDYEGAMFRYDKAVKLLPRDHGDVAYLRTSMASCYMQMGLGEYPNAINEC 120

Query: 121 NLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEM 180
           NLALE +P++SKALLKRARCYE L +LD A RD + VL+MEP N+ A EI ER+ K L  
Sbjct: 121 NLALEASPRFSKALLKRARCYEALNKLDFAFRDSRVVLNMEPENVSANEIFERVKKVLVG 180

Query: 181 KGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQKTKE---TIDEKKADETVEE 240
           KG   D+ E  L    V+   +A  +K  KE+ RKKK    T        E+K+ E V E
Sbjct: 181 KGIDVDEMEKNLV--NVQPVGAARLRKIVKERLRKKKKKSMTMTNGGNDGERKSVEAVVE 240

Query: 241 EEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQE----ETPMKTVK 300
           + K D   E         V   +  + +EEKK EDK+ V +K     E     T  +TVK
Sbjct: 241 DAKVDNGEE---------VDSGRKGKAIEEKKLEDKVAVMDKEVIASEIKEDATVTRTVK 300

Query: 301 LVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAET 360
           LV G+DIRWAQLP+D +++ +R+VI+DRFP     LIKYRD EGDLVTITT +ELRLA +
Sbjct: 301 LVHGDDIRWAQLPLDSSVVLVRDVIKDRFPALKGFLIKYRDSEGDLVTITTTDELRLAAS 360

Query: 361 SKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA-KCEVEENSIFENGHVLKAKEI-KMSSC 420
           ++   GS R YI EV+P ++P Y    ND++  K     +S+ +NG V    E  K S+ 
Sbjct: 361 TREKLGSFRLYIAEVSPNQEPTYDVIDNDESTDKFAKGSSSVADNGSVGDFVESEKASTS 420

Query: 421 IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHE 480
           ++ WI QFAQLF NHVGF+S  YL+LH+LGMKLY+EA+E+ VT E+AQ LF++AA+KF E
Sbjct: 421 LEHWIFQFAQLFKNHVGFDSDSYLELHNLGMKLYTEAMEDIVTGEDAQELFDIAADKFQE 480

Query: 481 MAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEI 540
           MAALA+FNWGNV MS+AR+ +YF + GS+  +LE+++A F+W + EY +A  KY+ AV+I
Sbjct: 481 MAALAMFNWGNVHMSKARRQIYFPEDGSRETILEKVEAGFEWAKNEYNKAAEKYEGAVKI 540

Query: 541 KPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWE 600
           K DFYE  LALGQQQFEQAKL W++A+S +VD ++    +V++LYN AEE+ME GM++WE
Sbjct: 541 KSDFYEALLALGQQQFEQAKLCWYHALSGEVDIESDASQDVLKLYNKAEESMEKGMQIWE 600

Query: 601 EWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERS 660
           E EE+R   +S     K  LQK+GLDG+  + S +E+AEQ  NM S INLLWG++LYERS
Sbjct: 601 EMEERRLNGISNFDKHKELLQKLGLDGIFSEASDEESAEQTANMSSQINLLWGSLLYERS 660

Query: 661 ILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAW 720
           I+E+K+GLP W ECLEVAVEKFELAGASATDIAVM+KNHCSS+N+ EG+GFKIDEIVQAW
Sbjct: 661 IVEYKLGLPTWDECLEVAVEKFELAGASATDIAVMVKNHCSSDNALEGMGFKIDEIVQAW 720

Query: 721 NEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLE 732
           NEMYDA++   GVPSFRLEPLFRRR  K++ +LE
Sbjct: 721 NEMYDAKRWQIGVPSFRLEPLFRRRSPKLHDILE 738

BLAST of HG10008107 vs. TAIR 10
Match: AT2G25290.2 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 731.5 bits (1887), Expect = 6.7e-211
Identity = 409/754 (54.24%), Postives = 526/754 (69.76%), Query Frame = 0

Query: 1   MGKQSGKKKQI--------------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVL 60
           MGK +GKKK                G K  K+  +       + S+D D  IFI  +  L
Sbjct: 1   MGKPTGKKKNNNYTEMPPTESSTTGGGKTGKSFDR-----SATKSFDDDMTIFINRALEL 60

Query: 61  KEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHEC 120
           KEEGNKLFQ RD EGAM +YDKA+KLLPR+H DV+YLR++MA+CYMQMGL EYP AI+EC
Sbjct: 61  KEEGNKLFQKRDYEGAMFRYDKAVKLLPRDHGDVAYLRTSMASCYMQMGLGEYPNAINEC 120

Query: 121 NLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEM 180
           NLALE +P++SKALLKRARCYE L +LD A RD + VL+MEP N+ A EI ER+ K L  
Sbjct: 121 NLALEASPRFSKALLKRARCYEALNKLDFAFRDSRVVLNMEPENVSANEIFERVKKVLVG 180

Query: 181 KGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQKTKE---TIDEKKADETVEE 240
           KG   D+ E  L    V+   +A  +K  KE+ RKKK    T        E+K+ E V E
Sbjct: 181 KGIDVDEMEKNLV--NVQPVGAARLRKIVKERLRKKKKKSMTMTNGGNDGERKSVEAVVE 240

Query: 241 EEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQE----ETPMKTVK 300
           + K D   E         V   +  + +EEKK EDK+ V +K     E     T  +TVK
Sbjct: 241 DAKVDNGEE---------VDSGRKGKAIEEKKLEDKVAVMDKEVIASEIKEDATVTRTVK 300

Query: 301 LVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAET 360
           LV G+DIRWAQLP+D +++ +R+VI+DRFP     LIKYRD EGDLVTITT +ELRLA +
Sbjct: 301 LVHGDDIRWAQLPLDSSVVLVRDVIKDRFPALKGFLIKYRDSEGDLVTITTTDELRLAAS 360

Query: 361 SKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA-KCEVEENSIFENGHVLKAKEI-KMSSC 420
           ++   GS R YI EV+P ++P Y    ND++  K     +S+ +NG V    E  K S+ 
Sbjct: 361 TREKLGSFRLYIAEVSPNQEPTYDVIDNDESTDKFAKGSSSVADNGSVGDFVESEKASTS 420

Query: 421 IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHE 480
           ++ WI QFAQLF NHVGF+S  YL+LH+LGMKLY+EA+E+ VT E+AQ LF++AA+KF E
Sbjct: 421 LEHWIFQFAQLFKNHVGFDSDSYLELHNLGMKLYTEAMEDIVTGEDAQELFDIAADKFQE 480

Query: 481 MAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEI 540
           MAALA+FNWGNV MS+AR+ +YF + GS+  +LE+++A F+W + EY +A  KY+ AV+I
Sbjct: 481 MAALAMFNWGNVHMSKARRQIYFPEDGSRETILEKVEAGFEWAKNEYNKAAEKYEGAVKI 540

Query: 541 KPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWE 600
           K DFYE  LALGQQQFEQAKL W++A+S +VD ++    +V++LYN AEE+ME GM++WE
Sbjct: 541 KSDFYEALLALGQQQFEQAKLCWYHALSGEVDIESDASQDVLKLYNKAEESMEKGMQIWE 600

Query: 601 EWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERS 660
           E EE+R   +S     K  LQK+GLDG+  + S +E+AEQ  NM S INLLWG++LYERS
Sbjct: 601 EMEERRLNGISNFDKHKELLQKLGLDGIFSEASDEESAEQTANMSSQINLLWGSLLYERS 660

Query: 661 ILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAW 720
           I+E+K+GLP W ECLEVAVEKFELAGASATDIAVM+KNHCSS+N+ EG+GFKIDEIVQAW
Sbjct: 661 IVEYKLGLPTWDECLEVAVEKFELAGASATDIAVMVKNHCSSDNALEGMGFKIDEIVQAW 720

Query: 721 NEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLE 732
           NEMYDA++   GVPSFRLEPLFRRR  K++ +LE
Sbjct: 721 NEMYDAKRWQIGVPSFRLEPLFRRRSPKLHDILE 738

BLAST of HG10008107 vs. TAIR 10
Match: AT2G25290.3 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 731.5 bits (1887), Expect = 6.7e-211
Identity = 409/754 (54.24%), Postives = 526/754 (69.76%), Query Frame = 0

Query: 1   MGKQSGKKKQI--------------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVL 60
           MGK +GKKK                G K  K+  +       + S+D D  IFI  +  L
Sbjct: 1   MGKPTGKKKNNNYTEMPPTESSTTGGGKTGKSFDR-----SATKSFDDDMTIFINRALEL 60

Query: 61  KEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHEC 120
           KEEGNKLFQ RD EGAM +YDKA+KLLPR+H DV+YLR++MA+CYMQMGL EYP AI+EC
Sbjct: 61  KEEGNKLFQKRDYEGAMFRYDKAVKLLPRDHGDVAYLRTSMASCYMQMGLGEYPNAINEC 120

Query: 121 NLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEM 180
           NLALE +P++SKALLKRARCYE L +LD A RD + VL+MEP N+ A EI ER+ K L  
Sbjct: 121 NLALEASPRFSKALLKRARCYEALNKLDFAFRDSRVVLNMEPENVSANEIFERVKKVLVG 180

Query: 181 KGSKEDDAEIKLPLDFVELPSSASSQKGPKEKNRKKKNNQKTKE---TIDEKKADETVEE 240
           KG   D+ E  L    V+   +A  +K  KE+ RKKK    T        E+K+ E V E
Sbjct: 181 KGIDVDEMEKNLV--NVQPVGAARLRKIVKERLRKKKKKSMTMTNGGNDGERKSVEAVVE 240

Query: 241 EEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQE----ETPMKTVK 300
           + K D   E         V   +  + +EEKK EDK+ V +K     E     T  +TVK
Sbjct: 241 DAKVDNGEE---------VDSGRKGKAIEEKKLEDKVAVMDKEVIASEIKEDATVTRTVK 300

Query: 301 LVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELRLAET 360
           LV G+DIRWAQLP+D +++ +R+VI+DRFP     LIKYRD EGDLVTITT +ELRLA +
Sbjct: 301 LVHGDDIRWAQLPLDSSVVLVRDVIKDRFPALKGFLIKYRDSEGDLVTITTTDELRLAAS 360

Query: 361 SKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA-KCEVEENSIFENGHVLKAKEI-KMSSC 420
           ++   GS R YI EV+P ++P Y    ND++  K     +S+ +NG V    E  K S+ 
Sbjct: 361 TREKLGSFRLYIAEVSPNQEPTYDVIDNDESTDKFAKGSSSVADNGSVGDFVESEKASTS 420

Query: 421 IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEKFHE 480
           ++ WI QFAQLF NHVGF+S  YL+LH+LGMKLY+EA+E+ VT E+AQ LF++AA+KF E
Sbjct: 421 LEHWIFQFAQLFKNHVGFDSDSYLELHNLGMKLYTEAMEDIVTGEDAQELFDIAADKFQE 480

Query: 481 MAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMAVEI 540
           MAALA+FNWGNV MS+AR+ +YF + GS+  +LE+++A F+W + EY +A  KY+ AV+I
Sbjct: 481 MAALAMFNWGNVHMSKARRQIYFPEDGSRETILEKVEAGFEWAKNEYNKAAEKYEGAVKI 540

Query: 541 KPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMKMWE 600
           K DFYE  LALGQQQFEQAKL W++A+S +VD ++    +V++LYN AEE+ME GM++WE
Sbjct: 541 KSDFYEALLALGQQQFEQAKLCWYHALSGEVDIESDASQDVLKLYNKAEESMEKGMQIWE 600

Query: 601 EWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLYERS 660
           E EE+R   +S     K  LQK+GLDG+  + S +E+AEQ  NM S INLLWG++LYERS
Sbjct: 601 EMEERRLNGISNFDKHKELLQKLGLDGIFSEASDEESAEQTANMSSQINLLWGSLLYERS 660

Query: 661 ILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIVQAW 720
           I+E+K+GLP W ECLEVAVEKFELAGASATDIAVM+KNHCSS+N+ EG+GFKIDEIVQAW
Sbjct: 661 IVEYKLGLPTWDECLEVAVEKFELAGASATDIAVMVKNHCSSDNALEGMGFKIDEIVQAW 720

Query: 721 NEMYDARKLLTGVPSFRLEPLFRRRVSKIYHVLE 732
           NEMYDA++   GVPSFRLEPLFRRR  K++ +LE
Sbjct: 721 NEMYDAKRWQIGVPSFRLEPLFRRRSPKLHDILE 738

BLAST of HG10008107 vs. TAIR 10
Match: AT4G32070.1 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 684.1 bits (1764), Expect = 1.2e-196
Identity = 400/811 (49.32%), Postives = 531/811 (65.47%), Query Frame = 0

Query: 1   MGKQSGKKKQI-----------GDKFRKAISKHRQSGDGSPSYDKDHVIFITMSQVLKEE 60
           MGK + KKK             G   +   + HR +   S  +D+D  IFI+ +  LKEE
Sbjct: 1   MGKPTAKKKNPETPKDASGGGGGGGGKSGKTYHRST---SRVFDEDMEIFISRALELKEE 60

Query: 61  GNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEYPRAIHECNLA 120
           GNKLFQ RD EGAML +DKALKLLP++HIDV+YLR++MA+CYMQMGL EYP AI ECNLA
Sbjct: 61  GNKLFQKRDHEGAMLSFDKALKLLPKDHIDVAYLRTSMASCYMQMGLGEYPNAISECNLA 120

Query: 121 LEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISERLTKALEMKGS 180
           LE +P+YSKAL++R+RCYE L +LD A RD + VL+MEP N+ A EI +R+ K L  KG 
Sbjct: 121 LEASPRYSKALVRRSRCYEALNKLDYAFRDARIVLNMEPGNVSANEIFDRVKKVLVDKGI 180

Query: 181 KEDDAEIKLPLDFVELP--SSASSQKGPKEKNRKKKNNQKTKETIDEKKADETV------ 240
             D+ E     DFV++    +A  +K  KE+ RK K  +K+    +E K+ + V      
Sbjct: 181 DVDEME----KDFVDVQPVCAARLKKIVKERLRKSKKKKKSGGKDEELKSPKVVVVDKGD 240

Query: 241 -------EEEEKADET---------VEEEEKADEIIVGEEKV---DETVEEKKAEDKLVV 300
                   +EEK+D++          EE++ + +   G++K    ++  EE+K EDK+VV
Sbjct: 241 EAEGRNKPKEEKSDKSDIDGKIGGKREEKKTSFKSDKGQKKKSGGNKAGEERKVEDKVVV 300

Query: 301 EEKI-----------TRTQEETPMKTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTC 360
            +K            ++ +  T  +T+KLV G+DIRWAQLP+D T+  +R+VIRDRFP  
Sbjct: 301 MDKEVIASEIVDGGGSKKEGATVTRTIKLVHGDDIRWAQLPLDSTVRLVRDVIRDRFPAL 360

Query: 361 MAVLIKYRDEEGDLVTITTNEELRLAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAA 420
              LIKYRD EGDLVTITT +ELRLA ++    GS+R YI EVNP+++P Y    N ++ 
Sbjct: 361 RGFLIKYRDTEGDLVTITTTDELRLAASTHDKLGSLRLYIAEVNPDQEPTYDGMSNTEST 420

Query: 421 -KCEVEENSIFENGHVLK-AKEIKMSSCIDDWIIQFAQLFINHVGFESGPYLDLHDLGMK 480
            K     +S+ +NG V +     K S C ++WI QFAQLF NHVGF+S  Y+DLHDLGMK
Sbjct: 421 DKVSKRLSSLADNGSVGEYVGSDKASGCFENWIFQFAQLFKNHVGFDSDSYVDLHDLGMK 480

Query: 481 LYSEAVEETVTSEEAQGLFELAAEKFHEMAALALFNWGNVIMSRARKTVYFADGGSKVRV 540
           LY+EA+E+ VT E+AQ LF++AA+KF EM ALAL NWGNV MS+ARK V   +  S+  +
Sbjct: 481 LYTEAMEDAVTGEDAQELFQIAADKFQEMGALALLNWGNVHMSKARKQVCIPEDASREAI 540

Query: 541 LEQIKAAFDWVEKEYAEAERKYQMAVEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVD 600
           +E ++AAF W + EY +A  KY+ A+++KPDFYE  LALGQ+QFE AKL W++A+ S VD
Sbjct: 541 IEAVEAAFVWTQNEYNKAAEKYEEAIKVKPDFYEALLALGQEQFEHAKLCWYHALKSKVD 600

Query: 601 PKTWPCAEVMQLYNSAEENMETGMKMWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDI 660
            ++    EV++LYN AE++ME GM++WEE EE R   +SK    K  L+K+ LD L  + 
Sbjct: 601 LESEASQEVLKLYNKAEDSMERGMQIWEEMEECRLNGISKLDKHKNMLRKLELDELFSEA 660

Query: 661 SVDEAAEQAKNMRSHINLLWGTMLYERSILEFKMGLPAWHECLEVAVEKFELAGASATDI 720
           S +E  EQ  NM S INLLWG++LYERSI+E+K+GLP W ECLEVAVEKFELAGASATDI
Sbjct: 661 SEEETVEQTANMSSQINLLWGSLLYERSIVEYKLGLPTWDECLEVAVEKFELAGASATDI 720

Query: 721 AVMIKNHCSSNNSHE-----------------------------GLGFKIDEIVQAWNEM 732
           AVM+KNHCSS ++ E                             G+GFKIDEIVQAWNEM
Sbjct: 721 AVMVKNHCSSESALEGNQFLARIPNSGQVTTQWFSVYNNLRTNAGMGFKIDEIVQAWNEM 780

BLAST of HG10008107 vs. TAIR 10
Match: AT5G20360.1 (Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide repeat (TPR)-containing protein )

HSP 1 Score: 533.5 bits (1373), Expect = 2.7e-151
Identity = 305/695 (43.88%), Postives = 457/695 (65.76%), Query Frame = 0

Query: 40  ITMSQVLKEEGNKLFQSRDLEGAMLKYDKALKLLPRNHIDVSYLRSNMAACYMQMGLSEY 99
           ++ +Q LKEEGNKLFQ RD +GAM KY +A+K+LP++H++VS++R+N+A+CYMQ+   E+
Sbjct: 123 VSKAQGLKEEGNKLFQKRDYDGAMFKYGEAIKILPKDHVEVSHVRANVASCYMQLEPGEF 182

Query: 100 PRAIHECNLALEVTPKYSKALLKRARCYEGLRRLDLALRDVKAVLSMEPNNIMALEISER 159
            +AIHEC+LAL VTP ++KALLKRARCYE L +LDLALRDV  V  ++P N MA EI E+
Sbjct: 183 AKAIHECDLALSVTPDHNKALLKRARCYEALNKLDLALRDVCMVSKLDPKNPMASEIVEK 242

Query: 160 LTKALEMKGSKEDDAEIKLPLDFVE----LPSSASSQKGPKEKNRKKKNNQKTKETIDEK 219
           L + LE KG + +++ I+LP D+VE     P++  ++ G     + KK+NQ  +++  E 
Sbjct: 243 LKRTLESKGLRINNSVIELPPDYVEPVGASPAALWAKLGKVRVKKTKKSNQVEEKSEGE- 302

Query: 220 KADETVEEEEKADETVEEEEKADEIIVGEEKVDETVEEKKAEDKLVVEEKITRTQEETPM 279
              E VE E+K +   E+ ++  ++ V  ++ D+  +  K ++K+++EE++     E   
Sbjct: 303 --GEDVEPEKKNNVLAEKGKEKIKMKVKGKQSDKRSDTSKEQEKVIIEEELLVIGVEDVN 362

Query: 280 KTVKLVFGEDIRWAQLPIDCTLLQLREVIRDRFPTCMAVLIKYRDEEGDLVTITTNEELR 339
           K VK V+ +DIR A+LPI+CTL +LREV+ +RFP+  AV IKYRD+EGDLVTITT+EELR
Sbjct: 363 KDVKFVYSDDIRLAELPINCTLFKLREVVHERFPSLRAVHIKYRDQEGDLVTITTDEELR 422

Query: 340 LAETSKLSQGSVRFYIFEVNPERDPFYKRFKNDKAAKCEVEENSIFENGHVLKAKEIKMS 399
           ++E S  SQG++RFY+ EV+PE+DPF+ R    K  K   +           KAK     
Sbjct: 423 MSEVSSRSQGTMRFYVVEVSPEQDPFFGRLVEMKKLKITADS---------FKAKVNGRG 482

Query: 400 SC-IDDWIIQFAQLFINHVGFESGPYLDLHDLGMKLYSEAVEETVTSEEAQGLFELAAEK 459
            C ++DW+I+FA LF      +S   L+L +LGMKL SEA+EE VTS+ AQG F+ AA++
Sbjct: 483 GCKVEDWMIEFAHLFKIQARIDSDRCLNLQELGMKLNSEAMEEVVTSDAAQGPFDRAAQQ 542

Query: 460 FHEMAALALFNWGNVIMSRARKTVYFADGGSKVRVLEQIKAAFDWVEKEYAEAERKYQMA 519
           F E+AA +L N G V MS ARK +    G S   V EQ+K A++  +KE+A A+ KY+ A
Sbjct: 543 FQEVAARSLLNLGYVHMSGARKRLSLLQGVSGESVSEQVKTAYECAKKEHANAKEKYEEA 602

Query: 520 VEIKPDFYEGYLALGQQQFEQAKLSWHYAVSSDVDPKTWPCAEVMQLYNSAEENMETGMK 579
           ++IKP+ +E +LALG QQFE+A+LSW+Y + S +D KTWP A+V+Q Y SAE N++  M+
Sbjct: 603 MKIKPECFEVFLALGLQQFEEARLSWYYVLVSHLDLKTWPYADVVQFYQSAESNIKKSME 662

Query: 580 MWEEWEEQRTGELSKSSNVKTQLQKMGLDGLIKDISVDEAAEQAKNMRSHINLLWGTMLY 639
           + E  E  +  E S++        +  L    +    +  A++A  ++S I++L   +LY
Sbjct: 663 VLENLETGKESEPSQAGKTDCLTHEKDLGSSTQ----NNPAKEAGRLKSWIDILLCAVLY 722

Query: 640 ERSILEFKMGLPAWHECLEVAVEKFELAGASATDIAVMIKNHCSSNNSHEGLGFKIDEIV 699
           ERSI+E+K+  P W E LE A+EKFELAG    D+  +I     + N+   + F ++EI+
Sbjct: 723 ERSIMEYKLDQPFWRESLEAAMEKFELAGTCKDDVVEIISEDYVAGNTLRDIRFHMEEII 782

Query: 700 QAWNEMYDARKLLTGVPSFRLEPLFRRRVSKIYHV 730
           Q ++E+Y+A+    G+PS +LE + +RR   I+HV
Sbjct: 783 QIFDEIYEAKHWTNGIPSDQLEEILKRRAENIFHV 801

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038879044.10.0e+0092.45protein PHOX1-like isoform X1 [Benincasa hispida][more]
XP_038879046.10.0e+0093.05protein PHOX1-like isoform X3 [Benincasa hispida][more]
XP_038879045.10.0e+0092.92protein PHOX1-like isoform X2 [Benincasa hispida][more]
KAA0050062.10.0e+0091.16Protein unc-45-A-like protein [Cucumis melo var. makuwa] >TYK10348.1 Protein unc... [more]
XP_004146713.10.0e+0090.86protein PHOX1 [Cucumis sativus] >XP_011655705.1 protein PHOX1 [Cucumis sativus] ... [more]
Match NameE-valueIdentityDescription
F4IRM49.5e-21054.24Protein PHOX1 OS=Arabidopsis thaliana OX=3702 GN=PHOX1 PE=1 SV=1[more]
F4JTI14.0e-20051.15Protein PHOX4 OS=Arabidopsis thaliana OX=3702 GN=PHOX4 PE=2 SV=1[more]
K7TQE33.5e-16445.52HSP-interacting protein OS=Zea mays OX=4577 GN=HIP PE=1 SV=1[more]
F4K4873.8e-15043.88Protein PHOX3 OS=Arabidopsis thaliana OX=3702 GN=PHOX3 PE=1 SV=1[more]
O488021.1e-11738.71Protein CLMP1 OS=Arabidopsis thaliana OX=3702 GN=CLMP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A5A7U2K20.0e+0091.16Protein unc-45-A-like protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_sc... [more]
A0A0A0LYY00.0e+0090.86Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_1G263990 PE=4 SV=1[more]
A0A1S3B9Z60.0e+0090.07LOW QUALITY PROTEIN: uncharacterized protein LOC103487400 OS=Cucumis melo OX=365... [more]
A0A6J1JHE20.0e+0086.66protein PHOX1-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111485121 PE=4 S... [more]
A0A6J1JJD60.0e+0084.74protein PHOX1-like isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111485121 PE=4 S... [more]
Match NameE-valueIdentityDescription
AT2G25290.16.7e-21154.24Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
AT2G25290.26.7e-21154.24Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
AT2G25290.36.7e-21154.24Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
AT4G32070.11.2e-19649.32Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
AT5G20360.12.7e-15143.88Octicosapeptide/Phox/Bem1p (PB1) domain-containing protein / tetratricopeptide r... [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 198..237
NoneNo IPR availableCOILSCoilCoilcoord: 490..510
NoneNo IPR availableGENE3D3.10.20.90coord: 275..357
e-value: 3.5E-9
score: 38.1
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 223..237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 185..237
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 204..222
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..30
NoneNo IPR availablePANTHERPTHR46183:SF16PROTEIN PHOX3coord: 1..732
NoneNo IPR availableCDDcd05992PB1coord: 277..352
e-value: 6.74457E-11
score: 56.9025
NoneNo IPR availableSUPERFAMILY54277CAD & PB1 domainscoord: 271..364
IPR000270PB1 domainSMARTSM00666PB1_newcoord: 275..354
e-value: 6.2E-15
score: 65.6
IPR000270PB1 domainPFAMPF00564PB1coord: 277..352
e-value: 1.3E-14
score: 53.9
IPR000270PB1 domainPROSITEPS51745PB1coord: 275..354
score: 13.904573
IPR019734Tetratricopeptide repeatSMARTSM00028tpr_5coord: 117..150
e-value: 0.011
score: 24.9
coord: 488..521
e-value: 420.0
score: 0.7
coord: 81..116
e-value: 32.0
score: 10.5
coord: 43..76
e-value: 1.1
score: 18.3
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 43..76
score: 9.1159
IPR019734Tetratricopeptide repeatPROSITEPS50005TPRcoord: 117..150
score: 8.1129
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 423..584
e-value: 3.8E-5
score: 25.4
IPR011990Tetratricopeptide-like helical domain superfamilyGENE3D1.25.40.10Tetratricopeptide repeat domaincoord: 41..171
e-value: 5.8E-29
score: 102.4
IPR011990Tetratricopeptide-like helical domain superfamilySUPERFAMILY48452TPR-likecoord: 44..538
IPR044517Protein PHOX1-4PANTHERPTHR46183PROTEIN CLMP1coord: 1..732

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10008107.1HG10008107.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
molecular_function GO:0016740 transferase activity