Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGAGAATTCTGATAAACTCATTTCATCATCAACTACAAGCAATCCCTCCCTGTCCTCCAAATCCGGTGAGTCTTTTTCCCTAATTTCTACGAAAACCACTGAAAAATCAATAAATCTTTCTTTCCATTTTCCTTTCGCTTTCTTTTCTATGCTTAGATTTAAACCAGTTTCCATTTCTAATTTAAGGAATTTGTTTATATTTCTTTCTTGTTAGATCAAAACGATCAAAATTATCCTTCATCCATCGCCAATTTGAATCTGAGAAAACTAGGGGAAACGGAAAAGTTGGTTAAGAAGAAGGTTTTAACTGAACGAAATGAGGCTATCGATCCGAAATTTACTGAGAATTCGTTGTCGGAAATCCCAAACGTCGATCCGAAATCCTCTTCTTGTCAAGTTGATTCTGTGTTTGAGACCACATTGCCCTATGATCCATTGACAAATTATCTCTCTCCTCGGCCTAGATTTTTACGATACAAACCAAATAAACGACGGGAGATCTTTTTGAGGACCGTTGGCGAGGATTCTCTTTCGGTTTCTCACACTTCATCCTCTGAAGAAGAAGAAACTAAAATGAAAGGAGAGGAGGAGGAGGAGCTTGAGGTGGAAAGTGAGGGGAAATCTAATGAAATTGATGATGAAGGTGAGGGAGATAAGGAGGAGATCAGAGGTTGGACTGTGAAAGAGTTGTTAAAGTTTCTGCTTCTAGTTGCGAGTTTGATTTCGTCTACCTTGTATATCACTTCCATGAACACGCCTTCACCTTCATTTGAAGTTTCAAGAGCCTTCAATTCTGGTTCTTTCCCAATTTTGAATCACACAAGTGAGTTTGAGTCGAGCCCAGTGGTGGAATCCATTTTTGCAAATCGAAGTAATCTCTGGGATGAAGAAGTAAGTGAGGCTGCTTCAATGAGGAATTTGGAAGGTGTAAGCCAATTGAACAATCAAGAAGATGCAGAAGATAGAGGTTTTATGGAAGAAAACGAGATATTGAAAGGTGAAAATGAAGGTGGCAAGGGTGGATATGGAGATTTGGTAAGAGTAGAATATACTGAACTGGTTGAAGATGCAAGAGAAAAACTACTAGCTGGGGAAAGTATCATTGAGGAGATGGATGATGGTGAAAAGAATGGAGTTGAATTGCTGAACTTTGGAGATACTGGTGATCAGGAAAAAAGAGAAGAATCCGAAATCTCTAAGACAACGTCTGTCCCTTGTGAAACATCAGAAGAAGATGAAATTACTGAAGTTCCTAATGTCAACGGATTTGATGAGGACAAGTTATTATCTAACATTTTAACTGCGGCTGAAAATGAGTACACTCCTCAAATGGAGGTTGTTGAAAACGAAGAAGGGGGAGATTTGCAAATGATTGAAAGCAACACATGGAAATCTGAGAGTTTTGTGCTTGAGGCGGATAAAATTACTGAATCTTCAAACTTCAATGGATTTGATGAGGACAAGTTGTTATCTAACATTTTAACTGCTGCTGAAAATGAGTACACTCCTCAAATGGAGGTAGTTGAAAAGGAAGAAGGGGGAGATTTGGAAATGATTGAAAGCAACACAGGGGAATCTGAGAGTTTTGTACTCGAGGCGGATAAAATCACCATTTTGGAGGGGATAACCAACAGCTTATCCAGTTTTGTTGAAGATTTGGAGAAACTGAAGTCTGAGCTTGTTGGGCTTATGCACGCTGAATCTGAGTCTGTGCTTAAGGCTGTACTTGGACTTTCAGTATCATCTGCAATGCTTACTTGTTTGATCTTGTCTTTCCAACAAAAGAAAAAGAAAGATGATACAAAAGTACCAGCCATTTCTGTGAGTGTAGAACCGTTGCTGCAGGATCCAGTTGCAAAAACTGAGAAAGTTATTACGAGGGAATCCCCTGTAATTAAGGCTACTTGTGATGTTAATGGATCAAATAATAGGCTTATCAGGAATGTTGATGCTTTCAAAACGCTCTCAGCTTCTATCCATTCAAGAGATGAAGGGAAAAAATTCAAAGAAATGTACCACAATGAAGCTCCATCAGTTCAATTTCTTGGTGAGTTCGTAGTTGGAGAGATCAGCAACTCTCTTAAGACCAAAAGTGGAATGAAGAACTGGACGGTTGAGGTAGAAGATAGCAATTTTCCTGGTTCAGTTGAAGAGAAACCAGTGAGCAAGAATATGAATTCTGGACCCGAGCAAGCTTTGTCAGAGTTCTCTGCCACGACTTCTTCCCCATCCTACGGTAGCTTTACCACTAAGAAGAAGATTGTTAAGAAAGAGGTGAGGATAATTTAAACTGTTATATTTTTTTTCCCCTTCTTTGACGACGTGATGAACAGTTTGTCTAAGGTTGCTGATTTTCATCTGTAGGTGGGAGGACATGGTGAGGTAAAGTCGATCCCAACTCCAGTGAGAAGATCAACCAGAATTCGAAACCGTATGATGTCGCCATGA
mRNA sequence
ATGGAGAATTCTGATAAACTCATTTCATCATCAACTACAAGCAATCCCTCCCTGTCCTCCAAATCCGATCAAAACGATCAAAATTATCCTTCATCCATCGCCAATTTGAATCTGAGAAAACTAGGGGAAACGGAAAAGTTGGTTAAGAAGAAGGTTTTAACTGAACGAAATGAGGCTATCGATCCGAAATTTACTGAGAATTCGTTGTCGGAAATCCCAAACGTCGATCCGAAATCCTCTTCTTGTCAAGTTGATTCTGTGTTTGAGACCACATTGCCCTATGATCCATTGACAAATTATCTCTCTCCTCGGCCTAGATTTTTACGATACAAACCAAATAAACGACGGGAGATCTTTTTGAGGACCGTTGGCGAGGATTCTCTTTCGGTTTCTCACACTTCATCCTCTGAAGAAGAAGAAACTAAAATGAAAGGAGAGGAGGAGGAGGAGCTTGAGGTGGAAAGTGAGGGGAAATCTAATGAAATTGATGATGAAGGTGAGGGAGATAAGGAGGAGATCAGAGGTTGGACTGTGAAAGAGTTGTTAAAGTTTCTGCTTCTAGTTGCGAGTTTGATTTCGTCTACCTTGTATATCACTTCCATGAACACGCCTTCACCTTCATTTGAAGTTTCAAGAGCCTTCAATTCTGGTTCTTTCCCAATTTTGAATCACACAAGTGAGTTTGAGTCGAGCCCAGTGGTGGAATCCATTTTTGCAAATCGAAGTAATCTCTGGGATGAAGAAGTAAGTGAGGCTGCTTCAATGAGGAATTTGGAAGGTGTAAGCCAATTGAACAATCAAGAAGATGCAGAAGATAGAGGTTTTATGGAAGAAAACGAGATATTGAAAGGTGAAAATGAAGGTGGCAAGGGTGGATATGGAGATTTGGTAAGAGTAGAATATACTGAACTGGTTGAAGATGCAAGAGAAAAACTACTAGCTGGGGAAAGTATCATTGAGGAGATGGATGATGGTGAAAAGAATGGAGTTGAATTGCTGAACTTTGGAGATACTGGTGATCAGGAAAAAAGAGAAGAATCCGAAATCTCTAAGACAACGTCTGTCCCTTGTGAAACATCAGAAGAAGATGAAATTACTGAAGTTCCTAATGTCAACGGATTTGATGAGGACAAGTTATTATCTAACATTTTAACTGCGGCTGAAAATGAGTACACTCCTCAAATGGAGGTTGTTGAAAACGAAGAAGGGGGAGATTTGCAAATGATTGAAAGCAACACATGGAAATCTGAGAGTTTTGTGCTTGAGGCGGATAAAATTACTGAATCTTCAAACTTCAATGGATTTGATGAGGACAAGTTGTTATCTAACATTTTAACTGCTGCTGAAAATGAGTACACTCCTCAAATGGAGGTAGTTGAAAAGGAAGAAGGGGGAGATTTGGAAATGATTGAAAGCAACACAGGGGAATCTGAGAGTTTTGTACTCGAGGCGGATAAAATCACCATTTTGGAGGGGATAACCAACAGCTTATCCAGTTTTGTTGAAGATTTGGAGAAACTGAAGTCTGAGCTTGTTGGGCTTATGCACGCTGAATCTGAGTCTGTGCTTAAGGCTGTACTTGGACTTTCAGTATCATCTGCAATGCTTACTTGTTTGATCTTGTCTTTCCAACAAAAGAAAAAGAAAGATGATACAAAAGTACCAGCCATTTCTGTGAGTGTAGAACCGTTGCTGCAGGATCCAGTTGCAAAAACTGAGAAAGTTATTACGAGGGAATCCCCTGTAATTAAGGCTACTTGTGATGTTAATGGATCAAATAATAGGCTTATCAGGAATGTTGATGCTTTCAAAACGCTCTCAGCTTCTATCCATTCAAGAGATGAAGGGAAAAAATTCAAAGAAATGTACCACAATGAAGCTCCATCAGTTCAATTTCTTGGTGAGTTCGTAGTTGGAGAGATCAGCAACTCTCTTAAGACCAAAAGTGGAATGAAGAACTGGACGGTTGAGGTAGAAGATAGCAATTTTCCTGGTTCAGTTGAAGAGAAACCAGTGAGCAAGAATATGAATTCTGGACCCGAGCAAGCTTTGTCAGAGTTCTCTGCCACGACTTCTTCCCCATCCTACGGTAGCTTTACCACTAAGAAGAAGATTGTTAAGAAAGAGGTGGGAGGACATGGTGAGGTAAAGTCGATCCCAACTCCAGTGAGAAGATCAACCAGAATTCGAAACCGTATGATGTCGCCATGA
Coding sequence (CDS)
ATGGAGAATTCTGATAAACTCATTTCATCATCAACTACAAGCAATCCCTCCCTGTCCTCCAAATCCGATCAAAACGATCAAAATTATCCTTCATCCATCGCCAATTTGAATCTGAGAAAACTAGGGGAAACGGAAAAGTTGGTTAAGAAGAAGGTTTTAACTGAACGAAATGAGGCTATCGATCCGAAATTTACTGAGAATTCGTTGTCGGAAATCCCAAACGTCGATCCGAAATCCTCTTCTTGTCAAGTTGATTCTGTGTTTGAGACCACATTGCCCTATGATCCATTGACAAATTATCTCTCTCCTCGGCCTAGATTTTTACGATACAAACCAAATAAACGACGGGAGATCTTTTTGAGGACCGTTGGCGAGGATTCTCTTTCGGTTTCTCACACTTCATCCTCTGAAGAAGAAGAAACTAAAATGAAAGGAGAGGAGGAGGAGGAGCTTGAGGTGGAAAGTGAGGGGAAATCTAATGAAATTGATGATGAAGGTGAGGGAGATAAGGAGGAGATCAGAGGTTGGACTGTGAAAGAGTTGTTAAAGTTTCTGCTTCTAGTTGCGAGTTTGATTTCGTCTACCTTGTATATCACTTCCATGAACACGCCTTCACCTTCATTTGAAGTTTCAAGAGCCTTCAATTCTGGTTCTTTCCCAATTTTGAATCACACAAGTGAGTTTGAGTCGAGCCCAGTGGTGGAATCCATTTTTGCAAATCGAAGTAATCTCTGGGATGAAGAAGTAAGTGAGGCTGCTTCAATGAGGAATTTGGAAGGTGTAAGCCAATTGAACAATCAAGAAGATGCAGAAGATAGAGGTTTTATGGAAGAAAACGAGATATTGAAAGGTGAAAATGAAGGTGGCAAGGGTGGATATGGAGATTTGGTAAGAGTAGAATATACTGAACTGGTTGAAGATGCAAGAGAAAAACTACTAGCTGGGGAAAGTATCATTGAGGAGATGGATGATGGTGAAAAGAATGGAGTTGAATTGCTGAACTTTGGAGATACTGGTGATCAGGAAAAAAGAGAAGAATCCGAAATCTCTAAGACAACGTCTGTCCCTTGTGAAACATCAGAAGAAGATGAAATTACTGAAGTTCCTAATGTCAACGGATTTGATGAGGACAAGTTATTATCTAACATTTTAACTGCGGCTGAAAATGAGTACACTCCTCAAATGGAGGTTGTTGAAAACGAAGAAGGGGGAGATTTGCAAATGATTGAAAGCAACACATGGAAATCTGAGAGTTTTGTGCTTGAGGCGGATAAAATTACTGAATCTTCAAACTTCAATGGATTTGATGAGGACAAGTTGTTATCTAACATTTTAACTGCTGCTGAAAATGAGTACACTCCTCAAATGGAGGTAGTTGAAAAGGAAGAAGGGGGAGATTTGGAAATGATTGAAAGCAACACAGGGGAATCTGAGAGTTTTGTACTCGAGGCGGATAAAATCACCATTTTGGAGGGGATAACCAACAGCTTATCCAGTTTTGTTGAAGATTTGGAGAAACTGAAGTCTGAGCTTGTTGGGCTTATGCACGCTGAATCTGAGTCTGTGCTTAAGGCTGTACTTGGACTTTCAGTATCATCTGCAATGCTTACTTGTTTGATCTTGTCTTTCCAACAAAAGAAAAAGAAAGATGATACAAAAGTACCAGCCATTTCTGTGAGTGTAGAACCGTTGCTGCAGGATCCAGTTGCAAAAACTGAGAAAGTTATTACGAGGGAATCCCCTGTAATTAAGGCTACTTGTGATGTTAATGGATCAAATAATAGGCTTATCAGGAATGTTGATGCTTTCAAAACGCTCTCAGCTTCTATCCATTCAAGAGATGAAGGGAAAAAATTCAAAGAAATGTACCACAATGAAGCTCCATCAGTTCAATTTCTTGGTGAGTTCGTAGTTGGAGAGATCAGCAACTCTCTTAAGACCAAAAGTGGAATGAAGAACTGGACGGTTGAGGTAGAAGATAGCAATTTTCCTGGTTCAGTTGAAGAGAAACCAGTGAGCAAGAATATGAATTCTGGACCCGAGCAAGCTTTGTCAGAGTTCTCTGCCACGACTTCTTCCCCATCCTACGGTAGCTTTACCACTAAGAAGAAGATTGTTAAGAAAGAGGTGGGAGGACATGGTGAGGTAAAGTCGATCCCAACTCCAGTGAGAAGATCAACCAGAATTCGAAACCGTATGATGTCGCCATGA
Protein sequence
MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAIDPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFLRTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDKEEIRGWTVKELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFANRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRVEYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCETSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSESFVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESESFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTCLILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIRNVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKSIPTPVRRSTRIRNRMMSP
Homology
BLAST of HG10002297 vs. NCBI nr
Match:
XP_038877902.1 (uncharacterized protein LOC120070117 isoform X1 [Benincasa hispida])
HSP 1 Score: 959.5 bits (2479), Expect = 1.6e-275
Identity = 556/737 (75.44%), Postives = 591/737 (80.19%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MENSD++ISSSTTSNPSL SKSD+NDQNYP S+ANL+ KLGE EK+VKKKVLTERNEA+
Sbjct: 1 MENSDQVISSSTTSNPSLFSKSDENDQNYPPSVANLDRGKLGEMEKVVKKKVLTERNEAM 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFTENS SEIP VDPK SSCQVD VF+TTLPYDPLTNYLSPRPRFLRYKPNKRREIF
Sbjct: 61 DPKFTENSSSEIPKVDPKPSSCQVD-VFQTTLPYDPLTNYLSPRPRFLRYKPNKRREIFW 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDK-EEIRGWTVK 180
R VGEDS SVSHTSSSEEEE KMK EEELEVESEGKSNEIDDEGEGDK EE RGWTVK
Sbjct: 121 RIVGEDSFSVSHTSSSEEEERKMK---EEELEVESEGKSNEIDDEGEGDKEEENRGWTVK 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
ELLKFLL+VASLI STLYITSMN+PSPS+EVS AF SGSFPILN TSEFES+ V+ESIFA
Sbjct: 181 ELLKFLLVVASLILSTLYITSMNSPSPSYEVSGAFRSGSFPILNLTSEFESNAVMESIFA 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
SN DEEV+EAASMRN E VSQLN+QEDAEDRGF+EE EIL GE G K VRV
Sbjct: 241 EGSNFCDEEVTEAASMRNFEYVSQLNSQEDAEDRGFIEETEILNGEIGGDK-----TVRV 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCET 360
EYTELVE+AREK LAG SI EEM +GEKNGVELLNF DTGD+EK +ESEIS TTS CET
Sbjct: 301 EYTELVEEAREKPLAGGSITEEMAEGEKNGVELLNFEDTGDREKGKESEISNTTS--CET 360
Query: 361 SEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSESF 420
SEEDE E PNVNGFDEDKLLSNILT
Sbjct: 361 SEEDETAEAPNVNGFDEDKLLSNILT---------------------------------- 420
Query: 421 VLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESES 480
EV EKEEGGDLEMIESNTGESES
Sbjct: 421 -------------------------------------EVDEKEEGGDLEMIESNTGESES 480
Query: 481 FVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTCL 540
FVLEADKITILEGI N+L SF EDLEKLKSELV LMH E+ESVLKAVLGL+VSS MLTCL
Sbjct: 481 FVLEADKITILEGIINNLFSFGEDLEKLKSELVELMHTETESVLKAVLGLTVSSVMLTCL 540
Query: 541 ILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIRN 600
+LSFQ KKKKDDTKVPAISVSVE LLQDPVAK EKV+T+ESP IKAT DV+GS N LIRN
Sbjct: 541 VLSFQHKKKKDDTKVPAISVSVEALLQDPVAKAEKVVTKESPSIKATFDVHGSKNELIRN 600
Query: 601 VDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVE 660
VD+FKTLS SIHS DE + FKEMYH EAP+VQFLGEFV GEI+NSLK +SG+KNW +EVE
Sbjct: 601 VDSFKTLSPSIHSSDERENFKEMYHVEAPTVQFLGEFVFGEINNSLKNESGLKNWMIEVE 655
Query: 661 DSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKSI 720
DSNFPGS+EEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGG GEVKSI
Sbjct: 661 DSNFPGSIEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGDGEVKSI 655
Query: 721 PTPVRRSTRIRNRMMSP 737
PTPVRRSTRIRNRMMSP
Sbjct: 721 PTPVRRSTRIRNRMMSP 655
BLAST of HG10002297 vs. NCBI nr
Match:
XP_011651480.1 (uncharacterized protein LOC105434901 [Cucumis sativus] >KGN58086.2 hypothetical protein Csa_009792 [Cucumis sativus])
HSP 1 Score: 950.3 bits (2455), Expect = 9.9e-273
Identity = 552/791 (69.79%), Postives = 622/791 (78.63%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MEN D+LISS TT+NPS+SSKSD++D+NYP S+ANLN RK GET+KL K +LT+RN AI
Sbjct: 1 MENPDQLISSPTTTNPSMSSKSDESDKNYPPSVANLNWRKQGETKKLDAKNILTDRNGAI 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFT+NSLSEIPNVD +DS F+ +LPYDPLTNYLSPRPRFLRY+PNKRREIFL
Sbjct: 61 DPKFTDNSLSEIPNVD-------LDSAFQASLPYDPLTNYLSPRPRFLRYEPNKRREIFL 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDKEEI-RGWTVK 180
+T GE SLSVSHTSSSEEEET +K E EE+LEVESEGKSNEIDDEGEG +EE+ RGW
Sbjct: 121 KTFGEGSLSVSHTSSSEEEETNIK-EVEEQLEVESEGKSNEIDDEGEGYEEEVNRGW--- 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
+LLKFL+LV SLIS T YI+SMN+ SPSFE+S AF SGS PILNH+ EF SSPVVES++
Sbjct: 181 KLLKFLVLVVSLISFTFYISSMNSSSPSFEISGAFGSGSIPILNHSIEFLSSPVVESVYG 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
N N W EEV+E+ SMRN EGV QLNNQEDA+DRGF+EE EIL GEN GGK GDLVRV
Sbjct: 241 NGRNFWGEEVTESESMRNSEGVRQLNNQEDAKDRGFIEETEILNGENGGGKA--GDLVRV 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCET 360
E E EK LAGE + EEM +GE + VELLNFGDTGD +K + SE+S T SVPCET
Sbjct: 301 ELVE----KGEKPLAGECVTEEMAEGETSSVELLNFGDTGDWKKIKGSEMSNTISVPCET 360
Query: 361 SEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSESF 420
SEEDEITE NV+G DE KLLSNI TA+ENEYT QM+VVE E+ DL++IE+NT +SESF
Sbjct: 361 SEEDEITEASNVHGLDEVKLLSNISTASENEYTLQMKVVEKEKEEDLEIIENNTGESESF 420
Query: 421 -------------------------------------------------------VLEAD 480
VLEAD
Sbjct: 421 VLEVDKITQASNVNGFDEDRLLSNILTVAENEYSSQMEVVEKEMVESNRGESESSVLEAD 480
Query: 481 KITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESESFVLEA 540
KITE+SN NGFDEDKLL NILT AENEYTPQMEVVEKEE GDLEM+ESNTG+SE FV+EA
Sbjct: 481 KITEASNVNGFDEDKLLYNILTVAENEYTPQMEVVEKEEVGDLEMVESNTGKSEGFVIEA 540
Query: 541 DKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTCLILSFQ 600
DKITILEGI N +SSFVEDLEKLKS+LV LMH E++SVLKAVLGLSVSSA+LTCL+LSFQ
Sbjct: 541 DKITILEGIINRVSSFVEDLEKLKSKLVELMHTETKSVLKAVLGLSVSSAVLTCLVLSFQ 600
Query: 601 QKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIRNVDAFK 660
KKKKDD KVPAISVSVEPLLQ PVA+ EKVI R+SP IK T DVN +NN +IRNVD+FK
Sbjct: 601 LKKKKDDIKVPAISVSVEPLLQGPVAEAEKVIVRKSPSIKVTRDVNRTNNEIIRNVDSFK 660
Query: 661 TLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVEDSNFP 720
LS+SIHSRDEG FK M+HNEAP+VQF GEFVVGEISNSLK K + NWT+EVEDSNFP
Sbjct: 661 KLSSSIHSRDEGGNFKVMHHNEAPTVQF-GEFVVGEISNSLKGK--LNNWTIEVEDSNFP 720
Query: 721 GSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKSIPTPVR 736
GSVEE+PV +NM SGPEQALSEFSATTSSPSYGSFTT K+IVK+EVGG GEVK IPTPVR
Sbjct: 721 GSVEEEPV-RNMTSGPEQALSEFSATTSSPSYGSFTTMKRIVKREVGGDGEVKLIPTPVR 770
BLAST of HG10002297 vs. NCBI nr
Match:
XP_038877910.1 (uncharacterized protein LOC120070117 isoform X2 [Benincasa hispida])
HSP 1 Score: 912.1 bits (2356), Expect = 3.0e-261
Identity = 531/711 (74.68%), Postives = 566/711 (79.61%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MENSD++ISSSTTSNPSL SKSD+NDQNYP S+ANL+ KLGE EK+VKKKVLTERNEA+
Sbjct: 1 MENSDQVISSSTTSNPSLFSKSDENDQNYPPSVANLDRGKLGEMEKVVKKKVLTERNEAM 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFTENS SEIP VDPK SSCQVD VF+TTLPYDPLTNYLSPRPRFLRYKPNKRREIF
Sbjct: 61 DPKFTENSSSEIPKVDPKPSSCQVD-VFQTTLPYDPLTNYLSPRPRFLRYKPNKRREIFW 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDK-EEIRGWTVK 180
R VGEDS SVSHTSSSEEEE KMK EEELEVESEGKSNEIDDEGEGDK EE RGWTVK
Sbjct: 121 RIVGEDSFSVSHTSSSEEEERKMK---EEELEVESEGKSNEIDDEGEGDKEEENRGWTVK 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
ELLKFLL+VASLI STLYITSMN+PSPS+EVS AF SGSFPILN TSEFES+ V+ESIFA
Sbjct: 181 ELLKFLLVVASLILSTLYITSMNSPSPSYEVSGAFRSGSFPILNLTSEFESNAVMESIFA 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
SN DEEV+EAASMRN E VSQLN+QEDAEDRGF+EE EIL GE G K VRV
Sbjct: 241 EGSNFCDEEVTEAASMRNFEYVSQLNSQEDAEDRGFIEETEILNGEIGGDK-----TVRV 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCET 360
EYTELVE+AREK LAG SI EEM +GEKNGVELLNF DTGD+EK +ESEIS TTS CET
Sbjct: 301 EYTELVEEAREKPLAGGSITEEMAEGEKNGVELLNFEDTGDREKGKESEISNTTS--CET 360
Query: 361 SEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSESF 420
SEEDE E PNVNGFDEDKLLSNILT
Sbjct: 361 SEEDETAEAPNVNGFDEDKLLSNILT---------------------------------- 420
Query: 421 VLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESES 480
EV EKEEGGDLEMIESNTGESES
Sbjct: 421 -------------------------------------EVDEKEEGGDLEMIESNTGESES 480
Query: 481 FVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTCL 540
FVLEADKITILEGI N+L SF EDLEKLKSELV LMH E+ESVLKAVLGL+VSS MLTCL
Sbjct: 481 FVLEADKITILEGIINNLFSFGEDLEKLKSELVELMHTETESVLKAVLGLTVSSVMLTCL 540
Query: 541 ILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIRN 600
+LSFQ KKKKDDTKVPAISVSVE LLQDPVAK EKV+T+ESP IKAT DV+GS N LIRN
Sbjct: 541 VLSFQHKKKKDDTKVPAISVSVEALLQDPVAKAEKVVTKESPSIKATFDVHGSKNELIRN 600
Query: 601 VDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVE 660
VD+FKTLS SIHS DE + FKEMYH EAP+VQFLGEFV GEI+NSLK +SG+KNW +EVE
Sbjct: 601 VDSFKTLSPSIHSSDERENFKEMYHVEAPTVQFLGEFVFGEINNSLKNESGLKNWMIEVE 629
Query: 661 DSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEV 711
DSNFPGS+EEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEV
Sbjct: 661 DSNFPGSIEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEV 629
BLAST of HG10002297 vs. NCBI nr
Match:
XP_008447562.1 (PREDICTED: uncharacterized protein LOC103489978 [Cucumis melo] >KAA0050853.1 uncharacterized protein E6C27_scaffold404G001220 [Cucumis melo var. makuwa] >TYK08499.1 uncharacterized protein E5676_scaffold323G00270 [Cucumis melo var. makuwa])
HSP 1 Score: 899.4 bits (2323), Expect = 2.0e-257
Identity = 517/736 (70.24%), Postives = 580/736 (78.80%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MEN D++ SS TT+NPS+SS SD+NDQNYPS ++NLN I
Sbjct: 1 MENPDQVNSSPTTTNPSMSSTSDENDQNYPSYVSNLNC--------------------PI 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFTENSLSEIPNVD +DSVF+ +LPYDPLTNYLSPRPRFLRYKP+KRREIFL
Sbjct: 61 DPKFTENSLSEIPNVD-------LDSVFQASLPYDPLTNYLSPRPRFLRYKPSKRREIFL 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDKEEI-RGWTVK 180
RT GEDSLSVSHTSSSEE+ T +K EEEE+LEVESEGKSN IDDEGEGD+EE RGW
Sbjct: 121 RTFGEDSLSVSHTSSSEEKGTNIK-EEEEQLEVESEGKSNAIDDEGEGDEEEANRGW--- 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
+LLKFL++V SLISSTLYI+SMN+ SPSFEVS AF SGSFPILNHT EF SSPVVES++
Sbjct: 181 KLLKFLVVVVSLISSTLYISSMNSASPSFEVSGAFRSGSFPILNHTIEFWSSPVVESVYG 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
N N WDEEV+E+ SMRN EGV QL
Sbjct: 241 NGRNFWDEEVTESESMRNCEGVGQL----------------------------------- 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISK-TTSVPCE 360
+ REK LAG+ I EEM +GE + VELLNFGDTGD+++ +E E+S TTSVPCE
Sbjct: 301 ------VEKREKPLAGQCITEEMAEGETSSVELLNFGDTGDRKRIKEPEMSNATTSVPCE 360
Query: 361 TSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSES 420
TSE++EITE NVNG DE KLLSNI TAAENEY QM+VVE E+ DL+MIE+NT +SES
Sbjct: 361 TSEKNEITEASNVNGLDEVKLLSNISTAAENEYASQMKVVEKEKEEDLEMIENNTGQSES 420
Query: 421 FVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESE 480
FVLE DKIT++SN NGFDEDKLLSNILT A+NEYTPQMEVVEKEE GDLEM+ESNTG+SE
Sbjct: 421 FVLEVDKITQASNVNGFDEDKLLSNILTVAKNEYTPQMEVVEKEEVGDLEMVESNTGKSE 480
Query: 481 SFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTC 540
SFV+E DK+TIL+GI N LSSFVEDLEKLKS+LV LMH E+ESVLKAVLGLSVSSA+LTC
Sbjct: 481 SFVIEEDKVTILDGIKNRLSSFVEDLEKLKSKLVELMHTETESVLKAVLGLSVSSAVLTC 540
Query: 541 LILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIR 600
L+ SFQ KK DDTKVPAISVSVEPLLQ PVAK EKV R+S IKAT DVN +NN +IR
Sbjct: 541 LVSSFQLKKNLDDTKVPAISVSVEPLLQGPVAKAEKVTVRKSSSIKATRDVNRTNNEIIR 600
Query: 601 NVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEV 660
NVD+FK LS+SIHSRDEG+ FKEM+HNEA +VQFLGEFVVGEISNSLK K +KNW +EV
Sbjct: 601 NVDSFKKLSSSIHSRDEGENFKEMHHNEASTVQFLGEFVVGEISNSLKNKGKLKNWMMEV 660
Query: 661 EDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKS 720
EDSNF GSVEE+PVSKN SGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGG GEVKS
Sbjct: 661 EDSNFAGSVEEEPVSKNKTSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGDGEVKS 664
Query: 721 IPTPVRRSTRIRNRMM 735
IPTPVRRS RIRNRMM
Sbjct: 721 IPTPVRRSNRIRNRMM 664
BLAST of HG10002297 vs. NCBI nr
Match:
XP_022153660.1 (uncharacterized protein LOC111021113 [Momordica charantia])
HSP 1 Score: 611.3 bits (1575), Expect = 1.1e-170
Identity = 415/764 (54.32%), Postives = 488/764 (63.87%), Query Frame = 0
Query: 11 STTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAIDPKFTENSLS 70
S S ++ SKSD+N+QNYP SI NL+ RK GETEK KKVLTERN+A+D K +N LS
Sbjct: 3 SKLSASTMFSKSDENNQNYPPSIVNLDPRKSGETEKSA-KKVLTERNKAMDLKSGDNPLS 62
Query: 71 EIPNVDPKSSSCQVD------SVFETTL-----------------PYDPLTNYLSPRPRF 130
EI DP S CQVD S+ +T L YDPLTNYLSPRP+F
Sbjct: 63 EIAKFDP--SFCQVDSGSRGNSMSQTRLLSSRVSDFDGDEKNSVAAYDPLTNYLSPRPKF 122
Query: 131 LRYKPNKRREIFLR--TVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDE 190
LRYKP++RREIF R G + VS T SSEEE K K E++E E EI+DE
Sbjct: 123 LRYKPSRRREIFFRQQNDGAAEILVSPTPSSEEETGKGK---MEDIEGECCEIDEEIEDE 182
Query: 191 GEGDKEEIRGWTVKELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHT 250
GEGD TVK LLKFLL +A L+ STLYITSMNTP+PSFEVSR F SG PILNHT
Sbjct: 183 GEGD------GTVKGLLKFLLTIAGLVLSTLYITSMNTPTPSFEVSRIFRSGFCPILNHT 242
Query: 251 SEF-ESSPVVESIFANRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKG 310
EF S+ V+E++ A SNLWDEEV+EA S N EGV Q +QEDA++ GF+EE E+L G
Sbjct: 243 DEFGGSNLVIETLSAKGSNLWDEEVTEATSNMNPEGVGQFIHQEDAKNVGFLEETEMLNG 302
Query: 311 ENEGGKGGYGDLVRVEYTELVEDAREKLLAGE--SIIEEMDDGEKNGVEL--LNFGDTGD 370
ENE YG+L +VE E VE+ EK AG ++ +EM +GE+N VE L D G+
Sbjct: 303 ENE-----YGNLEKVEDPEQVEEVVEKSQAGPGGTMADEMTEGEENEVEFSELIVEDDGN 362
Query: 371 QEKREESEISKTTSVPCETSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVEN 430
QEKR+E++ S S P +NGFD+D LLS+IL A NEYTP+ E
Sbjct: 363 QEKRKENDESIQASKP------------SILNGFDQDNLLSDILVAVGNEYTPKQE---- 422
Query: 431 EEGGDLQMIESNTWKSESFVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVE 490
EV E
Sbjct: 423 --------------------------------------------------------EVFE 482
Query: 491 KEEGGDLEMIESNTGESESFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESE 550
EE GD EM+ESN GE+ES V EA K TI E N +SSFVEDLEKLKSELV LMH E+E
Sbjct: 483 MEEVGDWEMVESNKGEAESSVREASKSTIWERTANVISSFVEDLEKLKSELVELMHTETE 542
Query: 551 SVLKAVLGLSVSSAMLTCLILSFQQKKKKDDTKVPAISVSVEP-LLQDPVAKTEKVITRE 610
SVLK +LGLSVSSA+LTCL+LSFQ KKKK D KVP IS SV P LLQ PV + EK+ITRE
Sbjct: 543 SVLKVILGLSVSSAILTCLVLSFQFKKKKVDKKVPTISASVTPSLLQSPVVEAEKIITRE 602
Query: 611 SP-------VIKATCDVNGSNNRLIRNVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQF 670
P IK TC V+ SN+ I NVD+FK LS+SIHSRDE + KE+YH+EAP+VQF
Sbjct: 603 PPSPSRSPSTIKPTCVVDKSNHEHIGNVDSFKMLSSSIHSRDEVESSKELYHHEAPTVQF 662
Query: 671 LGEFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTS 730
LGE VVG +SNSLK +SG+KN +E EDS+F SVE+KPVSKNMNSGPE+ALSEFS TTS
Sbjct: 663 LGEIVVGGMSNSLKNRSGLKNRMIEAEDSSFHASVEQKPVSKNMNSGPEEALSEFS-TTS 676
Query: 731 SPSYGSFTTKKKIVKKEVGGHGEVKSIPTPVRRSTRIRNRMMSP 737
SPSYGS TKKK VKKEV G EVKSIPTPVRRS+RIRNR++SP
Sbjct: 723 SPSYGSTITKKKAVKKEVRGDEEVKSIPTPVRRSSRIRNRIVSP 676
BLAST of HG10002297 vs. ExPASy TrEMBL
Match:
A0A5A7U4S8 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold323G00270 PE=4 SV=1)
HSP 1 Score: 899.4 bits (2323), Expect = 9.8e-258
Identity = 517/736 (70.24%), Postives = 580/736 (78.80%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MEN D++ SS TT+NPS+SS SD+NDQNYPS ++NLN I
Sbjct: 1 MENPDQVNSSPTTTNPSMSSTSDENDQNYPSYVSNLNC--------------------PI 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFTENSLSEIPNVD +DSVF+ +LPYDPLTNYLSPRPRFLRYKP+KRREIFL
Sbjct: 61 DPKFTENSLSEIPNVD-------LDSVFQASLPYDPLTNYLSPRPRFLRYKPSKRREIFL 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDKEEI-RGWTVK 180
RT GEDSLSVSHTSSSEE+ T +K EEEE+LEVESEGKSN IDDEGEGD+EE RGW
Sbjct: 121 RTFGEDSLSVSHTSSSEEKGTNIK-EEEEQLEVESEGKSNAIDDEGEGDEEEANRGW--- 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
+LLKFL++V SLISSTLYI+SMN+ SPSFEVS AF SGSFPILNHT EF SSPVVES++
Sbjct: 181 KLLKFLVVVVSLISSTLYISSMNSASPSFEVSGAFRSGSFPILNHTIEFWSSPVVESVYG 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
N N WDEEV+E+ SMRN EGV QL
Sbjct: 241 NGRNFWDEEVTESESMRNCEGVGQL----------------------------------- 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISK-TTSVPCE 360
+ REK LAG+ I EEM +GE + VELLNFGDTGD+++ +E E+S TTSVPCE
Sbjct: 301 ------VEKREKPLAGQCITEEMAEGETSSVELLNFGDTGDRKRIKEPEMSNATTSVPCE 360
Query: 361 TSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSES 420
TSE++EITE NVNG DE KLLSNI TAAENEY QM+VVE E+ DL+MIE+NT +SES
Sbjct: 361 TSEKNEITEASNVNGLDEVKLLSNISTAAENEYASQMKVVEKEKEEDLEMIENNTGQSES 420
Query: 421 FVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESE 480
FVLE DKIT++SN NGFDEDKLLSNILT A+NEYTPQMEVVEKEE GDLEM+ESNTG+SE
Sbjct: 421 FVLEVDKITQASNVNGFDEDKLLSNILTVAKNEYTPQMEVVEKEEVGDLEMVESNTGKSE 480
Query: 481 SFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTC 540
SFV+E DK+TIL+GI N LSSFVEDLEKLKS+LV LMH E+ESVLKAVLGLSVSSA+LTC
Sbjct: 481 SFVIEEDKVTILDGIKNRLSSFVEDLEKLKSKLVELMHTETESVLKAVLGLSVSSAVLTC 540
Query: 541 LILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIR 600
L+ SFQ KK DDTKVPAISVSVEPLLQ PVAK EKV R+S IKAT DVN +NN +IR
Sbjct: 541 LVSSFQLKKNLDDTKVPAISVSVEPLLQGPVAKAEKVTVRKSSSIKATRDVNRTNNEIIR 600
Query: 601 NVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEV 660
NVD+FK LS+SIHSRDEG+ FKEM+HNEA +VQFLGEFVVGEISNSLK K +KNW +EV
Sbjct: 601 NVDSFKKLSSSIHSRDEGENFKEMHHNEASTVQFLGEFVVGEISNSLKNKGKLKNWMMEV 660
Query: 661 EDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKS 720
EDSNF GSVEE+PVSKN SGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGG GEVKS
Sbjct: 661 EDSNFAGSVEEEPVSKNKTSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGDGEVKS 664
Query: 721 IPTPVRRSTRIRNRMM 735
IPTPVRRS RIRNRMM
Sbjct: 721 IPTPVRRSNRIRNRMM 664
BLAST of HG10002297 vs. ExPASy TrEMBL
Match:
A0A1S3BHQ6 (uncharacterized protein LOC103489978 OS=Cucumis melo OX=3656 GN=LOC103489978 PE=4 SV=1)
HSP 1 Score: 899.4 bits (2323), Expect = 9.8e-258
Identity = 517/736 (70.24%), Postives = 580/736 (78.80%), Query Frame = 0
Query: 1 MENSDKLISSSTTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAI 60
MEN D++ SS TT+NPS+SS SD+NDQNYPS ++NLN I
Sbjct: 1 MENPDQVNSSPTTTNPSMSSTSDENDQNYPSYVSNLNC--------------------PI 60
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKRREIFL 120
DPKFTENSLSEIPNVD +DSVF+ +LPYDPLTNYLSPRPRFLRYKP+KRREIFL
Sbjct: 61 DPKFTENSLSEIPNVD-------LDSVFQASLPYDPLTNYLSPRPRFLRYKPSKRREIFL 120
Query: 121 RTVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDEGEGDKEEI-RGWTVK 180
RT GEDSLSVSHTSSSEE+ T +K EEEE+LEVESEGKSN IDDEGEGD+EE RGW
Sbjct: 121 RTFGEDSLSVSHTSSSEEKGTNIK-EEEEQLEVESEGKSNAIDDEGEGDEEEANRGW--- 180
Query: 181 ELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFA 240
+LLKFL++V SLISSTLYI+SMN+ SPSFEVS AF SGSFPILNHT EF SSPVVES++
Sbjct: 181 KLLKFLVVVVSLISSTLYISSMNSASPSFEVSGAFRSGSFPILNHTIEFWSSPVVESVYG 240
Query: 241 NRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRV 300
N N WDEEV+E+ SMRN EGV QL
Sbjct: 241 NGRNFWDEEVTESESMRNCEGVGQL----------------------------------- 300
Query: 301 EYTELVEDAREKLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISK-TTSVPCE 360
+ REK LAG+ I EEM +GE + VELLNFGDTGD+++ +E E+S TTSVPCE
Sbjct: 301 ------VEKREKPLAGQCITEEMAEGETSSVELLNFGDTGDRKRIKEPEMSNATTSVPCE 360
Query: 361 TSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSES 420
TSE++EITE NVNG DE KLLSNI TAAENEY QM+VVE E+ DL+MIE+NT +SES
Sbjct: 361 TSEKNEITEASNVNGLDEVKLLSNISTAAENEYASQMKVVEKEKEEDLEMIENNTGQSES 420
Query: 421 FVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESE 480
FVLE DKIT++SN NGFDEDKLLSNILT A+NEYTPQMEVVEKEE GDLEM+ESNTG+SE
Sbjct: 421 FVLEVDKITQASNVNGFDEDKLLSNILTVAKNEYTPQMEVVEKEEVGDLEMVESNTGKSE 480
Query: 481 SFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAMLTC 540
SFV+E DK+TIL+GI N LSSFVEDLEKLKS+LV LMH E+ESVLKAVLGLSVSSA+LTC
Sbjct: 481 SFVIEEDKVTILDGIKNRLSSFVEDLEKLKSKLVELMHTETESVLKAVLGLSVSSAVLTC 540
Query: 541 LILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIR 600
L+ SFQ KK DDTKVPAISVSVEPLLQ PVAK EKV R+S IKAT DVN +NN +IR
Sbjct: 541 LVSSFQLKKNLDDTKVPAISVSVEPLLQGPVAKAEKVTVRKSSSIKATRDVNRTNNEIIR 600
Query: 601 NVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEV 660
NVD+FK LS+SIHSRDEG+ FKEM+HNEA +VQFLGEFVVGEISNSLK K +KNW +EV
Sbjct: 601 NVDSFKKLSSSIHSRDEGENFKEMHHNEASTVQFLGEFVVGEISNSLKNKGKLKNWMMEV 660
Query: 661 EDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGHGEVKS 720
EDSNF GSVEE+PVSKN SGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGG GEVKS
Sbjct: 661 EDSNFAGSVEEEPVSKNKTSGPEQALSEFSATTSSPSYGSFTTKKKIVKKEVGGDGEVKS 664
Query: 721 IPTPVRRSTRIRNRMM 735
IPTPVRRS RIRNRMM
Sbjct: 721 IPTPVRRSNRIRNRMM 664
BLAST of HG10002297 vs. ExPASy TrEMBL
Match:
A0A0A0LAS7 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G435540 PE=4 SV=1)
HSP 1 Score: 702.2 bits (1811), Expect = 2.3e-198
Identity = 405/590 (68.64%), Postives = 452/590 (76.61%), Query Frame = 0
Query: 201 MNTPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFANRSNLWDEEVSEAASMRNLEG 260
MN+ SPSFE+S AF SGS PILNH+ EF SSPVVES++ N N W EEV+E+ SMRN EG
Sbjct: 1 MNSSSPSFEISGAFGSGSIPILNHSIEFLSSPVVESVYGNGRNFWGEEVTESESMRNSEG 60
Query: 261 VSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRVEYTELVEDAREKLLAGESIIE 320
V QLNNQEDA+DRGF+EE EIL GEN GGK GDLVRVE E EK LAGE + E
Sbjct: 61 VRQLNNQEDAKDRGFIEETEILNGENGGGKA--GDLVRVELVE----KGEKPLAGECVTE 120
Query: 321 EMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCETSEEDEITEVPNVNGFDEDKLL 380
EM +GE + VELLNFGDTGD +K + SE+S T SVPCETSEEDEITE NV+G DE KLL
Sbjct: 121 EMAEGETSSVELLNFGDTGDWKKIKGSEMSNTISVPCETSEEDEITEASNVHGLDEVKLL 180
Query: 381 SNILTAAENEYTPQMEVVENEEGGDLQMIESNTWKSESF--------------------- 440
SNI TA+ENEYT QM+VVE E+ DL++IE+NT +SESF
Sbjct: 181 SNISTASENEYTLQMKVVEKEKEEDLEIIENNTGESESFVLEVDKITQASNVNGFDEDRL 240
Query: 441 ----------------------------------VLEADKITESSNFNGFDEDKLLSNIL 500
VLEADKITE+SN NGFDEDKLL NIL
Sbjct: 241 LSNILTVAENEYSSQMEVVEKEMVESNRGESESSVLEADKITEASNVNGFDEDKLLYNIL 300
Query: 501 TAAENEYTPQMEVVEKEEGGDLEMIESNTGESESFVLEADKITILEGITNSLSSFVEDLE 560
T AENEYTPQMEVVEKEE GDLEM+ESNTG+SE FV+EADKITILEGI N +SSFVEDLE
Sbjct: 301 TVAENEYTPQMEVVEKEEVGDLEMVESNTGKSEGFVIEADKITILEGIINRVSSFVEDLE 360
Query: 561 KLKSELVGLMHAESESVLKAVLGLSVSSAMLTCLILSFQQKKKKDDTKVPAISVSVEPLL 620
KLKS+LV LMH E++SVLKAVLGLSVSSA+LTCL+LSFQ KKKKDD KVPAISVSVEPLL
Sbjct: 361 KLKSKLVELMHTETKSVLKAVLGLSVSSAVLTCLVLSFQLKKKKDDIKVPAISVSVEPLL 420
Query: 621 QDPVAKTEKVITRESPVIKATCDVNGSNNRLIRNVDAFKTLSASIHSRDEGKKFKEMYHN 680
Q PVA+ EKVI R+SP IK T DVN +NN +IRNVD+FK LS+SIHSRDEG FK M+HN
Sbjct: 421 QGPVAEAEKVIVRKSPSIKVTRDVNRTNNEIIRNVDSFKKLSSSIHSRDEGGNFKVMHHN 480
Query: 681 EAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNSGPEQALS 736
EAP+VQF GEFVVGEISNSLK K + NWT+EVEDSNFPGSVEE+PV +NM SGPEQALS
Sbjct: 481 EAPTVQF-GEFVVGEISNSLKGK--LNNWTIEVEDSNFPGSVEEEPV-RNMTSGPEQALS 540
BLAST of HG10002297 vs. ExPASy TrEMBL
Match:
A0A6J1DHF6 (uncharacterized protein LOC111021113 OS=Momordica charantia OX=3673 GN=LOC111021113 PE=4 SV=1)
HSP 1 Score: 611.3 bits (1575), Expect = 5.3e-171
Identity = 415/764 (54.32%), Postives = 488/764 (63.87%), Query Frame = 0
Query: 11 STTSNPSLSSKSDQNDQNYPSSIANLNLRKLGETEKLVKKKVLTERNEAIDPKFTENSLS 70
S S ++ SKSD+N+QNYP SI NL+ RK GETEK KKVLTERN+A+D K +N LS
Sbjct: 3 SKLSASTMFSKSDENNQNYPPSIVNLDPRKSGETEKSA-KKVLTERNKAMDLKSGDNPLS 62
Query: 71 EIPNVDPKSSSCQVD------SVFETTL-----------------PYDPLTNYLSPRPRF 130
EI DP S CQVD S+ +T L YDPLTNYLSPRP+F
Sbjct: 63 EIAKFDP--SFCQVDSGSRGNSMSQTRLLSSRVSDFDGDEKNSVAAYDPLTNYLSPRPKF 122
Query: 131 LRYKPNKRREIFLR--TVGEDSLSVSHTSSSEEEETKMKGEEEEELEVESEGKSNEIDDE 190
LRYKP++RREIF R G + VS T SSEEE K K E++E E EI+DE
Sbjct: 123 LRYKPSRRREIFFRQQNDGAAEILVSPTPSSEEETGKGK---MEDIEGECCEIDEEIEDE 182
Query: 191 GEGDKEEIRGWTVKELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAFNSGSFPILNHT 250
GEGD TVK LLKFLL +A L+ STLYITSMNTP+PSFEVSR F SG PILNHT
Sbjct: 183 GEGD------GTVKGLLKFLLTIAGLVLSTLYITSMNTPTPSFEVSRIFRSGFCPILNHT 242
Query: 251 SEF-ESSPVVESIFANRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKG 310
EF S+ V+E++ A SNLWDEEV+EA S N EGV Q +QEDA++ GF+EE E+L G
Sbjct: 243 DEFGGSNLVIETLSAKGSNLWDEEVTEATSNMNPEGVGQFIHQEDAKNVGFLEETEMLNG 302
Query: 311 ENEGGKGGYGDLVRVEYTELVEDAREKLLAGE--SIIEEMDDGEKNGVEL--LNFGDTGD 370
ENE YG+L +VE E VE+ EK AG ++ +EM +GE+N VE L D G+
Sbjct: 303 ENE-----YGNLEKVEDPEQVEEVVEKSQAGPGGTMADEMTEGEENEVEFSELIVEDDGN 362
Query: 371 QEKREESEISKTTSVPCETSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVEN 430
QEKR+E++ S S P +NGFD+D LLS+IL A NEYTP+ E
Sbjct: 363 QEKRKENDESIQASKP------------SILNGFDQDNLLSDILVAVGNEYTPKQE---- 422
Query: 431 EEGGDLQMIESNTWKSESFVLEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVE 490
EV E
Sbjct: 423 --------------------------------------------------------EVFE 482
Query: 491 KEEGGDLEMIESNTGESESFVLEADKITILEGITNSLSSFVEDLEKLKSELVGLMHAESE 550
EE GD EM+ESN GE+ES V EA K TI E N +SSFVEDLEKLKSELV LMH E+E
Sbjct: 483 MEEVGDWEMVESNKGEAESSVREASKSTIWERTANVISSFVEDLEKLKSELVELMHTETE 542
Query: 551 SVLKAVLGLSVSSAMLTCLILSFQQKKKKDDTKVPAISVSVEP-LLQDPVAKTEKVITRE 610
SVLK +LGLSVSSA+LTCL+LSFQ KKKK D KVP IS SV P LLQ PV + EK+ITRE
Sbjct: 543 SVLKVILGLSVSSAILTCLVLSFQFKKKKVDKKVPTISASVTPSLLQSPVVEAEKIITRE 602
Query: 611 SP-------VIKATCDVNGSNNRLIRNVDAFKTLSASIHSRDEGKKFKEMYHNEAPSVQF 670
P IK TC V+ SN+ I NVD+FK LS+SIHSRDE + KE+YH+EAP+VQF
Sbjct: 603 PPSPSRSPSTIKPTCVVDKSNHEHIGNVDSFKMLSSSIHSRDEVESSKELYHHEAPTVQF 662
Query: 671 LGEFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNSGPEQALSEFSATTS 730
LGE VVG +SNSLK +SG+KN +E EDS+F SVE+KPVSKNMNSGPE+ALSEFS TTS
Sbjct: 663 LGEIVVGGMSNSLKNRSGLKNRMIEAEDSSFHASVEQKPVSKNMNSGPEEALSEFS-TTS 676
Query: 731 SPSYGSFTTKKKIVKKEVGGHGEVKSIPTPVRRSTRIRNRMMSP 737
SPSYGS TKKK VKKEV G EVKSIPTPVRRS+RIRNR++SP
Sbjct: 723 SPSYGSTITKKKAVKKEVRGDEEVKSIPTPVRRSSRIRNRIVSP 676
BLAST of HG10002297 vs. ExPASy TrEMBL
Match:
A0A2N9GRA5 (Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS32859 PE=4 SV=1)
HSP 1 Score: 206.5 bits (524), Expect = 3.9e-49
Identity = 262/826 (31.72%), Postives = 400/826 (48.43%), Query Frame = 0
Query: 1 MENSDKLISSSTT----SNPSLSSKSDQNDQNYPSSIAN-LNLRKLGETE---------- 60
M+ DK + SS+T S + SD+NDQ+ N N +KL +
Sbjct: 5 MDGPDKGVLSSSTPIKASEFPRVTVSDENDQSNQQLGPNPPNPKKLTQKHYMSPTISAAS 64
Query: 61 --KLVKKKVLTERNEAIDPKFTENSLSEIPNVDPK--------------------SSSCQ 120
+ +KK+L ERNEA + F+E + + N++ K S+ +
Sbjct: 65 KATVPRKKILGERNEASESIFSEPHVQKASNIESKLTFANPGTDVSDIPSPQGFESNGNE 124
Query: 121 VDSVFE--TTLPYDPLTNYLSPRPRFLRYKPNKRREIFLR-----TVGEDSLSVSHTSSS 180
++V + PYDPLTNYLSPRP+FLRYKPN+RREIF R V +D LS+S + S
Sbjct: 125 QNAVIAEGSLKPYDPLTNYLSPRPKFLRYKPNRRREIFFRLEDEIRVEKDGLSISTSGSF 184
Query: 181 EEEETKMKGEEEE------ELEVESEGKSNEIDDEG--EGDK--EEIRGWTVKELLKFLL 240
E + K+ EE + L S+ S + +DEG E D+ EE RGW+VK +L+ LL
Sbjct: 185 ESQ--KVSDEESDSGCNHGSLVSSSQEGSGQQEDEGIEESDEEIEEERGWSVKRVLESLL 244
Query: 241 LVASLISSTLYITSMN--TPSPSFEVSRAFNSGSFPILNHTSEFESSPVVESIFANRSNL 300
L+ STLYI+SMN PSPS + G I NHT E + F + S++
Sbjct: 245 WFVLLVFSTLYISSMNFPAPSPSLQSFEGPRYGCCNIQNHTVE-----ALVKTFDSGSHI 304
Query: 301 WD--EEVSEAASMRNLEGVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRVEYT 360
WD EE+ R+ E + + +++E E+++ GG
Sbjct: 305 WDQREEIQMGLDQRSQEAIDE-----------WLKEEEMVQDVKMGG------------V 364
Query: 361 ELVEDARE-KLLAGESIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCETSE 420
E+ E+ E KL GES E+D+ EK V
Sbjct: 365 EIAEELNEVKLEGGESEAIEVDEDEKKEV------------------------------- 424
Query: 421 EDEITEVPNVNGFDEDKLLSNILTAAENEYTPQMEVVEN-EEGGDLQMIESNTWKSESFV 480
DE+ EV V+ ED I +N+ E+ + +E ++Q + K E+F
Sbjct: 425 VDELGEVVLVDPQAED-----IKANCDNQRVFTREISDQMDENSEVQNAGTKE-KFEAFK 484
Query: 481 -LEADKITESSNFNGFDEDKLLSNILTAAENEYTPQMEVVEKEEGGDLEMIESNTGESES 540
EA +++ + + D +SN+L + + E V E+ GD EMIE N E E+
Sbjct: 485 DYEAPSMSDGVGHHIIESD--ISNVLEEENDVWLEATEEVNNEKAGDEEMIERNMEEMEN 544
Query: 541 F--VLEADKITILEGITNSLS-SFVEDLEKLKSELVGLMHAESESVLKAVLGLSVSSAML 600
V+ ++++ ++ T LS EDLE+ + + E+ES+LK V+G+ V S ++
Sbjct: 545 VMQVISSERMDNVDANTEILSLELEEDLEEGPKKKL-----ETESLLKPVIGVLVFSMIV 604
Query: 601 TCLILSFQQKKKKDDTKVPAISVSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRL 660
L+L F K KK K S+ +P + + +K I +ES + + ++ +
Sbjct: 605 AFLVLDFCFKSKKTIGK--DSSLIAKPCSGSLIVEKKKTIEKESLISIEQNETTRKDSLI 664
Query: 661 IR----NVDAFKTLSASIHSRDEG-----------------KKFKEMYH-NEAPSVQFLG 720
++ +V A K SA + +R+E K+ + YH + APSV+ LG
Sbjct: 665 VKPCIESVMAEK-CSAVLPNREEAHIARADSFRSHSSFHPIKEVSKDYHESRAPSVELLG 724
Query: 721 EFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNSGPEQ---ALSEFSATT 737
EFVVGE+S+SL++ G+KN +E E+S++ S ++K SK+ +S P Q A SEFS+
Sbjct: 725 EFVVGEVSSSLRS-CGVKNRRMESEESSYSVSTDKKSWSKS-HSVPVQSQSAFSEFSSRD 750
BLAST of HG10002297 vs. TAIR 10
Match:
AT1G16630.1 (unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT2G16270.1); Has 10587 Blast hits to 5736 proteins in 617 species: Archae - 88; Bacteria - 963; Metazoa - 3686; Fungi - 820; Plants - 541; Viruses - 438; Other Eukaryotes - 4051 (source: NCBI BLink). )
HSP 1 Score: 53.1 bits (126), Expect = 1.1e-06
Identity = 197/777 (25.35%), Postives = 310/777 (39.90%), Query Frame = 0
Query: 41 LGETEKLVKKK----VLTERNEAIDPKFTENSLSEI-------PNVDPKSSSCQVDSVFE 100
+ ET++L +++ +++ +E ++ K +NS +I P P S +VD V
Sbjct: 160 IDETKQLREEESHDITVSDFDEILERKSNDNSSFKISPLPPYVPCTFPVFESHEVDPV-- 219
Query: 101 TTLPYDPLTNYLSPRPRFLRYKPNKR--------REIFLRTVGEDSLSVSHTSSSEEEET 160
PYDP NYLSPRP+FL YKPN + +++ + E S S + S+ EEE
Sbjct: 220 -VAPYDPKKNYLSPRPQFLHYKPNPKIEHRSDECKQLEELFISESSSSDTDLSAEREEEG 279
Query: 161 KMK------------------GEE-----EEELEVESEGK--SNEIDDE------GEGDK 220
+ + GEE EE L+V+ E + + E DDE GE +
Sbjct: 280 QQEEEVASQEGVVAVEEQEDDGEERLEAAEEILDVDGEERLEAVESDDEEEEVVVGESIE 339
Query: 221 EE----------------IRGWTVKELLKFLLLVASLISSTLYITSMNTPSPSFEVSRAF 280
EE + GW + + +LLLV SST + T SP ++ F
Sbjct: 340 EEETHQISKQSRFSKTSMLLGWILALGVAYLLLV----SSTTFSQQTITDSPFYQ----F 399
Query: 281 NSGSFPILNHTSEFESSPVVESIFANRSNLWDEEVSEAASMRNLEGVSQLNNQEDAEDRG 340
N I++ + FE ++A S ++ +++ VS L +E +
Sbjct: 400 NISPEIIMSASENFEQLGAKLRMWAESSFVYLDKL-----------VSSLREEEGSVPFQ 459
Query: 341 FMEENEILKGENEGGKGGYGDLVRVEYTELVEDAREKLLAGESIIEE-MDDGEKNGVELL 400
F +L+ + + DA + + E I++ + D + +E +
Sbjct: 460 FHNLTVLLEDKR------------------LSDAVFQSTSVEIIVDGFIVDSLEVDIEEV 519
Query: 401 NFGDTGDQEKREESEISKTTSVPCETSEEDEITEVPNVNGFDEDKLLSNILTAAENEYTP 460
N G QE EESE S S+ E+D E N G +++ AE +
Sbjct: 520 N---VGHQEPEEESENSGEISLEAVYEEDDNEVEQENEEGKVNLEIVDECDEQAEIKIAT 579
Query: 461 QMEVVENEEGGDLQMIESNTWKSESFVLEADKITESSNFNGFDE----DKLLSNILTAA- 520
EV E + + E E+ V+E + E ++ N +E +LL ++ +AA
Sbjct: 580 DTEVNGGERYSE-SLSEEGHGGQETDVVEGQEEYEENDQNNMEEAESDAQLLDDVQSAAI 639
Query: 521 -----ENEYTPQMEVVEKEEGGDLEMIESNTGESESFVLEADKITILEGITNSLSSFVED 580
E +E V++EEG + G S S EA T +E N VE+
Sbjct: 640 SSNQQEQTGVANVETVQEEEG-----VGEIAGGSLSVSEEA---TDVEHDGNE----VEE 699
Query: 581 LEKLKSELVGLMHAESESVL-----KAVLGLSVSSAMLTCLILSFQQKKKKDDTKVPAIS 640
E E+V A SE +L K ++ S +L + F KKK TK
Sbjct: 700 EESGFGEVVN--DAGSEDILLSGQKKVLVLFSTMMVILAAVAAGFLLAKKK--TK----P 759
Query: 641 VSVEPLLQDPVAKTEKVITRESPVIKATCDVNGSNNRLIRNVDAFKTLSASIHSRDEGKK 700
V ++ +P A + + PV LIR +S++ ++E ++
Sbjct: 760 VMLQHEDGEPTAISATKVVEHVPV-----------ENLIRE------RLSSLNFKEEEEE 819
Query: 701 FKEMYHNEAPSVQFLGEFVVGEISNSLKTKSGMKNWTVEVEDSNFPGSVEEKPVSKNMNS 736
+ E S F E N K ++ S G K+ +S
Sbjct: 820 VGDDRKREVSS--FPSEMSFSFSKNKPLHSCSNKKDDLKEHQSGGGG-------KKSNDS 843
BLAST of HG10002297 vs. TAIR 10
Match:
AT2G16270.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 16 plant structures; EXPRESSED DURING: 9 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G16630.1); Has 1844 Blast hits to 1256 proteins in 271 species: Archae - 6; Bacteria - 283; Metazoa - 434; Fungi - 153; Plants - 91; Viruses - 52; Other Eukaryotes - 825 (source: NCBI BLink). )
HSP 1 Score: 52.4 bits (124), Expect = 1.8e-06
Identity = 111/423 (26.24%), Postives = 173/423 (40.90%), Query Frame = 0
Query: 61 DPKFTENSLSEIPNVDPKSSSCQVDSVFETTLPYDPLTNYLSPRPRFLRYKPNKR----- 120
DP+F + +P P+ ++C+VD++ PYDP N+LSPRP+FL YKPN R
Sbjct: 176 DPRFRISPRPSVPYTSPEFAACEVDTLLP---PYDPKKNFLSPRPQFLHYKPNPRIEKRF 235
Query: 121 ------REIFLRTVGEDSLSVSHTSSSE-----------EEET----KMKGEEEEELEVE 180
E+F+ D +S S E EEET + + E +EE+ E
Sbjct: 236 DECKQLEELFISESSSDDTELSVEESEEQEKDGAEEVVVEEETEDVEQSEAESDEEMVCE 295
Query: 181 S-EGKSNEIDDEGEGDKEEIRGWTVKELLKFLLLVASLISSTLYITSMN---TPSPSFEV 240
S E ++++ + K + GW + L +LL+ A+ S L +S N P E
Sbjct: 296 SVEETTSQVPKQSGSRKFKFLGWFLALALGYLLVSATF--SPLMKSSFNEFHIPKEITEF 355
Query: 241 SRAFNSGSFPILNHTSEFESSPVVESIFANRSNLWDEEVSEAASMRNLE----------- 300
++A N T ESS V +R +EE S+ NL
Sbjct: 356 AKANNLDQLSDKLWTLT-ESSLVYMDKLISRLGRGNEEYSQ-LQFHNLTYTLEDSTVFKP 415
Query: 301 ---GVSQLNNQEDAEDRGFMEENEILKGENEGGKGGYGDLVRVEYTELVEDAREKLLAGE 360
+ Q QE++ +E+ + E E G ++V ++ EL E
Sbjct: 416 TCVEIIQEPLQENSRSENSLEDGSV--NEEESGAEENSEVV-CQFDELAE-------VKP 475
Query: 361 SIIEEMDDGEKNGVELLNFGDTGDQEKREESEISKTTSVPCETSEEDEITEVPNVNGFDE 420
S E +DGE+N L G + E+ ESE+S + E E+ +E +N D
Sbjct: 476 STDIESNDGERNLKALFEDGLELNIEELRESEMSPEEKLETEKKLEETESEAIYINQPDV 535
Query: 421 DKLLSNILTAAENEYTPQMEVVENEEG--GDLQMIESNTWKSESFVLEADKITESSNFNG 438
+ N+ E+E E G GDL +E ++ + + D ES + G
Sbjct: 536 EFAAINVHQHIESEILVAESGSEESFGEIGDLLHLEVGSYND---LAKGD--AESGSEEG 576
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038877902.1 | 1.6e-275 | 75.44 | uncharacterized protein LOC120070117 isoform X1 [Benincasa hispida] | [more] |
XP_011651480.1 | 9.9e-273 | 69.79 | uncharacterized protein LOC105434901 [Cucumis sativus] >KGN58086.2 hypothetical ... | [more] |
XP_038877910.1 | 3.0e-261 | 74.68 | uncharacterized protein LOC120070117 isoform X2 [Benincasa hispida] | [more] |
XP_008447562.1 | 2.0e-257 | 70.24 | PREDICTED: uncharacterized protein LOC103489978 [Cucumis melo] >KAA0050853.1 unc... | [more] |
XP_022153660.1 | 1.1e-170 | 54.32 | uncharacterized protein LOC111021113 [Momordica charantia] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7U4S8 | 9.8e-258 | 70.24 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3BHQ6 | 9.8e-258 | 70.24 | uncharacterized protein LOC103489978 OS=Cucumis melo OX=3656 GN=LOC103489978 PE=... | [more] |
A0A0A0LAS7 | 2.3e-198 | 68.64 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_3G435540 PE=4 SV=1 | [more] |
A0A6J1DHF6 | 5.3e-171 | 54.32 | uncharacterized protein LOC111021113 OS=Momordica charantia OX=3673 GN=LOC111021... | [more] |
A0A2N9GRA5 | 3.9e-49 | 31.72 | Uncharacterized protein OS=Fagus sylvatica OX=28930 GN=FSB_LOCUS32859 PE=4 SV=1 | [more] |
Match Name | E-value | Identity | Description | |
AT1G16630.1 | 1.1e-06 | 25.35 | unknown protein; LOCATED IN: chloroplast; BEST Arabidopsis thaliana protein matc... | [more] |
AT2G16270.1 | 1.8e-06 | 26.24 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |