Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA
mRNA sequence
ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA
Coding sequence (CDS)
ATGGCCACTCCCTTGCAATTCCCTTTCTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCACTCGACCTCCTTCTCACCATTTTCCAATCCTCTTCTTCAAACCATAACCCTAACCCTAAAATCCCATCAAACTCATAAACCCCTTTCCATTGTTTCCGGTCACCCAAATCCTTCCCTTCTTTCGATCTCCCGCCAAATTTCGCATTTCTCATTCGCAAACGCCCGTCGGGACATTCGTACACACGCCGGCCGGAGCAAGAAGAAGGGTGGAGGGCCCTCTCCCGGTAGGATAGAAGGCAACGCCGAGTTCCGACGGAAATTGAGGCATAATGCCCGCCGGAAAAGCCAGAAGCTCGCCGAGTCCCATTTCTACCGCCGCAAGAAGTCGAACAGCAATACAGCGGATAACTTCAGTGAGGATGAGCTTCAGCAGATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAACTTACGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGAGTCGTCGTCGGCGAGCCTATTCGTGGGCGGTTCACGGATGAGCGAGTTACGATTATCAGCGAGGTTAAGGATCATGAGGAGTGGGAGAAGATCGAGCAATCAGAAATGGCTTCTGATTTCAGCGAAGGATTGCAGCGGATGGACAAGAGCAAAGGGCTTCGGTATTTTTGGGTGTTCGTGAGACACCCGCGGTGGAGGATTTCGGAGCTTCCCTGGCAGCAGTGGACTTTGATTGCAGAGGTTGTAGTTGAAGCTGGTAAAGAAAGGTTAGATAAATGGAGCTTAATGGGTCGACTTGGAAATAAGTCAAGAAAAAATATAACTCAATGTGCAGCTTGGATGAGACCTGATATCATATATGTGAAAAAGCCTGTTTACCAATGCAGATTTGAGCCCTCGGATGAGTTTTTCCAGGCAATAATGCCATTTCTTGATCCCAAAACAGAGCAAGATTTTCTGTTTGAGTTGCAGGATGATGAAGGAGATATTGAATGGGTGACTTATTTTGGTGGGTTGTGTAAGATTGTGAGGATAAATCCAAAGGCATTTGTGGATGATGTGGTGAATGCTTATGAGAAGCTAAGTGATGAGAAGAAATCCAAGTGTTTGGAGTTTCTTTTGACTAACCACCCTGTGCCATTGCTGCATCCATATACAAAAGAGTGGAAGGCTAAGTTGGAGGAAGAGGAGTTGGGTTGTGATGCCCCGGACGACATCGAGAATCGAGGTGGTGAGGAAAATGTGATCACGGAGTGGATTGAGACTGATGATGACAATGAAGAAGAGTATGAGGATCAGCCTGAGGAGGATGTCGTAATGGAGACCGAGAACGAGGACGAGGATGAGGATGATAAACGAGAGGATGGAAATGAGGAAGAAGATGAGAATTATTGGGATGAAAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAACTGGAGAAGCTGTTTAAACGCAGTGCAGAAGTGACTGATGAATTTTATGAGAAGGAGAAGGAGAAGGAGAAAGTGGGAAGTAGAAGGGCCACAGCCATGGAAGATGGGAGTGAAACAGAAATGAGAGGGAAGAGAGCAAAAGTGAGACCAGAAGAATGGGAGTATATTGGGTATGGGCCATGGAGGAAGAAGATAAAGAAAAGTCAGATTCCTCCAGAGCTGTTTTTGAGATCTACAGTAAGGCCATTCACTTACAGAAACCTTGTGAAGGAAATTGTATTGACAAGGCATGCTATTTTGGATGGTGTAATTGGGGTATGA
Protein sequence
MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVIGV
Homology
BLAST of HG10013707 vs. NCBI nr
Match:
XP_038898752.1 (uncharacterized protein LOC120086270 [Benincasa hispida])
HSP 1 Score: 1065.1 bits (2753), Expect = 2.3e-307
Identity = 551/605 (91.07%), Postives = 565/605 (93.39%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
MAT QFP SKTLN SS FLHSTS SPF +PLLQ TLTLKSHQTHKPLSI SG PNPS
Sbjct: 1 MATS-QFPLSKTLNLSSSFLHSTSLSPFFHPLLQ--TLTLKSHQTHKPLSIRSGPPNPSF 60
Query: 61 LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
L ISRQISH FAN+ R+IRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA
Sbjct: 61 LPISRQISHLQFANSHRNIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
ESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS
Sbjct: 121 ESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFS+GL RMDKSKG RYFW
Sbjct: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSQGLLRMDKSKGFRYFW 240
Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD 300
VFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPD 300
Query: 301 IIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDE-GDIEWVTYFGGLCKIV 360
IIYVKKPVYQCRFEP DEFFQAIMPFLDPKTEQDFLFELQDDE GD+EWVTYF GLCKIV
Sbjct: 301 IIYVKKPVYQCRFEPQDEFFQAIMPFLDPKTEQDFLFELQDDEGGDVEWVTYFAGLCKIV 360
Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
R+NPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
Query: 421 DDIENRGGEENVITEWIETDDDNEEEY-EDQPEEDVVMET--ENEDEDEDDKREDGN--E 480
DDIE R G+ENVITEWIETDDDN E+Y EDQPEE+VVMET E+EDEDEDDKREDGN E
Sbjct: 421 DDIEKRCGDENVITEWIETDDDNGEDYEEDQPEENVVMETEDEDEDEDEDDKREDGNQEE 480
Query: 481 EEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETE 540
EEDE YWDERFRKAISSPEELEKLFK SAEV DEFY EKEKE VGSRRATAMEDG ETE
Sbjct: 481 EEDEGYWDERFRKAISSPEELEKLFKHSAEVADEFY--EKEKESVGSRRATAMEDGDETE 540
Query: 541 MRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL 600
+RGKRAKV+ EEWEYIGYGPWRKKIKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAIL
Sbjct: 541 LRGKRAKVKAEEWEYIGYGPWRKKIKKSKIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL 600
BLAST of HG10013707 vs. NCBI nr
Match:
XP_008463741.1 (PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo] >KAA0066766.1 uncharacterized protein E6C27_scaffold271G001050 [Cucumis melo var. makuwa] >TYK27913.1 uncharacterized protein E5676_scaffold384G00980 [Cucumis melo var. makuwa])
HSP 1 Score: 1005.4 bits (2598), Expect = 2.1e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
MAT QFP KTLNPSSPFL+STS +PFSNPLLQ TLTLKSHQTH KPLSI+SG NP
Sbjct: 1 MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60
Query: 61 SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
QIS ++R DIRTHAGRSKK GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61 ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180
Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240
Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360
Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420
Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
PD++ENR ++NVITEWIET DNEEEYEDQPEED+VME E++D+++DD+RE+GN
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480
Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y EKE VG RRATAM+DG E
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540
Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592
BLAST of HG10013707 vs. NCBI nr
Match:
XP_022976454.1 (uncharacterized protein LOC111476853 [Cucurbita maxima])
HSP 1 Score: 1001.1 bits (2587), Expect = 4.0e-288
Identity = 514/604 (85.10%), Postives = 549/604 (90.89%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
MAT QFP KTLNPSSPFLHSTS +PFSNPLLQT+TLTLKSH+T KPLSI+SG PN S+
Sbjct: 1 MATS-QFPLCKTLNPSSPFLHSTSLTPFSNPLLQTLTLTLKSHKTRKPLSIISGLPNASV 60
Query: 61 LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
L I RQIS F FAN+R DIRT AGRSKKKGGG SPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61 LPIFRQISQFPFANSRPDIRTFAGRSKKKGGGTSPGRIEGNAEFRRKLRNNVRRKSQKPA 120
Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180
Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD++KG R+FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRNKGFRHFW 240
Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
DIIYVKKPVYQCRFEP EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360
Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
Query: 421 DDIE---NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEE 480
DD + NR +ENVI EWIETDDDN+ +YED+ EDVVMET E EDE+D E NEEE
Sbjct: 421 DDDDDNKNRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEE 480
Query: 481 DENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMR 540
DE+YWDERFRKAISSPEELEKL KRS E +DEFYEK+K + +GSR+A +DG ETE+R
Sbjct: 481 DEDYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NMGSRKAME-DDGDETELR 540
Query: 541 GKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG 600
GKRAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Sbjct: 541 GKRAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEG 600
BLAST of HG10013707 vs. NCBI nr
Match:
XP_004146025.1 (uncharacterized protein LOC101207599 [Cucumis sativus] >KGN55067.1 hypothetical protein Csa_012426 [Cucumis sativus])
HSP 1 Score: 993.8 bits (2568), Expect = 6.4e-286
Identity = 514/602 (85.38%), Postives = 545/602 (90.53%), Query Frame = 0
Query: 7 FPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNPSLLSIS 66
FP KTLNPSSPFL+STS +PFSNPLLQ TLTLK H TH KPLSI+SG +S
Sbjct: 6 FPPPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKPHHTHYYKPLSIISG------ISYP 65
Query: 67 RQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHF 126
QIS FS R DIRTHAGRSKKK GGPSPGRIEGNA+FRRKLR NARRK+QKLAESHF
Sbjct: 66 YQISLFS----RPDIRTHAGRSKKKPGGPSPGRIEGNADFRRKLRDNARRKTQKLAESHF 125
Query: 127 YRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 186
YRRKKSN N ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV
Sbjct: 126 YRRKKSNRNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 185
Query: 187 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVR 246
VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RYFWVFVR
Sbjct: 186 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVR 245
Query: 247 HPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 306
HPRWRISELPWQQWTLIAEVV+E+GKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 246 HPRWRISELPWQQWTLIAEVVLESGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305
Query: 307 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 366
KKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKIVRINPK
Sbjct: 306 KKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPK 365
Query: 367 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIEN 426
AF+DDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDAPD++EN
Sbjct: 366 AFIDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDAPDEMEN 425
Query: 427 RGGEENVITEWIETDDDNEEEYEDQPEEDVVM----ETENEDEDEDDKREDGN--EEEDE 486
R ++NVITEWIET DNEEEYE+QP+ED+VM E E+EDE++DD++E+GN EEEDE
Sbjct: 426 RRRDDNVITEWIET--DNEEEYEEQPKEDIVMEDMDEDEDEDEEDDDEQEEGNQEEEEDE 485
Query: 487 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 546
YWDERFRKAISSPEELEKLFKRS E+ DE Y EKE VG RRATAM+DG E EMRGK
Sbjct: 486 GYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEVEMRGK 545
Query: 547 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 601
+ KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG I
Sbjct: 546 KPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI 589
BLAST of HG10013707 vs. NCBI nr
Match:
XP_023535253.1 (uncharacterized protein LOC111796741 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 993.4 bits (2567), Expect = 8.4e-286
Identity = 514/602 (85.38%), Postives = 546/602 (90.70%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
MAT QFP KTLNPSSPFL STS +PFSNPLLQ TLTLKSH+T KPL+I+SG PN S+
Sbjct: 1 MATS-QFPLCKTLNPSSPFLPSTSLTPFSNPLLQ--TLTLKSHKTRKPLTIISGLPNASV 60
Query: 61 LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
L I RQIS F FAN+R DIRT AGRSKKKGGGPSPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61 LPIFRQISQFPFANSRPDIRTCAGRSKKKGGGPSPGRIEGNAEFRRKLRNNVRRKSQKPA 120
Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180
Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKG ++FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFQHFW 240
Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
DIIYVKKPVYQCRFEP EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360
Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
Query: 421 -DDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDE 480
DD ENR +ENVI EWIETDDDN+ +YED+ EDVVMET E EDE+D E NEEEDE
Sbjct: 421 DDDSENRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEEDE 480
Query: 481 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 540
+YWDERFRKAISSPEELEKL KRS E +DEFYEK+K + GSR+A EDG ETE+RGK
Sbjct: 481 DYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NAGSRKAME-EDGDETELRGK 540
Query: 541 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 600
RAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G I
Sbjct: 541 RAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEGEI 596
BLAST of HG10013707 vs. ExPASy TrEMBL
Match:
A0A5A7VK56 (Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold384G00980 PE=4 SV=1)
HSP 1 Score: 1005.4 bits (2598), Expect = 1.0e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
MAT QFP KTLNPSSPFL+STS +PFSNPLLQ TLTLKSHQTH KPLSI+SG NP
Sbjct: 1 MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60
Query: 61 SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
QIS ++R DIRTHAGRSKK GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61 ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180
Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240
Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360
Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420
Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
PD++ENR ++NVITEWIET DNEEEYEDQPEED+VME E++D+++DD+RE+GN
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480
Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y EKE VG RRATAM+DG E
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540
Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592
BLAST of HG10013707 vs. ExPASy TrEMBL
Match:
A0A1S3CKF2 (uncharacterized protein LOC103501814 OS=Cucumis melo OX=3656 GN=LOC103501814 PE=4 SV=1)
HSP 1 Score: 1005.4 bits (2598), Expect = 1.0e-289
Identity = 522/607 (86.00%), Postives = 551/607 (90.77%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNP 60
MAT QFP KTLNPSSPFL+STS +PFSNPLLQ TLTLKSHQTH KPLSI+SG NP
Sbjct: 1 MATS-QFPSPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKSHQTHYYKPLSILSGPSNP 60
Query: 61 SLLSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
QIS ++R DIRTHAGRSKK GGPSPGRIEGNAEFRRKLRHNARRKSQK
Sbjct: 61 ------YQISLLPSPHSRPDIRTHAGRSKKNPGGPSPGRIEGNAEFRRKLRHNARRKSQK 120
Query: 121 LAESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGP 180
LAESHFYRRKK NSN ADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGP
Sbjct: 121 LAESHFYRRKKPNSNYADNFSEDELQQIGLGYDRMVRFIEKDDPNLRHPYDWYKYGEFGP 180
Query: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRY 240
YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RY
Sbjct: 181 YSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRY 240
Query: 241 FWVFVRHPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
FWVFVRHPRWRISELPWQQWTLIAEVV+EAGKERLDKWSLMGRLGNKSRKNITQCAAWMR
Sbjct: 241 FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKERLDKWSLMGRLGNKSRKNITQCAAWMR 300
Query: 301 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 360
PDIIYVKKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKI
Sbjct: 301 PDIIYVKKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKI 360
Query: 361 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 420
VRI+PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDA
Sbjct: 361 VRISPKAFVDDVVNAYEKLSDEKKSICLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDA 420
Query: 421 PDDIENRGGEENVITEWIETDDDNEEEYEDQPEEDVVME--TENEDEDEDDKREDGN--- 480
PD++ENR ++NVITEWIET DNEEEYEDQPEED+VME E++D+++DD+RE+GN
Sbjct: 421 PDEMENRRRDDNVITEWIET--DNEEEYEDQPEEDIVMEDMDEDKDDEDDDEREEGNQEE 480
Query: 481 EEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSET 540
EEEDE+YWDERFRKAISSPEELEKLFKRS E+ DE Y EKE VG RRATAM+DG E
Sbjct: 481 EEEDESYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEM 540
Query: 541 EMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 600
EMRGKR KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI
Sbjct: 541 EMRGKRPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAI 592
BLAST of HG10013707 vs. ExPASy TrEMBL
Match:
A0A6J1INI9 (uncharacterized protein LOC111476853 OS=Cucurbita maxima OX=3661 GN=LOC111476853 PE=4 SV=1)
HSP 1 Score: 1001.1 bits (2587), Expect = 1.9e-288
Identity = 514/604 (85.10%), Postives = 549/604 (90.89%), Query Frame = 0
Query: 1 MATPLQFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSL 60
MAT QFP KTLNPSSPFLHSTS +PFSNPLLQT+TLTLKSH+T KPLSI+SG PN S+
Sbjct: 1 MATS-QFPLCKTLNPSSPFLHSTSLTPFSNPLLQTLTLTLKSHKTRKPLSIISGLPNASV 60
Query: 61 LSISRQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLA 120
L I RQIS F FAN+R DIRT AGRSKKKGGG SPGRIEGNAEFRRKLR+N RRKSQK A
Sbjct: 61 LPIFRQISQFPFANSRPDIRTFAGRSKKKGGGTSPGRIEGNAEFRRKLRNNVRRKSQKPA 120
Query: 121 ESHFYRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYS 180
ESHFYRRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYS
Sbjct: 121 ESHFYRRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYS 180
Query: 181 WRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFW 240
WRGVV+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD++KG R+FW
Sbjct: 181 WRGVVIGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRNKGFRHFW 240
Query: 241 VFVRHPRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
VFVRHPRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRP
Sbjct: 241 VFVRHPRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRP 300
Query: 301 DIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIV 360
DIIYVKKPVYQCRFEP EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+
Sbjct: 301 DIIYVKKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKIL 360
Query: 361 RINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
R+NPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP
Sbjct: 361 RVNPKAFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP 420
Query: 421 DDIE---NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEE 480
DD + NR +ENVI EWIETDDDN+ +YED+ EDVVMET E EDE+D E NEEE
Sbjct: 421 DDDDDNKNRPSDENVIMEWIETDDDNDHDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEE 480
Query: 481 DENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMR 540
DE+YWDERFRKAISSPEELEKL KRS E +DEFYEK+K + +GSR+A +DG ETE+R
Sbjct: 481 DEDYWDERFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NMGSRKAME-DDGDETELR 540
Query: 541 GKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG 600
GKRAKV+PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Sbjct: 541 GKRAKVKPEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEG 600
BLAST of HG10013707 vs. ExPASy TrEMBL
Match:
A0A0A0L3A4 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G627170 PE=4 SV=1)
HSP 1 Score: 993.8 bits (2568), Expect = 3.1e-286
Identity = 514/602 (85.38%), Postives = 545/602 (90.53%), Query Frame = 0
Query: 7 FPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTH--KPLSIVSGHPNPSLLSIS 66
FP KTLNPSSPFL+STS +PFSNPLLQ TLTLK H TH KPLSI+SG +S
Sbjct: 6 FPPPKTLNPSSPFLNSTSLTPFSNPLLQ--TLTLKPHHTHYYKPLSIISG------ISYP 65
Query: 67 RQISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHF 126
QIS FS R DIRTHAGRSKKK GGPSPGRIEGNA+FRRKLR NARRK+QKLAESHF
Sbjct: 66 YQISLFS----RPDIRTHAGRSKKKPGGPSPGRIEGNADFRRKLRDNARRKTQKLAESHF 125
Query: 127 YRRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 186
YRRKKSN N ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV
Sbjct: 126 YRRKKSNRNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGV 185
Query: 187 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVR 246
VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMA+DFS GLQRMDKSKG RYFWVFVR
Sbjct: 186 VVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMAADFSTGLQRMDKSKGFRYFWVFVR 245
Query: 247 HPRWRISELPWQQWTLIAEVVVEAGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 306
HPRWRISELPWQQWTLIAEVV+E+GKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 246 HPRWRISELPWQQWTLIAEVVLESGKERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305
Query: 307 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 366
KKPVYQCRFEP DEFFQA+MPFLDPKTEQDFLFELQDDEG++EWVTYFGGLCKIVRINPK
Sbjct: 306 KKPVYQCRFEPQDEFFQAMMPFLDPKTEQDFLFELQDDEGNVEWVTYFGGLCKIVRINPK 365
Query: 367 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIEN 426
AF+DDVVNAYEKLSDEKKSKCLEFLL+NHPVPLLHPYTKEWKAKLEEEELGCDAPD++EN
Sbjct: 366 AFIDDVVNAYEKLSDEKKSKCLEFLLSNHPVPLLHPYTKEWKAKLEEEELGCDAPDEMEN 425
Query: 427 RGGEENVITEWIETDDDNEEEYEDQPEEDVVM----ETENEDEDEDDKREDGN--EEEDE 486
R ++NVITEWIET DNEEEYE+QP+ED+VM E E+EDE++DD++E+GN EEEDE
Sbjct: 426 RRRDDNVITEWIET--DNEEEYEEQPKEDIVMEDMDEDEDEDEEDDDEQEEGNQEEEEDE 485
Query: 487 NYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGK 546
YWDERFRKAISSPEELEKLFKRS E+ DE Y EKE VG RRATAM+DG E EMRGK
Sbjct: 486 GYWDERFRKAISSPEELEKLFKRSGEMADELY----EKENVGRRRATAMKDGDEVEMRGK 545
Query: 547 RAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVI 601
+ KV+ EEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG I
Sbjct: 546 KPKVKAEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI 589
BLAST of HG10013707 vs. ExPASy TrEMBL
Match:
A0A6J1FAH0 (uncharacterized protein LOC111443567 OS=Cucurbita moschata OX=3662 GN=LOC111443567 PE=4 SV=1)
HSP 1 Score: 991.1 bits (2561), Expect = 2.0e-285
Identity = 511/597 (85.59%), Postives = 543/597 (90.95%), Query Frame = 0
Query: 6 QFPFSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISR 65
QFP KTLNPSSPFL STS +PFSNPLLQ TLTLKSHQT KPLSI+SG PN S+L I R
Sbjct: 5 QFPLCKTLNPSSPFLPSTSLTPFSNPLLQ--TLTLKSHQTRKPLSIISGLPNASVLPIFR 64
Query: 66 QISHFSFANARRDIRTHAGRSKKKGGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFY 125
QIS F FAN+R DIRT AGRSKKKGGGPSPGRIEGNAEFRRKLR+N RRKSQK AESHFY
Sbjct: 65 QISQFPFANSRPDIRTFAGRSKKKGGGPSPGRIEGNAEFRRKLRNNVRRKSQKPAESHFY 124
Query: 126 RRKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVV 185
RRK SNSN ADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV
Sbjct: 125 RRKNSNSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPHDWYKYGEFGPYSWRGVV 184
Query: 186 VGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYFWVFVRH 245
+GEPIRGRFTDERVT+I EVKDHEEWEKIEQSEMASDFSEGLQRMD+SKG R+FWVFVRH
Sbjct: 185 IGEPIRGRFTDERVTMIREVKDHEEWEKIEQSEMASDFSEGLQRMDRSKGFRHFWVFVRH 244
Query: 246 PRWRISELPWQQWTLIAEVVVEAGK-ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 305
PRWRISELPWQQWTLIAEVV+EAGK ERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV
Sbjct: 245 PRWRISELPWQQWTLIAEVVLEAGKEERLDKWSLMGRLGNKSRKNITQCAAWMRPDIIYV 304
Query: 306 KKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKIVRINPK 365
KKPVYQCRFEP EFFQA+MPFLDPKTEQD LFELQDDEG++EWVTYFGGLCKI+R+NPK
Sbjct: 305 KKPVYQCRFEPQAEFFQALMPFLDPKTEQDVLFELQDDEGNVEWVTYFGGLCKILRVNPK 364
Query: 366 AFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP-DDIE 425
AFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAP DD E
Sbjct: 365 AFVDDVANAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDDNE 424
Query: 426 NRGGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDEDEDDKREDGNEEEDENYWDE 485
NR +ENV+ EWIET DDN+++YED+ EDVVMET E EDE+D E NEEEDE+YWDE
Sbjct: 425 NRHSDENVVMEWIET-DDNDDDYEDE-AEDVVMETNEEAEDEEDGGEHQNEEEDEDYWDE 484
Query: 486 RFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVGSRRATAMEDGSETEMRGKRAKVR 545
RFRKAISSPEELEKL KRS E +DEFYEK+K + GSR+A +DG ETE+RGKRAKV+
Sbjct: 485 RFRKAISSPEELEKLLKRSEEASDEFYEKQKGR-NAGSRKAME-DDGDETELRGKRAKVK 544
Query: 546 PEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGVIGV 601
PEEWE IGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G IGV
Sbjct: 545 PEEWERIGYGPWRKKIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILEGEIGV 595
BLAST of HG10013707 vs. TAIR 10
Match:
AT3G14900.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: embryo development; LOCATED IN: chloroplast; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 13 growth stages; Has 17135 Blast hits to 10204 proteins in 644 species: Archae - 47; Bacteria - 1684; Metazoa - 5536; Fungi - 2506; Plants - 1043; Viruses - 361; Other Eukaryotes - 5958 (source: NCBI BLink). )
HSP 1 Score: 640.6 bits (1651), Expect = 1.3e-183
Identity = 347/619 (56.06%), Postives = 450/619 (72.70%), Query Frame = 0
Query: 9 FSKTLNPSSPFLHSTSFSPFSNPLLQTITLTLKSHQTHKPLSIVSGHPNPSLLSISRQIS 68
FSKTLNPS F SP ++ + + +++ + + S+ + R+
Sbjct: 8 FSKTLNPSFSFRK----SPLNSGVRRIVSVLPAITERNYAFSVKRSE------LLLREDG 67
Query: 69 HFSFANARRDIRTHAGRSKKK-GGGPSPGRIEGNAEFRRKLRHNARRKSQKLAESHFYR- 128
F RRD+R AGRSKKK GGG S GRIEG+++ R++++ NAR KS+KLAES FYR
Sbjct: 68 GF-----RRDVRALAGRSKKKLGGGSSGGRIEGDSDMRKQVKRNAREKSKKLAESLFYRL 127
Query: 129 -------RKKSNSNTADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPY 188
R + S+ D F+E+EL+ IGLGYDRMVRFM+KDDP LRHPYDW+KYGEFGPY
Sbjct: 128 YNNPDKSRSQILSSHPDKFTEEELEMIGLGYDRMVRFMDKDDPRLRHPYDWFKYGEFGPY 187
Query: 189 SWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEKIEQSEMASDFSEGLQRMDKSKGLRYF 248
SWRGVVVG+P+RG +DE VT+I EV++HEE+EKIEQ EM F + ++ +D + GLRYF
Sbjct: 188 SWRGVVVGDPVRGTISDECVTMIGEVENHEEFEKIEQHEMNIAFQKRVKELDSNVGLRYF 247
Query: 249 WVFVRHPRWRISELPWQQWTLIAEVVVEAG-KERLDKWSLMGRLGNKSRKNITQCAAWMR 308
WVFVRHP+WR+SELPW+QWTL++EVVVEA K+RLDKW+LMGRLGNKSR I QCAAW R
Sbjct: 248 WVFVRHPKWRLSELPWEQWTLVSEVVVEADKKQRLDKWNLMGRLGNKSRSLICQCAAWFR 307
Query: 309 PDIIYVKKPVYQCRFEPSDEFFQAIMPFLDPKTEQDFLFELQDDEGDIEWVTYFGGLCKI 368
PDI+YVKKPV+QCRFEP ++FF +++P+L+P TE F+ E++DDEG +E TY+GGLCK+
Sbjct: 308 PDIVYVKKPVFQCRFEPQEDFFNSLIPYLNPVTESGFVCEVEDDEGRVELSTYYGGLCKM 367
Query: 369 VRINPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDA 428
+++ AFVDDVVNAYEKLSDEKKS+ L+FLL NHP LLHPYTKEWKAKLEE ELGCDA
Sbjct: 368 LKVRQTAFVDDVVNAYEKLSDEKKSRVLKFLLGNHPNELLHPYTKEWKAKLEEMELGCDA 427
Query: 429 PDDIENR-----GGEENVITEWIETDDDNEEEYEDQPEEDVVMETENEDE---------- 488
PD+ E+ E+ +EWIE + DN+++ +D ++D +E ++D+
Sbjct: 428 PDEDEDEISISGSSEKAEFSEWIEDEADNDDDDDDDDDDDGEVEEVDDDDNMVVDVEGNV 487
Query: 489 DED---DKREDGNEEEDENYWDERFRKAISSPEELEKLFKRSAEVTDEFYEKEKEKEKVG 548
+ED D+ E+ + EEDE YW+E+F KA ++ E +EKL + S V+D+FYEK+ K
Sbjct: 488 EEDSLEDEIEESDPEEDERYWEEQFNKATNNAERMEKLAEMSMVVSDKFYEKQL---KAL 547
Query: 549 SRRATAMEDGSETEMRGKRAKVRPEEWEYIGYGPWRKKIKKSQIPPELFLRSTVRPFTYR 600
R +G E EMRGK+AKV+PEEW+ +GYG W KKIKKS+IPPELFLR+ VRPF YR
Sbjct: 548 EEREKGEIEGDELEMRGKKAKVKPEEWKTVGYGRWMKKIKKSRIPPELFLRAAVRPFVYR 607
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
XP_038898752.1 | 2.3e-307 | 91.07 | uncharacterized protein LOC120086270 [Benincasa hispida] | [more] |
XP_008463741.1 | 2.1e-289 | 86.00 | PREDICTED: uncharacterized protein LOC103501814 [Cucumis melo] >KAA0066766.1 unc... | [more] |
XP_022976454.1 | 4.0e-288 | 85.10 | uncharacterized protein LOC111476853 [Cucurbita maxima] | [more] |
XP_004146025.1 | 6.4e-286 | 85.38 | uncharacterized protein LOC101207599 [Cucumis sativus] >KGN55067.1 hypothetical ... | [more] |
XP_023535253.1 | 8.4e-286 | 85.38 | uncharacterized protein LOC111796741 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7VK56 | 1.0e-289 | 86.00 | Uncharacterized protein OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold... | [more] |
A0A1S3CKF2 | 1.0e-289 | 86.00 | uncharacterized protein LOC103501814 OS=Cucumis melo OX=3656 GN=LOC103501814 PE=... | [more] |
A0A6J1INI9 | 1.9e-288 | 85.10 | uncharacterized protein LOC111476853 OS=Cucurbita maxima OX=3661 GN=LOC111476853... | [more] |
A0A0A0L3A4 | 3.1e-286 | 85.38 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G627170 PE=4 SV=1 | [more] |
A0A6J1FAH0 | 2.0e-285 | 85.59 | uncharacterized protein LOC111443567 OS=Cucurbita moschata OX=3662 GN=LOC1114435... | [more] |
Match Name | E-value | Identity | Description | |
AT3G14900.1 | 1.3e-183 | 56.06 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: embryo d... | [more] |