Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGGTATGGAATTAACTTTTCTAAATATTATTGAAATTCAACTCCAAAAATAAAGAAAATAGACATTTAGGAGCTAGCTTTTTTTTTTTTTTTTTTAAGTTCAAAAATACATATTTTGTATTTTGCAGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
mRNA sequence
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
Coding sequence (CDS)
ATGAACTCTACAGACCAACTCTGCAACTTTGAAGCTGTTGCAAAAATCTCACAGCCAAAGCCAGATGGAGAACCAAAGAAACAGGTTAGAAGGAGGCGTCAAAGCCGGCGGCTTTACAAGGAAACGCCTCTGGATATGGCTGAGGCTAGAAGAGAGATTGTAACTGCACTTAAACTCCACAGAGCATCAACTAAAGAAGCAAGAGAACAGCAACAAAAACAAGACCAAGAAATTAAACAATCAGTTCCTCTGTTTCCTCAATTATGCCCATGTTTTGAAGCTGAAGGAAGAAGGAAATCCAGGAGAAATCCCAGGATATACCCAGGTTACTCCTATGATTGCTCATTTTATTTGGAAAATGGGTCTGGTTTTGTTGCTCCTCCACCTGTTGCACAGAATCTCAATGCAGAAATCCCTATACAAATCTTTGATGATGATTTCAAAACTTCATTCTGGCCCCCATCTTCATATATTTGTCTCACTGTTTCTTGTCCTGATACTCATCAGGAAGTTCCCAAATCAATTTCATTATCTGAGGAAGAAGGGAAGTTAATGGCTTCTGATTTCTTGTTTTGGTCCAATAATGATCCAACTAGAGAGAGTGAAAAAGATATGCAGCAAGGGGCAGTGGAGGAAGCTATGGCTATGGCTGAGATCAGGTCCATGTCCATGGATGTGAAAGCTTTGGAGTTTGATGTTCACCATAGTTCTGATAATGCTATGGAATTTCCAGAGTGGTTGAGCATCAATAATGATATTTTGCAGCAGCATTCGAATTATAAATGCGTAGAGGAGGATTATCTTCAATATCCTGACCTATCCTGCTTCGACTTTGGGAAGATTGAAGATGTGGATGGAGATTGGTTAGCATGA
Protein sequence
MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALKLHRASTKEAREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQIFDDDFKTSFWPPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAVEEAMAMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKIEDVDGDWLA
Homology
BLAST of HG10014307 vs. NCBI nr
Match:
KAE8649926.1 (hypothetical protein Csa_011922 [Cucumis sativus])
HSP 1 Score: 426.0 bits (1094), Expect = 2.6e-115
Identity = 239/307 (77.85%), Postives = 252/307 (82.08%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
MNSTDQL NFEA A+IS KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1 MNSTDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60
Query: 61 HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
HRA STKE AREQQQKQDQE KQS PLFPQ CFEAEGRRKSRRNPRIYP SYDCSFY
Sbjct: 61 HRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRNPRIYPDCSYDCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
LENGSG VAPPP +NLN EIPIQ FDDDFKT SFW PPSSYIC T+SCPD
Sbjct: 121 LENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTLSCPD 180
Query: 181 THQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAM-AMAEIRSMS 240
THQE+PKS+SL EEEG LMASD +FW NNDPT SEKDMQQ V EEAM AMA+I+SMS
Sbjct: 181 THQELPKSVSLREEEGNLMASD-VFWFNNDPTGVSEKDMQQEGVLEEEAMHAMADIKSMS 240
Query: 241 MDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKIED 291
MDVKALE D HSSDNAMEFP+WLSIN+D L Q+SNY CVEEDYLQ PDLSCFD KIED
Sbjct: 241 MDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQDPDLSCFDTWKIED 300
BLAST of HG10014307 vs. NCBI nr
Match:
XP_016901295.1 (PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo] >KAA0064553.1 putative WRKY transcription factor protein 1 isoform X2 [Cucumis melo var. makuwa] >TYK20037.1 putative WRKY transcription factor protein 1 isoform X2 [Cucumis melo var. makuwa])
HSP 1 Score: 421.4 bits (1082), Expect = 6.3e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
MNS DQL NFEA A+IS KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1 MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60
Query: 61 HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
HRA STKE AREQQQKQDQE KQS PLFP+L CFEAEGRRKS+RNPRIYP SYDCSFY
Sbjct: 61 HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
LENGSGFVAPPP +NLN EIPIQ FDDDFKT SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180
Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
T HQE PKS+SL EEEG LMASD +FW NNDPT +EKDMQQ AV EEAMAMA +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240
Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300
BLAST of HG10014307 vs. NCBI nr
Match:
XP_038897806.1 (uncharacterized protein LOC120085720 [Benincasa hispida])
HSP 1 Score: 409.1 bits (1050), Expect = 3.3e-110
Identity = 220/287 (76.66%), Postives = 234/287 (81.53%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
MNS DQLCNFEA A+ISQPKPDGE KKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1 MNSADQLCNFEAAAQISQPKPDGESKKQVRRRRHSRRRLYKEMPLDMAEARREIVTALKL 60
Query: 61 HRASTKEAREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFYLE 120
HRASTKEAREQQQKQDQ+I QS+P+FPQL PCFE +GRRKSRRN R YP DCSFYLE
Sbjct: 61 HRASTKEAREQQQKQDQQINQSIPIFPQLGPCFEGDGRRKSRRNTRTYP----DCSFYLE 120
Query: 121 NGSGFVAPPPVAQNLNAEIPIQIFDDDFKT------SFWPPSSYICLTVSCPDTHQEVPK 180
NGSGFVAPP VAQNL EIP Q FDDDFKT SFWPPSSYI TVSC THQEVPK
Sbjct: 121 NGSGFVAPPSVAQNLITEIPTQSFDDDFKTSSYCPLSFWPPSSYIYPTVSCSATHQEVPK 180
Query: 181 SISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAVEE--AMAMAEIRSMSMDVKALEF 240
SISLSEEEG LMASD +FW NND +KDMQ+GAVEE A AMAE+R M+MDVKALE
Sbjct: 181 SISLSEEEGNLMASD-VFWFNND-----QKDMQEGAVEEARARAMAEVRPMTMDVKALES 240
Query: 241 DVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDF 279
D HHS +N MEF +W SIN+D LQQHSNY CVEEDYLQ PDLS + F
Sbjct: 241 DGHHSCENPMEFSDWPSINDDFLQQHSNYHCVEEDYLQDPDLSWYQF 277
BLAST of HG10014307 vs. NCBI nr
Match:
XP_022940715.1 (uncharacterized protein LOC111446225 [Cucurbita moschata])
HSP 1 Score: 352.1 bits (902), Expect = 4.7e-93
Identity = 210/329 (63.83%), Postives = 234/329 (71.12%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
MNSTDQLCNFEA KI QP+P GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1 MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60
Query: 61 LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
LHRASTKEA+EQQQKQDQ+IK S+P++P Q PCFE E R KSRRNPRIYP DCSFY
Sbjct: 61 LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD--------------FKTSFWPPSSY 180
ENGS F+APPPVAQ+L+ +IPIQ F+D + SF PPSSY
Sbjct: 121 FENGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCSNNNNNHSFYSLSFLPPSSY 180
Query: 181 ICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----E 240
IC T THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++ GAV E
Sbjct: 181 ICPTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEE 240
Query: 241 EAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHSNYK 291
E +AEIR S+D K LE D H S+ AMEFP+WLSIN+D LQ SNY+
Sbjct: 241 EEAMVAEIR--SVDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYQ 300
BLAST of HG10014307 vs. NCBI nr
Match:
KAG6608324.1 (hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sororia])
HSP 1 Score: 351.7 bits (901), Expect = 6.2e-93
Identity = 206/325 (63.38%), Postives = 230/325 (70.77%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
MNSTDQLCNFEA KI QP+P GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1 MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60
Query: 61 LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
LHRASTKEA+EQQQKQDQ+IK S+P++P Q PCFE E R KSRRNPRIYP DCSFY
Sbjct: 61 LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD------------FKTSFWPPSSYIC 180
ENGS F+APPPVAQ+L+ +IPIQ F+D + SF PPSSYIC
Sbjct: 121 FENGSHFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCNNNNHSFYSLSFLPPSSYIC 180
Query: 181 LTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----EEA 240
T THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++ GAV EE
Sbjct: 181 PTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEEEE 240
Query: 241 MAMAEIRSMSMDVKALEFDVH--------HSSDNAMEFPEWLSINNDILQQHSNYKCVEE 291
+AEIRSM ++ H S+ AMEFP+WLSIN+D LQ SNY E
Sbjct: 241 AMVAEIRSMEEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYPFSNE 300
BLAST of HG10014307 vs. ExPASy TrEMBL
Match:
A0A5A7V8V7 (Putative WRKY transcription factor protein 1 isoform X2 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold134G001570 PE=4 SV=1)
HSP 1 Score: 421.4 bits (1082), Expect = 3.1e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
MNS DQL NFEA A+IS KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1 MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60
Query: 61 HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
HRA STKE AREQQQKQDQE KQS PLFP+L CFEAEGRRKS+RNPRIYP SYDCSFY
Sbjct: 61 HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
LENGSGFVAPPP +NLN EIPIQ FDDDFKT SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180
Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
T HQE PKS+SL EEEG LMASD +FW NNDPT +EKDMQQ AV EEAMAMA +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240
Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300
BLAST of HG10014307 vs. ExPASy TrEMBL
Match:
A0A1S4DZY0 (uncharacterized protein LOC103493717 OS=Cucumis melo OX=3656 GN=LOC103493717 PE=4 SV=1)
HSP 1 Score: 421.4 bits (1082), Expect = 3.1e-114
Identity = 237/309 (76.70%), Postives = 253/309 (81.88%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPDGEPKKQVRRRRQS-RRLYKETPLDMAEARREIVTALKL 60
MNS DQL NFEA A+IS KPD EPKKQVRRRR S RRLYKE PLDMAEARREIVTALKL
Sbjct: 1 MNSPDQL-NFEAAAQIS--KPDEEPKKQVRRRRHSRRRLYKEVPLDMAEARREIVTALKL 60
Query: 61 HRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
HRA STKE AREQQQKQDQE KQS PLFP+L CFEAEGRRKS+RNPRIYP SYDCSFY
Sbjct: 61 HRASSTKEAAREQQQKQDQESKQSFPLFPELGQCFEAEGRRKSKRNPRIYPSCSYDCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW-PPSSYICLTVSCPD 180
LENGSGFVAPPP +NLN EIPIQ FDDDFKT SFW PPSSYIC TVSCPD
Sbjct: 121 LENGSGFVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWPPPSSYICPTVSCPD 180
Query: 181 T-HQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--EEAMAMA--EIRS 240
T HQE PKS+SL EEEG LMASD +FW NNDPT +EKDMQQ AV EEAMAMA +++S
Sbjct: 181 THHQEFPKSVSLREEEGNLMASD-VFWFNNDPTGVNEKDMQQEAVLEEEAMAMAMDDLKS 240
Query: 241 MSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQYPDLSCFDFGKI 291
MSMDVKALE D HHSSDNAM FP+W+SIN+D LQQ+SNY CVEED LQ PDLSCFD GKI
Sbjct: 241 MSMDVKALEIDCHHSSDNAMAFPDWMSINDDSLQQYSNYHCVEEDCLQEPDLSCFDIGKI 300
BLAST of HG10014307 vs. ExPASy TrEMBL
Match:
A0A6J1FRD8 (uncharacterized protein LOC111446225 OS=Cucurbita moschata OX=3662 GN=LOC111446225 PE=4 SV=1)
HSP 1 Score: 352.1 bits (902), Expect = 2.3e-93
Identity = 210/329 (63.83%), Postives = 234/329 (71.12%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPD--GEPKKQVRRRRQSRRLYKETPLDMAEARREIVTALK 60
MNSTDQLCNFEA KI QP+P GE KKQVRRRRQSRRLYK+ PL+MAEARREIVTALK
Sbjct: 1 MNSTDQLCNFEA-TKIPQPQPQPHGERKKQVRRRRQSRRLYKQMPLNMAEARREIVTALK 60
Query: 61 LHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYDCSFY 120
LHRASTKEA+EQQQKQDQ+IK S+P++P Q PCFE E R KSRRNPRIYP DCSFY
Sbjct: 61 LHRASTKEAKEQQQKQDQQIKHSLPMYPHQFTPCFEPERRMKSRRNPRIYP----DCSFY 120
Query: 121 LENGSGFVAPPPVAQNLNAEIPIQI------FDDD--------------FKTSFWPPSSY 180
ENGS F+APPPVAQ+L+ +IPIQ F+D + SF PPSSY
Sbjct: 121 FENGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCSNNNNNHSFYSLSFLPPSSY 180
Query: 181 ICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-----E 240
IC T THQEVPKSISLSEEEG+LMASD LFWSNN PT ESEK++ GAV E
Sbjct: 181 ICPTFDYAATTHQEVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEEEEE 240
Query: 241 EAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHSNYK 291
E +AEIR S+D K LE D H S+ AMEFP+WLSIN+D LQ SNY+
Sbjct: 241 EEAMVAEIR--SVDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRSNYQ 300
BLAST of HG10014307 vs. ExPASy TrEMBL
Match:
A0A0A0L091 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G649650 PE=4 SV=1)
HSP 1 Score: 349.0 bits (894), Expect = 1.9e-92
Identity = 192/249 (77.11%), Postives = 203/249 (81.53%), Query Frame = 0
Query: 46 MAEARREIVTALKLHRA-STKE-AREQQQKQDQEIKQSVPLFPQLCPCFEAEGRRKSRRN 105
MAEARREIVTALKLHRA STKE AREQQQKQDQE KQS PLFPQ CFEAEGRRKSRRN
Sbjct: 1 MAEARREIVTALKLHRASSTKEAAREQQQKQDQESKQSFPLFPQFGQCFEAEGRRKSRRN 60
Query: 106 PRIYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQIFDDDFKT----------SFW- 165
PRIYP SYDCSFYLENGSG VAPPP +NLN EIPIQ FDDDFKT SFW
Sbjct: 61 PRIYPDCSYDCSFYLENGSGLVAPPP--ENLNTEIPIQTFDDDFKTLDTCSSFCSLSFWP 120
Query: 166 PPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV-- 225
PPSSYIC T+SCPDTHQE+PKS+SL EEEG LMASD +FW NNDPT SEKDMQQ V
Sbjct: 121 PPSSYICPTLSCPDTHQELPKSVSLREEEGNLMASD-VFWFNNDPTGVSEKDMQQEGVLE 180
Query: 226 EEAM-AMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSNYKCVEEDYLQ 279
EEAM AMA+I+SMSMDVKALE D HSSDNAMEFP+WLSIN+D L Q+SNY CVEEDYLQ
Sbjct: 181 EEAMHAMADIKSMSMDVKALEIDGRHSSDNAMEFPDWLSINDDFLLQYSNYHCVEEDYLQ 240
BLAST of HG10014307 vs. ExPASy TrEMBL
Match:
A0A6J1IXC1 (uncharacterized protein LOC111480786 OS=Cucurbita maxima OX=3661 GN=LOC111480786 PE=4 SV=1)
HSP 1 Score: 343.6 bits (880), Expect = 8.1e-91
Identity = 206/332 (62.05%), Postives = 233/332 (70.18%), Query Frame = 0
Query: 1 MNSTDQLCNFEAVAKISQPKPD------GEPKKQVRRRRQSRRLYKETPLDMAEARREIV 60
MNSTDQLCNFEA KI QP+P GE KKQVRRRR++RRLYK+ PL+MAEARREIV
Sbjct: 1 MNSTDQLCNFEA-TKIPQPQPQPQPQPHGERKKQVRRRRETRRLYKQMPLNMAEARREIV 60
Query: 61 TALKLHRASTKEAREQQQKQDQEIKQSVPLFP-QLCPCFEAEGRRKSRRNPRIYPGYSYD 120
TALKLHRASTKEA+EQQQKQDQ+IK S+P++P Q PCFE E R KSRRNPRIYP D
Sbjct: 61 TALKLHRASTKEAKEQQQKQDQQIKHSLPVYPHQFTPCFEPERRMKSRRNPRIYP----D 120
Query: 121 CSFYLENGSGFVAPPPVAQNLNAEIPIQI------FDDD-------------FKTSFWPP 180
CSFY +NGS F+APPPVAQ+L+ +IPIQ F+D + SF P
Sbjct: 121 CSFYFQNGSDFIAPPPVAQSLHLDIPIQTLGLNPNFEDTSSVVCNNNNNHSFYSLSFLHP 180
Query: 181 SSYICLTVS-CPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTRESEKDMQQGAV--- 240
SSYIC T TH+EVPKSISLSEEEG+LMASD LFWSNN PT ESEK++ GAV
Sbjct: 181 SSYICPTFDYAATTHREVPKSISLSEEEGRLMASD-LFWSNNFPTGESEKEI-HGAVEEE 240
Query: 241 --EEAMAMAEIRSMSMDVKALEFD--VH--------HSSDNAMEFPEWLSINNDILQQHS 291
EE +AEIR SMD K LE D H S+ AMEFP+WLSIN+D LQ S
Sbjct: 241 EEEEEAMVAEIR--SMDEKPLEIDGQTHCTFENVPTGQSEEAMEFPDWLSINDDFLQPRS 300
BLAST of HG10014307 vs. TAIR 10
Match:
AT5G21280.1 (hydroxyproline-rich glycoprotein family protein )
HSP 1 Score: 77.0 bits (188), Expect = 2.7e-14
Identity = 81/272 (29.78%), Postives = 117/272 (43.01%), Query Frame = 0
Query: 26 KKQVRRRRQSRRLYKETPLDMAEARREIVTALKLHRASTKEAREQQQKQDQEIKQSVPLF 85
KKQVRRR + R Y+E L+MAEARREIVTALK HRAS ++A Q Q + LF
Sbjct: 53 KKQVRRRLHTSRPYQERLLNMAEARREIVTALKQHRASMRQATRIPPPQPPPPPQPLNLF 112
Query: 86 PQLCPCFEAEGRRKSRRNPR---IYPGYSYDCSFYLENGSGFVAPPPVAQNLNAEIPIQI 145
P S NP + P + ++ + F+ + ++
Sbjct: 113 SPPPP--PPPPDPFSWTNPSLNFLLPNQPLGLNLNFQDFNDFIQTSSTTSSSSSSSTSSS 172
Query: 146 FDDDFKTS---FWPPSSYICLTVSCPDTHQEVPKSISLSEEEGKLMASDFLFWSNNDPTR 205
F T+ + PS T + D+ ++P S S E ++ S +WS
Sbjct: 173 SSSIFPTNPHIYSSPSPPPTFTTATSDSAPQLPSS---SNGENNVVTS--AWWS------ 232
Query: 206 ESEKDMQQGAVEEAMAMAEIRSMSMDVKALEFDVHHSSDNAMEFPEWLSINNDILQQHSN 265
++ VE EI+ + +V +E DV + MEFP WL+ + L N
Sbjct: 233 ----ELMLKTVE-----PEIKPETEEVIVVEDDVFPKFSDVMEFPSWLNQTEEELFHPYN 292
Query: 266 YKCVEEDYLQYPDLSCFDFGKIEDVDG-DWLA 291
P LSC + G+IE +DG DWLA
Sbjct: 293 LTDHYSSSPHNPPLSCMEIGEIEGMDGDDWLA 302
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
KAE8649926.1 | 2.6e-115 | 77.85 | hypothetical protein Csa_011922 [Cucumis sativus] | [more] |
XP_016901295.1 | 6.3e-114 | 76.70 | PREDICTED: uncharacterized protein LOC103493717 [Cucumis melo] >KAA0064553.1 put... | [more] |
XP_038897806.1 | 3.3e-110 | 76.66 | uncharacterized protein LOC120085720 [Benincasa hispida] | [more] |
XP_022940715.1 | 4.7e-93 | 63.83 | uncharacterized protein LOC111446225 [Cucurbita moschata] | [more] |
KAG6608324.1 | 6.2e-93 | 63.38 | hypothetical protein SDJN03_01666, partial [Cucurbita argyrosperma subsp. sorori... | [more] |
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A5A7V8V7 | 3.1e-114 | 76.70 | Putative WRKY transcription factor protein 1 isoform X2 OS=Cucumis melo var. mak... | [more] |
A0A1S4DZY0 | 3.1e-114 | 76.70 | uncharacterized protein LOC103493717 OS=Cucumis melo OX=3656 GN=LOC103493717 PE=... | [more] |
A0A6J1FRD8 | 2.3e-93 | 63.83 | uncharacterized protein LOC111446225 OS=Cucurbita moschata OX=3662 GN=LOC1114462... | [more] |
A0A0A0L091 | 1.9e-92 | 77.11 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G649650 PE=4 SV=1 | [more] |
A0A6J1IXC1 | 8.1e-91 | 62.05 | uncharacterized protein LOC111480786 OS=Cucurbita maxima OX=3661 GN=LOC111480786... | [more] |
Match Name | E-value | Identity | Description | |
AT5G21280.1 | 2.7e-14 | 29.78 | hydroxyproline-rich glycoprotein family protein | [more] |