Sequences
The following sequences are available for this feature:
Gene sequence (with intron)
Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TAAATAAAAGCTGTAGATTCTCCTGAGTAAATCAGTATAAATGGACATTAACCATAAATACAAGCAATGGCAGGAATGTTAGAGATTCCCCATTTCACATATTCCCATCTCATTTCTCCTTCTTCTTCTCTTCACAATTTGATCATATTCATGGCTTTCCATCTCTGAACCAAACAACTCTCCTCTTTCATTCTTTCCCATGTCTCGCAGACCCCTAGATTCCCGCCATTCAATTGACTCTTGTACTCTCAAATTCCATGGTTGGACCCCTTTCCACCTCCCCAAAACCCTAGATTCCGACCCCCATAATATTAATACCTCTGCTCCCACTAACCCTAAACCCTACTACTCTTCCACTCCCCTCCACACCAAACGCCCTTGTCTCTCCGATCGCACTACCTCTTTCAATGTCGACGCCATTGACATGTCCGCCCTTAGTTTGATCGACGACGACAAGCCTTCTATTCCTCCTGCCCGTAGCTTCCGATTGATTGCTAGGAAGCGACGTCGGCGTGGTTCTAGGTCTGTTTCTGGCCGGAGTAGTGATCGGAGTGGGACTAGACGGTGTTGCTCTGTTGGGGCTTCTGCGGCTCATGGGACTTGCTCGGATTTCCCTCTAGCGGTTGGGACTGATTCGAGTGGGGAGTTGTTTGTCAATGGGGATGCCAATTGGTCTTCGGATGTGAGTGAAGCCAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGAACATTTGGGTTCTGGGTTTGGTTCTTCTAATGGTGGTTTTGATGTTCAGGGGAATGAGTCTGGATATGGTAGTGAGCCTGGTTATCGTGGTGATGGTGAATTTGGATATGGTGATGAGATCGATGAGGAGGATGAGGATGCTAGATTGCTTTTGTGGGGTGAACGACTTGGAGGTTCGTTTTCTTTCTCATTTTCTTCTCATCTTAAGAGATTACAATGGTTGATCACATTGATGGCTTGGACTAGACTACAATTTTAGTTTTAGACGTTGACATTGGCTGATAAGTTAGGACACACTGCTGTTATGACACTTGTTGGATACCTCGTTAGACACTTGTTTTTAGTACAACAATTGTTGGGGACAAAGATTTGGACCTTCAATTTAGGAATAGTATGTGCCAACTGCCAACTACTGATGAACATGTTAAGCCTAATAAATGGACATATGAGTTTCTAAAGCATATATTTATGGTTAAATTATACCAACACCTTAAACTTTAGGCTACGTTTCTGTATGTCTATTCATTCATAAGAAATCGTTAAAGAATATATTTTTTTTCCTAAAATAAACCATGGAAACTAATTTACTTGAATGTAAATGATGTCACGTATGTCTTTTATTTTTAAAATACATATATGTTTCTTATTGAATTTTCATAGATGTGTAAAAAAGCTAATATATATATATATTTTTTATATCTGACATCTTGCGTCCCAGATTTTTAAAAGGTGTTGTCTCGACGATGCATCTGTGTCTATCTGGCTTTTTAGATTTTGAGATACATTTAGCTAATCCTTAAAAATTGAAATGTTATCTTTGAAATTGAATTCCACGGTTCCTATCATCACATTAACTTAAAGATAGGTCCTTTTATCAGTTTGTTTTGAAATATTAAACATAATAGAGTGAAGATGTGTACTAGGAGTTTGATACACAAGTAAAAGAGGAATATTGGACAATCATTTTATTGTCCGACTGTAGCCGATGTTTGTCTAACATGTCACTTCAACGGTTCAATCTTTAGCTAAGGATGTTTACAATAAAAGTTATTCAAACATAGGGGAGGCAATTGAAAATTTTGAACTTTAGAAACTAATGTTGTAATTTAACGGAGCAGACTTTTGAGGGTTTATTTTAATCATTTTATTTTGTGGGGAAAGCTAATGGTTGAGGAATTATAGCACCTTGATGGAGAAGGTTCATGGCAGCTTCTAGATTGTTACCAGCGTGTGAAACTTGTCCATTTATTTTGTGGAGCGTGATCTACATCTGAATTTAGTAGCTTTATGAACCTTTGATCTCATACTTCGCATTGTCTGGAAACTATATTCCTCTTAGAACATGGTTTCACAGACTCCACAACAGGCGACTCTGTTTCACCTTTGGATGTCAACGAAAGCTTTATCTCTAATGACATTAAATTGGCTTGTTGTCTTATTCTAGTTCTGTTGGTGGATTTGTTTAAAAAGATGGCACAATATGGTAAAAAATCGTGTGGTGCCTTTGTACTTAAGATTGAATAAACTTGAGAACAACTATTTTGGGACTGGCTCCACGGTAGAGTTAAGGTGTTAAAGATGGACAATTGTGGATGTAGTATTTTTGCTACGCTTGGTTTCGAGTATGAATTTTATCTAATCTAGGCATTTATATCATTGTTAACTCTTCATCTATGCTTTTCGTTATGGCAATTTTAGGCAATTTATGTCCTAGAGTTGATTTCGTTGAAAGATGAGAACATAACTTATCCTAGAAAATGGAGCAGTCTTCTCAATAAATGTCTTTTGCTGTTGGTAGAACTTAACAAGTTCATTTATCTTTCTTCTACCTTAAATGACTGCCTCCTGGTTAATATGTATGTCCTGCCTTCCGTTTGCCCAAAAACATCGATTGGCATAACAATGACCAATGGGTAGATTAATGCTTGATTCCAAATTTCTGATATGCACATACAGTCCGTTACCTTTTTCTGGTTCAAAATCCAAGTTAAAAGAGATAGATTGATATGTTCACATTCTATGCAGATTCTAGAATGGAAATTGTAGGAGAGAACACATTTGCAGATCAGAAATCGCACCATAGATGTCGCCGTAAGAAGCACGAATGTAGAATGGTTGATACCCTGCGGTGAAGCAAGACATTGAAACCGAGCAATAAACACTGTGAATACTATGGGCCCTACCAGCTGGCATGCTTCTGAGACCTAAAATTTTGAACTGCAAAGTAAAGGTTCCACAGTTTTGAGAGTCTGATTAGATTGGAGTTTACAGCATCTAGATTTTTCTTCATATTTTTTGGTGGATAGGATAGATCTATCGCTGCTATGAAATTTAGAACACCGTCATTGATGCATCTATAGCAAGGAAAATATAATCCCCATTGCCAAATTATCTTTGATATTTTCTCAGCTTAAGGTGTAAACTCTTTTGTATCTGTCTTGAAAAAAAGGCTTGAGTTATTCTTGAGTCAACTTAGTAAATACCACTTTTGACCCCTTATTCTATTTCAATCCTTATGATTTCATTTTTTTTAACTTTGATTTGACACCTATATTTTTTATAAATTTTATAACGGCCCCAACTGTTGGCTTTTAATTACATTTTGATAAAAAAGCA
mRNA sequence
TAAATAAAAGCTGTAGATTCTCCTGAGTAAATCAGTATAAATGGACATTAACCATAAATACAAGCAATGGCAGGAATGTTAGAGATTCCCCATTTCACATATTCCCATCTCATTTCTCCTTCTTCTTCTCTTCACAATTTGATCATATTCATGGCTTTCCATCTCTGAACCAAACAACTCTCCTCTTTCATTCTTTCCCATGTCTCGCAGACCCCTAGATTCCCGCCATTCAATTGACTCTTGTACTCTCAAATTCCATGGTTGGACCCCTTTCCACCTCCCCAAAACCCTAGATTCCGACCCCCATAATATTAATACCTCTGCTCCCACTAACCCTAAACCCTACTACTCTTCCACTCCCCTCCACACCAAACGCCCTTGTCTCTCCGATCGCACTACCTCTTTCAATGTCGACGCCATTGACATGTCCGCCCTTAGTTTGATCGACGACGACAAGCCTTCTATTCCTCCTGCCCGTAGCTTCCGATTGATTGCTAGGAAGCGACGTCGGCGTGGTTCTAGGTCTGTTTCTGGCCGGAGTAGTGATCGGAGTGGGACTAGACGGTGTTGCTCTGTTGGGGCTTCTGCGGCTCATGGGACTTGCTCGGATTTCCCTCTAGCGGTTGGGACTGATTCGAGTGGGGAGTTGTTTGTCAATGGGGATGCCAATTGGTCTTCGGATGTGAGTGAAGCCAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGAACATTTGGGTTCTGGGTTTGGTTCTTCTAATGGTGGTTTTGATGTTCAGGGGAATGAGTCTGGATATGGTAGTGAGCCTGGTTATCGTGGTGATGGTGAATTTGGATATGGTGATGAGATCGATGAGGAGGATGAGGATGCTAGATTGCTTTTGTGGGGTGAACGACTTGGAGATTCTAGAATGGAAATTGTAGGAGAGAACACATTTGCAGATCAGAAATCGCACCATAGATGTCGCCGTAAGAAGCACGAATGTAGAATGGTTGATACCCTGCGGTGAAGCAAGACATTGAAACCGAGCAATAAACACTGTGAATACTATGGGCCCTACCAGCTGGCATGCTTCTGAGACCTAAAATTTTGAACTGCAAAGTAAAGGTTCCACAGTTTTGAGAGTCTGATTAGATTGGAGTTTACAGCATCTAGATTTTTCTTCATATTTTTTGGTGGATAGGATAGATCTATCGCTGCTATGAAATTTAGAACACCGTCATTGATGCATCTATAGCAAGGAAAATATAATCCCCATTGCCAAATTATCTTTGATATTTTCTCAGCTTAAGGTGTAAACTCTTTTGTATCTGTCTTGAAAAAAAGGCTTGAGTTATTCTTGAGTCAACTTAGTAAATACCACTTTTGACCCCTTATTCTATTTCAATCCTTATGATTTCATTTTTTTTAACTTTGATTTGACACCTATATTTTTTATAAATTTTATAACGGCCCCAACTGTTGGCTTTTAATTACATTTTGATAAAAAAGCA
Coding sequence (CDS)
ATGTCTCGCAGACCCCTAGATTCCCGCCATTCAATTGACTCTTGTACTCTCAAATTCCATGGTTGGACCCCTTTCCACCTCCCCAAAACCCTAGATTCCGACCCCCATAATATTAATACCTCTGCTCCCACTAACCCTAAACCCTACTACTCTTCCACTCCCCTCCACACCAAACGCCCTTGTCTCTCCGATCGCACTACCTCTTTCAATGTCGACGCCATTGACATGTCCGCCCTTAGTTTGATCGACGACGACAAGCCTTCTATTCCTCCTGCCCGTAGCTTCCGATTGATTGCTAGGAAGCGACGTCGGCGTGGTTCTAGGTCTGTTTCTGGCCGGAGTAGTGATCGGAGTGGGACTAGACGGTGTTGCTCTGTTGGGGCTTCTGCGGCTCATGGGACTTGCTCGGATTTCCCTCTAGCGGTTGGGACTGATTCGAGTGGGGAGTTGTTTGTCAATGGGGATGCCAATTGGTCTTCGGATGTGAGTGAAGCCAAGAATTCGAGGAGGGAGAGAGAGGAGAAGGAACATTTGGGTTCTGGGTTTGGTTCTTCTAATGGTGGTTTTGATGTTCAGGGGAATGAGTCTGGATATGGTAGTGAGCCTGGTTATCGTGGTGATGGTGAATTTGGATATGGTGATGAGATCGATGAGGAGGATGAGGATGCTAGATTGCTTTTGTGGGGTGAACGACTTGGAGATTCTAGAATGGAAATTGTAGGAGAGAACACATTTGCAGATCAGAAATCGCACCATAGATGTCGCCGTAAGAAGCACGAATGTAGAATGGTTGATACCCTGCGGTGA
Protein sequence
MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKEHLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR
Homology
BLAST of PI0020009 vs. ExPASy TrEMBL
Match:
A0A0A0KT54 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G001770 PE=4 SV=1)
HSP 1 Score: 515.0 bits (1325), Expect = 1.9e-142
Identity = 260/268 (97.01%), Postives = 262/268 (97.76%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH NTSAPTN KPYYSSTPLHTKRP
Sbjct: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH--NTSAPTNSKPYYSSTPLHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT
Sbjct: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
Query: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKEHLGS 180
RRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEK+HLGS
Sbjct: 121 RRCCSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGS 180
Query: 181 GFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
GF SSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV
Sbjct: 181 GFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
Query: 241 GENTFADQKSHHRCRRKKHECRMVDTLR 269
GENTFADQKSHHRCRRKKHECRMVD LR
Sbjct: 241 GENTFADQKSHHRCRRKKHECRMVDALR 266
BLAST of PI0020009 vs. ExPASy TrEMBL
Match:
A0A1S3BYD4 (uncharacterized protein LOC103494772 OS=Cucumis melo OX=3656 GN=LOC103494772 PE=4 SV=1)
HSP 1 Score: 515.0 bits (1325), Expect = 1.9e-142
Identity = 260/268 (97.01%), Postives = 262/268 (97.76%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH NTSAPTN KPYYSSTP+HTKRP
Sbjct: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH--NTSAPTNSKPYYSSTPIHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT
Sbjct: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
Query: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKEHLGS 180
RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEK+HLGS
Sbjct: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGS 180
Query: 181 GFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
GF SSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV
Sbjct: 181 GFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
Query: 241 GENTFADQKSHHRCRRKKHECRMVDTLR 269
GENTFADQKSHHRCRRKKHECRMVD LR
Sbjct: 241 GENTFADQKSHHRCRRKKHECRMVDALR 266
BLAST of PI0020009 vs. ExPASy TrEMBL
Match:
A0A6J1GCW8 (uncharacterized protein LOC111452804 isoform X2 OS=Cucurbita moschata OX=3662 GN=LOC111452804 PE=4 SV=1)
HSP 1 Score: 452.6 bits (1163), Expect = 1.1e-123
Identity = 242/277 (87.36%), Postives = 245/277 (88.45%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFH----LPKTLDSDPHNINTSAPTNPKPYYSSTPLH 60
MSRRPLDSR SIDSCTLK H W PFH PKTLDSD H S PT KPYYSST LH
Sbjct: 1 MSRRPLDSRQSIDSCTLKLHTWRPFHHLHSAPKTLDSDTH---ISPPTTSKPYYSSTALH 60
Query: 61 TKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA----RSFRLIARKRRRRGSRSVSG 120
TKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSI RSF LIARKRRRRGSRSVSG
Sbjct: 61 TKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGRYTRRSFGLIARKRRRRGSRSVSG 120
Query: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER
Sbjct: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
Query: 181 EEKE-HLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
+EK+ HLG GFG SNGG D QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER
Sbjct: 181 DEKDHHLGGGFG-SNGGLDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
Query: 241 LGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
LGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVDTLR
Sbjct: 241 LGDSRMEIVGENTFSDQKSHHRCRRKKHECRMVDTLR 273
BLAST of PI0020009 vs. ExPASy TrEMBL
Match:
A0A6J1J7N4 (uncharacterized protein LOC111482487 OS=Cucurbita maxima OX=3661 GN=LOC111482487 PE=4 SV=1)
HSP 1 Score: 450.3 bits (1157), Expect = 5.7e-123
Identity = 239/275 (86.91%), Postives = 247/275 (89.82%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRR LDSR SI SCTLK HGW PF LPK LDSD H TSAPT+ KPYYSS+ LHTKRP
Sbjct: 1 MSRRALDSRESIHSCTLKLHGWRPFQLPKALDSDAH---TSAPTSAKPYYSSSGLHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA-----RSFRLIARK-RRRRGSRSVSGRS 120
CLSDRTTSFNVDAIDMS LSLIDDDKPSI SF+LIARK RRRRGSRSVSGRS
Sbjct: 61 CLSDRTTSFNVDAIDMSGLSLIDDDKPSITAGGSYSRPSFQLIARKRRRRRGSRSVSGRS 120
Query: 121 SDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNS-RRERE 180
+DRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNS RRERE
Sbjct: 121 TDRSGTRRCCSVGASAAHGTCSDFPMAVGTDSSGELFVNGDANWSSDVSEAKNSRRRERE 180
Query: 181 EKEHLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG 240
EK+ LGSGFGSSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG
Sbjct: 181 EKDQLGSGFGSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLG 240
Query: 241 DSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
DSR+EIVGENTFADQKSHHRCRRKKHEC MVD+LR
Sbjct: 241 DSRVEIVGENTFADQKSHHRCRRKKHECGMVDSLR 272
BLAST of PI0020009 vs. ExPASy TrEMBL
Match:
A0A6J1F5R5 (uncharacterized protein LOC111442395 OS=Cucurbita moschata OX=3662 GN=LOC111442395 PE=4 SV=1)
HSP 1 Score: 448.4 bits (1152), Expect = 2.2e-122
Identity = 238/276 (86.23%), Postives = 246/276 (89.13%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRR LDSR SI SCTLK HGW PF LPK LDSD H TSAPT+ KPYYSS+ LHTKRP
Sbjct: 1 MSRRALDSRESIHSCTLKLHGWRPFQLPKALDSDAH---TSAPTSAKPYYSSSGLHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA-----RSFRLIARK--RRRRGSRSVSGR 120
CLSDRTTSFNVDAIDMS LSLIDDDKPSI SF+LIARK RRRRGSRSVSGR
Sbjct: 61 CLSDRTTSFNVDAIDMSGLSLIDDDKPSITAGGSYSRPSFQLIARKRRRRRRGSRSVSGR 120
Query: 121 SSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNS-RRER 180
S+DRSGTRRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNS RRER
Sbjct: 121 STDRSGTRRCCSVGASAAHGTCSDFPMAVGTDSSGELFVNGDANWSSDVSEAKNSRRRER 180
Query: 181 EEKEHLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL 240
EEK+ LGSGFGSSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL
Sbjct: 181 EEKDQLGSGFGSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERL 240
Query: 241 GDSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
GDSR+EIVGENTF DQKSHHRCRRKKHEC MVD+LR
Sbjct: 241 GDSRVEIVGENTFTDQKSHHRCRRKKHECGMVDSLR 273
BLAST of PI0020009 vs. NCBI nr
Match:
XP_004152251.1 (uncharacterized protein LOC101206482 [Cucumis sativus] >KGN52810.1 hypothetical protein Csa_015327 [Cucumis sativus])
HSP 1 Score: 515.0 bits (1325), Expect = 3.9e-142
Identity = 260/268 (97.01%), Postives = 262/268 (97.76%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH NTSAPTN KPYYSSTPLHTKRP
Sbjct: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH--NTSAPTNSKPYYSSTPLHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT
Sbjct: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
Query: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKEHLGS 180
RRCCSVGASAAHGTCSDFP+AVGTDSSGELFVNGDANWSSDVSEAKNSRREREEK+HLGS
Sbjct: 121 RRCCSVGASAAHGTCSDFPIAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGS 180
Query: 181 GFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
GF SSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV
Sbjct: 181 GFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
Query: 241 GENTFADQKSHHRCRRKKHECRMVDTLR 269
GENTFADQKSHHRCRRKKHECRMVD LR
Sbjct: 241 GENTFADQKSHHRCRRKKHECRMVDALR 266
BLAST of PI0020009 vs. NCBI nr
Match:
XP_008454343.1 (PREDICTED: uncharacterized protein LOC103494772 [Cucumis melo])
HSP 1 Score: 515.0 bits (1325), Expect = 3.9e-142
Identity = 260/268 (97.01%), Postives = 262/268 (97.76%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH NTSAPTN KPYYSSTP+HTKRP
Sbjct: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPH--NTSAPTNSKPYYSSTPIHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT
Sbjct: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPARSFRLIARKRRRRGSRSVSGRSSDRSGT 120
Query: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKEHLGS 180
RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEK+HLGS
Sbjct: 121 RRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKDHLGS 180
Query: 181 GFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
GF SSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV
Sbjct: 181 GFVSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSRMEIV 240
Query: 241 GENTFADQKSHHRCRRKKHECRMVDTLR 269
GENTFADQKSHHRCRRKKHECRMVD LR
Sbjct: 241 GENTFADQKSHHRCRRKKHECRMVDALR 266
BLAST of PI0020009 vs. NCBI nr
Match:
XP_038906083.1 (uncharacterized protein LOC120091971 [Benincasa hispida])
HSP 1 Score: 485.0 bits (1247), Expect = 4.3e-133
Identity = 250/272 (91.91%), Postives = 254/272 (93.38%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKRP 60
MSRR LDSR SIDSCTLK HGW+PFHLPKTLDSD H +SAPTN KPYYSSTPLHTKRP
Sbjct: 1 MSRRALDSRQSIDSCTLKLHGWSPFHLPKTLDSDTH---SSAPTNSKPYYSSTPLHTKRP 60
Query: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA----RSFRLIARKRRRRGSRSVSGRSSD 120
CLSDRTTSFNVDAIDMSALSLIDDDKPSI RS RLIARKRRRRGSRSVSGRSSD
Sbjct: 61 CLSDRTTSFNVDAIDMSALSLIDDDKPSIAAGCYTRRSLRLIARKRRRRGSRSVSGRSSD 120
Query: 121 RSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKE 180
RSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEK+
Sbjct: 121 RSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRREREEKD 180
Query: 181 HLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSR 240
HLGSGFGSSNGGFD QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDS+
Sbjct: 181 HLGSGFGSSNGGFDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGERLGDSK 240
Query: 241 MEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
MEIVGENTFADQKSHHRCRRKKHECRMVD LR
Sbjct: 241 MEIVGENTFADQKSHHRCRRKKHECRMVDALR 269
BLAST of PI0020009 vs. NCBI nr
Match:
XP_022949469.1 (uncharacterized protein LOC111452804 isoform X2 [Cucurbita moschata])
HSP 1 Score: 452.6 bits (1163), Expect = 2.4e-123
Identity = 242/277 (87.36%), Postives = 245/277 (88.45%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFH----LPKTLDSDPHNINTSAPTNPKPYYSSTPLH 60
MSRRPLDSR SIDSCTLK H W PFH PKTLDSD H S PT KPYYSST LH
Sbjct: 1 MSRRPLDSRQSIDSCTLKLHTWRPFHHLHSAPKTLDSDTH---ISPPTTSKPYYSSTALH 60
Query: 61 TKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA----RSFRLIARKRRRRGSRSVSG 120
TKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSI RSF LIARKRRRRGSRSVSG
Sbjct: 61 TKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGRYTRRSFGLIARKRRRRGSRSVSG 120
Query: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER
Sbjct: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
Query: 181 EEKE-HLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
+EK+ HLG GFG SNGG D QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER
Sbjct: 181 DEKDHHLGGGFG-SNGGLDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
Query: 241 LGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
LGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVDTLR
Sbjct: 241 LGDSRMEIVGENTFSDQKSHHRCRRKKHECRMVDTLR 273
BLAST of PI0020009 vs. NCBI nr
Match:
XP_023524455.1 (uncharacterized protein LOC111788369 [Cucurbita pepo subsp. pepo])
HSP 1 Score: 452.2 bits (1162), Expect = 3.1e-123
Identity = 242/277 (87.36%), Postives = 245/277 (88.45%), Query Frame = 0
Query: 1 MSRRPLDSRHSIDSCTLKFHGWTPFH----LPKTLDSDPHNINTSAPTNPKPYYSSTPLH 60
MSRRPLDSR SIDSCTLK H W PFH PKTLDSD H S PT KPYYSST LH
Sbjct: 1 MSRRPLDSRQSIDSCTLKLHTWRPFHHLHSAPKTLDSDTH---ISPPTTSKPYYSSTALH 60
Query: 61 TKRPCLSDRTTSFNVDAIDMSALSLIDDDKPSIPPA----RSFRLIARKRRRRGSRSVSG 120
TKRPCLSDRTTSFNVDAIDMS LSLIDDDKPSI RSF LIARKRRRRGSRSVSG
Sbjct: 61 TKRPCLSDRTTSFNVDAIDMSRLSLIDDDKPSIAAGPYTRRSFGLIARKRRRRGSRSVSG 120
Query: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER
Sbjct: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSEAKNSRRER 180
Query: 181 EEKE-HLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
+EK+ HLG GF SSNGG D QGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER
Sbjct: 181 DEKDHHLGGGF-SSNGGLDAQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLLWGER 240
Query: 241 LGDSRMEIVGENTFADQKSHHRCRRKKHECRMVDTLR 269
LGDSRMEIVGENTF+DQKSHHRCRRKKHECRMVDTLR
Sbjct: 241 LGDSRMEIVGENTFSDQKSHHRCRRKKHECRMVDTLR 273
BLAST of PI0020009 vs. TAIR 10
Match:
AT4G02425.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; Has 29 Blast hits to 28 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 6; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )
HSP 1 Score: 255.8 bits (652), Expect = 4.0e-68
Identity = 163/282 (57.80%), Postives = 192/282 (68.09%), Query Frame = 0
Query: 1 MSRRPLD-SRHSIDSCTLKFHGWTPFHLPKTLDSDPHNINTSAPTNPKPYYSSTPLHTKR 60
MS + L+ SR SI+SCT + W PFH KTLDS + P ++S TP KR
Sbjct: 1 MSPKHLESSRSSIESCTSQLLSWRPFHRSKTLDS------SDQPPQTNGFHSFTP---KR 60
Query: 61 PCLSDRTTSFNVDAIDMSALSLIDDDK-------PSIPPARSFRLIARKRRRRGSRSVSG 120
PC SDR+TSF ++A MS LSL DDD + SFRL+ARKRRRR SRSVSG
Sbjct: 61 PCFSDRSTSFTIEA--MSRLSLADDDNGGKTLSASNYSNRGSFRLVARKRRRRNSRSVSG 120
Query: 121 RSSDRSGTRRCCSVGASAAHGTCSDFPLAVGTDSSGELFVNGDANWSSDVSE-AKNSRRE 180
RSSDRSGTRRCCS+G AHGTCSD P AVGTDSSGELF G+ANW+SDVSE A+NSRRE
Sbjct: 121 RSSDRSGTRRCCSIG---AHGTCSDLPFAVGTDSSGELF--GEANWASDVSEAARNSRRE 180
Query: 181 RE----EKEHLGSGFGSSNGGFDVQGNESGYGSEPGYRGDGEFGYGDEIDEEDEDARLLL 240
R EKE G GFG +N G D GNESGYGSEPGYRGD EFGYGDE D+E+ED + L
Sbjct: 181 RRDSGGEKEASG-GFGFAN-GVDPMGNESGYGSEPGYRGDAEFGYGDEFDDEEEDVKPLF 240
Query: 241 WGERLGDSRMEIVGENTFADQKSHHRCRRKK-HECRMVDTLR 269
WG+ DS M + GE F+D K RCRR++ H+ + VD++R
Sbjct: 241 WGDT--DSTMGMSGETKFSDSKPQFRCRRRRQHDYKTVDSMR 262
The following BLAST results are available for this feature:
Match Name | E-value | Identity | Description | |
Match Name | E-value | Identity | Description | |
A0A0A0KT54 | 1.9e-142 | 97.01 | Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G001770 PE=4 SV=1 | [more] |
A0A1S3BYD4 | 1.9e-142 | 97.01 | uncharacterized protein LOC103494772 OS=Cucumis melo OX=3656 GN=LOC103494772 PE=... | [more] |
A0A6J1GCW8 | 1.1e-123 | 87.36 | uncharacterized protein LOC111452804 isoform X2 OS=Cucurbita moschata OX=3662 GN... | [more] |
A0A6J1J7N4 | 5.7e-123 | 86.91 | uncharacterized protein LOC111482487 OS=Cucurbita maxima OX=3661 GN=LOC111482487... | [more] |
A0A6J1F5R5 | 2.2e-122 | 86.23 | uncharacterized protein LOC111442395 OS=Cucurbita moschata OX=3662 GN=LOC1114423... | [more] |
Match Name | E-value | Identity | Description | |
XP_004152251.1 | 3.9e-142 | 97.01 | uncharacterized protein LOC101206482 [Cucumis sativus] >KGN52810.1 hypothetical ... | [more] |
XP_008454343.1 | 3.9e-142 | 97.01 | PREDICTED: uncharacterized protein LOC103494772 [Cucumis melo] | [more] |
XP_038906083.1 | 4.3e-133 | 91.91 | uncharacterized protein LOC120091971 [Benincasa hispida] | [more] |
XP_022949469.1 | 2.4e-123 | 87.36 | uncharacterized protein LOC111452804 isoform X2 [Cucurbita moschata] | [more] |
XP_023524455.1 | 3.1e-123 | 87.36 | uncharacterized protein LOC111788369 [Cucurbita pepo subsp. pepo] | [more] |
Match Name | E-value | Identity | Description | |
AT4G02425.1 | 4.0e-68 | 57.80 | unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... | [more] |