HG10003664 (gene) Bottle gourd (Hangzhou Gourd) v1

Overview
NameHG10003664
Typegene
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Description40S ribosomal protein SA
LocationChr08: 5159910 .. 5163084 (-)
RNA-Seq ExpressionHG10003664
SyntenyHG10003664
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
ATGCCATATTGCGCCCGCAATGTAAGCACAATGTGGACGCAATTTCTGCACTTCCTAGCTTCACTTCATTCTGTCATCCGATGGAATTGCAACCATAATGTAGTTGCAACTTTGACATGAATTTCTACAAACAACACTAAAACATAAGAATAACTCATAATTTGATTCTTTTTACACTTATTTTGCTTTTTATTTCACTAATCAAGTTTTGACTTGTTTTTACATAAAATCACTTAATATACAATTGAATATGACTTTAAACACGTTTAGGGTAATTTCTTGTTACTATTTTGTTATTTAACTATCAATTCCCTTTTTTCTTCTTCTTTTTTTCTAGTAAAGTTAACTTTCATGTTTTATGACAACAAGGGCTTAATAAAAAAATCAATTGAATAAAAGATGAAATTGAAAAAAAAAATACTGTCATGTTACACAGGGAGTAATTGGATGTTTTTAGGGAAATTGTTGTGTATAGAAAAAAACAAAACTAATTATACATATAAAAAAAGTGCAACCATGCTTTTGTATTTCCCACTTTTTTTCTATATGTGTAATTAGTTTTGTTTTTTTTTTCTATTTGTGAAAAACTCTTTTTATCATTTTTTATATTTATTTTAAAAATCTATTTATTAATTTAATTTATCATCGAGGGTATTTTAGTAACTTTGTCAACCTAGGGTTTTCATTGTTGGATAGAATAAATATTGAAGTTGCTGTTCTTCTAACCCTCTTCAGTTCGTTGAGTATTGCAGTCGCCGAAGAGCTCCAATCCCTCCTTTCCTTCACTTCGTCGATTCCCAGATTCCGCCATGGCGACTGGAGCTGCACCGCCGCGTCAACTCTCACAGAAGGAGGCCGACATTCAGATGATGTTGGCCGCCGAGGTTCATCTTGGCACCAAAAACTGCGATTTCCAAATGGAACGATACGTTTTTAAGCGCCGAAATGATGGTAAGTTCTTAATCTTCTCGCCTTTAAATCTTCCATGGTTTGTTACTCGAAGTTCTGTTTTCTATTTTTCCTTTTTCTCTGCGATTTTCGCTTTGACTGATATTTACAGTGCTAATGGGCGTGTTGTTATTGTTCTTATTTTTTCCCCTAGAATTTCTTAGTAACATATTGTGATTGAAATTTTATTTATTTTTAGGTGTGAAGTTTCAGTTTCTGGCTATTATTCTGTTTATCTCGTTAAATTTTACTTCATTTTCCGAGGAATTTTCTCCAAATTCTCCCTCGATGGTTTTTACTTGTTTGGTGTGACAAGCACTTCTGTTGCCGTTTTCAATTTTTTCGCTGAATGTTGATTAATACGGTTTCAGAAATGAACAACAGCCATACGTGTTGAGCTGGAGAACTTAGTCTAATTTACATGGTTGTACATGGTTCTGAGAATCGGTTTTCCTTAGTCTTTTGCCTAAAAGGAACTTTGAGTTTATTGAAGAGGCTCAATGAGGATTATCTATACAATATATATATGTTAGGTTATTTTCATTTTTAGGTTGCGGTCGTGTATGATTTTTTTAAAAAAATTCTCTTGTTTTTAGGAATCTACATCATTAACCTTGGTAAAACATGGGAGAAACTTCAGCTGGCTGCAAGAGTTATTGTGGGTATAGAGAATCCTCAAGATATCATTGTTCAGTCTGCTAGACCCTATGGTCAGAGGGCTGTTTTGAAGTTTGCTCAGCACACTGGTGCTCATGCTATTGCTGGGAGGCACACTCCAGGAACATTCACCAACCAGCTTCAGACGTCATTCAATGAACCACGTCTTCTTATACTTACCGATCCTAGAACTGATCATCAGGTGTTTAATTTAGAAATCAAGAACAGTACATGGTTTTAAATTTAACTCCAAGAAGAATAGTGCGTTATATTGTCAATTATGATTCCATTCATGCTTGTTCCCTAAATTTATTTGTGTTTGGATTCACTTGATGCTGTTCTTTATTGATGTCGCCTTGTTGATTCTGACTTGCAGCCCATTAAAGAAGCTGCCCTTGGTAACATTCCCACAATAGCCTTTTGTGACACTGATTCTCCAATGCGGTATGTTGATATTGGAATCCCGGCTAACAATAAGGGGAAACATAGCATAGGATGTCTGTTCTGGCTTTTGGCAAGGATGGTTCTACAAATGCGTGGGACAATTCGTGCTGGGCACAAATGGGATGTGATGGTAAAAAATCTTAATACAATTTTCTATTAGGACTTTAGAAACAAAATTTGGACTAAGTTTTTTATTGACAATTATTTGTGGGAACTTTTTCCTTTTGTAGGTGGATTTGTTTTTCTATCGCGAGCCTGAGGAGGCTAAGGAGCCAGAAGAGGAAGAGGCTCTTCCACCCCCAGATTTTGGCATTGCTGACTATGGTGCTGCTCCACTTGTTGGCTCAGATCAATGGACTGCACAAATTTCTGATGCGCAGTGGGGTGCTGCAGATGCAGTTCCTGCTGCTACTGTTCCACCTGCTTCCAGTGTCGAGTGGGCTCAGGAGCCAGGTCGAATATAAGTTTTTGTTTTTACTTGTTTGTTTGAATAGCTATCTGGCCTTTGGTGTGTCTATTCATGTAAACTAGTTTTGGTCTCTTTGAAGAACTGTTTCCTCTCAGGCTAGATGATTCATTTTGTCTGCTTTCCATATGAAGTTTTTGAGTAGTATCTATGTCATACTGCCAATTAACCCTGTTCTTTGCCATTACATTGTTTTTGCATCTTAGACATCATTGTCCTGGAGTGTAAGATGCAACTAACTAGTTGGTCTCAGGTTGATTGTCTGTTTTGAGTTGAACAGAAGAATATGACAGTTTTTTAGATATTAGATTTTTATGCTTGGACTATATTTGAGTCATTAGTCATGGAAAGAGTGCTTTTAAATGGTAAAAAGTACTCTTCTTAACCGGAGTAAATATTGATACTTCTACAATTTAGTACTTCCATAAAAGCAATTGATATCTTGCGTAGCTTTTTAACACTTATATTTAATCTCTTGTAACATTAATTTTTTTTCCTACTTACTTTCTACAAGCATTTGTACTTGCTTTCTCACCCTTGTTCCCGTTTGTTGCAGTTGCTTTGGCAGGCGATGGATGGGATGCTGCAGCAGCACCACCACCACCAGTGGCGGCACCTGCTGCTGATGCTGCGGCACCTTCCTCCGCCAGTTGGTTTTGA

mRNA sequence

ATGCCATATTGCGCCCGCAATTCGCCGAAGAGCTCCAATCCCTCCTTTCCTTCACTTCGTCGATTCCCAGATTCCGCCATGGCGACTGGAGCTGCACCGCCGCGTCAACTCTCACAGAAGGAGGCCGACATTCAGATGATGTTGGCCGCCGAGGTTCATCTTGGCACCAAAAACTGCGATTTCCAAATGGAACGATACGTTTTTAAGCGCCGAAATGATGGAATCTACATCATTAACCTTGGTAAAACATGGGAGAAACTTCAGCTGGCTGCAAGAGTTATTGTGGGTATAGAGAATCCTCAAGATATCATTGTTCAGTCTGCTAGACCCTATGGTCAGAGGGCTGTTTTGAAGTTTGCTCAGCACACTGGTGCTCATGCTATTGCTGGGAGGCACACTCCAGGAACATTCACCAACCAGCTTCAGACGTCATTCAATGAACCACGTCTTCTTATACTTACCGATCCTAGAACTGATCATCAGCCCATTAAAGAAGCTGCCCTTGGTAACATTCCCACAATAGCCTTTTGTGACACTGATTCTCCAATGCGGTATGTTGATATTGGAATCCCGGCTAACAATAAGGGGAAACATAGCATAGGATGTCTGTTCTGGCTTTTGGCAAGGATGGTTCTACAAATGCGTGGGACAATTCGTGCTGGGCACAAATGGGATGTGATGGTGGATTTGTTTTTCTATCGCGAGCCTGAGGAGGCTAAGGAGCCAGAAGAGGAAGAGGCTCTTCCACCCCCAGATTTTGGCATTGCTGACTATGGTGCTGCTCCACTTGTTGGCTCAGATCAATGGACTGCACAAATTTCTGATGCGCAGTGGGGTGCTGCAGATGCAGTTCCTGCTGCTACTGTTCCACCTGCTTCCAGTGTCGAGTGGGCTCAGGAGCCAGTTGCTTTGGCAGGCGATGGATGGGATGCTGCAGCAGCACCACCACCACCAGTGGCGGCACCTGCTGCTGATGCTGCGGCACCTTCCTCCGCCAGTTGGTTTTGA

Coding sequence (CDS)

ATGCCATATTGCGCCCGCAATTCGCCGAAGAGCTCCAATCCCTCCTTTCCTTCACTTCGTCGATTCCCAGATTCCGCCATGGCGACTGGAGCTGCACCGCCGCGTCAACTCTCACAGAAGGAGGCCGACATTCAGATGATGTTGGCCGCCGAGGTTCATCTTGGCACCAAAAACTGCGATTTCCAAATGGAACGATACGTTTTTAAGCGCCGAAATGATGGAATCTACATCATTAACCTTGGTAAAACATGGGAGAAACTTCAGCTGGCTGCAAGAGTTATTGTGGGTATAGAGAATCCTCAAGATATCATTGTTCAGTCTGCTAGACCCTATGGTCAGAGGGCTGTTTTGAAGTTTGCTCAGCACACTGGTGCTCATGCTATTGCTGGGAGGCACACTCCAGGAACATTCACCAACCAGCTTCAGACGTCATTCAATGAACCACGTCTTCTTATACTTACCGATCCTAGAACTGATCATCAGCCCATTAAAGAAGCTGCCCTTGGTAACATTCCCACAATAGCCTTTTGTGACACTGATTCTCCAATGCGGTATGTTGATATTGGAATCCCGGCTAACAATAAGGGGAAACATAGCATAGGATGTCTGTTCTGGCTTTTGGCAAGGATGGTTCTACAAATGCGTGGGACAATTCGTGCTGGGCACAAATGGGATGTGATGGTGGATTTGTTTTTCTATCGCGAGCCTGAGGAGGCTAAGGAGCCAGAAGAGGAAGAGGCTCTTCCACCCCCAGATTTTGGCATTGCTGACTATGGTGCTGCTCCACTTGTTGGCTCAGATCAATGGACTGCACAAATTTCTGATGCGCAGTGGGGTGCTGCAGATGCAGTTCCTGCTGCTACTGTTCCACCTGCTTCCAGTGTCGAGTGGGCTCAGGAGCCAGTTGCTTTGGCAGGCGATGGATGGGATGCTGCAGCAGCACCACCACCACCAGTGGCGGCACCTGCTGCTGATGCTGCGGCACCTTCCTCCGCCAGTTGGTTTTGA

Protein sequence

MPYCARNSPKSSNPSFPSLRRFPDSAMATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFNEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGSDQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADAAAPSSASWF
Homology
BLAST of HG10003664 vs. NCBI nr
Match: XP_038886558.1 (40S ribosomal protein SA-like [Benincasa hispida])

HSP 1 Score: 583.6 bits (1503), Expect = 1.1e-162
Identity = 294/312 (94.23%), Postives = 297/312 (95.19%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQL+QKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLAQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVG+
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGA 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
           DQWTAQI DAQWGA DAVPAA VPPAS+VEWAQEPVALA DGWDAA  P P  AA  A A
Sbjct: 241 DQWTAQIPDAQWGAGDAVPAAAVPPASNVEWAQEPVALAADGWDAAVPPVP--AADGAVA 300

Query: 327 AA---PSSASWF 336
           AA   PSSASWF
Sbjct: 301 AAPPPPSSASWF 310

BLAST of HG10003664 vs. NCBI nr
Match: XP_004138372.1 (40S ribosomal protein SA [Cucumis sativus] >KGN45883.1 hypothetical protein Csa_005754 [Cucumis sativus])

HSP 1 Score: 571.2 bits (1471), Expect = 5.7e-159
Identity = 288/310 (92.90%), Postives = 293/310 (94.52%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPP DFGIADY AAPL  S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPADFGIADYSAAPLT-S 240

Query: 267 DQWTAQISDAQWGAADAVPA-ATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAAD 326
           DQWT+QI DAQWGAADA+P+ A V PAS  EWA EPVALA DGWDAAAAPPPP A  +A+
Sbjct: 241 DQWTSQIPDAQWGAADAIPSPAPVVPASGAEWAPEPVALAADGWDAAAAPPPPPAV-SAE 300

Query: 327 AAAPSSASWF 336
             APSSASWF
Sbjct: 301 GTAPSSASWF 308

BLAST of HG10003664 vs. NCBI nr
Match: XP_008463065.1 (PREDICTED: 40S ribosomal protein SA-like [Cucumis melo] >KAA0039133.1 40S ribosomal protein SA-like [Cucumis melo var. makuwa] >TYK26980.1 40S ribosomal protein SA-like [Cucumis melo var. makuwa])

HSP 1 Score: 567.4 bits (1461), Expect = 8.3e-158
Identity = 286/310 (92.26%), Postives = 290/310 (93.55%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPP DFGIADY AAPL  S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPADFGIADYSAAPL-AS 240

Query: 267 DQWTAQISDAQWGAADAV-PAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAAD 326
           DQWT+QI DAQWGAADA+ P A V PAS  EWA EPVAL  DGWDAAA PPPP A  +AD
Sbjct: 241 DQWTSQIPDAQWGAADALPPPAAVVPASGAEWAPEPVALTADGWDAAAPPPPPAA--SAD 300

Query: 327 AAAPSSASWF 336
             AP+SASWF
Sbjct: 301 GTAPASASWF 307

BLAST of HG10003664 vs. NCBI nr
Match: XP_022925394.1 (40S ribosomal protein SA-like [Cucurbita moschata] >XP_022973502.1 40S ribosomal protein SA-like [Cucurbita maxima] >XP_023535006.1 40S ribosomal protein SA-like [Cucurbita pepo subsp. pepo] >KAG6592327.1 40S ribosomal protein SA, partial [Cucurbita argyrosperma subsp. sororia] >KAG7025148.1 40S ribosomal protein SA [Cucurbita argyrosperma subsp. argyrosperma])

HSP 1 Score: 566.6 bits (1459), Expect = 1.4e-157
Identity = 283/309 (91.59%), Postives = 288/309 (93.20%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAP RQLSQKEADIQMMLAAEVHLGTKNC+FQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPARQLSQKEADIQMMLAAEVHLGTKNCNFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           L +AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TG HAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LYMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGTHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKE ALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKE EEEEALPPPDFGIADYGAAPLV S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEKEEEEALPPPDFGIADYGAAPLVAS 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
           DQW AQISDAQWGAAD VPA   P ASSVEWA EPV LA + W+AA APPPP A PAAD 
Sbjct: 241 DQWAAQISDAQWGAADVVPAPAAPAASSVEWAPEPVPLAANEWEAAVAPPPP-APPAADG 300

Query: 327 AAPSSASWF 336
           AAPSSASWF
Sbjct: 301 AAPSSASWF 308

BLAST of HG10003664 vs. NCBI nr
Match: XP_022972886.1 (40S ribosomal protein SA-like [Cucurbita maxima])

HSP 1 Score: 561.2 bits (1445), Expect = 5.9e-156
Identity = 284/315 (90.16%), Postives = 293/315 (93.02%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNC+FQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCNFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQ+TGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKE ALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKE EEEEALPPPDFGIADYGAAPLV S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEKEEEEALPPPDFGIADYGAAPLVAS 240

Query: 267 DQWTAQISDAQWGAAD-AVPAAT---VPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAP 326
           DQW+AQI DAQWGAAD  VPAAT   +PP SSVEW  EP ALA DGWDAA +  PP++AP
Sbjct: 241 DQWSAQIPDAQWGAADVVVPAATAAAIPPPSSVEWTSEPAALAADGWDAAVS--PPLSAP 300

Query: 327 AADAAAPSSAS--WF 336
           A D AAP SA+  W+
Sbjct: 301 AVDGAAPPSANGGWY 313

BLAST of HG10003664 vs. ExPASy Swiss-Prot
Match: O80377 (40S ribosomal protein SA OS=Daucus carota OX=4039 GN=179B PE=2 SV=1)

HSP 1 Score: 458.4 bits (1178), Expect = 7.1e-128
Identity = 242/304 (79.61%), Postives = 258/304 (84.87%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MA+GA   R+LS  EADIQMM AAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MASGA---RELSTMEADIQMMCAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           L LAARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TGAHAIAGRHTPGTFTNQLQTSF+
Sbjct: 61  LMLAARVIVSIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSFS 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRG I  GHKW+VMVDLFFYREPEE K+ EEE+ LP  D+ +ADY AAP+ G+
Sbjct: 181 LARMVLQMRGVISQGHKWEVMVDLFFYREPEETKDQEEED-LPVGDY-VADYAAAPIGGA 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
           DQW A I DAQWG     PA +  PA +  W  +   +AGDGWDAAAAPP P  A   D 
Sbjct: 241 DQWNA-IPDAQWGGDVVQPANSAVPAGT--WT-DAGPIAGDGWDAAAAPPVP-GAVGLDV 294

Query: 327 AAPS 331
            AP+
Sbjct: 301 PAPT 294

BLAST of HG10003664 vs. ExPASy Swiss-Prot
Match: A5BUU4 (40S ribosomal protein SA OS=Vitis vinifera OX=29760 GN=GSVIVT00034021001 PE=3 SV=1)

HSP 1 Score: 457.2 bits (1175), Expect = 1.6e-127
Identity = 239/312 (76.60%), Postives = 251/312 (80.45%), Query Frame = 0

Query: 31  AAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQLA 90
           A P R LSQKE DIQMMLAAEVHLGTKNC+FQMERYVFKRRNDGIYIINLGKTWEKLQLA
Sbjct: 2   ATPTRALSQKEQDIQMMLAAEVHLGTKNCNFQMERYVFKRRNDGIYIINLGKTWEKLQLA 61

Query: 91  ARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFNEPRL 150
           ARVIV IENP+DIIVQSARPYGQRAVLKFAQ+TGAHAIAGRHTPGTFTNQLQTSF+EPRL
Sbjct: 62  ARVIVAIENPKDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSFSEPRL 121

Query: 151 LILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARM 210
           LILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARM
Sbjct: 122 LILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARM 181

Query: 211 VLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPL--VGSDQ 270
           VLQMR TI  GHKWDVMVDLFFYREPEE KE E +E +  PD+GI DY A  L  +G   
Sbjct: 182 VLQMRRTIAPGHKWDVMVDLFFYREPEEPKEQEGDEVVAAPDYGITDYQATALGGLGGGD 241

Query: 271 WTAQISDA-QWGAADAVPAATVPPASSVEWAQ-----EPVALAGDGWDAAAAPPPPVAAP 330
           W A I+DA QWG AD    A +P      W         V +A DGWDAAAAPP  V  P
Sbjct: 242 WGAPITDAPQWG-ADVPAVAPIPAVPGSNWGDAAPMPSAVPIATDGWDAAAAPPVAVPPP 301

Query: 331 AADAAAPSSASW 335
             +   P  A W
Sbjct: 302 VVE-GVPPPAGW 311

BLAST of HG10003664 vs. ExPASy Swiss-Prot
Match: O22518 (40S ribosomal protein SA OS=Glycine max OX=3847 PE=2 SV=1)

HSP 1 Score: 454.5 bits (1168), Expect = 1.0e-126
Identity = 237/313 (75.72%), Postives = 252/313 (80.51%), Query Frame = 0

Query: 25  SAMATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTW 84
           +A    AAPPRQLSQKEADIQMMLAA+VHLGTKNCDFQMERY+FKRRNDGIYIINLGKTW
Sbjct: 3   TATNAAAAPPRQLSQKEADIQMMLAADVHLGTKNCDFQMERYIFKRRNDGIYIINLGKTW 62

Query: 85  EKLQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTS 144
           EKLQLAARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TGAHAIAGRHTPGTFTNQLQTS
Sbjct: 63  EKLQLAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTS 122

Query: 145 FNEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLF 204
           F+EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLF
Sbjct: 123 FSEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLF 182

Query: 205 WLLARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLV 264
           WLLARMVLQMRGTIR G KWDVMVDLFFYREPEEAK+ EEEEA P  D+ I D+ A  + 
Sbjct: 183 WLLARMVLQMRGTIRPGLKWDVMVDLFFYREPEEAKQQEEEEA-PAVDYAITDFNAGAIA 242

Query: 265 GSDQWTAQISDAQWGAADAVPAATVPPASSVEW---AQEPVALAGDGWDAAAAPPPPVAA 324
              QW   I D  W  +DAVP   +P    V W   A+ P A  GD W  A  PP  +  
Sbjct: 243 ADGQWPGTI-DQSW--SDAVP-QPIPAVPGVNWGAPAEAPAAAGGD-WGEAVPPPQQIPV 302

Query: 325 PAADAAAPSSASW 335
           P +       + W
Sbjct: 303 PPSGIDTVQPSGW 309

BLAST of HG10003664 vs. ExPASy Swiss-Prot
Match: O65751 (40S ribosomal protein SA OS=Cicer arietinum OX=3827 GN=RAP40 PE=2 SV=1)

HSP 1 Score: 445.7 bits (1145), Expect = 4.8e-124
Identity = 233/308 (75.65%), Postives = 253/308 (82.14%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MAT  AP RQL+QKEADIQMMLAA+VHLGTKNC+FQMERY+FKRRNDGIYIINLGKTW+K
Sbjct: 1   MATTTAPSRQLTQKEADIQMMLAADVHLGTKNCNFQMERYIFKRRNDGIYIINLGKTWDK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           L LAAR+IV IEN QDIIVQSARPYGQRAVLKFAQ+TGAHAIAGRHTPGTFTNQLQTSF+
Sbjct: 61  LNLAARIIVAIENSQDIIVQSARPYGQRAVLKFAQYTGAHAIAGRHTPGTFTNQLQTSFS 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPM YVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMNYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIR G KWDVMVDLFFYREPEEAK+PEE+E +  PD+ IAD+  + +   
Sbjct: 181 LARMVLQMRGTIRPGLKWDVMVDLFFYREPEEAKQPEEDE-VAAPDYAIADFNVSAIPSD 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
            QW A I D  W   DAVP   +P   +V WA  P A+AGD W  A   PPP   P A  
Sbjct: 241 GQWPAAI-DQPWN--DAVP-QPIPAVPAVNWA-APEAVAGD-WGEAV--PPPQQIPTAGI 299

Query: 327 AAPSSASW 335
            +  +  W
Sbjct: 301 ESVPATGW 299

BLAST of HG10003664 vs. ExPASy Swiss-Prot
Match: Q08682 (40S ribosomal protein Sa-1 OS=Arabidopsis thaliana OX=3702 GN=RPSaA PE=1 SV=3)

HSP 1 Score: 441.0 bits (1133), Expect = 1.2e-122
Identity = 227/300 (75.67%), Postives = 247/300 (82.33%), Query Frame = 0

Query: 27  MAT-GAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWE 86
           MAT G+A   QLSQKEAD++MM AAEVHLGTKNC++QMERYVFKRRNDGIYI NLGKTWE
Sbjct: 1   MATNGSASSAQLSQKEADVRMMCAAEVHLGTKNCNYQMERYVFKRRNDGIYIFNLGKTWE 60

Query: 87  KLQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSF 146
           KLQ+AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TGA+AIAGRHTPGTFTNQ+QTSF
Sbjct: 61  KLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGANAIAGRHTPGTFTNQMQTSF 120

Query: 147 NEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFW 206
           +EPRLLILTDPRTDHQPIKE ALGNIP IAFCDTDSPMR+VDIGIPANNKGKHSIGCLFW
Sbjct: 121 SEPRLLILTDPRTDHQPIKEGALGNIPIIAFCDTDSPMRFVDIGIPANNKGKHSIGCLFW 180

Query: 207 LLARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVG 266
           LLARMVLQMRGTI AG KWDVMVDLFFYREPEE K  +E+EA P  ++G        +VG
Sbjct: 181 LLARMVLQMRGTIAAGQKWDVMVDLFFYREPEETKPEDEDEAGPQAEYGALPAPEYGMVG 240

Query: 267 SDQW-TAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAA 325
            DQW TAQI DA W      P +  P A+S  W+    A A  GW+AAA   PP  APAA
Sbjct: 241 GDQWTTAQIPDAAWPGEGQAPISAAPAAAS--WSDSAAAPADGGWEAAA---PPSGAPAA 295

BLAST of HG10003664 vs. ExPASy TrEMBL
Match: A0A0A0K8X1 (40S ribosomal protein SA OS=Cucumis sativus OX=3659 GN=Csa_6G016990 PE=3 SV=1)

HSP 1 Score: 571.2 bits (1471), Expect = 2.8e-159
Identity = 288/310 (92.90%), Postives = 293/310 (94.52%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPP DFGIADY AAPL  S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPADFGIADYSAAPLT-S 240

Query: 267 DQWTAQISDAQWGAADAVPA-ATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAAD 326
           DQWT+QI DAQWGAADA+P+ A V PAS  EWA EPVALA DGWDAAAAPPPP A  +A+
Sbjct: 241 DQWTSQIPDAQWGAADAIPSPAPVVPASGAEWAPEPVALAADGWDAAAAPPPPPAV-SAE 300

Query: 327 AAAPSSASWF 336
             APSSASWF
Sbjct: 301 GTAPSSASWF 308

BLAST of HG10003664 vs. ExPASy TrEMBL
Match: A0A5D3DUK0 (40S ribosomal protein SA OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold322G00510 PE=3 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 4.0e-158
Identity = 286/310 (92.26%), Postives = 290/310 (93.55%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPP DFGIADY AAPL  S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPADFGIADYSAAPL-AS 240

Query: 267 DQWTAQISDAQWGAADAV-PAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAAD 326
           DQWT+QI DAQWGAADA+ P A V PAS  EWA EPVAL  DGWDAAA PPPP A  +AD
Sbjct: 241 DQWTSQIPDAQWGAADALPPPAAVVPASGAEWAPEPVALTADGWDAAAPPPPPAA--SAD 300

Query: 327 AAAPSSASWF 336
             AP+SASWF
Sbjct: 301 GTAPASASWF 307

BLAST of HG10003664 vs. ExPASy TrEMBL
Match: A0A1S3CJX7 (40S ribosomal protein SA OS=Cucumis melo OX=3656 GN=LOC103501303 PE=3 SV=1)

HSP 1 Score: 567.4 bits (1461), Expect = 4.0e-158
Identity = 286/310 (92.26%), Postives = 290/310 (93.55%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPP DFGIADY AAPL  S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPADFGIADYSAAPL-AS 240

Query: 267 DQWTAQISDAQWGAADAV-PAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAAD 326
           DQWT+QI DAQWGAADA+ P A V PAS  EWA EPVAL  DGWDAAA PPPP A  +AD
Sbjct: 241 DQWTSQIPDAQWGAADALPPPAAVVPASGAEWAPEPVALTADGWDAAAPPPPPAA--SAD 300

Query: 327 AAAPSSASWF 336
             AP+SASWF
Sbjct: 301 GTAPASASWF 307

BLAST of HG10003664 vs. ExPASy TrEMBL
Match: A0A6J1I7P9 (40S ribosomal protein SA OS=Cucurbita maxima OX=3661 GN=LOC111472042 PE=3 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 6.8e-158
Identity = 283/309 (91.59%), Postives = 288/309 (93.20%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAP RQLSQKEADIQMMLAAEVHLGTKNC+FQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPARQLSQKEADIQMMLAAEVHLGTKNCNFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           L +AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TG HAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LYMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGTHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKE ALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKE EEEEALPPPDFGIADYGAAPLV S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEKEEEEALPPPDFGIADYGAAPLVAS 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
           DQW AQISDAQWGAAD VPA   P ASSVEWA EPV LA + W+AA APPPP A PAAD 
Sbjct: 241 DQWAAQISDAQWGAADVVPAPAAPAASSVEWAPEPVPLAANEWEAAVAPPPP-APPAADG 300

Query: 327 AAPSSASWF 336
           AAPSSASWF
Sbjct: 301 AAPSSASWF 308

BLAST of HG10003664 vs. ExPASy TrEMBL
Match: A0A6J1EC15 (40S ribosomal protein SA OS=Cucurbita moschata OX=3662 GN=LOC111432696 PE=3 SV=1)

HSP 1 Score: 566.6 bits (1459), Expect = 6.8e-158
Identity = 283/309 (91.59%), Postives = 288/309 (93.20%), Query Frame = 0

Query: 27  MATGAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEK 86
           MATGAAP RQLSQKEADIQMMLAAEVHLGTKNC+FQMERYVFKRRNDGIYIINLGKTWEK
Sbjct: 1   MATGAAPARQLSQKEADIQMMLAAEVHLGTKNCNFQMERYVFKRRNDGIYIINLGKTWEK 60

Query: 87  LQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFN 146
           L +AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TG HAIAGRHTPGTFTNQLQTSFN
Sbjct: 61  LYMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGTHAIAGRHTPGTFTNQLQTSFN 120

Query: 147 EPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 206
           EPRLLILTDPRTDHQPIKE ALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL
Sbjct: 121 EPRLLILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWL 180

Query: 207 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGS 266
           LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKE EEEEALPPPDFGIADYGAAPLV S
Sbjct: 181 LARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEKEEEEALPPPDFGIADYGAAPLVAS 240

Query: 267 DQWTAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAADA 326
           DQW AQISDAQWGAAD VPA   P ASSVEWA EPV LA + W+AA APPPP A PAAD 
Sbjct: 241 DQWAAQISDAQWGAADVVPAPAAPAASSVEWAPEPVPLAANEWEAAVAPPPP-APPAADG 300

Query: 327 AAPSSASWF 336
           AAPSSASWF
Sbjct: 301 AAPSSASWF 308

BLAST of HG10003664 vs. TAIR 10
Match: AT1G72370.1 (40s ribosomal protein SA )

HSP 1 Score: 441.0 bits (1133), Expect = 8.3e-124
Identity = 227/300 (75.67%), Postives = 247/300 (82.33%), Query Frame = 0

Query: 27  MAT-GAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWE 86
           MAT G+A   QLSQKEAD++MM AAEVHLGTKNC++QMERYVFKRRNDGIYI NLGKTWE
Sbjct: 1   MATNGSASSAQLSQKEADVRMMCAAEVHLGTKNCNYQMERYVFKRRNDGIYIFNLGKTWE 60

Query: 87  KLQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSF 146
           KLQ+AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TGA+AIAGRHTPGTFTNQ+QTSF
Sbjct: 61  KLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGANAIAGRHTPGTFTNQMQTSF 120

Query: 147 NEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFW 206
           +EPRLLILTDPRTDHQPIKE ALGNIP IAFCDTDSPMR+VDIGIPANNKGKHSIGCLFW
Sbjct: 121 SEPRLLILTDPRTDHQPIKEGALGNIPIIAFCDTDSPMRFVDIGIPANNKGKHSIGCLFW 180

Query: 207 LLARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVG 266
           LLARMVLQMRGTI AG KWDVMVDLFFYREPEE K  +E+EA P  ++G        +VG
Sbjct: 181 LLARMVLQMRGTIAAGQKWDVMVDLFFYREPEETKPEDEDEAGPQAEYGALPAPEYGMVG 240

Query: 267 SDQW-TAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAA 325
            DQW TAQI DA W      P +  P A+S  W+    A A  GW+AAA   PP  APAA
Sbjct: 241 GDQWTTAQIPDAAWPGEGQAPISAAPAAAS--WSDSAAAPADGGWEAAA---PPSGAPAA 295

BLAST of HG10003664 vs. TAIR 10
Match: AT1G72370.2 (40s ribosomal protein SA )

HSP 1 Score: 436.0 bits (1120), Expect = 2.7e-122
Identity = 226/300 (75.33%), Postives = 246/300 (82.00%), Query Frame = 0

Query: 27  MAT-GAAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWE 86
           MAT G+A   QLSQKEAD++MM AAEVHLGTKNC++QMERYVFKRRNDGIYI NLGKTWE
Sbjct: 1   MATNGSASSAQLSQKEADVRMMCAAEVHLGTKNCNYQMERYVFKRRNDGIYIFNLGKTWE 60

Query: 87  KLQLAARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSF 146
           KLQ+AARVIV IENPQDIIVQSARPYGQRAVLKFAQ+TGA+AIAGRHTPGTFTNQ+QTSF
Sbjct: 61  KLQMAARVIVAIENPQDIIVQSARPYGQRAVLKFAQYTGANAIAGRHTPGTFTNQMQTSF 120

Query: 147 NEPRLLILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFW 206
           +EPRLLILTDPRTDHQPIKE ALGNIP IAFCDTDSPMR+VDIGIPANNKGKHSIGCLFW
Sbjct: 121 SEPRLLILTDPRTDHQPIKEGALGNIPIIAFCDTDSPMRFVDIGIPANNKGKHSIGCLFW 180

Query: 207 LLARMVLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVG 266
           LLARMVLQMRGTI AG KWDVMVDLFFYREPEE K  +E+EA P  ++G        +VG
Sbjct: 181 LLARMVLQMRGTIAAGQKWDVMVDLFFYREPEETKPEDEDEAGPQAEYGALPAPEYGMVG 240

Query: 267 SDQW-TAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAPPPPVAAPAA 325
            DQW TAQI DA W      P +  P A+S  W+      A  GW+AAA   PP  APAA
Sbjct: 241 GDQWTTAQIPDAAWPGEGQAPISAAPAAAS--WSDS----ADGGWEAAA---PPSGAPAA 291

BLAST of HG10003664 vs. TAIR 10
Match: AT3G04770.2 (40s ribosomal protein SA B )

HSP 1 Score: 415.6 bits (1067), Expect = 3.7e-116
Identity = 212/286 (74.13%), Postives = 235/286 (82.17%), Query Frame = 0

Query: 31  AAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQLA 90
           A   RQ+S+KEADIQMML+A+VHLGTKNC++QMERYVFKRR+DGIYIINLGKTW+KLQ+A
Sbjct: 7   ATAGRQVSEKEADIQMMLSADVHLGTKNCNYQMERYVFKRRDDGIYIINLGKTWDKLQMA 66

Query: 91  ARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFNEPRL 150
           ARVIV IENP+DIIVQSARPYGQRAVLKFAQ+TG +AIAGRHTPGTFTNQ+QTSF+EPRL
Sbjct: 67  ARVIVAIENPKDIIVQSARPYGQRAVLKFAQYTGVNAIAGRHTPGTFTNQMQTSFSEPRL 126

Query: 151 LILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARM 210
           LILTDPRTDHQPIKE ALGNIPTIAFCDTDSPM +VDIGIPANNKGKHSIGCLFWLLARM
Sbjct: 127 LILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMGFVDIGIPANNKGKHSIGCLFWLLARM 186

Query: 211 VLQMRGTIRAGHKWDVMVDLFFYREPEEAKEPEEEEALPPPDFGIADYGAAPLVGSDQW- 270
           VLQMRGTI A  KWDVMVDLFFYREPEEAK+  +EEA    D+G        +VG DQW 
Sbjct: 187 VLQMRGTILAAQKWDVMVDLFFYREPEEAKQEGDEEAEVQADYG--------MVGGDQWT 246

Query: 271 TAQISDAQWGAADAVPAATVPPASSVEWAQEPVALAGDGWDAAAAP 316
           TAQISDA W      P +  P           V +A  GW+AA+ P
Sbjct: 247 TAQISDAAWSGEVEQPISAAPAVG--------VTVAA-GWEAASVP 275

BLAST of HG10003664 vs. TAIR 10
Match: AT3G04770.1 (40s ribosomal protein SA B )

HSP 1 Score: 365.2 bits (936), Expect = 5.8e-101
Identity = 173/199 (86.93%), Postives = 189/199 (94.97%), Query Frame = 0

Query: 31  AAPPRQLSQKEADIQMMLAAEVHLGTKNCDFQMERYVFKRRNDGIYIINLGKTWEKLQLA 90
           A   RQ+S+KEADIQMML+A+VHLGTKNC++QMERYVFKRR+DGIYIINLGKTW+KLQ+A
Sbjct: 7   ATAGRQVSEKEADIQMMLSADVHLGTKNCNYQMERYVFKRRDDGIYIINLGKTWDKLQMA 66

Query: 91  ARVIVGIENPQDIIVQSARPYGQRAVLKFAQHTGAHAIAGRHTPGTFTNQLQTSFNEPRL 150
           ARVIV IENP+DIIVQSARPYGQRAVLKFAQ+TG +AIAGRHTPGTFTNQ+QTSF+EPRL
Sbjct: 67  ARVIVAIENPKDIIVQSARPYGQRAVLKFAQYTGVNAIAGRHTPGTFTNQMQTSFSEPRL 126

Query: 151 LILTDPRTDHQPIKEAALGNIPTIAFCDTDSPMRYVDIGIPANNKGKHSIGCLFWLLARM 210
           LILTDPRTDHQPIKE ALGNIPTIAFCDTDSPM +VDIGIPANNKGKHSIGCLFWLLARM
Sbjct: 127 LILTDPRTDHQPIKEGALGNIPTIAFCDTDSPMGFVDIGIPANNKGKHSIGCLFWLLARM 186

Query: 211 VLQMRGTIRAGHKWDVMVD 230
           VLQMRGTI A  KWDVMV+
Sbjct: 187 VLQMRGTILAAQKWDVMVN 205

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
XP_038886558.11.1e-16294.2340S ribosomal protein SA-like [Benincasa hispida][more]
XP_004138372.15.7e-15992.9040S ribosomal protein SA [Cucumis sativus] >KGN45883.1 hypothetical protein Csa_... [more]
XP_008463065.18.3e-15892.26PREDICTED: 40S ribosomal protein SA-like [Cucumis melo] >KAA0039133.1 40S riboso... [more]
XP_022925394.11.4e-15791.5940S ribosomal protein SA-like [Cucurbita moschata] >XP_022973502.1 40S ribosomal... [more]
XP_022972886.15.9e-15690.1640S ribosomal protein SA-like [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
O803777.1e-12879.6140S ribosomal protein SA OS=Daucus carota OX=4039 GN=179B PE=2 SV=1[more]
A5BUU41.6e-12776.6040S ribosomal protein SA OS=Vitis vinifera OX=29760 GN=GSVIVT00034021001 PE=3 SV... [more]
O225181.0e-12675.7240S ribosomal protein SA OS=Glycine max OX=3847 PE=2 SV=1[more]
O657514.8e-12475.6540S ribosomal protein SA OS=Cicer arietinum OX=3827 GN=RAP40 PE=2 SV=1[more]
Q086821.2e-12275.6740S ribosomal protein Sa-1 OS=Arabidopsis thaliana OX=3702 GN=RPSaA PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0K8X12.8e-15992.9040S ribosomal protein SA OS=Cucumis sativus OX=3659 GN=Csa_6G016990 PE=3 SV=1[more]
A0A5D3DUK04.0e-15892.2640S ribosomal protein SA OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffol... [more]
A0A1S3CJX74.0e-15892.2640S ribosomal protein SA OS=Cucumis melo OX=3656 GN=LOC103501303 PE=3 SV=1[more]
A0A6J1I7P96.8e-15891.5940S ribosomal protein SA OS=Cucurbita maxima OX=3661 GN=LOC111472042 PE=3 SV=1[more]
A0A6J1EC156.8e-15891.5940S ribosomal protein SA OS=Cucurbita moschata OX=3662 GN=LOC111432696 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT1G72370.18.3e-12475.6740s ribosomal protein SA [more]
AT1G72370.22.7e-12275.3340s ribosomal protein SA [more]
AT3G04770.23.7e-11674.1340s ribosomal protein SA B [more]
AT3G04770.15.8e-10186.9340s ribosomal protein SA B [more]
InterPro
Analysis Name: InterPro Annotations of Bottle gourd (Hangzhou Gourd) v1
Date Performed: 2022-08-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001865Ribosomal protein S2PRINTSPR00395RIBOSOMALS2coord: 123..140
score: 47.01
coord: 44..62
score: 36.11
coord: 148..165
score: 44.62
coord: 165..176
score: 51.15
coord: 74..83
score: 64.92
coord: 186..200
score: 53.64
IPR001865Ribosomal protein S2PFAMPF00318Ribosomal_S2coord: 146..211
e-value: 5.8E-12
score: 45.3
coord: 47..140
e-value: 3.7E-13
score: 49.2
IPR001865Ribosomal protein S2CDDcd01425RPS2coord: 47..213
e-value: 4.34496E-73
score: 222.457
NoneNo IPR availableGENE3D3.40.50.10490coord: 43..248
e-value: 5.1E-104
score: 348.0
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..19
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..31
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 308..335
NoneNo IPR availablePANTHERPTHR11489:SF2540S RIBOSOMAL PROTEIN SAcoord: 25..324
IPR005707Ribosomal protein S2, eukaryotic/archaealTIGRFAMTIGR01012TIGR01012coord: 39..235
e-value: 5.7E-96
score: 317.6
IPR005707Ribosomal protein S2, eukaryotic/archaealPANTHERPTHR1148940S RIBOSOMAL PROTEIN SAcoord: 25..324
IPR018130Ribosomal protein S2, conserved sitePROSITEPS00962RIBOSOMAL_S2_1coord: 44..55
IPR018130Ribosomal protein S2, conserved sitePROSITEPS00963RIBOSOMAL_S2_2coord: 148..172
IPR027498Ribosomal protein S2, eukaryoticHAMAPMF_03015Ribosomal_S2_eukcoord: 31..300
score: 50.604214
IPR023591Ribosomal protein S2, flavodoxin-like domain superfamilySUPERFAMILY52313Ribosomal protein S2coord: 39..229

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
HG10003664.1HG10003664.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0002181 cytoplasmic translation
biological_process GO:0000028 ribosomal small subunit assembly
biological_process GO:0006412 translation
cellular_component GO:0022627 cytosolic small ribosomal subunit
cellular_component GO:0005840 ribosome
cellular_component GO:0015935 small ribosomal subunit
molecular_function GO:0003735 structural constituent of ribosome