CSPI02G25090 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G25090
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
LocationChr2: 21445441 .. 21450079 (-)
RNA-Seq ExpressionCSPI02G25090
SyntenyCSPI02G25090
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CAAATAGACGCATTTGTGAAGATGCAGATTACCTAAACTCGAACAAAAAAGAGGTACAAAGGCCCCTTTCGGGTCTTTTTTTTTTTTTTATCCTTAACAATTGTTCGAGTTCTTTTTTATATATATTTATAAGAATTCTCTAGATTTTTCCCTTAAAAATAAATAAGCAGAAGTATTTGTATGGATTTTATTAATTTTATTTTTTCCAACTTTTATTTGTGGCTTTTAGTTCATTCCAAACCGCCTAAGTTATGGAATTTAGCCCTTAAACCCTAACAAAATCCCATTCCGTTGAGAGAGGAAGTGAAGGTTTTGTTGTTTCGGTTCAATTCTACATTCTTCTTCCATATTCCACAATGGAGCTTCATTCCGCAACTCTCCACACCTCATTCTCCTTCTCTATCAGAAGCACACCTCTTGCGCACGGAGACGCCTCTGCCGCTTGCTCTCCCTCCCTGCCATCGCTTTCAAGAATCACAGTTCGAAACTTCTCTTTGGGTTCGAAAAGTAGAGGTAATTTTGCTAACTTTGAACTTTGTTAAGTTTCCGATGAGTTGCTAAGACTGAACCACGAGTTTCATACCTCTCTATACTTCTGTACTATTGCTCGATGCTTTCAGTTTCATCCGGGTGAGACCTCTGTATAATTATGCATACTTCGAGGAAAAGGATTGAGATTTTGATTCTGTGTTCTCTTTGGTGTTCTGTTCACTATACGAAGAATAGTAGTCTGAGAGGCATATTATCCATGGCGGTTATAGAGAACGAAGGTTGAAGACGAATTGGGAGGGAAAAAAAAGGAAAGAATGGAATTAGGTGGCGATAGAAATACAACTGATACCATATTAGAAGAAAAGACTTTCTTTAATAAAATATCTCTCATTTTTTACTAATGGCTAGAAACATTGATATATACACATGTGAAAGATTAAAAGGAGAATACAACGTTGTTATAGAAATAACTATTTTTAATAACAAATAATAACAAACTAGCTTCTAATAACTACAATATTCTCGACGTGGAAAAATGGTCCTTAAGAATTATTTAAACCGAAGTGTCGAAATTCGGCCACAATTTCTATATATACGCATGTGTGTATTGATGTGTATAGCTATGCATTGGTCTAGATGCTGAATATATGTTGTTTAAGTTATTGATTCCTTATAGTTTGTGTTTCAAGCTTCATATTGACTTAGCTAAGCAGTCACAAAGTGAAGGTAGTATGAACAGTCTAAAATTGCAGAAATTTAAAAGAATTGTGACAATGTTCTTTATAAAGTAGAGTGAAGTATTAGTAATTTTGTTGAATAGCAGGGTTCCCTTCACTGGTATGTCATGATAGACCAACGAAGTCATCCTTTTCTGCTTTTGTCAGAGGGGTGAAAGCAGTACCCAGTGATTGCAATTCAGAAACTCTTGATTTATTGAATCCCTCTCCTGATGAGCCAGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAAAAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCTCATTTCTTTAACAGGTGCCAAGATCGAGCTGCCTCTGAAGATGATCCTGGTATGAAGCATAAGCTGCTTCGTTTTGGAAGAAAGTTGAAAGAGGTATATCTTGGGATTTAAAAAAATTCATAGTAAAAGATTTGACATAAGAGTTGTGGACCTTGTTTCTGTAATTTTGCAAATAAGAAGAGTAGAAACTAACACTTTGTATCTAAATGAAGTTAGGTCTATTGTTGTAAAACCCATAGAGGATAAGTTGCTGAGGGAAGAATTGAGATGAAATGTTGACAATGGTTATTTTATTGATCAGATAGTTGTAGTATGTGATTTGTCAATCAACCTTATTGTTTCCTTCTATGAAATTTCCACAGATCGATGAAGATGTGCAGAGACATAATGAACTTCTTGAAGTTGTCAGAGCAACATCACCTTCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACGAAAGAATTCTTTGTGCATCTTCACACGGTGGCTCAATCTTATTATGATGATCCGGCTAAGCAAAATGGTAATACTGCATACTTTACAGTTGGGAAAATAGTAAAGAATAAATAGCTTCCAAGAGGATGAGGAAGTGTGGTTTAGAAAGCAATTATATTGGTTGAAAATACCCTTTAAAGTTCTAATTATCGTTTCAATCTGGTCCTTAGATTTTGAAATATTACATTGATTTTTGAATTTAATTTCCATTTAATCCTTAGGATTTAAAAGTTTTAGCCGTGAAGTCACGGCTAATATCTAATTAAAAGTTTCAATCTGGTCCTTAGGATTTAAAAGTGCGTTAACTCACTGCTAATGTTTACTTAAAACATTATCATTTGAAATGACCAAATGGAAACTACAATCAAAACTAAGGGCCTAAAGGATACCTTCCCTATATTGTTTTTTATGCTATTCAAAGAATTAGTGGCCTCTTAGAAATTGTTCCTTTATCTTAAGAAGTCCTGAAGAGTAGCTAGATTGGTTAAGACATTATGTATCTGTCAACTGTTTGAACTTATAACTCCACATTTTTTTTTTGTTAAGGATGTCTCTGGTGAACGTTTTTTTTCAATTTGCATATATGGTATTGATAGATAAACTGATATCCACCCCCACCATATTTCAATTGATAAGGTTTGGCAAAACTTGGGAATTCCTGCCTTGCTGCTGTACAAACATATGATGCTGCTACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAAGATATCATTAATTCTCCAACTATAGATGCCGCTTGCAGAAAGATAGACAATTTGGCAGAGAAAAATCAACTTGATTCTGCATTAGTGTTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCAAACATGATGAAAGAGGAGGTAATGCGTTTGCTCTTCCTGATCGCTCCTGCTGGAAACAGGTGGTTTTATGGTCTTATTTCTATGATCTTGTAGCTTGGCAAGGCATCATCCAACTATTCCCGAAATTTCAAATGATTGTTTAAGTTAGTAATCATCGAAACTTCCAATAACGCTTCAAATACAATTATCTATTTGAATGAAATTACTACAGCTAAGGGCACCAGGGTAAAAAAAATACCAATTATTTCTCTTGATTTTACGTGTTTCTATGTTGCCTCATGCAAGCAAATGGAGAATGTACGTCTTCAGTTGTTGCTTAATGACTTTCTGATGGTTTTATTGTTGATTTTGTTCTCTTTGTTAGGCGAAAGACATACTATACCACTTATATGTCACCGCAAGGGGAAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATCCTGAAGTATCTTCTCACAATTAATGATCCTGAGGAAAAACTAAGTGCTTTGAAGGATGCATTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGTATGCTATCAGACCTTATTGTTAAGAATATACTGAAATACTAAATATGCCATAGTTTATTAGCTCGACCAAAATTTTAGCGATTTAGCATGATATCAGATCTTCTATGTTCAGACCCTTAGGGTTTTATTTTCTTATCTACAATGTTGTTTTTCTACTTGTTCGATACTTGACATTGCACCATTTCTTTTAGCCTGCAAATGGGATGTTAAGAATACATCCAAACAATTAGATATTAGCGGTGAGTTAATTAACACCCTCATTGTCGTCATACCTTCCCATCAAACAAGAAAAAAAAAATAAGAAAGTGGGTGATGTGGTAGGATTAGAGTAACTTAGGTATCTTGGTAAATCCTTAATTGATTAGGATTATGATTAGTTTATTTGATTAATTAAGATTAAGATTAGTTTACTTTACAATTCTCTATTTCTCTTCTTGGGTCATAACTTTTGACACATAATCAAGACTTTCATTTTAGCATTAGTATTGGAGATTTCTCTTCTTTTATCCATTTGGAATTTAGGCTGCATCAAAAAGCCTTAGTCTCGTTCGTCCTCGTAGATCTTTCGAGCCTTTGATGTCCATCTAGATAAGAGTACGAATACTACCTTGACTTTACATTTGCACAACACCAACCACGATCACTTGAATGCTGTATGAAAAACGTCCTTCAGAAAGGCAATCTTATTAGGTTTATTCTTGTTTTTGGAAACTTTTAGGACCCCAGAGGAGCTTCACACGTGGGTAAAGACAGTGGTAGATGCTTATCATTTCAGCAGGGAAGGCACCCTCGTCAGGGAAGCCAGAGACCTTATGAATCCACAGCTCATCGTTAAACTTGAAGAATTGAAGCGTCTCATTGAGAAGAAATTCATGTGAGGTACTAATGGATTTGGTAGTTAGGTCTTAATTAGTGCATCAAATGTTGTATAGTGAACTTTTTTCTTTCTTTTTTTGCCTTTTTGTGTAACAAAATGAATATGAACAATGTATTCAAAATAATTTATTTTTTACCCGGAAATAGAGTGTAGTTCAAGACTTCAAGTGGTGAGATATTTGTACCTGTTTTGCAGTTTGTATTCTAATCTGCAATCGTAACCAAG

mRNA sequence

CAAATAGACGCATTTGTGAAGATGCAGATTACCTAAACTCGAACAAAAAAGAGTTCATTCCAAACCGCCTAAGTTATGGAATTTAGCCCTTAAACCCTAACAAAATCCCATTCCGTTGAGAGAGGAAGTGAAGGTTTTGTTGTTTCGGTTCAATTCTACATTCTTCTTCCATATTCCACAATGGAGCTTCATTCCGCAACTCTCCACACCTCATTCTCCTTCTCTATCAGAAGCACACCTCTTGCGCACGGAGACGCCTCTGCCGCTTGCTCTCCCTCCCTGCCATCGCTTTCAAGAATCACAGTTCGAAACTTCTCTTTGGGTTCGAAAAGTAGAGCAGGGTTCCCTTCACTGGTATGTCATGATAGACCAACGAAGTCATCCTTTTCTGCTTTTGTCAGAGGGGTGAAAGCAGTACCCAGTGATTGCAATTCAGAAACTCTTGATTTATTGAATCCCTCTCCTGATGAGCCAGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAAAAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCTCATTTCTTTAACAGGTGCCAAGATCGAGCTGCCTCTGAAGATGATCCTGGTATGAAGCATAAGCTGCTTCGTTTTGGAAGAAAGTTGAAAGAGATCGATGAAGATGTGCAGAGACATAATGAACTTCTTGAAGTTGTCAGAGCAACATCACCTTCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACGAAAGAATTCTTTGTGCATCTTCACACGGTGGCTCAATCTTATTATGATGATCCGGCTAAGCAAAATGGTTTGGCAAAACTTGGGAATTCCTGCCTTGCTGCTGTACAAACATATGATGCTGCTACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAAGATATCATTAATTCTCCAACTATAGATGCCGCTTGCAGAAAGATAGACAATTTGGCAGAGAAAAATCAACTTGATTCTGCATTAGTGTTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCAAACATGATGAAAGAGGAGGCGAAAGACATACTATACCACTTATATGTCACCGCAAGGGGAAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATCCTGAAGTATCTTCTCACAATTAATGATCCTGAGGAAAAACTAAGTGCTTTGAAGGATGCATTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGGAGCTTCACACGTGGGTAAAGACAGTGGTAGATGCTTATCATTTCAGCAGGGAAGGCACCCTCGTCAGGGAAGCCAGAGACCTTATGAATCCACAGCTCATCGTTAAACTTGAAGAATTGAAGCGTCTCATTGAGAAGAAATTCATGTGAGGTACTAATGGATTTGGTAGTTAGGTCTTAATTAGTGCATCAAATGTTGTATAGTGAACTTTTTTCTTTCTTTTTTTGCCTTTTTGTGTAACAAAATGAATATGAACAATGTATTCAAAATAATTTATTTTTTACCCGGAAATAGAGTGTAGTTCAAGACTTCAAGTGGTGAGATATTTGTACCTGTTTTGCAGTTTGTATTCTAATCTGCAATCGTAACCAAG

Coding sequence (CDS)

ATGGAGCTTCATTCCGCAACTCTCCACACCTCATTCTCCTTCTCTATCAGAAGCACACCTCTTGCGCACGGAGACGCCTCTGCCGCTTGCTCTCCCTCCCTGCCATCGCTTTCAAGAATCACAGTTCGAAACTTCTCTTTGGGTTCGAAAAGTAGAGCAGGGTTCCCTTCACTGGTATGTCATGATAGACCAACGAAGTCATCCTTTTCTGCTTTTGTCAGAGGGGTGAAAGCAGTACCCAGTGATTGCAATTCAGAAACTCTTGATTTATTGAATCCCTCTCCTGATGAGCCAGTAAGAGATGTTCAAAATGCAAAAGACAGTGTTGAAAGCTTGGACCAGCATAAAATGACCAAAGTGTGTGATAAGCTCATTGAAGTCTTCATGATCGACAAGCCAACTCCAAAAGATTGGAGACGGTTAATTGCTTTCAGCAAGGAATGGGACAATATCCGTCCTCATTTCTTTAACAGGTGCCAAGATCGAGCTGCCTCTGAAGATGATCCTGGTATGAAGCATAAGCTGCTTCGTTTTGGAAGAAAGTTGAAAGAGATCGATGAAGATGTGCAGAGACATAATGAACTTCTTGAAGTTGTCAGAGCAACATCACCTTCAGAACTTGGTGAAATTATTTCTAGGCGTCGCAAAGATTTTACGAAAGAATTCTTTGTGCATCTTCACACGGTGGCTCAATCTTATTATGATGATCCGGCTAAGCAAAATGGTTTGGCAAAACTTGGGAATTCCTGCCTTGCTGCTGTACAAACATATGATGCTGCTACTGAAAACATTGAAGCACTGAATGCCGCAGAGTTGAAATTCCAAGATATCATTAATTCTCCAACTATAGATGCCGCTTGCAGAAAGATAGACAATTTGGCAGAGAAAAATCAACTTGATTCTGCATTAGTGTTGATGATCACAAAAGCTTGGTCAGCTGCAAAGGAGTCAAACATGATGAAAGAGGAGGCGAAAGACATACTATACCACTTATATGTCACCGCAAGGGGAAACCTTCAAAGATTAATGCCAAAAGAAATCAGGATCCTGAAGTATCTTCTCACAATTAATGATCCTGAGGAAAAACTAAGTGCTTTGAAGGATGCATTTACCCCCGGAGAAGAACTTGAAGGACAAGATGTGGACTGCTTATACACGACCCCAGAGGAGCTTCACACGTGGGTAAAGACAGTGGTAGATGCTTATCATTTCAGCAGGGAAGGCACCCTCGTCAGGGAAGCCAGAGACCTTATGAATCCACAGCTCATCGTTAAACTTGAAGAATTGAAGCGTCTCATTGAGAAGAAATTCATGTGA

Protein sequence

MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRAGFPSLVCHDRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQNGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQLIVKLEELKRLIEKKFM*
Homology
BLAST of CSPI02G25090 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 302.8 bits (774), Expect = 6.5e-81
Identity = 156/399 (39.10%), Postives = 255/399 (63.91%), Query Frame = 0

Query: 45  FSLGSKSRGFP--SLVCHDRPTKSSFSAFVRGVKAVPSDCN----SETLDLLNPSPDEPV 104
           FS   K   FP  +   H  P    FSAF+ G + +         ++T+     +  E  
Sbjct: 11  FSSADKLLSFPPKNSQTHHLP----FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVK 70

Query: 105 RDVQNAKDSVESLDQHKMTKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRC 164
             V++  + VE  + + M + CDK+I++F+ +KP  K W+  +    EW+    +F+ RC
Sbjct: 71  SSVEDPME-VEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRC 130

Query: 165 QDRAASEDDPGMKHKLLRFGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFT 224
           + RA +E DP +K KL+    K+K+ID+++++HN+LL+ ++  +P+++  I ++RR+DFT
Sbjct: 131 RIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFT 190

Query: 225 KEFFVHLHTVAQSYYDDPAKQNGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIIN 284
            EFF ++ T+     D    ++ +A+L   CL+AV  YD   E++E L+ A+ KF+DI+N
Sbjct: 191 GEFFRYV-TLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILN 250

Query: 285 SPTIDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNL 344
           SP++D+AC KI +LA+  +LDS+L+L+I  A++AAKES  +  EAKDI+YHLY   + +L
Sbjct: 251 SPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSL 310

Query: 345 QRLMPKEIRILKYLLTINDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVV 404
           + + PKEI++LKYLL I DPEE+ SAL  AF+PG++ E +D   LYTTP+ELH W+K ++
Sbjct: 311 RSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIML 370

Query: 405 DAYHFSREGTLVREARDLMNPQLIVKLEELKRLIEKKFM 438
           DAYH ++E T ++EA+ +  P +I +L  LK  IE +++
Sbjct: 371 DAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYL 402

BLAST of CSPI02G25090 vs. ExPASy TrEMBL
Match: A0A0A0LMH6 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1)

HSP 1 Score: 856.3 bits (2211), Expect = 5.6e-245
Identity = 431/437 (98.63%), Postives = 433/437 (99.08%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRIT+RNFSLGSKSRGFPSLVCH
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
           DRP KSSFSAFVRGVKAVPSDCNSETLDLLNPS DEPVRDVQNAKDSVE+LDQHKMTKVC
Sbjct: 61  DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           LIVKLEELK LIEKKFM
Sbjct: 421 LIVKLEELKGLIEKKFM 437

BLAST of CSPI02G25090 vs. ExPASy TrEMBL
Match: A0A1S3B306 (uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)

HSP 1 Score: 800.4 bits (2066), Expect = 3.7e-228
Identity = 401/437 (91.76%), Postives = 416/437 (95.19%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELHSATL TSFSFSIR   LA GDASAACSPS PSLSRITVRNFSLGSKSRGFPSL+C 
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRGFPSLLCR 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
           DRP KSSFS FVRGV AVPSDCNSETLD LNPSP E VRDVQNAKDSVESLDQHKMTKVC
Sbjct: 61  DRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR GRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDVQRHNELLEVVRAT+PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCLAAVQTYDAATENIEAL+AAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTP++LH W+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           +IVKLEELK L+EKKFM
Sbjct: 421 VIVKLEELKHLLEKKFM 437

BLAST of CSPI02G25090 vs. ExPASy TrEMBL
Match: A0A1S3B2I9 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103485431 PE=4 SV=1)

HSP 1 Score: 795.8 bits (2054), Expect = 9.0e-227
Identity = 401/438 (91.55%), Postives = 416/438 (94.98%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSR-GFPSLVC 60
           MELHSATL TSFSFSIR   LA GDASAACSPS PSLSRITVRNFSLGSKSR GFPSL+C
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRAGFPSLLC 60

Query: 61  HDRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKV 120
            DRP KSSFS FVRGV AVPSDCNSETLD LNPSP E VRDVQNAKDSVESLDQHKMTKV
Sbjct: 61  RDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR 180
           CDKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR GR
Sbjct: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGR 180

Query: 181 KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ 240
           KLKEIDEDVQRHNELLEVVRAT+PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+Q
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240

Query: 241 NGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD 300
           N LAKLGNSCLAAVQTYDAATENIEAL+AAELKFQDIINSPT+DAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE 360
           SALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP 420
           EKLSALKDAFTPGEE+EGQDVDCLYTTP++LH W+KTVVDAYHFSREGTL++EARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNP 420

Query: 421 QLIVKLEELKRLIEKKFM 438
           Q+IVKLEELK L+EKKFM
Sbjct: 421 QVIVKLEELKHLLEKKFM 438

BLAST of CSPI02G25090 vs. ExPASy TrEMBL
Match: A0A6J1JSR8 (uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN=LOC111489439 PE=4 SV=1)

HSP 1 Score: 732.3 bits (1889), Expect = 1.2e-207
Identity = 369/437 (84.44%), Postives = 394/437 (90.16%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELH ATL  SFSF IR   L HGDASA CS S  S+SRIT R+FSLGSKSRGFPSL   
Sbjct: 1   MELHCATLQASFSFYIRGKTLPHGDASATCSSSSSSVSRITARSFSLGSKSRGFPSLTWR 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
            R  KSS SA VRG  A PS C+++TLD  N +PD+ VRDVQNAK+ VE LDQHKMTKVC
Sbjct: 61  VRLKKSSSSAVVRGGSAEPSHCSTDTLDSSNTTPDDSVRDVQNAKNDVECLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVF+IDKPTP DWRRLIAFSK WDNIRPHFF RCQ+RAASEDDPGM+HKLLR GRK
Sbjct: 121 DKLIEVFLIDKPTPTDWRRLIAFSKTWDNIRPHFFRRCQERAASEDDPGMRHKLLRLGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDVQRHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYY DPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYADPAEQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCL AVQ YDAATENIEALNAAELKFQDIINSPT+DAACRKID+LAEKNQLDS
Sbjct: 241 ALAKLGNSCLVAVQAYDAATENIEALNAAELKFQDIINSPTLDAACRKIDSLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMI+KAWSAAKESNMMK+E KDILYHLYVTARGNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMISKAWSAAKESNMMKDEVKDILYHLYVTARGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTPE+LHTW+KTV+DAYHFSREGTLV+EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPEKLHTWIKTVLDAYHFSREGTLVKEARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           +IVKLEELK L+EKKFM
Sbjct: 421 VIVKLEELKLLVEKKFM 437

BLAST of CSPI02G25090 vs. ExPASy TrEMBL
Match: A0A6J1DF51 (uncharacterized protein At4g37920, chloroplastic OS=Momordica charantia OX=3673 GN=LOC111020242 PE=4 SV=1)

HSP 1 Score: 730.7 bits (1885), Expect = 3.6e-207
Identity = 368/439 (83.83%), Postives = 397/439 (90.43%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELHSATL TS SF +R   LAH DASAACSPS  SLSRI  RN S+GSKSRGF SLVC 
Sbjct: 1   MELHSATLQTSLSFPVRRRTLAHADASAACSPSSSSLSRIPARNSSMGSKSRGFLSLVCQ 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPS--PDEPVRDVQNAKDSVESLDQHKMTK 120
            RP KSSFSA VRG  AVP+DC+SE L+  N +   + PV +VQNA+D VE LDQHKMT+
Sbjct: 61  VRPKKSSFSAVVRGDSAVPNDCSSENLNSSNRTLIHNGPVGNVQNAEDGVECLDQHKMTE 120

Query: 121 VCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFG 180
           VCDKLI VF+IDKPTP DWRRLIAFSKEWDNIRPHFF+RCQDRAA+EDDPGMKHKLLR G
Sbjct: 121 VCDKLIGVFLIDKPTPTDWRRLIAFSKEWDNIRPHFFSRCQDRAATEDDPGMKHKLLRLG 180

Query: 181 RKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAK 240
           RKLKEIDEDVQRHNELLEVVRA +PSELGEI+SRRRKDFTKEFFVHLHTVA+SYYDDP +
Sbjct: 181 RKLKEIDEDVQRHNELLEVVRAAAPSELGEIVSRRRKDFTKEFFVHLHTVAESYYDDPTE 240

Query: 241 QNGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQL 300
           QN LAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPT+DAACRKID+LAEKNQL
Sbjct: 241 QNALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTVDAACRKIDSLAEKNQL 300

Query: 301 DSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDP 360
           DSALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DP
Sbjct: 301 DSALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIEDP 360

Query: 361 EEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMN 420
           EE+LS LKDAFTPGEELEGQDVDCLYTTPE+L TW+KTV+DAYHFSREGTL++EARDLMN
Sbjct: 361 EERLSGLKDAFTPGEELEGQDVDCLYTTPEKLRTWIKTVLDAYHFSREGTLIKEARDLMN 420

Query: 421 PQLIVKLEELKRLIEKKFM 438
           P++IVKLEELK L+EKKFM
Sbjct: 421 PKVIVKLEELKHLVEKKFM 439

BLAST of CSPI02G25090 vs. NCBI nr
Match: XP_004138642.1 (uncharacterized protein At4g37920 isoform X2 [Cucumis sativus] >KGN63115.1 hypothetical protein Csa_022408 [Cucumis sativus])

HSP 1 Score: 856.3 bits (2211), Expect = 1.2e-244
Identity = 431/437 (98.63%), Postives = 433/437 (99.08%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRIT+RNFSLGSKSRGFPSLVCH
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRGFPSLVCH 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
           DRP KSSFSAFVRGVKAVPSDCNSETLDLLNPS DEPVRDVQNAKDSVE+LDQHKMTKVC
Sbjct: 61  DRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK
Sbjct: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           LIVKLEELK LIEKKFM
Sbjct: 421 LIVKLEELKGLIEKKFM 437

BLAST of CSPI02G25090 vs. NCBI nr
Match: XP_011649916.1 (uncharacterized protein At4g37920 isoform X1 [Cucumis sativus])

HSP 1 Score: 851.7 bits (2199), Expect = 2.9e-243
Identity = 431/438 (98.40%), Postives = 433/438 (98.86%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSR-GFPSLVC 60
           MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRIT+RNFSLGSKSR GFPSLVC
Sbjct: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITIRNFSLGSKSRAGFPSLVC 60

Query: 61  HDRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKV 120
           HDRP KSSFSAFVRGVKAVPSDCNSETLDLLNPS DEPVRDVQNAKDSVE+LDQHKMTKV
Sbjct: 61  HDRPKKSSFSAFVRGVKAVPSDCNSETLDLLNPSSDEPVRDVQNAKDSVENLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR 180
           CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR
Sbjct: 121 CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR 180

Query: 181 KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ 240
           KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ 240

Query: 241 NGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD 300
           N LAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE 360
           SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP 420
           EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP
Sbjct: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP 420

Query: 421 QLIVKLEELKRLIEKKFM 438
           QLIVKLEELK LIEKKFM
Sbjct: 421 QLIVKLEELKGLIEKKFM 438

BLAST of CSPI02G25090 vs. NCBI nr
Match: XP_008441243.1 (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 800.4 bits (2066), Expect = 7.6e-228
Identity = 401/437 (91.76%), Postives = 416/437 (95.19%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELHSATL TSFSFSIR   LA GDASAACSPS PSLSRITVRNFSLGSKSRGFPSL+C 
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRGFPSLLCR 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
           DRP KSSFS FVRGV AVPSDCNSETLD LNPSP E VRDVQNAKDSVESLDQHKMTKVC
Sbjct: 61  DRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR GRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDVQRHNELLEVVRAT+PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCLAAVQTYDAATENIEAL+AAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEE+EGQDVDCLYTTP++LH W+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           +IVKLEELK L+EKKFM
Sbjct: 421 VIVKLEELKHLLEKKFM 437

BLAST of CSPI02G25090 vs. NCBI nr
Match: XP_008441242.1 (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 795.8 bits (2054), Expect = 1.9e-226
Identity = 401/438 (91.55%), Postives = 416/438 (94.98%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSR-GFPSLVC 60
           MELHSATL TSFSFSIR   LA GDASAACSPS PSLSRITVRNFSLGSKSR GFPSL+C
Sbjct: 1   MELHSATLQTSFSFSIRGKSLALGDASAACSPSSPSLSRITVRNFSLGSKSRAGFPSLLC 60

Query: 61  HDRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKV 120
            DRP KSSFS FVRGV AVPSDCNSETLD LNPSP E VRDVQNAKDSVESLDQHKMTKV
Sbjct: 61  RDRPKKSSFSTFVRGVSAVPSDCNSETLDSLNPSPVESVRDVQNAKDSVESLDQHKMTKV 120

Query: 121 CDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGR 180
           CDKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLR GR
Sbjct: 121 CDKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRLGR 180

Query: 181 KLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQ 240
           KLKEIDEDVQRHNELLEVVRAT+PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+Q
Sbjct: 181 KLKEIDEDVQRHNELLEVVRATAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQ 240

Query: 241 NGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLD 300
           N LAKLGNSCLAAVQTYDAATENIEAL+AAELKFQDIINSPT+DAACRKIDNLAEKNQLD
Sbjct: 241 NALAKLGNSCLAAVQTYDAATENIEALDAAELKFQDIINSPTLDAACRKIDNLAEKNQLD 300

Query: 301 SALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPE 360
           SALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPE
Sbjct: 301 SALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPE 360

Query: 361 EKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNP 420
           EKLSALKDAFTPGEE+EGQDVDCLYTTP++LH W+KTVVDAYHFSREGTL++EARDLMNP
Sbjct: 361 EKLSALKDAFTPGEEVEGQDVDCLYTTPDKLHAWIKTVVDAYHFSREGTLIKEARDLMNP 420

Query: 421 QLIVKLEELKRLIEKKFM 438
           Q+IVKLEELK L+EKKFM
Sbjct: 421 QVIVKLEELKHLLEKKFM 438

BLAST of CSPI02G25090 vs. NCBI nr
Match: XP_038884061.1 (uncharacterized protein At4g37920 isoform X2 [Benincasa hispida])

HSP 1 Score: 772.3 bits (1993), Expect = 2.2e-219
Identity = 390/437 (89.24%), Postives = 406/437 (92.91%), Query Frame = 0

Query: 1   MELHSATLHTSFSFSIRSTPLAHGDASAACSPSLPSLSRITVRNFSLGSKSRGFPSLVCH 60
           MELH ATL TSFSFSIRS  LAHGDASA CSPS PS SRIT RNFS+GSKSRGFPSLVC 
Sbjct: 1   MELHCATLQTSFSFSIRSKALAHGDASAVCSPSSPSFSRITARNFSMGSKSRGFPSLVCR 60

Query: 61  DRPTKSSFSAFVRGVKAVPSDCNSETLDLLNPSPDEPVRDVQNAKDSVESLDQHKMTKVC 120
            R  KSSFSA VRGV AVPSDCNSETLD  NP+  EP RDVQNAKD VESLDQ KMTKVC
Sbjct: 61  VRLKKSSFSAVVRGVSAVPSDCNSETLDWSNPTQYEP-RDVQNAKDGVESLDQPKMTKVC 120

Query: 121 DKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRCQDRAASEDDPGMKHKLLRFGRK 180
           DKLIEVFMIDKPTP DWRRLIAFSKEWDNIRPHFF RCQDRAASEDDPGM+HKLLR GRK
Sbjct: 121 DKLIEVFMIDKPTPTDWRRLIAFSKEWDNIRPHFFRRCQDRAASEDDPGMRHKLLRLGRK 180

Query: 181 LKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFTKEFFVHLHTVAQSYYDDPAKQN 240
           LKEIDEDV+RHNELLEVVRA +PSELGEIISRRRKDFTKEFFVHLHTVA+SYYDDPA+QN
Sbjct: 181 LKEIDEDVKRHNELLEVVRAAAPSELGEIISRRRKDFTKEFFVHLHTVAESYYDDPAEQN 240

Query: 241 GLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIINSPTIDAACRKIDNLAEKNQLDS 300
            LAKLGNSCL+AVQTYDAATENIEALNAAELKFQDIINSPT+DAACRKIDNLAEKNQLDS
Sbjct: 241 ALAKLGNSCLSAVQTYDAATENIEALNAAELKFQDIINSPTLDAACRKIDNLAEKNQLDS 300

Query: 301 ALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNLQRLMPKEIRILKYLLTINDPEE 360
           ALVLMITKAWSAAKESNMMK+E KDILYHLYVTA GNLQRLMPKEIRILKYLLTI DPEE
Sbjct: 301 ALVLMITKAWSAAKESNMMKDEVKDILYHLYVTASGNLQRLMPKEIRILKYLLTIKDPEE 360

Query: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVVDAYHFSREGTLVREARDLMNPQ 420
           KLSALKDAFTPGEELEGQDVDCLYTTPE+LHTW+KTVVDAYHFSREGTL++EARDLMNPQ
Sbjct: 361 KLSALKDAFTPGEELEGQDVDCLYTTPEKLHTWIKTVVDAYHFSREGTLIKEARDLMNPQ 420

Query: 421 LIVKLEELKRLIEKKFM 438
           +IVKLEELK L+EK FM
Sbjct: 421 VIVKLEELKHLLEKNFM 436

BLAST of CSPI02G25090 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 521.9 bits (1343), Expect = 4.8e-148
Identity = 250/345 (72.46%), Postives = 302/345 (87.54%), Query Frame = 0

Query: 95  DEPVRDVQNAKDSVES--LDQHKMTKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRP 154
           D+ V   +  KD  E   +D  +M KVCDKLIEVFM+DKPTP DWRRL+AFSKEWD+IRP
Sbjct: 70  DKSVVAKEEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLAFSKEWDSIRP 129

Query: 155 HFFNRCQDRAASEDDPGMKHKLLRFGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISR 214
           HF+ RCQ+RA SED+P MKHK+ R  RKLKE+DED+QRHNELL V++ T P+E+GE+++R
Sbjct: 130 HFYKRCQERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTPPAEIGELVAR 189

Query: 215 RRKDFTKEFFVHLHTVAQSYYDDPAKQNGLAKLGNSCLAAVQTYDAATENIEALNAAELK 274
           RRKDFT EFF HLHTVA+SYYD+P +QN LA LG   +AAVQ YD +TE+I+ALNAAE+K
Sbjct: 190 RRKDFTNEFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTESIDALNAAEMK 249

Query: 275 FQDIINSPTIDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYV 334
            QDIINSP++DAACRKID+LAEKNQLDSALVLMITKAWSAAKESNMMKEE KDILYHLYV
Sbjct: 250 LQDIINSPSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEEVKDILYHLYV 309

Query: 335 TARGNLQRLMPKEIRILKYLLTINDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHT 394
           TARGNLQRLMPKE+RILKYLL+I DP+E++SAL+DAFTPG+ELEG DVD LYTTPE L +
Sbjct: 310 TARGNLQRLMPKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDYLYTTPEHLQS 369

Query: 395 WVKTVVDAYHFSREGTLVREARDLMNPQLIVKLEELKRLIEKKFM 438
            +KTV++AYHFSREG+LV+EA+DLM+P+LI K+E+LK+L+EKK+M
Sbjct: 370 LMKTVLEAYHFSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

BLAST of CSPI02G25090 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 302.8 bits (774), Expect = 4.6e-82
Identity = 156/399 (39.10%), Postives = 255/399 (63.91%), Query Frame = 0

Query: 45  FSLGSKSRGFP--SLVCHDRPTKSSFSAFVRGVKAVPSDCN----SETLDLLNPSPDEPV 104
           FS   K   FP  +   H  P    FSAF+ G + +         ++T+     +  E  
Sbjct: 11  FSSADKLLSFPPKNSQTHHLP----FSAFINGGRKIRKSSTITFATDTVTYNGTTSAEVK 70

Query: 105 RDVQNAKDSVESLDQHKMTKVCDKLIEVFMIDKPTPKDWRRLIAFSKEWDNIRPHFFNRC 164
             V++  + VE  + + M + CDK+I++F+ +KP  K W+  +    EW+    +F+ RC
Sbjct: 71  SSVEDPME-VEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRC 130

Query: 165 QDRAASEDDPGMKHKLLRFGRKLKEIDEDVQRHNELLEVVRATSPSELGEIISRRRKDFT 224
           + RA +E DP +K KL+    K+K+ID+++++HN+LL+ ++  +P+++  I ++RR+DFT
Sbjct: 131 RIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQ-ENPTDINAIAAKRRRDFT 190

Query: 225 KEFFVHLHTVAQSYYDDPAKQNGLAKLGNSCLAAVQTYDAATENIEALNAAELKFQDIIN 284
            EFF ++ T+     D    ++ +A+L   CL+AV  YD   E++E L+ A+ KF+DI+N
Sbjct: 191 GEFFRYV-TLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILN 250

Query: 285 SPTIDAACRKIDNLAEKNQLDSALVLMITKAWSAAKESNMMKEEAKDILYHLYVTARGNL 344
           SP++D+AC KI +LA+  +LDS+L+L+I  A++AAKES  +  EAKDI+YHLY   + +L
Sbjct: 251 SPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSL 310

Query: 345 QRLMPKEIRILKYLLTINDPEEKLSALKDAFTPGEELEGQDVDCLYTTPEELHTWVKTVV 404
           + + PKEI++LKYLL I DPEE+ SAL  AF+PG++ E +D   LYTTP+ELH W+K ++
Sbjct: 311 RSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIML 370

Query: 405 DAYHFSREGTLVREARDLMNPQLIVKLEELKRLIEKKFM 438
           DAYH ++E T ++EA+ +  P +I +L  LK  IE +++
Sbjct: 371 DAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYL 402

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84WN06.5e-8139.10Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
A0A0A0LMH65.6e-24598.63Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G403700 PE=4 SV=1[more]
A0A1S3B3063.7e-22891.76uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3... [more]
A0A1S3B2I99.0e-22791.55uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A6J1JSR81.2e-20784.44uncharacterized protein At4g37920-like isoform X2 OS=Cucurbita maxima OX=3661 GN... [more]
A0A6J1DF513.6e-20783.83uncharacterized protein At4g37920, chloroplastic OS=Momordica charantia OX=3673 ... [more]
Match NameE-valueIdentityDescription
XP_004138642.11.2e-24498.63uncharacterized protein At4g37920 isoform X2 [Cucumis sativus] >KGN63115.1 hypot... [more]
XP_011649916.12.9e-24398.40uncharacterized protein At4g37920 isoform X1 [Cucumis sativus][more]
XP_008441243.17.6e-22891.76PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
XP_008441242.11.9e-22691.55PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
XP_038884061.12.2e-21989.24uncharacterized protein At4g37920 isoform X2 [Benincasa hispida][more]
Match NameE-valueIdentityDescription
AT1G36320.14.8e-14872.46unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
AT4G37920.14.6e-8239.10unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 181..201
NoneNo IPR availablePANTHERPTHR31755:SF3FOLATE RECEPTOR-LIKEcoord: 9..437
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 9..437

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G25090.2CSPI02G25090.2mRNA
CSPI02G25090.1CSPI02G25090.1mRNA
CSPI02G25090.3CSPI02G25090.3mRNA