CSPI02G18990 (gene) Cucumber (PI 183967) v1

Overview
NameCSPI02G18990
Typegene
OrganismCucumis sativus var. hardwickii cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLEA_2 domain-containing protein
LocationChr2: 17372138 .. 17374017 (+)
RNA-Seq ExpressionCSPI02G18990
SyntenyCSPI02G18990
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GTGTTCAATAATGAAGATTCGTGTTGGCGGTGCCAACTGTAATCTAGCTGTCAGTAACTAAAATTCCATTGCCCAAAAATCTCTCCTTCTTCAGCCTTACGCCAAAATTTCATCTCCCATTCCCCTTTTTACCCTTTTCTTCCTTTTCTGTTTTTAATACTCACTTCACAAAACCGCGTCTCCCCTTCTTCAATGGCGTTACTCTTTACACTTTCACTCTCTTGAGTCTTGATCTCCATTCCAAACTCAAATCAACGCTCTACAATCTCTCATTCTACCTTAACCCAACTCCCATTTCTTCAACACCCCAATAACTCCCACAATGCACGCTAAAACCGACTCCGAAGTCACCAGTATCGCCCCTTCTTCTCCGACCAGATCTCCTCGCCGTCCTGTCTACTTCGTTCAGAGCCCTTCCAGAGACTCACACGATGGGGAGAAGACTGCCACCTCCTTTCACTCTACTCCCGTTCTCACTAGTCCCATGGACTCCCCTCCCCATTCTCGCTCCTCCGTCGGCCGTCACTCTAGAGAATCTTCCTCCAGTAGGTTTTCTGGATCTCTTAAACCTGGATCCAGGAAGATCACTCCTAATGACGTCTCTCGCGGCGCACATCGGAAGGGTCAGAAGCCATGGAAGGAATGCGATGTGATCGAAGAGGAAGGTCTTCTTGAAGACGAAGATCGGGGAAAATCTCTTCCTCGTCGCTGTTATGTTCTCGCTTTCATTTTGGGATTTGTTGTTCTTTTCTCTATGTTTGCTTTGATTCTTTGGGGTGCTAGTAGGCCGATGAAGCCCAAGATCACTATGAAGGTAAAACCACAATAGCAATGCAATACATTTTCTATATTAGGGATTTTTTTTTCTTGTTTCTTGTTCGTTGAATTTCCGAATTTTTTACAGAGCATTACATTCGAGCAATTCAAAATCCAAGCCGGTTCCGATTTTACGGGCGTCGCCACTGATATGGCTTCCGTAAATTCTACTGTGAAACTCATTTTTCGAAACACCGGATCATTCTTCGGCGTCCACGTCTCTCCTACTCCCGTCGATTTATCATATTCCGAAATCACAGTCGCATCAGGAACCGTAAGTTCCCAAAACCCTAACCCCCAAATTTCTTCTCATTCAATCTCTAACAAATTGTCGGAATTTTCAGGTTAAAAAGTTCTATCAATCACGGAAGAGTCATAGATCTATGACAATCAATGTAATCGGTACCAGAGTCCCACTGTACGGAAGTGGAGCAAGTCTGAGCGGTTCCACTGGAACCCCCGAAACACCATTGCCGTTGAAACTGAGATTCGTGATCAGATCCAGAGCCTACGTGCTGGGCCAATTAGTGAAGCCAAAATTCTACAGACACATCGATTGCCCCATAATTTTCGATTCCAAGAAACTCAATGTCCCCATGTCGCTCAAGAATTGCACAGTCGTTTGAAAACGAAAACGAAAACAAAATCGAGAATCACTTGAACATTTATTAAAATCAGTACTTGTTTTTTTGTTCTTGAGGTGTCTAGGAAATGGCGGACTGCGACAACTGGAACAGAAACGGGGGACGCCAGAACGGGAAGGGTAAAAGTGGAATAAGCGACAGAAACGGGGTTTCGATTGGCTGCCCGTTTGCCCGACTAGTGCAATACAGGTTGGTTTCCGGGCAGCGACATGTGAATGAATGTTGGCTTTTATTTTAATTTTTTTCGATTTTTAATTATCTCATTCTTTTAATTACCATATGTGTGAGTGATGATAATCATAATATATAAAATCAAATTTTAGTTTACATATTTTTGCTCCATCATTGTACTATTATTATTCTTAATAATGCTTTACTTATACTTTTAGAGTTTGATTTGTTCATAATGGACGGTGGCTG

mRNA sequence

GTGTTCAATAATGAAGATTCGTGTTGGCGGTGCCAACTGTAATCTAGCTGTCAGTAACTAAAATTCCATTGCCCAAAAATCTCTCCTTCTTCAGCCTTACGCCAAAATTTCATCTCCCATTCCCCTTTTTACCCTTTTCTTCCTTTTCTGTTTTTAATACTCACTTCACAAAACCGCGTCTCCCCTTCTTCAATGGCGTTACTCTTTACACTTTCACTCTCTTGAGTCTTGATCTCCATTCCAAACTCAAATCAACGCTCTACAATCTCTCATTCTACCTTAACCCAACTCCCATTTCTTCAACACCCCAATAACTCCCACAATGCACGCTAAAACCGACTCCGAAGTCACCAGTATCGCCCCTTCTTCTCCGACCAGATCTCCTCGCCGTCCTGTCTACTTCGTTCAGAGCCCTTCCAGAGACTCACACGATGGGGAGAAGACTGCCACCTCCTTTCACTCTACTCCCGTTCTCACTAGTCCCATGGACTCCCCTCCCCATTCTCGCTCCTCCGTCGGCCGTCACTCTAGAGAATCTTCCTCCAGTAGGTTTTCTGGATCTCTTAAACCTGGATCCAGGAAGATCACTCCTAATGACGTCTCTCGCGGCGCACATCGGAAGGGTCAGAAGCCATGGAAGGAATGCGATGTGATCGAAGAGGAAGGTCTTCTTGAAGACGAAGATCGGGGAAAATCTCTTCCTCGTCGCTGTTATGTTCTCGCTTTCATTTTGGGATTTGTTGTTCTTTTCTCTATGTTTGCTTTGATTCTTTGGGGTGCTAGTAGGCCGATGAAGCCCAAGATCACTATGAAGAGCATTACATTCGAGCAATTCAAAATCCAAGCCGGTTCCGATTTTACGGGCGTCGCCACTGATATGGCTTCCGTAAATTCTACTGTGAAACTCATTTTTCGAAACACCGGATCATTCTTCGGCGTCCACGTCTCTCCTACTCCCGTCGATTTATCATATTCCGAAATCACAGTCGCATCAGGAACCGTTAAAAAGTTCTATCAATCACGGAAGAGTCATAGATCTATGACAATCAATGTAATCGGTACCAGAGTCCCACTGTACGGAAGTGGAGCAAGTCTGAGCGGTTCCACTGGAACCCCCGAAACACCATTGCCGTTGAAACTGAGATTCGTGATCAGATCCAGAGCCTACGTGCTGGGCCAATTAGTGAAGCCAAAATTCTACAGACACATCGATTGCCCCATAATTTTCGATTCCAAGAAACTCAATGTCCCCATGTCGCTCAAGAATTGCACAGTCGTTTGAAAACGAAAACGAAAACAAAATCGAGAATCACTTGAACATTTATTAAAATCAGTACTTGTTTTTTTGTTCTTGAGGTGTCTAGGAAATGGCGGACTGCGACAACTGGAACAGAAACGGGGGACGCCAGAACGGGAAGGGTAAAAGTGGAATAAGCGACAGAAACGGGGTTTCGATTGGCTGCCCGTTTGCCCGACTAGTGCAATACAGGTTGGTTTCCGGGCAGCGACATGTGAATGAATGTTGGCTTTTATTTTAATTTTTTTCGATTTTTAATTATCTCATTCTTTTAATTACCATATGTGTGAGTGATGATAATCATAATATATAAAATCAAATTTTAGTTTACATATTTTTGCTCCATCATTGTACTATTATTATTCTTAATAATGCTTTACTTATACTTTTAGAGTTTGATTTGTTCATAATGGACGGTGGCTG

Coding sequence (CDS)

ATGCACGCTAAAACCGACTCCGAAGTCACCAGTATCGCCCCTTCTTCTCCGACCAGATCTCCTCGCCGTCCTGTCTACTTCGTTCAGAGCCCTTCCAGAGACTCACACGATGGGGAGAAGACTGCCACCTCCTTTCACTCTACTCCCGTTCTCACTAGTCCCATGGACTCCCCTCCCCATTCTCGCTCCTCCGTCGGCCGTCACTCTAGAGAATCTTCCTCCAGTAGGTTTTCTGGATCTCTTAAACCTGGATCCAGGAAGATCACTCCTAATGACGTCTCTCGCGGCGCACATCGGAAGGGTCAGAAGCCATGGAAGGAATGCGATGTGATCGAAGAGGAAGGTCTTCTTGAAGACGAAGATCGGGGAAAATCTCTTCCTCGTCGCTGTTATGTTCTCGCTTTCATTTTGGGATTTGTTGTTCTTTTCTCTATGTTTGCTTTGATTCTTTGGGGTGCTAGTAGGCCGATGAAGCCCAAGATCACTATGAAGAGCATTACATTCGAGCAATTCAAAATCCAAGCCGGTTCCGATTTTACGGGCGTCGCCACTGATATGGCTTCCGTAAATTCTACTGTGAAACTCATTTTTCGAAACACCGGATCATTCTTCGGCGTCCACGTCTCTCCTACTCCCGTCGATTTATCATATTCCGAAATCACAGTCGCATCAGGAACCGTTAAAAAGTTCTATCAATCACGGAAGAGTCATAGATCTATGACAATCAATGTAATCGGTACCAGAGTCCCACTGTACGGAAGTGGAGCAAGTCTGAGCGGTTCCACTGGAACCCCCGAAACACCATTGCCGTTGAAACTGAGATTCGTGATCAGATCCAGAGCCTACGTGCTGGGCCAATTAGTGAAGCCAAAATTCTACAGACACATCGATTGCCCCATAATTTTCGATTCCAAGAAACTCAATGTCCCCATGTCGCTCAAGAATTGCACAGTCGTTTGA

Protein sequence

MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDEDRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFTGVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSMTINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPIIFDSKKLNVPMSLKNCTVV*
Homology
BLAST of CSPI02G18990 vs. ExPASy TrEMBL
Match: A0A0A0LNF3 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G356680 PE=4 SV=1)

HSP 1 Score: 629.4 bits (1622), Expect = 8.2e-177
Identity = 319/319 (100.00%), Postives = 319/319 (100.00%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTVV 320
           IFDSKKLNVPMSLKNCTVV
Sbjct: 301 IFDSKKLNVPMSLKNCTVV 319

BLAST of CSPI02G18990 vs. ExPASy TrEMBL
Match: A0A1S3BBJ5 (uncharacterized protein LOC103487879 OS=Cucumis melo OX=3656 GN=LOC103487879 PE=4 SV=1)

HSP 1 Score: 613.2 bits (1580), Expect = 6.1e-172
Identity = 309/318 (97.17%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGF VLFSMFALILWGASRPMKP++TMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFFVLFSMFALILWGASRPMKPRVTMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVS TPVDL+YSEITVASGTVKKFYQSRKSHRS+
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSSTPVDLTYSEITVASGTVKKFYQSRKSHRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLS STGTPETPLPLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSSSTGTPETPLPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFDSKKLNVPMSLKNCTV
Sbjct: 301 IFDSKKLNVPMSLKNCTV 318

BLAST of CSPI02G18990 vs. ExPASy TrEMBL
Match: A0A5A7UYD8 (Late embryogenesis abundant protein, LEA-14 OS=Cucumis melo var. makuwa OX=1194695 GN=E6C27_scaffold501G00450 PE=4 SV=1)

HSP 1 Score: 611.3 bits (1575), Expect = 2.3e-171
Identity = 308/318 (96.86%), Postives = 313/318 (98.43%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGF VLFSMFALILWGASRPMKP++TMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFFVLFSMFALILWGASRPMKPRVTMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVS TPVDL+YSEITVASGTVKKFYQSRKSHRS+
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSSTPVDLTYSEITVASGTVKKFYQSRKSHRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLS STGTPETPLPLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSSSTGTPETPLPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFD KKLNVPMSLKNCTV
Sbjct: 301 IFDPKKLNVPMSLKNCTV 318

BLAST of CSPI02G18990 vs. ExPASy TrEMBL
Match: A0A6J1BRX1 (uncharacterized protein LOC111005194 OS=Momordica charantia OX=3673 GN=LOC111005194 PE=4 SV=1)

HSP 1 Score: 581.3 bits (1497), Expect = 2.6e-162
Identity = 296/321 (92.21%), Postives = 309/321 (96.26%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSP-RRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP 60
           MHAKTDSEVTS+APSSPTRSP RRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP
Sbjct: 1   MHAKTDSEVTSLAPSSPTRSPGRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP 60

Query: 61  HSRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGA-HRKGQKPWKECDVIEEEGLLE 120
           HSRSSVGRHSRESSS+RFSGSLKPGSRKI+PNDVSRGA +RKGQKPWKECDVIEEEGLLE
Sbjct: 61  HSRSSVGRHSRESSSTRFSGSLKPGSRKISPNDVSRGAGNRKGQKPWKECDVIEEEGLLE 120

Query: 121 DEDRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSD 180
           DEDR  SLPRRCYVLAFILGF VLFSMFALILWGAS+PMKPKITMKSITFEQFKIQAGSD
Sbjct: 121 DEDRANSLPRRCYVLAFILGFFVLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSD 180

Query: 181 FTGVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHR 240
           FTGVATDMASVNSTVKL FRNTGSFFGVHV+ TPVDL+YSEI+VASG+VKKFYQSRKS R
Sbjct: 181 FTGVATDMASVNSTVKLTFRNTGSFFGVHVTSTPVDLTYSEISVASGSVKKFYQSRKSQR 240

Query: 241 SMTINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDC 300
           S+TINVIGTR+PLYGSGASLS STGTP TP+PLKL FV+RSRAYVLGQLVKPKFYRHIDC
Sbjct: 241 SLTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVVRSRAYVLGQLVKPKFYRHIDC 300

Query: 301 PIIFDSKKLNVPMSLKNCTVV 320
           PIIFD KKLNVPMSLKNCTVV
Sbjct: 301 PIIFDPKKLNVPMSLKNCTVV 321

BLAST of CSPI02G18990 vs. ExPASy TrEMBL
Match: A0A6J1KAA3 (uncharacterized protein LOC111492078 OS=Cucurbita maxima OX=3661 GN=LOC111492078 PE=4 SV=1)

HSP 1 Score: 568.9 bits (1465), Expect = 1.3e-158
Identity = 283/318 (88.99%), Postives = 303/318 (95.28%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTS A SSPTRSPRRP Y+VQSPSRDSHDG+KTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSNATSSPTRSPRRPAYYVQSPSRDSHDGDKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSS+GRHSRESSS+RFSGSLKPGSRKI+PNDVSR  HRKGQKPW +CD I+EEGLLEDE
Sbjct: 61  SRSSLGRHSRESSSTRFSGSLKPGSRKISPNDVSRAPHRKGQKPWNDCDAIQEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           D+GKSLPRRCY+LAFILGF++LFS FAL+LWGASRPMKPKITMKSITFEQF+IQAGSDFT
Sbjct: 121 DQGKSLPRRCYLLAFILGFLLLFSFFALVLWGASRPMKPKITMKSITFEQFRIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFG+HVS +PVDL+YSEI+VASGTVKKFYQSRKS RS+
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGIHVSSSPVDLTYSEISVASGTVKKFYQSRKSQRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TI+VIGTRVPLYGSGASLS STGT  TP+PLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TIHVIGTRVPLYGSGASLSSSTGTSATPVPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFD KKLNVPMSLKNCTV
Sbjct: 301 IFDPKKLNVPMSLKNCTV 318

BLAST of CSPI02G18990 vs. NCBI nr
Match: XP_004142871.1 (uncharacterized protein LOC101203977 [Cucumis sativus] >KGN62494.1 hypothetical protein Csa_018716 [Cucumis sativus])

HSP 1 Score: 629.4 bits (1622), Expect = 1.7e-176
Identity = 319/319 (100.00%), Postives = 319/319 (100.00%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTVV 320
           IFDSKKLNVPMSLKNCTVV
Sbjct: 301 IFDSKKLNVPMSLKNCTVV 319

BLAST of CSPI02G18990 vs. NCBI nr
Match: XP_008444608.1 (PREDICTED: uncharacterized protein LOC103487879 [Cucumis melo])

HSP 1 Score: 613.2 bits (1580), Expect = 1.3e-171
Identity = 309/318 (97.17%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGF VLFSMFALILWGASRPMKP++TMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFFVLFSMFALILWGASRPMKPRVTMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVS TPVDL+YSEITVASGTVKKFYQSRKSHRS+
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSSTPVDLTYSEITVASGTVKKFYQSRKSHRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLS STGTPETPLPLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSSSTGTPETPLPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFDSKKLNVPMSLKNCTV
Sbjct: 301 IFDSKKLNVPMSLKNCTV 318

BLAST of CSPI02G18990 vs. NCBI nr
Match: KAA0060912.1 (Late embryogenesis abundant protein, LEA-14 [Cucumis melo var. makuwa])

HSP 1 Score: 611.3 bits (1575), Expect = 4.8e-171
Identity = 308/318 (96.86%), Postives = 313/318 (98.43%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGF VLFSMFALILWGASRPMKP++TMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFFVLFSMFALILWGASRPMKPRVTMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIFRNTGSFFGVHVS TPVDL+YSEITVASGTVKKFYQSRKSHRS+
Sbjct: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSSTPVDLTYSEITVASGTVKKFYQSRKSHRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGASLS STGTPETPLPLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASLSSSTGTPETPLPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFD KKLNVPMSLKNCTV
Sbjct: 301 IFDPKKLNVPMSLKNCTV 318

BLAST of CSPI02G18990 vs. NCBI nr
Match: XP_038884165.1 (uncharacterized protein LOC120075075 [Benincasa hispida])

HSP 1 Score: 610.5 bits (1573), Expect = 8.1e-171
Identity = 307/318 (96.54%), Postives = 314/318 (98.74%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTSIAPSSPTRSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH
Sbjct: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           SRSSVGRHSRESSSSRFSGSLKPGSRKI+PNDVSRGAHRKGQKPWKECDVIEEEGLLEDE
Sbjct: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKISPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT
Sbjct: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GVATDMASVNSTVKLIF+NTGSFFGVHVSPTPV+L+YSEITVASGTVKKFYQSRKSHRS+
Sbjct: 181 GVATDMASVNSTVKLIFKNTGSFFGVHVSPTPVELTYSEITVASGTVKKFYQSRKSHRSL 240

Query: 241 TINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPI 300
           TINVIGTRVPLYGSGAS S STGTPETPLPLKL FVIRSRAYVLGQLVKPKFYRHIDCPI
Sbjct: 241 TINVIGTRVPLYGSGASFSSSTGTPETPLPLKLSFVIRSRAYVLGQLVKPKFYRHIDCPI 300

Query: 301 IFDSKKLNVPMSLKNCTV 319
           IFD KKLNVP+SLKNCTV
Sbjct: 301 IFDPKKLNVPISLKNCTV 318

BLAST of CSPI02G18990 vs. NCBI nr
Match: XP_022132311.1 (uncharacterized protein LOC111005194 [Momordica charantia])

HSP 1 Score: 581.3 bits (1497), Expect = 5.3e-162
Identity = 296/321 (92.21%), Postives = 309/321 (96.26%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSP-RRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP 60
           MHAKTDSEVTS+APSSPTRSP RRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP
Sbjct: 1   MHAKTDSEVTSLAPSSPTRSPGRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPP 60

Query: 61  HSRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGA-HRKGQKPWKECDVIEEEGLLE 120
           HSRSSVGRHSRESSS+RFSGSLKPGSRKI+PNDVSRGA +RKGQKPWKECDVIEEEGLLE
Sbjct: 61  HSRSSVGRHSRESSSTRFSGSLKPGSRKISPNDVSRGAGNRKGQKPWKECDVIEEEGLLE 120

Query: 121 DEDRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSD 180
           DEDR  SLPRRCYVLAFILGF VLFSMFALILWGAS+PMKPKITMKSITFEQFKIQAGSD
Sbjct: 121 DEDRANSLPRRCYVLAFILGFFVLFSMFALILWGASKPMKPKITMKSITFEQFKIQAGSD 180

Query: 181 FTGVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHR 240
           FTGVATDMASVNSTVKL FRNTGSFFGVHV+ TPVDL+YSEI+VASG+VKKFYQSRKS R
Sbjct: 181 FTGVATDMASVNSTVKLTFRNTGSFFGVHVTSTPVDLTYSEISVASGSVKKFYQSRKSQR 240

Query: 241 SMTINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDC 300
           S+TINVIGTR+PLYGSGASLS STGTP TP+PLKL FV+RSRAYVLGQLVKPKFYRHIDC
Sbjct: 241 SLTINVIGTRIPLYGSGASLSSSTGTPATPVPLKLSFVVRSRAYVLGQLVKPKFYRHIDC 300

Query: 301 PIIFDSKKLNVPMSLKNCTVV 320
           PIIFD KKLNVPMSLKNCTVV
Sbjct: 301 PIIFDPKKLNVPMSLKNCTVV 321

BLAST of CSPI02G18990 vs. TAIR 10
Match: AT1G45688.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 258 Blast hits to 242 proteins in 39 species: Archae - 0; Bacteria - 11; Metazoa - 10; Fungi - 14; Plants - 198; Viruses - 17; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 424.9 bits (1091), Expect = 5.9e-119
Identity = 217/340 (63.82%), Postives = 265/340 (77.94%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTS+A SSP RSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVL SPM SPPH
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVL-SPMGSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           S SS+GRHSRESSSSRFSGSLKPGSRK+ PND S+     G+K WKEC VIEEEGLL+D 
Sbjct: 61  SHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDDG 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DR   +PRRCYVLAFI+GF +LF  F+LIL+GA++PMKPKIT+KSITFE  KIQAG D  
Sbjct: 121 DRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAG 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSM 240
           GV TDM ++N+T+++++RNTG+FFGVHV+ TP+DLS+S+I + SG+VKKFYQ RKS R++
Sbjct: 181 GVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVKKFYQGRKSERTV 240

Query: 241 TINVIGTRVPLYGSGASL---------------------SGSTGTPETPLPLKLRFVIRS 300
            ++VIG ++PLYGSG++L                           P  P+P+ L FV+RS
Sbjct: 241 LVHVIGEKIPLYGSGSTLLPPAPPAPLPKPKKKKGAPVPIPDPPAPPAPVPMTLSFVVRS 300

Query: 301 RAYVLGQLVKPKFYRHIDCPIIFDSKKLNVPMSL-KNCTV 319
           RAYVLG+LV+PKFY+ I+C I F+ K LN  + + KNCTV
Sbjct: 301 RAYVLGKLVQPKFYKKIECDINFEHKNLNKHIVITKNCTV 339

BLAST of CSPI02G18990 vs. TAIR 10
Match: AT5G42860.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 19 plant structures; EXPRESSED DURING: 11 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G45688.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink). )

HSP 1 Score: 359.0 bits (920), Expect = 4.0e-99
Identity = 193/341 (56.60%), Postives = 249/341 (73.02%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTS++ SSPTRSPRRP YFVQSPSRDSHDGEKTATSFHSTPVLTSPM SPPH
Sbjct: 1   MHAKTDSEVTSLSASSPTRSPRRPAYFVQSPSRDSHDGEKTATSFHSTPVLTSPMGSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           S          SSSSRFS        KI       G+ RKG    K+  +IEEEGLL+D 
Sbjct: 61  S---------HSSSSRFS--------KI------NGSKRKGHAGEKQFAMIEEEGLLDDG 120

Query: 121 DR-GKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDF 180
           DR  ++LPRRCYVLAFI+GF +LF+ F+LIL+ A++P KPKI++KSITFEQ K+QAG D 
Sbjct: 121 DREQEALPRRCYVLAFIVGFSLLFAFFSLILYAAAKPQKPKISVKSITFEQLKVQAGQDA 180

Query: 181 TGVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRS 240
            G+ TDM ++N+T+++++RNTG+FFGVHV+ +P+DLS+S+IT+ SG++KKFYQSRKS R+
Sbjct: 181 GGIGTDMITMNATLRMLYRNTGTFFGVHVTSSPIDLSFSQITIGSGSIKKFYQSRKSQRT 240

Query: 241 MTINVIGTRVPLYGSGASLSG--------------------STGTPETPLPLKLRFVIRS 300
           + +NV+G ++PLYGSG++L                          P  P+P++L F +RS
Sbjct: 241 VVVNVLGDKIPLYGSGSTLVPPPPPAPIPKPKKKKGPIVIVEPPAPPAPVPMRLNFTVRS 300

Query: 301 RAYVLGQLVKPKFYRHIDCPIIFDSKKL--NVPMSLKNCTV 319
           RAYVLG+LV+PKFY+ I C I F+ KKL  ++P++  NCTV
Sbjct: 301 RAYVLGKLVQPKFYKRIVCLINFEHKKLSKHIPIT-NNCTV 317

BLAST of CSPI02G18990 vs. TAIR 10
Match: AT1G45688.2 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: plasma membrane; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT5G42860.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink). )

HSP 1 Score: 335.1 bits (858), Expect = 6.1e-92
Identity = 169/239 (70.71%), Postives = 202/239 (84.52%), Query Frame = 0

Query: 1   MHAKTDSEVTSIAPSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPH 60
           MHAKTDSEVTS+A SSP RSPRRPVY+VQSPSRDSHDGEKTATSFHSTPVL SPM SPPH
Sbjct: 1   MHAKTDSEVTSLAASSPARSPRRPVYYVQSPSRDSHDGEKTATSFHSTPVL-SPMGSPPH 60

Query: 61  SRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDE 120
           S SS+GRHSRESSSSRFSGSLKPGSRK+ PND S+     G+K WKEC VIEEEGLL+D 
Sbjct: 61  SHSSMGRHSRESSSSRFSGSLKPGSRKVNPNDGSKRKGHGGEKQWKECAVIEEEGLLDDG 120

Query: 121 DRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFT 180
           DR   +PRRCYVLAFI+GF +LF  F+LIL+GA++PMKPKIT+KSITFE  KIQAG D  
Sbjct: 121 DRDGGVPRRCYVLAFIVGFFILFGFFSLILYGAAKPMKPKITVKSITFETLKIQAGQDAG 180

Query: 181 GVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTV----KKFYQSRK 236
           GV TDM ++N+T+++++RNTG+FFGVHV+ TP+DLS+S+I + SG+V    +K Y+ R+
Sbjct: 181 GVGTDMITMNATLRMLYRNTGTFFGVHVTSTPIDLSFSQIKIGSGSVSLPIQKLYRMRE 238

BLAST of CSPI02G18990 vs. TAIR 10
Match: AT2G41990.1 (CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterPro:IPR004864); BEST Arabidopsis thaliana protein match is: Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family (TAIR:AT4G35170.1); Has 172 Blast hits to 168 proteins in 15 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 172; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink). )

HSP 1 Score: 180.6 bits (457), Expect = 1.9e-45
Identity = 128/315 (40.63%), Postives = 178/315 (56.51%), Query Frame = 0

Query: 1   MHAKTDSEVTSI--APSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSP 60
           MHAKTDSE TSI  A  SP RS  RP+Y+VQSPS  +HD EK   SF S   L      P
Sbjct: 1   MHAKTDSEATSIDAAALSPPRSAIRPLYYVQSPS--NHDVEK--MSFGSGCSLMGSPTHP 60

Query: 61  PHSRSSVGRHSRESSSSRFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLE 120
            +   S   HSRESS+SRFS       + I          R+ ++   + D   + G  +
Sbjct: 61  HYYHCSPIHHSRESSTSRFSDRALLSYKSI----------RERRRYINDGDDKTDGG--D 120

Query: 121 DEDRGKSLPRRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSD 180
           D+D  +++  R YV   +L  + LF++F+LILWGAS+   PK+T+K +      +QAG+D
Sbjct: 121 DDDPFRNV--RLYVW-LLLSVIFLFTVFSLILWGASKSYPPKVTVKGMLVRDLNLQAGND 180

Query: 181 FTGVATDMASVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHR 240
            +GV TDM S+NSTV++ +RN  +FF VHV+ +P+ L YS + ++SG + KF   R    
Sbjct: 181 LSGVPTDMLSLNSTVRIYYRNPSTFFAVHVTASPLLLHYSNLLLSSGEMNKFTVGRNGET 240

Query: 241 SMTINVIGTRVPLYGSGASLSGSTGTPETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDC 300
           ++   V G ++PLYG G S    T      LPL L  V+ S+AY+LG+LV  KFY  I C
Sbjct: 241 NVVTVVQGHQIPLYG-GVSFHLDT----LSLPLNLTIVLHSKAYILGRLVTSKFYTRIIC 291

Query: 301 PIIFDSKKLNVPMSL 314
               D+  L   +SL
Sbjct: 301 SFTLDANHLPKSISL 291

BLAST of CSPI02G18990 vs. TAIR 10
Match: AT4G35170.1 (Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family )

HSP 1 Score: 174.5 bits (441), Expect = 1.4e-43
Identity = 118/312 (37.82%), Postives = 173/312 (55.45%), Query Frame = 0

Query: 13  APSSPTRSPRRPVYFVQSPSRDSHDGEKTATSFHSTPVLTSPMDSPPHSRSSVG---RHS 72
           A SSP ++ R+PVY V SP     D   T + F       SP  SP + +  V     HS
Sbjct: 6   ARSSP-QNTRKPVYVVHSPPNTDVDKISTGSGF-------SPFGSPLNDQGQVSNFQHHS 65

Query: 73  RESSSS--RFSGSLKPGSRKITPNDVSRGAHRKGQKPWKECDVIEEEGLLEDEDRGKSLP 132
              SSS  R SG L+     +  +D+ R  H       ++ D  E +G    +++ + + 
Sbjct: 66  VAESSSYPRSSGPLRNEYSSVQVHDLDRRTH-------EDEDYDEMDG---PDEKRRRIT 125

Query: 133 RRCYVLAFILGFVVLFSMFALILWGASRPMKPKITMKSITFEQFKIQAGSDFTGVATDMA 192
           R    L F L  V+ F++F LILWG S+   P  T+K +  E   +Q+G+D +GV TDM 
Sbjct: 126 RFYSCLLFTL--VLAFTLFCLILWGVSKSFAPIATLKEMVLENLNVQSGNDQSGVLTDML 185

Query: 193 SVNSTVKLIFRNTGSFFGVHVSPTPVDLSYSEITVASGTVKKFYQSRKSHRSMTINVIGT 252
           ++NSTV++++RN  +FF VHV+  P+ LSYS++ +ASG + +F Q RKS R +   V G 
Sbjct: 186 TLNSTVRILYRNPATFFTVHVTSAPLQLSYSQLILASGQMGEFSQRRKSERIIETKVFGD 245

Query: 253 RVPLYGSGASLSGSTGTP-ETPLPLKLRFVIRSRAYVLGQLVKPKFYRHIDCPIIFDSKK 312
           ++PLYG   +L G    P +  LPL L F +R+RAYVLG+LVK  F+ +I C I F   K
Sbjct: 246 QIPLYGGVPALFGQRAEPDQVVLPLNLTFTLRARAYVLGRLVKTTFHSNIKCSITFYGDK 297

Query: 313 LNVPMSL-KNCT 318
           L   + L K+C+
Sbjct: 306 LGKTLDLSKSCS 297

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Match NameE-valueIdentityDescription
A0A0A0LNF38.2e-177100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_2G356680 PE=4 SV=1[more]
A0A1S3BBJ56.1e-17297.17uncharacterized protein LOC103487879 OS=Cucumis melo OX=3656 GN=LOC103487879 PE=... [more]
A0A5A7UYD82.3e-17196.86Late embryogenesis abundant protein, LEA-14 OS=Cucumis melo var. makuwa OX=11946... [more]
A0A6J1BRX12.6e-16292.21uncharacterized protein LOC111005194 OS=Momordica charantia OX=3673 GN=LOC111005... [more]
A0A6J1KAA31.3e-15888.99uncharacterized protein LOC111492078 OS=Cucurbita maxima OX=3661 GN=LOC111492078... [more]
Match NameE-valueIdentityDescription
XP_004142871.11.7e-176100.00uncharacterized protein LOC101203977 [Cucumis sativus] >KGN62494.1 hypothetical ... [more]
XP_008444608.11.3e-17197.17PREDICTED: uncharacterized protein LOC103487879 [Cucumis melo][more]
KAA0060912.14.8e-17196.86Late embryogenesis abundant protein, LEA-14 [Cucumis melo var. makuwa][more]
XP_038884165.18.1e-17196.54uncharacterized protein LOC120075075 [Benincasa hispida][more]
XP_022132311.15.3e-16292.21uncharacterized protein LOC111005194 [Momordica charantia][more]
Match NameE-valueIdentityDescription
AT1G45688.15.9e-11963.82unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT5G42860.14.0e-9956.60unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G45688.26.1e-9270.71unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT2G41990.11.9e-4540.63CONTAINS InterPro DOMAIN/s: Late embryogenesis abundant protein, group 2 (InterP... [more]
AT4G35170.11.4e-4337.82Late embryogenesis abundant (LEA) hydroxyproline-rich glycoprotein family [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (PI 183967) v1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 1..105
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 42..62
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 70..87
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 8..25
NoneNo IPR availablePANTHERPTHR31852:SF180PROTEIN, PUTATIVE-RELATEDcoord: 60..317
NoneNo IPR availablePANTHERPTHR31852LATE EMBRYOGENESIS ABUNDANT (LEA) HYDROXYPROLINE-RICH GLYCOPROTEIN FAMILYcoord: 60..317

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G18990.1CSPI02G18990.1mRNA


GO Annotation
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
cellular_component GO:0016021 integral component of membrane