CSPI02G08830 (gene) Wild cucumber (PI 183967)

NameCSPI02G08830
Typegene
OrganismCucumis sativus (Wild cucumber (PI 183967))
DescriptionLoricrin-like protein
LocationChr2 : 8381419 .. 8386650 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTCTCTCTTTTTTCCCCCATTGCTCTCCCCCCCTCTCTCTCAAATCTCTCTAACAAGGTTTCCACATGCCTTAGGTTGCTACTCTACTTTCTACTCTTCCCGGATTCTCTTCTACTAATTGCTTCCTCACGGTACATCATTTCTGTTTCCTTTTCCTTTCTCCTATTCTTCCCTGATTCTGCCTTCTCTTTTCTCTTAGATTTTTTTTTTTTCTTTTTGTGTTTTCGGATTTCATTTCATTTCTGTCTTTTCTTTTACTTCAATCTATTACTATTAGGACTAGAATCTTCCACTATTACCTGATTATCTATGTGTCTTGTGCATTTTGTTGTTAGAAATCCTAACTTTTGGTTTCTGGTCGCCTCGTCTTTTTTTTTCTCTCGATATGATGGTTGTGATTGGGCGTACGTATCTTCTTTTTGAATCTCTTTTGCTCTCTGTATATGGATTTGTTTGTTTGTTTGTTTGTTTTTTTTTTAAAATTCCTTAGTTGTTGAATTGGGATGTATTTGAAGATCTAACAAGGGTATCCTTTAATTATTTTTTATTTGCATGATTAAGCCCTGTTTATACCCAAAAGTTTATTTTCTTGATTTTCTGAGGGAGTGAAGCTAGTGATTCGTGTCGCCTATTCTGATTCATGGAGGTTGACTTCATTTGTATTATTTGCTTTTTCTTCGATTTGCGTGAAGTTCTGTTTTCTTTATGGTAAGAAATCCTATTTTTTGTTTTTTTGTTTTTTTTATATAAAGTATATTTACTACTACTTGGCCGGTTGTTACTTTATGGTTTCAGAACATTTTGAGGCGATATGCTATGATGCTAATCAGTGGTAGCTGTAATTTTAAAAGGAGATTAGGAATTGGAACATATTTACTGCTGAAGAATCCTTATTGCACTTCAAAACTATTGAGTCTTGTGTTCTATATCAGGAAACTGGGGGCGATATAGAAAACTAGTTTTTGTCAAAAACATGAATGAACAAAATCTAAGGAGAGAAACTCACAAAAAAGATTGAAAGAATAGGGTCTTCTTGAAACAAGGGGGACACCGTCACAAAACAATTATAGAAGTGTTTCCCAAACTTCTGGTAATAAAAAACAAAGGATAGTTATATGATAGAGTCCATGGATAAGAGTTGGATTGGATATTACCACATTTCAAATCTTCACATTCTCCCAATAGTTGTTTTTTCAGAAGATCACTCCGTTTCGTTTCTGCTTTTAGGGAAGAAAAACCTAGGAACTTGCCTGCCAAGGAACCTGAACCCTTCCCTGGAAATGAAAATCTGCTACTTCCATTACCGGTTCTTGGATGTTCGATATTTGATGAGTGAAGGCTAAACTGAACTCTTAACAAAACTGATTTCAAATTCTAGCTGATTAGTAGTATAACCACAAAAGATTATTTACAATTTCTGCATAAAATCAGAAGTAACAGTTCTTCCCAAGGAGAAACACAATTCAAGATATGTTATTATTATTTTTGTTGTGGACATGGTCATGATTTTATTCACAAGGAAGGATAAAAAAAAGAGTTGTTTCCACTCAAAGAGCTACGAGAGCTTTTGAGAAAGGATTTCACTCATGTTAGTTTGGGTTTCTGAATATCCTTCTTGCATTTGTAAATCCTATAAAACTTATAATTGTTATTGTTGCTAATTATCAAGGACACGCCATATAGTGAAATGTCTCAATGAAAAATGTGATAAGAAAAGGTATTTCTAAGTGAAGTGTACAAGACTAAACCTGTTACTGAAGTTGGATGCTTAAAAGAAGTTGACAAACAAAAGGGTATCATTTCAACAATCATATAAACTTTAGCTCATTATCTACATGTATGCGTTCTGCAGCATCAGCAAAATTACATCTGTTTCAACTTAGTTTTTATGTTGTTCTATTTCTTTCTATTTTACCCTCTTTACTGCTTGCTTTCTCATTAGTTTTTCCTTACTTTCTGTTGCATCCACGTGACTTAAAGAGACTCTGACCTCATCTACCATGCGTCCAAAATTCCAAATGGATTTACAACGAATATGCATTGAAGTTTGTATATAATTGTTTGTCCATTTATGAAGTTGTAAAGAAGCAACGACAATAAATAAAAAGGTCCAAAGTAATGTTTCTGCTATTGTTGGAGTTGTGAACTGATATGTTATTACATGCTCCTCTAGTGCCATTTTTCAAGCATGTAACCATTTCATTTATTGTTTGCATTTAACTTAAATTTTGGATTAGCATCCTGCCACTGCACTCATTTCAGATTGGTGGTTATTGTATCTGTCCATAACTTTATTTATGTTTCAATTAAGAAGTCATGTTATTTCAGAAAGTTGGGATGTAGAACTTGTAACTTTTTTCTATAATGTTCTCAAAATTGTATTGACAGTAGTCATCTGACTAGAGATTTATTTTAAAACATTCTTCCTGACATCAACTTCAAATTCCTCTTTGCTTCGACAGTACCTACTATTCTGCTATTGTTGTGTCCAACAATGAGTTTGAATTGATGGAGAGGCATCATTTCATTGACATTGGTCGTGTGAACTGTCTTATAAGATGTACTACTTTCAGGGTTGATGAAGCTGAATGAGTTCTCATCAAGGAGCATAAGGAGCTCCTCATATTTCTGCATATCAACATTACTCGTTGTCATCAAAGGGTTCTATCCAAGAAACGTCATTAGATATACTAGGAATTGATTAAAAAAAATTGTTGCATTATTGGTGGGAAGATCTTTGGTTAGCATATAAAGCTGATGGATAGATCAGATACGTCGGGCTTTGTTAGCGGTGGAGGTAGTAGTAATCTGTTAGAATCTGTTTTGGGTGCTGTTATGTACATCTGCTTTTAGTATTCCTTACATCTTGTTGGCCAGATTATCCTGTTAGTTTGAACTTCAGTGTTGAAAGAATGGATTTGAACAAGACTGTAGCACATTATTCTCAAAATGCTGATCTCACAAAAGATGACAACTTTGGTGACACTACTTTGAGCTTGAATTGTTTTGGCTTTGGAGGAAGAAAATCTTCTGGATGTGAGGTTGCTTTAAATGACCTCAACTTCAACTTCAGTTACTCTCCAGATGATGGATGTAGACTGGTACTTGGACTTGGTCCAACTCCAAGTGCTAACTGTGATGATTATTACAATGTTGGATATAATAAGACTAAGGCACAAGTTGCATCTGTACCAGAGGAAATATCACCGAGTGACTCAGTATTGCAGCTTGGTCTTTCTGGAGGGACTAATGAAGTTTCAAGTGTGGTCGAATGTTCAGTTTCAGCAGAAACTGATGTCAGTACAACTTATCTGATAAGCCAATGGGCTGCTGAAGCTAATCAACTGTCTATCCCACTAGTTGACGAGGGTTCTACCTCAGCAAAAAAATCAGGTGGGTATATGCCATCACTTCTTTTTGCTCCAAGAATGGGCATTTCAAACATTCTGATTCAGCAGGAGATTCTTGAAACTGATAGCAGAAATCAGCTGAGCCAAGGACTATCACCTACTGTAGAATACTCTTTAGGAACTGTCATTGACCAGACAACCAAAAGCGTGTGTTCAGATCATCAGGCAAATAATCCTAAAAGGTGCAAATACTTTGGCTGCGAGAAGGGTGCACGAGGAGCATCTGGTCTTTGTATTGGTCATGGAGGTGGACATAGATGCCAGAAACCTGGGTGCAATAAGGGTGCTGAGAGTCGTACTGCTTACTGTAAAGCTCATGGTGGAGGAAGGAGGTGCCAACATTTAGGGTGCACCAAAAGTGCCGAGGGGAAGACAGAATTTTGCATCGCTCATGGTGGTGGTAGGCGTTGTGGATATTCAGGTGGATGTGCGAAGGCTGCACGTGGAAAGTCAGGCCTTTGTATTAGACATGGTGGTGGAAAACGATGTAAGATGGATGGCTGCACTCGTAGTGCTGAAGGACACGCTGGTCTATGCATTTCTCATGGTGGTGGACGCCGTTGCCAGTATGAACGCTGCACAAAAGGTGCACAAGGAAGTACCATGTATTGTAAGGCCCATGGGGGAGGGAAACGATGTATATTTGCCGGATGCACAAAAGGTGCTGAAGGAAGTACTCCTCTTTGCAAGGGACATGGTGGGGGAAAGCGTTGCCTCTTTGATGGGGGTGGGATTTGCCCAAAAAGTGTACATGGGGGCACAAACTTTTGTGTTGCTCATGGTGGTGGAAAGAGGTGTGTTGTGTCAGGATGCACAAAAAGTGCTCGTGGGCGCACTGATTGCTGTGTCAGACATGGTGGAGGCAAGCGATGCAAATTCGAGAACTGTGGAAAGAGTGCCCAAGGGAGCACAGACTTCTGCAAAGCCCATGGAGGTGGAAAACGATGCACATGGGGAGAAGGCAAGTGCGAAAAATTTGCAAGGGGTAAAAGTGGTTTGTGTGCTGCTCATAGTAGCATGATCCAAGATCGAGAAACGAACAAGGGAAGCCTGATTGGACCGGGACTTTTCCATGGTCTAGTATCTGCTTCTGCTGCGTCTACAGTTGGAGATAGCTTTGACCACTATAAGTCATCTTCTGCAATCAGTTTCATATGTGATTCAATTGATTCTGCAGAGAAGCCTATGAAGCGACATCAGCTTATACCACCACAGGTATTGGTTCCATCCTCAATGAAATCATCTGCTTCATATTCTAGTTTCTTGAGTACAGAGAAGGGAGAGGAAGATGGGAATGGATATTGTATTGGCACAAAATTCCTTGAATATTCAATTCCTGAGGGAAGAGTTCATGGAGGTGGGCTCATGTCATTGCTTGGCGGGCATTTGAAGATGAAGAATATGAGTGATGGCATTTAAGGATATCTGATGATCCGAAAGCCAGTGGAAATGGAGAAATTCAGGTAATCTAGGAGTTATTTAATAACGCATTGGAATCATCATGTACATAAAACTTCAGTTGGGGATTTGTGAACCGTGCACTGTAATAATGATGATTAGATTTGTAGTTTTACATTGTGGAATTTGTCAGGGCTATCTGATTTAAAACATTATATCAGTACTTTTGTGTCAATAGGCTCTTTTATTTGTATATCGTTATAAAAGCCAGGCCTATGCTTTGGTGATTTGTAGTGGTGGTGTTACTTTCTGAAAAACTTTGTCCACAGCTACTCCAAGTTTTGAGTGTTTGTTGATGCTTTCTGATCCGTACAGAATTGAGCTATAAATACACACATTTTATTTGGAAAGTAGGAAAAGCA

mRNA sequence

ATGGATTTGAACAAGACTGTAGCACATTATTCTCAAAATGCTGATCTCACAAAAGATGACAACTTTGGTGACACTACTTTGAGCTTGAATTGTTTTGGCTTTGGAGGAAGAAAATCTTCTGGATGTGAGGTTGCTTTAAATGACCTCAACTTCAACTTCAGTTACTCTCCAGATGATGGATGTAGACTGGTACTTGGACTTGGTCCAACTCCAAGTGCTAACTGTGATGATTATTACAATGTTGGATATAATAAGACTAAGGCACAAGTTGCATCTGTACCAGAGGAAATATCACCGAGTGACTCAGTATTGCAGCTTGGTCTTTCTGGAGGGACTAATGAAGTTTCAAGTGTGGTCGAATGTTCAGTTTCAGCAGAAACTGATGTCAGTACAACTTATCTGATAAGCCAATGGGCTGCTGAAGCTAATCAACTGTCTATCCCACTAGTTGACGAGGGTTCTACCTCAGCAAAAAAATCAGGTGGGTATATGCCATCACTTCTTTTTGCTCCAAGAATGGGCATTTCAAACATTCTGATTCAGCAGGAGATTCTTGAAACTGATAGCAGAAATCAGCTGAGCCAAGGACTATCACCTACTGTAGAATACTCTTTAGGAACTGTCATTGACCAGACAACCAAAAGCGTGTGTTCAGATCATCAGGCAAATAATCCTAAAAGGTGCAAATACTTTGGCTGCGAGAAGGGTGCACGAGGAGCATCTGGTCTTTGTATTGGTCATGGAGGTGGACATAGATGCCAGAAACCTGGGTGCAATAAGGGTGCTGAGAGTCGTACTGCTTACTGTAAAGCTCATGGTGGAGGAAGGAGGTGCCAACATTTAGGGTGCACCAAAAGTGCCGAGGGGAAGACAGAATTTTGCATCGCTCATGGTGGTGGTAGGCGTTGTGGATATTCAGGTGGATGTGCGAAGGCTGCACGTGGAAAGTCAGGCCTTTGTATTAGACATGGTGGTGGAAAACGATGTAAGATGGATGGCTGCACTCGTAGTGCTGAAGGACACGCTGGTCTATGCATTTCTCATGGTGGTGGACGCCGTTGCCAGTATGAACGCTGCACAAAAGGTGCACAAGGAAGTACCATGTATTGTAAGGCCCATGGGGGAGGGAAACGATGTATATTTGCCGGATGCACAAAAGGTGCTGAAGGAAGTACTCCTCTTTGCAAGGGACATGGTGGGGGAAAGCGTTGCCTCTTTGATGGGGGTGGGATTTGCCCAAAAAGTGTACATGGGGGCACAAACTTTTGTGTTGCTCATGGTGGTGGAAAGAGGTGTGTTGTGTCAGGATGCACAAAAAGTGCTCGTGGGCGCACTGATTGCTGTGTCAGACATGGTGGAGGCAAGCGATGCAAATTCGAGAACTGTGGAAAGAGTGCCCAAGGGAGCACAGACTTCTGCAAAGCCCATGGAGGTGGAAAACGATGCACATGGGGAGAAGGCAAGTGCGAAAAATTTGCAAGGGGTAAAAGTGGTTTGTGTGCTGCTCATAGTAGCATGATCCAAGATCGAGAAACGAACAAGGGAAGCCTGATTGGACCGGGACTTTTCCATGGTCTAGTATCTGCTTCTGCTGCGTCTACAGTTGGAGATAGCTTTGACCACTATAAGTCATCTTCTGCAATCAGTTTCATATGTGATTCAATTGATTCTGCAGAGAAGCCTATGAAGCGACATCAGCTTATACCACCACAGGTATTGGTTCCATCCTCAATGAAATCATCTGCTTCATATTCTAGTTTCTTGAGTACAGAGAAGGGAGAGGAAGATGGGAATGGATATTGTATTGGCACAAAATTCCTTGAATATTCAATTCCTGAGGGAAGAGTTCATGGAGGTGGGCTCATGTCATTGCTTGGCGGGCATTTGAAGATGAAGAATATGAGTGATGGCATTTAA

Coding sequence (CDS)

ATGGATTTGAACAAGACTGTAGCACATTATTCTCAAAATGCTGATCTCACAAAAGATGACAACTTTGGTGACACTACTTTGAGCTTGAATTGTTTTGGCTTTGGAGGAAGAAAATCTTCTGGATGTGAGGTTGCTTTAAATGACCTCAACTTCAACTTCAGTTACTCTCCAGATGATGGATGTAGACTGGTACTTGGACTTGGTCCAACTCCAAGTGCTAACTGTGATGATTATTACAATGTTGGATATAATAAGACTAAGGCACAAGTTGCATCTGTACCAGAGGAAATATCACCGAGTGACTCAGTATTGCAGCTTGGTCTTTCTGGAGGGACTAATGAAGTTTCAAGTGTGGTCGAATGTTCAGTTTCAGCAGAAACTGATGTCAGTACAACTTATCTGATAAGCCAATGGGCTGCTGAAGCTAATCAACTGTCTATCCCACTAGTTGACGAGGGTTCTACCTCAGCAAAAAAATCAGGTGGGTATATGCCATCACTTCTTTTTGCTCCAAGAATGGGCATTTCAAACATTCTGATTCAGCAGGAGATTCTTGAAACTGATAGCAGAAATCAGCTGAGCCAAGGACTATCACCTACTGTAGAATACTCTTTAGGAACTGTCATTGACCAGACAACCAAAAGCGTGTGTTCAGATCATCAGGCAAATAATCCTAAAAGGTGCAAATACTTTGGCTGCGAGAAGGGTGCACGAGGAGCATCTGGTCTTTGTATTGGTCATGGAGGTGGACATAGATGCCAGAAACCTGGGTGCAATAAGGGTGCTGAGAGTCGTACTGCTTACTGTAAAGCTCATGGTGGAGGAAGGAGGTGCCAACATTTAGGGTGCACCAAAAGTGCCGAGGGGAAGACAGAATTTTGCATCGCTCATGGTGGTGGTAGGCGTTGTGGATATTCAGGTGGATGTGCGAAGGCTGCACGTGGAAAGTCAGGCCTTTGTATTAGACATGGTGGTGGAAAACGATGTAAGATGGATGGCTGCACTCGTAGTGCTGAAGGACACGCTGGTCTATGCATTTCTCATGGTGGTGGACGCCGTTGCCAGTATGAACGCTGCACAAAAGGTGCACAAGGAAGTACCATGTATTGTAAGGCCCATGGGGGAGGGAAACGATGTATATTTGCCGGATGCACAAAAGGTGCTGAAGGAAGTACTCCTCTTTGCAAGGGACATGGTGGGGGAAAGCGTTGCCTCTTTGATGGGGGTGGGATTTGCCCAAAAAGTGTACATGGGGGCACAAACTTTTGTGTTGCTCATGGTGGTGGAAAGAGGTGTGTTGTGTCAGGATGCACAAAAAGTGCTCGTGGGCGCACTGATTGCTGTGTCAGACATGGTGGAGGCAAGCGATGCAAATTCGAGAACTGTGGAAAGAGTGCCCAAGGGAGCACAGACTTCTGCAAAGCCCATGGAGGTGGAAAACGATGCACATGGGGAGAAGGCAAGTGCGAAAAATTTGCAAGGGGTAAAAGTGGTTTGTGTGCTGCTCATAGTAGCATGATCCAAGATCGAGAAACGAACAAGGGAAGCCTGATTGGACCGGGACTTTTCCATGGTCTAGTATCTGCTTCTGCTGCGTCTACAGTTGGAGATAGCTTTGACCACTATAAGTCATCTTCTGCAATCAGTTTCATATGTGATTCAATTGATTCTGCAGAGAAGCCTATGAAGCGACATCAGCTTATACCACCACAGGTATTGGTTCCATCCTCAATGAAATCATCTGCTTCATATTCTAGTTTCTTGAGTACAGAGAAGGGAGAGGAAGATGGGAATGGATATTGTATTGGCACAAAATTCCTTGAATATTCAATTCCTGAGGGAAGAGTTCATGGAGGTGGGCTCATGTCATTGCTTGGCGGGCATTTGAAGATGAAGAATATGAGTGATGGCATTTAA
BLAST of CSPI02G08830 vs. Swiss-Prot
Match: WRK19_ARATH (Probable WRKY transcription factor 19 OS=Arabidopsis thaliana GN=WRKY19 PE=2 SV=1)

HSP 1 Score: 243.4 bits (620), Expect = 6.6e-63
Identity = 122/242 (50.41%), Postives = 155/242 (64.05%), Query Frame = 1

Query: 184 ILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGL 243
           I ++ S   +  G++ T   S G+ + Q   +      +++ K C+  GC+KGAR ASG 
Sbjct: 55  ISQSSSMCTVPPGMAATPPISSGSGLSQQLNN------SSSSKLCQVEGCQKGARDASGR 114

Query: 244 CIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRC 303
           CI HGGG RCQKP C KGAE +T YCKAHGGGRRC++LGCTK AEG T+FCIAHGGGRRC
Sbjct: 115 CISHGGGRRCQKPDCQKGAEGKTVYCKAHGGGRRCEYLGCTKGAEGSTDFCIAHGGGRRC 174

Query: 304 GYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCTKGA 363
            +   C ++A G++  C++HGGG RCK  GC +SA G    C +HGGG++C +E CT  A
Sbjct: 175 NHE-DCTRSAWGRTEFCVKHGGGARCKTYGCGKSASGPLPFCRAHGGGKKCSHEDCTGFA 234

Query: 364 QGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTNFC 423
           +G +  C  HGGGKRC    CTK AEG + LC  HGGG+RC   G   C K   G   FC
Sbjct: 235 RGRSGLCLMHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQSIG---CTKGAKGSKMFC 286

Query: 424 VA 426
            A
Sbjct: 295 KA 286

BLAST of CSPI02G08830 vs. TrEMBL
Match: A0A0A0LI30_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_2G122020 PE=4 SV=1)

HSP 1 Score: 1304.7 bits (3375), Expect = 0.0e+00
Identity = 634/638 (99.37%), Postives = 635/638 (99.53%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG
Sbjct: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVE 120
           CRLVLGLGPTPSANCDDYYNVGYNKTKAQVAS+PEEISPSDSVLQLGLSGGTNEVSSVVE
Sbjct: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASLPEEISPSDSVLQLGLSGGTNEVSSVVE 120

Query: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNILI 180
           CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMG SNILI
Sbjct: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNILI 180

Query: 181 QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240
           QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA
Sbjct: 181 QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240

Query: 241 SGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300
           SGLCIGHGGGHRCQKPGC KGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Sbjct: 241 SGLCIGHGGGHRCQKPGCTKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300

Query: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCT 360
           RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYE CT
Sbjct: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYECCT 360

Query: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420
           KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT
Sbjct: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420

Query: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480
           NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK
Sbjct: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480

Query: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD 540
           RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD
Sbjct: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD 540

Query: 541 HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600
           HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY
Sbjct: 541 HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600

Query: 601 CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
           CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI
Sbjct: 601 CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 638

BLAST of CSPI02G08830 vs. TrEMBL
Match: B9RIZ1_RICCO (Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1752370 PE=4 SV=1)

HSP 1 Score: 906.0 bits (2340), Expect = 2.6e-260
Identity = 455/653 (69.68%), Postives = 517/653 (79.17%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN     +   ++L K DNFGDTTL LNC  +GG   +G E   ++L  +F+  PDDG
Sbjct: 1   MDLNDKCKQFLHKSELPKSDNFGDTTLRLNCLSYGGTNMNGFECTQSNLKVDFTNGPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEE---ISPSDSVLQLGLSGGTNEVSS 120
           C+LVLGLGPTP+A CDDYY++ +NKTK   A+        S  DS+LQLGLSGGT E  S
Sbjct: 61  CKLVLGLGPTPTAYCDDYYSMRFNKTKGSTAAAVLHRGLSSDGDSILQLGLSGGTKEALS 120

Query: 121 VVECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISN 180
            +ECS   ETD+ST  +++Q++   ++  IP+VDEGSTSAKKSGGYMPSLL APRM  + 
Sbjct: 121 ELECSF-LETDISTP-ILNQFSGHEDRFLIPVVDEGSTSAKKSGGYMPSLLLAPRMDGAK 180

Query: 181 ILIQ-QEILE----TDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFG 240
           + ++ +E L+        +QL  G S + + S+GT+ +Q T +   D + +NPK+CK+FG
Sbjct: 181 VSLEGEEFLQFGAAKSQSHQLIHGTSASTDISMGTISEQATTATSVDRKISNPKKCKFFG 240

Query: 241 CEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTE 300
           C KGARGA GLCIGHGGG RCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+
Sbjct: 241 CSKGARGALGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTD 300

Query: 301 FCIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGR 360
           FCIAHGGGRRCG+ GGC KAARGKSGLCI+HGGGKRCK+DGC+RSAEG AGLCISHGGGR
Sbjct: 301 FCIAHGGGRRCGFGGGCTKAARGKSGLCIKHGGGKRCKVDGCSRSAEGQAGLCISHGGGR 360

Query: 361 RCQYERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGIC 420
           RCQYE CTKGAQGSTM+CKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCL+DGGGIC
Sbjct: 361 RCQYEGCTKGAQGSTMHCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGIC 420

Query: 421 PKSVHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDF 480
           PKSVHGGTNFCVAHGGGKRCVV GCTKSARGRTDCCV+HGGGKRCKFENCGKSAQGSTDF
Sbjct: 421 PKSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDF 480

Query: 481 CKAHGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAA 540
           CKAHGGGKRCTWGEGKCEKFARG+SGLCAAHSSM+ ++ +NKGSLIGPGLF GLVSA  A
Sbjct: 481 CKAHGGGKRCTWGEGKCEKFARGRSGLCAAHSSMVLEQGSNKGSLIGPGLFQGLVSA--A 540

Query: 541 STVGDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEK 600
           S  G S D+  SSS IS + D  DS  KP KR  LIP QVLVP SMKSS+SYSSFL+ EK
Sbjct: 541 SNAGSSIDNNYSSSGISAVSDCTDSLGKPTKRQHLIPAQVLVPPSMKSSSSYSSFLNAEK 600

Query: 601 GEEDGNGYCIG-------TKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
            EE  N Y  G       T F +Y  PEGRVHGGGLMSL GG+L  KN  DGI
Sbjct: 601 QEEGRNEYSAGAGSTSRVTSF-DYMAPEGRVHGGGLMSLFGGNL--KNAIDGI 646

BLAST of CSPI02G08830 vs. TrEMBL
Match: A0A061E1L7_THECC (Emb:CAB89363.1 OS=Theobroma cacao GN=TCM_007538 PE=4 SV=1)

HSP 1 Score: 902.5 bits (2331), Expect = 2.9e-259
Identity = 456/651 (70.05%), Postives = 525/651 (80.65%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLNK V  +S  ++L+K++NFGDTTL LN  G+GG   +      ++L+ + S +PDDG
Sbjct: 1   MDLNKNV-QFSHVSELSKNENFGDTTLCLNFLGYGGSNKARFGSTQSNLHADLSNAPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPS-DSVLQLGLSGGTNEVSSVV 120
           CRLVLGLGPTPS  C++YYNVG NK K+  A   + +SP  DS+L+LGLSGGT E  S++
Sbjct: 61  CRLVLGLGPTPSVYCNNYYNVGLNKNKSTGAFFTQGLSPEDDSILKLGLSGGTKESMSLL 120

Query: 121 ECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNIL 180
           ECS+S ETD  T+  +S   +  ++LSIP+VDEGSTSAKKSGGYMPSLL APRM     L
Sbjct: 121 ECSLSTETD--TSMPLSNQVSADSRLSIPVVDEGSTSAKKSGGYMPSLLLAPRMDSGKGL 180

Query: 181 IQ-QEILETDSR---NQLSQGLSPT--VEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGC 240
           +Q +E+ +  ++   +QL +   P+   ++S  T+ +QTT     D++ +N K+CK+ GC
Sbjct: 181 VQTRELFQFGAKSHCHQLHRSCEPSAQTDFSGDTLSEQTTTMTSLDNRTSNSKKCKFAGC 240

Query: 241 EKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF 300
            KGARGASGLCIGHGGG RCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF
Sbjct: 241 TKGARGASGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF 300

Query: 301 CIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRR 360
           CIAHGGGRRCG+ GGC KAARGKSGLCIRHGGGKRCK++GCTRSAEG AGLCISHGGGRR
Sbjct: 301 CIAHGGGRRCGFPGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRR 360

Query: 361 CQYERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICP 420
           CQ++ CTKG+QGSTMYCKAHGGGKRCIFAGCT+GAEGSTPLCKGHGGGKRCL++GGGICP
Sbjct: 361 CQFQECTKGSQGSTMYCKAHGGGKRCIFAGCTRGAEGSTPLCKGHGGGKRCLYNGGGICP 420

Query: 421 KSVHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFC 480
           KSVHGGTNFCVAHGGGKRCVV GCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFC
Sbjct: 421 KSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFC 480

Query: 481 KAHGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAAS 540
           KAHGGGKRC+WGEGKCEKFARG+SGLCAAHSSM+Q+RE +KG LI PG+FHGLV  SA S
Sbjct: 481 KAHGGGKRCSWGEGKCEKFARGRSGLCAAHSSMVQEREASKGGLIAPGVFHGLV--SAGS 540

Query: 541 TVGDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKG 600
           T G S D+  SSS  S I D IDS EKP +R  LIPPQVLVP SMKSS+SYSS LS EK 
Sbjct: 541 TTGSSVDYNHSSSGTSVISDCIDSLEKPARRQHLIPPQVLVPLSMKSSSSYSSLLSAEKQ 600

Query: 601 EEDGNGY------CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
            E  NGY       +G +   + IPEGRVHGGGLMSLLGG+L  KN  DGI
Sbjct: 601 VEGRNGYGMGIGGGVGNESFNFMIPEGRVHGGGLMSLLGGNL--KNPIDGI 644

BLAST of CSPI02G08830 vs. TrEMBL
Match: A0A0B0PEI2_GOSAR (Putative WRKY transcription factor 19-like protein OS=Gossypium arboreum GN=F383_04964 PE=4 SV=1)

HSP 1 Score: 892.9 bits (2306), Expect = 2.3e-256
Identity = 451/651 (69.28%), Postives = 526/651 (80.80%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN  V  +S+ ++L+K++NFGDTTL LN  G GG   +G     +DL+ + S + DDG
Sbjct: 1   MDLNTNV-RFSRVSELSKNENFGDTTLRLNFLGHGGSNKAGFGSTQSDLHIDLSSASDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSD-SVLQLGLSGGTNEVSSVV 120
           CRLVLGLGPTPS  C+DY+NVG NK K+  A     +SP D S+L+LGLSGGT    +++
Sbjct: 61  CRLVLGLGPTPSVYCNDYHNVGLNKNKSTAALFTPGLSPEDNSILKLGLSGGTKGSMNLL 120

Query: 121 ECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNIL 180
           E S+S +TD+S  +  +Q++AE +QLSIP VDEGSTSAKKSGGYMPSLL APRM     L
Sbjct: 121 ERSLSTDTDLSV-HFSNQFSAEGSQLSIPFVDEGSTSAKKSGGYMPSLLLAPRMDSGKAL 180

Query: 181 IQ-QEILETDSRN---QLSQGLSPTVE--YSLGTVIDQTTKSVCSDHQANNPKRCKYFGC 240
           +Q  E+ +  +++   Q  Q   P+ +  +S+ T+ +QTT    SD++ +N K+CK+ GC
Sbjct: 181 VQTHELFQFGAKSRSHQFYQSCEPSTQTDFSVDTISEQTTTITSSDNRTSNSKKCKFAGC 240

Query: 241 EKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF 300
            KGARGA+GLCIGHGGG RCQK GCNKGAESRT +CKAHGGGRRCQHLGCTKSAEGKT+F
Sbjct: 241 FKGARGATGLCIGHGGGQRCQKAGCNKGAESRTVFCKAHGGGRRCQHLGCTKSAEGKTDF 300

Query: 301 CIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRR 360
           CIAHGGGRRCG+SGGC KAARGKSGLCIRHGGGKRCK++GCTRSAEG AGLCISHGGGRR
Sbjct: 301 CIAHGGGRRCGFSGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRR 360

Query: 361 CQYERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICP 420
           CQ+  CTKGAQGSTM+CKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCL++GGGICP
Sbjct: 361 CQFPACTKGAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYNGGGICP 420

Query: 421 KSVHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFC 480
           KSVHGGTNFCVAHGGGKRCVV GCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTD C
Sbjct: 421 KSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDLC 480

Query: 481 KAHGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAAS 540
           KAHGGGKRC+WGEGKCEKFARG+SGLCAAHSSM+Q+R+ +KG LI PG+FHGLVSAS  S
Sbjct: 481 KAHGGGKRCSWGEGKCEKFARGRSGLCAAHSSMLQERQASKGGLIAPGVFHGLVSAS--S 540

Query: 541 TVGDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKG 600
           T G S ++  SSS  S I D IDS +KP+KR QLIPPQVLVP SMKSSASYSSFLS+E+ 
Sbjct: 541 TTGSSSNNNHSSSGNSVISDCIDSPDKPVKRQQLIPPQVLVPPSMKSSASYSSFLSSEQQ 600

Query: 601 EE------DGNGYCIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
           +E      + N   +G    ++ IPEGRVHGGGLMSLLGG+L  KN  DGI
Sbjct: 601 DEGINRHGNHNAGGVGNTSFDFLIPEGRVHGGGLMSLLGGNL--KNPIDGI 645

BLAST of CSPI02G08830 vs. TrEMBL
Match: A0A0D2W3D3_GOSRA (Uncharacterized protein OS=Gossypium raimondii GN=B456_013G063300 PE=4 SV=1)

HSP 1 Score: 892.5 bits (2305), Expect = 3.0e-256
Identity = 448/643 (69.67%), Postives = 521/643 (81.03%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN  V  +S  ++L+K++NFGDTTL LN  G GG   +G     +DL+ + S +PDDG
Sbjct: 1   MDLNTNV-RFSHVSELSKNENFGDTTLRLNFLGHGGSNKAGFGSTQSDLHVDLSSAPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSD-SVLQLGLSGGTNEVSSVV 120
           CRLVLGLGPTPS  C+DY+NVG NK K+  A     +SP D S+L+LGLSGGT    +++
Sbjct: 61  CRLVLGLGPTPSVYCNDYHNVGLNKNKSTAALFTPGLSPEDNSILKLGLSGGTKGSMNLL 120

Query: 121 ECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNIL 180
           E S+S ETDVS  +  +Q++AE +QLSIP VDEGSTSAKKSGGYMPSLL APRM      
Sbjct: 121 ERSLSTETDVSV-HFSNQFSAEGSQLSIPFVDEGSTSAKKSGGYMPSLLLAPRMDSGKAS 180

Query: 181 IQ-QEILETDSRNQ-----LSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGC 240
           +Q  E+ +  +++      LS   S   ++S+ T+ +QTT    SD++ +N K+CK+ GC
Sbjct: 181 VQTHELFQFGAKSHSHQFHLSCEHSTQTDFSVDTISEQTTTITSSDYRTSNSKKCKFAGC 240

Query: 241 EKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEF 300
            KGARGA+GLCIGHGGG RCQK GCNKGAESRT +CKAHGGGRRCQHLGCTKSAEGKT+F
Sbjct: 241 FKGARGATGLCIGHGGGQRCQKAGCNKGAESRTVFCKAHGGGRRCQHLGCTKSAEGKTDF 300

Query: 301 CIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRR 360
           CIAHGGGRRCG+SGGC KAARGKSGLCIRHGGGKRCK++GCTRSAEG AGLCISHGGGRR
Sbjct: 301 CIAHGGGRRCGFSGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRR 360

Query: 361 CQYERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICP 420
           CQ+  CTKGAQGSTM+CKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCL++GGGICP
Sbjct: 361 CQFPACTKGAQGSTMFCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYNGGGICP 420

Query: 421 KSVHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFC 480
           KSVHGGTNFCVAHGGGKRCVV GCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTD C
Sbjct: 421 KSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDLC 480

Query: 481 KAHGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAAS 540
           KAHGGGKRC+WGEGKCEKFARG+SGLCAAHSSM+Q+R+ +KG LI PG+FHGLVSA+  S
Sbjct: 481 KAHGGGKRCSWGEGKCEKFARGRSGLCAAHSSMLQERQASKGGLIAPGVFHGLVSAT--S 540

Query: 541 TVGDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKG 600
           T G S ++Y SSS  S I D IDS +KP++R QLIPPQVLVP SMKSSASYSSFLS+E+ 
Sbjct: 541 TTGSSSNNYHSSSGNSVISDCIDSPDKPVERQQLIPPQVLVPLSMKSSASYSSFLSSEQQ 600

Query: 601 EEDGNGY------CIGTKFLEYSIPEGRVHGGGLMSLLGGHLK 631
           +E  N +       +G    ++ IPEGRVHGGGLMSLLGG+LK
Sbjct: 601 DEGINRHGNHIAGGVGNTSFDFLIPEGRVHGGGLMSLLGGNLK 639

BLAST of CSPI02G08830 vs. TAIR10
Match: AT5G64550.1 (AT5G64550.1 loricrin-related)

HSP 1 Score: 733.0 bits (1891), Expect = 1.5e-211
Identity = 395/653 (60.49%), Postives = 463/653 (70.90%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN++V H+S+   + K DNFGDT LSL C G    +  G     + L  + S  PD G
Sbjct: 1   MDLNESVVHFSRGNGIAKLDNFGDTALSLKCLGSSAGRLIGSSHHNHKLCSDVSNCPDGG 60

Query: 61  CRLVLGLGPTPSANCDDYYNV----GYNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVS 120
           CRLVLGLGPTP +    YYNV      NK  A   SV E  S  +S+LQLG    T +  
Sbjct: 61  CRLVLGLGPTPPSY---YYNVRVNDNNNKGSASSGSVQELSSGGNSILQLGPPAVTMDTF 120

Query: 121 SVVECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRM-GI 180
           S +E S+    D +    +SQ A          VDEGSTSA++SGGYMPSLLFAPR   +
Sbjct: 121 SGLEGSLLTYADTN----VSQAA----------VDEGSTSARRSGGYMPSLLFAPRTENV 180

Query: 181 SNILIQQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEK 240
                 QE       +  +  LS   E+S+    D++  +  S  + +NPK+CK+ GC K
Sbjct: 181 RKPSRMQECSTNCGTDAYNSQLSHESEFSVSAFSDRSASATSSQQRMSNPKKCKFMGCVK 240

Query: 241 GARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCI 300
           GARGASGLCIGHGGG RCQK GCNKGAES+T +CKAHGGG+RCQHLGCTKSAEGKT+ CI
Sbjct: 241 GARGASGLCIGHGGGQRCQKLGCNKGAESKTTFCKAHGGGKRCQHLGCTKSAEGKTDLCI 300

Query: 301 AHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQ 360
           +HGGGRRCG+  GCAKAARGKSGLCI+HGGGKRC+++ CTRSAEG AGLCISHGGGRRCQ
Sbjct: 301 SHGGGRRCGFPEGCAKAARGKSGLCIKHGGGKRCRIESCTRSAEGQAGLCISHGGGRRCQ 360

Query: 361 YERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKS 420
              CTKGAQGST YCKAHGGGKRCIFAGCTKGAEGSTPLCK HGGGKRC+FDGGGICPKS
Sbjct: 361 SSGCTKGAQGSTNYCKAHGGGKRCIFAGCTKGAEGSTPLCKAHGGGKRCMFDGGGICPKS 420

Query: 421 VHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKA 480
           VHGGT+FCVAHGGGKRCVV+GCTKSARGRTDCCV+HGGGKRCK + C KSAQGSTDFCKA
Sbjct: 421 VHGGTSFCVAHGGGKRCVVAGCTKSARGRTDCCVKHGGGKRCKSDGCEKSAQGSTDFCKA 480

Query: 481 HGGGKRCTW-GEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSAS---- 540
           HGGGKRC+W G+ KCEKFARGKSGLCAAH+SM QD+  +K  LIGPGLF GLVS S    
Sbjct: 481 HGGGKRCSWGGDWKCEKFARGKSGLCAAHNSMSQDKAGSKVGLIGPGLFRGLVSTSLQTT 540

Query: 541 -AASTVGDSFDHYKSSSAISFICDSIDSAEKPM--------KRHQL-IPPQVLVPSSMKS 600
             A+T   + DH  S S +S + D +DS ++P+        KR +L IP QVLVP SMKS
Sbjct: 541 TTATTTTTTTDH--SQSGVSAVSDCMDSIDRPLPPLHHQPEKRQKLMIPMQVLVPPSMKS 600

Query: 601 SASYSSFLSTEKGEEDGNGYCIGT---KFLEYSIPEGRVHGGGLMSLLGGHLK 631
                SF +TE+ + + N    G+      ++ IPE RVHGGGLMSLL G++K
Sbjct: 601 ----LSFSNTERPDIETNNNSSGSNGRNIFDFMIPEERVHGGGLMSLLNGNMK 630

BLAST of CSPI02G08830 vs. TAIR10
Match: AT5G09670.2 (AT5G09670.2 loricrin-related)

HSP 1 Score: 649.4 bits (1674), Expect = 2.3e-186
Identity = 353/615 (57.40%), Postives = 428/615 (69.59%), Query Frame = 1

Query: 23  GDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDGCRLVLGLGPTPSANCDDYYNVG 82
           GDT LSL C G  G ++  C         + S   DDGCRLVLGLGPT ++ C   ++VG
Sbjct: 2   GDTALSLKCLG--GSENQFCS--------DVSSFTDDGCRLVLGLGPTSTSYC---FSVG 61

Query: 83  YNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVECSVSAETDVSTTYLISQWAAEA 142
            N +    AS     + +DSVLQLG              +VS +T          ++   
Sbjct: 62  VNNSNKDSAS---GFTQADSVLQLGRP------------AVSIDT----------FSGLD 121

Query: 143 NQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNILIQQEILETDSRNQLSQGLSPTVE 202
           N   IP+VDEGS+SAK+SGGYMPSLL  P +   + + Q +   T   +Q+SQ  SP  E
Sbjct: 122 NGGVIPVVDEGSSSAKRSGGYMPSLLLDPNVKNPSQIQQLKDFGTGIHSQVSQEPSPYTE 181

Query: 203 YSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCNKGA 262
           + +               + +NP++CK+ GC KGARGASGLCI HGGG RCQKPGCNKGA
Sbjct: 182 FYV-------------QQRTSNPRKCKFMGCVKGARGASGLCISHGGGQRCQKPGCNKGA 241

Query: 263 ESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGLCIR 322
           ES+T +CK HGGG+RC+HLGCTKSAEGKT+FCI+HGGGRRC +  GC KAARG+SGLCI+
Sbjct: 242 ESKTTFCKTHGGGKRCEHLGCTKSAEGKTDFCISHGGGRRCEFLEGCDKAARGRSGLCIK 301

Query: 323 HGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQY-ERCTKGAQGSTMYCKAHGGGKRCIF 382
           HGGGKRC ++ CTRSAEG AGLCISHGGG+RCQY   C KGAQGST YCKAHGGGKRCIF
Sbjct: 302 HGGGKRCNIEDCTRSAEGQAGLCISHGGGKRCQYFSGCEKGAQGSTNYCKAHGGGKRCIF 361

Query: 383 AGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTNFCVAHGGGKRCVVSGCTKSA 442
           +GC+KGAEGSTPLCK HGGGKRCL DGGGIC KSVHGGTNFCVAHGGGKRCVV GCTKSA
Sbjct: 362 SGCSKGAEGSTPLCKAHGGGKRCLADGGGICSKSVHGGTNFCVAHGGGKRCVVVGCTKSA 421

Query: 443 RGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCTWGEGKCEKFARGKSGLCA 502
           RGRTD CV+HGGGKRCK  +C KSAQGSTDFCKAHGGGKRC+WG+GKCEKFARGKSGLCA
Sbjct: 422 RGRTDSCVKHGGGKRCKIIDCEKSAQGSTDFCKAHGGGKRCSWGDGKCEKFARGKSGLCA 481

Query: 503 AHSSMI--QDRETNKGSLIGPGLFHGLVSASAASTVGDSFDHYKS-SSAISFICDSIDSA 562
           AH++++  ++++ +K  LIGPGLF GLV        G + DH +S +SA+S   DS++  
Sbjct: 482 AHNTIMSRENKDGSKSGLIGPGLFSGLV-------FGSTSDHSQSGASAVSDCTDSVERI 541

Query: 563 E---KPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGYCIGTKFLEYSIPEGR 622
           +   +   +  +IP QVLVPSSMKS +      ++ +GE         T   ++ +PE R
Sbjct: 542 QFENRQKNKKMMIPMQVLVPSSMKSPS------NSHEGE---------TNIYDFMVPEER 543

Query: 623 VHGGGL-MSLLGGHL 630
           VHGGGL MSLLGG +
Sbjct: 602 VHGGGLVMSLLGGSI 543

BLAST of CSPI02G08830 vs. TAIR10
Match: AT1G64140.1 (AT1G64140.1 BEST Arabidopsis thaliana protein match is: loricrin-related (TAIR:AT5G64550.1))

HSP 1 Score: 477.2 bits (1227), Expect = 1.5e-134
Identity = 252/450 (56.00%), Postives = 298/450 (66.22%), Query Frame = 1

Query: 84  NKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVECSVSAETDVSTTYLISQWAAEAN 143
           NK  A +     ++      L+L LSGG +  S +      A    S   ++      AN
Sbjct: 122 NKKPANLKMKGLQVPSPKFDLELSLSGGGSCQSEITAVQQHANRFQSLADML-----RAN 181

Query: 144 QLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNIL---IQQEILETDSRNQLSQGLSPT 203
                  +E +T   + G  +P+L  +     S+ L    +  I+      +LS   + T
Sbjct: 182 N------EESATCGWRQGFGLPTLQASSSKETSSFLGHIPKNVIIPAAHVLELSSNTAAT 241

Query: 204 VEYSLGTVIDQTTKSVCSD-HQANNPKRCKYFGCEKGARGASGLCIGHGGGHRCQKPGCN 263
              S GT     ++ +      +++ K C+  GC KGARGASG CI HGGG RCQK GC+
Sbjct: 242 TPISSGTCTSGLSQQLKPQLKNSSSSKLCQVEGCHKGARGASGRCISHGGGRRCQKHGCH 301

Query: 264 KGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRCGYSGGCAKAARGKSGL 323
           KGAE RT YCKAHGGGRRC+ LGCTKSAEG+T+FCIAHGGGRRC +   C +AARG+SGL
Sbjct: 302 KGAEGRTVYCKAHGGGRRCEFLGCTKSAEGRTDFCIAHGGGRRCSHED-CTRAARGRSGL 361

Query: 324 CIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCTKGAQGSTMYCKAHGGGKRC 383
           CIRHGGGKRC+ + CT+SAEG +GLCISHGGGRRCQ   CTKGAQGSTM+CKAHGGGKRC
Sbjct: 362 CIRHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQSNGCTKGAQGSTMFCKAHGGGKRC 421

Query: 384 IFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTNFCVAHGGGKRCVVSGCTK 443
             +GCTKGAEGSTP CKGHGGGKRC F G   C KSVHGGTNFCVAHGGGKRC V  CTK
Sbjct: 422 THSGCTKGAEGSTPFCKGHGGGKRCAFQGDDPCSKSVHGGTNFCVAHGGGKRCAVPECTK 481

Query: 444 SARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGKRCTWGE-----------GK 503
           SARGRTD CVRHGGGKRC+ E CGKSAQGSTDFCKAHGGGKRC WG+           G 
Sbjct: 482 SARGRTDFCVRHGGGKRCQSEGCGKSAQGSTDFCKAHGGGKRCAWGQPETEYAGQSSSGP 541

Query: 504 CEKFARGKSGLCAAHSSMIQDRETNKGSLI 519
           C  FARGK+GLCA H+S++QD   + G  I
Sbjct: 542 CTSFARGKTGLCALHNSLVQDNRVHGGMTI 559

BLAST of CSPI02G08830 vs. TAIR10
Match: AT4G12020.2 (AT4G12020.2 protein kinase family protein)

HSP 1 Score: 243.4 bits (620), Expect = 3.7e-64
Identity = 122/242 (50.41%), Postives = 155/242 (64.05%), Query Frame = 1

Query: 184 ILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGASGL 243
           I ++ S   +  G++ T   S G+ + Q   +      +++ K C+  GC+KGAR ASG 
Sbjct: 55  ISQSSSMCTVPPGMAATPPISSGSGLSQQLNN------SSSSKLCQVEGCQKGARDASGR 114

Query: 244 CIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGGRRC 303
           CI HGGG RCQKP C KGAE +T YCKAHGGGRRC++LGCTK AEG T+FCIAHGGGRRC
Sbjct: 115 CISHGGGRRCQKPDCQKGAEGKTVYCKAHGGGRRCEYLGCTKGAEGSTDFCIAHGGGRRC 174

Query: 304 GYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCTKGA 363
            +   C ++A G++  C++HGGG RCK  GC +SA G    C +HGGG++C +E CT  A
Sbjct: 175 NHE-DCTRSAWGRTEFCVKHGGGARCKTYGCGKSASGPLPFCRAHGGGKKCSHEDCTGFA 234

Query: 364 QGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGTNFC 423
           +G +  C  HGGGKRC    CTK AEG + LC  HGGG+RC   G   C K   G   FC
Sbjct: 235 RGRSGLCLMHGGGKRCQRENCTKSAEGLSGLCISHGGGRRCQSIG---CTKGAKGSKMFC 286

Query: 424 VA 426
            A
Sbjct: 295 KA 286

BLAST of CSPI02G08830 vs. NCBI nr
Match: gi|449442343|ref|XP_004138941.1| (PREDICTED: delta-like protein C [Cucumis sativus])

HSP 1 Score: 1304.7 bits (3375), Expect = 0.0e+00
Identity = 634/638 (99.37%), Postives = 635/638 (99.53%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG
Sbjct: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVE 120
           CRLVLGLGPTPSANCDDYYNVGYNKTKAQVAS+PEEISPSDSVLQLGLSGGTNEVSSVVE
Sbjct: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASLPEEISPSDSVLQLGLSGGTNEVSSVVE 120

Query: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNILI 180
           CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMG SNILI
Sbjct: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNILI 180

Query: 181 QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240
           QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA
Sbjct: 181 QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240

Query: 241 SGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300
           SGLCIGHGGGHRCQKPGC KGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Sbjct: 241 SGLCIGHGGGHRCQKPGCTKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300

Query: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCT 360
           RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYE CT
Sbjct: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYECCT 360

Query: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420
           KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT
Sbjct: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420

Query: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480
           NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK
Sbjct: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480

Query: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD 540
           RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD
Sbjct: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD 540

Query: 541 HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600
           HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY
Sbjct: 541 HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600

Query: 601 CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
           CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI
Sbjct: 601 CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 638

BLAST of CSPI02G08830 vs. NCBI nr
Match: gi|659114614|ref|XP_008457144.1| (PREDICTED: uncharacterized protein LOC103496890 [Cucumis melo])

HSP 1 Score: 1276.2 bits (3301), Expect = 0.0e+00
Identity = 616/638 (96.55%), Postives = 626/638 (98.12%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLNKTVAHYSQN DLTKDDNFGDTTLSLNCFGFG RKSSGCEVALNDLNFNFSY+PDDG
Sbjct: 1   MDLNKTVAHYSQNGDLTKDDNFGDTTLSLNCFGFGVRKSSGCEVALNDLNFNFSYAPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVE 120
           CRLVLGLGPTPSANCDDYYNVGYNKTK QVAS+PEEISPSDS+LQLGLSGGTNEVSSVVE
Sbjct: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKTQVASLPEEISPSDSILQLGLSGGTNEVSSVVE 120

Query: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNILI 180
           CSVSAETDVS TYLI+QW AEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMG SN+LI
Sbjct: 121 CSVSAETDVSATYLINQWTAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGTSNLLI 180

Query: 181 QQEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240
           QQE+LE DSRNQLSQ LSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA
Sbjct: 181 QQEVLEIDSRNQLSQELSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARGA 240

Query: 241 SGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300
           SGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG
Sbjct: 241 SGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGGG 300

Query: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERCT 360
           RRCGYSGGCAKAARGKSGLCIRHGGGKRCKM+GCTRSAEGHAGLCISHGGGRRCQYERCT
Sbjct: 301 RRCGYSGGCAKAARGKSGLCIRHGGGKRCKMEGCTRSAEGHAGLCISHGGGRRCQYERCT 360

Query: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420
           KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT
Sbjct: 361 KGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGGT 420

Query: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480
           NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK
Sbjct: 421 NFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGGK 480

Query: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSFD 540
           RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDS D
Sbjct: 481 RCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSLD 540

Query: 541 HYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600
           HY +SSAISFICDSIDSAEKP KRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY
Sbjct: 541 HYNTSSAISFICDSIDSAEKPTKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNGY 600

Query: 601 CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
           CIG KFLEYSIPEGRVHGGGLMSLLGGHLKMKNM++GI
Sbjct: 601 CIGKKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMNNGI 638

BLAST of CSPI02G08830 vs. NCBI nr
Match: gi|470129587|ref|XP_004300691.1| (PREDICTED: uncharacterized protein LOC101302269 [Fragaria vesca subsp. vesca])

HSP 1 Score: 914.1 bits (2361), Expect = 1.4e-262
Identity = 461/645 (71.47%), Postives = 517/645 (80.16%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN+    +S + ++TK+ N+GDT L LN  G  G  ++    + ++   + S +PDD 
Sbjct: 1   MDLNRKSILFSHDGEMTKNGNYGDTVLCLNSPGLSGSNTTRYRCSQSNFRIDSSSAPDDS 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEISPSDSVLQLGLSGGTNEVSSVVE 120
           CRLVLGLGPTPS  CDDYYN    K K        E    DS+LQLGLSGGT E S V++
Sbjct: 61  CRLVLGLGPTPSEYCDDYYNFQVTKNKGLSQGFASE---GDSILQLGLSGGTVEASGVLD 120

Query: 121 CSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNILI 180
           C++S ETDV+T++L +      NQLSIPLVDEGSTSAKKSGGYMPSLLFAPR   + + +
Sbjct: 121 CAISGETDVNTSFLRNH----DNQLSIPLVDEGSTSAKKSGGYMPSLLFAPRRNSTEVSL 180

Query: 181 Q-QEILETDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEKGARG 240
           Q +E+LE  +++QL    S T EYS GTV +QTT    SDH+ +NPK+CK+ GC KGARG
Sbjct: 181 QTRELLELGAKSQLRYEPSSTEEYSAGTVSEQTTTGTSSDHRTSNPKKCKFLGCRKGARG 240

Query: 241 ASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCIAHGG 300
           ASGLCIGHGGG RCQKPGCNKGAESRTAYCKAHGGG+RCQHLGCTKSAEGKT+ CIAHGG
Sbjct: 241 ASGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGKRCQHLGCTKSAEGKTDNCIAHGG 300

Query: 301 GRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQYERC 360
           GRRCGYSGGC KAARGKSGLCIRHGGGKRCK++GCTRSAEG AGLCISHGGGRRCQYE C
Sbjct: 301 GRRCGYSGGCTKAARGKSGLCIRHGGGKRCKVEGCTRSAEGQAGLCISHGGGRRCQYECC 360

Query: 361 TKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGG 420
            KGAQGSTMYCKAHGGGKRCIF GCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGG
Sbjct: 361 AKGAQGSTMYCKAHGGGKRCIFQGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKSVHGG 420

Query: 421 TNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKAHGGG 480
           TNFCVAHGGGKRC VSGCTKSARGRTDCCVRHGGGKRC+ +NCGKSAQGSTDFCKAHGGG
Sbjct: 421 TNFCVAHGGGKRCSVSGCTKSARGRTDCCVRHGGGKRCRSDNCGKSAQGSTDFCKAHGGG 480

Query: 481 KRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTVGDSF 540
           KRCTWGEGKCEKFARGKSGLCAAHSS++ +RET+KG LIGPGLFHGLVSA+  ST G SF
Sbjct: 481 KRCTWGEGKCEKFARGKSGLCAAHSSLVLERETSKGGLIGPGLFHGLVSAT--STAGSSF 540

Query: 541 DHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEEDGNG 600
           D+  SSS +S I DSIDS E P  RH LIP QVLVP SMKSS+SYS+ LS+EK EE+ NG
Sbjct: 541 DYTHSSSGVSVISDSIDSLENPGTRH-LIPAQVLVPLSMKSSSSYSNLLSSEKPEEERNG 600

Query: 601 YCIGT------KFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
              G       K  ++ IPEGRVHGG LMSL GG+L  KN  DGI
Sbjct: 601 CGTGVGSSDGRKGFDFKIPEGRVHGGPLMSLFGGNL--KNAIDGI 633

BLAST of CSPI02G08830 vs. NCBI nr
Match: gi|694326313|ref|XP_009354077.1| (PREDICTED: uncharacterized protein LOC103945257 [Pyrus x bretschneideri])

HSP 1 Score: 909.4 bits (2349), Expect = 3.4e-261
Identity = 462/649 (71.19%), Postives = 516/649 (79.51%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLNK    +S +   TK+D+FGDT L LN  G GG  ++      ++   N S +PDD 
Sbjct: 1   MDLNKKSMLFSHDGQFTKNDHFGDTALCLNSPGSGGSNAARSRCTQSNFRVNCSSAPDDS 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEEI-SPSDSVLQLGLSGGTNEVSSVV 120
           C+LVLGLGPTPSA C+DYYN G  K +    ++ +   S  DS+LQLGLSGGT E S+V+
Sbjct: 61  CKLVLGLGPTPSAYCNDYYNFGSTKNRGLTTALSQGFASEGDSILQLGLSGGTFEASTVL 120

Query: 121 ECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISNI- 180
           + S+S ETDV+  Y  +Q ++  NQ+SIP VDEGSTSA+KSGGYMPSLLFAPR    N+ 
Sbjct: 121 DYSISRETDVNINYGQNQVSSGDNQVSIPPVDEGSTSARKSGGYMPSLLFAPRRDSVNLS 180

Query: 181 LIQQEILETDSRNQLSQGL---SPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFGCEK 240
           L  +EIL+   + QLS      S T +YS  ++ +QT     SDH+  N K+CK+ GC K
Sbjct: 181 LYGKEILDLRVKCQLSDPRCEPSATTDYSTESISEQTATGESSDHRTGNLKKCKFLGCRK 240

Query: 241 GARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTEFCI 300
           GARGASGLCIGHGGG RCQKPGCNKGAESRTAYCKAHGGG+RCQHLGCTKSAEGKT++CI
Sbjct: 241 GARGASGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGKRCQHLGCTKSAEGKTDYCI 300

Query: 301 AHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGRRCQ 360
           AHGGG+RCGY GGC KAARGKSGLCIRHGGGKRC++DGCTRSAEG AGLCISHGGGRRCQ
Sbjct: 301 AHGGGKRCGYPGGCTKAARGKSGLCIRHGGGKRCQVDGCTRSAEGQAGLCISHGGGRRCQ 360

Query: 361 YERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKS 420
           YE CTKGAQGSTMYCKAHGGG+RCIF GCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKS
Sbjct: 361 YEACTKGAQGSTMYCKAHGGGRRCIFQGCTKGAEGSTPLCKGHGGGKRCLFDGGGICPKS 420

Query: 421 VHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDFCKA 480
           VHGGTNFCVAHGGGKRC V GCTKSARGRTDCCVRHGGGKRCKF+NCGKSAQGSTDFCKA
Sbjct: 421 VHGGTNFCVAHGGGKRCSVPGCTKSARGRTDCCVRHGGGKRCKFDNCGKSAQGSTDFCKA 480

Query: 481 HGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAASTV 540
           HGGGKRCTWGEGKCEKFARGKSGLCAAHSSM+QDR  NKG LIGPGLFHGLVSAS  ST 
Sbjct: 481 HGGGKRCTWGEGKCEKFARGKSGLCAAHSSMVQDRGINKGGLIGPGLFHGLVSAS--STA 540

Query: 541 GDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEKGEE 600
           G SFD+  SSS IS I DS+DS EKP K H LIP QVLVP SMKSS+SYS FLS+EK EE
Sbjct: 541 GSSFDNNHSSSGISAISDSMDSLEKPAKTH-LIPSQVLVPLSMKSSSSYSHFLSSEKPEE 600

Query: 601 DGNGY------CIGTKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
             +GY      C G K L++ IPEGRVHGG LMSL GG L  KN  DG+
Sbjct: 601 GRDGYGIGVGSCSGIKSLDFKIPEGRVHGGPLMSLFGGDL--KNAIDGM 644

BLAST of CSPI02G08830 vs. NCBI nr
Match: gi|255545299|ref|XP_002513710.1| (PREDICTED: uncharacterized protein LOC8281022 [Ricinus communis])

HSP 1 Score: 906.0 bits (2340), Expect = 3.8e-260
Identity = 455/653 (69.68%), Postives = 517/653 (79.17%), Query Frame = 1

Query: 1   MDLNKTVAHYSQNADLTKDDNFGDTTLSLNCFGFGGRKSSGCEVALNDLNFNFSYSPDDG 60
           MDLN     +   ++L K DNFGDTTL LNC  +GG   +G E   ++L  +F+  PDDG
Sbjct: 1   MDLNDKCKQFLHKSELPKSDNFGDTTLRLNCLSYGGTNMNGFECTQSNLKVDFTNGPDDG 60

Query: 61  CRLVLGLGPTPSANCDDYYNVGYNKTKAQVASVPEE---ISPSDSVLQLGLSGGTNEVSS 120
           C+LVLGLGPTP+A CDDYY++ +NKTK   A+        S  DS+LQLGLSGGT E  S
Sbjct: 61  CKLVLGLGPTPTAYCDDYYSMRFNKTKGSTAAAVLHRGLSSDGDSILQLGLSGGTKEALS 120

Query: 121 VVECSVSAETDVSTTYLISQWAAEANQLSIPLVDEGSTSAKKSGGYMPSLLFAPRMGISN 180
            +ECS   ETD+ST  +++Q++   ++  IP+VDEGSTSAKKSGGYMPSLL APRM  + 
Sbjct: 121 ELECSF-LETDISTP-ILNQFSGHEDRFLIPVVDEGSTSAKKSGGYMPSLLLAPRMDGAK 180

Query: 181 ILIQ-QEILE----TDSRNQLSQGLSPTVEYSLGTVIDQTTKSVCSDHQANNPKRCKYFG 240
           + ++ +E L+        +QL  G S + + S+GT+ +Q T +   D + +NPK+CK+FG
Sbjct: 181 VSLEGEEFLQFGAAKSQSHQLIHGTSASTDISMGTISEQATTATSVDRKISNPKKCKFFG 240

Query: 241 CEKGARGASGLCIGHGGGHRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTE 300
           C KGARGA GLCIGHGGG RCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKT+
Sbjct: 241 CSKGARGALGLCIGHGGGQRCQKPGCNKGAESRTAYCKAHGGGRRCQHLGCTKSAEGKTD 300

Query: 301 FCIAHGGGRRCGYSGGCAKAARGKSGLCIRHGGGKRCKMDGCTRSAEGHAGLCISHGGGR 360
           FCIAHGGGRRCG+ GGC KAARGKSGLCI+HGGGKRCK+DGC+RSAEG AGLCISHGGGR
Sbjct: 301 FCIAHGGGRRCGFGGGCTKAARGKSGLCIKHGGGKRCKVDGCSRSAEGQAGLCISHGGGR 360

Query: 361 RCQYERCTKGAQGSTMYCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLFDGGGIC 420
           RCQYE CTKGAQGSTM+CKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCL+DGGGIC
Sbjct: 361 RCQYEGCTKGAQGSTMHCKAHGGGKRCIFAGCTKGAEGSTPLCKGHGGGKRCLYDGGGIC 420

Query: 421 PKSVHGGTNFCVAHGGGKRCVVSGCTKSARGRTDCCVRHGGGKRCKFENCGKSAQGSTDF 480
           PKSVHGGTNFCVAHGGGKRCVV GCTKSARGRTDCCV+HGGGKRCKFENCGKSAQGSTDF
Sbjct: 421 PKSVHGGTNFCVAHGGGKRCVVPGCTKSARGRTDCCVKHGGGKRCKFENCGKSAQGSTDF 480

Query: 481 CKAHGGGKRCTWGEGKCEKFARGKSGLCAAHSSMIQDRETNKGSLIGPGLFHGLVSASAA 540
           CKAHGGGKRCTWGEGKCEKFARG+SGLCAAHSSM+ ++ +NKGSLIGPGLF GLVSA  A
Sbjct: 481 CKAHGGGKRCTWGEGKCEKFARGRSGLCAAHSSMVLEQGSNKGSLIGPGLFQGLVSA--A 540

Query: 541 STVGDSFDHYKSSSAISFICDSIDSAEKPMKRHQLIPPQVLVPSSMKSSASYSSFLSTEK 600
           S  G S D+  SSS IS + D  DS  KP KR  LIP QVLVP SMKSS+SYSSFL+ EK
Sbjct: 541 SNAGSSIDNNYSSSGISAVSDCTDSLGKPTKRQHLIPAQVLVPPSMKSSSSYSSFLNAEK 600

Query: 601 GEEDGNGYCIG-------TKFLEYSIPEGRVHGGGLMSLLGGHLKMKNMSDGI 639
            EE  N Y  G       T F +Y  PEGRVHGGGLMSL GG+L  KN  DGI
Sbjct: 601 QEEGRNEYSAGAGSTSRVTSF-DYMAPEGRVHGGGLMSLFGGNL--KNAIDGI 646

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
WRK19_ARATH6.6e-6350.41Probable WRKY transcription factor 19 OS=Arabidopsis thaliana GN=WRKY19 PE=2 SV=... [more]
Match NameE-valueIdentityDescription
A0A0A0LI30_CUCSA0.0e+0099.37Uncharacterized protein OS=Cucumis sativus GN=Csa_2G122020 PE=4 SV=1[more]
B9RIZ1_RICCO2.6e-26069.68Putative uncharacterized protein OS=Ricinus communis GN=RCOM_1752370 PE=4 SV=1[more]
A0A061E1L7_THECC2.9e-25970.05Emb:CAB89363.1 OS=Theobroma cacao GN=TCM_007538 PE=4 SV=1[more]
A0A0B0PEI2_GOSAR2.3e-25669.28Putative WRKY transcription factor 19-like protein OS=Gossypium arboreum GN=F383... [more]
A0A0D2W3D3_GOSRA3.0e-25669.67Uncharacterized protein OS=Gossypium raimondii GN=B456_013G063300 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G64550.11.5e-21160.49 loricrin-related[more]
AT5G09670.22.3e-18657.40 loricrin-related[more]
AT1G64140.11.5e-13456.00 BEST Arabidopsis thaliana protein match is: loricrin-related (TAIR:A... [more]
AT4G12020.23.7e-6450.41 protein kinase family protein[more]
Match NameE-valueIdentityDescription
gi|449442343|ref|XP_004138941.1|0.0e+0099.37PREDICTED: delta-like protein C [Cucumis sativus][more]
gi|659114614|ref|XP_008457144.1|0.0e+0096.55PREDICTED: uncharacterized protein LOC103496890 [Cucumis melo][more]
gi|470129587|ref|XP_004300691.1|1.4e-26271.47PREDICTED: uncharacterized protein LOC101302269 [Fragaria vesca subsp. vesca][more]
gi|694326313|ref|XP_009354077.1|3.4e-26171.19PREDICTED: uncharacterized protein LOC103945257 [Pyrus x bretschneideri][more]
gi|255545299|ref|XP_002513710.1|3.8e-26069.68PREDICTED: uncharacterized protein LOC8281022 [Ricinus communis][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CSPI02G08830.1CSPI02G08830.1mRNA


Analysis Name: InterPro Annotations of cucumber (PI183967)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availablePANTHERPTHR31827FAMILY NOT NAMEDcoord: 376..563
score: 2.2E-227coord: 7..336
score: 2.2E

The following gene(s) are paralogous to this gene:

None