Cucsa.252140 (gene) Cucumber (Gy14) v1

NameCucsa.252140
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Locationscaffold02229 : 1118270 .. 1125766 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TGTTCATGCTTCGCTGTTCTTCCATTTTCATTACATATATATATATATTTTGGCACTCTCTCTCTTGGGTTGAGGTTTGGCCGGTGCATGGCGTTGCCCGCTAGGTAATTTGCCGGTTTATAACTGAGGCAGTGCGGTGGCGGCGGCACCACCATTGTGTGAATTGTTGGCTTTCTTCTAGTAAGTTTTAATGGAGCAGAAGGGGAAATCAAATGCTGTGAAGAACTCTACTACGATGTCATCTCGGGGTGGGCGGGTTTCTTTGAAGGCTATGGAGTCACCGAAGCGGGTGGTTTCTGTATCGGCGGTTGAATCGACGCCTCAGTCTGGTGTGAAGAAGCAAAGTTCGAGAGTTAGTAGATCTCTGACGCCGAATGGTCCTAAGAAGGGGAGGGATGGTGAGAATGTTGGAGTTTCGGCTCGAACGGTCAACCGTGGTGGTCTCAAGCAAGTTTTGCACCGGCGTTCTTTGTCTGGTGCTGGTTCTTGTGTGAATGTTGAGGATTGTAATGGAGTTAAGAGTGGATTGCAGGAGAAGCTTTGTTTTGCGGAGGATTTGATTAAAGATTTGCAGTCTCAATTGGTGGAGCTGAAGGAGGAGTTGCATAAGTCTCAGAGCTTGAACTTTGAACTTCAATCGCAGAATGATTTGCTCGTTCGTGACCTAGCCGCTGCTGAAGCGAAGTTCGCTAGTGTTAGTAATAATGACAAGGTGAGGAAGAAGATGCCATTGTTGTTTCTTTGAAAATTACTTTCTATTTGCCGTTGAAATTTTTTAAGGAAGTTCAGAATCGTTTTCAATTTTCTGTTCGTGGTTATAGAGGAAGTCAGTTTCAGAGGAATCGCAACGAAGCGCCGAGGACAATCAGAAACTTGAAAATGGAAAGTTGGAGACTCAACCATCAAGTTCGTGTCGGAATGTTAGAGATTTGGACTGCAAGACTCCACCACCACGGGCACCGCCGCCTCCGCCTCCGCCTCCGCCGCTTCCTGTCCAGTCCATGCCCCGAGCAGCGGCTACACAGAAATCTCCGGACCTCGTACGCCTCTTTCACTCATTAAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCAGCTGCGATCAATGCGCATAATAGCATTGTTGGGGAAATTCAGAATCGTTCTGCGCATCTTTTAGCCGTAAGGCCATGATTTTGGTTTATTTGATGAATTAAATCTAACCCACATAGAAAAATTAAGTCCCATCTTCAATTTTCAGATAAAAGCAGATATTGAAACCAAAGGAGAGTTCATCAATGGTCTCATTGACAAGGTGCTTGTTGCAGCTCATACGGACATTGAAGATATCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTTTCATCATTGGTAAATCTTTACTTGTCTCTTTTTtCTCTATCAGTTTTATTATTATTTTTTATTGTGTATACATCTTTGGAAAGAGCTAGTTTTAAGACAAAGTATTCTGGGTGTAGTGGGCCATATCATGTCTTACTATGCAGAAGATTTATGGTCATTATTCGTTAATATGGTCTGGTTTATTAGGAGAAAAGCAACGATCTAATCAACGTTTCTAACATGTGATGATTTTGTCAACTCGAACATAGTTTAGTAGATTAAGGCATCAATTACGAATTCAAAAGTAAAATGGTTTAATCTTCCACCCGAATGATAAAAGAAAAGCATGTGATAGACTTCGTGATCCTCTAGTGCCTTCCTATATTGTATAAATCTAATGCTGGTCCATTGTCTTTTAGACTGTCCTAAGATGGCCCCCTCTGACTTCTAAATTTACTATGGACTGGATTGTTTCTTTTTCTTTttTtCTTCTCCCCTCGACAAAGGAAATTGAGATTGAGGGACCCCGAGTCTTTTATAATGGATTTTGTAAACTAAATAGTTTTtGCAAGTCGGGTTGTCCCAGGAGGACCTATTTCGGGAACTCTTGCTGGGTACTTGGCTTCTTATCGGAAGGCTTTAGCATTTATAAAATGGCTGAATCATCCTACAAAAGCAAAACTTTGTTACCTTGGGACCATATTCCATTGCTCAATCTATCCAGAAAATCGTTGGCATTTTCCTTTGTGAACTTTATCCTCTTTAGGCAGTGCGGCATGTGAGGTGGTGAATTGGGTTTTATATTTCTTTTGAACTAAAAAAATCTATTCTCTGTCCTCAAATTCTAGAAGCTGTTGGTTTGAGAATGACGTGGTCTTTACTTTGTTCATTTTGGGGTGAAGTCCTTGTTGATGTTGACTGGCCAACTGTACAGCTTTGTACATTTTGCTTGAAACCAGAAACTTTAGATGATTATCTAAATTTGTCATTTGTGGAACTTCACTGTGCCGAAATTGAAATCCTACGTACATTAGTCATATGTTATCTTATCTCAACATTTCTCAGCCATGTGCCACTGATAATGGTTGTTGATCTTTTGGAATATGCCATTTGTTTCTTGGTGGACCTAAGTTTTAGAAACAGAAGGAAACTTTTTTATGGACCCAAAACTGAGATTTAATGTTAGTTCCACTAATTCCTTTCTAACTAAAACGCCAAATGACAATCATTCAGATAAAATAACTACTGAAACTGCTGTTTGCTAATTAATGCTGTTCCGTGATAAATATATTCCTGCGTTTGAGTTAGTTTAGTCTATGTTCTTGTTGTGCCTATAAATGGCACTTTTGGGGTAGTTTTGTATTTAGGCCCACTCTGTAATGGCTAATAGGATACATGAATTAAGGGCATGCTTGATAGTGATTTAGAGATAGTtAATTTTTTTTTTTTttCATATTCAAAATCACTCTCGAACATGTCATTAATCATGCATTCAAAATCAATTTAGTGTCTAGTTCCTACACTTTTAGTGCATCTTCATACCATCAAAAAaGATTTTGAATATATTTTGAAAGTTAACCAAGTGACTGAAAATTACTCATAAGACATAAACAATGTGAAAATAGTGATGTTGAGAAAGAGTAGAAATGTTTTGGCCTACACATGACCGTATGCGTGCTCCTCCCCTGTTATATTATTGGATGTTAAATGTTTATCTGCAGGCTGACGAGCGAGCTGTGTTAAAGCATTTCAAGTGGCCTGAGAAAAAAGCTGATGCCATGCGAGAAGCGGCCATAGAATACCGTGCGCTCAAACTGTTGGAAAATGAAATCTCCTTTTACAAGGATGACACTAATTCTCCATGTGAGGCAGCCTTGAAGAAGATGGCGAGCTTGTTAGACAAGTTAGAACTCCAACACATCTGATAAACTTTTTATTAACCAATTCATGTTTGTCCCTGATTTTTCCTTCTATCGTCATATAATAAAAGGTCGGAGCGAGGCATACAACGTTTAATCACACTTCGGAGTACTGTTATGCATTCTTATCAAAACCTGAAACTCCCAACAAATTGGATGCTAGACTCCGGTATCATGAGTAAGGTACATGACTTGAAAAaCATAAAAAaTCTCTCATCGTACTGGAACAATCATTTTCATACAACATATAATTTCGATTTTGAACCCTTTACAGATAAAGCAAGCTTCTATGAATCTTGCCAAGATGTACATGAAAAGGGTGAAAACGGAGCTGGATTCAGTTCGTAGTTCAGATAAAGAATCCAACCATGAATCTCTATTACTTCAGGGAATTCATTTTGCATACAGAACTCACCAGGTAACATTTTGAGAAGAATACGTCCCTTTGAACCACAAATTCTCTAATCCTTGAGGCGCCACTATTCTGTATTCAATAACATTCTCTCTCTGCCACGTGACATAGCTAACACACTTGTAGTGAACCATGTAAATGTGTGTGTCGGTTCTTTCTGTTTTAGTATTCTCTATTTGGTCAATTGATTTTGTAACATGATCTAAATACACAGTGCTTTTGGTTGGGCAGTTTGCTGGAGGGCTTGATTCGGAAACATTGTGTGCTTTTGAGGAAATAAAACAATGGGTTCCAAGACGAATGGTTGGAAGATCCCATGCTCAAGGATTGATAGTTGGCATACAATCATCATAACCAAGTAAACAGTTATTCTCTCTCCTTATGTAAAATATTTACTCTTTTGTAGTATATTTATTGAAAATGAGAAGGGGATAGCAGATTAAGATAGATGTTCATTATTAATTTGGTTAATGTTAGTGAAGTGTATTGATATATGATATTTGGTTGTTTTGTAGTAATGGCTTAAAGAAATGGTGTTTCAAATAGTTTTATTTTTAAAATAAATAAAATTAGCTTCATCTACTTGTCATAATTTTTtACTTTTTAATGTATATCTGTTCTCTGTGTATATGGTTGAACATTATCATTATCATAAAATTATCATTATCATAAATGGTGTATATCCGCATTTTCA

mRNA sequence

TGTTCATGCTTCGCTGTTCTTCCATTTTCATTACATATATATATATATTTTGGCACTCTCTCTCTTGGGTTGAGGTTTGGCCGGTGCATGGCGTTGCCCGCTAGGTAATTTGCCGGTTTATAACTGAGGCAGTGCGGTGGCGGCGGCACCACCATTGTGTGAATTGTTGGCTTTCTTCTAGTAAGTTTTAATGGAGCAGAAGGGGAAATCAAATGCTGTGAAGAACTCTACTACGATGTCATCTCGGGGTGGGCGGGTTTCTTTGAAGGCTATGGAGTCACCGAAGCGGGTGGTTTCTGTATCGGCGGTTGAATCGACGCCTCAGTCTGGTGTGAAGAAGCAAAGTTCGAGAGTTAGTAGATCTCTGACGCCGAATGGTCCTAAGAAGGGGAGGGATGGTGAGAATGTTGGAGTTTCGGCTCGAACGGTCAACCGTGGTGGTCTCAAGCAAGTTTTGCACCGGCGTTCTTTGTCTGGTGCTGGTTCTTGTGTGAATGTTGAGGATTGTAATGGAGTTAAGAGTGGATTGCAGGAGAAGCTTTGTTTTGCGGAGGATTTGATTAAAGATTTGCAGTCTCAATTGGTGGAGCTGAAGGAGGAGTTGCATAAGTCTCAGAGCTTGAACTTTGAACTTCAATCGCAGAATGATTTGCTCGTTCGTGACCTAGCCGCTGCTGAAGCGAAGTTCGCTAGTGTTAGTAATAATGACAAGAGGAAGTCAGTTTCAGAGGAATCGCAACGAAGCGCCGAGGACAATCAGAAACTTGAAAATGGAAAGTTGGAGACTCAACCATCAAGTTCGTGTCGGAATGTTAGAGATTTGGACTGCAAGACTccaccaccacgggcaccgccgcctccgcctccgcctccgccgcTTCCTGTCCAGTCCATGCCCCGAGCAGCGGCTACACAGAAATCTCCGGACCTCGTACGCCTCTTTCACTCATTAAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCAGCTGCGATCAATGCGCATAATAGCATTGTTGGGGAAATTCAGAATCGTTCTGCGCATCTTTTAGCCATAAAAGCAGATATTGAAACCAAAGGAGAGTTCATCAATGGTCTCATTGACAAGGTGCTTGTTGCAGCTCATACGGACATTGAAGATATCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTTTCATCATTGGCTGACGAGCGAGCTGTGTTAAAGCATTTCAAGTGGCCTGAGAAAAAAGCTGATGCCATGCGAGAAGCGGCCATAGAATACCGTGCGCTCAAACTGTTGGAAAATGAAATCTCCTTTTACAAGGATGACACTAATTCTCCATGTGAGGCAGCCTTGAAGAAGATGGCGAGCTTGTTAGACAAGTCGGAGCGAGGCATACAACGTTTAATCACACTTCGGAGTACTGTTATGCATTCTTATCAAAACCTGAAACTCCCAACAAATTGGATGCTAGACTCCGGTATCATGAGTAAGATAAAGCAAGCTTCTATGAATCTTGCCAAGATGTACATGAAAAGGGTGAAAACGGAGCTGGATTCAGTTCGTAGTTCAGATAAAGAATCCAACCATGAATCTCTATTACTTCAGGGAATTCATTTTGCATACAGAACTCACCAGTTTGCTGGAGGGCTTGATTCGGAAACATTGTGTGCTTTTGAGGAAATAAAACAATGGGTTCCAAGACGAATGGTTGGAAGATCCCATGCTCAAGGATTGATAGTTGGCATACAATCATCATAACCAAGTAAACAGTTATTCTCTCTCCTTATGTAAAATATTTACTCTTTTGTAGTATATTTATTGAAAATGAGAAGGGGATAGCAGATTAAGATAGATGTTCATTATTAATTTGGTTAATGTTAGTGAAGTGTATTGATATATGATATTTGGTTGTTTTGTAGTAATGGCTTAAAGAAATGGTGTTTCAAATAGttttatttttaaaataaataaaattaGCTTCATCTACTTGTCATAATTTTTTACTTTTTAATGTATATCTGTTCTCTGTGTATATGGTTGAACATTATCATTATCATAAAATTATCATTATCATAAATGGTGTATATCCGCATTTTCA

Coding sequence (CDS)

ATGGAGCAGAAGGGGAAATCAAATGCTGTGAAGAACTCTACTACGATGTCATCTCGGGGTGGGCGGGTTTCTTTGAAGGCTATGGAGTCACCGAAGCGGGTGGTTTCTGTATCGGCGGTTGAATCGACGCCTCAGTCTGGTGTGAAGAAGCAAAGTTCGAGAGTTAGTAGATCTCTGACGCCGAATGGTCCTAAGAAGGGGAGGGATGGTGAGAATGTTGGAGTTTCGGCTCGAACGGTCAACCGTGGTGGTCTCAAGCAAGTTTTGCACCGGCGTTCTTTGTCTGGTGCTGGTTCTTGTGTGAATGTTGAGGATTGTAATGGAGTTAAGAGTGGATTGCAGGAGAAGCTTTGTTTTGCGGAGGATTTGATTAAAGATTTGCAGTCTCAATTGGTGGAGCTGAAGGAGGAGTTGCATAAGTCTCAGAGCTTGAACTTTGAACTTCAATCGCAGAATGATTTGCTCGTTCGTGACCTAGCCGCTGCTGAAGCGAAGTTCGCTAGTGTTAGTAATAATGACAAGAGGAAGTCAGTTTCAGAGGAATCGCAACGAAGCGCCGAGGACAATCAGAAACTTGAAAATGGAAAGTTGGAGACTCAACCATCAAGTTCGTGTCGGAATGTTAGAGATTTGGACTGCAAGACTCCACCACCACGGGCACCGCCGCCTCCGCCTCCGCCTCCGCCGCTTCCTGTCCAGTCCATGCCCCGAGCAGCGGCTACACAGAAATCTCCGGACCTCGTACGCCTCTTTCACTCATTAAGAAAGAAAGAGGGGAAGAGAGATCCTCCATTGTTGGGGAAACCAGCTGCGATCAATGCGCATAATAGCATTGTTGGGGAAATTCAGAATCGTTCTGCGCATCTTTTAGCCATAAAAGCAGATATTGAAACCAAAGGAGAGTTCATCAATGGTCTCATTGACAAGGTGCTTGTTGCAGCTCATACGGACATTGAAGATATCCTCAAGTTTGTCGATTGGCTTGATTCCCAACTTTCATCATTGGCTGACGAGCGAGCTGTGTTAAAGCATTTCAAGTGGCCTGAGAAAAAAGCTGATGCCATGCGAGAAGCGGCCATAGAATACCGTGCGCTCAAACTGTTGGAAAATGAAATCTCCTTTTACAAGGATGACACTAATTCTCCATGTGAGGCAGCCTTGAAGAAGATGGCGAGCTTGTTAGACAAGTCGGAGCGAGGCATACAACGTTTAATCACACTTCGGAGTACTGTTATGCATTCTTATCAAAACCTGAAACTCCCAACAAATTGGATGCTAGACTCCGGTATCATGAGTAAGATAAAGCAAGCTTCTATGAATCTTGCCAAGATGTACATGAAAAGGGTGAAAACGGAGCTGGATTCAGTTCGTAGTTCAGATAAAGAATCCAACCATGAATCTCTATTACTTCAGGGAATTCATTTTGCATACAGAACTCACCAGTTTGCTGGAGGGCTTGATTCGGAAACATTGTGTGCTTTTGAGGAAATAAAACAATGGGTTCCAAGACGAATGGTTGGAAGATCCCATGCTCAAGGATTGATAGTTGGCATACAATCATCATAA

Protein sequence

MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLTPNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSEESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS*
BLAST of Cucsa.252140 vs. Swiss-Prot
Match: CHUP1_ARATH (Protein CHUP1, chloroplastic OS=Arabidopsis thaliana GN=CHUP1 PE=1 SV=1)

HSP 1 Score: 290.0 bits (741), Expect = 5.0e-77
Identity = 147/295 (49.83%), Postives = 200/295 (67.80%), Query Frame = 1

Query: 216 PPPRAPPPPP----PPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPL 275
           PP   PPPPP    PPPP P  ++ R A       ++P+LV  + SL K+E K++  P L
Sbjct: 685 PPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSL 744

Query: 276 L--GKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILK 335
           +  G   +  A N+++GEI+NRS  LLA+KAD+ET+G+F+  L  +V  ++ TDIED+L 
Sbjct: 745 ISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLA 804

Query: 336 FVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPC 395
           FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L  LE +++ + DD N  C
Sbjct: 805 FVSWLDEELSFLVDERAVLKHFDWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 864

Query: 396 EAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAK 455
           E ALKKM  LL+K E+ +  L+  R   +  Y+   +P +W+ D+G++ KIK +S+ LAK
Sbjct: 865 EPALKKMYKLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAK 924

Query: 456 MYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK 499
            YMKRV  ELDSV  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Sbjct: 925 KYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 979

BLAST of Cucsa.252140 vs. TrEMBL
Match: A0A0A0L5G9_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171180 PE=4 SV=1)

HSP 1 Score: 1021.1 bits (2639), Expect = 4.6e-295
Identity = 520/521 (99.81%), Postives = 521/521 (100.00%), Query Frame = 1

Query: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLT 60
           MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSS+VSRSLT
Sbjct: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSKVSRSLT 60

Query: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120
           PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA
Sbjct: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120

Query: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180
           EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE
Sbjct: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180

Query: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240
           ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA
Sbjct: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240

Query: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300
           TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG
Sbjct: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300

Query: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360
           EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI
Sbjct: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360

Query: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420
           EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL
Sbjct: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420

Query: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480
           PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH
Sbjct: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480

Query: 481 QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS 522
           QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS
Sbjct: 481 QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS 521

BLAST of Cucsa.252140 vs. TrEMBL
Match: E5GC44_CUCME (Hydroxyproline-rich glycoprotein family protein OS=Cucumis melo subsp. melo PE=4 SV=1)

HSP 1 Score: 901.7 bits (2329), Expect = 4.1e-259
Identity = 466/481 (96.88%), Postives = 467/481 (97.09%), Query Frame = 1

Query: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLT 60
           MEQKGKS AVKNSTTMSSRGGRVSLKAMESPKRVVSVS VESTPQSGVKKQSSRVSRSLT
Sbjct: 1   MEQKGKSTAVKNSTTMSSRGGRVSLKAMESPKRVVSVSVVESTPQSGVKKQSSRVSRSLT 60

Query: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120
           PN PKKGRDGENVGVSARTVNRGGLKQV HRRSLS AGSCVNVEDCNGVKSGLQEKL FA
Sbjct: 61  PNAPKKGRDGENVGVSARTVNRGGLKQVSHRRSLSVAGSCVNVEDCNGVKSGLQEKLYFA 120

Query: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180
           EDLIKDLQSQLVELKEEL KSQSLN ELQSQNDLLVRDLAAAEAKFAS SNNDKRKSVSE
Sbjct: 121 EDLIKDLQSQLVELKEELRKSQSLNLELQSQNDLLVRDLAAAEAKFASASNNDKRKSVSE 180

Query: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240
           ESQR  EDNQKLENGKLETQPSSSCRNVRDLDCK PPPRA PP PPPPPLPVQSMPRAAA
Sbjct: 181 ESQRRTEDNQKLENGKLETQPSSSCRNVRDLDCKAPPPRAAPP-PPPPPLPVQSMPRAAA 240

Query: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300
           TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG
Sbjct: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300

Query: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360
           EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI
Sbjct: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360

Query: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420
           EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQ+LKL
Sbjct: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQDLKL 420

Query: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480
           PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH
Sbjct: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480

Query: 481 Q 482
           Q
Sbjct: 481 Q 480

BLAST of Cucsa.252140 vs. TrEMBL
Match: M5XEB8_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003741mg PE=4 SV=1)

HSP 1 Score: 498.8 bits (1283), Expect = 7.9e-138
Identity = 297/556 (53.42%), Postives = 358/556 (64.39%), Query Frame = 1

Query: 6   KSNAVKNSTTMSSR-GGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLTPNGP 65
           K     +ST   S+  G +S     S  R  + S  + +P     +  S + RSL  N P
Sbjct: 2   KQGTPPSSTKSESKVSGNMSQPTPPSYLRASASSKAKESPSPRPSRAKS-IRRSLLLNKP 61

Query: 66  KKG---------RDGENVGVSARTVNRGGLKQVLHRRSLSGA--GSCVNVEDCNGVKSGL 125
           K G         ++ E      R  NR   +Q    R    A   S  N ED +     L
Sbjct: 62  KSGELVLGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNREL 121

Query: 126 QEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNND 185
           QE+L  +E L  + Q++++ LK EL K+Q LN ELQSQN  L   LAAAEAK A+ +  +
Sbjct: 122 QERLDMSESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTRE 181

Query: 186 KRKSVSEESQRSAEDNQKLENGKLETQ-------------------PSSSCRNVRDLDCK 245
           +R++  E      +D QKL   KLE                     P+ +   V      
Sbjct: 182 QRETNGEYQSPKFKDLQKLIANKLERPVVKKEAVKEKSANKTPAPAPTGAIPRVAATQSG 241

Query: 246 TPPPRAPPP------PPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLG- 305
            PPP  PPP      PPPPPP P       +ATQK+P LV  FHSLRK+E KRD P    
Sbjct: 242 PPPPPPPPPSVRSPTPPPPPPQP-SVRTTTSATQKAPSLVEFFHSLRKQEVKRDSPESRN 301

Query: 306 --KPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILKFV 365
             KP+AI+AHNSIVGEIQNRSAHLLAIKAD++TKGEFIN LI KVLVAA+TDIED+LKFV
Sbjct: 302 HHKPSAISAHNSIVGEIQNRSAHLLAIKADVQTKGEFINDLIQKVLVAAYTDIEDVLKFV 361

Query: 366 DWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEA 425
           DWLD +LSSLADERAVLKHFKWPE+KADAMREAAIEYR LKLL++EIS YKDDT+ PC A
Sbjct: 362 DWLDGELSSLADERAVLKHFKWPERKADAMREAAIEYRDLKLLQSEISSYKDDTDIPCAA 421

Query: 426 ALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMY 485
           ALKKMA LLDKSER IQRLI LR++VM SYQ LK+P +WMLDSGI+SKIK+ASMNLA +Y
Sbjct: 422 ALKKMAGLLDKSERSIQRLIKLRNSVMRSYQELKIPIDWMLDSGIVSKIKKASMNLANVY 481

Query: 486 MKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRRM 522
           MKRV  EL+S+R+SD+E++ ESLLLQG+HF YR HQFAGGLDSETLCAFEEI+Q VP  +
Sbjct: 482 MKRVTMELESIRNSDRETSQESLLLQGVHFVYRAHQFAGGLDSETLCAFEEIRQRVPGHL 541

BLAST of Cucsa.252140 vs. TrEMBL
Match: A0A067FXY0_CITSI (Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008574mg PE=4 SV=1)

HSP 1 Score: 495.7 bits (1275), Expect = 6.7e-137
Identity = 295/564 (52.30%), Postives = 371/564 (65.78%), Query Frame = 1

Query: 2   EQKGKSNAVKNSTTMSSRGG-RVSLKAMESPKRVVSVSAVESTPQSGVKKQS-------- 61
           ++  K+N + +ST  ++    R + K  ESPK+   ++ V  +P+   + +S        
Sbjct: 5   QELSKTNNMSHSTAATTTFRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPADVKTN 64

Query: 62  --SRVSRSLTPNGPKKGRDG------ENVGVSARTVNRGGLKQVLH-RRSLSGAGSCVNV 121
             S+  R+L  N PK           + V V  R++NR  ++Q    RR      +   +
Sbjct: 65  NISKSRRALILNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKI 124

Query: 122 ED--CNGVKSGLQEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAA 181
           ED   +  K   +EKL  +E+L+KDLQS++  LK E  K+QSLN EL+ QN  LV DL A
Sbjct: 125 EDGLMDKKKKEFEEKLMLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVA 184

Query: 182 AEAKFASVSNNDKRKSVSEESQRSAEDNQKLENGKLE------------------TQPSS 241
           AEAK AS+S+ ++R++V E      +D QKL   KLE                  ++P  
Sbjct: 185 AEAKIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSINTPPSEPKI 244

Query: 242 SCRNVRDLDCKT---PPPRAPPPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGK 301
             RN   ++ K    P   AP PPPPPP  P     RAAATQK+P   +L+HSL K+  K
Sbjct: 245 PIRNAAGVERKPQAYPSMPAPLPPPPPPRPPA----RAAATQKTPSFAQLYHSLTKQVEK 304

Query: 302 RDPPL---LGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTD 361
           +D P      +PA   AH+SIVGEIQNRSAHLLAIKADIETKG FIN LI KVL AA+T+
Sbjct: 305 KDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIETKGGFINSLIQKVLAAAYTN 364

Query: 362 IEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKD 421
           IED+L+FVDWLD +LSSLADERAVLKHFKWPEKKADAMREAA+EYR LK LENEIS Y+D
Sbjct: 365 IEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMREAAVEYRDLKQLENEISSYRD 424

Query: 422 DTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQA 481
           DTN P  AALKKMASLLDKSER IQRL+ LR++VMHSY++ K+P +WMLDSGI+SKIKQA
Sbjct: 425 DTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDCKIPVDWMLDSGIISKIKQA 484

Query: 482 SMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEI 522
           SM LA+MYMKRV  EL+ V +SD+ES  E+LLLQG+HFAYR HQF GGLDSETLCAFEEI
Sbjct: 485 SMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYRAHQFVGGLDSETLCAFEEI 544

BLAST of Cucsa.252140 vs. TrEMBL
Match: V4TZL4_9ROSI (Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004653mg PE=4 SV=1)

HSP 1 Score: 495.0 bits (1273), Expect = 1.1e-136
Identity = 294/564 (52.13%), Postives = 371/564 (65.78%), Query Frame = 1

Query: 2   EQKGKSNAVKNSTTMSSRGG-RVSLKAMESPKRVVSVSAVESTPQSGVKKQS-------- 61
           ++  K+N + +ST  ++    R + K  ESPK+   ++ V  +P+   + +S        
Sbjct: 5   QELSKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTN 64

Query: 62  --SRVSRSLTPNGPKKGRDG------ENVGVSARTVNRGGLKQVLH-RRSLSGAGSCVNV 121
             S+  R+L  N PK           + V V  R++NR  ++Q    RR      +   +
Sbjct: 65  NISKSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKI 124

Query: 122 ED--CNGVKSGLQEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAA 181
           ED   +  K   +EKL  +E+L+KDLQS++  LK E  K+QSLN EL+ QN  LV DL A
Sbjct: 125 EDGLMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVA 184

Query: 182 AEAKFASVSNNDKRKSVSEESQRSAEDNQKLENGKLE------------------TQPSS 241
           AEAK AS+S+ ++R++V E      +D QKL   KLE                  ++P  
Sbjct: 185 AEAKIASLSSREQREAVGEYQSPKFKDVQKLIANKLEHSIVMTDAISETSINTPPSEPKI 244

Query: 242 SCRNVRDLDCKT---PPPRAPPPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGK 301
             RN   ++ K    P   AP PPPPPP  P     RAAATQK+P   +L+HSL K+  K
Sbjct: 245 PIRNAAGVERKPQAYPSMPAPLPPPPPPRPPA----RAAATQKTPSFAQLYHSLTKQVEK 304

Query: 302 RDPPL---LGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTD 361
           +D P      +PA   AH+SIVGEIQNRSAHLLAIKADIETKG FIN LI KVL AA+T+
Sbjct: 305 KDLPSPVNQKRPAVSIAHSSIVGEIQNRSAHLLAIKADIETKGGFINSLIQKVLAAAYTN 364

Query: 362 IEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKD 421
           IED+L+FVDWLD +LSSLADERAVLKHFKWPEKKADAM+EAA+EYR LK LENEIS Y+D
Sbjct: 365 IEDLLEFVDWLDKELSSLADERAVLKHFKWPEKKADAMQEAAVEYRDLKQLENEISSYRD 424

Query: 422 DTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQA 481
           DTN P  AALKKMASLLDKSER IQRL+ LR++VMHSY++ K+P +WMLDSGI+SKIKQA
Sbjct: 425 DTNVPFGAALKKMASLLDKSERSIQRLVKLRNSVMHSYKDCKIPVDWMLDSGIISKIKQA 484

Query: 482 SMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEI 522
           SM LA+MYMKRV  EL+ V +SD+ES  E+LLLQG+HFAYR HQF GGLDSETLCAFEEI
Sbjct: 485 SMKLAQMYMKRVTRELELVHNSDRESTQEALLLQGLHFAYRAHQFVGGLDSETLCAFEEI 544

BLAST of Cucsa.252140 vs. TAIR10
Match: AT1G48280.1 (AT1G48280.1 hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 366.3 bits (939), Expect = 3.1e-101
Identity = 189/313 (60.38%), Postives = 229/313 (73.16%), Query Frame = 1

Query: 196 KLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLR 255
           K    P+SS     +      PP  PPPPPPPPP P   + +AA  QKSP + +LF  L 
Sbjct: 237 KFLVSPASSLGKRDENSSPFAPPTPPPPPPPPPPRP---LAKAARAQKSPPVSQLFQLLN 296

Query: 256 KKEGKRD--PPLLGKPAAIN-AHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLV 315
           K++  R+    + G  + +N AHNSIVGEIQNRSAHL+AIKADIETKGEFIN LI KVL 
Sbjct: 297 KQDNSRNLSQSVNGNKSQVNSAHNSIVGEIQNRSAHLIAIKADIETKGEFINDLIQKVLT 356

Query: 316 AAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEI 375
              +D+ED++KFVDWLD +L++LADERAVLKHFKWPEKKAD ++EAA+EYR LK LE E+
Sbjct: 357 TCFSDMEDVMKFVDWLDKELATLADERAVLKHFKWPEKKADTLQEAAVEYRELKKLEKEL 416

Query: 376 SFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMS 435
           S Y DD N     ALKKMA+LLDKSE+ I+RL+ LR + M SYQ+ K+P  WMLDSG++ 
Sbjct: 417 SSYSDDPNIHYGVALKKMANLLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMIC 476

Query: 436 KIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLC 495
           KIK+AS+ LAK YM RV  EL S R+ D+ES  E+LLLQG+ FAYRTHQFAGGLD ETLC
Sbjct: 477 KIKRASIKLAKTYMNRVANELQSARNLDRESTKEALLLQGVRFAYRTHQFAGGLDPETLC 536

Query: 496 AFEEIKQWVPRRM 506
           A EEIKQ VP  +
Sbjct: 537 ALEEIKQRVPSHL 546


HSP 2 Score: 345.9 bits (886), Expect = 4.3e-95
Identity = 217/473 (45.88%), Postives = 276/473 (58.35%), Query Frame = 1

Query: 77  ARTVNR-------GGLKQVLHRRSLSGAGSCVNVEDCNGVK-SGLQEKLCFAEDLIKDLQ 136
           AR+VNR       G  ++ + R+S     +    ED    +   L+EKL   E LIKDLQ
Sbjct: 77  ARSVNRPAVVEQFGCPRRPISRKSEETVMATAAAEDEKRKRMEELEEKLVVNESLIKDLQ 136

Query: 137 SQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSEESQRSAED 196
            Q++ LK EL ++++ N EL+  N  L +DL +AEAK +S+S+ND  K   E      +D
Sbjct: 137 LQVLNLKTELEEARNSNVELELNNRKLSQDLVSAEAKISSLSSND--KPAKEHQNSRFKD 196

Query: 197 NQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLP--------------VQS 256
            Q+L   KLE QP        +    +PP  +P   PP PPLP                S
Sbjct: 197 IQRLIASKLE-QPKVKKEVAVESSRLSPPSPSPSRLPPTPPLPKFLVSPASSLGKRDENS 256

Query: 257 MPRAAATQKSPDLVRLFHSLRK-KEGKRDPP------LLGK-------PAAINAHNSIVG 316
            P A  T   P        L K    ++ PP      LL K         ++N + S V 
Sbjct: 257 SPFAPPTPPPPPPPPPPRPLAKAARAQKSPPVSQLFQLLNKQDNSRNLSQSVNGNKSQVN 316

Query: 317 EIQNRSAHLLAIK--------ADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQL 376
              N     +  +        ADIETKGEFIN LI KVL    +D+ED++KFVDWLD +L
Sbjct: 317 SAHNSIVGEIQNRSAHLIAIKADIETKGEFINDLIQKVLTTCFSDMEDVMKFVDWLDKEL 376

Query: 377 SSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEAALKKMAS 436
           ++LADERAVLKHFKWPEKKAD ++EAA+EYR LK LE E+S Y DD N     ALKKMA+
Sbjct: 377 ATLADERAVLKHFKWPEKKADTLQEAAVEYRELKKLEKELSSYSDDPNIHYGVALKKMAN 436

Query: 437 LLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMYMKRVKTE 496
           LLDKSE+ I+RL+ LR + M SYQ+ K+P  WMLDSG++ KIK+AS+ LAK YM RV  E
Sbjct: 437 LLDKSEQRIRRLVRLRGSSMRSYQDFKIPVEWMLDSGMICKIKRASIKLAKTYMNRVANE 496

Query: 497 LDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRRM 506
           L S R+ D+ES  E+LLLQG+ FAYRTHQFAGGLD ETLCA EEIKQ VP  +
Sbjct: 497 LQSARNLDRESTKEALLLQGVRFAYRTHQFAGGLDPETLCALEEIKQRVPSHL 546

BLAST of Cucsa.252140 vs. TAIR10
Match: AT3G25690.1 (AT3G25690.1 Hydroxyproline-rich glycoprotein family protein)

HSP 1 Score: 290.0 bits (741), Expect = 2.8e-78
Identity = 147/295 (49.83%), Postives = 200/295 (67.80%), Query Frame = 1

Query: 216 PPPRAPPPPP----PPPPLPVQSMPRAAA----TQKSPDLVRLFHSLRKKEGKRD--PPL 275
           PP   PPPPP    PPPP P  ++ R A       ++P+LV  + SL K+E K++  P L
Sbjct: 685 PPGGGPPPPPGGGPPPPPPPPGALGRGAGGGNKVHRAPELVEFYQSLMKRESKKEGAPSL 744

Query: 276 L--GKPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILK 335
           +  G   +  A N+++GEI+NRS  LLA+KAD+ET+G+F+  L  +V  ++ TDIED+L 
Sbjct: 745 ISSGTGNSSAARNNMIGEIENRSTFLLAVKADVETQGDFVQSLATEVRASSFTDIEDLLA 804

Query: 336 FVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPC 395
           FV WLD +LS L DERAVLKHF WPE KADA+REAA EY+ L  LE +++ + DD N  C
Sbjct: 805 FVSWLDEELSFLVDERAVLKHFDWPEGKADALREAAFEYQDLMKLEKQVTSFVDDPNLSC 864

Query: 396 EAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAK 455
           E ALKKM  LL+K E+ +  L+  R   +  Y+   +P +W+ D+G++ KIK +S+ LAK
Sbjct: 865 EPALKKMYKLLEKVEQSVYALLRTRDMAISRYKEFGIPVDWLSDTGVVGKIKLSSVQLAK 924

Query: 456 MYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK 499
            YMKRV  ELDSV  SDK+ N E LLLQG+ FA+R HQFAGG D+E++ AFEE++
Sbjct: 925 KYMKRVAYELDSVSGSDKDPNREFLLLQGVRFAFRVHQFAGGFDAESMKAFEELR 979

BLAST of Cucsa.252140 vs. TAIR10
Match: AT4G18570.1 (AT4G18570.1 Tetratricopeptide repeat (TPR)-like superfamily protein)

HSP 1 Score: 262.7 bits (670), Expect = 4.8e-70
Identity = 140/294 (47.62%), Postives = 200/294 (68.03%), Query Frame = 1

Query: 216 PPP---RAPPPPPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEG---KRDPPLLGKP 275
           PPP   +APPPPPPPPP P      +A  ++ P++V  +HSL +++    +RD    G  
Sbjct: 326 PPPSVSKAPPPPPPPPP-PKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGGGNA 385

Query: 276 AA--INAHNS---IVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILKF 335
           AA  I A+++   ++GEI+NRS +LLAIK D+ET+G+FI  LI +V  AA +DIED++ F
Sbjct: 386 AAEAILANSNARDMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPF 445

Query: 336 VDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCE 395
           V WLD +LS L DERAVLKHF+WPE+KADA+REAA  Y  LK L +E S +++D      
Sbjct: 446 VKWLDDELSYLVDERAVLKHFEWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSS 505

Query: 396 AALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKM 455
           +ALKKM +L +K E G+  L  +R +    +++ ++P +WML++GI S+IK AS+ LA  
Sbjct: 506 SALKKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMK 565

Query: 456 YMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK 499
           YMKRV  EL+++     E   E L++QG+ FA+R HQFAGG D+ET+ AFEE++
Sbjct: 566 YMKRVSAELEAIEGGGPE--EEELIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616


HSP 2 Score: 214.5 bits (545), Expect = 1.5e-55
Identity = 141/412 (34.22%), Postives = 216/412 (52.43%), Query Frame = 1

Query: 122 DLIKDLQ--SQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVS 181
           +LI+ L+    L  L E +   ++ N  + S  D    D+   + +  S S +   + ++
Sbjct: 210 NLIRSLKRVGSLRNLPEPITNQENTNKSISSSGDA-DGDIYRKD-EIESYSRSSNSEELT 269

Query: 182 EESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMP--- 241
           E S  S   ++     K   + S S  +  +     PP ++ PPPPPPPP P+   P   
Sbjct: 270 ESSSLSTVRSRVPRVPKPPPKRSISLGDSTENRADPPPQKSIPPPPPPPPPPLLQQPPPP 329

Query: 242 ---RAAATQKSPDLVRLFHSLRKKEGKRDPPL------LGKPAAINAHNSIVGEIQNRSA 301
                A     P       S+   + +R P +      L +  + N+     G   N +A
Sbjct: 330 PSVSKAPPPPPPPPPPKSLSIASAKVRRVPEVVEFYHSLMRRDSTNSRRDSTGG-GNAAA 389

Query: 302 HLLAIKAD---------------------IETKGEFINGLIDKVLVAAHTDIEDILKFVD 361
             +   ++                     +ET+G+FI  LI +V  AA +DIED++ FV 
Sbjct: 390 EAILANSNARDMIGEIENRSVYLLAIKTDVETQGDFIRFLIKEVGNAAFSDIEDVVPFVK 449

Query: 362 WLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEAA 421
           WLD +LS L DERAVLKHF+WPE+KADA+REAA  Y  LK L +E S +++D      +A
Sbjct: 450 WLDDELSYLVDERAVLKHFEWPEQKADALREAAFCYFDLKKLISEASRFREDPRQSSSSA 509

Query: 422 LKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMYM 481
           LKKM +L +K E G+  L  +R +    +++ ++P +WML++GI S+IK AS+ LA  YM
Sbjct: 510 LKKMQALFEKLEHGVYSLSRMRESAATKFKSFQIPVDWMLETGITSQIKLASVKLAMKYM 569

Query: 482 KRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIK 499
           KRV  EL+++     E   E L++QG+ FA+R HQFAGG D+ET+ AFEE++
Sbjct: 570 KRVSAELEAIEGGGPE--EEELIVQGVRFAFRVHQFAGGFDAETMKAFEELR 616

BLAST of Cucsa.252140 vs. TAIR10
Match: AT1G07120.1 (AT1G07120.1 FUNCTIONS IN: molecular_function unknown)

HSP 1 Score: 243.0 bits (619), Expect = 4.0e-64
Identity = 141/381 (37.01%), Postives = 221/381 (58.01%), Query Frame = 1

Query: 129 SQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASV-SNNDKRKSVSEESQRSAE 188
           S L+ L +EL      N +L+ +N  L +++A   A+ +++ S+ ++RKS+  +  +S+ 
Sbjct: 9   SDLLRLVKELQAYLVRNDKLEKENHELRQEVARLRAQVSNLKSHENERKSMLWKKLQSSY 68

Query: 189 DNQKLENGKLETQPSSSCRNVRDLDCKTPPPR-------APPPPPPPPPLPVQSMPRAAA 248
           D    +   L+  P S   N +  + + P P+           PPPPPPLP +      +
Sbjct: 69  DGSNTDGSNLKA-PESVKSNTKGQEVRNPNPKPTIQGQSTATKPPPPPPLPSKRTLGKRS 128

Query: 249 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHN-SIVGEIQNRSAHLLAIKADIETK 308
            +++P++V  + +L K+E      +        A N +++GEI+NRS +L  IK+D +  
Sbjct: 129 VRRAPEVVEFYRALTKRESHMGNKINQNGVLSPAFNRNMIGEIENRSKYLSDIKSDTDRH 188

Query: 309 GEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHF-KWPEKKADAMREA 368
            + I+ LI KV  A  TDI ++  FV W+D +LSSL DERAVLKHF KWPE+K D++REA
Sbjct: 189 RDHIHILISKVEAATFTDISEVETFVKWIDEELSSLVDERAVLKHFPKWPERKVDSLREA 248

Query: 369 AIEYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNL 428
           A  Y+  K L NEI  +KD+       AL+++ SL D+ E  +     +R +    Y++ 
Sbjct: 249 ACNYKRPKNLGNEILSFKDNPKDSLTQALQRIQSLQDRLEESVNNTEKMRDSTGKRYKDF 308

Query: 429 KLPTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYR 488
           ++P  WMLD+G++ ++K +S+ LA+ YMKR+  EL+S   S KE N   L+LQG+ FAY 
Sbjct: 309 QIPWEWMLDTGLIGQLKYSSLRLAQEYMKRIAKELES-NGSGKEGN---LMLQGVRFAYT 368

Query: 489 THQFAGGLDSETLCAFEEIKQ 500
            HQFAGG D ETL  F E+K+
Sbjct: 369 IHQFAGGFDGETLSIFHELKK 384

BLAST of Cucsa.252140 vs. NCBI nr
Match: gi|449459796|ref|XP_004147632.1| (PREDICTED: protein CHUP1, chloroplastic [Cucumis sativus])

HSP 1 Score: 1021.1 bits (2639), Expect = 6.6e-295
Identity = 520/521 (99.81%), Postives = 521/521 (100.00%), Query Frame = 1

Query: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLT 60
           MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSS+VSRSLT
Sbjct: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSKVSRSLT 60

Query: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120
           PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA
Sbjct: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120

Query: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180
           EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE
Sbjct: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180

Query: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240
           ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA
Sbjct: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240

Query: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300
           TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG
Sbjct: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300

Query: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360
           EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI
Sbjct: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360

Query: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420
           EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL
Sbjct: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420

Query: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480
           PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH
Sbjct: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480

Query: 481 QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS 522
           QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS
Sbjct: 481 QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS 521

BLAST of Cucsa.252140 vs. NCBI nr
Match: gi|659077044|ref|XP_008439002.1| (PREDICTED: protein CHUP1, chloroplastic [Cucumis melo])

HSP 1 Score: 978.8 bits (2529), Expect = 3.8e-282
Identity = 504/521 (96.74%), Postives = 507/521 (97.31%), Query Frame = 1

Query: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLT 60
           MEQKGKS AVKNSTTMSSRGGRVSLKAMESPKRVVSVS VESTPQSGVKKQSSRVSRSLT
Sbjct: 1   MEQKGKSTAVKNSTTMSSRGGRVSLKAMESPKRVVSVSVVESTPQSGVKKQSSRVSRSLT 60

Query: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120
           PN PKKGRDGENVGVSARTVNRGGLKQV HRRSLS AGSCVNVEDCNGVKSGLQEKL FA
Sbjct: 61  PNAPKKGRDGENVGVSARTVNRGGLKQVSHRRSLSVAGSCVNVEDCNGVKSGLQEKLYFA 120

Query: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180
           EDLIKDLQSQLVELKEEL KSQSLN ELQSQNDLLVRDLAAAEAKFAS SNNDKRKSVSE
Sbjct: 121 EDLIKDLQSQLVELKEELRKSQSLNLELQSQNDLLVRDLAAAEAKFASASNNDKRKSVSE 180

Query: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240
           ESQR  EDNQKLENGKLETQPSSSCRNVRDLDCK PPPRA PP PPPPPLPVQSMPRAAA
Sbjct: 181 ESQRRTEDNQKLENGKLETQPSSSCRNVRDLDCKAPPPRAAPP-PPPPPLPVQSMPRAAA 240

Query: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300
           TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG
Sbjct: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300

Query: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360
           EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI
Sbjct: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360

Query: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420
           EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQ+LKL
Sbjct: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQDLKL 420

Query: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480
           PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH
Sbjct: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480

Query: 481 QFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGLIVGIQSS 522
           QFAGGLDSETLCAFEEIKQWVPR+M+GRSHAQGLIVGIQSS
Sbjct: 481 QFAGGLDSETLCAFEEIKQWVPRQMLGRSHAQGLIVGIQSS 520

BLAST of Cucsa.252140 vs. NCBI nr
Match: gi|307136204|gb|ADN34042.1| (hydroxyproline-rich glycoprotein family protein [Cucumis melo subsp. melo])

HSP 1 Score: 901.7 bits (2329), Expect = 5.8e-259
Identity = 466/481 (96.88%), Postives = 467/481 (97.09%), Query Frame = 1

Query: 1   MEQKGKSNAVKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLT 60
           MEQKGKS AVKNSTTMSSRGGRVSLKAMESPKRVVSVS VESTPQSGVKKQSSRVSRSLT
Sbjct: 1   MEQKGKSTAVKNSTTMSSRGGRVSLKAMESPKRVVSVSVVESTPQSGVKKQSSRVSRSLT 60

Query: 61  PNGPKKGRDGENVGVSARTVNRGGLKQVLHRRSLSGAGSCVNVEDCNGVKSGLQEKLCFA 120
           PN PKKGRDGENVGVSARTVNRGGLKQV HRRSLS AGSCVNVEDCNGVKSGLQEKL FA
Sbjct: 61  PNAPKKGRDGENVGVSARTVNRGGLKQVSHRRSLSVAGSCVNVEDCNGVKSGLQEKLYFA 120

Query: 121 EDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNNDKRKSVSE 180
           EDLIKDLQSQLVELKEEL KSQSLN ELQSQNDLLVRDLAAAEAKFAS SNNDKRKSVSE
Sbjct: 121 EDLIKDLQSQLVELKEELRKSQSLNLELQSQNDLLVRDLAAAEAKFASASNNDKRKSVSE 180

Query: 181 ESQRSAEDNQKLENGKLETQPSSSCRNVRDLDCKTPPPRAPPPPPPPPPLPVQSMPRAAA 240
           ESQR  EDNQKLENGKLETQPSSSCRNVRDLDCK PPPRA PP PPPPPLPVQSMPRAAA
Sbjct: 181 ESQRRTEDNQKLENGKLETQPSSSCRNVRDLDCKAPPPRAAPP-PPPPPLPVQSMPRAAA 240

Query: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300
           TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG
Sbjct: 241 TQKSPDLVRLFHSLRKKEGKRDPPLLGKPAAINAHNSIVGEIQNRSAHLLAIKADIETKG 300

Query: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360
           EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI
Sbjct: 301 EFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSSLADERAVLKHFKWPEKKADAMREAAI 360

Query: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKL 420
           EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQ+LKL
Sbjct: 361 EYRALKLLENEISFYKDDTNSPCEAALKKMASLLDKSERGIQRLITLRSTVMHSYQDLKL 420

Query: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480
           PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH
Sbjct: 421 PTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTH 480

Query: 481 Q 482
           Q
Sbjct: 481 Q 480

BLAST of Cucsa.252140 vs. NCBI nr
Match: gi|694359364|ref|XP_009359807.1| (PREDICTED: protein CHUP1, chloroplastic-like [Pyrus x bretschneideri])

HSP 1 Score: 518.5 bits (1334), Expect = 1.4e-143
Identity = 305/547 (55.76%), Postives = 365/547 (66.73%), Query Frame = 1

Query: 10  VKNSTTMSSRGGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRS--LTPNGPKKG 69
           + N TT S      S +A  SP       A   +P   V  +SSR +R   L  N PK G
Sbjct: 1   MSNQTTKSYLRASASSRAKGSPTAARPARAKSVSPD--VNSESSRSTRRSLLLSNKPKSG 60

Query: 70  ---------RDGENVGVSARTVNRGGLKQVLHRRSLSGAGSC-----VNVEDCNGVKSGL 129
                    ++ E + V  RT NR   +Q    R L    S      +N +D +G    L
Sbjct: 61  ELVLGSQKSKELEEIKVVGRTRNRQAAEQFSRPRRLRAVESNSKRNGLNEDDPHGKNREL 120

Query: 130 QEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNND 189
           QEKL  +E LI  L++++  LK EL K+Q  N ELQSQN+ L R+LAAAEAK ++ ++ +
Sbjct: 121 QEKLEMSESLIAGLRAEVAALKAELDKTQGFNMELQSQNESLSRNLAAAEAKISASASPE 180

Query: 190 KRKSVSEESQRSAEDNQKLENGKLETQP-SSSCRNVRDLDCKTPPPRAP----------- 249
           +R++  E      +D QKL   KLE         N      K PPP  P           
Sbjct: 181 QRETNGEYQSPKFKDLQKLIANKLERSVVQKDAVNETSPPVKAPPPPPPKSAIPRVSATQ 240

Query: 250 --PPPPPPPPLPVQSMPRAAA--TQKSPDLVRLFHSLRKKEGKRDPPLLG---KPAAINA 309
             PPPPPPPP P+QS  RA    TQK+P LV  +HSLRK+E KRD P      KPAA +A
Sbjct: 241 CGPPPPPPPPPPLQSSVRATGVTTQKAPALVEFYHSLRKQEVKRDSPESRNHLKPAATSA 300

Query: 310 HNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILKFVDWLDSQLSS 369
           HNSIVGEIQNRSAHLLAIKAD++TKGEFIN LI KVL AA+TDIED+LKFVDWLD +LSS
Sbjct: 301 HNSIVGEIQNRSAHLLAIKADVQTKGEFINDLIQKVLAAAYTDIEDVLKFVDWLDVELSS 360

Query: 370 LADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEAALKKMASLL 429
           LADERAVLKHFKWPE+KADAMREAAIEYR LK LE+EIS YKDDT+ PC AALKKMA LL
Sbjct: 361 LADERAVLKHFKWPERKADAMREAAIEYRDLKRLESEISCYKDDTDIPCAAALKKMAGLL 420

Query: 430 DKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMYMKRVKTELD 489
           DKSER IQRLI LR++VM SYQ LK+PT+WMLDSGI+SKIK+ASMNLA +YMKRV  EL+
Sbjct: 421 DKSERSIQRLIKLRNSVMRSYQELKIPTDWMLDSGIVSKIKRASMNLANIYMKRVTLELE 480

Query: 490 SVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRRMVGRSHAQGL 522
           S+R+SD+ES+ ESLLLQG+HFAYR HQFAGGLDSETLCAFEEI+Q VP  + G   ++ L
Sbjct: 481 SIRNSDRESSQESLLLQGVHFAYRAHQFAGGLDSETLCAFEEIRQRVPGHLGG---SREL 540

BLAST of Cucsa.252140 vs. NCBI nr
Match: gi|596170181|ref|XP_007223070.1| (hypothetical protein PRUPE_ppa003741mg [Prunus persica])

HSP 1 Score: 498.8 bits (1283), Expect = 1.1e-137
Identity = 297/556 (53.42%), Postives = 358/556 (64.39%), Query Frame = 1

Query: 6   KSNAVKNSTTMSSR-GGRVSLKAMESPKRVVSVSAVESTPQSGVKKQSSRVSRSLTPNGP 65
           K     +ST   S+  G +S     S  R  + S  + +P     +  S + RSL  N P
Sbjct: 2   KQGTPPSSTKSESKVSGNMSQPTPPSYLRASASSKAKESPSPRPSRAKS-IRRSLLLNKP 61

Query: 66  KKG---------RDGENVGVSARTVNRGGLKQVLHRRSLSGA--GSCVNVEDCNGVKSGL 125
           K G         ++ E      R  NR   +Q    R    A   S  N ED +     L
Sbjct: 62  KSGELVLGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNREL 121

Query: 126 QEKLCFAEDLIKDLQSQLVELKEELHKSQSLNFELQSQNDLLVRDLAAAEAKFASVSNND 185
           QE+L  +E L  + Q++++ LK EL K+Q LN ELQSQN  L   LAAAEAK A+ +  +
Sbjct: 122 QERLDMSESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTRE 181

Query: 186 KRKSVSEESQRSAEDNQKLENGKLETQ-------------------PSSSCRNVRDLDCK 245
           +R++  E      +D QKL   KLE                     P+ +   V      
Sbjct: 182 QRETNGEYQSPKFKDLQKLIANKLERPVVKKEAVKEKSANKTPAPAPTGAIPRVAATQSG 241

Query: 246 TPPPRAPPP------PPPPPPLPVQSMPRAAATQKSPDLVRLFHSLRKKEGKRDPPLLG- 305
            PPP  PPP      PPPPPP P       +ATQK+P LV  FHSLRK+E KRD P    
Sbjct: 242 PPPPPPPPPSVRSPTPPPPPPQP-SVRTTTSATQKAPSLVEFFHSLRKQEVKRDSPESRN 301

Query: 306 --KPAAINAHNSIVGEIQNRSAHLLAIKADIETKGEFINGLIDKVLVAAHTDIEDILKFV 365
             KP+AI+AHNSIVGEIQNRSAHLLAIKAD++TKGEFIN LI KVLVAA+TDIED+LKFV
Sbjct: 302 HHKPSAISAHNSIVGEIQNRSAHLLAIKADVQTKGEFINDLIQKVLVAAYTDIEDVLKFV 361

Query: 366 DWLDSQLSSLADERAVLKHFKWPEKKADAMREAAIEYRALKLLENEISFYKDDTNSPCEA 425
           DWLD +LSSLADERAVLKHFKWPE+KADAMREAAIEYR LKLL++EIS YKDDT+ PC A
Sbjct: 362 DWLDGELSSLADERAVLKHFKWPERKADAMREAAIEYRDLKLLQSEISSYKDDTDIPCAA 421

Query: 426 ALKKMASLLDKSERGIQRLITLRSTVMHSYQNLKLPTNWMLDSGIMSKIKQASMNLAKMY 485
           ALKKMA LLDKSER IQRLI LR++VM SYQ LK+P +WMLDSGI+SKIK+ASMNLA +Y
Sbjct: 422 ALKKMAGLLDKSERSIQRLIKLRNSVMRSYQELKIPIDWMLDSGIVSKIKKASMNLANVY 481

Query: 486 MKRVKTELDSVRSSDKESNHESLLLQGIHFAYRTHQFAGGLDSETLCAFEEIKQWVPRRM 522
           MKRV  EL+S+R+SD+E++ ESLLLQG+HF YR HQFAGGLDSETLCAFEEI+Q VP  +
Sbjct: 482 MKRVTMELESIRNSDRETSQESLLLQGVHFVYRAHQFAGGLDSETLCAFEEIRQRVPGHL 541

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CHUP1_ARATH5.0e-7749.83Protein CHUP1, chloroplastic OS=Arabidopsis thaliana GN=CHUP1 PE=1 SV=1[more]
Match NameE-valueIdentityDescription
A0A0A0L5G9_CUCSA4.6e-29599.81Uncharacterized protein OS=Cucumis sativus GN=Csa_3G171180 PE=4 SV=1[more]
E5GC44_CUCME4.1e-25996.88Hydroxyproline-rich glycoprotein family protein OS=Cucumis melo subsp. melo PE=4... [more]
M5XEB8_PRUPE7.9e-13853.42Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa003741mg PE=4 SV=1[more]
A0A067FXY0_CITSI6.7e-13752.30Uncharacterized protein OS=Citrus sinensis GN=CISIN_1g008574mg PE=4 SV=1[more]
V4TZL4_9ROSI1.1e-13652.13Uncharacterized protein OS=Citrus clementina GN=CICLE_v10004653mg PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G48280.13.1e-10160.38 hydroxyproline-rich glycoprotein family protein[more]
AT3G25690.12.8e-7849.83 Hydroxyproline-rich glycoprotein family protein[more]
AT4G18570.14.8e-7047.62 Tetratricopeptide repeat (TPR)-like superfamily protein[more]
AT1G07120.14.0e-6437.01 FUNCTIONS IN: molecular_function unknown[more]
Match NameE-valueIdentityDescription
gi|449459796|ref|XP_004147632.1|6.6e-29599.81PREDICTED: protein CHUP1, chloroplastic [Cucumis sativus][more]
gi|659077044|ref|XP_008439002.1|3.8e-28296.74PREDICTED: protein CHUP1, chloroplastic [Cucumis melo][more]
gi|307136204|gb|ADN34042.1|5.8e-25996.88hydroxyproline-rich glycoprotein family protein [Cucumis melo subsp. melo][more]
gi|694359364|ref|XP_009359807.1|1.4e-14355.76PREDICTED: protein CHUP1, chloroplastic-like [Pyrus x bretschneideri][more]
gi|596170181|ref|XP_007223070.1|1.1e-13753.42hypothetical protein PRUPE_ppa003741mg [Prunus persica][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.252140.1Cucsa.252140.1mRNA
Cucsa.252140.2Cucsa.252140.2mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 124..151
scor
NoneNo IPR availablePANTHERPTHR31342FAMILY NOT NAMEDcoord: 221..519
score: 8.3E
NoneNo IPR availablePANTHERPTHR31342:SF12F11A17.16coord: 221..519
score: 8.3E