CsGy4G012900 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy4G012900
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPAP_fibrillin domain-containing protein
LocationGy14Chr4: 18068385 .. 18072493 (+)
RNA-Seq ExpressionCsGy4G012900
SyntenyCsGy4G012900
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRexonCDSpolypeptidethree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCTCATCTCATATTGAATTTGAACTATGGTCTTCTCTCAATTTATGTTAATCAGCCCTACTTAGAAGAAGATATAGCAGTTCTTAATTCTTATCGTACCTTGTTGGTTCACGTGTGTGAATCTCATTCAACCACCTCAGATTTCTCCCTTCACAAATGCAAATCAATAAACCCTTGCTTACACTGCCACCACTACTAAAGATAACAAGCAATCTGGGCGCGTACCTCACCTCAGCTTGCTCTGTTTTTGCCTTTCCCATGCCCCCCTTAATGGCTCTTAAAGCTAATGCTATGACGCCCACTTATCTTCACCGTTCATTCTTCACTCCTCAGACTCCATCTCTCCCTCCCATCAAACGGTACCATCGCCTTCACACACATATTCGTCTCCGTTGTCGATCTTCACTTGTTGATGAGCAGCAGAAAGAAGTTGTCTCGTTTTCTCAACCGGAAAATTCTCTCATTGATGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCGTAATTCTTCTTTTTTTTCGCCTTCTTTTCGTTCTTTTCTGCTCTGTTTATTAAACTTCTTTTAGTGTATTGGTTTCTTCAGAATGTCGAACGGGCTGTCAGTGTTCTCGAAGGGTTGGAAGGCGTGCGAGATCCGGTAATCCGGTTGGCTTTCCCTATCGATTTTTAATAGTTTTGCTACGGTGATAAACTGTTCGAAATATTGCCCGTTAAGTTTTTTATTTGTTTATATCATCTCGTTCGAGCTTTGGAATAGAAAGGAATTGGCTTTCATGGTGGTTGTAGAAGTTTATCTATGATACTCGAAACCCCATATTTCCAGGCGTTCTTAATTACTCTTTGGTTACTCCTTTTGTTGGTTCAGTTTTACAAGTTTGTTAATGTTTCAGTAGTCTTCTGAAGAAATTAGTATGCTTCAAAGTTTTCTTTTATTACTCAATGTTAAAGCAGCAACTTGTTACATTGTGTGGTAATCAGGAGACATCCTTATTTTATTTCTCTGTCCACTACATAGAGATTTAGAAAATTTCATCCTTGGTATTATATCTTCAACATGAATGAAAATGGTAACTGGTGTCTATTTTACAGGTTTTAAAGCTTTGTCTTCGCTAATCCTGACTTCTCTCTTTCATTGCCTGCAGACAAACTCTAGCCTAATTGAAGGACGTTGGCAGTTGGTATTCACAACTAGACCTGGAACAGCGTCCATAATACAGGTTCGATTTCAATTCTTGATACGTGTCCAACAATGGCAATTCCCTTTTCTTATATTGTGATCCTGTACCTAAACACCATTATAAAATCATAGTTCAGAAGTCAGATTCAAGATTTCTTTCTAGTATTTTGATTAAAATTAGGCGAAAGGTGACTTCAAAACAAATCGATTAAAATAACAAAAATGACTATTTCATGAGCCGTAGACAACGAAAAAGATCATGATTATCTGATCTCGTTGCTGAAGAAATTAGCATAGAACAGAAACTGGGACTTGGAAGAAAGAATACACTCTTCAATCTCCACTCGTTCCCTCGCCTCGGATTTGGGACGCACTCCCTATCCATTAGCAAAAGATTCTTAGCCCAGGGAAATAAATATTTTTAGACCCTTGATGAATATAAAACATCTGAATTGTTTTTTTGAGATCTATCCCTGTCGTGGAGAAATTCTAGAATTCTTTGAAGTCCAACGAAACAATATCTTTGAAAATATCTATTTATAAATTTTCTAAGATAATTATAGAAAGGGCTAGTATATATTAGTCATTTAGTTAGTGTTGGTTTTTTTTTAACAGAGGATTTTGAAAATGTTTTATTGCTTCATCTCAGTAACTAATATATAATGAGTTTGTTATATAATAATCTGAAGCATTGATAATGACAAGAAATGGTAAAATCTTGAAACTCTTATCCTTCAGAGAACATTTGTTGGAGTTGATTTCTTCAGTGTGTTTCAAGAGATATTTCTACGAACAAACGACCCACGAGTCTCCAATATTGTTAAATTCTCTGATGCAATCGGTGAGCTGAAAGTAGAGGTATGTTTGCATATCTTTTAATAACCGGTTAACAATCTCGGATCCTTGCTGTCTAATATTATTCTCCCCTTATAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATCCTTTTCCAATTTGACAGAGCAGCATTTTCTTTTAAGTTCTTGCCTTTTAAGGTCCCATATCCTGTACCGTTCAAACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCAGGGAACCTCCGCATCTCAAGAGGAAACAAGGTAAATTACTTCACTTTCTTCTGTTACTCAAACATATCTTACATACTTAAAACTTTTCGAGTTGATTTGGTAATTGCTAAGGACGTACAGAACCATTGTAATAGTGTAGGGCACCACGTTTGTGTTGCAAAAGCAAACAGAAGCAAGGCAAAAATTATTGCTAGCAATTTCTACAGATAAAGGAGTTGAAGAGGTAAACCTACTTTCCACACACTCATCATAAAGAAAATGTTCGACCTCATTATATTCTCCTGCGGCTGATTTCTTGAGCGCTTGAATACAGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGAAAATAAATTTGAAGAGGAACTTCTCGAGGGAGGATGGAATATGTTATGGAGTTCACAGGTAGTCTATGGATTCATATTTTCGTATGCTGAACAAAATAAAATAGATTACCAAAATATCAGTGGTAAAATGTGAATAGGCTGTATAAGCATGTGAATAAAGTATAAGACGTAATCGGTTAAGGATATGTGACATGAGATGTCAAGTAGTAGTAGTTCTAGTTAGACATAAAGAGAAATGTCAGCAGATTACATGGGAAGCTGTGATTATTCCAGCTAAATACCTCTCTCAAAAAACAATCTTTTATACGATGTCATCTGGCTTGATTTACTTGTTTTTCCAACTTCTTGATAGATACATATCCATCTCAAAAAGCAATCTTTTATATGATTCCATATGACCGCCCTTGACATTTATCTGGTCCGCTGCAACAGATGTCTAGAGTAGCTTGATGAATCGTATAAAAGATTTCTTTTCGGGAGTAGAAAGAGCAAGGGAAAGATTGCTTCTATTTCATGTACTTATCAATTACTGTAAACTGCAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGGTAGAGTGAATAGGATACCAGCTGTACTAAACTCCAATAATGTATAAACCTTCATTTCTAGACTAAAAGAATCATAATCATTCACTAGTCACTTGCATAAAGTACACTCTTACCCTACCCGACAGGTTATCAAGAACGGGCAAATGAAGTTTGGAGTGGATATGTTGCTAGGACTGAGATTCTCCATGATTGGTACCCTTGTGTAAGTTTCCCAATCATATAGTTCATTTTATTGGTTAGCAAAGCTATGAGAAAAAGGTTTGGAAGAATTGAATCCAACAGGCAAATACACAACACGATACGTTCTCATTTGTTATTGGGTCCTCGATTTGTCATTAACATAAGTTTTCTCTGCGGTAGGAAATCTGGAGACAACGCATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCCTTTGGATATCCTTTGGGAATGGAGAGCCGGTTCAAGCTCCAACTTCTGTGAGATATCTCCATATTTGGTTGTTGTTTCATTTGGCTACTACATAGCTATTTAACTTGCCGGGTTGTTTTCAGATACAATGATGGGAAGATTAGGATCACACGAGGATACAATAACATCTTGTTTGTGCATGTACGGGTGGCTGAATCAAAGCAGGTGTAAGAAGACTTGTCAAAAGAAGGAAGCCCTTGAGGTTAGGAACCCGTGTATAGTAGTAGTAAGACATGCCCTCGGCGTCTTTACAATGTATATTCGTCAATTGAAGTTGTAAAGATCTCGAACGTCAAAATTTCACCACCATCCTTCCAATGAAGAATTTCAATCTCAAGTTTTATCAATTAGTTAGACGGGAGTATCTTTCATTTGTGAATCTCAAACCAAGGTGTTCTTTTTTTTTTTTCCAC

mRNA sequence

GCTCATCTCATATTGAATTTGAACTATGGTCTTCTCTCAATTTATGTTAATCAGCCCTACTTAGAAGAAGATATAGCAGTTCTTAATTCTTATCGTACCTTGTTGGTTCACGTGTGTGAATCTCATTCAACCACCTCAGATTTCTCCCTTCACAAATGCAAATCAATAAACCCTTGCTTACACTGCCACCACTACTAAAGATAACAAGCAATCTGGGCGCGTACCTCACCTCAGCTTGCTCTGTTTTTGCCTTTCCCATGCCCCCCTTAATGGCTCTTAAAGCTAATGCTATGACGCCCACTTATCTTCACCGTTCATTCTTCACTCCTCAGACTCCATCTCTCCCTCCCATCAAACGGTACCATCGCCTTCACACACATATTCGTCTCCGTTGTCGATCTTCACTTGTTGATGAGCAGCAGAAAGAAGTTGTCTCGTTTTCTCAACCGGAAAATTCTCTCATTGATGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCAATGTCGAACGGGCTGTCAGTGTTCTCGAAGGGTTGGAAGGCGTGCGAGATCCGACAAACTCTAGCCTAATTGAAGGACGTTGGCAGTTGGTATTCACAACTAGACCTGGAACAGCGTCCATAATACAGAGAACATTTGTTGGAGTTGATTTCTTCAGTGTGTTTCAAGAGATATTTCTACGAACAAACGACCCACGAGTCTCCAATATTGTTAAATTCTCTGATGCAATCGGTGAGCTGAAAGTAGAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATCCTTTTCCAATTTGACAGAGCAGCATTTTCTTTTAAGTTCTTGCCTTTTAAGGTCCCATATCCTGTACCGTTCAAACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCAGGGAACCTCCGCATCTCAAGAGGAAACAAGGGCACCACGTTTGTGTTGCAAAAGCAAACAGAAGCAAGGCAAAAATTATTGCTAGCAATTTCTACAGATAAAGGAGTTGAAGAGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGAAAATAAATTTGAAGAGGAACTTCTCGAGGGAGGATGGAATATGTTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGGTTATCAAGAACGGGCAAATGAAGTTTGGAGTGGATATGTTGCTAGGACTGAGATTCTCCATGATTGGTACCCTTGTGAAATCTGGAGACAACGCATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCCTTTGGATATCCTTTGGGAATGGAGAGCCGGTTCAAGCTCCAACTTCTATACAATGATGGGAAGATTAGGATCACACGAGGATACAATAACATCTTGTTTGTGCATGTACGGGTGGCTGAATCAAAGCAGGTGTAAGAAGACTTGTCAAAAGAAGGAAGCCCTTGAGGTTAGGAACCCGTGTATAGTAGTAGTAAGACATGCCCTCGGCGTCTTTACAATGTATATTCGTCAATTGAAGTTGTAAAGATCTCGAACGTCAAAATTTCACCACCATCCTTCCAATGAAGAATTTCAATCTCAAGTTTTATCAATTAGTTAGACGGGAGTATCTTTCATTTGTGAATCTCAAACCAAGGTGTTCTTTTTTTTTTTTCCAC

Coding sequence (CDS)

ATGCAAATCAATAAACCCTTGCTTACACTGCCACCACTACTAAAGATAACAAGCAATCTGGGCGCGTACCTCACCTCAGCTTGCTCTGTTTTTGCCTTTCCCATGCCCCCCTTAATGGCTCTTAAAGCTAATGCTATGACGCCCACTTATCTTCACCGTTCATTCTTCACTCCTCAGACTCCATCTCTCCCTCCCATCAAACGGTACCATCGCCTTCACACACATATTCGTCTCCGTTGTCGATCTTCACTTGTTGATGAGCAGCAGAAAGAAGTTGTCTCGTTTTCTCAACCGGAAAATTCTCTCATTGATGCTCTCATTGGTGTCCAAGGTCGAGGCCGCTCTGTTTCATCTCAGCAGCTCAGCAATGTCGAACGGGCTGTCAGTGTTCTCGAAGGGTTGGAAGGCGTGCGAGATCCGACAAACTCTAGCCTAATTGAAGGACGTTGGCAGTTGGTATTCACAACTAGACCTGGAACAGCGTCCATAATACAGAGAACATTTGTTGGAGTTGATTTCTTCAGTGTGTTTCAAGAGATATTTCTACGAACAAACGACCCACGAGTCTCCAATATTGTTAAATTCTCTGATGCAATCGGTGAGCTGAAAGTAGAGGCGGCTGCATCAGTCAAAGATGGCAAACGCATCCTTTTCCAATTTGACAGAGCAGCATTTTCTTTTAAGTTCTTGCCTTTTAAGGTCCCATATCCTGTACCGTTCAAACTTCTTGGAGATGAAGCAAAGGGCTGGTTGGACACCACATATTTATCTCCCTCAGGGAACCTCCGCATCTCAAGAGGAAACAAGGGCACCACGTTTGTGTTGCAAAAGCAAACAGAAGCAAGGCAAAAATTATTGCTAGCAATTTCTACAGATAAAGGAGTTGAAGAGGCAATTGATAAGCTCATCTCTGAAAATCAGAATGAAAATAAATTTGAAGAGGAACTTCTCGAGGGAGGATGGAATATGTTATGGAGTTCACAGATGGAAACAGATAGTTGGATAGAGAATGCTGCAAATGGTCTCATGGGGATGCAGGTTATCAAGAACGGGCAAATGAAGTTTGGAGTGGATATGTTGCTAGGACTGAGATTCTCCATGATTGGTACCCTTGTGAAATCTGGAGACAACGCATACGATGTAACCATGGATGATGCTGCAATTATTGGAGGCCCCTTTGGATATCCTTTGGGAATGGAGAGCCGGTTCAAGCTCCAACTTCTATACAATGATGGGAAGATTAGGATCACACGAGGATACAATAACATCTTGTTTGTGCATGTACGGGTGGCTGAATCAAAGCAGGTGTAA

Protein sequence

MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV*
Homology
BLAST of CsGy4G012900 vs. ExPASy Swiss-Prot
Match: Q8LAP6 (Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PAP12 PE=1 SV=1)

HSP 1 Score: 486.1 bits (1250), Expect = 4.1e-136
Identity = 240/356 (67.42%), Postives = 292/356 (82.02%), Query Frame = 0

Query: 76  IRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLE 135
           +R+ C SS     Q +  SF+  E  LIDALIG+QGRG+S S +QL++VE AV VLEGLE
Sbjct: 52  LRISCSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKVLEGLE 111

Query: 136 GVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLR-TNDPRVSNIVK 195
           G+++PT+S LIEGRW+L+FTTRPGTAS IQRTF GVD F+VFQ+++L+ TNDPRVSNIVK
Sbjct: 112 GIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVK 171

Query: 196 FSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTT 255
           FSD IGELKVEA AS+KDGKR+LF+FDRAAF  KFLPFKVPYPVPF+LLGDEAKGWLDTT
Sbjct: 172 FSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTT 231

Query: 256 YLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAIDKLISENQNENKFEE 315
           YLSPSGNLRISRGNKGTTFVLQK+T  RQKLL  IS DKGV EAID+ ++ N N  +   
Sbjct: 232 YLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNY 291

Query: 316 ELLEGGWNMLWSSQMETDSWIENAANGLMGMQVI-KNGQMKFGVDMLLGLRFSMIGTLVK 375
           ELLEG W M+WSSQM TDSWIENAANGLMG Q+I K+G++KF V+++   RFSM G  +K
Sbjct: 292 ELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIK 351

Query: 376 SGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGYNNILFVHVR 430
           S  + YD+ MDDAAIIGG FGYP+ + +  +L++LY D K+RI+RG++NI+FVH+R
Sbjct: 352 SESSTYDLKMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHIR 407

BLAST of CsGy4G012900 vs. ExPASy Swiss-Prot
Match: O49629 (Probable plastid-lipid-associated protein 2, chloroplastic OS=Arabidopsis thaliana OX=3702 GN=PAP2 PE=1 SV=1)

HSP 1 Score: 58.9 bits (141), Expect = 1.6e-07
Identity = 55/234 (23.50%), Postives = 99/234 (42.31%), Query Frame = 0

Query: 86  DEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNS-S 145
           +E  ++V    + + SL+D+L G   RG S SS+  + +   ++ LE       PT +  
Sbjct: 74  EEAIEDVEETERLKRSLVDSLYGTD-RGLSASSETRAEIGDLITQLESKNPTPAPTEALF 133

Query: 146 LIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIG--EL 205
           L+ G+W L +T+      ++ R  V +       +  + +++  V N V+F+  +G   +
Sbjct: 134 LLNGKWILAYTSFVNLFPLLSRGIVPLIKVDEISQT-IDSDNFTVQNSVRFAGPLGTNSI 193

Query: 206 KVEAAASVKDGKRILFQFDRAAFSFKFL--PFKVPY------------------------ 265
              A   ++  KR+  +F++       L    ++P                         
Sbjct: 194 STNAKFEIRSPKRVQIKFEQGVIGTPQLTDSIEIPEYVEVLGQKIDLNPIRGLLTSVQDT 253

Query: 266 ------------PVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 279
                       P+ F L  D A+ WL TTYL    ++RISRG+ G+ FVL K+
Sbjct: 254 ASSVARTISSQPPLKFSLPADNAQSWLLTTYLDK--DIRISRGDGGSVFVLIKE 303

BLAST of CsGy4G012900 vs. ExPASy Swiss-Prot
Match: Q96398 (Chromoplast-specific carotenoid-associated protein, chromoplastic OS=Cucumis sativus OX=3659 GN=CHRC PE=1 SV=1)

HSP 1 Score: 56.2 bits (134), Expect = 1.1e-06
Identity = 56/234 (23.93%), Postives = 93/234 (39.74%), Query Frame = 0

Query: 86  DEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNS-S 145
           +E+  E     + + +L+D+  G   RG  VS    + +   ++ LE       PT + +
Sbjct: 87  EEKPLEPSEIYKLKKALVDSFYGTD-RGLRVSRDTRAEIVELITQLESKNPTPAPTEALT 146

Query: 146 LIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSD--AIGEL 205
           L+ G+W L +TT  G   ++ R    V    + Q I   + +  V N V+FS   A   +
Sbjct: 147 LLNGKWILAYTTFAGLFPLLSRNLPLVKVEEISQTI--DSENLTVQNSVQFSGPLATTSI 206

Query: 206 KVEAAASVKDGKRILFQFDRAAF-------------SFKFLPFKVPY------------- 265
              A   V+   R+  +F+                 +  FL  K+ +             
Sbjct: 207 TTNAKFEVRSPLRVHIKFEEGVIGTPQLTDSIVIPDNVDFLGQKIDFTPFNGIISSLQDT 266

Query: 266 ------------PVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 279
                       P+ F +     + WL TTYL    +LRISRG+ G+ FVL K+
Sbjct: 267 ASNVAKTISSQPPIKFSISNTRVESWLLTTYLDE--DLRISRGDGGSVFVLLKE 315

BLAST of CsGy4G012900 vs. ExPASy Swiss-Prot
Match: P80471 (Light-induced protein, chloroplastic OS=Solanum tuberosum OX=4113 PE=1 SV=2)

HSP 1 Score: 55.5 bits (132), Expect = 1.8e-06
Identity = 61/237 (25.74%), Postives = 97/237 (40.93%), Query Frame = 0

Query: 86  DEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNS-S 145
           +E  KE       +  L D+L G   RG S SS+  + +   ++ LE       PT + +
Sbjct: 90  EEPPKEPSEIELLKKQLADSLYGT-NRGLSASSETRAEIVELITQLESKNPNPAPTEALT 149

Query: 146 LIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLRTNDPR---VSNIVKFSD--AI 205
           L+ G+W L +T+  G   ++ R  + +    V  E   +T D     V N V F+   A 
Sbjct: 150 LLNGKWILAYTSFSGLFPLLSRGNLPL----VRVEEISQTIDSESFTVQNSVVFAGPLAT 209

Query: 206 GELKVEAAASVKDGKRILFQFDRAAF-------------SFKFLPFKV---PY------- 265
             +   A   V+  KR+  +F+                 + +FL  K+   P+       
Sbjct: 210 TSISTNAKFEVRSPKRVQIKFEEGIIGTPQLTDSIVLPENVEFLGQKIDLSPFKGLITSV 269

Query: 266 ---------------PVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 279
                          P+ F +  + A+ WL TTYL     LRISRG+ G+ FVL K+
Sbjct: 270 QDTASSVAKSISSQPPIKFPITNNNAQSWLLTTYL--DDELRISRGDAGSVFVLIKE 319

BLAST of CsGy4G012900 vs. ExPASy Swiss-Prot
Match: Q94FZ9 (Plastid lipid-associated protein 1, chloroplastic OS=Brassica campestris OX=3711 GN=PAP1 PE=1 SV=1)

HSP 1 Score: 55.1 bits (131), Expect = 2.4e-06
Identity = 61/244 (25.00%), Postives = 99/244 (40.57%), Query Frame = 0

Query: 82  SSLVDEQQKEVVSFSQPENSLIDALIGV---QGRGRSVSSQQLSNVERAVSVLEGLEGVR 141
           SS+ ++  +E +  ++    L   L G      RG S SS+  + +   ++ LE      
Sbjct: 83  SSVAEKVAEEAIESAEETERLKRVLAGSLYGTDRGLSASSETRAEISELITQLESKNPNP 142

Query: 142 DPTNS-SLIEGRWQLVFTTRPGTASIIQR---TFVGVDFFSVFQEIFLRTNDPRVSNIVK 201
            P  +  L+ G+W LV+T+  G   ++ R     V VD  S      + ++   V N V+
Sbjct: 143 APNEALFLLNGKWILVYTSFVGLFPLLSRRISPLVKVDEISQ----TIDSDSFTVHNSVR 202

Query: 202 FSD--AIGELKVEAAASVKDGKRILFQFDRAAFSFKFL--PFKVPY-------------- 261
           F+   A   L   A   V+  KR+  +F++       L    ++P               
Sbjct: 203 FASPLATTSLSTNAKFEVRSPKRVQVKFEQGVIGTPQLTDSIEIPEFVEVLGQKIDLNPI 262

Query: 262 ----------------------PVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 279
                                 P+ F L GD A+ WL TTYL    +LRISRG+ G+ FV
Sbjct: 263 KGLLTSVQDTASSVARTISSQPPLKFSLPGDSAQSWLLTTYLDK--DLRISRGDGGSVFV 320

BLAST of CsGy4G012900 vs. NCBI nr
Match: XP_004142141.1 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucumis sativus] >KGN54108.1 hypothetical protein Csa_018101 [Cucumis sativus])

HSP 1 Score: 861 bits (2224), Expect = 5.29e-315
Identity = 436/436 (100.00%), Postives = 436/436 (100.00%), Query Frame = 0

Query: 1   MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT 60
           MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT
Sbjct: 1   MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT 60

Query: 61  PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ 120
           PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ
Sbjct: 61  PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ 120

Query: 121 LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI 180
           LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI
Sbjct: 121 LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI 180

Query: 181 FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF 240
           FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF
Sbjct: 181 FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF 240

Query: 241 KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID 300
           KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID
Sbjct: 241 KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID 300

Query: 301 KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML 360
           KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML
Sbjct: 301 KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML 360

Query: 361 LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY 420
           LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY
Sbjct: 361 LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY 420

Query: 421 NNILFVHVRVAESKQV 436
           NNILFVHVRVAESKQV
Sbjct: 421 NNILFVHVRVAESKQV 436

BLAST of CsGy4G012900 vs. NCBI nr
Match: XP_008449765.1 (PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Cucumis melo] >KAA0041357.1 putative plastid-lipid-associated protein 12 [Cucumis melo var. makuwa] >TYK21659.1 putative plastid-lipid-associated protein 12 [Cucumis melo var. makuwa])

HSP 1 Score: 773 bits (1995), Expect = 1.10e-280
Identity = 390/398 (97.99%), Postives = 393/398 (98.74%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANAMTPTYLHRSFFTP TPSLPPIKRYHR HTHIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIG+LVKSGDN YDVTMDDAAIIGGPFGYPLG
Sbjct: 301 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLG 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RVAESKQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVAESKQV 398

BLAST of CsGy4G012900 vs. NCBI nr
Match: XP_038902721.1 (probable plastid-lipid-associated protein 12, chloroplastic [Benincasa hispida])

HSP 1 Score: 735 bits (1898), Expect = 6.63e-266
Identity = 371/398 (93.22%), Postives = 384/398 (96.48%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           M LK NA+TPTYLH SFFTP TPSLPPIKR+HR HTHIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MTLKPNAITPTYLHLSFFTPLTPSLPPIKRFHRHHTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLI ALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPT+SSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIQALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTDSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAAS+KDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASIKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFDRAAFSFKFLPFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDRAAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAIST KGVEEAIDKLISEN+NENKFEEEL+EG WNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISTGKGVEEAIDKLISENRNENKFEEELVEGEWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQVIK+GQMK+ VDMLLGLRFSM GTLVKSGD+ YDVTMDDAAIIGGPFGYPL 
Sbjct: 301 ANGLMGMQVIKSGQMKYRVDMLLGLRFSMTGTLVKSGDDIYDVTMDDAAIIGGPFGYPLE 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RV E+KQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVGETKQV 398

BLAST of CsGy4G012900 vs. NCBI nr
Match: KAG6570366.1 (putative plastid-lipid-associated protein 12, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia])

HSP 1 Score: 726 bits (1875), Expect = 2.11e-262
Identity = 365/398 (91.71%), Postives = 381/398 (95.73%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANA+ PTYLHRSFFTPQ PSLPPIKRYHR +THIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLI+ALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIEALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFD+AAFSFKFLPFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQK+
Sbjct: 181 QFDKAAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKR 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAIS  K VEEAIDKLISE +NENKF++ELLEG WNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISAGKRVEEAIDKLISEYRNENKFQQELLEGDWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQ+IKNGQMKF VDMLLG+RFSM GT VKS D+ YDV+MDDAAIIGGPFGYP+ 
Sbjct: 301 ANGLMGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVE 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RV E KQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVGEPKQV 398

BLAST of CsGy4G012900 vs. NCBI nr
Match: XP_022987045.1 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbita maxima])

HSP 1 Score: 725 bits (1871), Expect = 8.60e-262
Identity = 363/398 (91.21%), Postives = 381/398 (95.73%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANA+ PTYLHRSFFTPQ PSLPPIKRYHR +THIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLI+ALIGVQGRGR+VSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIEALIGVQGRGRAVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFD+AAFSFKFLPFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDKAAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAIS  K VEEAIDKL+SE +NENKF++ELLEG WNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISAGKRVEEAIDKLVSEYRNENKFQQELLEGDWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQ+IKNGQMKF VDMLLG+RFSM GT VKS D+ YDV+MDDAAIIGGPFGYP+ 
Sbjct: 301 ANGLMGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVE 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RV E +QV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVGEPQQV 398

BLAST of CsGy4G012900 vs. ExPASy TrEMBL
Match: A0A0A0KX56 (PAP_fibrillin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G286330 PE=3 SV=1)

HSP 1 Score: 861 bits (2224), Expect = 2.56e-315
Identity = 436/436 (100.00%), Postives = 436/436 (100.00%), Query Frame = 0

Query: 1   MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT 60
           MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT
Sbjct: 1   MQINKPLLTLPPLLKITSNLGAYLTSACSVFAFPMPPLMALKANAMTPTYLHRSFFTPQT 60

Query: 61  PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ 120
           PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ
Sbjct: 61  PSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQ 120

Query: 121 LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI 180
           LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI
Sbjct: 121 LSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEI 180

Query: 181 FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF 240
           FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF
Sbjct: 181 FLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPF 240

Query: 241 KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID 300
           KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID
Sbjct: 241 KLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAID 300

Query: 301 KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML 360
           KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML
Sbjct: 301 KLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENAANGLMGMQVIKNGQMKFGVDML 360

Query: 361 LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY 420
           LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY
Sbjct: 361 LGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGY 420

Query: 421 NNILFVHVRVAESKQV 436
           NNILFVHVRVAESKQV
Sbjct: 421 NNILFVHVRVAESKQV 436

BLAST of CsGy4G012900 vs. ExPASy TrEMBL
Match: A0A5D3DEA0 (Putative plastid-lipid-associated protein 12 OS=Cucumis melo var. makuwa OX=1194695 GN=E5676_scaffold859G00220 PE=3 SV=1)

HSP 1 Score: 773 bits (1995), Expect = 5.34e-281
Identity = 390/398 (97.99%), Postives = 393/398 (98.74%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANAMTPTYLHRSFFTP TPSLPPIKRYHR HTHIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIG+LVKSGDN YDVTMDDAAIIGGPFGYPLG
Sbjct: 301 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLG 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RVAESKQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVAESKQV 398

BLAST of CsGy4G012900 vs. ExPASy TrEMBL
Match: A0A1S3BNF7 (probable plastid-lipid-associated protein 12, chloroplastic OS=Cucumis melo OX=3656 GN=LOC103491552 PE=3 SV=1)

HSP 1 Score: 773 bits (1995), Expect = 5.34e-281
Identity = 390/398 (97.99%), Postives = 393/398 (98.74%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANAMTPTYLHRSFFTP TPSLPPIKRYHR HTHIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAMTPTYLHRSFFTPHTPSLPPIKRYHRHHTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEA ASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAVASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIG+LVKSGDN YDVTMDDAAIIGGPFGYPLG
Sbjct: 301 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGSLVKSGDNTYDVTMDDAAIIGGPFGYPLG 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RVAESKQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVAESKQV 398

BLAST of CsGy4G012900 vs. ExPASy TrEMBL
Match: A0A6J1JFQ6 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurbita maxima OX=3661 GN=LOC111484606 PE=3 SV=1)

HSP 1 Score: 725 bits (1871), Expect = 4.16e-262
Identity = 363/398 (91.21%), Postives = 381/398 (95.73%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANA+ PTYLHRSFFTPQ PSLPPIKRYHR +THIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAIAPTYLHRSFFTPQAPSLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           ENSLI+ALIGVQGRGR+VSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  ENSLIEALIGVQGRGRAVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFD+AAFSFKFLPFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDKAAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAIS  K VEEAIDKL+SE +NENKF++ELLEG WNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISAGKRVEEAIDKLVSEYRNENKFQQELLEGDWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQ+IKNGQMKF VDMLLG+RFSM GT VKS D+ YDV+MDDAAIIGGPFGYP+ 
Sbjct: 301 ANGLMGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSADDTYDVSMDDAAIIGGPFGYPVE 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RV E +QV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVGEPQQV 398

BLAST of CsGy4G012900 vs. ExPASy TrEMBL
Match: A0A6J1FUZ3 (probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurbita moschata OX=3662 GN=LOC111448350 PE=3 SV=1)

HSP 1 Score: 720 bits (1859), Expect = 2.80e-260
Identity = 362/398 (90.95%), Postives = 379/398 (95.23%), Query Frame = 0

Query: 39  MALKANAMTPTYLHRSFFTPQTPSLPPIKRYHRLHTHIRLRCRSSLVDEQQKEVVSFSQP 98
           MALKANA+ PTYLHRSFFTPQ P LPPIKRYHR +THIRLRCRSSLVDEQQKEVVSFS+P
Sbjct: 1   MALKANAIAPTYLHRSFFTPQAPPLPPIKRYHRYNTHIRLRCRSSLVDEQQKEVVSFSEP 60

Query: 99  ENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 158
           E SLI+ALIGVQGRGRSVSSQQLSNVE+AVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP
Sbjct: 61  EKSLIEALIGVQGRGRSVSSQQLSNVEQAVSVLEGLEGVRDPTNSSLIEGRWQLVFTTRP 120

Query: 159 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 218
           GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF
Sbjct: 121 GTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVKFSDAIGELKVEAAASVKDGKRILF 180

Query: 219 QFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 278
           QFD+AAFSFKFLPFKVPYPVPF+LLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ
Sbjct: 181 QFDKAAFSFKFLPFKVPYPVPFRLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFVLQKQ 240

Query: 279 TEARQKLLLAISTDKGVEEAIDKLISENQNENKFEEELLEGGWNMLWSSQMETDSWIENA 338
           TEARQ LLLAIS  K V+EAIDKLISE +NENKF++ELLEG WNMLWSSQMETDSWIENA
Sbjct: 241 TEARQNLLLAISAGKRVDEAIDKLISEYRNENKFQQELLEGDWNMLWSSQMETDSWIENA 300

Query: 339 ANGLMGMQVIKNGQMKFGVDMLLGLRFSMIGTLVKSGDNAYDVTMDDAAIIGGPFGYPLG 398
           ANGLMGMQ+IKNGQMKF VDMLLG+RFSM GT VKS D+ YDV+MDDAAIIGGPFGYP+ 
Sbjct: 301 ANGLMGMQIIKNGQMKFRVDMLLGMRFSMTGTFVKSEDDTYDVSMDDAAIIGGPFGYPVE 360

Query: 399 MESRFKLQLLYNDGKIRITRGYNNILFVHVRVAESKQV 436
           MESRFKLQLLYNDGKIRITRGYNNILFVH+RV E KQV
Sbjct: 361 MESRFKLQLLYNDGKIRITRGYNNILFVHLRVGEPKQV 398

BLAST of CsGy4G012900 vs. TAIR 10
Match: AT1G51110.1 (Plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 486.1 bits (1250), Expect = 2.9e-137
Identity = 240/356 (67.42%), Postives = 292/356 (82.02%), Query Frame = 0

Query: 76  IRLRCRSSLVDEQQKEVVSFSQPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLE 135
           +R+ C SS     Q +  SF+  E  LIDALIG+QGRG+S S +QL++VE AV VLEGLE
Sbjct: 52  LRISCSSSSTVTDQTQQSSFNDAELKLIDALIGIQGRGKSASPKQLNDVESAVKVLEGLE 111

Query: 136 GVRDPTNSSLIEGRWQLVFTTRPGTASIIQRTFVGVDFFSVFQEIFLR-TNDPRVSNIVK 195
           G+++PT+S LIEGRW+L+FTTRPGTAS IQRTF GVD F+VFQ+++L+ TNDPRVSNIVK
Sbjct: 112 GIQNPTDSDLIEGRWRLMFTTRPGTASPIQRTFTGVDVFTVFQDVYLKATNDPRVSNIVK 171

Query: 196 FSDAIGELKVEAAASVKDGKRILFQFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTT 255
           FSD IGELKVEA AS+KDGKR+LF+FDRAAF  KFLPFKVPYPVPF+LLGDEAKGWLDTT
Sbjct: 172 FSDFIGELKVEAVASIKDGKRVLFRFDRAAFDLKFLPFKVPYPVPFRLLGDEAKGWLDTT 231

Query: 256 YLSPSGNLRISRGNKGTTFVLQKQTEARQKLLLAISTDKGVEEAIDKLISENQNENKFEE 315
           YLSPSGNLRISRGNKGTTFVLQK+T  RQKLL  IS DKGV EAID+ ++ N N  +   
Sbjct: 232 YLSPSGNLRISRGNKGTTFVLQKETVPRQKLLATISQDKGVAEAIDEFLASNSNSAEDNY 291

Query: 316 ELLEGGWNMLWSSQMETDSWIENAANGLMGMQVI-KNGQMKFGVDMLLGLRFSMIGTLVK 375
           ELLEG W M+WSSQM TDSWIENAANGLMG Q+I K+G++KF V+++   RFSM G  +K
Sbjct: 292 ELLEGSWQMIWSSQMYTDSWIENAANGLMGRQIIEKDGRIKFEVNIIPAFRFSMKGKFIK 351

Query: 376 SGDNAYDVTMDDAAIIGGPFGYPLGMESRFKLQLLYNDGKIRITRGYNNILFVHVR 430
           S  + YD+ MDDAAIIGG FGYP+ + +  +L++LY D K+RI+RG++NI+FVH+R
Sbjct: 352 SESSTYDLKMDDAAIIGGAFGYPVDITNNIELKILYTDEKMRISRGFDNIIFVHIR 407

BLAST of CsGy4G012900 vs. TAIR 10
Match: AT3G26070.1 (Plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 53.1 bits (126), Expect = 6.4e-07
Identity = 46/183 (25.14%), Postives = 88/183 (48.09%), Query Frame = 0

Query: 97  QPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTT 156
           Q +  L++A+  ++ RG + S      +++    +E +   ++P  S L+ G+W+L++TT
Sbjct: 73  QLKQELLEAIEPLE-RGATASPDDQLRIDQLARKVEAVNPTKEPLKSDLVNGKWELIYTT 132

Query: 157 RPGTASIIQRTFVGVDFFSVFQEIFLRTNDPRVSNIVK---FSDAIGELKVEAAASVKDG 216
              +ASI+Q         S+     +  +  +V N+     ++   G++K        + 
Sbjct: 133 ---SASILQAKKPRF-LRSITNYQSINVDTLKVQNMETWPFYNSVTGDIK------PLNS 192

Query: 217 KRILFQFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTF 276
           K++  +       FK L F +P   P     D A+G L+ TY+     LR+SRG+KG  F
Sbjct: 193 KKVAVKLQ----VFKILGF-IPIKAP-----DSARGELEITYVDE--ELRLSRGDKGNLF 232

BLAST of CsGy4G012900 vs. TAIR 10
Match: AT3G26080.1 (plastid-lipid associated protein PAP / fibrillin family protein )

HSP 1 Score: 48.1 bits (113), Expect = 2.0e-05
Identity = 45/182 (24.73%), Postives = 82/182 (45.05%), Query Frame = 0

Query: 97  QPENSLIDALIGVQGRGRSVSSQQLSNVERAVSVLEGLEGVRDPTNSSLIEGRWQLVFTT 156
           Q ++ L++A+  ++ RG + S      +++    +E +   ++P  S LI G+W+L++TT
Sbjct: 64  QLKHELVEAIEPLE-RGATASPDDQLLIDQLARKVEAVNPTKEPLKSDLINGKWELIYTT 123

Query: 157 RPGTASIIQRTFVGVDFFSVFQEIFLR--TNDPRVSNIVKFSDAIGELKVEAAASVKDGK 216
              +A+I+Q            +  FLR  TN   ++      D +   ++E         
Sbjct: 124 ---SAAILQAK----------KPRFLRSLTNYQCIN-----MDTLKVQRMETWPFYNSVT 183

Query: 217 RILFQFDRAAFSFKFLPFKVPYPVPFKLLGDEAKGWLDTTYLSPSGNLRISRGNKGTTFV 276
             L   +    + K   FK+   +P K     A+G L+ TY+     LRISRG     F+
Sbjct: 184 GDLTPLNSKTVAVKLQVFKILGFIPVKAPDGTARGELEITYVDE--ELRISRGKGNLLFI 224

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q8LAP64.1e-13667.42Probable plastid-lipid-associated protein 12, chloroplastic OS=Arabidopsis thali... [more]
O496291.6e-0723.50Probable plastid-lipid-associated protein 2, chloroplastic OS=Arabidopsis thalia... [more]
Q963981.1e-0623.93Chromoplast-specific carotenoid-associated protein, chromoplastic OS=Cucumis sat... [more]
P804711.8e-0625.74Light-induced protein, chloroplastic OS=Solanum tuberosum OX=4113 PE=1 SV=2[more]
Q94FZ92.4e-0625.00Plastid lipid-associated protein 1, chloroplastic OS=Brassica campestris OX=3711... [more]
Match NameE-valueIdentityDescription
XP_004142141.15.29e-315100.00probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucumis ... [more]
XP_008449765.11.10e-28097.99PREDICTED: probable plastid-lipid-associated protein 12, chloroplastic [Cucumis ... [more]
XP_038902721.16.63e-26693.22probable plastid-lipid-associated protein 12, chloroplastic [Benincasa hispida][more]
KAG6570366.12.11e-26291.71putative plastid-lipid-associated protein 12, chloroplastic, partial [Cucurbita ... [more]
XP_022987045.18.60e-26291.21probable plastid-lipid-associated protein 12, chloroplastic isoform X1 [Cucurbit... [more]
Match NameE-valueIdentityDescription
A0A0A0KX562.56e-315100.00PAP_fibrillin domain-containing protein OS=Cucumis sativus OX=3659 GN=Csa_4G2863... [more]
A0A5D3DEA05.34e-28197.99Putative plastid-lipid-associated protein 12 OS=Cucumis melo var. makuwa OX=1194... [more]
A0A1S3BNF75.34e-28197.99probable plastid-lipid-associated protein 12, chloroplastic OS=Cucumis melo OX=3... [more]
A0A6J1JFQ64.16e-26291.21probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurb... [more]
A0A6J1FUZ32.80e-26090.95probable plastid-lipid-associated protein 12, chloroplastic isoform X1 OS=Cucurb... [more]
Match NameE-valueIdentityDescription
AT1G51110.12.9e-13767.42Plastid-lipid associated protein PAP / fibrillin family protein [more]
AT3G26070.16.4e-0725.14Plastid-lipid associated protein PAP / fibrillin family protein [more]
AT3G26080.12.0e-0524.73plastid-lipid associated protein PAP / fibrillin family protein [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 295..315
NoneNo IPR availablePANTHERPTHR31906:SF30PLASTID-LIPID-ASSOCIATED PROTEIN 12, CHLOROPLASTIC-RELATEDcoord: 69..292
NoneNo IPR availablePROSITEPS51257PROKAR_LIPOPROTEINcoord: 1..28
score: 5.0
IPR006843Plastid lipid-associated protein/fibrillin conserved domainPFAMPF04755PAP_fibrillincoord: 99..275
e-value: 8.5E-43
score: 146.6
IPR039633Plastid-lipid-associated proteinPANTHERPTHR31906FAMILY NOT NAMEDcoord: 69..292

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G012900.1CsGy4G012900.1mRNA