Cucsa.281100 (gene) Cucumber (Gy14) v1

NameCucsa.281100
Typegene
OrganismCucumis sativus (Cucumber (Gy14) v1)
DescriptionProtein of unknown function (DUF3531)
Locationscaffold02633 : 1003088 .. 1008206 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GCGAGGGAGGGAACCAACTTTCTCCTAAGAATTAATCAGCGCGAGAGTTTCTGAAGAACAAAGAGAAAAGCAGCTGAAAGGAAAGAAGACACGAGGACATGTCCGTGTTCAATGGCATCGGATTAGGGTTAGCTTTTACGAATCCCAATTCCACTTGCATTTTTCATTCTAATACAAGATTTTTCCCACAATCCCTCACTTCAATTCCTGAATTTCGTCCAATTTCACTTCGTTCTCGTGCATTGCTTTCTGAAAATGGCGATGATTCTAAGTTTGACGCTGTGAGTACTTCTACACCCACTGCTACTGATGCTAAGAAGAGCTCTGGAACCTCTGCTAGAAGTCGTCGATTGCTTAaGCTTCGTGAAGAGAAACGCAAACGAGAACATGATCGTCTCCACAATTATCCTGCTTGGGCgAAGTCTCTCTCTCTCTCTctctctCTcTctCTcNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNgAGTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTGTGTTTTTtCTGTATGATTTGTGTAGCGTTTTATTTGATTATTGGTAATGAAATTGCTGGGAAAACGACAGAGTGTTGGAAGATGCCTGCAAAAACGATGCGGAATTACGAGCTGTTCTTGGCGATAGCATTGGCAATCCAGAGGAGATGAGAAAAAAGGTATATCAGTTTGGCTTATATTCATCTTCTTCATTTCATTTTTTGAGTATCAGTTTGTTGTAATTGAATACATTTCCTGTTTCCACATTTCTTGTTGTTTCATTTTTCTAACTCTTTGTTCAGAATTCTTTCACCTCCGTTTGAGCGTCGAGAAACCTTAACAAAATGATTGGAACTAGTTTGTATCATATTTGAAATTTACTATTTAATGTTTCCTTTATATATTCATCAGTCGGACGTCTGAGAAGATAAGATTCACCTTATTATCTTCTCGCAATTACGAGGATTAAGTAGGGTCCTTGTTATAATATATCTTGTTGCTATAGTTGAATATGGTGCTTGCTATGAGAGAATTGGGAAGTTCTAGGTAGTGATTTTATCTCAATGAGAGAGCATTTTCTGGATAATCAACGGGACTGTTTTATTAGTTTGTCGTTTTATAAGCCGCTCAGTTTACTTATATTCCAGTATTGGCTTTTCAAAACATTTGGCTGGAGAAGTTTCATGTGACTAATAATTCATATAGAAAATTCTTGTAATTAAACTTATTTAGCGTCTATCGTCAATCACAAGCCGATAGCTTCATTTTAGAGTATCAAAAGTCGTTTGAATATCTTGTCTTGAACATTTGTCTGTGATTACATTAACATAGATGATTGAACTGATGAACAGAATAAGCTTCTTTCTGTGAGGTATCAACCAGAAGAGAAAAGGCTTCTAGATTTTTTTCATGTAATTTCTTTTAAATAATGGTGTTTTTTtATCTTTGTGGATAGTTACATTTTCCAATGTGATTGCAACAGGTTGAAGAGAGAGTACGGAGGAAGGGTAGAGATTTCCAAAAGTCTAAAACGGGTTCCATCCTTGCCTTCAAAGTCAGCTTTAGAGAGTAATCTATTCAATAAATGGCCCCTTATTTGTTTTTCTGCCCTTTTCTGTCCTCGCTAGTATAACCTTATTTTTGGTGCCAAAATAAAAGCTGGATCCTCCACCATAAAAGAAAAGACTTCACCTCATTGCCAATTTCATTTTGTTCTTGCAATCGGCCATTACTCTTTAAATATGATTATGTTCTTCTCAATTGCTCAGGTTTCAAATCTGTTTTGAAGAAAATATTTAAGACCCAAAAGTTGTTAAAAGTCCTGGCTGGTTCAGCACAAAGTCTTATTTAAAGCAGCTGTGTCTTAAGAAAAAAAAaTTCATTGTGGCTTCAAATCACGATAGGGATTTAGTTTCAACTTTGGGTTTCCTAAATTTGCATGTCAGAAATTTTTTATTCATTAAAGAATGGTTGGATATTAATACTGTTTTACGGTTCTTATCATGGTTAAATTTCTTGTCTTCTGAGCTATTTTGAGGCTGAAATATGCTTTCTGGTTGTTTATTAGTTTTTTTGCTCACGAATGTACAAGTGAACTATATGAAATATATTGATGGGGATGCTGGTCATAATTGAAATTTTGGATCATGACTAATTTATACGTCAATAACTATATAATCACCTAAAACCCCATAAATTAGTTTTTCTTCTTTTTGGAAAATTTGGTGTAGTTCAGTGCAAAACTGAACAATCCCGGGATACAGGTATTCTGGAGTTCTGCTATACATTTGTCCTCGTTTCGTGATTAACCGGTCTTTAGGTATAGCCATTGTTTTTTATGCACTAGATATATATAATACCTGTTTGCTACGTGTCTTGGGAATTTTTCAGTCATCCTTATGCTGCTAAGTGCTAATGAAGTGAAAGTTGTTAGTATTGCCATTGATGTGGTGATAAATTGTTCATTTAACTTTTTAGACGTTCAACTTTACCATTTTAAGTAACTGATCATTCTTTTCCCTCTGTTTTGCCGCACAGCTTCAATCCTCTTGATTCCTACATATGGTTTGAGCTGATTGGATCACCAACTGATCGAGATGTTGATCTTATTGGCAGTGTAAGTTAATTCTTATTGCTGTAGAATGAGTTTCTTTGTCAGAATTATTGGTATACCTGAGCTGCTCAAACTTTACAAGGACCATATTACTTACATTACAACTAATCGTGCATCCATGAGTTACAAGGACCATATGAGTTTCTTCTCTGTACATCCATGAATGACTTTTAACTTGTAGTTTATATTACTTACATTACAACTAATCGTGCAGGTTATCCAGTCATGGTATGTCATGGGTCGATTGGGGGCCTTTAACTCTTCAAATTTGCAGGTTACACCATATATCCTTGAACCTACTTGATTTATAGTATACTTCATTAAAGCCCCTTACCGTGATGATCTGAAGTCAATTACTCCTTTTATCTTACGTAATACGAATCTATGAATTACTTCTTTAGTTATTCTGTTTAGTTGTGGATTGGAAAGAGGGAAAGTTTTTCTGCCAGTTTCTTAATATTTTCTACATGACATTATTTCTTGTGGAGATAACACTTACCTGATAAATGAATTGCACAACTGTTTAATTAGCTTGTCAGCCTTATATGGTTGAAGGTAGTCACTCTGCTTTGGATGGACTCTTGATCACTATCCACTCATGCTCAGACTACTGAAAAAACGGAACTATGGTCCATGTTAAAAGCATTTTTTTTTtCTTGTAAAAGAAATAACACATTCAATTAATTATGAATTCGATTTATTCGAAGGCTGTTTATATATGAGTCCGGCGGATGGATGACCCAAAAATTGGAGAAGAAATATGGAATGACCTGTTCTTTTTAAAATCACAAATAACCTTTTGTTAATGTCAAATTCTAGTGGAACTACCAAAATGTTCTCGCTTGCACTCGGAAAAACAGTCCCGTCTAGCTTTCTTCCGTCTCGCTCAATCTCTCTTAGTCTTGCACGGTTGGTGTCGCTCTCGCCCATCTCATTTGGTCTCCTTGTTGTCGTTCAGTCTTGCCCATTTTGGTTTGTGTCGCTTTCACCCATTAGTTTGTGTCACTCGATGTTGGAGAAGTGGTCGGAGGAGAAGAAGAAGATGATGAGAAAAAGAGAAAAATGTGTGGATGCTGGAGAAGAACGTGGGGAGATAAGAAAAAACTTGTGTGGCGGCTGGGAGGAAGATAAAAAAAAaCCTAAATCAATATTGAGAGCAATGTTGTAATTTCAGACAAGGATGGACATCATCGAAGGGGTATTTGATGTTTTCTAAGAAATTTGGGTCATTTGCCATTTGAGTGCCTTTAAAATGGGTCTTTTAGACAATTGAATCTTTACATATCTCGCTACATTCTATTCTATCCTTTTTTtCCCTTTTGTGTGATACCCAAAAGAAAGGCGGTTAGTCTGTAACTCCTTTACATAGTGACTGCTGCGTGAATTGCAAAATTGTTGGATTCAAATTTTCTGTTGAATTAAATGCAGCTGGCGAATTCATCCATGGAGTACAATCCTGTCTACGATGCAGATAAAGGGTTTAAAGTGATGCAGTCATCATTTCATGATATCAGTGATGTTGAGTTTCAGGACAACTGGGGCCGGGTTTGGTAAGAGTTCTTGAAACTAACTTTATTCTATTGTTCAAACAAGTTGTTATTATGTTGTTGTTTAATGCGTGGCAGGGTCGACCTCGGTACGTCTGATTATTTCGCCATCGATGTTCTATTGAACTGCTTGACTGTTCTAAGCTCAGAGTAAGTTCGACTTTGTCAACTTTGTATTATACTCTTTTACGACAGCTTCTATGCCATGATACATTGTAAAGTTTTAGATACTCATATTTATCTAATAATAGAGCAACATTGACGTGCAGATATTTAGGCATCCAACAAGTTGTGTTTGGTGGACGTCGAATGGGCGATTGGGAAGAAGGGATGACAAGTCCTGACTATGGGTACAAGTCTTTCAAAATCTAAACTTTTACTTTTAAAATCCTGAGTGTATATATAATGCCTAAATCTTTATCGTTTAAATTTCTTCCCTAATAGTAGGAACTTTTTTtAGTACGCGGCGGGTAGGAAGATTCGAACCTTAGATCTTTTAACCGCTAACACATTTTTGTATTAGTGGACAGTTATATCTACTTGCTTTAGCAATCATAGGAAGCTTTTAGGCTTTAGTTCAAGTAGTTTAAAAATGGCCCCACACATTCTTCTCCTCTAAGAAAGGGAACACCAATTAGACGTAGGAATCATCTTTCTCTTAATATTAAAAAAGTAGTCAAGTTATACTCGATGAGA

mRNA sequence

GCGAGGGAGGGAACCAACTTTCTCCTAAGAATTAATCAGCGCGAGAGTTTCTGAAGAACAAAGAGAAAAGCAGCTGAAAGGAAAGAAGACACGAGGACATGTCCGTGTTCAATGGCATCGGATTAGGGTTAGCTTTTACGAATCCCAATTCCACTTGCATTTTTCATTCTAATACAAGATTTTTCCCACAATCCCTCACTTCAATTCCTGAATTTCGTCCAATTTCACTTCGTTCTCGTGCATTGCTTTCTGAAAATGGCGATGATTCTAAGTTTGACGCTGTGAGTACTTCTACACCCACTGCTACTGATGCTAAGAAGAGCTCTGGAACCTCTGCTAGAAGTCGTCGATTGCTTAAGCTTCGTGAAGAGAAACGCAAACGAGAACATGATCGTCTCCACAATTATCCTGCTTGGGCGAAAGTGTTGGAAGATGCCTGCAAAAACGATGCGGAATTACGAGCTGTTCTTGGCGATAGCATTGGCAATCCAGAGGAGATGAGAAAAAAGGTTGAAGAGAGAGTACGGAGGAAGGGTAGAGATTTCCAAAAGTCTAAAACGGGTTCCATCCTTGCCTTCAAAGTCAGCTTTAGAGACTTCAATCCTCTTGATTCCTACATATGGTTTGAGCTGATTGGATCACCAACTGATCGAGATGTTGATCTTATTGGCAGTGTTATCCAGTCATGGTATGTCATGGGTCGATTGGGGGCCTTTAACTCTTCAAATTTGCAGCTGGCGAATTCATCCATGGAGTACAATCCTGTCTACGATGCAGATAAAGGGTTTAAAGTGATGCAGTCATCATTTCATGATATCAGTGATGTTGAGTTTCAGGACAACTGGGGCCGGGTTTGGGTCGACCTCGGTACGTCTGATTATTTCGCCATCGATGTTCTATTGAACTGCTTGACTGTTCTAAGCTCAGAATATTTAGGCATCCAACAAGTTGTGTTTGGTGGACGTCGAATGGGCGATTGGGAAGAAGGGATGACAAGTCCTGACTATGGGTACAAGTCTTTCAAAATCTAAACTTTTACTTTTAAAATCCTGAGTGTATATATAATGCCTAAATCTTTATCGTTTAAATTTCTTCCCTAATAGTAGGAACTTTTTTTAGTACGCGGCGGGTAGGAAGATTCGAACCTTAGATCTTTTAACCGCTAACACATTTTTGTATTAGTGGACAGTTATATCTACTTGCTTTAGCAATCATAGGAAGCTTTTAGGCTTTAGTTCAAGTAGTTTAAAAATGGCCCCACACATTCTTCTCCTCTAAGAAAGGGAACACCAATTAGACGTAGGAATCATCTTTCTCTTAATATTAAAAAAGTAGTCAAGTTATACTCGATGAGA

Coding sequence (CDS)

ATGTCCGTGTTCAATGGCATCGGATTAGGGTTAGCTTTTACGAATCCCAATTCCACTTGCATTTTTCATTCTAATACAAGATTTTTCCCACAATCCCTCACTTCAATTCCTGAATTTCGTCCAATTTCACTTCGTTCTCGTGCATTGCTTTCTGAAAATGGCGATGATTCTAAGTTTGACGCTGTGAGTACTTCTACACCCACTGCTACTGATGCTAAGAAGAGCTCTGGAACCTCTGCTAGAAGTCGTCGATTGCTTAaGCTTCGTGAAGAGAAACGCAAACGAGAACATGATCGTCTCCACAATTATCCTGCTTGGGCgAAAGTGTTGGAAGATGCCTGCAAAAACGATGCGGAATTACGAGCTGTTCTTGGCGATAGCATTGGCAATCCAGAGGAGATGAGAAAAAAGGTTGAAGAGAGAGTACGGAGGAAGGGTAGAGATTTCCAAAAGTCTAAAACGGGTTCCATCCTTGCCTTCAAAGTCAGCTTTAGAGACTTCAATCCTCTTGATTCCTACATATGGTTTGAGCTGATTGGATCACCAACTGATCGAGATGTTGATCTTATTGGCAGTGTTATCCAGTCATGGTATGTCATGGGTCGATTGGGGGCCTTTAACTCTTCAAATTTGCAGCTGGCGAATTCATCCATGGAGTACAATCCTGTCTACGATGCAGATAAAGGGTTTAAAGTGATGCAGTCATCATTTCATGATATCAGTGATGTTGAGTTTCAGGACAACTGGGGCCGGGTTTGGGTCGACCTCGGTACGTCTGATTATTTCGCCATCGATGTTCTATTGAACTGCTTGACTGTTCTAAGCTCAGAATATTTAGGCATCCAACAAGTTGTGTTTGGTGGACGTCGAATGGGCGATTGGGAAGAAGGGATGACAAGTCCTGACTATGGGTACAAGTCTTTCAAAATCTAA

Protein sequence

MSVFNGIGLGLAFTNPNSTCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI*
BLAST of Cucsa.281100 vs. TrEMBL
Match: A0A0B2SCW4_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_021604 PE=4 SV=1)

HSP 1 Score: 448.7 bits (1153), Expect = 5.6e-123
Identity = 219/283 (77.39%), Postives = 251/283 (88.69%), Query Frame = 1

Query: 28  FFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLK 87
           +FP S T    FRP+ L S + +S++   S   + S       +  K SGT+AR RRLL+
Sbjct: 24  WFPSSTTK-RSFRPV-LVSVSAISDDNSHSYTSSSSNGRKLEEEGIKGSGTTARDRRLLR 83

Query: 88  LREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGR 147
           +R+EKR+RE+DRL+NYPAWAKVLE+ACK+DAELRAVLGDSIGNPE MRK+VE+RVR+KGR
Sbjct: 84  IRQEKRQREYDRLNNYPAWAKVLENACKDDAELRAVLGDSIGNPELMRKRVEDRVRKKGR 143

Query: 148 DFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFN 207
           DFQKSKTGS+LAFKV+FRDFNPLDSYIWFEL GSP+DRDV+LIG+VIQSWYVMGRLGAFN
Sbjct: 144 DFQKSKTGSVLAFKVTFRDFNPLDSYIWFELFGSPSDRDVNLIGNVIQSWYVMGRLGAFN 203

Query: 208 SSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVL 267
           SSNLQLANSS+EY+P+YDADKGFKVM SSFHDISD+EFQ+NWGRVWVDLGTSDYFAIDVL
Sbjct: 204 SSNLQLANSSVEYDPLYDADKGFKVMPSSFHDISDIEFQENWGRVWVDLGTSDYFAIDVL 263

Query: 268 LNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           LNCLTVLSSEYLGIQQ+VFGGRRMGDWEEGMTSP+YGYK FKI
Sbjct: 264 LNCLTVLSSEYLGIQQIVFGGRRMGDWEEGMTSPEYGYKYFKI 304

BLAST of Cucsa.281100 vs. TrEMBL
Match: A0A068V8Y6_COFCA (Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00018256001 PE=4 SV=1)

HSP 1 Score: 448.0 bits (1151), Expect = 9.6e-123
Identity = 209/237 (88.19%), Postives = 231/237 (97.47%), Query Frame = 1

Query: 74  KSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEE 133
           K SGT+AR RRL+K+REEKRKRE+DRLHNYPAWAKVLEDACKNDAELRAVLGD+IGNPE 
Sbjct: 24  KGSGTTARGRRLIKVREEKRKREYDRLHNYPAWAKVLEDACKNDAELRAVLGDTIGNPEL 83

Query: 134 MRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSV 193
           MRK+VEERVRRKGRDFQKSKTGS+LAFKVSFRDFNPLDSYIWFEL GSP+DRDVDL+GSV
Sbjct: 84  MRKRVEERVRRKGRDFQKSKTGSVLAFKVSFRDFNPLDSYIWFELYGSPSDRDVDLLGSV 143

Query: 194 IQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVW 253
           IQSWY+MGR+GAFNSSNLQLANSSMEY+P+YDADKGFKVM SSFHDISDVEFQDNWGR+W
Sbjct: 144 IQSWYIMGRIGAFNSSNLQLANSSMEYDPLYDADKGFKVMPSSFHDISDVEFQDNWGRIW 203

Query: 254 VDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           VDLGTSD+F+ID+LLNCLTVLSSEY+GIQQV+FGGR++GDWEEGMTSP+YGYK FKI
Sbjct: 204 VDLGTSDFFSIDILLNCLTVLSSEYVGIQQVIFGGRKIGDWEEGMTSPEYGYKFFKI 260

BLAST of Cucsa.281100 vs. TrEMBL
Match: F6HVX8_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0071g01140 PE=4 SV=1)

HSP 1 Score: 447.6 bits (1150), Expect = 1.2e-122
Identity = 213/237 (89.87%), Postives = 229/237 (96.62%), Query Frame = 1

Query: 74  KSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEE 133
           K SGT+AR RRLLKLREEKRKRE+DRLHNYPAWAKV+EDACK+D+ELRAVLGDSIGNPE 
Sbjct: 61  KGSGTTARGRRLLKLREEKRKREYDRLHNYPAWAKVMEDACKDDSELRAVLGDSIGNPEL 120

Query: 134 MRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSV 193
           MRK+VEERVR+KGRDF+KSKTGS+LA+KVSFRDFNP+DSYIWFEL GSP+DRDVDLIGSV
Sbjct: 121 MRKRVEERVRKKGRDFRKSKTGSVLAYKVSFRDFNPVDSYIWFELYGSPSDRDVDLIGSV 180

Query: 194 IQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVW 253
           IQSWYVMGRLGAFNSSNLQLANSSMEYNP+YDADKGFK+M SSFHDI DVEFQDNWGRVW
Sbjct: 181 IQSWYVMGRLGAFNSSNLQLANSSMEYNPLYDADKGFKLMPSSFHDIGDVEFQDNWGRVW 240

Query: 254 VDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           VDLGTSD+FAIDVLLNCLTVLSSEYLGIQQVVFGGR MGDWEEGMTSP+YGYK FKI
Sbjct: 241 VDLGTSDFFAIDVLLNCLTVLSSEYLGIQQVVFGGRNMGDWEEGMTSPEYGYKYFKI 297

BLAST of Cucsa.281100 vs. TrEMBL
Match: K7K0P8_SOYBN (Uncharacterized protein OS=Glycine max GN=GLYMA_03G000100 PE=4 SV=1)

HSP 1 Score: 447.2 bits (1149), Expect = 1.6e-122
Identity = 218/283 (77.03%), Postives = 250/283 (88.34%), Query Frame = 1

Query: 28  FFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLK 87
           +FP S T    FRP+ L S + +S++   S   + S       +  K SGT+AR RRLL+
Sbjct: 24  WFPSSTTK-RSFRPV-LVSVSAISDDNSHSYTSSSSNGRKLEEEGIKGSGTTARDRRLLR 83

Query: 88  LREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGR 147
           +R+EKR+RE+DRL+NYPAWAKVLE+ACK+DAELRAVLGDSIGNPE MRK+VE+RVR+KGR
Sbjct: 84  IRQEKRQREYDRLNNYPAWAKVLENACKDDAELRAVLGDSIGNPELMRKRVEDRVRKKGR 143

Query: 148 DFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFN 207
           DFQKSKTGS+LAFKV+FRDFNPLDSYIWFEL GSP+DRDV+LIG+VIQSWYVMGRLGAFN
Sbjct: 144 DFQKSKTGSVLAFKVTFRDFNPLDSYIWFELFGSPSDRDVNLIGNVIQSWYVMGRLGAFN 203

Query: 208 SSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVL 267
           SSNLQLANSS+EY+P+YDADKGFKVM SSFHDISD+EFQ+NWGRVWVDLGTSDYFAIDVL
Sbjct: 204 SSNLQLANSSVEYDPLYDADKGFKVMPSSFHDISDIEFQENWGRVWVDLGTSDYFAIDVL 263

Query: 268 LNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           LNCLT LSSEYLGIQQ+VFGGRRMGDWEEGMTSP+YGYK FKI
Sbjct: 264 LNCLTALSSEYLGIQQIVFGGRRMGDWEEGMTSPEYGYKYFKI 304

BLAST of Cucsa.281100 vs. TrEMBL
Match: A0A0B2QHK6_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_037211 PE=4 SV=1)

HSP 1 Score: 446.0 bits (1146), Expect = 3.6e-122
Identity = 217/283 (76.68%), Postives = 248/283 (87.63%), Query Frame = 1

Query: 28  FFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLK 87
           +FP S T    FRP+ +   A+     DD+     S+      +  K SGT+AR RRLL+
Sbjct: 24  WFPSSTTK-RSFRPVIVSVSAI----SDDNSHSYTSSGRKLEEEGIKGSGTTARDRRLLR 83

Query: 88  LREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGR 147
           +R+EKR+RE+D L+NYPAWAKVLE+ACK+DAELRAVLGDSIGNPE MRK+VE+RVR+KGR
Sbjct: 84  IRQEKRQREYDLLNNYPAWAKVLENACKDDAELRAVLGDSIGNPELMRKRVEDRVRKKGR 143

Query: 148 DFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFN 207
           DFQKSKTGS+LAFKV+FRDFNPLDSYIWFEL GSP+DRDV+LIG+VIQSWYVMGRLGAFN
Sbjct: 144 DFQKSKTGSVLAFKVTFRDFNPLDSYIWFELFGSPSDRDVNLIGNVIQSWYVMGRLGAFN 203

Query: 208 SSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVL 267
           SSNLQLANSS+EY+P+YDADKGFKVM SSFHDISD+EFQ+NWGRVWVDLGTSDYFAIDVL
Sbjct: 204 SSNLQLANSSVEYDPLYDADKGFKVMPSSFHDISDIEFQENWGRVWVDLGTSDYFAIDVL 263

Query: 268 LNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           LNCLTVLSSEYLGIQQ+VFGGRRMGDWEEGMTSP+YGYK FKI
Sbjct: 264 LNCLTVLSSEYLGIQQIVFGGRRMGDWEEGMTSPEYGYKYFKI 301

BLAST of Cucsa.281100 vs. TAIR10
Match: AT5G08400.1 (AT5G08400.1 Protein of unknown function (DUF3531))

HSP 1 Score: 412.1 bits (1058), Expect = 2.9e-115
Identity = 207/326 (63.50%), Postives = 245/326 (75.15%), Query Frame = 1

Query: 18  STCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSG 77
           S+C  + N  F P  ++    F         L++ + +            +A  A K SG
Sbjct: 10  SSCTMNLNFAFSPFLVSQRQPFSSHKRNLHTLVAVSANSDNLAGEDNGGISA--ANKGSG 69

Query: 78  TSARSRRLLKLREEKRKREHDRLHNYPAWA------------------------------ 137
           T+AR RRLLK+REEKRKR++DRLH+YP+WA                              
Sbjct: 70  TTARGRRLLKVREEKRKRDYDRLHDYPSWAKYLFLSFSFALQVFVFLPKSRESVNLFLVN 129

Query: 138 ---KVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSF 197
              +VLE ACK+D ELRAVLGDSIGNPE MRKKVEERVR+KG+DFQK KTGS+L+FKV+F
Sbjct: 130 DKCRVLESACKDDEELRAVLGDSIGNPELMRKKVEERVRKKGKDFQKQKTGSVLSFKVNF 189

Query: 198 RDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVY 257
           RDFNP+DS+IWFEL G+P+DRDVDLIGSVIQ+WYVMGRLGAFN+SNLQLAN+S+EY+P+Y
Sbjct: 190 RDFNPVDSFIWFELYGTPSDRDVDLIGSVIQAWYVMGRLGAFNTSNLQLANTSLEYDPLY 249

Query: 258 DADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQV 311
           DA+KGFKVM SSFHDISDVEFQDNWGRVWVDLGTSD FA+DVLLNCLTV+SSEYLGIQQV
Sbjct: 250 DAEKGFKVMPSSFHDISDVEFQDNWGRVWVDLGTSDIFALDVLLNCLTVMSSEYLGIQQV 309

BLAST of Cucsa.281100 vs. TAIR10
Match: AT4G29400.1 (AT4G29400.1 Protein of unknown function (DUF3531))

HSP 1 Score: 188.7 bits (478), Expect = 5.3e-48
Identity = 85/194 (43.81%), Postives = 131/194 (67.53%), Query Frame = 1

Query: 117 DAELRAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWF 176
           D E   +LGD + NP++ +KK+EER+R+K      +KTGS  +  V+F  F   +SY+W 
Sbjct: 109 DPEFADILGDCLDNPDKAQKKMEERLRKKRNKILHTKTGSATSMPVTFNKFEYSNSYMWL 168

Query: 177 ELIGSPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSS 236
           E   +P D+D+ LI   I+SW+++GRLG +NS N+QL+ + ++  P YDA  G  V  ++
Sbjct: 169 EFYNTPLDKDIALISDTIRSWHILGRLGGYNSMNMQLSQAPLDKRPNYDAILGANVEPTT 228

Query: 237 FHDISDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEE 296
           F++I D+E QDN  R+W+D+GTS+   +DVL+N LT +SS+Y+GI++VVFGG     W+E
Sbjct: 229 FYNIGDLEVQDNVARIWLDIGTSEPLILDVLINALTQISSDYVGIKKVVFGGSEFESWKE 288

Query: 297 GMTSPDYGYKSFKI 311
            MTS + G++  KI
Sbjct: 289 NMTSEESGFRVHKI 302

BLAST of Cucsa.281100 vs. NCBI nr
Match: gi|449458716|ref|XP_004147093.1| (PREDICTED: uncharacterized protein LOC101211689 [Cucumis sativus])

HSP 1 Score: 628.6 bits (1620), Expect = 5.7e-177
Identity = 310/310 (100.00%), Postives = 310/310 (100.00%), Query Frame = 1

Query: 1   MSVFNGIGLGLAFTNPNSTCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFD 60
           MSVFNGIGLGLAFTNPNSTCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFD
Sbjct: 1   MSVFNGIGLGLAFTNPNSTCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFD 60

Query: 61  AVSTSTPTATDAKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL 120
           AVSTSTPTATDAKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL
Sbjct: 61  AVSTSTPTATDAKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL 120

Query: 121 RAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG 180
           RAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG
Sbjct: 121 RAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG 180

Query: 181 SPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI 240
           SPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI
Sbjct: 181 SPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI 240

Query: 241 SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS 300
           SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS
Sbjct: 241 SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS 300

Query: 301 PDYGYKSFKI 311
           PDYGYKSFKI
Sbjct: 301 PDYGYKSFKI 310

BLAST of Cucsa.281100 vs. NCBI nr
Match: gi|659090247|ref|XP_008445913.1| (PREDICTED: uncharacterized protein LOC103488796 [Cucumis melo])

HSP 1 Score: 609.4 bits (1570), Expect = 3.6e-171
Identity = 297/310 (95.81%), Postives = 304/310 (98.06%), Query Frame = 1

Query: 1   MSVFNGIGLGLAFTNPNSTCIFHSNTRFFPQSLTSIPEFRPISLRSRALLSENGDDSKFD 60
           MSV +G+GLGLAFTNPNSTC FHSNTRFFPQSL S+PEF PISLRSRALLSENGDDSKFD
Sbjct: 1   MSVLHGVGLGLAFTNPNSTCNFHSNTRFFPQSLASVPEFHPISLRSRALLSENGDDSKFD 60

Query: 61  AVSTSTPTATDAKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL 120
           A+STSTPT TD KKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL
Sbjct: 61  AMSTSTPTTTDPKKSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAEL 120

Query: 121 RAVLGDSIGNPEEMRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG 180
           RAVLGDSIGNPEEMRKKVE+RVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG
Sbjct: 121 RAVLGDSIGNPEEMRKKVEDRVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIG 180

Query: 181 SPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI 240
           SPTDRDVDLIGS+IQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI
Sbjct: 181 SPTDRDVDLIGSIIQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDI 240

Query: 241 SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS 300
           SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS
Sbjct: 241 SDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTS 300

Query: 301 PDYGYKSFKI 311
           P+YGYKSFKI
Sbjct: 301 PEYGYKSFKI 310

BLAST of Cucsa.281100 vs. NCBI nr
Match: gi|720060625|ref|XP_010274923.1| (PREDICTED: uncharacterized protein LOC104610134 [Nelumbo nucifera])

HSP 1 Score: 454.9 bits (1169), Expect = 1.1e-124
Identity = 221/278 (79.50%), Postives = 247/278 (88.85%), Query Frame = 1

Query: 35  SIPEFRPISLRSR--ALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLKLREEK 94
           S+P    ++  SR  A++     D+  D ++      T A K SGT+ARSRRLLK++EEK
Sbjct: 50  SLPIIGDVNSNSRRLAMIRAAASDNSGDTINNKD---TMAAKGSGTTARSRRLLKVKEEK 109

Query: 95  RKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGRDFQKS 154
           RKRE+DRLHNYP+WAK+LEDAC+ND+ELRAVLGDSIGNPE+MRKKVEERVR+KGRDF+KS
Sbjct: 110 RKREYDRLHNYPSWAKILEDACRNDSELRAVLGDSIGNPEQMRKKVEERVRKKGRDFRKS 169

Query: 155 KTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFNSSNLQ 214
           KTGS+LAFKVSFRDFNPLDSYIWFEL GSP+DRDVDLIGSVIQSWYVMGRLGAFNSSNLQ
Sbjct: 170 KTGSVLAFKVSFRDFNPLDSYIWFELYGSPSDRDVDLIGSVIQSWYVMGRLGAFNSSNLQ 229

Query: 215 LANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVLLNCLT 274
           LANSS EYNP+YDADKGFKVM SSFHDISDVEFQDNWGRVWVDLGT D+FA+DVLLNCLT
Sbjct: 230 LANSSFEYNPLYDADKGFKVMPSSFHDISDVEFQDNWGRVWVDLGTCDFFAVDVLLNCLT 289

Query: 275 VLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           VLSSEYLGIQQVVFGG RMGDWEEGMT+P+YGYK FKI
Sbjct: 290 VLSSEYLGIQQVVFGGHRMGDWEEGMTNPEYGYKHFKI 324

BLAST of Cucsa.281100 vs. NCBI nr
Match: gi|734423210|gb|KHN42114.1| (hypothetical protein glysoja_021604 [Glycine soja])

HSP 1 Score: 448.7 bits (1153), Expect = 8.0e-123
Identity = 219/283 (77.39%), Postives = 251/283 (88.69%), Query Frame = 1

Query: 28  FFPQSLTSIPEFRPISLRSRALLSENGDDSKFDAVSTSTPTATDAKKSSGTSARSRRLLK 87
           +FP S T    FRP+ L S + +S++   S   + S       +  K SGT+AR RRLL+
Sbjct: 24  WFPSSTTK-RSFRPV-LVSVSAISDDNSHSYTSSSSNGRKLEEEGIKGSGTTARDRRLLR 83

Query: 88  LREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEEMRKKVEERVRRKGR 147
           +R+EKR+RE+DRL+NYPAWAKVLE+ACK+DAELRAVLGDSIGNPE MRK+VE+RVR+KGR
Sbjct: 84  IRQEKRQREYDRLNNYPAWAKVLENACKDDAELRAVLGDSIGNPELMRKRVEDRVRKKGR 143

Query: 148 DFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSVIQSWYVMGRLGAFN 207
           DFQKSKTGS+LAFKV+FRDFNPLDSYIWFEL GSP+DRDV+LIG+VIQSWYVMGRLGAFN
Sbjct: 144 DFQKSKTGSVLAFKVTFRDFNPLDSYIWFELFGSPSDRDVNLIGNVIQSWYVMGRLGAFN 203

Query: 208 SSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVWVDLGTSDYFAIDVL 267
           SSNLQLANSS+EY+P+YDADKGFKVM SSFHDISD+EFQ+NWGRVWVDLGTSDYFAIDVL
Sbjct: 204 SSNLQLANSSVEYDPLYDADKGFKVMPSSFHDISDIEFQENWGRVWVDLGTSDYFAIDVL 263

Query: 268 LNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           LNCLTVLSSEYLGIQQ+VFGGRRMGDWEEGMTSP+YGYK FKI
Sbjct: 264 LNCLTVLSSEYLGIQQIVFGGRRMGDWEEGMTSPEYGYKYFKI 304

BLAST of Cucsa.281100 vs. NCBI nr
Match: gi|661879946|emb|CDP16398.1| (unnamed protein product [Coffea canephora])

HSP 1 Score: 448.0 bits (1151), Expect = 1.4e-122
Identity = 209/237 (88.19%), Postives = 231/237 (97.47%), Query Frame = 1

Query: 74  KSSGTSARSRRLLKLREEKRKREHDRLHNYPAWAKVLEDACKNDAELRAVLGDSIGNPEE 133
           K SGT+AR RRL+K+REEKRKRE+DRLHNYPAWAKVLEDACKNDAELRAVLGD+IGNPE 
Sbjct: 24  KGSGTTARGRRLIKVREEKRKREYDRLHNYPAWAKVLEDACKNDAELRAVLGDTIGNPEL 83

Query: 134 MRKKVEERVRRKGRDFQKSKTGSILAFKVSFRDFNPLDSYIWFELIGSPTDRDVDLIGSV 193
           MRK+VEERVRRKGRDFQKSKTGS+LAFKVSFRDFNPLDSYIWFEL GSP+DRDVDL+GSV
Sbjct: 84  MRKRVEERVRRKGRDFQKSKTGSVLAFKVSFRDFNPLDSYIWFELYGSPSDRDVDLLGSV 143

Query: 194 IQSWYVMGRLGAFNSSNLQLANSSMEYNPVYDADKGFKVMQSSFHDISDVEFQDNWGRVW 253
           IQSWY+MGR+GAFNSSNLQLANSSMEY+P+YDADKGFKVM SSFHDISDVEFQDNWGR+W
Sbjct: 144 IQSWYIMGRIGAFNSSNLQLANSSMEYDPLYDADKGFKVMPSSFHDISDVEFQDNWGRIW 203

Query: 254 VDLGTSDYFAIDVLLNCLTVLSSEYLGIQQVVFGGRRMGDWEEGMTSPDYGYKSFKI 311
           VDLGTSD+F+ID+LLNCLTVLSSEY+GIQQV+FGGR++GDWEEGMTSP+YGYK FKI
Sbjct: 204 VDLGTSDFFSIDILLNCLTVLSSEYVGIQQVIFGGRKIGDWEEGMTSPEYGYKFFKI 260

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
A0A0B2SCW4_GLYSO5.6e-12377.39Uncharacterized protein OS=Glycine soja GN=glysoja_021604 PE=4 SV=1[more]
A0A068V8Y6_COFCA9.6e-12388.19Uncharacterized protein OS=Coffea canephora GN=GSCOC_T00018256001 PE=4 SV=1[more]
F6HVX8_VITVI1.2e-12289.87Putative uncharacterized protein OS=Vitis vinifera GN=VIT_10s0071g01140 PE=4 SV=... [more]
K7K0P8_SOYBN1.6e-12277.03Uncharacterized protein OS=Glycine max GN=GLYMA_03G000100 PE=4 SV=1[more]
A0A0B2QHK6_GLYSO3.6e-12276.68Uncharacterized protein OS=Glycine soja GN=glysoja_037211 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT5G08400.12.9e-11563.50 Protein of unknown function (DUF3531)[more]
AT4G29400.15.3e-4843.81 Protein of unknown function (DUF3531)[more]
Match NameE-valueIdentityDescription
gi|449458716|ref|XP_004147093.1|5.7e-177100.00PREDICTED: uncharacterized protein LOC101211689 [Cucumis sativus][more]
gi|659090247|ref|XP_008445913.1|3.6e-17195.81PREDICTED: uncharacterized protein LOC103488796 [Cucumis melo][more]
gi|720060625|ref|XP_010274923.1|1.1e-12479.50PREDICTED: uncharacterized protein LOC104610134 [Nelumbo nucifera][more]
gi|734423210|gb|KHN42114.1|8.0e-12377.39hypothetical protein glysoja_021604 [Glycine soja][more]
gi|661879946|emb|CDP16398.1|1.4e-12288.19unnamed protein product [Coffea canephora][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR021920DUF3531
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0010207 photosystem II assembly
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Cucsa.281100.1Cucsa.281100.1mRNA


Analysis Name: InterPro Annotations of cucumber (Gy14)
Date Performed: 2017-01-17
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR021920Protein of unknown function DUF3531PFAMPF12049DUF3531coord: 161..299
score: 1.1
NoneNo IPR availablePANTHERPTHR33102FAMILY NOT NAMEDcoord: 30..310
score: 2.3E
NoneNo IPR availablePANTHERPTHR33102:SF13DVL17coord: 30..310
score: 2.3E