Csa1G042880.2 (mRNA) Cucumber (Chinese Long) v2

NameCsa1G042880.2
TypemRNA
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionSurfeit locus protein, putative; contains IPR002994 (Surfeit locus 1/Shy1)
LocationChr1 : 4556343 .. 4560455 (-)
Sequence length765
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
TCTAACTCTTCGTCTCACTCTCACAGAAAACGCGCGCCTCCCTAGCTCAGTTTTCCCTCGCCGTCCGGTGACCCTCCGCCCAGCTCGTCGTCGTCAAAAGTCCGGTAACCCTTGCCCAGCTTGTCGTTCGCCTCTGCTGTCCCTCCCAATCAACAGAGGAGCCAGGTGAAGACGAAGACTGAGAAACTCTTCAAATGTCAGCATCAAGAACATGGCATCTTCTTCCTTAGCTAAATCCATCACAAAATTTCGCCCTTGTTTTTCCCTTTCTGGCCATTCTTCGACGCCTTTACCTTCATCTTCTTCTTCCTTCAGTTCTGCCGCGGTAGTTTCTTCTACTCCTGATCCCAACTCATCTTCCCTTTCGCAACCTCAACGTGAGTTTCTCGACTTTATTCTATCTGGGTTGCTCTTAATTTCGGGAAATGATTTAGATTCATGTGATTTTGATTGTTGAAAATGGCGTATTGCTTTCTGGGTGGTGTTCTTAGAGAAACAAAGAGAGTCGAGATTGTCGAAATGGCTACTGTTTCTACCTGGTGCTCTCACGTTTGGCCTTGGAACGTGGCAGATTTTCAGGAGGCAAGAGAAGGTATTTCCATTTACTGTTAGATTTTATACTGTGCTTGTTTGAATTATCTGATTTCAGTTTGGTTTATCAATTTCTTTGTCTCGCCCCTAACGAATTAAGCTGTAATTGATCTAATTCAAGTATGCCTGAACACTTTCTAGTGTTCAAAACTAAAGATGCCTCCCCTCCCTCTTTCGAGCAATCGGCTCTGGAAATAGGGATATACCTTTCATTACTAGTTTTTATCATTGATCCAGTTGAAATGAGTAAATTATCTCCATTTGAAAGAACGCACATCTCCAAAAGAATGTAGGTCTTCAAGAAGTTTCTTGACGCTTCTGTGCTCTCTATATACTATTCTAGATGGTTGATCAAGTTTAGTCGCTTGGATCATATTATAGATGGGTGATGGATTTCTAACTTTTATAATTGAGGCACGAATTTTTTCATGACTTAGAGAATATGGCAACTGGTAAATGCAGATACTCTATTTAATTAAACATATGCATGTTAGGGTGTTGGAACTTTTTCATCAATTTAATCTCAGGCTTCTTAGGAGGCCAAAAGTAATTCCGGTTGTTAAGTGTCAAAAAATTTGCTTCCTTTTCACATTCGTCAAAGCTGAAAAACATTCTTTTGAACATCAAATTATTTAAGTAACATTGCAGGGAGCTGATTTGCACACATTTATCATTCAAGTACTTATCGTCTTTGACATTATATCTTGCAGATAGAAATGCTAGATTACAGGCGGAAGCGGTTGTTAATGGAACCTGTGAACATAAACAACTTATTGTCATTGGAAGACAAGCTCGATGATCTGGAGTTCAGGAGAGTGATCTGTAAAGGGGTTTTTGATGAGAAAAAGTCAATCTATGTTGGTCCACGTTCAAGAAGTATTTCAGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGATGCCAATTCCTGGCCTTCCTGATAGGTATTTTCCAGTGAAACTTGATGTTGACTATACCATTTTACTATCAGCATTTTGTGAGAGAGAACCTGCATTGCTCATATTCCTATGATAATGATCAAAGATTTTAATATCATCAATATCAAACAATCTAATAATATTAAAAACATCAATATCTGATTCAGAACATCGCATGAAGTTTCAATTCCAAAAAAAATTCTAGCAGAGAAGTGAATAAAACACACAATGGTTTGCTATTATCTGCAATTTTACATCAATTTAAACCAAAGAGGAACTTATTTTATTTTTTCTTATCAACAATGGCTTGATAAATTTATTTTCATTCTATAATTGTTCCACTCAATAGACTGGTGTATTTTTGAATTTTTACTATCTATCTTCTTGCATTATTATTATTATTATCGTATGCTTTGCAAGTTCATGGTAGTTTAAGTTCCTTCCAACTAATAACCATACATTCATGATTTCTAATATGAAATTATGAATGGCAATTTTTTGCTGGAACTCTTTCATAGTAATGGAGCCAAGTTAGTGGAACCATGATAACCTCTACCTTGTCATGATTGAACAACATATGAGACTATAGATAGATATGTGCAAAGAATTCTGAAGGACATAAAAGACCAAATATCGCCTTCATTTCTGGCAATTCCCATTTTCATGTGGTGTAAAAATGAAATTATGTACTTCTGGATGAATTTAGTAGTTTGATATTTGTTTTATGATTCCATTTGACCAAGAATGTACCGAATAACCCTCCTCCAGTGTGCAGTCACCAGTTCTGGTTAATAGAGGCTGGGCTCCACGCACTTGGAAGGAGAAGGCTTTAGAAGTAAATCAACAAGGTAGTGAGCAGTCTTCAGATATTGTACCATCCTTGGTTCAAGGGGGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACTGAAAGTCTAGAGGTTCTTTCTGATATTTTCACATGTTAAATGTCAAATAACATCCATTCAAACACTATAATTCTTACTTTACTTGATGTACAGAATGAAATCACTCCCATTACTCCAGTAGAAGTTATCGGAGTAGTCCGTACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGATCCTGGCTCTAGGCAATGGTTTTATGTTGATGTTCCAGCAATTGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAATCCAAGTGACCCTTATCCCATTCCAAAGGATGTTAATACCTTGATACGGAGTTCTGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTATGTCGTTTGTCTTTTTCATATATTCTTTCCATTTAAAGAGTTCTATATTTCCTACTATGAATGCGAAGTACTTTCTAAGCATTGATTGATTTTACACGTGGAAAAGTTTCAGTTTCTTTTACTTGTGATTAATTATCATTCCAATTATATTTAGAATTAAGATGCTGCATCTGATCCACTACTAAATGTTGGTTGATGGCATAACCCTGGCACCTCTTCGTGCTGCCAAGAAAGTGGATGGAATTAGTTCTCAAGCTCATAATCGCTCAAAATTTTCTAAGAATTCAAACTTTTCTGAATTTCCCCCAATTCTTGTAGATTTCATTTTAAAAAAATGAATCTATAATTCCTTTAGAAGGGTCATGAACTTTTTGAGCCGCTTCAGTTTTAGCTTAACCTATGATTTTTAAGTTTACGCTACAAGCTTATATCAACTTATAATCCTTGCGAGAAGATAACAAAAGTAAACTAAAAACTATTTCAATGTGAAATAAGTTTTATTTGGAAGTCATACACATTCTTATTCTTGCAATGGCTGATCTTTGTGCAGGTACTCTCTCTCAGCTGCTGTAACCTTTATGGCTTTCAAAAGACTGAGACAAAAAACAAGTCGAAGATAACGAGAACTCATGCTTGACATTAGAGCAGGAATGAATCCACCCGGAATTCGTAAATGTATGTGAGTTAATTTTGTTGGTATTGATTTTCATAATGCTTTTCACTAGAGAAGCCCGAATACCTCATTTCTTAAGGACCTGAATACCTCATTTCTTAAGGACTTTTTGGACTAGAAAACTGCTCCGATATCCTCAATCCCTTTTGTGATAGATACAGCGTGGGGTGTAGACAACAAAGCTTTCTGCAACCACTAAAACAGAGCAAGCATCTTGATTTTTTCATAACTAATAAATTTGTTAGCCAATTTGAAGCATTGTTTCTTTTTTTAACTGTCGTGTTTTCTGGTGTATGAACAGCTAACTTGCCAATATGAGCATAGTTCAGCGGTAATTGGTTGTGTTGTTCAAACTACCTACCCTCTATTAACTTGAAAAAAAAAGGTTGGTCGTCTCCATCATATTCTTTGTCCATGTTTCTCTTTTTCCTTAAATCTTTCATTTTCAAGTTTATTAGGCATAGTTGAATGTGAAAGTGACATTAAGATATCACTTGTTTGGTGGGTTACCATTGATGTCTACAGATATTGAAAACTTGTTAGGTTGGATGCTTTCAACTTCTTATTGGATTTTAACTTTTTATCCTCTCCC

mRNA sequence

ATGCTAGATTACAGGCGGAAGCGGTTGTTAATGGAACCTGTGAACATAAACAACTTATTGTCATTGGAAGACAAGCTCGATGATCTGGAGTTCAGGAGAGTGATCTGTAAAGGGGTTTTTGATGAGAAAAAGTCAATCTATGTTGGTCCACGTTCAAGAAGTATTTCAGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGATGCCAATTCCTGGCCTTCCTGATAGTGTGCAGTCACCAGTTCTGGTTAATAGAGGCTGGGCTCCACGCACTTGGAAGGAGAAGGCTTTAGAAGTAAATCAACAAGGTAGTGAGCAGTCTTCAGATATTGTACCATCCTTGGTTCAAGGGGGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACTGAAAGTCTAGAGAATGAAATCACTCCCATTACTCCAGTAGAAGTTATCGGAGTAGTCCGTACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGATCCTGGCTCTAGGCAATGGTTTTATGTTGATGTTCCAGCAATTGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAATCCAAGTGACCCTTATCCCATTCCAAAGGATGTTAATACCTTGATACGGAGTTCTGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTACTCTCTCTCAGCTGCTGTAACCTTTATGGCTTTCAAAAGACTGAGACAAAAAACAAGTCGAAGATAA

Coding sequence (CDS)

ATGCTAGATTACAGGCGGAAGCGGTTGTTAATGGAACCTGTGAACATAAACAACTTATTGTCATTGGAAGACAAGCTCGATGATCTGGAGTTCAGGAGAGTGATCTGTAAAGGGGTTTTTGATGAGAAAAAGTCAATCTATGTTGGTCCACGTTCAAGAAGTATTTCAGGAGTGACTGAAAATGGCCATTATGTGATTACCCCATTGATGCCAATTCCTGGCCTTCCTGATAGTGTGCAGTCACCAGTTCTGGTTAATAGAGGCTGGGCTCCACGCACTTGGAAGGAGAAGGCTTTAGAAGTAAATCAACAAGGTAGTGAGCAGTCTTCAGATATTGTACCATCCTTGGTTCAAGGGGGTGAGAGAAGCTCTTGGTGGAAGTTCTGGTCAAAAAAGACTGAAAGTCTAGAGAATGAAATCACTCCCATTACTCCAGTAGAAGTTATCGGAGTAGTCCGTACAAGTGAGAAGCCTAGCATATTTGTGCCAGCAAATGATCCTGGCTCTAGGCAATGGTTTTATGTTGATGTTCCAGCAATTGCACGCTCTTCTGGTCTACCTGAGGATACTATTTATGTTGAAGACATAAATGAGAATGTGAATCCAAGTGACCCTTATCCCATTCCAAAGGATGTTAATACCTTGATACGGAGTTCTGTTATGCCACAGGATCATCTGAATTACACGTTAACATGGTACTCTCTCTCAGCTGCTGTAACCTTTATGGCTTTCAAAAGACTGAGACAAAAAACAAGTCGAAGATAA

Protein sequence

MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGGERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKTSRR*
BLAST of Csa1G042880.2 vs. Swiss-Prot
Match: SURF1_ARATH (Surfeit locus protein 1 OS=Arabidopsis thaliana GN=SURF1 PE=2 SV=1)

HSP 1 Score: 331.6 bits (849), Expect = 7.4e-90
Identity = 157/253 (62.06%), Postives = 199/253 (78.66%), Query Frame = 1

Query: 2   LDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTEN 61
           L+Y+++RL MEP+ +N    L+  L+ LEFRRV CKGVFDE++SIY+GPRSRSISG+TEN
Sbjct: 103 LEYQQQRLNMEPIKLNIDHPLDKNLNALEFRRVSCKGVFDEQRSIYLGPRSRSISGITEN 162

Query: 62  GHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGGE 121
           G +VITPLMPIPG  DS+QSP+LVNRGW PR+W+EK+ E + +    ++    +     E
Sbjct: 163 GFFVITPLMPIPGDLDSMQSPILVNRGWVPRSWREKSQE-SAEAEFIANQSTKAKSPSNE 222

Query: 122 RSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIA 181
             SWWKFWSK     +  I+ + PVEV+GV+R  E PSIFVP+NDP + QWFYVDVPA+A
Sbjct: 223 PKSWWKFWSKTPVITKEHISAVKPVEVVGVIRGGENPSIFVPSNDPSTGQWFYVDVPAMA 282

Query: 182 RSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTF 241
           R+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWYSLSAAVTF
Sbjct: 283 RAVGLPENTIYVEDVHEHVDRSRPYPVPKDINTLIRSKVMPQDHLNYSITWYSLSAAVTF 342

Query: 242 MAFKRLRQKTSRR 255
           MA+KRL+ K  RR
Sbjct: 343 MAYKRLKAKPVRR 354

BLAST of Csa1G042880.2 vs. Swiss-Prot
Match: SURFL_ARATH (Surfeit locus protein 1-like OS=Arabidopsis thaliana GN=At1g48510 PE=2 SV=2)

HSP 1 Score: 200.7 bits (509), Expect = 2.0e-50
Identity = 107/257 (41.63%), Postives = 153/257 (59.53%), Query Frame = 1

Query: 2   LDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTEN 61
           LD R++ L M+P+ +N    L    D L FRRV+CKG+FDE++SIYVGP+ RS+S  +E 
Sbjct: 82  LDSRKQCLEMKPMKLNTTKDL----DGLGFRRVVCKGIFDEQRSIYVGPKPRSMSKSSEI 141

Query: 62  GHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQG---SEQSSDIVPSLVQ 121
           G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE    G   +        + + 
Sbjct: 142 GFYVITPLLPIPNEPNSMKSPILVNRGWVPSDWKENSLESLGTGGLVAAAKESRKANKLL 201

Query: 122 GGERSSWWKFWSKKTESL--ENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVD 181
             ++S   KFW K    +  E++++    VEV+GVVR SE P I+   N P S  WFY+D
Sbjct: 202 SSQQSLLSKFWYKLNNPMIVEDQVSRAMHVEVVGVVRKSETPGIYTLVNYPSSLAWFYLD 261

Query: 182 VPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS 241
           VP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  YT+ W+  S
Sbjct: 262 VPKLALAMGFGEDTMYIESTYTDMDESRTYPVPRDVENLTRSKDIPLDYHLYTVLWHWSS 321

Query: 242 AAVTFMAFKRLRQKTSR 254
                 A   L ++ ++
Sbjct: 322 LTCFIKASSILMRRLTK 334

BLAST of Csa1G042880.2 vs. Swiss-Prot
Match: SURF1_DICDI (SURF1-like protein OS=Dictyostelium discoideum GN=surf1-1 PE=3 SV=2)

HSP 1 Score: 83.6 bits (205), Expect = 3.5e-15
Identity = 75/267 (28.09%), Postives = 122/267 (45.69%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLL------SLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRS 60
           ++   + R+  +P+ ++N        S    L+  EFRRV   G   + + + +GP  RS
Sbjct: 35  LIQRAKDRMEEDPIELSNSFIKNFKGSSFGDLNKYEFRRVYLNGKVIDNQYVLLGP--RS 94

Query: 61  ISGVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWK---------EKALEVNQQG 120
           I G    G+YVI+PL    G      + +L+NRGW+  T K         E+   ++Q+ 
Sbjct: 95  IDGTL--GYYVISPLQLSDG------TRILLNRGWSASTPKSNYKIPYAIEELKLIHQKE 154

Query: 121 SEQSSDIVPSLVQGGERSSWWKFWSKKTESLENEITPITPVEVIGVV-RTSEKPSIFVPA 180
            EQ         QG + S  +++++                 ++GV+ +T E+ S F P 
Sbjct: 155 KEQGQQ------QGNQESILYRYFN-----------------ILGVISKTKERGSAFTPT 214

Query: 181 NDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINE-NVNPSD-PYPIPKDVNTLIRSSVMP 240
           N P   QW+ +DV A+A         I   D  E N  PS  P P  K  +  + SS   
Sbjct: 215 NQPEKGQWYSLDVDAMADQLNTEPLMINTMDETEINSKPSSLPNPQFKRFDNDVESS-FH 267

Query: 241 QDHLNYTLTWYSLSAAVTFMAFKRLRQ 250
             H++Y  TWY+LSA++ F+ F+ +R+
Sbjct: 275 NKHMSYIGTWYTLSASLFFIYFRYMRK 267

BLAST of Csa1G042880.2 vs. Swiss-Prot
Match: SURF1_HUMAN (Surfeit locus protein 1 OS=Homo sapiens GN=SURF1 PE=1 SV=1)

HSP 1 Score: 72.4 bits (176), Expect = 8.0e-12
Identity = 71/255 (27.84%), Postives = 109/255 (42.75%), Query Frame = 1

Query: 8   RLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRS-----------IS 67
           R+L EPV    L +   +L +LE+R V  +G FD  K +Y+ PR+             IS
Sbjct: 97  RVLAEPVP---LPADPMELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLIS 156

Query: 68  GVTENGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSL 127
             T++G YV+TP                        T     + VN+ G      + P  
Sbjct: 157 SSTQSGAYVVTPFHC---------------------TDLGVTILVNR-GFVPRKKVNPET 216

Query: 128 VQGGERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVD 187
            Q G+              +E E      V++IG+VR +E    FVP N+P    W Y D
Sbjct: 217 RQKGQ--------------IEGE------VDLIGMVRLTETRQPFVPENNPERNHWHYRD 276

Query: 188 VPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS 247
           + A+AR +G   + I+++   ++  P  P  I       +R      +HL Y +TWY LS
Sbjct: 277 LEAMARITGA--EPIFIDANFQSTVPGGP--IGGQTRVTLR-----NEHLQYIVTWYGLS 297

Query: 248 AAVTFMAFKRLRQKT 252
           AA +++ FK+  + T
Sbjct: 337 AATSYLWFKKFLRGT 297

BLAST of Csa1G042880.2 vs. Swiss-Prot
Match: SURF1_MOUSE (Surfeit locus protein 1 OS=Mus musculus GN=Surf1 PE=1 SV=3)

HSP 1 Score: 62.8 bits (151), Expect = 6.4e-09
Identity = 33/106 (31.13%), Postives = 56/106 (52.83%), Query Frame = 1

Query: 146 VEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIARSSGLPEDTIYVEDINENVNPSDP 205
           V+++G+VR +E    FVP N P    W+Y D+ A+A+ +G   D I+++    +  P  P
Sbjct: 207 VDLVGIVRLTENRKPFVPENSPERNHWYYRDLEAMAKITGA--DPIFIDADFHSTAPGGP 266

Query: 206 YPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRQKT 252
             I       +R      +H+ Y LTWY L AA +++ F++  ++T
Sbjct: 267 --IGGQTRVTLR-----NEHMQYILTWYGLCAATSYLWFQKFVRRT 303


HSP 2 Score: 39.3 bits (90), Expect = 7.6e-02
Identity = 29/91 (31.87%), Postives = 42/91 (46.15%), Query Frame = 1

Query: 25  KLDDLEFRRVICKGVFDEKKSIYVGPRS----------RSISGVTENGHYVITPLMPIPG 84
           +L +LE+R V  +G FD  K +Y+ PR+                TE+G +V+TP      
Sbjct: 118 ELKNLEYRPVKVRGHFDHSKELYIMPRTMVDPVREARDAGRLSSTESGAHVVTPFH---- 177

Query: 85  LPDSVQSPVLVNRGWAPRTWKEKALEVNQQG 106
               +   +LVNRG+ PR  K+   E  Q+G
Sbjct: 178 -CSDLGVTILVNRGFVPR--KKVNPETRQKG 201

BLAST of Csa1G042880.2 vs. TrEMBL
Match: A0A0A0LVW3_CUCSA (SURF1-like protein OS=Cucumis sativus GN=Csa_1G042880 PE=3 SV=1)

HSP 1 Score: 517.7 bits (1332), Expect = 8.1e-144
Identity = 254/254 (100.00%), Postives = 254/254 (100.00%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE
Sbjct: 92  MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 151

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG
Sbjct: 152 NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 211

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI
Sbjct: 212 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 271

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 272 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 331

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKRLRQKTSRR
Sbjct: 332 FMAFKRLRQKTSRR 345

BLAST of Csa1G042880.2 vs. TrEMBL
Match: M5WUQ5_PRUPE (SURF1-like protein OS=Prunus persica GN=PRUPE_ppa007867mg PE=3 SV=1)

HSP 1 Score: 386.3 bits (991), Expect = 2.8e-104
Identity = 187/256 (73.05%), Postives = 217/256 (84.77%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYR+KRL MEPVN NN+    ++LD LEFRRVICKG FDE++SIYVGPRSRSISGVTE
Sbjct: 99  MLDYRQKRLEMEPVNFNNVSLSSEELDHLEFRRVICKGYFDEERSIYVGPRSRSISGVTE 158

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+YVITPL+P+   P+ VQ P+LVNRGW PR+WKEK+ EV++ G EQ S++ PS VQ  
Sbjct: 159 NGYYVITPLVPVSDKPERVQPPILVNRGWVPRSWKEKSSEVHEDG-EQPSNVAPSSVQEN 218

Query: 121 ERSSWWKFWSKKTESLE-NEITP-ITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVP 180
           ER SWW+FW KK++ +E ++ TP   PVE++GVVR SEKPSIFVP NDP S QWFYVDVP
Sbjct: 219 ERRSWWRFWMKKSKVVEVDQQTPAFAPVEIVGVVRGSEKPSIFVPPNDPKSSQWFYVDVP 278

Query: 181 AIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAA 240
           AIAR+ GLPEDT+Y+EDINENVNPS+PYP+PKDV  LIRSSVMPQDHLNYTLTWYSLSAA
Sbjct: 279 AIARTCGLPEDTVYIEDINENVNPSNPYPVPKDVGALIRSSVMPQDHLNYTLTWYSLSAA 338

Query: 241 VTFMAFKRLRQKTSRR 255
           VTFMAFKRLR K SRR
Sbjct: 339 VTFMAFKRLRPKKSRR 353

BLAST of Csa1G042880.2 vs. TrEMBL
Match: D7SJD1_VITVI (SURF1-like protein OS=Vitis vinifera GN=VIT_17s0000g02190 PE=3 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 1.8e-103
Identity = 186/254 (73.23%), Postives = 218/254 (85.83%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYRRKRL +EP+  +NL SL +KLD LEFRRV  KG FDEKKSIYVGPRSRSISGVTE
Sbjct: 97  MLDYRRKRLDLEPIPGSNLYSLNEKLDSLEFRRVKAKGFFDEKKSIYVGPRSRSISGVTE 156

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+Y+ITPLMPIP  PDSVQSP+LVNRGW PR+W++K L+ +    EQS +I    +Q  
Sbjct: 157 NGYYLITPLMPIPDDPDSVQSPILVNRGWVPRSWRDKFLQ-DLPVDEQSKNIASPSIQES 216

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ERSSWW+FWSKK +++E+++  +TPVEV+GVVR SEKPSIFVP ND  SRQWFYVDVPAI
Sbjct: 217 ERSSWWRFWSKKPKTVEDQVPAVTPVEVVGVVRGSEKPSIFVPENDLCSRQWFYVDVPAI 276

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           +R+SGL E+TIYV+DINENVNPS+PYP+PK+V+TLIRSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 277 SRASGLAENTIYVDDINENVNPSNPYPVPKEVSTLIRSSVMPQDHLNYTLTWYSLSAAVT 336

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKR+  K SRR
Sbjct: 337 FMAFKRITPKKSRR 349

BLAST of Csa1G042880.2 vs. TrEMBL
Match: A0A067L7C9_JATCU (SURF1-like protein OS=Jatropha curcas GN=JCGZ_22926 PE=3 SV=1)

HSP 1 Score: 383.6 bits (984), Expect = 1.8e-103
Identity = 185/254 (72.83%), Postives = 214/254 (84.25%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           ML+YR+KRL M P+  N++    ++LD LEFRRV CKGVFDEK+SIYVGPRSRSISGVTE
Sbjct: 88  MLEYRQKRLEMVPMKFNDVTPSSEQLDTLEFRRVACKGVFDEKRSIYVGPRSRSISGVTE 147

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+YVITPL+PI   P+SV+SP+LVNRGW PR WKE++LE++Q   EQ S I  S VQ G
Sbjct: 148 NGYYVITPLLPIANDPESVRSPILVNRGWVPRIWKERSLEISQD-VEQPSRITSSSVQEG 207

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ER SWWKFWSKK +  E+++  +TPVEV+GVVR SEKPSIFVP NDP S QWFYVDVPAI
Sbjct: 208 ERISWWKFWSKKQKVTEDQVPSVTPVEVVGVVRGSEKPSIFVPQNDPSSHQWFYVDVPAI 267

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           AR+  LPE+T+Y+EDINENVN + PYP+PKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 268 ARACELPENTVYIEDINENVNSACPYPVPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 327

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKRLR K SRR
Sbjct: 328 FMAFKRLRPKRSRR 340

BLAST of Csa1G042880.2 vs. TrEMBL
Match: A0A0B2SEV3_GLYSO (SURF1-like protein OS=Glycine soja GN=glysoja_041932 PE=3 SV=1)

HSP 1 Score: 378.6 bits (971), Expect = 5.8e-102
Identity = 178/254 (70.08%), Postives = 212/254 (83.46%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           ML+YR KRL MEP+  ++  S +++LD LEFR+V+CKG FD+KKS+YVGPRSRSISGVTE
Sbjct: 82  MLEYREKRLQMEPLKFSSAYSSDEELDSLEFRKVVCKGYFDDKKSVYVGPRSRSISGVTE 141

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+Y+ITPLMP+P  PDSV  P+LVNRGW PR+WK+K LE +Q    + +   PS V G 
Sbjct: 142 NGYYIITPLMPVPNCPDSVSIPILVNRGWVPRSWKDKFLEASQDEDLEDALPSPSHVDGS 201

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           +  SWW+FWSKK   +E+++  +TP+EV+GVVR SEKPSIFVPANDPGS QWFYVDVP I
Sbjct: 202 K--SWWRFWSKKPV-IEDQVASVTPIEVVGVVRGSEKPSIFVPANDPGSSQWFYVDVPGI 261

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           AR+ GLPE+TIY ED NENVNPS+PYP+PKDVNTLIRSSVMP+DHLNYTLTWYSLSAAVT
Sbjct: 262 ARACGLPENTIYFEDTNENVNPSNPYPVPKDVNTLIRSSVMPRDHLNYTLTWYSLSAAVT 321

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKRLRQK  RR
Sbjct: 322 FMAFKRLRQKNKRR 332

BLAST of Csa1G042880.2 vs. TAIR10
Match: AT3G17910.1 (AT3G17910.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein)

HSP 1 Score: 331.6 bits (849), Expect = 4.1e-91
Identity = 157/253 (62.06%), Postives = 199/253 (78.66%), Query Frame = 1

Query: 2   LDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTEN 61
           L+Y+++RL MEP+ +N    L+  L+ LEFRRV CKGVFDE++SIY+GPRSRSISG+TEN
Sbjct: 103 LEYQQQRLNMEPIKLNIDHPLDKNLNALEFRRVSCKGVFDEQRSIYLGPRSRSISGITEN 162

Query: 62  GHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGGE 121
           G +VITPLMPIPG  DS+QSP+LVNRGW PR+W+EK+ E + +    ++    +     E
Sbjct: 163 GFFVITPLMPIPGDLDSMQSPILVNRGWVPRSWREKSQE-SAEAEFIANQSTKAKSPSNE 222

Query: 122 RSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAIA 181
             SWWKFWSK     +  I+ + PVEV+GV+R  E PSIFVP+NDP + QWFYVDVPA+A
Sbjct: 223 PKSWWKFWSKTPVITKEHISAVKPVEVVGVIRGGENPSIFVPSNDPSTGQWFYVDVPAMA 282

Query: 182 RSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVTF 241
           R+ GLPE+TIYVED++E+V+ S PYP+PKD+NTLIRS VMPQDHLNY++TWYSLSAAVTF
Sbjct: 283 RAVGLPENTIYVEDVHEHVDRSRPYPVPKDINTLIRSKVMPQDHLNYSITWYSLSAAVTF 342

Query: 242 MAFKRLRQKTSRR 255
           MA+KRL+ K  RR
Sbjct: 343 MAYKRLKAKPVRR 354

BLAST of Csa1G042880.2 vs. TAIR10
Match: AT1G48510.1 (AT1G48510.1 Surfeit locus 1 cytochrome c oxidase biogenesis protein)

HSP 1 Score: 200.7 bits (509), Expect = 1.1e-51
Identity = 107/257 (41.63%), Postives = 153/257 (59.53%), Query Frame = 1

Query: 2   LDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTEN 61
           LD R++ L M+P+ +N    L    D L FRRV+CKG+FDE++SIYVGP+ RS+S  +E 
Sbjct: 82  LDSRKQCLEMKPMKLNTTKDL----DGLGFRRVVCKGIFDEQRSIYVGPKPRSMSKSSEI 141

Query: 62  GHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQG---SEQSSDIVPSLVQ 121
           G YVITPL+PIP  P+S++SP+LVNRGW P  WKE +LE    G   +        + + 
Sbjct: 142 GFYVITPLLPIPNEPNSMKSPILVNRGWVPSDWKENSLESLGTGGLVAAAKESRKANKLL 201

Query: 122 GGERSSWWKFWSKKTESL--ENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVD 181
             ++S   KFW K    +  E++++    VEV+GVVR SE P I+   N P S  WFY+D
Sbjct: 202 SSQQSLLSKFWYKLNNPMIVEDQVSRAMHVEVVGVVRKSETPGIYTLVNYPSSLAWFYLD 261

Query: 182 VPAIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLS 241
           VP +A + G  EDT+Y+E    +++ S  YP+P+DV  L RS  +P D+  YT+ W+  S
Sbjct: 262 VPKLALAMGFGEDTMYIESTYTDMDESRTYPVPRDVENLTRSKDIPLDYHLYTVLWHWSS 321

Query: 242 AAVTFMAFKRLRQKTSR 254
                 A   L ++ ++
Sbjct: 322 LTCFIKASSILMRRLTK 334

BLAST of Csa1G042880.2 vs. NCBI nr
Match: gi|449439471|ref|XP_004137509.1| (PREDICTED: surfeit locus protein 1 [Cucumis sativus])

HSP 1 Score: 517.7 bits (1332), Expect = 1.2e-143
Identity = 254/254 (100.00%), Postives = 254/254 (100.00%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE
Sbjct: 92  MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 151

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG
Sbjct: 152 NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 211

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI
Sbjct: 212 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 271

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 272 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 331

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKRLRQKTSRR
Sbjct: 332 FMAFKRLRQKTSRR 345

BLAST of Csa1G042880.2 vs. NCBI nr
Match: gi|659066886|ref|XP_008465733.1| (PREDICTED: surfeit locus protein 1 [Cucumis melo])

HSP 1 Score: 503.1 bits (1294), Expect = 2.9e-139
Identity = 246/254 (96.85%), Postives = 248/254 (97.64%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE
Sbjct: 92  MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 151

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NGHYVITPLMP+PGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSS  VPSLVQ G
Sbjct: 152 NGHYVITPLMPVPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSHTVPSLVQEG 211

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ERSSWWKFWSKKTESLENEITPITPVEVIGV+RTSEKPSIFVPANDP SRQWFYVDVPAI
Sbjct: 212 ERSSWWKFWSKKTESLENEITPITPVEVIGVIRTSEKPSIFVPANDPDSRQWFYVDVPAI 271

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           ARSSGLPEDT YVEDINENVNPSDPYPIPKDVNTL RSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 272 ARSSGLPEDTFYVEDINENVNPSDPYPIPKDVNTLTRSSVMPQDHLNYTLTWYSLSAAVT 331

Query: 241 FMAFKRLRQKTSRR 255
           FMAFKRLRQKTSRR
Sbjct: 332 FMAFKRLRQKTSRR 345

BLAST of Csa1G042880.2 vs. NCBI nr
Match: gi|645269731|ref|XP_008240132.1| (PREDICTED: surfeit locus protein 1 [Prunus mume])

HSP 1 Score: 392.9 bits (1008), Expect = 4.3e-106
Identity = 188/255 (73.73%), Postives = 218/255 (85.49%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYR+KRL MEPVN NN+    ++LD LEFRRVICKG FDE++SIYVGPRSRSISGVTE
Sbjct: 99  MLDYRQKRLEMEPVNFNNVSLSSEELDHLEFRRVICKGYFDEERSIYVGPRSRSISGVTE 158

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+YVITPL+P+   P+ VQ P+LVNRGW PR+WKEK+ EV++ G EQ S++ PS VQ  
Sbjct: 159 NGYYVITPLVPVSDKPERVQPPILVNRGWVPRSWKEKSSEVHEDG-EQPSNVAPSSVQEN 218

Query: 121 ERSSWWKFWSKKTESLENEITP-ITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPA 180
           ER SWW+FW+KK + +E++ TP   PVE++GVVR SEKPSIFVP NDP S QWFYVDVPA
Sbjct: 219 ERRSWWRFWTKKPKVVEDQQTPAFAPVEIVGVVRGSEKPSIFVPPNDPKSSQWFYVDVPA 278

Query: 181 IARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAV 240
           IAR+ GLPEDT+Y+EDINENVNPS+PYP+PKDV TLIRSSVMPQDHLNYTLTWYSLSAAV
Sbjct: 279 IARTCGLPEDTVYIEDINENVNPSNPYPVPKDVGTLIRSSVMPQDHLNYTLTWYSLSAAV 338

Query: 241 TFMAFKRLRQKTSRR 255
           TFMAFKRLR K SRR
Sbjct: 339 TFMAFKRLRPKKSRR 352

BLAST of Csa1G042880.2 vs. NCBI nr
Match: gi|694330649|ref|XP_009356019.1| (PREDICTED: surfeit locus protein 1 isoform X3 [Pyrus x bretschneideri])

HSP 1 Score: 386.3 bits (991), Expect = 4.0e-104
Identity = 184/254 (72.44%), Postives = 214/254 (84.25%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYRRKRL +EP+N++N      +LD LEFRRV CKG FDEK+SIYVGPRSRSISGVTE
Sbjct: 165 MLDYRRKRLELEPLNLSNASPSSQELDQLEFRRVKCKGYFDEKRSIYVGPRSRSISGVTE 224

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+Y+ITPL+PIP  PDSVQ P+LVNRGW PR+WK++A +V++ G EQ SDI PS VQ  
Sbjct: 225 NGYYIITPLIPIPEKPDSVQPPILVNRGWVPRSWKDEASKVSKDG-EQPSDINPSSVQET 284

Query: 121 ERSSWWKFWSKKTESLENEITPITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVPAI 180
           ER SWW+ WSKK E +E++   + PVEV+GVVR SEKPSIFVP NDP S QWFYVDVPAI
Sbjct: 285 ERRSWWRLWSKKPEVVEDKTPAVAPVEVVGVVRGSEKPSIFVPPNDPNSGQWFYVDVPAI 344

Query: 181 ARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAAVT 240
           AR  GLPEDT+Y+ED NENVNPS+PYP+PKD+++LIRSSVMPQDHLNYTLTWYSLSAAVT
Sbjct: 345 ARKCGLPEDTVYIEDANENVNPSNPYPLPKDISSLIRSSVMPQDHLNYTLTWYSLSAAVT 404

Query: 241 FMAFKRLRQKTSRR 255
           FMAF RL+ K SRR
Sbjct: 405 FMAFMRLKPKKSRR 417

BLAST of Csa1G042880.2 vs. NCBI nr
Match: gi|595847352|ref|XP_007209306.1| (hypothetical protein PRUPE_ppa007867mg [Prunus persica])

HSP 1 Score: 386.3 bits (991), Expect = 4.0e-104
Identity = 187/256 (73.05%), Postives = 217/256 (84.77%), Query Frame = 1

Query: 1   MLDYRRKRLLMEPVNINNLLSLEDKLDDLEFRRVICKGVFDEKKSIYVGPRSRSISGVTE 60
           MLDYR+KRL MEPVN NN+    ++LD LEFRRVICKG FDE++SIYVGPRSRSISGVTE
Sbjct: 99  MLDYRQKRLEMEPVNFNNVSLSSEELDHLEFRRVICKGYFDEERSIYVGPRSRSISGVTE 158

Query: 61  NGHYVITPLMPIPGLPDSVQSPVLVNRGWAPRTWKEKALEVNQQGSEQSSDIVPSLVQGG 120
           NG+YVITPL+P+   P+ VQ P+LVNRGW PR+WKEK+ EV++ G EQ S++ PS VQ  
Sbjct: 159 NGYYVITPLVPVSDKPERVQPPILVNRGWVPRSWKEKSSEVHEDG-EQPSNVAPSSVQEN 218

Query: 121 ERSSWWKFWSKKTESLE-NEITP-ITPVEVIGVVRTSEKPSIFVPANDPGSRQWFYVDVP 180
           ER SWW+FW KK++ +E ++ TP   PVE++GVVR SEKPSIFVP NDP S QWFYVDVP
Sbjct: 219 ERRSWWRFWMKKSKVVEVDQQTPAFAPVEIVGVVRGSEKPSIFVPPNDPKSSQWFYVDVP 278

Query: 181 AIARSSGLPEDTIYVEDINENVNPSDPYPIPKDVNTLIRSSVMPQDHLNYTLTWYSLSAA 240
           AIAR+ GLPEDT+Y+EDINENVNPS+PYP+PKDV  LIRSSVMPQDHLNYTLTWYSLSAA
Sbjct: 279 AIARTCGLPEDTVYIEDINENVNPSNPYPVPKDVGALIRSSVMPQDHLNYTLTWYSLSAA 338

Query: 241 VTFMAFKRLRQKTSRR 255
           VTFMAFKRLR K SRR
Sbjct: 339 VTFMAFKRLRPKKSRR 353

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
SURF1_ARATH7.4e-9062.06Surfeit locus protein 1 OS=Arabidopsis thaliana GN=SURF1 PE=2 SV=1[more]
SURFL_ARATH2.0e-5041.63Surfeit locus protein 1-like OS=Arabidopsis thaliana GN=At1g48510 PE=2 SV=2[more]
SURF1_DICDI3.5e-1528.09SURF1-like protein OS=Dictyostelium discoideum GN=surf1-1 PE=3 SV=2[more]
SURF1_HUMAN8.0e-1227.84Surfeit locus protein 1 OS=Homo sapiens GN=SURF1 PE=1 SV=1[more]
SURF1_MOUSE6.4e-0931.13Surfeit locus protein 1 OS=Mus musculus GN=Surf1 PE=1 SV=3[more]
Match NameE-valueIdentityDescription
A0A0A0LVW3_CUCSA8.1e-144100.00SURF1-like protein OS=Cucumis sativus GN=Csa_1G042880 PE=3 SV=1[more]
M5WUQ5_PRUPE2.8e-10473.05SURF1-like protein OS=Prunus persica GN=PRUPE_ppa007867mg PE=3 SV=1[more]
D7SJD1_VITVI1.8e-10373.23SURF1-like protein OS=Vitis vinifera GN=VIT_17s0000g02190 PE=3 SV=1[more]
A0A067L7C9_JATCU1.8e-10372.83SURF1-like protein OS=Jatropha curcas GN=JCGZ_22926 PE=3 SV=1[more]
A0A0B2SEV3_GLYSO5.8e-10270.08SURF1-like protein OS=Glycine soja GN=glysoja_041932 PE=3 SV=1[more]
Match NameE-valueIdentityDescription
AT3G17910.14.1e-9162.06 Surfeit locus 1 cytochrome c oxidase biogenesis protein[more]
AT1G48510.11.1e-5141.63 Surfeit locus 1 cytochrome c oxidase biogenesis protein[more]
Match NameE-valueIdentityDescription
gi|449439471|ref|XP_004137509.1|1.2e-143100.00PREDICTED: surfeit locus protein 1 [Cucumis sativus][more]
gi|659066886|ref|XP_008465733.1|2.9e-13996.85PREDICTED: surfeit locus protein 1 [Cucumis melo][more]
gi|645269731|ref|XP_008240132.1|4.3e-10673.73PREDICTED: surfeit locus protein 1 [Prunus mume][more]
gi|694330649|ref|XP_009356019.1|4.0e-10472.44PREDICTED: surfeit locus protein 1 isoform X3 [Pyrus x bretschneideri][more]
gi|595847352|ref|XP_007209306.1|4.0e-10473.05hypothetical protein PRUPE_ppa007867mg [Prunus persica][more]
The following terms have been associated with this mRNA:
Vocabulary: INTERPRO
TermDefinition
IPR002994Surf1/Shy1
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
GO Assignments
This mRNA is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0016020 membrane
molecular_function GO:0003674 molecular_function

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameType
Csa1G042880Csa1G042880gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameType
Csa1G042880.2Csa1G042880.2-proteinpolypeptide


The following three_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G042880.2.utr3p1Csa1G042880.2.utr3p1three_prime_UTR


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G042880.2.cds5Csa1G042880.2.cds5CDS
Csa1G042880.2.cds4Csa1G042880.2.cds4CDS
Csa1G042880.2.cds3Csa1G042880.2.cds3CDS
Csa1G042880.2.cds2Csa1G042880.2.cds2CDS


The following five_prime_UTR feature(s) are a part of this mRNA:

Feature NameUnique NameType
Csa1G042880.2.utr5p2Csa1G042880.2.utr5p2five_prime_UTR
Csa1G042880.2.utr5p1Csa1G042880.2.utr5p1five_prime_UTR


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR002994Surfeit locus 1/Shy1PFAMPF02104SURF1coord: 3..237
score: 9.7
IPR002994Surfeit locus 1/Shy1PROFILEPS50895SURF1coord: 1..251
score: 30
NoneNo IPR availablePANTHERPTHR23427SURFEIT LOCUS PROTEINcoord: 143..254
score: 2.8E-79coord: 6..106
score: 2.8
NoneNo IPR availablePANTHERPTHR23427:SF2SURFEIT LOCUS PROTEIN 1coord: 143..254
score: 2.8E-79coord: 6..106
score: 2.8