Csa4G515540 (gene) Cucumber (Chinese Long) v2

NameCsa4G515540
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionEndoribonuclease E-like protein
LocationChr4 : 17979404 .. 17985486 (-)
   



The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
AAAACTGGACGGGCTTTACACCGTCGGGCCTCGGCTGAGTTCATTTTCGTAATTTCACCGAAATTTTGGTCTCCCCATTCGTACAAAACCACCCGCATTTGTTTGAGTGTCCAGTCTCCAGCTCCATCCACTTGCCTTTGTCTTGCGTTCTTTCAAAATTTCTATCAGCAATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGAGTTCTCTTTCCTTACCATTTTGTTTTCTCTTCAGTTCTTTCCCTTTGCCCATAGTTAGGGCTTCTGAGTTCTCATCCGTTTGTTTTTAATCTTTAAAAGTAAAACCCTTCGCAATGTATCCAAAGTCACAGTCACACGAAAATCATCAAATGATTGGCTAGCCAATATTTTGTGTGTGTGTTCTTGTCTGCTTCTTGGGCTCTTGCAGAGTTCCAAGGATTCTAATACGAATGCTATCGGCATAGAATTAGGAATACTCTTTGGTTTTTAAAGGTATGTTAGTAATTAGTTTGGGAAGTTGTTATGGTCATTTTCATTTGTTTTAATTTGTGGGTATGTACGGATGAATGTATTCAATGGCTAATTTGGTTTAAGTTGAAGTAACACTACTCGGAATTAGCGAGGCTTCAAGTAACTCGAGTACTTGGTTTATCTTTGTAGTTCTTCTGTCTTTTACATTTCAATATCATTTTAGTTGTTTGGATTCTATCAGATCCATTGCCAGTTGAGAGCAAAAACTTGCTTCTTAGGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGGTACGTCCTTGTCCACTTCATTTTCCCTCCATTTTTTATAACTTTCTTGGTATGATTTGTGTTCATGAATACAATATCTTGTTGCATGGGTTTTTGTTTGCAAGAATCTGAAAACAAAATTTAGAGGAGAATTTCGACTGTAAATGTTCATTGCTTTCAATTTGATAATTATTTCCGATGACCAGAGTTTAATTTGTAAACAATTTGAGCTGCGACTTTTGTTTTATATTTCTGCATTTTTTATAACAAGTGAGAAATATACAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCGACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGGTAATATATTAAAAGTTTCATTACTGATCATAGTTCTGATGTTGGATAACCAAGTGTTTGTATTAGATTAAACCAAGTCTCTTATTAATTCAGTTTAGATTTAATCTCAGATTTCTTCTGGGGGATAAGATAATGTTTAGTTTATATTTATTGCGGCCTCATATTTAAACCAATTTTACAAGTATTAATCTAAATCTAGAAGCCCCATCAAACATATCACTCCTGAGAACTGAATAATAAAATTATCATTTTTTCTCCAATTATTGCGATTTTTTTTCTAGTAAAGTCTGAACTGTGCATTCACACTGGAACATAGATTTCATTCTTTTTCTTTCTTATCTTTATGATTTAGGATCTAGTAATGACATTTATAATCTATAGTAGTTATCTCAAGAATTGACTAAAGCCCAGAGTTGGAACAGTATTTTAACTTGGCAGATAGAATTGCTCGTCTGAAAAAAAAAAAAATCTTATAACTAGTTCTCTCTTAAACTAATTTTTCTATTTGAGCAGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTAATGTTTAGAATTAAATCGGATCTCACTACTTCAACTGGCATGTTTATTTCATTTTAAATTTAATACCTAAATAATTGCTCTTGTTAACTTTGAAGTTCTTGAAAGTAACTCCCTGCAACAAACCCAGTTTTGCCCAAGTTGGAAACTAACTCTTATTTTCAAGTTAGTCATATTTTTGAGAACACATTCTTTCATGTACGAGAATAAAAACAAAATACACGACTATTATACAAAATAGAAAACACATAAAAAAAACAAAAGCTCAGTCTGAGACTCCAATCCATACGGATCAAGCCTAATTGTTTCATACATTGTGTTGGACATGCTTTTTAGAAAGCTTATTTTAATGTGAAGCTCTGATTATTTCAGCAGCATTGAAAAAATGCTAATCAAATTTTTCCTTTTAAAGCATTTACTGAATTTTTCTAACCACTTTTGATGATAGAAAATCTTTAAAACTTAACCAAGCACTATCAATTTAAAAAAGAAGAAAAATAGAAAATTTCATTTCTAATCACTTTAGAAAGCATGCCAAACACACACTAATACAAAGAAACGAGAACTGTTTAAAACTTCTCATTGTAGTTGATATTTTGAAATTGGTTTACAAACTCGATTGAAAGTCTACGAAGTGCTGACAATCAATTTATAAGAGAAAGACCACATGAAGATTCTATGGCCTTAGTGAAAAAATCTATTAGGATACCACCTAACTGGAATAAGATTTGAAATGGTGAAATCTAAAAAGAGGAACTACTAGATAACTGTTAGAATTTAGAAGACTTCCATGCAACAACATTCAAGAACCAGAAAAAACAAAAGATAGAAATATATACAACATCAATGAAGAAACTACCAACAAAAATTTCATAAAACTCTTCACACCTTACCCCCAATGCACTCCTTCTATTTATAGCTAAAAGTTAACAAACCTACTAGCTATTTACTCAGATGCCCCTTCTAATACTTCAACATTCAAGAACCAGAAAAAACAAAAGATAGAAATATATACAACATCAATGAAGAAACTACCAACAAAAATTTCATAAAACTCTTCACACCTTACCCCCAATGCACTCCTTCTATTTATAGCTAAAAGTTAACAAACCTACTAGCTATTTACTCAGATGCCCCTTCTAATACTTATACTAATATTCTACTAATAATCCTATGATATCTCTAACTAGGTCTCTCACAATAACCCAAGTATAAAGGAACTTGGAACCTCCTCTTGATTATGCTAACCCAAGCCCTAGATCACTCAACAATTTCCTCCCTACCAGCCTATTTCCTCCCTCTATTTATATCCTAATACATTAATTCCCCAATTAATTACCCTTACGTCCCCCACTAACTCCATACTAATATTTCTAATTAAGACCCTTACAAAATCTTCCATCTGTATTGGCTATTGAGTTGAGCAGTCTAAAAGTACGTTCTAAAGCTTTCCAACAGTTTCAGATGCAACTATTATGGCTCATTCTCTGTGTCTCTGTTTGTGAATGTAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACGTGAGTAAGCTTCATATCATAAATTATTGTACATCTTCATGATAATTACATATTTCTCCTATAGGTTTCATTAATCATCGGAGTTTATCTCAAAATATACCTACTACTTGGTTTTGGTCTTTGAACTTTTAAGTTTAGTTCATAATGTTTCCAAAACTTTTAGGAAGTGTATTTTAATTATTGTCAATTATTTATCAGATGGATAACATGGCCTAAATATGGAAGCCATATAGATTAGTTGAGATGAGTAATAGATGGGGAATATGTGAGCGCCCTAGTTAGAGATGTAATAGGATTATTAGTAGAATATTAGAGTGGTTATTAGAAGGGCATATTAGAAATTTGATAGCAAGTTGGTTAGGGTTTTTAGTTTAGTTATAAATAGAGTGAGTGGGTTGAGAGCAAGTTGTGAAGGATATTAAGGGATTCCCTTTGGAGAATTTGGGAGAGTCTAGGCCACTCAAAAGACAAGCAATTACCTTGTTGAGTTTTGAGATGATATTATAACACATGAATTTTTTTTGTGTGTTATTGAGTGTGTTCTTGTTAGGAGGTATTCTAACAGAATACTTTATGCACACACTCATATCGACTCTCAACCCTCATATTAGTTTTAATCTGTATAGCTTTACTTGAAAAAGGAAGTAAAAATAATGTAATATTATTCCTCTACATTATTGAATCAAATACAAGATCTTATATAGAGAAAGACCAACAACAAATAAGGAAAAAATATAGGTATAAATAAAAAAATAATACGGGTCTTTTAAAAAATATAACAAACCGTTAAAATATTTACATCGTATAGAATAATTCTAACAAAGGAAAAAGCCCACAGGCCTACAATGGGAAATACAAAAAATACACTAGTCAATGTGCGATTAATCGATCACATGCGCTCGTGTAATCGTCTTTTAAACGATCATGATACATGATCGTGTAGATAATGATACACGGTTGTGTAGGTAGTATCAACATGATTGTTTTATTCTTTTTAACGATGAGAAAAGAGCTTCAAATCTAAACGATCGTTTAAACGATCATTTAGATCATACCCACCTGATCATGTAGTTCTTTTTTAAACCATGGGAAAAGAGCTTCAAATCTAAACGATTGTGTTGTTATGGTAAACGATTGTTTAGATCATAACTACACGATCTTGTAGTTCTTTTTAAACGGTGGAAAATGGCTTCAACTCTAAAACGATCGTGTTGACCATGGTAAACGATCGTTTAGATCATGTCACGGTGATATTTAAACGATCTTGATATTCTTGAATATAAGGATGATGAAGTTCAATATTTTTTTTAAACGATCTTGAATATTCTTGTGATTGAAAAAGAAGAATAATACGTTTAAGAATAGAAGGAAAAATTGGAAGTAGAGTGAAAGAAGTCTGAAAGAGAAGATGGATAAATTGGATTGTGCTACGGTGGTATGTGGTGAAGACAAGAGAATCTATATATACAATCTTGACTATTACTGACAAGTTGGCGCTAAGTCTTTTGCAGGAGGTCAGTTTCAAGCACCAATAGATATGGTCTATACTCTTGTAGCCCGGATATAGAAGACATGACAAATAGACTCCTAGTGATATGGTCTTTCTTTAATATTCGTCAAGATTTAAATCTAGATTTGGTCATATTTATAAATTCTTTTGTATTGTGTTATAATTGTAAATATTTGGATCTAATTGCTATATTTATAACTGTATCATTTTGTAATTGGCAGAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAATCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAGTTTTAGTGTAAATGTCTCATATTTGTAATTTAGCCGCCACTAGAACCACTAG

mRNA sequence

ATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCGACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAATCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAG

Coding sequence (CDS)

ATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCGACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAATCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAG

Protein sequence

MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSRPSHNHGSEDAISI*
BLAST of Csa4G515540 vs. Swiss-Prot
Match: Y4920_ARATH (Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g37920 PE=1 SV=2)

HSP 1 Score: 517.3 bits (1331), Expect = 1.6e-145
Identity = 268/403 (66.50%), Postives = 321/403 (79.65%), Query Frame = 1

Query: 11  FYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARV 70
           F+ S+ K   FP  ++  + LP  +SA       KI KS   T  T T        SA V
Sbjct: 10  FFSSADKLLSFPPKNSQTHHLP--FSAF-INGGRKIRKSSTITFATDTVTYN-GTTSAEV 69

Query: 71  NDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESF 130
                S  E+ +E+EVA+GY+++QFCDKIID+FLNEKPK K+W+ +LV R+EW KY  +F
Sbjct: 70  K----SSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNF 129

Query: 131 YSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHK 190
           Y  C+ RAD E DPI+K+KL+SL  KVKKID EME H++LLKE+Q++PTDINAI AKR +
Sbjct: 130 YKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRR 189

Query: 191 EFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNI 250
           +FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAY+ TLE+VETLD+AQ KF++I
Sbjct: 190 DFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDI 249

Query: 251 LNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKS 310
           LNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMYHLYKATKS
Sbjct: 250 LNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKS 309

Query: 311 SLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKI 370
           SLRS+ PKEIKLLK+LLNI DPEERFSALAT FSPGD  E KDP ALYTTPKELHKWIKI
Sbjct: 310 SLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKI 369

Query: 371 MLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQ 414
           MLD+YHLN+E+TDI+EA+ M+QPIVIQRLFILKDTIE EYL++
Sbjct: 370 MLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of Csa4G515540 vs. TrEMBL
Match: A0A0A0L3X1_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_4G515540 PE=4 SV=1)

HSP 1 Score: 862.8 bits (2228), Expect = 1.7e-247
Identity = 435/435 (100.00%), Postives = 435/435 (100.00%), Query Frame = 1

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL
Sbjct: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 436
           SRPSHNHGSEDAISI
Sbjct: 421 SRPSHNHGSEDAISI 435

BLAST of Csa4G515540 vs. TrEMBL
Match: M5VYC6_PRUPE (Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006103mg PE=4 SV=1)

HSP 1 Score: 565.8 bits (1457), Expect = 4.4e-158
Identity = 286/422 (67.77%), Postives = 343/422 (81.28%), Query Frame = 1

Query: 4   TNHLPFQFYVSSTKPFIFPSFSTTLNPL----PSIYSASPFKPSPKISKSDNRTSVTITA 63
           +N +  +   S TKP IF   +T   P     P+     P   +   SK   +T    ++
Sbjct: 2   SNFMALELSNSITKPSIFFDATTFFPPNENVPPTTLLIPPISFASLKSKHRRKTRAAQSS 61

Query: 64  PLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVF 123
           P      S + +++AT++  EQVE+EVA+GY+++QFCDKIID+F+NEKP+ KEWRKFLVF
Sbjct: 62  P----TTSPQASNIATAQVAEQVEVEVAEGYTMTQFCDKIIDVFMNEKPRAKEWRKFLVF 121

Query: 124 REEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPT 183
           RE+W+KY+ESFY+ C+ RAD E D  MKEK  SL RKVKKIDDEME HSELLKE+QD+PT
Sbjct: 122 REDWEKYKESFYNRCRTRADMEGDQTMKEKFTSLGRKVKKIDDEMERHSELLKEIQDNPT 181

Query: 184 DINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVET 243
           D+NAIVA+R K+FT+EFF+ + L+SE +DSLEDRDA+ARL ARCL+AVSAY+ TLE VET
Sbjct: 182 DVNAIVARRRKDFTEEFFRHVNLLSEVYDSLEDRDAMARLGARCLSAVSAYDNTLEYVET 241

Query: 244 LDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKE 303
           LD+AQ KFD++LNSPS+DVACEKI SLAKAKELDSSL+LLINSAWASAKESTTMKNEVK+
Sbjct: 242 LDTAQAKFDDMLNSPSVDVACEKIKSLAKAKELDSSLVLLINSAWASAKESTTMKNEVKD 301

Query: 304 IMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYT 363
           IMYHLYKATKSSLRS+APKEIKLLKHLLNI DPEERFSALAT FSPGDG E KDP A+YT
Sbjct: 302 IMYHLYKATKSSLRSIAPKEIKLLKHLLNITDPEERFSALATAFSPGDGPEAKDPKAVYT 361

Query: 364 TPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNP 422
           TPKELHKWIKIMLD+YHLN E+TDIREA+ MTQP+VIQRLFILK+TIE EYLE++  Q  
Sbjct: 362 TPKELHKWIKIMLDAYHLNAEETDIREAKQMTQPVVIQRLFILKETIEEEYLERSTDQKS 419

BLAST of Csa4G515540 vs. TrEMBL
Match: D7T5B2_VITVI (Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0211g00010 PE=4 SV=1)

HSP 1 Score: 558.1 bits (1437), Expect = 9.2e-156
Identity = 290/418 (69.38%), Postives = 344/418 (82.30%), Query Frame = 1

Query: 4   TNHLPFQFYV-SSTKPFIFPS--FSTTLNPLPSIYSASPFKPSPKISK--SDNRTSV--- 63
           +N L F+  + ++TKP +     FS ++  LPSI S   F PS +      ++RT     
Sbjct: 2   SNLLGFKLLLCNTTKPSVLNQNLFSASIL-LPSISSPPLFLPSKQSDSLTPNSRTRKGRG 61

Query: 64  TITAPLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRK 123
           T  A L  F A++  N +  +E  EQVE+EVA GY+++QFCDKIID+F+NEKPK KEWRK
Sbjct: 62  TSDAVLSNFRANSTANSIGAAEVAEQVEVEVANGYTITQFCDKIIDVFMNEKPKLKEWRK 121

Query: 124 FLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQ 183
           +LVFREEW KYRE+FY+ CQ RA  E DP++K+KLI L RKVKKIDDEME H+ELL+E+Q
Sbjct: 122 YLVFREEWNKYREAFYNRCQTRAYAETDPVIKKKLIELGRKVKKIDDEMERHTELLEEVQ 181

Query: 184 DSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLE 243
            SP D+NAIV +R K+FT EFF+ L+L+SET+DSLEDRDA+ARL ARCL+AVSAY+ TLE
Sbjct: 182 SSPMDVNAIVVRRRKDFTGEFFRHLSLLSETYDSLEDRDAMARLGARCLSAVSAYDNTLE 241

Query: 244 NVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKN 303
            VETLD AQ KFD+ILNSPS+DVACEKI SLAKAKELDSSLILLINSAW++AKESTTMKN
Sbjct: 242 IVETLDVAQAKFDDILNSPSIDVACEKIKSLAKAKELDSSLILLINSAWSAAKESTTMKN 301

Query: 304 EVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPN 363
           EVK+IMYHLYKATKSSLRS+APKEIKLLKHLLNI DPEERFSALA+ FSPGD  E KDPN
Sbjct: 302 EVKDIMYHLYKATKSSLRSIAPKEIKLLKHLLNITDPEERFSALASAFSPGDDREAKDPN 361

Query: 364 ALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQ 414
           ALYTTPKELHKWIKIMLD+YHLN+E+TDIREAR MT+P+VIQRLFILK+TIE EYLE+
Sbjct: 362 ALYTTPKELHKWIKIMLDAYHLNKEETDIREARQMTEPVVIQRLFILKETIEEEYLER 418

BLAST of Csa4G515540 vs. TrEMBL
Match: A0A151TH46_CAJCA (Uncharacterized protein OS=Cajanus cajan GN=KK1_012672 PE=4 SV=1)

HSP 1 Score: 557.8 bits (1436), Expect = 1.2e-155
Identity = 278/408 (68.14%), Postives = 339/408 (83.09%), Query Frame = 1

Query: 21  FPSFSTTLNPL-PSIYSASPFKPS-----PKISKSDNRTSVTITAPLQIFNASARVNDVA 80
           F  F+  L+P+ PS Y+A  + PS     PK+   + +  +     L     S + +  +
Sbjct: 17  FIPFNNNLSPITPSSYNAPKYSPSFQNQRPKLIPQNFKPHILQCTSLP----SPQASSAS 76

Query: 81  TSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHC 140
           T++ EEQ E+E+A GY+++QFCDK+ID FLNEK K+KEW+K+L+FREEWKKYR+ FYS C
Sbjct: 77  TAQAEEQ-EIEIANGYTMTQFCDKMIDFFLNEKTKSKEWKKYLIFREEWKKYRDRFYSRC 136

Query: 141 QRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHKEFTD 200
           QRRAD E+DPIMKEK ISLRRKVKKIDDEME H ELLKE+QDSPTDINAIVA+R K+FT 
Sbjct: 137 QRRADMENDPIMKEKFISLRRKVKKIDDEMEEHYELLKEIQDSPTDINAIVAQRRKDFTG 196

Query: 201 EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNILNSP 260
           EFF +L+L+S+T+DSLEDRD +ARL +RCL++VSAY+ TLEN+ETLD+AQ KFD+ILNSP
Sbjct: 197 EFFNYLSLLSDTYDSLEDRDGIARLGSRCLSSVSAYDNTLENIETLDTAQAKFDDILNSP 256

Query: 261 SLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRS 320
           S+DVAC+KI SLAKAKELDS+LILLI+SAWA AKESTTMK+EVK+IMY LYKATKSSLRS
Sbjct: 257 SIDVACQKIKSLAKAKELDSTLILLISSAWAKAKESTTMKDEVKDIMYQLYKATKSSLRS 316

Query: 321 MAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKIMLDS 380
           + PKEIKLLKHLLNI+DPEERFSALAT FSPGD  E KDPN LYTTPKELHKWIKIMLD+
Sbjct: 317 ITPKEIKLLKHLLNIIDPEERFSALATAFSPGDELEAKDPNTLYTTPKELHKWIKIMLDA 376

Query: 381 YHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSR 423
           YHLN+E+TD+REA+ MT P+VIQRLFILKDTIE EY+E+   Q  +++
Sbjct: 377 YHLNKEETDLREAKQMTDPVVIQRLFILKDTIEQEYMEKGTPQKSETK 419

BLAST of Csa4G515540 vs. TrEMBL
Match: A0A0B2PBI0_GLYSO (Uncharacterized protein OS=Glycine soja GN=glysoja_024255 PE=4 SV=1)

HSP 1 Score: 556.2 bits (1432), Expect = 3.5e-155
Identity = 278/419 (66.35%), Postives = 340/419 (81.15%), Query Frame = 1

Query: 4   TNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQI 63
           T  +PF   +S   P   PS+ T  N  PS  S  P     K+   + +  + +   L  
Sbjct: 17  TTFIPFNNNLSPNYPLTPPSY-TASNCFPSFQSQRP-----KLIAQNFKPHILLCTSLPS 76

Query: 64  FNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEW 123
             AS+     A++ + E+ E+E+AKGY+++QFCDK+ID FLNEK K+KEWRK+L+FREEW
Sbjct: 77  PQASS-----ASTAQAEEHEVEIAKGYTMTQFCDKMIDFFLNEKTKSKEWRKYLIFREEW 136

Query: 124 KKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINA 183
           KKYR+ FY+ CQRRAD E+DP+MKEK ISLRRK+KKIDDEME H ELL E+QDSP DINA
Sbjct: 137 KKYRDRFYNRCQRRADMENDPVMKEKFISLRRKLKKIDDEMEGHYELLMEIQDSPMDINA 196

Query: 184 IVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSA 243
           IVA+R K+FT EFF +L+LIS+T+DSLEDRD ++RL +RCL+AVSAY+ TLEN+ETLD+A
Sbjct: 197 IVARRRKDFTGEFFHYLSLISDTYDSLEDRDGISRLGSRCLSAVSAYDNTLENIETLDAA 256

Query: 244 QVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYH 303
           Q KFD+ILNSPS+D+AC+KI SLAKAKELDSSLILLI+SAWA AKESTTMKNEVK+IMY 
Sbjct: 257 QAKFDDILNSPSIDIACQKIKSLAKAKELDSSLILLISSAWAKAKESTTMKNEVKDIMYQ 316

Query: 304 LYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKE 363
           LY+ATKSSLRS+ PKEIKLLKHLLNI+DPEERFSALAT F+PGD  E KDPNALYTTPKE
Sbjct: 317 LYRATKSSLRSITPKEIKLLKHLLNIIDPEERFSALATAFTPGDEHEAKDPNALYTTPKE 376

Query: 364 LHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSR 423
           LHKWIKIMLD+YHLN+E+TD+REAR MT P+VIQRLFILKDTIE EY+E++  Q  +++
Sbjct: 377 LHKWIKIMLDAYHLNKEETDLREARQMTDPVVIQRLFILKDTIEQEYMEKDTTQKSETK 424

BLAST of Csa4G515540 vs. TAIR10
Match: AT4G37920.1 (AT4G37920.1 unknown protein)

HSP 1 Score: 517.3 bits (1331), Expect = 9.1e-147
Identity = 268/403 (66.50%), Postives = 321/403 (79.65%), Query Frame = 1

Query: 11  FYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARV 70
           F+ S+ K   FP  ++  + LP  +SA       KI KS   T  T T        SA V
Sbjct: 10  FFSSADKLLSFPPKNSQTHHLP--FSAF-INGGRKIRKSSTITFATDTVTYN-GTTSAEV 69

Query: 71  NDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESF 130
                S  E+ +E+EVA+GY+++QFCDKIID+FLNEKPK K+W+ +LV R+EW KY  +F
Sbjct: 70  K----SSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNF 129

Query: 131 YSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHK 190
           Y  C+ RAD E DPI+K+KL+SL  KVKKID EME H++LLKE+Q++PTDINAI AKR +
Sbjct: 130 YKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRR 189

Query: 191 EFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNI 250
           +FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAY+ TLE+VETLD+AQ KF++I
Sbjct: 190 DFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDI 249

Query: 251 LNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKS 310
           LNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMYHLYKATKS
Sbjct: 250 LNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKS 309

Query: 311 SLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKI 370
           SLRS+ PKEIKLLK+LLNI DPEERFSALAT FSPGD  E KDP ALYTTPKELHKWIKI
Sbjct: 310 SLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKI 369

Query: 371 MLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQ 414
           MLD+YHLN+E+TDI+EA+ M+QPIVIQRLFILKDTIE EYL++
Sbjct: 370 MLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of Csa4G515540 vs. TAIR10
Match: AT1G36320.1 (AT1G36320.1 unknown protein)

HSP 1 Score: 306.2 bits (783), Expect = 3.2e-83
Identity = 150/355 (42.25%), Postives = 244/355 (68.73%), Query Frame = 1

Query: 62  QIFNASARVND---VATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLV 121
           Q F  SA V+D   VA  EK++  E  V     + + CDK+I++F+ +KP   +WR+ L 
Sbjct: 60  QRFVISAVVDDKSVVAKEEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLA 119

Query: 122 FREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSP 181
           F +EW   R  FY  CQ RAD ED+P MK K+  L RK+K++D++++ H+ELL  ++ +P
Sbjct: 120 FSKEWDSIRPHFYKRCQERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTP 179

Query: 182 -TDINAIVAKRHKEFTDEFFKFLTLISETH-DSLEDRDAVARLAARCLAAVSAYNRTLEN 241
             +I  +VA+R K+FT+EFF+ L  ++E++ D+ ++++A+A L    +AAV AY+ + E+
Sbjct: 180 PAEIGELVARRRKDFTNEFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTES 239

Query: 242 VETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNE 301
           ++ L++A++K  +I+NSPSLD AC KI SLA+  +LDS+L+L+I  AW++AKES  MK E
Sbjct: 240 IDALNAAEMKLQDIINSPSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEE 299

Query: 302 VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNA 361
           VK+I+YHLY   + +L+ + PKE+++LK+LL+I DP+E+ SAL   F+PGD  E  D + 
Sbjct: 300 VKDILYHLYVTARGNLQRLMPKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDY 359

Query: 362 LYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 412
           LYTTP+ L   +K +L++YH ++E + ++EA+++  P +I ++  LK  +E +Y+
Sbjct: 360 LYTTPEHLQSLMKTVLEAYHFSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

BLAST of Csa4G515540 vs. NCBI nr
Match: gi|449457285|ref|XP_004146379.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic [Cucumis sativus])

HSP 1 Score: 862.8 bits (2228), Expect = 2.5e-247
Identity = 435/435 (100.00%), Postives = 435/435 (100.00%), Query Frame = 1

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL
Sbjct: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 436
           SRPSHNHGSEDAISI
Sbjct: 421 SRPSHNHGSEDAISI 435

BLAST of Csa4G515540 vs. NCBI nr
Match: gi|659082883|ref|XP_008442081.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 837.0 bits (2161), Expect = 1.5e-239
Identity = 420/435 (96.55%), Postives = 426/435 (97.93%), Query Frame = 1

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFY+SSTK FIFP+FSTTL PLPSIYSASPFKPSPK SKSDNRT+VTITAP
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTTVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQ EMEVAKGYSLSQFCDKIIDIF+NEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVA R KEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETL
Sbjct: 181 INAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQ KFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT FSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 436
           SRP+HNHGSEDAISI
Sbjct: 421 SRPNHNHGSEDAISI 435

BLAST of Csa4G515540 vs. NCBI nr
Match: gi|659082885|ref|XP_008442082.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis melo])

HSP 1 Score: 685.6 bits (1768), Expect = 5.5e-194
Identity = 344/352 (97.73%), Postives = 347/352 (98.58%), Query Frame = 1

Query: 84  MEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWEDD 143
           MEVAKGYSLSQFCDKIIDIF+NEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWE D
Sbjct: 1   MEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWESD 60

Query: 144 PIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHKEFTDEFFKFLTLI 203
           PIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVA R KEFTDEFFKFLTLI
Sbjct: 61  PIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVANRRKEFTDEFFKFLTLI 120

Query: 204 SETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKI 263
           SETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETLDSAQ KFDNILNSPSLDVACEKI
Sbjct: 121 SETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETLDSAQAKFDNILNSPSLDVACEKI 180

Query: 264 ASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLL 323
           ASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLL
Sbjct: 181 ASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLL 240

Query: 324 KHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTD 383
           KHLLNIVDPEERFSALAT FSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTD
Sbjct: 241 KHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTD 300

Query: 384 IREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSRPSHNHGSEDAISI 436
           IREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSRP+HNHGSEDAISI
Sbjct: 301 IREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSRPNHNHGSEDAISI 352

BLAST of Csa4G515540 vs. NCBI nr
Match: gi|694437695|ref|XP_009345861.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic [Pyrus x bretschneideri])

HSP 1 Score: 577.0 bits (1486), Expect = 2.7e-161
Identity = 291/429 (67.83%), Postives = 352/429 (82.05%), Query Frame = 1

Query: 4   TNHLPFQFYVSSTKP--FIFPS---FSTTLN------PLPSIYSASPFKPSPKISKSDNR 63
           +N +  +   S TKP  F F +   F  T N      P+PS+ SASPF P     +S +R
Sbjct: 2   SNFMGLELSASITKPSTFFFDAINFFPITENVSPATLPIPSMSSASPFPPPKHRLQSKHR 61

Query: 64  TSVTITAPLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKE 123
               +T    I+    + +++A+++  EQV++EVA+GY+++QFCDK+IDIF+NEKP+TK+
Sbjct: 62  RKAGVTQ--SIYTTRTQASNIASAQVAEQVDVEVAEGYTMTQFCDKVIDIFMNEKPRTKD 121

Query: 124 WRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLK 183
           WRKFLVFRE+WKKYRE+FY+ C+ RAD E DP MKEK  SL RKVKKIDDEME H+ELL 
Sbjct: 122 WRKFLVFREDWKKYRENFYNRCRTRADTEVDPTMKEKFTSLGRKVKKIDDEMERHNELLN 181

Query: 184 ELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNR 243
           E+QD+PTD+NAIVA+R ++FT+EFF+ L L+SE +DSLEDRDA+ARL ARCL+AVSAY+ 
Sbjct: 182 EIQDNPTDVNAIVARRREDFTEEFFRHLNLLSEIYDSLEDRDAMARLGARCLSAVSAYDN 241

Query: 244 TLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTT 303
           TLE VETLD+AQ KFD+ILNSPS+DVACEKI SLAKAKELDSSL+LLINSAWASAKESTT
Sbjct: 242 TLEYVETLDTAQAKFDDILNSPSMDVACEKIKSLAKAKELDSSLVLLINSAWASAKESTT 301

Query: 304 MKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQK 363
           MKNEVK++MYHLYKATKSSLRS+APKEIKLLKHLLNI DPEERFSALAT FSPGDG E K
Sbjct: 302 MKNEVKDVMYHLYKATKSSLRSIAPKEIKLLKHLLNITDPEERFSALATAFSPGDGPEAK 361

Query: 364 DPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLE 422
           DP A+YTTPKELHKWIKIMLD+YHLN E+TDIREA+ MTQP+VIQRLFILK+TIE EYLE
Sbjct: 362 DPKAVYTTPKELHKWIKIMLDAYHLNAEETDIREAKQMTQPVVIQRLFILKETIEQEYLE 421

BLAST of Csa4G515540 vs. NCBI nr
Match: gi|645219622|ref|XP_008236637.1| (PREDICTED: uncharacterized protein At4g37920, chloroplastic [Prunus mume])

HSP 1 Score: 566.2 bits (1458), Expect = 4.8e-158
Identity = 287/422 (68.01%), Postives = 344/422 (81.52%), Query Frame = 1

Query: 4   TNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSAS----PFKPSPKISKSDNRTSVTITA 63
           +N +  +   S TKP IF   +T      ++ + +    P   +   SK   +T    ++
Sbjct: 2   SNFMALELSNSITKPSIFFDATTFFRLNENLPTTTLLIPPISFASLKSKHRRKTRAAQSS 61

Query: 64  PLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVF 123
           P      S + +++AT++  EQVE+EVA+GY+++QFCDKIID+F+NEKP+ KEWRK LVF
Sbjct: 62  P----TTSPQASNIATAQVAEQVEVEVAEGYTMTQFCDKIIDVFMNEKPRAKEWRKLLVF 121

Query: 124 REEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPT 183
           RE+W+KYRESFY+ C+ RAD E DP MKEK  SL RKVKKIDDEME HSELLKE+QD+PT
Sbjct: 122 REDWEKYRESFYNRCRTRADMEGDPTMKEKFTSLGRKVKKIDDEMERHSELLKEIQDNPT 181

Query: 184 DINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVET 243
           DINA+VA+R K+FT+EFF+ L L+SE +DSLEDRDA+ARL ARCL+AVSAY+ TLE VET
Sbjct: 182 DINALVARRRKDFTEEFFRHLNLLSEVYDSLEDRDAMARLGARCLSAVSAYDNTLEYVET 241

Query: 244 LDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKE 303
           LD+AQ KFD+ILNSPS+DVACEKI SLAKAKELDSSL+LLINSAWASAKESTTMKNEVK+
Sbjct: 242 LDTAQAKFDDILNSPSVDVACEKIKSLAKAKELDSSLVLLINSAWASAKESTTMKNEVKD 301

Query: 304 IMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYT 363
           IMYHLYKATKSSLRS+APKEIKLLKHLLNI DPEERFSALAT FSPGDG E KDP A+YT
Sbjct: 302 IMYHLYKATKSSLRSIAPKEIKLLKHLLNITDPEERFSALATAFSPGDGPEAKDPKAVYT 361

Query: 364 TPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNP 422
           TPKELHKWIKIMLD+YHLN E+TDIREA+ MTQP+VIQRLFILK+TIE EYLE++  Q  
Sbjct: 362 TPKELHKWIKIMLDAYHLNAEETDIREAKQMTQPVVIQRLFILKETIEEEYLERSTDQKS 419

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Y4920_ARATH1.6e-14566.50Uncharacterized protein At4g37920, chloroplastic OS=Arabidopsis thaliana GN=At4g... [more]
Match NameE-valueIdentityDescription
A0A0A0L3X1_CUCSA1.7e-247100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_4G515540 PE=4 SV=1[more]
M5VYC6_PRUPE4.4e-15867.77Uncharacterized protein OS=Prunus persica GN=PRUPE_ppa006103mg PE=4 SV=1[more]
D7T5B2_VITVI9.2e-15669.38Putative uncharacterized protein OS=Vitis vinifera GN=VIT_00s0211g00010 PE=4 SV=... [more]
A0A151TH46_CAJCA1.2e-15568.14Uncharacterized protein OS=Cajanus cajan GN=KK1_012672 PE=4 SV=1[more]
A0A0B2PBI0_GLYSO3.5e-15566.35Uncharacterized protein OS=Glycine soja GN=glysoja_024255 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT4G37920.19.1e-14766.50 unknown protein[more]
AT1G36320.13.2e-8342.25 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449457285|ref|XP_004146379.1|2.5e-247100.00PREDICTED: uncharacterized protein At4g37920, chloroplastic [Cucumis sativus][more]
gi|659082883|ref|XP_008442081.1|1.5e-23996.55PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
gi|659082885|ref|XP_008442082.1|5.5e-19497.73PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X2 [Cucumis ... [more]
gi|694437695|ref|XP_009345861.1|2.7e-16167.83PREDICTED: uncharacterized protein At4g37920, chloroplastic [Pyrus x bretschneid... [more]
gi|645219622|ref|XP_008236637.1|4.8e-15868.01PREDICTED: uncharacterized protein At4g37920, chloroplastic [Prunus mume][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0009941 chloroplast envelope
cellular_component GO:0009535 chloroplast thylakoid membrane
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function
This gene is associated with the following unigenes:
Unigene NameAnalysis NameSequence type in Unigene
CU091566cucumber EST collection version 3.0transcribed_cluster

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa4G515540.1Csa4G515540.1mRNA


The following transcribed_cluster feature(s) are associated with this gene:

Feature NameUnique NameType
CU091566CU091566transcribed_cluster


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableunknownCoilCoilcoord: 157..177
scor
NoneNo IPR availablePANTHERPTHR31755FAMILY NOT NAMEDcoord: 14..426
score: 1.8E
NoneNo IPR availablePANTHERPTHR31755:SF2SUBFAMILY NOT NAMEDcoord: 14..426
score: 1.8E