CsGy4G018700 (gene) Cucumber (Gy14) v2.1

Overview
NameCsGy4G018700
Typegene
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionUnknown protein
LocationGy14Chr4: 24429463 .. 24435436 (-)
RNA-Seq ExpressionCsGy4G018700
SyntenyCsGy4G018700
Sequences
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: exonfive_prime_UTRpolypeptideCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
CTTTACACCGTCGGGCCTCGGCTGAGTTCATTTTCGTAATTTCACCGAAATTTTGGTCTCCCCATTCGTACAAAACCACCCGCATTTGTTTGAGTGTCCAGTCTCCAGCTCCATCCACTTGCCTTTGTCTTGCGTTCTTTCAAAATTTCTATCAGCAATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGAGTTCTCTTTCCTTACCATTTTGTTTTCTCTTCAGTTCTTTCCCTTTGCCCATAGTTAGGGCTTCTGAGTTCTCATCCGTTTGTTTTTAATCTTTAAAAGTAAAACCCTTCGCAATGTATCCAAAGTCACAGTCACACGAAAATCATCAAATGATTGGCTAGCCAATATTTTGTGTGTGTGTTCTTGTCTGCTTCTTGGGCTCTTGCAGAGTTCCAAGGATTCTAATACGAATGCTATTGGCATAGAATTAGGAATACTCTTTGGTTTTTAAAGGTATGTTAGTAATTAGTTTGGGAAGTTGTTATGGTCATTTTCATTTGTTTTAATTTGTGGGTATGTACGGATGAATGTATTCAATGGCTAATTTGGTTTAAGTTGAAGTAACACTACTCGGAATTAGGGAGGCTTCAAGTAACTCGAGTACTTGGTTTATCTTTGTAGTTCTTCTGTCTTTTACATTTCAATATCATTTTAGTTGTTTGGATTCTATCAGATCCATTGCCAGTTGAGAGCAAAAACTTGCTTCTTAGGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGGTACGTCCTTGTCCACTTCATTTTCCCTCCATTTTTTATAACTTTCTTGGTATGATTTGTGTTCATGAATACAATATCTTGTTGCATGGGTTTTTGTTTGCAAGAATCTGAAAACAAAATTTAGAGGAGAATTTCGACTGTAAATGTTCATTGCTTTCAATTTGATAATTATTTCCGATGACCAGAGTTTAATTTGTAAACAATTTGAGCTGCGACTTTTGTTTTATATTTCTGCATTTTTTATAACAAGTGAGAAATATACAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGGTAATATATTAAAAGTTTCATTACTGATCATAGTTCTGATGTTGGATAACCAAGTGTTTGTATTAGATTAAACCAAGTCTCTTATTAATTCAGTTTAGATTTAATCTCAGATTTCTTCTGGGGGATAAGATAATGTTTAGTTTATATTTATTGCGTCCTCATATTTAAACCAATTTTACAAGTATTAATCTAAATCTAGAAGCCCCATCAAACATATCACTCCTGAGAACTGAATAATAAAATTATCATTTTTTCTCCAATTATTGCGATTTTTTTTCTAGTAAAGTCTGAACTGTGCATTCACACTGGAACATAGATTTCATTCTTTTTCTTTCTTATCTTTATGATTTAGGATCTAGTAATGACATTTATAATCTATAGTAGTTATCTCAAGAATTGACTAAAGCCCAGAGTTGGAACAGTATTTTAACTTGGCAGATAGAATTGCTCGTCTGAAAAAAAAAAAAAAATCTTATAACTAGTTCTCTCTTAAACTAATTTTTCTATTTGAGCAGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTAATGTTTAGAATTAAATCGGATCTCACTACTTCAACTGGCATGTTTATTTCATTTTAAATTTAATACCTAAATAATTGCTCTTGTTAACTTTGAAGTTCTTGAAAGTAACTCCCTGCAACAAACCCAGTTTTGCCCAAGTTGGAAACTAACTCTTATTTTCAAGTTAGTCATATTTTTGAGAACACATTCTTTCATGTACGAGAATAAAAACAAAATACACGACTATTATACAAAATAGAAAACACATAAAAAAAACAAAAGCTCAGTCTGAGACTCCAATCCATACGGATCAAGCCTAATTGTTTCATACATTGTGTTGGACATGCTTTTTAGAAAGCTTATTTTAATGTGAAGCTCTGATTATTTCAGCAGCATTGAAAAAATGCTAATCAAATTATTCCTTTTAAAGCATTTACTGAATTTTTCTAACCACTTTTGATGATAGAAAATCTTTAAAACTTAACCAAGCACTATCAATTTAAAAAAGAAGAAAAATAGAAAATTTCATTTCTAATCACTTTAGAAAGCATGCCAAACACACACTAATACAAAGAAACGAGAACTGTTTAAAACTTCTCATTGTAGTTGATATTTTGAAATTGGTTTACAAACTCGATTGAAAGTCTATGAAGTGCTGACAATCAATTTATAAGAGAAAGACCACATGAAGATTCTATGGCCTTAGTGAAAAAATCTATTAGGATACCACCTAACTGGAATAAGATTTGAAATGGTGAAATCTAAAAAGAGGAACTACTAGATAACTGTTAGAATTTAGAAGACTTCCATGCAACAACATTCAAGAACCAGAAAAAACAAAAGATAGAAATATATACAACATCAATGAAGAAACTACCAACGAAAATTTCATAAAACTCTTCACACCTTACCCCCAATGCACTCCTTCTATTTATAGCTAAAAGTTAACAAACCTACTAGCTATTTACTCAGATGCCCCTTCTAATACTTATACTAATATTCTACTAATAATCCTATGATATCTCTAACTAGGTCTCTCACAATAACCCAAGTATAAAGGAACTTGGAACCTCCTCTTGATTATGCTAACCCAAGCCCTAGATCACTCAACAATTTCCTCCCCACCAGCCTATTTCCTCCCTCTATTTATATCCTAATACATTAATTCCCCAATTAATTACCCTTACGTCCCCCACTAACTCCATACTAATATTTCTAATTAAGACCCTTACAAAATCTTCCATCTGTATTGACTATTGAGTTGAGCAGTCTAAAAGTACGTTCTAAAGCTTTCCAACAGTTTCAGATGCAACTATTATGGCTCATTCTCTGTGTCTCTGTTTGTGAATATAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACGTGAGTAAGCTTCATATCATAAATTATTGTACATCTTCATGATAATTACATATTTCTCCTATAGGTTTCATTAATCATCGGAGTTTATCTCAAAATATACCTACTACTTGGTTTTGGTCTTTGAACTTTTAAGTTTAGTTCATAATGTTTCCAAAACTTTTAGGAAGTGTATTTTAATTATTGTCAATTATTTATCAGATGGATAACATGGCCTAAATATGGCAGCCATATAGATTAGTTGAGATGAGTAATAGATGGGGAATATGTGAGCGCCCTAGTTAGAGATGTAATAGGATTATTAGTAGAATATTAGAGTGGTTATTAGAAGGGCATATTAGAAATTTGATAGCAAGTTGGTTAGGGTTTTTAGTTTAGTTATAAATAGAGTGAGTGGGTTGAGAGCAAGTTGTGAAGGATATTAAGGGATTCCCTTTGGAGAGTTTGGGAGAGTCTAGGCCACTCAAAAGATAAGCAATTACCTTGTTGAGTTTTGAGATGATATTATAACACATGAATTTTTTTTGTGTGTTATTGAGTGTGTTCTTGTTAGGAGGTATTCTAACAGAATACTTTATGCACACACTCATATCGACTCTCAACCCTCATATTAGTTTTAATCTGTATAGCTTTACTTGAAAAAGGAAGTAAAAATAATGTAATATTATTCCTCTACATTATTGAATCAAATACAAGATCTTATATAGAGAAAGACCAACAACAAATAAAAAAATAATAGGGGTCTTTTAAAAAATATAACAAACCGTTAAAATATTTACATCGTATAGAATAATTCTAACAAAGGAAAAAGCCCACAGGCCTACAATGGGAAATACAAAAAATACCCTAGTCAATGTGCGATTAATCGATCACATGCGCTCGTGTAATCGTCTTCTAAACGATCATGATACATGATCGTGTAGATAATGATACACGGTTGTGTAGGTAGTATCAACATGATTGTTTTATTCTTTTTAACGATGAGAAAAGAGCTTCAAATCTAAACGATCATTTAGATCATACCCACCTGATCATGTAGTTTTTTTTTAAACCATGGGAAAAGAGCTTCAAATCTAAACGATTGTGTTGTTATGGTAAACGATTGTTTAGATCATAACTACACGATCTTGTAGTTCTTTTTAAACGGTGGAAAATGGCTTCAACTCTAAAACGATCGTGTTGACCATGGTAAACGATCGTTTAGATCATGTCACGGTGATATTTAAACGATCTTGATATTCTTGAATATAAGGATGATGAAGTTCAATATTTTTTTTAAACGATCTTGAATATTCTTGAGATTGAAAAAGAAGAATAATACATTTAAGAATAGAAGGAAAATTTGGAAGTAGAGTGGAAGAAGTCTGAAAGAGAAGATGGATAAATTGGATTGTGCTACGGTGGTATGTGGTGAAGACAAGAGAATCTATATATACAATCTTGACTATTACTGACAAGTTGGCGCTAAGTCTTTTGCAGGAGGTCAGTTTCAAGCACCAATAGATATGGTCTATACTCTTGTAGCCCGGATATAGAAGACATGACAAATAGAGACTCCTAGTGATATGGTCTTTCTTTAATATTCGTCAAGATTTAAATCTAGATTTGGTCATATTTATAAATTCTTTTGTATTGTGTTATAATTGTAAATATTTTGGATCTAATTGCTATATTTATAACTGTATCATTTTGTAATTGGCAGAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAACCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAGTTTTAGTGTAAATGTCTCATATTTGTAATTTAGCCGCCACTAGAACCACTAGTTTCAGCATCTGAGTTTAAAGTTAGAATAGTTGCAAGGTCAGGTAGCTCATGTAACAAATATCAACATCTTTCAATTAAATTGATTTTTTTTTTCAGATTTAATATCATA

mRNA sequence

CTTTACACCGTCGGGCCTCGGCTGAGTTCATTTTCGTAATTTCACCGAAATTTTGGTCTCCCCATTCGTACAAAACCACCCGCATTTGTTTGAGTGTCCAGTCTCCAGCTCCATCCACTTGCCTTTGTCTTGCGTTCTTTCAAAATTTCTATCAGCAATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAACCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAGTTTTAGTGTAAATGTCTCATATTTGTAATTTAGCCGCCACTAGAACCACTAGTTTCAGCATCTGAGTTTAAAGTTAGAATAGTTGCAAGGTCAGGTAGCTCATGTAACAAATATCAACATCTTTCAATTAAATTGATTTTTTTTTTCAGATTTAATATCATA

Coding sequence (CDS)

ATGGCTTTCACAAATCATCTCCCCTTTCAGTTCTACGTTTCCTCAACCAAACCTTTCATCTTCCCCAGCTTTTCCACCACTCTAAACCCACTTCCATCCATTTATTCTGCTTCACCATTCAAACCATCACCCAAAATTTCCAAATCCGACAACCGTACATCAGTTACAATCACAGCCCCATTACAAATATTCAACGCAAGTGCACGAGTGAATGATGTAGCTACATCTGAAAAGGAAGAGCAAGTAGAGATGGAAGTTGCAAAGGGATATAGCCTCTCTCAATTTTGTGATAAAATAATTGATATTTTCTTGAATGAGAAGCCCAAGACCAAAGAATGGAGGAAGTTTTTGGTATTTAGGGAGGAGTGGAAAAAGTATCGTGAGAGCTTCTACAGTCATTGCCAAAGGCGGGCGGATTGGGAGGATGATCCAATTATGAAAGAGAAGTTAATATCACTTAGGAGAAAAGTTAAAAAGATTGATGATGAAATGGAAATCCACAGTGAACTTCTCAAGGAATTACAGGACAGCCCAACTGACATTAATGCGATAGTTGCAAAGCGGCACAAAGAGTTCACAGATGAGTTTTTTAAGTTCCTAACTCTGATATCGGAAACCCATGACAGTTTGGAAGATCGTGATGCTGTGGCTCGGCTGGCAGCCAGATGTCTGGCTGCAGTTAGTGCGTACAACCGAACATTAGAAAATGTGGAGACATTGGATTCTGCACAGGTCAAATTTGATAATATTCTGAATTCTCCCTCATTGGATGTGGCTTGTGAGAAGATTGCAAGTCTTGCAAAGGCAAAGGAACTTGATTCATCATTGATCCTTTTGATAAACAGTGCTTGGGCTTCTGCAAAAGAATCCACAACCATGAAAAATGAGGTGAAAGAAATAATGTATCATTTATACAAGGCCACAAAAAGCAGTCTTAGAAGCATGGCCCCTAAAGAAATAAAGCTGTTAAAGCATTTGCTGAACATCGTAGATCCTGAAGAACGATTTTCAGCTTTAGCAACAACCTTCTCCCCAGGTGATGGAAGTGAACAAAAAGATCCAAATGCTTTGTACACAACCCCGAAAGAGCTGCATAAGTGGATAAAGATCATGCTTGATTCATACCATCTAAACCAAGAAGATACGGACATCAGAGAAGCAAGGAATATGACTCAGCCTATTGTTATACAAAGGCTATTCATCCTTAAGGATACTATTGAAACTGAGTATTTGGAACAGAATCAGTTTCAGAATCCTCAATCAAGACCAAGTCATAATCATGGTTCTGAGGATGCAATCTCCATATAG

Protein sequence

MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQSRPSHNHGSEDAISI*
Homology
BLAST of CsGy4G018700 vs. ExPASy Swiss-Prot
Match: Q84WN0 (Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 PE=2 SV=2)

HSP 1 Score: 501.1 bits (1289), Expect = 1.2e-140
Identity = 266/403 (66.00%), Postives = 323/403 (80.15%), Query Frame = 0

Query: 11  FYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARV 70
           F+ S+ K   FP  ++  + LP  +SA       KI KS   T  T T     +N +   
Sbjct: 10  FFSSADKLLSFPPKNSQTHHLP--FSAF-INGGRKIRKSSTITFATDTV---TYNGTTSA 69

Query: 71  NDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESF 130
                S  E+ +E+EVA+GY+++QFCDKIID+FLNEKPK K+W+ +LV R+EW KY  +F
Sbjct: 70  E--VKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNF 129

Query: 131 YSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHK 190
           Y  C+ RAD E DPI+K+KL+SL  KVKKID EME H++LLKE+Q++PTDINAI AKR +
Sbjct: 130 YKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRR 189

Query: 191 EFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNI 250
           +FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAY+ TLE+VETLD+AQ KF++I
Sbjct: 190 DFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDI 249

Query: 251 LNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKS 310
           LNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMYHLYKATKS
Sbjct: 250 LNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKS 309

Query: 311 SLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKI 370
           SLRS+ PKEIKLLK+LLNI DPEERFSALAT FSPGD  E KDP ALYTTPKELHKWIKI
Sbjct: 310 SLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKI 369

Query: 371 MLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQ 414
           MLD+YHLN+E+TDI+EA+ M+QPIVIQRLFILKDTIE EYL++
Sbjct: 370 MLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of CsGy4G018700 vs. NCBI nr
Match: XP_004146379.1 (uncharacterized protein At4g37920 isoform X1 [Cucumis sativus] >KGN54831.1 hypothetical protein Csa_012204 [Cucumis sativus])

HSP 1 Score: 848 bits (2191), Expect = 5.81e-310
Identity = 435/435 (100.00%), Postives = 435/435 (100.00%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL
Sbjct: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           SRPSHNHGSEDAISI
Sbjct: 421 SRPSHNHGSEDAISI 435

BLAST of CsGy4G018700 vs. NCBI nr
Match: XP_008442081.1 (PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis melo])

HSP 1 Score: 822 bits (2124), Expect = 9.44e-300
Identity = 420/435 (96.55%), Postives = 426/435 (97.93%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFY+SSTK FIFP+FSTTL PLPSIYSASPFKPSPK SKSDNRT+VTITAP
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTTVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQ EMEVAKGYSLSQFCDKIIDIF+NEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVA R KEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETL
Sbjct: 181 INAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQ KFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT FSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           SRP+HNHGSEDAISI
Sbjct: 421 SRPNHNHGSEDAISI 435

BLAST of CsGy4G018700 vs. NCBI nr
Match: XP_038883875.1 (uncharacterized protein At4g37920 isoform X2 [Benincasa hispida])

HSP 1 Score: 756 bits (1952), Expect = 1.36e-273
Identity = 391/435 (89.89%), Postives = 409/435 (94.02%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHL FQ  +SSTK FIFPSFS TL PLPSIYSAS FKPSP+I KSDN T VTIT P
Sbjct: 1   MAFTNHLLFQLSISSTKSFIFPSFSATLKPLPSIYSASLFKPSPEIYKSDNPTPVTITTP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           +Q F ASA VNDVAT+EKEE+ EMEVA+GY++SQFCDKIIDIF+NEKPKTKEWRKFLVFR
Sbjct: 61  MQ-FKASALVNDVATTEKEEEAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+IDDEMEIH ELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHGELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETL
Sbjct: 181 INAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQ KFD+IL SPSLDVACEKIASLAKAKELDSSLILLINSAWA+AKESTTMKNEVKEI
Sbjct: 241 DSAQAKFDDILTSPSLDVACEKIASLAKAKELDSSLILLINSAWAAAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSEQKDP ALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEQKDPKALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQP+VIQRLFILKDTIETEYLEQN+FQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVVIQRLFILKDTIETEYLEQNEFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           S P  NH SEDA+SI
Sbjct: 421 STP--NHVSEDAVSI 432

BLAST of CsGy4G018700 vs. NCBI nr
Match: XP_038883874.1 (uncharacterized protein At4g37920 isoform X1 [Benincasa hispida])

HSP 1 Score: 745 bits (1923), Expect = 5.35e-269
Identity = 388/445 (87.19%), Postives = 406/445 (91.24%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHL FQ  +SSTK FIFPSFS TL PLPSIYSAS FKPSP+I KSDN T VTIT P
Sbjct: 1   MAFTNHLLFQLSISSTKSFIFPSFSATLKPLPSIYSASLFKPSPEIYKSDNPTPVTITTP 60

Query: 61  LQI----------FNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKT 120
           +Q               A VNDVAT+EKEE+ EMEVA+GY++SQFCDKIIDIF+NEKPKT
Sbjct: 61  MQFKIHCELRAKTCFLGALVNDVATTEKEEEAEMEVAEGYTISQFCDKIIDIFMNEKPKT 120

Query: 121 KEWRKFLVFREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSEL 180
           KEWRKFLVFREEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVK+IDDEMEIH EL
Sbjct: 121 KEWRKFLVFREEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKRIDDEMEIHGEL 180

Query: 181 LKELQDSPTDINAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY 240
           LKELQDSPTDINAIVAKR KEFT+EFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY
Sbjct: 181 LKELQDSPTDINAIVAKRRKEFTEEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY 240

Query: 241 NRTLENVETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKES 300
           +RTLENVETLDSAQ KFD+IL SPSLDVACEKIASLAKAKELDSSLILLINSAWA+AKES
Sbjct: 241 DRTLENVETLDSAQAKFDDILTSPSLDVACEKIASLAKAKELDSSLILLINSAWAAAKES 300

Query: 301 TTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSE 360
           TTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSE
Sbjct: 301 TTMKNEVKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSE 360

Query: 361 QKDPNALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEY 420
           QKDP ALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQP+VIQRLFILKDTIETEY
Sbjct: 361 QKDPKALYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPVVIQRLFILKDTIETEY 420

Query: 421 LEQNQFQNPQSRPSHNHGSEDAISI 435
           LEQN+FQNPQS P  NH SEDA+SI
Sbjct: 421 LEQNEFQNPQSTP--NHVSEDAVSI 443

BLAST of CsGy4G018700 vs. NCBI nr
Match: XP_022967802.1 (uncharacterized protein At4g37920 [Cucurbita maxima])

HSP 1 Score: 718 bits (1853), Expect = 1.62e-258
Identity = 369/435 (84.83%), Postives = 397/435 (91.26%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MA TN L FQ  +SST+ FIF  FS   NPLPSI SA PFKP+PK SKSDNR + T+  P
Sbjct: 1   MAITNQLAFQLSISSTRTFIFRRFSAAQNPLPSISSAIPFKPAPKNSKSDNRATATVPTP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           +Q FNASAR NDVAT+E EEQ EMEVA+GY++SQFCDKIIDIF+NEKPKTKEWRK LVFR
Sbjct: 61  MQ-FNASARANDVATTEMEEQTEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKL+SL R+VK+IDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKR KEFT++FFKFLTL+SETHDSLED DAVARLAARCL+AVSAY+RTLE+VETL
Sbjct: 181 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFD+ILNSP+LDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDDILNSPTLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MY LYKATKS LRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSE KDPNA+YTT
Sbjct: 301 MYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREAR M QPIVIQRLFILKDTIETEYLEQN+ QNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNELQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           S+P  NH S +A+SI
Sbjct: 421 SKP--NHVSANAVSI 432

BLAST of CsGy4G018700 vs. ExPASy TrEMBL
Match: A0A0A0L3X1 (Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1)

HSP 1 Score: 848 bits (2191), Expect = 2.81e-310
Identity = 435/435 (100.00%), Postives = 435/435 (100.00%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP
Sbjct: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL
Sbjct: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           SRPSHNHGSEDAISI
Sbjct: 421 SRPSHNHGSEDAISI 435

BLAST of CsGy4G018700 vs. ExPASy TrEMBL
Match: A0A1S3B4W5 (uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3656 GN=LOC103486045 PE=4 SV=1)

HSP 1 Score: 822 bits (2124), Expect = 4.57e-300
Identity = 420/435 (96.55%), Postives = 426/435 (97.93%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFY+SSTK FIFP+FSTTL PLPSIYSASPFKPSPK SKSDNRT+VTITAP
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTTVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQ EMEVAKGYSLSQFCDKIIDIF+NEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVA R KEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETL
Sbjct: 181 INAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQ KFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT FSPGDGSEQKDPNALYTT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           SRP+HNHGSEDAISI
Sbjct: 421 SRPNHNHGSEDAISI 435

BLAST of CsGy4G018700 vs. ExPASy TrEMBL
Match: A0A6J1HRT8 (uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE=4 SV=1)

HSP 1 Score: 718 bits (1853), Expect = 7.87e-259
Identity = 369/435 (84.83%), Postives = 397/435 (91.26%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MA TN L FQ  +SST+ FIF  FS   NPLPSI SA PFKP+PK SKSDNR + T+  P
Sbjct: 1   MAITNQLAFQLSISSTRTFIFRRFSAAQNPLPSISSAIPFKPAPKNSKSDNRATATVPTP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           +Q FNASAR NDVAT+E EEQ EMEVA+GY++SQFCDKIIDIF+NEKPKTKEWRK LVFR
Sbjct: 61  MQ-FNASARANDVATTEMEEQTEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKL+SL R+VK+IDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKR KEFT++FFKFLTL+SETHDSLED DAVARLAARCL+AVSAY+RTLE+VETL
Sbjct: 181 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFD+ILNSP+LDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDDILNSPTLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MY LYKATKS LRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSE KDPNA+YTT
Sbjct: 301 MYRLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREAR M QPIVIQRLFILKDTIETEYLEQN+ QNPQ
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNELQNPQ 420

Query: 421 SRPSHNHGSEDAISI 435
           S+P  NH S +A+SI
Sbjct: 421 SKP--NHVSANAVSI 432

BLAST of CsGy4G018700 vs. ExPASy TrEMBL
Match: A0A6J1F3Z5 (uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 PE=4 SV=1)

HSP 1 Score: 716 bits (1849), Expect = 3.20e-258
Identity = 370/435 (85.06%), Postives = 396/435 (91.03%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MA TN L FQ  +SSTK FIF  FS    PLPSI SA+PFK SPK SKSDNR + T+  P
Sbjct: 1   MAITNQLAFQLSISSTKTFIFRRFSAAQKPLPSISSATPFKSSPKNSKSDNRATATVPTP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           +Q FNASAR NDVAT+E EEQ EMEVA+GY++SQFCDKIIDIF+NEKPKTKEWRK LVFR
Sbjct: 61  MQ-FNASARTNDVATTEMEEQAEMEVAEGYTISQFCDKIIDIFMNEKPKTKEWRKLLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKL+SL R+VK+IDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLLSLGRRVKRIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVAKR KEFT++FFKFLTL+SETHDSLED DAVARLAARCL+AVSAY+RTLE+VETL
Sbjct: 181 INAIVAKRRKEFTEDFFKFLTLVSETHDSLEDHDAVARLAARCLSAVSAYDRTLEHVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQVKFD+ILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQVKFDDILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTT 360
           MYHLYKATKS LRSMAPKEIKLLKHLLNIVDPEERFSALAT F+PGDGSE KDPNA+YTT
Sbjct: 301 MYHLYKATKSGLRSMAPKEIKLLKHLLNIVDPEERFSALATAFAPGDGSEPKDPNAIYTT 360

Query: 361 PKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQNQFQNPQ 420
           PKELHKWIKIMLDSYHLNQEDTDIREAR M QPIVIQRLFILKDTIETEYLEQN+ QN Q
Sbjct: 361 PKELHKWIKIMLDSYHLNQEDTDIREARKMAQPIVIQRLFILKDTIETEYLEQNESQNAQ 420

Query: 421 SRPSHNHGSEDAISI 435
           S+P  NH S +A+SI
Sbjct: 421 SKP--NHVSTNAVSI 432

BLAST of CsGy4G018700 vs. ExPASy TrEMBL
Match: A0A1S4DV48 (uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3656 GN=LOC103486045 PE=4 SV=1)

HSP 1 Score: 669 bits (1725), Expect = 1.61e-240
Identity = 345/359 (96.10%), Postives = 350/359 (97.49%), Query Frame = 0

Query: 1   MAFTNHLPFQFYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAP 60
           MAFTNHLPFQFY+SSTK FIFP+FSTTL PLPSIYSASPFKPSPK SKSDNRT+VTITAP
Sbjct: 1   MAFTNHLPFQFYISSTKSFIFPNFSTTLKPLPSIYSASPFKPSPKFSKSDNRTTVTITAP 60

Query: 61  LQIFNASARVNDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFR 120
           LQIFNASARVNDVATSEKEEQ EMEVAKGYSLSQFCDKIIDIF+NEKPKTKEWRKFLVFR
Sbjct: 61  LQIFNASARVNDVATSEKEEQAEMEVAKGYSLSQFCDKIIDIFMNEKPKTKEWRKFLVFR 120

Query: 121 EEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180
           EEWKKYRESFYSHCQRRADWE DPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD
Sbjct: 121 EEWKKYRESFYSHCQRRADWESDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTD 180

Query: 181 INAIVAKRHKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETL 240
           INAIVA R KEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAY+RTLENVETL
Sbjct: 181 INAIVANRRKEFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYDRTLENVETL 240

Query: 241 DSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300
           DSAQ KFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI
Sbjct: 241 DSAQAKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEI 300

Query: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYT 359
           MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALAT FSPGDGSEQKDPNALYT
Sbjct: 301 MYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATAFSPGDGSEQKDPNALYT 359

BLAST of CsGy4G018700 vs. TAIR 10
Match: AT4G37920.1 (unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast thylakoid membrane, chloroplast, chloroplast envelope; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G36320.1); Has 123 Blast hits to 120 proteins in 40 species: Archae - 2; Bacteria - 11; Metazoa - 8; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 17 (source: NCBI BLink). )

HSP 1 Score: 501.1 bits (1289), Expect = 8.8e-142
Identity = 266/403 (66.00%), Postives = 323/403 (80.15%), Query Frame = 0

Query: 11  FYVSSTKPFIFPSFSTTLNPLPSIYSASPFKPSPKISKSDNRTSVTITAPLQIFNASARV 70
           F+ S+ K   FP  ++  + LP  +SA       KI KS   T  T T     +N +   
Sbjct: 10  FFSSADKLLSFPPKNSQTHHLP--FSAF-INGGRKIRKSSTITFATDTV---TYNGTTSA 69

Query: 71  NDVATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLVFREEWKKYRESF 130
                S  E+ +E+EVA+GY+++QFCDKIID+FLNEKPK K+W+ +LV R+EW KY  +F
Sbjct: 70  E--VKSSVEDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNF 129

Query: 131 YSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDSPTDINAIVAKRHK 190
           Y  C+ RAD E DPI+K+KL+SL  KVKKID EME H++LLKE+Q++PTDINAI AKR +
Sbjct: 130 YKRCRIRADTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRR 189

Query: 191 EFTDEFFKFLTLISETHDSLEDRDAVARLAARCLAAVSAYNRTLENVETLDSAQVKFDNI 250
           +FT EFF+++TL+SET D LEDRDAVARLA RCL+AVSAY+ TLE+VETLD+AQ KF++I
Sbjct: 190 DFTGEFFRYVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDI 249

Query: 251 LNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNEVKEIMYHLYKATKS 310
           LNSPS+D ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE K+IMYHLYKATKS
Sbjct: 250 LNSPSVDSACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKS 309

Query: 311 SLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNALYTTPKELHKWIKI 370
           SLRS+ PKEIKLLK+LLNI DPEERFSALAT FSPGD  E KDP ALYTTPKELHKWIKI
Sbjct: 310 SLRSITPKEIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKI 369

Query: 371 MLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYLEQ 414
           MLD+YHLN+E+TDI+EA+ M+QPIVIQRLFILKDTIE EYL++
Sbjct: 370 MLDAYHLNKEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDK 404

BLAST of CsGy4G018700 vs. TAIR 10
Match: AT1G36320.1 (unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT4G37920.1); Has 93 Blast hits to 90 proteins in 22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 85; Viruses - 0; Other Eukaryotes - 8 (source: NCBI BLink). )

HSP 1 Score: 294.3 bits (752), Expect = 1.6e-79
Identity = 150/355 (42.25%), Postives = 245/355 (69.01%), Query Frame = 0

Query: 62  QIFNASARVND---VATSEKEEQVEMEVAKGYSLSQFCDKIIDIFLNEKPKTKEWRKFLV 121
           Q F  SA V+D   VA  EK++  E  V     + + CDK+I++F+ +KP   +WR+ L 
Sbjct: 60  QRFVISAVVDDKSVVAKEEKKDGSEEVVVDNQRMIKVCDKLIEVFMVDKPTPSDWRRLLA 119

Query: 122 FREEWKKYRESFYSHCQRRADWEDDPIMKEKLISLRRKVKKIDDEMEIHSELLKELQDS- 181
           F +EW   R  FY  CQ RAD ED+P MK K+  L RK+K++D++++ H+ELL  ++ + 
Sbjct: 120 FSKEWDSIRPHFYKRCQERADSEDNPEMKHKVHRLARKLKEVDEDIQRHNELLNVIKRTP 179

Query: 182 PTDINAIVAKRHKEFTDEFFKFLTLISET-HDSLEDRDAVARLAARCLAAVSAYNRTLEN 241
           P +I  +VA+R K+FT+EFF+ L  ++E+ +D+ ++++A+A L    +AAV AY+ + E+
Sbjct: 180 PAEIGELVARRRKDFTNEFFEHLHTVAESYYDNPDEQNALASLGKLSIAAVQAYDTSTES 239

Query: 242 VETLDSAQVKFDNILNSPSLDVACEKIASLAKAKELDSSLILLINSAWASAKESTTMKNE 301
           ++ L++A++K  +I+NSPSLD AC KI SLA+  +LDS+L+L+I  AW++AKES  MK E
Sbjct: 240 IDALNAAEMKLQDIINSPSLDAACRKIDSLAEKNQLDSALVLMITKAWSAAKESNMMKEE 299

Query: 302 VKEIMYHLYKATKSSLRSMAPKEIKLLKHLLNIVDPEERFSALATTFSPGDGSEQKDPNA 361
           VK+I+YHLY   + +L+ + PKE+++LK+LL+I DP+E+ SAL   F+PGD  E  D + 
Sbjct: 300 VKDILYHLYVTARGNLQRLMPKEVRILKYLLSIEDPQEQISALQDAFTPGDELEGTDVDY 359

Query: 362 LYTTPKELHKWIKIMLDSYHLNQEDTDIREARNMTQPIVIQRLFILKDTIETEYL 412
           LYTTP+ L   +K +L++YH ++E + ++EA+++  P +I ++  LK  +E +Y+
Sbjct: 360 LYTTPEHLQSLMKTVLEAYHFSREGSLVKEAKDLMHPELIAKIEQLKKLVEKKYM 414

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
Q84WN01.2e-14066.00Uncharacterized protein At4g37920 OS=Arabidopsis thaliana OX=3702 GN=At4g37920 P... [more]
Match NameE-valueIdentityDescription
XP_004146379.15.81e-310100.00uncharacterized protein At4g37920 isoform X1 [Cucumis sativus] >KGN54831.1 hypot... [more]
XP_008442081.19.44e-30096.55PREDICTED: uncharacterized protein At4g37920, chloroplastic isoform X1 [Cucumis ... [more]
XP_038883875.11.36e-27389.89uncharacterized protein At4g37920 isoform X2 [Benincasa hispida][more]
XP_038883874.15.35e-26987.19uncharacterized protein At4g37920 isoform X1 [Benincasa hispida][more]
XP_022967802.11.62e-25884.83uncharacterized protein At4g37920 [Cucurbita maxima][more]
Match NameE-valueIdentityDescription
A0A0A0L3X12.81e-310100.00Uncharacterized protein OS=Cucumis sativus OX=3659 GN=Csa_4G515540 PE=4 SV=1[more]
A0A1S3B4W54.57e-30096.55uncharacterized protein At4g37920, chloroplastic isoform X1 OS=Cucumis melo OX=3... [more]
A0A6J1HRT87.87e-25984.83uncharacterized protein At4g37920 OS=Cucurbita maxima OX=3661 GN=LOC111467208 PE... [more]
A0A6J1F3Z53.20e-25885.06uncharacterized protein At4g37920 OS=Cucurbita moschata OX=3662 GN=LOC111439867 ... [more]
A0A1S4DV481.61e-24096.10uncharacterized protein At4g37920, chloroplastic isoform X2 OS=Cucumis melo OX=3... [more]
Match NameE-valueIdentityDescription
AT4G37920.18.8e-14266.00unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biologic... [more]
AT1G36320.11.6e-7942.25unknown protein; BEST Arabidopsis thaliana protein match is: unknown protein (TA... [more]
InterPro
Analysis Name: InterPro Annotations of Cucumber (Gy14) v2.1
Date Performed: 2021-10-25
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
NoneNo IPR availableCOILSCoilCoilcoord: 157..177
NoneNo IPR availableMOBIDB_LITEmobidb-litedisorder_predictioncoord: 414..435
NoneNo IPR availablePANTHERPTHR31755:SF2ENDORIBONUCLEASE E-LIKE PROTEINcoord: 50..422
IPR040320Uncharacterized protein At4g37920-likePANTHERPTHR31755FOLATE RECEPTOR-LIKEcoord: 50..422

Relationships

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
CsGy4G018700.2CsGy4G018700.2mRNA