Csa1G025090 (gene) Cucumber (Chinese Long) v2

NameCsa1G025090
Typegene
OrganismCucumis. sativus (Cucumber (Chinese Long) v2)
DescriptionNuclear localized protein 1
LocationChr1 : 2717104 .. 2721185 (+)
The following sequences are available for this feature:

Gene sequence (with intron)

Legend: five_prime_UTRCDSthree_prime_UTR
Hold the cursor over a type above to highlight its positions in the sequence below.
GAATCAAAATGGACGGCGATGGTGTTCTTCATCTTTCAACTTCAAATTCCCGCGCTTCCCTTTCTTCAATCGCCGGCTTCGACAGTTTTCCTCCTAATTCGAAGCTCCAAAAATGGAACCATGGGAAGCCCTTGATCTTGATTACTCCGACGTACACTCCCTCCTCCGCCCTTTGAAGCGTCACCGTAGTCCCCAGCCCCTCTCCCCTTCCTCCGCCTCCACTTCCACTCTCTCTCTGCCGCTTCTTGAAACCTGCTCTCTCCCCCCTTCCCAGTCCCAACCCCGGGTTGATAACCTCCAATCGGAGTTGAGTTTATCACCTCAGGCTTCAATTTGTCGATCCCAACGGATTTCAACTGAATTAGAGGCCTCGTGTCCGTCCGGTGCGTCTACACGTATCATTCCTGGTCCTGCTGGAGCGGTTCAGGTAGCAATGCAGCGTAGAACTCGTGGTGATCACTCTTGTGTCGGTGATGAAGAGCCCGTTCCTACTCAGGAGTACATAAGGAGAGTCATTGAGAATGGGGATGAAGAGGACGATGATTTCAATCGCAGTGCGTGGGTTTGCGCTTTGGATTTTGTCCGCGGCATAGGTTATTTTTCTTTTTCTTTTTCTGTTTGTTTGGTGATTTGAGAAGTGTGGAATTGTTGGGTCTTTAATTCAAGGATTTCTTTTGTAGGTGCGATGGAGGGTAATGGAGCTGTGTCTGAAACTCCTTTGAACTCTATCAAGAATGGCTTCATCGATGAGAAAGTTGGTTTTGTAAGTTTTATTCAGATCCTTTTCGTTTTCTTTTTTTGGCGCGAAAAGATTTTGGAAGTTTGATAAATCAAATTCCTGGCCTATAGCTACAATTGCAATTTTATATGCTGTTCAAATGATTATTACTTAAAACCCATTTGTTGGATGTTCCTTCTTTATATAAGGGCATCAATTATTGCTAATATATTCACTTTGGATTCTGTAATTGCAAATTTTCCATTAGGGTTTTGAGTTTTCTGTTCATGTCATTCCCCTGGCTGCTTTGTTAGGTCGTAGCCATTATCAAATCTTGTACCTCAAATGGTCTGGGTGGAATGATGGTAGCTTTAAAGGTTAGATTTCCGAGTATCCATGTCCTATACGGCGGAAGTAGCATTCTAAGATTTCAATACTCTGTGGTTACGAAAATAAGCACGAACTTCTGGTGTCAGGATCCAACAGGTACAATAGACGCTAGCATCCACCATAGAGTCATTTCTGAAGGAAATTTTGGGAAGGACTTGTCTGTTGGTGCAGTTTTAATATTGCAGAAGGTTTGATTTATTTATTTGTATTTGATGATTGGGTTGAATTTATTATTTGATTCTGTTTGGAACTTATACTTTCGTGAACTAGAGTTCCACCTGCTTTTCTCGATGAATAAATGCTCATCTCATCTTTCATTTATCGTCCAGGTTGCCGTGTTTTCTCCCACTCGTTCTGTACATGTGCTCAATGTAACAAGAAGTAACGTTGTCAAGGTATGGTCTGCTTTAAGAATTGCAGTTTGTGAGGCTTCATACTAGGGAGCATCTTATTCTGATATGAGTATTTTGTTTAATTTTTGTATGTATTAAAAGCAACCTACTACTTTCGTGGAAGAAAAATATATATATATATATATACAAGGACATACAAAAAAATCCTTCCCACAAAAGAAGGGAGTTCTCTCTACAAAGAGGCAACTAGACAAAATAACGCCTAAAGAATATTATAAGAGGTCTTTGCAGTTGAAACTCAAATAGAAACATGGAAATGAATGGAAGAAAATCTAACTAGGCCCTTTCCACACCACTAGACACCCTACTATTCTGGTCAACCCAAAAAACCCAATAAATAACATACACCCCACAAACCATAAAGAGCGACCCTTCTTCCCATGAGGTGAATTGAGAAGAAACTCTTTGATTTCAACACTCGCATTTTTCTAGTGATCTTAATCCTCCAAATGACTACAAATACCTACACTCTTAAGGGAGAAGGATCAACCAAACAACAAAAGAAAGAACTACATGAGAACCCCTCCAAAAGATTGGGGCTCCAAGTCCTCACGCCCCTTCTCTCATACCTAAAGGGGTGACCCTACTAGAAGATTGGCCATATTCGTCCCTATCTTATTGGAAAGGGGGCGATATTATCAGAAAGAAAAAGAATAAGAGCTTTCAGACCACATTAAAAAATCGAGAAGAAAATGATTTTTGAGGGAAGACAAAGACAAGTAAACAATGAACAGAGGAGGTCCATCTCCCACCAAATGGTCTTCCCAAAAATTCGTATCCTTCCCTCCCCAATACACAACGGACCAAGTGAACAAATGAAGAGAGTTCTTTCAAAATATTCTTCCACAAGTTCTAGTGAGTGCCTTTAACCCTCTTTTGACAGTGGGTAGGGGCCATTTGTAACAATGTGAGTTTCTTGTTTCTAGCCATCTAATTCACGTGTACGCACCTTTTCTACTTGAAAATTAAGGTTCTGAAAGTAAAAGGAATCATTACTGGCTTACTGGACAAAGGAGATTCGTGTTTTCACGGCATCTATTGTGGCAAATCACAAGAGTTAGACAACTTATCCACTCTTTCACTTGGTGACAAACGTCATGTGTGATTGGACAGCCCATATGCATGCGCATAATGATTGATATTTAAGAAAAATCTGACTCCTAGTTTGTTTGACATTGGCAATCATAAGCTTATTTGAACAGAGTTAGATAGCTGATACCTATTGTTGTCTGTGGACTTGTACTTCCCTCTTGTATTCCTGTTGTCTGCTAGAGAATGCACTAGATTCCTTCAGGGAAATTATTGTAGGTTTAAAACTTTCCAGGATAATTTTTCAGGAAACTAATTTTAACCTAGTGTGGATGTAACGTAGATATATATAGTTATAGAACAGTAAGGTCAACTCTTTAAAGATTCATAGAAAATTAGAACGTTTTATTTTTTTTCCCCCCTTTCCTTTATGATTTACTTTAACTTCAAGGCATACACTTGTACTTAACTTCTGCTTATGCAATATGCTTGATACTGTAGGTTATCTCCAAGGACAGTGGACCTCTTATAAAGCATAATAGCCCTACAGCAATCAGACAGTCTGATTCTATAACTGGAGGTTCGCAATGTTTTCTCATTCAAATTTCTTGTCTAAATCCCCCAAATTGATTACGTCTAATAAGATTTTTGGTTTCTCATTAACAGTGTATCACAAGAATGACTAGAATGAGATCCATTTTTTTCAATTAATCTCCTTTCACTGTCCAGATACACATGGAGTACACATGCCGCAGATGAATTCTGATGTATCACGTGAATCAACTCAAAATATCATGAACAATCTAAAGCAAAATTCTAAATTGAGAGGGAATGGACTAGATGATTTACAAACAGGAAAAGGAATTGCTGCATCGTCTAGAAATTGGAAATGGAATGAAACCGTTGGAAACCGACAGTCCATCGAGAAAGAAGGGGGAGTGATAGATGTGGGTATCTCTAAAGGAACCCCAAGTGTTGGTTGTAACACAGTCCATGTTGATCAGGATCAAGGAAGAGGATCGGATGAGCCTATCAACCATCCTATGGGTACAGATCCAGCCAAAGAAAATGGTGCTGCATCCAACACTGTTCAACTCCCAAATAATCAAGAAGTTGAAACAATCAATGAGATGAAAAAGACAGTCACACGAACACAACAACCATTACTTCCACAATGGACAGATGAGCAGTTAGATGAGCTCTTTGTATTTGACTGAGGTAATGTAATCTTGAAGCTTATTTCAGTAGGTTATATCAGTGGAAGTTCTTGCAAAAGATAGGTCCTCTTCATCACTGCCCAAAATCTACAATCTACAATCTCTTTCGACGACATGGGAACTGGTTTTTCTTCATAAATCTACAATCTCTTTCTGTGAAGAAGCAACTGACTAATATTTTACAGCAGTTCCAGCTTGGGAATTTTGAGGGAGAGATTAACTGTAGTTTGATTGAGATATCAAAGTGTGACAAATGCAATGCACTTGAATTAGGCACGTTTAAGAGGAAGTTTAAGAGAATCAAAATCACTCCAA

mRNA sequence

ATGGAACCATGGGAAGCCCTTGATCTTGATTACTCCGACGTACACTCCCTCCTCCGCCCTTTGAAGCGTCACCGTAGTCCCCAGCCCCTCTCCCCTTCCTCCGCCTCCACTTCCACTCTCTCTCTGCCGCTTCTTGAAACCTGCTCTCTCCCCCCTTCCCAGTCCCAACCCCGGGTTGATAACCTCCAATCGGAGTTGAGTTTATCACCTCAGGCTTCAATTTGTCGATCCCAACGGATTTCAACTGAATTAGAGGCCTCGTGTCCGTCCGGTGCGTCTACACGTATCATTCCTGGTCCTGCTGGAGCGGTTCAGGTAGCAATGCAGCGTAGAACTCGTGGTGATCACTCTTGTGTCGGTGATGAAGAGCCCGTTCCTACTCAGGAGTACATAAGGAGAGTCATTGAGAATGGGGATGAAGAGGACGATGATTTCAATCGCAGTGCGTGGGTTTGCGCTTTGGATTTTGTCCGCGGCATAGGTGCGATGGAGGGTAATGGAGCTGTGTCTGAAACTCCTTTGAACTCTATCAAGAATGGCTTCATCGATGAGAAAGTTGGTTTTGTCGTAGCCATTATCAAATCTTGTACCTCAAATGGTCTGGGTGGAATGATGGTAGCTTTAAAGGATCCAACAGGTACAATAGACGCTAGCATCCACCATAGAGTCATTTCTGAAGGAAATTTTGGGAAGGACTTGTCTGTTGGTGCAGTTTTAATATTGCAGAAGGTTGCCGTGTTTTCTCCCACTCGTTCTGTACATGTGCTCAATGTAACAAGAAGTAACGTTGTCAAGGTTATCTCCAAGGACAGTGGACCTCTTATAAAGCATAATAGCCCTACAGCAATCAGACAGTCTGATTCTATAACTGGAGATACACATGGAGTACACATGCCGCAGATGAATTCTGATGTATCACGTGAATCAACTCAAAATATCATGAACAATCTAAAGCAAAATTCTAAATTGAGAGGGAATGGACTAGATGATTTACAAACAGGAAAAGGAATTGCTGCATCGTCTAGAAATTGGAAATGGAATGAAACCGTTGGAAACCGACAGTCCATCGAGAAAGAAGGGGGAGTGATAGATGTGGGTATCTCTAAAGGAACCCCAAGTGTTGGTTGTAACACAGTCCATGTTGATCAGGATCAAGGAAGAGGATCGGATGAGCCTATCAACCATCCTATGGGTACAGATCCAGCCAAAGAAAATGGTGCTGCATCCAACACTGTTCAACTCCCAAATAATCAAGAAGTTGAAACAATCAATGAGATGAAAAAGACAGTCACACGAACACAACAACCATTACTTCCACAATGGACAGATGAGCAGTTAGATGAGCTCTTTGTATTTGACTGA

Coding sequence (CDS)

ATGGAACCATGGGAAGCCCTTGATCTTGATTACTCCGACGTACACTCCCTCCTCCGCCCTTTGAAGCGTCACCGTAGTCCCCAGCCCCTCTCCCCTTCCTCCGCCTCCACTTCCACTCTCTCTCTGCCGCTTCTTGAAACCTGCTCTCTCCCCCCTTCCCAGTCCCAACCCCGGGTTGATAACCTCCAATCGGAGTTGAGTTTATCACCTCAGGCTTCAATTTGTCGATCCCAACGGATTTCAACTGAATTAGAGGCCTCGTGTCCGTCCGGTGCGTCTACACGTATCATTCCTGGTCCTGCTGGAGCGGTTCAGGTAGCAATGCAGCGTAGAACTCGTGGTGATCACTCTTGTGTCGGTGATGAAGAGCCCGTTCCTACTCAGGAGTACATAAGGAGAGTCATTGAGAATGGGGATGAAGAGGACGATGATTTCAATCGCAGTGCGTGGGTTTGCGCTTTGGATTTTGTCCGCGGCATAGGTGCGATGGAGGGTAATGGAGCTGTGTCTGAAACTCCTTTGAACTCTATCAAGAATGGCTTCATCGATGAGAAAGTTGGTTTTGTCGTAGCCATTATCAAATCTTGTACCTCAAATGGTCTGGGTGGAATGATGGTAGCTTTAAAGGATCCAACAGGTACAATAGACGCTAGCATCCACCATAGAGTCATTTCTGAAGGAAATTTTGGGAAGGACTTGTCTGTTGGTGCAGTTTTAATATTGCAGAAGGTTGCCGTGTTTTCTCCCACTCGTTCTGTACATGTGCTCAATGTAACAAGAAGTAACGTTGTCAAGGTTATCTCCAAGGACAGTGGACCTCTTATAAAGCATAATAGCCCTACAGCAATCAGACAGTCTGATTCTATAACTGGAGATACACATGGAGTACACATGCCGCAGATGAATTCTGATGTATCACGTGAATCAACTCAAAATATCATGAACAATCTAAAGCAAAATTCTAAATTGAGAGGGAATGGACTAGATGATTTACAAACAGGAAAAGGAATTGCTGCATCGTCTAGAAATTGGAAATGGAATGAAACCGTTGGAAACCGACAGTCCATCGAGAAAGAAGGGGGAGTGATAGATGTGGGTATCTCTAAAGGAACCCCAAGTGTTGGTTGTAACACAGTCCATGTTGATCAGGATCAAGGAAGAGGATCGGATGAGCCTATCAACCATCCTATGGGTACAGATCCAGCCAAAGAAAATGGTGCTGCATCCAACACTGTTCAACTCCCAAATAATCAAGAAGTTGAAACAATCAATGAGATGAAAAAGACAGTCACACGAACACAACAACCATTACTTCCACAATGGACAGATGAGCAGTTAGATGAGCTCTTTGTATTTGACTGA

Protein sequence

MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDNLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVGDEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNGFIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEGGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEVETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD*
BLAST of Csa1G025090 vs. Swiss-Prot
Match: CQ053_RAT (Uncharacterized protein C17orf53 homolog OS=Rattus norvegicus PE=2 SV=1)

HSP 1 Score: 73.6 bits (179), Expect = 6.4e-12
Identity = 79/304 (25.99%), Postives = 127/304 (41.78%), Query Frame = 1

Query: 31  SPSSASTSTLSLPLLETCSLPPSQSQPRVDNLQSELSLSPQASICRSQRISTELEASCPS 90
           SPS A  S++  P     S   + +QP    LQ+ +  +    +  +   + +  +    
Sbjct: 284 SPSRAPVSSVESPFSTPRSTSTTVTQPA---LQTPVVTNHLVQLVTATNRTPQQPSRPSI 343

Query: 91  GASTRIIPGPAGAVQVAMQRRTRGDHSCVGDEEPVPTQEYIRR-----VIENGDEEDDDF 150
            A TR  PGPAG +    Q         +      PT   + +     V  +    ++DF
Sbjct: 344 RAKTRRFPGPAGLLP--HQHSGENLEEIMVSTPQTPTHGALAKFQTEIVTSSQGSVEEDF 403

Query: 151 NRSAWVCALDFVRGIGAMEGN----------------GAVSETPLNSIKNGFIDEKVGFV 210
            R  W   L     +G  EG+                 A+ + P N + N         +
Sbjct: 404 GRGPW---LTMKSALGLDEGDPTCFLYTYSIVMVLRKAALKQLPRNKVPN---------M 463

Query: 211 VAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSP 270
             +IKS T + +   +V  KDPTG +  ++H RV+ E +   +L  G+VL+L+++ VFSP
Sbjct: 464 AVMIKSLTRSTMDASVV-FKDPTGEMLGTVH-RVLLETH-QNELKPGSVLLLKQIGVFSP 523

Query: 271 TRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQMNSDVSRES 314
           +   H LNVT +N+V + S DSG       P  + +     G++HG   P    DV+ E 
Sbjct: 524 SLRNHYLNVTPNNLVHIYSLDSGDGDFLKPPQPLPKD---LGNSHGSLQP----DVAAEP 560

BLAST of Csa1G025090 vs. Swiss-Prot
Match: CQ053_HUMAN (Uncharacterized protein C17orf53 OS=Homo sapiens GN=C17orf53 PE=1 SV=1)

HSP 1 Score: 69.7 bits (169), Expect = 9.3e-11
Identity = 65/262 (24.81%), Postives = 106/262 (40.46%), Query Frame = 1

Query: 28  QPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDNLQSELSLSPQASICRSQRISTELEAS 87
           QP +P S+  S +  P     +L        +  L +  S +PQ     S R        
Sbjct: 341 QPQAPVSSIGSPVGTPKGPQGALQTPIVTNHLVQLVTAASRTPQQPTHPSTR-------- 400

Query: 88  CPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVGDEEPV--PTQEYIRRVIENGDEE-DDD 147
               A TR  PGPAG +      R+  D      + P      ++   ++ +     ++D
Sbjct: 401 ----AKTRRFPGPAGILPHQQSGRSLEDIMVSAPQTPTHGALAKFQTEIVASSQASVEED 460

Query: 148 FNRSAWV--------------CALDFVRGIGAMEGNGAVSETPLNSIKNGFIDEKVGFVV 207
           F R  W+              C L     +  +    A+ + P N + N         + 
Sbjct: 461 FGRGPWLTMKSTLGLDERDPSCFLCTYSIVMVLRKQAALKQLPRNKVPN---------MA 520

Query: 208 AIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLILQKVAVFSPT 267
            +IKS T + +   +V  KDPTG +  ++H  ++       +L  G+VL+L+++ VFSP+
Sbjct: 521 VMIKSLTRSTMDASVV-FKDPTGEMQGTVHRLLLETCQ--NELKPGSVLLLKQIGVFSPS 578

Query: 268 RSVHVLNVTRSNVVKVISKDSG 273
              H LNVT +N+V + S DSG
Sbjct: 581 LRNHYLNVTPNNLVHIYSPDSG 578

BLAST of Csa1G025090 vs. Swiss-Prot
Match: CQ053_MOUSE (Uncharacterized protein C17orf53 homolog OS=Mus musculus PE=2 SV=2)

HSP 1 Score: 64.7 bits (156), Expect = 3.0e-09
Identity = 69/254 (27.17%), Postives = 108/254 (42.52%), Query Frame = 1

Query: 82  TELEASCPS-GASTRIIPGPAGAVQVAMQRRTRGDHSCVGDEEPVPTQEYIRR-----VI 141
           T  + S PS  A TR  PGPAG +    Q         +      PT   + +       
Sbjct: 348 TPQQPSRPSIRAKTRRFPGPAGLLP--HQHSGENLEEIMVSTPQTPTHGALAKFQTEIAT 407

Query: 142 ENGDEEDDDFNRSAWVCALDFVRGIGAMEGN----------------GAVSETPLNSIKN 201
            +    ++DF +  W   L     +G  EG+                 A+ + P N + N
Sbjct: 408 SSQGSVEEDFGQGPW---LTMKSALGLDEGDPTCFLYTYSIVMVLRKAALKQLPRNKVPN 467

Query: 202 GFIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVL 261
                    +  +IKS T + +   +V  KDPTG +  ++H RV+ E +   +L  G+VL
Sbjct: 468 ---------MAVMIKSLTRSTMDASVV-FKDPTGEMLGTVH-RVLLETH-QSELRPGSVL 527

Query: 262 ILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMP 314
           +L+++ VFSP+   H LNVT +N+V + S DSG       P  + +     G++HG   P
Sbjct: 528 LLKQIGVFSPSLRNHYLNVTPNNLVHIYSLDSGDGDFLEPPQPLPKD---LGNSHGSLQP 577

BLAST of Csa1G025090 vs. TrEMBL
Match: A0A0A0LSW2_CUCSA (Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025090 PE=4 SV=1)

HSP 1 Score: 909.1 bits (2348), Expect = 2.2e-261
Identity = 453/453 (100.00%), Postives = 453/453 (100.00%), Query Frame = 1

Query: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60
           MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD
Sbjct: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60

Query: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120
           NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG
Sbjct: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120

Query: 121 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 180
           DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG
Sbjct: 121 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 180

Query: 181 FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI 240
           FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI
Sbjct: 181 FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI 240

Query: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ 300
           LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ
Sbjct: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ 300

Query: 301 MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG 360
           MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG
Sbjct: 301 MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG 360

Query: 361 GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV 420
           GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV
Sbjct: 361 GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV 420

Query: 421 ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 454
           ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD
Sbjct: 421 ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 453

BLAST of Csa1G025090 vs. TrEMBL
Match: A0A061FVS5_THECC (Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_012841 PE=4 SV=1)

HSP 1 Score: 290.8 bits (743), Expect = 2.9e-75
Identity = 192/468 (41.03%), Postives = 256/468 (54.70%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           +PWEALDLD SD+ SLLRP KR                         S PPS     + N
Sbjct: 44  DPWEALDLDASDLPSLLRPCKRK---------------------PRYSPPPSP----IKN 103

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG- 121
           LQ   +  P +S C                    +IPGPAGAVQ AM R+ +   + VG 
Sbjct: 104 LQPTPNSPPPSSPC--------------------LIPGPAGAVQAAMLRKIQNKSNPVGI 163

Query: 122 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 181
            EEP+PTQEYIRR +E+   +DDDF+R+ W+ AL+F+R  G  +  G +  TPL+ IK  
Sbjct: 164 GEEPLPTQEYIRRAVEDPGADDDDFSRAPWLFALEFIRREGLADDGGTIG-TPLSWIKTE 223

Query: 182 --FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 241
               + KV  +VA+IKSCT NGLG +MV LKDPTGTIDASIH +V+ EG FGKD+SVG V
Sbjct: 224 PKMGNRKVAQIVAVIKSCTPNGLGDLMVTLKDPTGTIDASIHRKVLVEGGFGKDISVGTV 283

Query: 242 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSP-TAIRQSDSITGDTHGVH 301
           LILQKV++FSP+RSVH LN+T SNVVK ISKDSGP  + N P + +  +D    ++   +
Sbjct: 284 LILQKVSIFSPSRSVHYLNITLSNVVKAISKDSGPPSQQNYPASTVIPTDHGVENSKQPY 343

Query: 302 MPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDL------QTGKGIAASSRNWKWNETVG 361
           + Q  S +S+E T+ IMN+L+Q   +RG   +D         G     + RN   N  +G
Sbjct: 344 IQQKVSTLSQERTEGIMNSLRQTGYMRGRVHNDKGIEGNEALGSSCCINERNRNQNAFIG 403

Query: 362 NRQSIEKE--GGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAAS 421
              S+ ++   G+    +  GT     N V   Q   R      NH      + ++   +
Sbjct: 404 KGHSVRQDILSGLKKAAVLAGTNEYEENVVLEKQSSPRNLAASGNHV----ESNQSSGGA 458

Query: 422 NTVQLPNNQEVETINEMKKTVTRTQQPL----LPQWTDEQLDELFVFD 454
           N V + +NQ+  T N  KK   + + P+    LPQWTDEQLDELF FD
Sbjct: 464 NLVGVASNQKTVTDNGDKK---QGRLPISSGSLPQWTDEQLDELFAFD 458

BLAST of Csa1G025090 vs. TrEMBL
Match: A0A061FW98_THECC (Uncharacterized protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_012841 PE=4 SV=1)

HSP 1 Score: 290.4 bits (742), Expect = 3.7e-75
Identity = 191/468 (40.81%), Postives = 255/468 (54.49%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           +PWEALDLD SD+ SLLRP KR                         S PPS     + N
Sbjct: 3   DPWEALDLDASDLPSLLRPCKRK---------------------PRYSPPPSP----IKN 62

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG- 121
           LQ   +  P +S C                    +IPGPAGAVQ AM R+ +   + VG 
Sbjct: 63  LQPTPNSPPPSSPC--------------------LIPGPAGAVQAAMLRKIQNKSNPVGI 122

Query: 122 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 181
            EEP+PTQEYIRR +E+   +DDDF+R+ W+ AL+F+R    +  +G    TPL+ IK  
Sbjct: 123 GEEPLPTQEYIRRAVEDPGADDDDFSRAPWLFALEFIRREEGLADDGGTIGTPLSWIKTE 182

Query: 182 --FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 241
               + KV  +VA+IKSCT NGLG +MV LKDPTGTIDASIH +V+ EG FGKD+SVG V
Sbjct: 183 PKMGNRKVAQIVAVIKSCTPNGLGDLMVTLKDPTGTIDASIHRKVLVEGGFGKDISVGTV 242

Query: 242 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSP-TAIRQSDSITGDTHGVH 301
           LILQKV++FSP+RSVH LN+T SNVVK ISKDSGP  + N P + +  +D    ++   +
Sbjct: 243 LILQKVSIFSPSRSVHYLNITLSNVVKAISKDSGPPSQQNYPASTVIPTDHGVENSKQPY 302

Query: 302 MPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDL------QTGKGIAASSRNWKWNETVG 361
           + Q  S +S+E T+ IMN+L+Q   +RG   +D         G     + RN   N  +G
Sbjct: 303 IQQKVSTLSQERTEGIMNSLRQTGYMRGRVHNDKGIEGNEALGSSCCINERNRNQNAFIG 362

Query: 362 NRQSIEKE--GGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAAS 421
              S+ ++   G+    +  GT     N V   Q   R      NH      + ++   +
Sbjct: 363 KGHSVRQDILSGLKKAAVLAGTNEYEENVVLEKQSSPRNLAASGNHV----ESNQSSGGA 418

Query: 422 NTVQLPNNQEVETINEMKKTVTRTQQPL----LPQWTDEQLDELFVFD 454
           N V + +NQ+  T N  KK   + + P+    LPQWTDEQLDELF FD
Sbjct: 423 NLVGVASNQKTVTDNGDKK---QGRLPISSGSLPQWTDEQLDELFAFD 418

BLAST of Csa1G025090 vs. TrEMBL
Match: A0A061FWU7_THECC (Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_012841 PE=4 SV=1)

HSP 1 Score: 285.4 bits (729), Expect = 1.2e-73
Identity = 194/482 (40.25%), Postives = 256/482 (53.11%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           +PWEALDLD SD+ SLLRP KR                         S PPS     + N
Sbjct: 4   DPWEALDLDASDLPSLLRPCKRK---------------------PRYSPPPSP----IKN 63

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG- 121
           LQ   +  P +S C                    +IPGPAGAVQ AM R+ +   + VG 
Sbjct: 64  LQPTPNSPPPSSPC--------------------LIPGPAGAVQAAMLRKIQNKSNPVGI 123

Query: 122 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 181
            EEP+PTQEYIRR +E+   +DDDF+R+ W+ AL+F+R  G  +  G +  TPL+ IK  
Sbjct: 124 GEEPLPTQEYIRRAVEDPGADDDDFSRAPWLFALEFIRREGLADDGGTIG-TPLSWIKTE 183

Query: 182 --FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 241
               + KV  +VA+IKSCT NGLG +MV LKDPTGTIDASIH +V+ EG FGKD+SVG V
Sbjct: 184 PKMGNRKVAQIVAVIKSCTPNGLGDLMVTLKDPTGTIDASIHRKVLVEGGFGKDISVGTV 243

Query: 242 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSP--TAIRQSDSI------- 301
           LILQKV++FSP+RSVH LN+T SNVVK ISKDSGP  + N P  T I     +       
Sbjct: 244 LILQKVSIFSPSRSVHYLNITLSNVVKAISKDSGPPSQQNYPASTVIPTDHGVVQLIIFF 303

Query: 302 ------TGDTHGVHMPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDL------QTGKGI 361
                 T ++   ++ Q  S +S+E T+ IMN+L+Q   +RG   +D         G   
Sbjct: 304 GFFPYSTENSKQPYIQQKVSTLSQERTEGIMNSLRQTGYMRGRVHNDKGIEGNEALGSSC 363

Query: 362 AASSRNWKWNETVGNRQSIEKE--GGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINH 421
             + RN   N  +G   S+ ++   G+    +  GT     N V   Q   R      NH
Sbjct: 364 CINERNRNQNAFIGKGHSVRQDILSGLKKAAVLAGTNEYEENVVLEKQSSPRNLAASGNH 423

Query: 422 PMGTDPAKENGAASNTVQLPNNQEVETINEMKKTVTRTQQPL----LPQWTDEQLDELFV 454
                 + ++   +N V + +NQ+  T N  KK   + + P+    LPQWTDEQLDELF 
Sbjct: 424 V----ESNQSSGGANLVGVASNQKTVTDNGDKK---QGRLPISSGSLPQWTDEQLDELFA 432

BLAST of Csa1G025090 vs. TrEMBL
Match: W9RTC8_9ROSA (Uncharacterized protein OS=Morus notabilis GN=L484_017363 PE=4 SV=1)

HSP 1 Score: 279.6 bits (714), Expect = 6.6e-72
Identity = 193/488 (39.55%), Postives = 260/488 (53.28%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           EPWEALD+D SD    LRP KRH                 LP+L   S  PSQSQ     
Sbjct: 8   EPWEALDIDDSDT-PFLRPCKRHNQ--------------DLPILSQSSSSPSQSQ----- 67

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVGD 121
                   P+                 PS  S  +IPGPAGAVQ AM RR R D S  GD
Sbjct: 68  -------QPK-----------------PSPPSPPLIPGPAGAVQAAMHRRARKDWSFAGD 127

Query: 122 EEPVPTQEYIRRVIENGD--EEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKN 181
           E+P+PTQEYIR+V+ENGD  ++DDDF  + W+ ALDFV+     EGN A+S TP  SIK 
Sbjct: 128 EDPIPTQEYIRKVLENGDVCDDDDDFTSNPWLSALDFVQ----REGNMAISGTPPRSIKK 187

Query: 182 GFIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVL 241
           G   +KV +VVA+IKSCT NGLG +MV LKDP+GT+ ASIH +V+ +  FGK +SVGAVL
Sbjct: 188 GIHTDKVDWVVALIKSCTPNGLGDLMVTLKDPSGTMGASIHRKVLLDEEFGKIISVGAVL 247

Query: 242 ILQKVAVFSPTRSVHVLNVTRSNVVK----------------------------VISKDS 301
           +L+KVAVF+P+RS + LN+T +NVVK                            V+SKDS
Sbjct: 248 VLKKVAVFAPSRSAYYLNITLNNVVKVFSYDCEWNDREQSEALQMQRCDKARKRVMSKDS 307

Query: 302 GPLIKHNSP-TAIRQSDSITGDTHGVHMPQMNSDVSRESTQNIMNNLKQNSKLR----GN 361
           GP  K N P +++R +   +   +   MPQ+   V++E T+ IM+NL++ S+ R    GN
Sbjct: 308 GPPSKTNYPSSSVRCAAETSERCNASRMPQIM--VTQERTEGIMSNLRKRSERRGSVHGN 367

Query: 362 GLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEGGVIDVGISKGTPSVGCNTVHVDQDQG 421
            + +  T   I+  S     N T     + EKE     + +++ T    C+   +  D+ 
Sbjct: 368 RVLEGNTIPDISCFSNENSRNPTA----NTEKESSFKKIAVTENT----CSDHVIVTDKQ 427

Query: 422 RGSDEPINHPMGTDPAKENGAASNTVQLPNNQEVETINEMK-KTVTRTQQPLLPQWTDEQ 454
                P      +   +   A +N V++  +QE E  +  K +T     +   P+WTDEQ
Sbjct: 428 PNLWIPAERDNSSHSTQTINATANLVEVSADQETEIASGTKPQTKLPVSRISPPEWTDEQ 437

BLAST of Csa1G025090 vs. TAIR10
Match: AT1G48580.1 (AT1G48580.1 unknown protein)

HSP 1 Score: 214.2 bits (544), Expect = 1.7e-55
Identity = 163/487 (33.47%), Postives = 240/487 (49.28%), Query Frame = 1

Query: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60
           ++ WEALDL  S++ S LRP KR    + L P +                   Q  P+  
Sbjct: 7   VDQWEALDLGDSELPSFLRPCKRKSPTRSLQPHA------------------QQQNPKA- 66

Query: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120
              S  +  P    C S     E         S  +IPGPAG VQVA++R+   D     
Sbjct: 67  GFNSNTNHRPTLRRCSSPDKFLE------ESYSRSLIPGPAGVVQVAIRRKMNKDPKSFN 126

Query: 121 DE-EPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKN 180
           +  EP+PTQE++R+  E  D ED DF+   WV  +D++R  G +   G    TP++ IK 
Sbjct: 127 EHGEPIPTQEFLRKAAEEPDWEDKDFSEDPWVSTVDYIRSEGLLSNGGNAIGTPVSEIKR 186

Query: 181 GFID-EKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 240
                 KV  VVAI+K+CT NGLG +MV LKDPTGTIDAS+H +VISE  FG+D+ VGAV
Sbjct: 187 RCDSWGKVDQVVAIVKTCTPNGLGDVMVTLKDPTGTIDASVHRKVISESEFGRDIRVGAV 246

Query: 241 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHN-SPTAIRQSDSITGDTHGVH 300
           +IL++VAV +P+RS   LN+T  N+ KVI+KD+  L   N S  + +   S+ G      
Sbjct: 247 VILKQVAVCAPSRSSTYLNITLKNISKVITKDTPVLPNQNDSEMSAKNHVSVNG------ 306

Query: 301 MPQMNSDVSRESTQNI--MNNLKQNSKLRGNGLD-DLQTGKGIAASSRNWKWNETVGNRQ 360
           +P+ N   S  S +N+  +N  ++  +L+ N    +  T +GI  + R        G+ +
Sbjct: 307 LPKQND--SEMSGKNLVPVNENEEYLRLQPNVFSVEQSTTQGIMNNLR----QNAKGSSE 366

Query: 361 SIEKEGGVIDVGI---SKGTPSVGCNTVHVD----------QDQGRGSDEPINHPMGTD- 420
           ++  +  ++D+     SK +P  G    H +           D    +++ +   + T+ 
Sbjct: 367 ALH-DIEMVDINPAEGSKSSPKKGVTKNHCEVRMEQTLLGKHDSISQTEQQLYEDVATET 426

Query: 421 -------PAKENGAASNTVQLP-------NNQEVETINEMKKTVTRTQQPLLPQWTDEQL 454
                  PAK+   +S + Q+        N  EV T   + K+ +      LPQWTDEQL
Sbjct: 427 DIADCIRPAKQIRRSSQS-QIDEQESVMGNPDEVTTRTTIHKSQSMASTISLPQWTDEQL 454

BLAST of Csa1G025090 vs. NCBI nr
Match: gi|449439563|ref|XP_004137555.1| (PREDICTED: uncharacterized protein C17orf53 homolog [Cucumis sativus])

HSP 1 Score: 909.1 bits (2348), Expect = 3.2e-261
Identity = 453/453 (100.00%), Postives = 453/453 (100.00%), Query Frame = 1

Query: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60
           MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD
Sbjct: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60

Query: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120
           NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG
Sbjct: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120

Query: 121 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 180
           DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG
Sbjct: 121 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 180

Query: 181 FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI 240
           FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI
Sbjct: 181 FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI 240

Query: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ 300
           LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ
Sbjct: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHGVHMPQ 300

Query: 301 MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG 360
           MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG
Sbjct: 301 MNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQSIEKEG 360

Query: 361 GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV 420
           GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV
Sbjct: 361 GVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQEV 420

Query: 421 ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 454
           ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD
Sbjct: 421 ETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 453

BLAST of Csa1G025090 vs. NCBI nr
Match: gi|659106975|ref|XP_008453483.1| (PREDICTED: uncharacterized protein C17orf53 homolog [Cucumis melo])

HSP 1 Score: 823.2 bits (2125), Expect = 2.3e-235
Identity = 418/455 (91.87%), Postives = 429/455 (94.29%), Query Frame = 1

Query: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 60
           MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSS +TSTLSLPLLETCSLPPS+SQPRVD
Sbjct: 1   MEPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSTATSTLSLPLLETCSLPPSKSQPRVD 60

Query: 61  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120
           NLQSELSLSPQAS+CRSQRIST LEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG
Sbjct: 61  NLQSELSLSPQASLCRSQRISTGLEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG 120

Query: 121 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 180
           DEEPVPTQEYIRRV+ENGDEEDDDFNRS WVCALDFVR IGAMEGNGAVSETPLNSIKNG
Sbjct: 121 DEEPVPTQEYIRRVMENGDEEDDDFNRSPWVCALDFVRSIGAMEGNGAVSETPLNSIKNG 180

Query: 181 FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAVLI 240
           FIDEKVG VVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEG FGKDLSVGAVLI
Sbjct: 181 FIDEKVGLVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGIFGKDLSVGAVLI 240

Query: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTAIRQSDSITGDTHG-VHMP 300
           LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPT IR SD ITGDTHG VHM 
Sbjct: 241 LQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSPTPIRWSDFITGDTHGEVHMQ 300

Query: 301 QMNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQS-IEK 360
           QMNSDVSRESTQNIMNNL+Q+SKLR N L DL+TGKG AASS NW  NETVG+RQS +EK
Sbjct: 301 QMNSDVSRESTQNIMNNLRQSSKLRRNRLGDLRTGKGGAASSSNWNCNETVGSRQSVVEK 360

Query: 361 EGGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAASNTVQLPNNQ 420
           EGGVIDVGISK TPSVGCN V+VDQDQGRGSDEPINHPMGTD  KENGAAS+T QLPNNQ
Sbjct: 361 EGGVIDVGISKRTPSVGCNIVYVDQDQGRGSDEPINHPMGTDSTKENGAASSTAQLPNNQ 420

Query: 421 EVETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 454
           E ETINEMKKTVTRTQQPLLPQWTDEQLDELF FD
Sbjct: 421 EAETINEMKKTVTRTQQPLLPQWTDEQLDELFEFD 455

BLAST of Csa1G025090 vs. NCBI nr
Match: gi|590665817|ref|XP_007036840.1| (Uncharacterized protein isoform 1 [Theobroma cacao])

HSP 1 Score: 290.8 bits (743), Expect = 4.1e-75
Identity = 192/468 (41.03%), Postives = 256/468 (54.70%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           +PWEALDLD SD+ SLLRP KR                         S PPS     + N
Sbjct: 44  DPWEALDLDASDLPSLLRPCKRK---------------------PRYSPPPSP----IKN 103

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG- 121
           LQ   +  P +S C                    +IPGPAGAVQ AM R+ +   + VG 
Sbjct: 104 LQPTPNSPPPSSPC--------------------LIPGPAGAVQAAMLRKIQNKSNPVGI 163

Query: 122 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 181
            EEP+PTQEYIRR +E+   +DDDF+R+ W+ AL+F+R  G  +  G +  TPL+ IK  
Sbjct: 164 GEEPLPTQEYIRRAVEDPGADDDDFSRAPWLFALEFIRREGLADDGGTIG-TPLSWIKTE 223

Query: 182 --FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 241
               + KV  +VA+IKSCT NGLG +MV LKDPTGTIDASIH +V+ EG FGKD+SVG V
Sbjct: 224 PKMGNRKVAQIVAVIKSCTPNGLGDLMVTLKDPTGTIDASIHRKVLVEGGFGKDISVGTV 283

Query: 242 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSP-TAIRQSDSITGDTHGVH 301
           LILQKV++FSP+RSVH LN+T SNVVK ISKDSGP  + N P + +  +D    ++   +
Sbjct: 284 LILQKVSIFSPSRSVHYLNITLSNVVKAISKDSGPPSQQNYPASTVIPTDHGVENSKQPY 343

Query: 302 MPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDL------QTGKGIAASSRNWKWNETVG 361
           + Q  S +S+E T+ IMN+L+Q   +RG   +D         G     + RN   N  +G
Sbjct: 344 IQQKVSTLSQERTEGIMNSLRQTGYMRGRVHNDKGIEGNEALGSSCCINERNRNQNAFIG 403

Query: 362 NRQSIEKE--GGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAAS 421
              S+ ++   G+    +  GT     N V   Q   R      NH      + ++   +
Sbjct: 404 KGHSVRQDILSGLKKAAVLAGTNEYEENVVLEKQSSPRNLAASGNHV----ESNQSSGGA 458

Query: 422 NTVQLPNNQEVETINEMKKTVTRTQQPL----LPQWTDEQLDELFVFD 454
           N V + +NQ+  T N  KK   + + P+    LPQWTDEQLDELF FD
Sbjct: 464 NLVGVASNQKTVTDNGDKK---QGRLPISSGSLPQWTDEQLDELFAFD 458

BLAST of Csa1G025090 vs. NCBI nr
Match: gi|590665823|ref|XP_007036842.1| (Uncharacterized protein isoform 3, partial [Theobroma cacao])

HSP 1 Score: 290.4 bits (742), Expect = 5.3e-75
Identity = 191/468 (40.81%), Postives = 255/468 (54.49%), Query Frame = 1

Query: 2   EPWEALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVDN 61
           +PWEALDLD SD+ SLLRP KR                         S PPS     + N
Sbjct: 3   DPWEALDLDASDLPSLLRPCKRK---------------------PRYSPPPSP----IKN 62

Query: 62  LQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRRTRGDHSCVG- 121
           LQ   +  P +S C                    +IPGPAGAVQ AM R+ +   + VG 
Sbjct: 63  LQPTPNSPPPSSPC--------------------LIPGPAGAVQAAMLRKIQNKSNPVGI 122

Query: 122 DEEPVPTQEYIRRVIENGDEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLNSIKNG 181
            EEP+PTQEYIRR +E+   +DDDF+R+ W+ AL+F+R    +  +G    TPL+ IK  
Sbjct: 123 GEEPLPTQEYIRRAVEDPGADDDDFSRAPWLFALEFIRREEGLADDGGTIGTPLSWIKTE 182

Query: 182 --FIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSVGAV 241
               + KV  +VA+IKSCT NGLG +MV LKDPTGTIDASIH +V+ EG FGKD+SVG V
Sbjct: 183 PKMGNRKVAQIVAVIKSCTPNGLGDLMVTLKDPTGTIDASIHRKVLVEGGFGKDISVGTV 242

Query: 242 LILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHNSP-TAIRQSDSITGDTHGVH 301
           LILQKV++FSP+RSVH LN+T SNVVK ISKDSGP  + N P + +  +D    ++   +
Sbjct: 243 LILQKVSIFSPSRSVHYLNITLSNVVKAISKDSGPPSQQNYPASTVIPTDHGVENSKQPY 302

Query: 302 MPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDL------QTGKGIAASSRNWKWNETVG 361
           + Q  S +S+E T+ IMN+L+Q   +RG   +D         G     + RN   N  +G
Sbjct: 303 IQQKVSTLSQERTEGIMNSLRQTGYMRGRVHNDKGIEGNEALGSSCCINERNRNQNAFIG 362

Query: 362 NRQSIEKE--GGVIDVGISKGTPSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAAS 421
              S+ ++   G+    +  GT     N V   Q   R      NH      + ++   +
Sbjct: 363 KGHSVRQDILSGLKKAAVLAGTNEYEENVVLEKQSSPRNLAASGNHV----ESNQSSGGA 418

Query: 422 NTVQLPNNQEVETINEMKKTVTRTQQPL----LPQWTDEQLDELFVFD 454
           N V + +NQ+  T N  KK   + + P+    LPQWTDEQLDELF FD
Sbjct: 423 NLVGVASNQKTVTDNGDKK---QGRLPISSGSLPQWTDEQLDELFAFD 418

BLAST of Csa1G025090 vs. NCBI nr
Match: gi|802753740|ref|XP_012088560.1| (PREDICTED: uncharacterized protein LOC105647171 [Jatropha curcas])

HSP 1 Score: 288.9 bits (738), Expect = 1.6e-74
Identity = 184/465 (39.57%), Postives = 266/465 (57.20%), Query Frame = 1

Query: 2   EPWE-ALDLDYSDVHSLLRPLKRHRSPQPLSPSSASTSTLSLPLLETCSLPPSQSQPRVD 61
           EPWE ALDLD SD+ SL RP K  ++      ++A++ ++S P L  C+L  SQS     
Sbjct: 3   EPWEEALDLDDSDLSSL-RPFKHRKTT-----TAATSVSVSQPFLHRCTL--SQSSQNSQ 62

Query: 62  NLQSELSLSPQASICRSQRISTELEASCPSGASTRIIPGPAGAVQVAMQRR--TRGDHSC 121
           NL S+    P                  P  AS  +IPGPAG VQ AM RR   + D + 
Sbjct: 63  NLLSQFHPPP------------------PPSASPILIPGPAGTVQAAMLRRRNNQNDGNF 122

Query: 122 VGD--EEPVPTQEYIRRVIENG-DEEDDDFNRSAWVCALDFVRGIGAMEGNGAVSETPLN 181
            GD  EEP+PTQEYIRRV+E G  ++DDDF    W+ A++F+R  G   G+GA+   PL+
Sbjct: 123 TGDFGEEPIPTQEYIRRVVEEGVPQDDDDFTSDPWLYAVNFIRSQGLGYGDGAIG-IPLS 182

Query: 182 SIKNGFIDEKVGFVVAIIKSCTSNGLGGMMVALKDPTGTIDASIHHRVISEGNFGKDLSV 241
           ++K+    ++V  VVAI+KSCT NG G MMV LKDPTGTIDA+IH  V++EG FGK++S+
Sbjct: 183 AVKSRNKMDRVAQVVAIVKSCTPNGFGDMMVTLKDPTGTIDATIHGGVLTEGEFGKNISI 242

Query: 242 GAVLILQKVAVFSPTRSVHVLNVTRSNVVKVISKDSGPLIKHN-SPTAIRQSDSITGDTH 301
           G+V+ILQK+AVFSP+RS H LN+TRSN+VKVISKDS   + HN + + ++ +  ++    
Sbjct: 243 GSVIILQKIAVFSPSRSAHYLNITRSNMVKVISKDSELSLTHNCAASTVKHAAPMSEYNE 302

Query: 302 GVHMPQMNSDVSRESTQNIMNNLKQNSKLRGNGLDDLQTGKGIAASSRNWKWNETVGNRQ 361
              MP     +S+  T+ IMN+L+QN+  RG+ LD           S +   N    N  
Sbjct: 303 KSWMPNYPLSLSQGRTEGIMNSLRQNANKRGSSLDQHMERDNATRDSCHGNGNNEDQNVV 362

Query: 362 SIEKEGGVIDVGISKGT------PSVGCNTVHVDQDQGRGSDEPINHPMGTDPAKENGAA 421
           + ++ G  +   ++ GT       +V C  ++     GRG+         ++  +   AA
Sbjct: 363 AGKQNGANVTAEVADGTDQDNNGKAVVCERLNPSSQAGRGN--------LSEGDQYGSAA 422

Query: 422 SNTVQLPNNQEVETINEMKKTVTRTQQPLLPQWTDEQLDELFVFD 454
           +  + + +NQE+  I+  K     + +  +PQWTDEQLD+LF  D
Sbjct: 423 TGLIDVFDNQEIGNIDGPKWRRPPSSRVSVPQWTDEQLDKLFALD 432

The following BLAST results are available for this feature:
Match NameE-valueIdentityDescription
CQ053_RAT6.4e-1225.99Uncharacterized protein C17orf53 homolog OS=Rattus norvegicus PE=2 SV=1[more]
CQ053_HUMAN9.3e-1124.81Uncharacterized protein C17orf53 OS=Homo sapiens GN=C17orf53 PE=1 SV=1[more]
CQ053_MOUSE3.0e-0927.17Uncharacterized protein C17orf53 homolog OS=Mus musculus PE=2 SV=2[more]
Match NameE-valueIdentityDescription
A0A0A0LSW2_CUCSA2.2e-261100.00Uncharacterized protein OS=Cucumis sativus GN=Csa_1G025090 PE=4 SV=1[more]
A0A061FVS5_THECC2.9e-7541.03Uncharacterized protein isoform 1 OS=Theobroma cacao GN=TCM_012841 PE=4 SV=1[more]
A0A061FW98_THECC3.7e-7540.81Uncharacterized protein isoform 3 (Fragment) OS=Theobroma cacao GN=TCM_012841 PE... [more]
A0A061FWU7_THECC1.2e-7340.25Uncharacterized protein isoform 2 OS=Theobroma cacao GN=TCM_012841 PE=4 SV=1[more]
W9RTC8_9ROSA6.6e-7239.55Uncharacterized protein OS=Morus notabilis GN=L484_017363 PE=4 SV=1[more]
Match NameE-valueIdentityDescription
AT1G48580.11.7e-5533.47 unknown protein[more]
Match NameE-valueIdentityDescription
gi|449439563|ref|XP_004137555.1|3.2e-261100.00PREDICTED: uncharacterized protein C17orf53 homolog [Cucumis sativus][more]
gi|659106975|ref|XP_008453483.1|2.3e-23591.87PREDICTED: uncharacterized protein C17orf53 homolog [Cucumis melo][more]
gi|590665817|ref|XP_007036840.1|4.1e-7541.03Uncharacterized protein isoform 1 [Theobroma cacao][more]
gi|590665823|ref|XP_007036842.1|5.3e-7540.81Uncharacterized protein isoform 3, partial [Theobroma cacao][more]
gi|802753740|ref|XP_012088560.1|1.6e-7439.57PREDICTED: uncharacterized protein LOC105647171 [Jatropha curcas][more]
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR028045DUF4539
GO Assignments
This gene is annotated with the following GO terms.
Category Term Accession Term Name
biological_process GO:0008150 biological_process
cellular_component GO:0005575 cellular_component
molecular_function GO:0003674 molecular_function

The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameType
Csa1G025090.1Csa1G025090.1mRNA


Analysis Name: InterPro Annotations of cucumber (Chinese Long)
Date Performed: 2016-09-28
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR028045Protein of unknown function DUF4539PFAMPF15072DUF4539coord: 184..269
score: 2.4
NoneNo IPR availablePANTHERPTHR14523FAMILY NOT NAMEDcoord: 51..453
score: 1.2